BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023868
         (276 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus]
 gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus]
          Length = 282

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 174/283 (61%), Positives = 205/283 (72%), Gaps = 15/283 (5%)

Query: 1   MLQTHHLLSLNFP-FTVSHHPQKLN---FLQKPTISLSAFPRR------RPLIEPYCLAQ 50
           M  T +LLS NFP FT+S  P   N   FL  PT +LS+  R        PL    C   
Sbjct: 1   MWHTQNLLSSNFPLFTLS--PPTYNHKLFLSPPT-TLSSLHRPITFHSVSPLTNHRCFCL 57

Query: 51  AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
            Q  T    A    + GPVELP +IFATTD+PSSLQVATSVLLTGAISVFLFR+LRRRAK
Sbjct: 58  PQF-TDLADATFLDDNGPVELPPTIFATTDDPSSLQVATSVLLTGAISVFLFRSLRRRAK 116

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKF 170
           R KELKFRS G KKSLK+EA+D+LKA+ +  I++K  PSP+QA LG + AGVIA+ILYKF
Sbjct: 117 RVKELKFRSGGVKKSLKEEAMDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKF 176

Query: 171 TTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSF 230
           TTTIEA+LNRQT+SDNFSVRQ+TITIRTIVNGLCYLATFVFGIN++GLFLYSGQLA+NS 
Sbjct: 177 TTTIEASLNRQTVSDNFSVRQLTITIRTIVNGLCYLATFVFGINAIGLFLYSGQLAMNSV 236

Query: 231 TEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSS 272
            E+ S   E +   ++Q     STAE   D TE ++ K+DQSS
Sbjct: 237 MEEGSKDTEPKAKSDEQVSPPTSTAETTLDSTESSNSKDDQSS 279


>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera]
 gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 174/275 (63%), Positives = 205/275 (74%), Gaps = 21/275 (7%)

Query: 6   HLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSE 65
           H   L+ PF     P  L+      +S     R  PL+       AQ P T  TAP   E
Sbjct: 20  HNPCLSPPFCSLSRPTSLHL-----VSAHLRTRAEPLL-------AQVPDTTTTAP---E 64

Query: 66  EGPVELP---QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
           EGP+ELP    SIFAT D+P+ LQVATSVLLTGAISVFLFR++RRR KRAKEL+FRSSG 
Sbjct: 65  EGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRRRVKRAKELRFRSSGV 124

Query: 123 KKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
           KK+LK+EALD+LKA+GS S+ A  PPSPVQALLGG+TAGVIA+ILYKFT TIEA+LNRQT
Sbjct: 125 KKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALILYKFTITIEASLNRQT 183

Query: 183 ISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
           +SDNFSVRQITITIRTI+NGLCYLATFVFGINSVGLFLYSGQLALNSF E  S+S+ET+ 
Sbjct: 184 VSDNFSVRQITITIRTIINGLCYLATFVFGINSVGLFLYSGQLALNSFME-GSTSEETQI 242

Query: 243 IGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
            GE+Q  + N+  ++  D T+LNS + DQ +   Q
Sbjct: 243 KGEEQLSTPNTLTDSPPDGTQLNSSEGDQRAQDEQ 277


>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis]
 gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis]
          Length = 285

 Score =  286 bits (733), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 177/288 (61%), Positives = 211/288 (73%), Gaps = 15/288 (5%)

Query: 1   MLQTHHLLSLN--FPFTVSHHPQKLNF-----LQKPTISLSAFPRRRPLIEPYCLAQAQE 53
           ML  HH +     FP    HH             KPT SLS F    P  +P     ++ 
Sbjct: 1   MLHCHHYIPSPQFFPRNHHHHTSTPYLSLTSSFSKPT-SLSTF-SALPPSKPDIWLASEP 58

Query: 54  PTTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA 109
                 +P+  E+GP+ELP     SIFATTD+PS +QVATSVLLTG+I VFLFRALRRRA
Sbjct: 59  TPVTPPSPSPEEDGPIELPFPSSPSIFATTDDPSPIQVATSVLLTGSIGVFLFRALRRRA 118

Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
           KRAKELKFRSSGAKK++K+EALD+LKA+GS+SIDA+ PPSP+Q  LGG++A VIA+ILYK
Sbjct: 119 KRAKELKFRSSGAKKTMKEEALDSLKAMGSASIDAQKPPSPLQTFLGGVSAAVIALILYK 178

Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
           FTTTIEAALNRQT+SDNFSVRQITIT+RTI+NGLCYLATFVFG+NSVGLFLYSGQLA+NS
Sbjct: 179 FTTTIEAALNRQTVSDNFSVRQITITVRTIINGLCYLATFVFGLNSVGLFLYSGQLAINS 238

Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
           F E S+S +     G +QSGSLNS  E+  D TE NS + DQ+ D+ Q
Sbjct: 239 FMEGSTSEESESK-GNEQSGSLNSAPESPIDSTESNSSQGDQNPDNSQ 285


>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max]
          Length = 277

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 176/288 (61%), Positives = 209/288 (72%), Gaps = 34/288 (11%)

Query: 5   HHLLSLNFPFTVSHHPQK--LNFL----QKPTI----SLSAFPRRRPLIEPYCLAQAQEP 54
           HHLL   FP T++HH  K  L+ L     KPTI    S SA P       P+    A   
Sbjct: 8   HHLL---FPLTLAHHKVKPPLSILPPLHHKPTILHSVSASAIP-------PWAAQLADAA 57

Query: 55  TTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               T      EGPVELP     SIFAT+D+PS +QVA+SVLLTGA+SVFLFR+LRRRA+
Sbjct: 58  ADIDT------EGPVELPFSSTPSIFATSDDPSPIQVASSVLLTGAVSVFLFRSLRRRAQ 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAK-GPPSPVQALLGGLTAGVIAIILYK 169
           R K+ +FRSSG +KS+K+EALD+LKA+GS+S+ AK    SPVQALLGG++AGVIA+ILYK
Sbjct: 112 RVKQSQFRSSG-EKSIKEEALDSLKAMGSASVKAKEDSASPVQALLGGISAGVIALILYK 170

Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
           F TTIEAALNRQTISDNFSVRQITITIRTIVNGL YLATFVFG+NS+GLFLYSGQLAL S
Sbjct: 171 FATTIEAALNRQTISDNFSVRQITITIRTIVNGLTYLATFVFGLNSLGLFLYSGQLALKS 230

Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
              D S+ KETE+    Q    NS+ E++TD TEL+S KE+QSS+  Q
Sbjct: 231 IMGD-STEKETESKSTDQPNLSNSSNESSTDKTELSSRKEEQSSNDAQ 277


>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa]
 gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa]
          Length = 211

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 148/197 (75%), Positives = 172/197 (87%), Gaps = 2/197 (1%)

Query: 74  SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDN 133
           SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR+KELKFRSSGAKK+LK+EALD+
Sbjct: 16  SIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKRSKELKFRSSGAKKTLKEEALDS 75

Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
           LK  GS+ ID K PPSPVQA LG ++AGVIA+ILYKF+TTIEA+LNRQT+SDNFSVRQIT
Sbjct: 76  LKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFSTTIEASLNRQTVSDNFSVRQIT 135

Query: 194 ITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLNS 253
           IT+RTIVNGLCYLATFVFGINSVGLFLYS QLA NS  E + + + TE+    QSGS NS
Sbjct: 136 ITVRTIVNGLCYLATFVFGINSVGLFLYSAQLAFNSLME-AGTDEVTESTTSGQSGSSNS 194

Query: 254 TAENATD-TELNSGKED 269
           T+++ATD  EL S + D
Sbjct: 195 TSDSATDGAELTSSQGD 211


>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 238

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 144/225 (64%), Positives = 168/225 (74%), Gaps = 13/225 (5%)

Query: 2   LQTHHLLSLN---FPFT---VSHHPQKLNFLQKPTI----SLSAFPRRRPLIEPYCLAQA 51
           LQ H+   L+   FP +    +HH   L FL +P I    +LSA PR RP   P  LAQ 
Sbjct: 4   LQCHNFHPLSPQIFPLSHHHTNHHLYPLKFLYRPPIFTPFTLSALPRSRP-DPPLWLAQL 62

Query: 52  QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
            E               +    SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 63  PEQAPEPEE--EGPIEILRSSPSIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKR 120

Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
           +KELKFRSSGAKK+LK+EALD+LK  GS+ ID K PPSPVQA LG ++AGVIA+ILYKFT
Sbjct: 121 SKELKFRSSGAKKTLKEEALDSLKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFT 180

Query: 172 TTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSV 216
           TTIEA+LNRQT+SDNFSVRQIT+T+RTIVNGLCYLATFVFGINS+
Sbjct: 181 TTIEASLNRQTVSDNFSVRQITVTVRTIVNGLCYLATFVFGINSL 225


>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana]
          Length = 266

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/285 (53%), Positives = 192/285 (67%), Gaps = 36/285 (12%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E       
Sbjct: 3   VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL+FRS+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELQFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
           FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA N 
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLAFN- 230

Query: 230 FTEDSSSS--KETENIGEQQSGSLNSTAENATDTELNSGKEDQSS 272
             EDS+    K T   G+  SG          ++E+N   EDQSS
Sbjct: 231 --EDSAEENMKATTQPGDSSSGD---------NSEVNKSNEDQSS 264


>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana]
 gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana]
 gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 266

 Score =  257 bits (656), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 153/285 (53%), Positives = 191/285 (67%), Gaps = 36/285 (12%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E       
Sbjct: 3   VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL FRS+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
           FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA N 
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLAFN- 230

Query: 230 FTEDSSSS--KETENIGEQQSGSLNSTAENATDTELNSGKEDQSS 272
             EDS+    K T   G+  SG          ++E+N   EDQSS
Sbjct: 231 --EDSAEENMKATTQPGDSSSGD---------NSEVNKSNEDQSS 264


>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 261

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/282 (52%), Positives = 187/282 (66%), Gaps = 34/282 (12%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H        FT+ H  +    +  P  SLS+F R RP  +   ++  +E       
Sbjct: 3   VLQSHQC-----RFTLPHRLRPTRLIS-PIHSLSSFTRIRP--DRIRVSVVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPPSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL+FRS+GAKKSLK+EA+DNLKALGS+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELRFRSTGAKKSLKEEAMDNLKALGSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
           FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LY+GQLA N 
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYAGQLAFN- 230

Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATDTELNSGKEDQS 271
             EDS    +     E  SG          ++E+N   EDQS
Sbjct: 231 --EDSDEDMKATTQPEDSSGD---------NSEVNKSNEDQS 261


>gi|217072908|gb|ACJ84814.1| unknown [Medicago truncatula]
          Length = 271

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 132/217 (60%), Positives = 167/217 (76%), Gaps = 8/217 (3%)

Query: 66  EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
           +GP+E+P     S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57  DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116

Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
            +KS+K+EAL+ LKA G++SI+  KGPPSPVQ  LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKATGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175

Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
           QTISD+FS RQITIT+RTI+NGL YLATF+FG+NS+GL LYSGQLAL SFT D S  KET
Sbjct: 176 QTISDDFSARQITITVRTIINGLTYLATFIFGLNSLGLLLYSGQLALKSFTGD-SIEKET 234

Query: 241 ENIGEQQSGSLNSTAE-NATDTELNSGKEDQSSDSRQ 276
           EN    QS   N + E    D EL+   E+QSS+  Q
Sbjct: 235 ENKSADQSSLSNLSVETRINDAELSGRSEEQSSNDAQ 271


>gi|388511081|gb|AFK43606.1| unknown [Lotus japonicus]
          Length = 251

 Score =  239 bits (611), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 143/278 (51%), Positives = 177/278 (63%), Gaps = 52/278 (18%)

Query: 9   SLNFPFTVSHH---------PQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNIT 59
           +L  P T++HH            L+FL +PTI     P     I P   +   E TT   
Sbjct: 15  NLPVPLTLTHHHSSFSHPPPSHLLHFLHRPTI----IPHTISAIPPTLSSWLTELTTG-- 68

Query: 60  APTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRAKRAKELKFR 118
                              TD+PSS +QV +++L+TGAI++F FR L+RRAKRAKELKFR
Sbjct: 69  -------------------TDDPSSTIQVTSTILVTGAITLFFFRTLQRRAKRAKELKFR 109

Query: 119 SSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL 178
           SSG  KSL            S+SI AK  PSP QALLG + AGVIA+ILY+FTT++EA L
Sbjct: 110 SSGVNKSL-----------ASTSIKAKKRPSPDQALLGAIIAGVIAVILYRFTTSVEATL 158

Query: 179 NRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSK 238
            RQTISDNFSVRQ+TITIRTI+NGLCYLATFV+GINSVGL LYSGQLA+N+F E   S K
Sbjct: 159 YRQTISDNFSVRQVTITIRTIINGLCYLATFVYGINSVGLLLYSGQLAMNTF-EGGPSGK 217

Query: 239 ETENIGEQQSGSLNSTAENATDTELNSGKEDQSSDSRQ 276
           ETE    +QSGS NS+ +     +LNS KED++S++ Q
Sbjct: 218 ETEKRILEQSGSSNSSTD-----KLNSTKEDENSNNSQ 250


>gi|357497633|ref|XP_003619105.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
 gi|355494120|gb|AES75323.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
          Length = 254

 Score =  221 bits (562), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 120/204 (58%), Positives = 148/204 (72%), Gaps = 20/204 (9%)

Query: 75  IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNL 134
           IF   DE SS QV  SV+L+ AI+VFLF  ++RR KRAK+LK+RSSG KKS    +L++ 
Sbjct: 69  IFDAIDESSSNQVTISVVLSVAIAVFLFPTIQRRIKRAKQLKYRSSGVKKS----SLNST 124

Query: 135 KAL-GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
           K L GS+   +K PPSP +ALLG + AGVIA+ILY+FTTTIEA+L RQTISDNFSVRQIT
Sbjct: 125 KGLMGSTRKKSKNPPSPDKALLGAVMAGVIAVILYRFTTTIEASLYRQTISDNFSVRQIT 184

Query: 194 ITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLNS 253
           ITIRT++NGLCYLATFV+GINS GL LYSGQLALN++ + SSS K+ E            
Sbjct: 185 ITIRTVINGLCYLATFVYGINSFGLLLYSGQLALNTYEKGSSSGKKIE------------ 232

Query: 254 TAENATDTELNSGK-EDQSSDSRQ 276
                 +  +NS K EDQSS++ Q
Sbjct: 233 --SKIMENHINSNKNEDQSSNNSQ 254


>gi|356514641|ref|XP_003526013.1| PREDICTED: uncharacterized protein LOC100783357 [Glycine max]
          Length = 276

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 146/273 (53%), Positives = 181/273 (66%), Gaps = 24/273 (8%)

Query: 10  LNFPFTVSHHPQKL---NFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEE 66
           L    T +HH   L   +FL +P I+ +        I P     A+         TT + 
Sbjct: 18  LRLSLTTTHHNSSLPSLHFLHRPIITTTTTTTTVSAIPPTAAWLAKL--------TTQDF 69

Query: 67  GPVELPQS-IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKS 125
              ELP S +F  TD+PS++QVA SVLLTGAI+VF FR+++RRAKRAKELKFRSSG  KS
Sbjct: 70  AHFELPSSHLF--TDDPSTIQVACSVLLTGAITVFFFRSVQRRAKRAKELKFRSSGVNKS 127

Query: 126 LKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
           L     DNLKA+ SSSI AK PPSP QALLG + AGVIA++LY+FTT+IEA L RQT+SD
Sbjct: 128 L-----DNLKAISSSSIKAKSPPSPDQALLGAIIAGVIAVVLYRFTTSIEATLGRQTLSD 182

Query: 186 NFSVRQ-ITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIG 244
           NFSVRQ       TI+NGLCYLATFV+GINSVGLFLYSGQLA+++     SS KETE+  
Sbjct: 183 NFSVRQITITIRETIINGLCYLATFVYGINSVGLFLYSGQLAMDTM-GGGSSGKETESKI 241

Query: 245 EQQSG---SLNSTAENATDTELNSGKEDQSSDS 274
            QQ G   +    + N  + +L+S KEDQSS++
Sbjct: 242 TQQPGLSNASVENSNNNNNNKLSSTKEDQSSNN 274


>gi|357144153|ref|XP_003573191.1| PREDICTED: uncharacterized protein LOC100830225 [Brachypodium
           distachyon]
          Length = 284

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 135/221 (61%), Positives = 161/221 (72%), Gaps = 9/221 (4%)

Query: 35  AFPRRRPLI-----EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVA 88
           A P RRP       EP     A    +  T     EEGPVEL   ++F+  D P+SLQVA
Sbjct: 28  ARPGRRPTALRAEPEPQSPPPASAAASPDTTDDGGEEGPVELRAPTLFSVDDNPTSLQVA 87

Query: 89  TSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSIDAKG 146
           TSVLLTGAISVFLFR+LRRRA+RAKEL+ RS G +K  +L +EAL+ L+ + +S ++ + 
Sbjct: 88  TSVLLTGAISVFLFRSLRRRARRAKELRVRSGGLEKPKNLSEEALEALRMVSTSPVETEK 147

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYL 206
           PPSPVQALLGG+ AGVIA+ LYKF TTIEA+LNRQTISDNFSVRQIT+TIRTI+NG+CYL
Sbjct: 148 PPSPVQALLGGIAAGVIALFLYKFATTIEASLNRQTISDNFSVRQITVTIRTIINGICYL 207

Query: 207 ATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE-TENIGEQ 246
           ATFVFGIN+VGL LYS QL  NS  +D  SS    E I EQ
Sbjct: 208 ATFVFGINAVGLILYSLQLTFNSIMDDDDSSSSPVEKISEQ 248


>gi|326511271|dbj|BAJ87649.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326513094|dbj|BAK06787.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529293|dbj|BAK01040.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 281

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 123/186 (66%), Positives = 149/186 (80%), Gaps = 3/186 (1%)

Query: 64  SEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
           S +GPVE+   ++FA  D P+ LQVATSV+LTGAISVFLFR+LRRRA+RAKEL+ RS G 
Sbjct: 63  SGDGPVEIRAPTLFAVDDNPTPLQVATSVMLTGAISVFLFRSLRRRARRAKELRVRSGGV 122

Query: 123 KK--SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
           +K  +L +EAL+ L+ + +S ++   PPSPVQALLGG+ AGVIA+ LYKF +T+EA+LNR
Sbjct: 123 EKPRNLSEEALEALRMVSTSPVETNKPPSPVQALLGGIAAGVIALFLYKFASTVEASLNR 182

Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
           QTISDNFSVRQIT+TIRTI+NGLCYLATFVFGIN VGL LYS QL  NS  +D SSS   
Sbjct: 183 QTISDNFSVRQITVTIRTIINGLCYLATFVFGINGVGLILYSLQLTFNSLMDDDSSSSSV 242

Query: 241 ENIGEQ 246
           E I EQ
Sbjct: 243 EKISEQ 248


>gi|115445727|ref|NP_001046643.1| Os02g0307800 [Oryza sativa Japonica Group]
 gi|46390708|dbj|BAD16208.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536174|dbj|BAF08557.1| Os02g0307800 [Oryza sativa Japonica Group]
 gi|125581805|gb|EAZ22736.1| hypothetical protein OsJ_06408 [Oryza sativa Japonica Group]
 gi|215692736|dbj|BAG88156.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215694440|dbj|BAG89457.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 273

 Score =  203 bits (516), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 132/224 (58%), Positives = 157/224 (70%), Gaps = 9/224 (4%)

Query: 29  PTISLSAFPRRRPLI---EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
           P  S SA   RR +    EP     A EP           +GPVEL   ++F+  + P+ 
Sbjct: 26  PAASASALRLRRSVAVRAEPELSTSAAEPPPGDDG---EGDGPVELRTPTLFSIDENPTP 82

Query: 85  LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
           LQ ATSVLLTGAISVFLFR++RRR +RAKEL+ RS G +K  +L  EAL+ L+ + +S I
Sbjct: 83  LQTATSVLLTGAISVFLFRSIRRRVRRAKELRVRSGGVEKPNNLSKEALEGLRLVSASPI 142

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNG 202
           +   PPSPVQALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQITITIRTI+NG
Sbjct: 143 EVDKPPSPVQALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQITITIRTIING 202

Query: 203 LCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQ 246
           +CYLATFVFGINS+GL LY  QL   S   D +SS   E I EQ
Sbjct: 203 ICYLATFVFGINSIGLVLYGLQLTFASIMGDDNSSSAAEKISEQ 246


>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula]
          Length = 230

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 97/152 (63%), Positives = 126/152 (82%), Gaps = 6/152 (3%)

Query: 66  EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
           +GP+E+P     S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57  DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116

Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
            +KS+K+EAL+ LKA+G++SI+  KGPPSPVQ  LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKAMGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175

Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFG 212
           QTISD+FS RQITIT+RTI+NGL YLATF F 
Sbjct: 176 QTISDDFSARQITITVRTIINGLTYLATFYFW 207


>gi|218190566|gb|EEC72993.1| hypothetical protein OsI_06901 [Oryza sativa Indica Group]
          Length = 325

 Score =  187 bits (474), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/214 (57%), Positives = 145/214 (67%), Gaps = 33/214 (15%)

Query: 66  EGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKF------- 117
           +GPVEL   ++F+  + P+ LQ ATSVLLTGAISVFLFR++RRRA+RAKEL         
Sbjct: 85  DGPVELRTPTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRARRAKELVIPSLLIHL 144

Query: 118 -----------------------RSSGAKK--SLKDEALDNLKALGSSSIDAKGPPSPVQ 152
                                  RS G +K  +L  EAL+ L+ + +S I+   PPSPVQ
Sbjct: 145 RCAFSTGEHTWSTNPISFRDTEGRSGGVEKPNNLSKEALEGLRLVSASPIEVDKPPSPVQ 204

Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFG 212
           ALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQITITIRTI+NG+CYLATFVFG
Sbjct: 205 ALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQITITIRTIINGICYLATFVFG 264

Query: 213 INSVGLFLYSGQLALNSFTEDSSSSKETENIGEQ 246
           INS+GL LY  QL   S   D +SS   E I EQ
Sbjct: 265 INSIGLVLYGLQLTFASIMGDDNSSSAAEKISEQ 298


>gi|242065424|ref|XP_002454001.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
 gi|241933832|gb|EES06977.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
          Length = 293

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 122/190 (64%), Positives = 147/190 (77%), Gaps = 5/190 (2%)

Query: 45  PYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFR 103
           P   A+ + P     A     +GPVEL   ++F+T D P+ LQ ATS+LLTGAISVFLFR
Sbjct: 55  PVSAAEPEPPDAGAVA-EAEGQGPVELRAPTLFSTDDNPTPLQTATSLLLTGAISVFLFR 113

Query: 104 ALRRRAKRAKELKFRSSGAKK---SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTA 160
           +LRRRA+RAKEL+ RSSG KK   +L +EAL+ L+ + +S I+ + PPSPVQ LLGG+ A
Sbjct: 114 SLRRRARRAKELRVRSSGVKKKPNNLTEEALEGLRLVSASPIETEKPPSPVQTLLGGIAA 173

Query: 161 GVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFL 220
           GVIA+ILYKF+TTIEAALNRQTISD+FSVRQITITIRTI+ GLCYLAT VFGIN+VGL L
Sbjct: 174 GVIAVILYKFSTTIEAALNRQTISDSFSVRQITITIRTIITGLCYLATSVFGINAVGLVL 233

Query: 221 YSGQLALNSF 230
           YS QL L S 
Sbjct: 234 YSLQLTLQSI 243


>gi|226494037|ref|NP_001143499.1| uncharacterized protein LOC100276178 [Zea mays]
 gi|195621548|gb|ACG32604.1| hypothetical protein [Zea mays]
 gi|224036011|gb|ACN37081.1| unknown [Zea mays]
 gi|413936044|gb|AFW70595.1| hypothetical protein ZEAMMB73_826587 [Zea mays]
          Length = 285

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 126/208 (60%), Positives = 157/208 (75%), Gaps = 7/208 (3%)

Query: 26  LQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
           +++PT   +    R PL  P   A+++ P  +  A     +GPVEL   ++F+T D P+ 
Sbjct: 32  VRRPTAIRAEAEVRPPL--PVSAAESEPP--DAGAVEAEGQGPVELRAPTLFSTDDNPTP 87

Query: 85  LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
           LQ ATS+LLTGAISVFLFRALRRRA+RAKEL+ RSSG KK  +L +EAL+ L+ + +S I
Sbjct: 88  LQTATSLLLTGAISVFLFRALRRRARRAKELRVRSSGLKKPNNLTEEALERLRLMSASPI 147

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNG 202
           + +   SP+QALLGG+ AGVIA+ILYKF+TT+EAALNRQTISD+FSVRQITITIRTIV G
Sbjct: 148 ETEKATSPIQALLGGIAAGVIALILYKFSTTVEAALNRQTISDSFSVRQITITIRTIVTG 207

Query: 203 LCYLATFVFGINSVGLFLYSGQLALNSF 230
           LCYLAT VFGIN+VGL LYS QL L S 
Sbjct: 208 LCYLATSVFGINAVGLVLYSLQLTLQSI 235


>gi|11994500|dbj|BAB02565.1| unnamed protein product [Arabidopsis thaliana]
          Length = 135

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 84/145 (57%), Positives = 103/145 (71%), Gaps = 15/145 (10%)

Query: 131 LDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
           +DNLKAL S+ I+     PS  QA LG + AGVIA+ILYKFT T+E+ LNRQTISDNFSV
Sbjct: 1   MDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKFTVTVESGLNRQTISDNFSV 60

Query: 190 RQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSS--SSKETENIGEQQ 247
           RQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA   F EDS+  + K T   G+  
Sbjct: 61  RQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLA---FNEDSAEENMKATTQPGDSS 117

Query: 248 SGSLNSTAENATDTELNSGKEDQSS 272
           SG          ++E+N   EDQSS
Sbjct: 118 SGD---------NSEVNKSNEDQSS 133


>gi|168035549|ref|XP_001770272.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678489|gb|EDQ64947.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 277

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 105/153 (68%), Gaps = 6/153 (3%)

Query: 66  EGPVELPQSIF--ATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
           +GP+ELP  +       E S LQVA SV+LTG I+V L R+LRRR+K+AKE +FRS+G  
Sbjct: 88  DGPIELPPELLDPLAIPEASPLQVAASVVLTGLITVLLIRSLRRRSKKAKETRFRSTG-- 145

Query: 124 KSLKDEALDNLKALGSSSIDAKGPP-SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
             +K++A  +  AL + + + + PP S +Q   G + AG IA++LYKFT T+E +   + 
Sbjct: 146 -EIKEDARKSAMALLNKAPEVETPPPSALQTFSGAVVAGFIALVLYKFTVTVEGSFTGKA 204

Query: 183 ISDNFSVRQITITIRTIVNGLCYLATFVFGINS 215
           +S N+S+R +TIT+RTIV GLCYLATFVF  NS
Sbjct: 205 VSMNYSIRNLTITVRTIVTGLCYLATFVFAANS 237


>gi|302785097|ref|XP_002974320.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
 gi|300157918|gb|EFJ24542.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
          Length = 1433

 Score =  124 bits (311), Expect = 4e-26,   Method: Composition-based stats.
 Identities = 78/179 (43%), Positives = 112/179 (62%), Gaps = 13/179 (7%)

Query: 51   AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRA 109
            AQ     I   ++S +GP+ELP  +    ++P+S LQ+A SV+LTGAISV L R+LRRR 
Sbjct: 1244 AQVAEQAIADASSSLDGPIELPGDL----EQPTSPLQIAGSVVLTGAISVLLLRSLRRRY 1299

Query: 110  KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
            K AKE        KK LK++A  N+ A  ++ +  K PPS  Q L G L AG IA +LY+
Sbjct: 1300 KEAKE--------KKPLKEQAKQNVLASLANQVVEKPPPSFEQTLYGALLAGAIAFVLYQ 1351

Query: 170  FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALN 228
             TT +  +   + +   ++ R I +T+RTI++G+CYLATFVF  +++GL LY  QL +N
Sbjct: 1352 CTTAVTGSFASKPLPTEYATRNIAVTVRTIISGICYLATFVFAADAIGLALYGLQLLVN 1410


>gi|145346872|ref|XP_001417906.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578134|gb|ABO96199.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 181

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/152 (35%), Positives = 88/152 (57%), Gaps = 8/152 (5%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
           T+E + LQ    VL TG    +  RA+++R+ +A+E  FR +      +   +D  +A  
Sbjct: 9   TEEATPLQNVFGVLFTGFSFWYFARAVQKRSGKARE--FRVANRLPEEERAKIDEERAKK 66

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRT 198
           +  + A      +Q+  GGLT   I+++LY F + + A+ + + + ++ +VRQI+IT+RT
Sbjct: 67  TKELTA------MQSFTGGLTGIGISVVLYAFASKVSASFDGKALPESETVRQISITVRT 120

Query: 199 IVNGLCYLATFVFGINSVGLFLYSGQLALNSF 230
           IV GL YLATFV+  N VGL   +GQ  L+ F
Sbjct: 121 IVEGLTYLATFVYAANGVGLIALAGQKTLDYF 152


>gi|159491108|ref|XP_001703515.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280439|gb|EDP06197.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 206

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 52/129 (40%), Positives = 77/129 (59%), Gaps = 10/129 (7%)

Query: 91  VLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK-SLKDEALDNLKALGSSSIDAKGPPS 149
           +LL G   +FLFR  R+RA +A+  K     A   S+ DE  + L A        +G  +
Sbjct: 86  ILLVG---IFLFRLFRKRAAKARSEKIAGQLATGPSIFDELREKLGA------PKRGKAT 136

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATF 209
           P+ A LG + A  +A  L+ F+T ++A++  Q++ D ++ R I IT+RTI+ GLCYLATF
Sbjct: 137 PMNAFLGAVQALGLAAGLWFFSTKVQASIEAQSLPDGYTARNIAITVRTIILGLCYLATF 196

Query: 210 VFGINSVGL 218
           +FG N VGL
Sbjct: 197 IFGANGVGL 205


>gi|303283914|ref|XP_003061248.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457599|gb|EEH54898.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 332

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 13/159 (8%)

Query: 80  DEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSS-------GAKKSLKDEALD 132
           DE + LQ A     T   + +  R +++R  RAKE +  ++       G     + E  D
Sbjct: 145 DEATPLQNAFGFAFTIFCAWYFVRVVKKRGNRAKEFRVANALPVRVRAGRPSKEERERRD 204

Query: 133 NLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQI 192
             +   S  + AK       A +GG+T   I+  LY+F   I  +   + + +++  RQI
Sbjct: 205 AEQLAKSKKLSAK------DAFVGGVTGLAISFALYQFAVVIGESFEGRLMPESYQARQI 258

Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFT 231
           TIT+RTIVNGL YLATFV+  N  GL L + + AL+S T
Sbjct: 259 TITVRTIVNGLAYLATFVYAANGTGLVLLAARKALDSLT 297


>gi|302848926|ref|XP_002955994.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
           nagariensis]
 gi|300258720|gb|EFJ42954.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/138 (38%), Positives = 75/138 (54%), Gaps = 10/138 (7%)

Query: 91  VLLTGAISVFLFRALRRRAKRAKELKFRSSG-AKKSLKDEALDNLKALGSSSIDAKGPPS 149
           +LL G   +FLFR  R+RA +AK  +      +  +  DE    + A   S        +
Sbjct: 108 ILLVG---IFLFRLFRKRAAKAKTERIAGQAPSGPTFFDELRQKVGAPKQSKA------T 158

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATF 209
           PV A +G   A V+A  L+ FTT ++ ++  Q + D ++ R I IT+RTI+ GLCYLATF
Sbjct: 159 PVNAFIGCGQALVLAYGLWFFTTKVQGSIEGQALPDGYTARNIAITVRTIILGLCYLATF 218

Query: 210 VFGINSVGLFLYSGQLAL 227
           +F  N VGL   S QL L
Sbjct: 219 IFAANGVGLAGLSVQLLL 236


>gi|384247605|gb|EIE21091.1| hypothetical protein COCSUDRAFT_33788 [Coccomyxa subellipsoidea
           C-169]
          Length = 232

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 73/134 (54%), Gaps = 16/134 (11%)

Query: 97  ISVFLFRALRRRAKRAKELKFRSS--GAKKSLKDEALDNLKALGSSSIDAKGPPSPVQAL 154
           ++VF  R L +RAK A E +  S+  G  +   D ++  +              +P+Q+L
Sbjct: 84  LAVFAVRLLTKRAKFATEERLASAAKGENERPPDPSIKRV--------------TPLQSL 129

Query: 155 LGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGIN 214
              + AGV A +LY F++ ++    RQ + D ++ R IT+ I TIV GL YL TF+FG N
Sbjct: 130 WAAVQAGVFAYLLYLFSSNVDGYFERQPLPDQYTARNITVAISTIVRGLSYLITFIFGAN 189

Query: 215 SVGLFLYSGQLALN 228
           +VGL   + QL +N
Sbjct: 190 AVGLSALTVQLFIN 203


>gi|255071307|ref|XP_002507735.1| predicted protein [Micromonas sp. RCC299]
 gi|226523010|gb|ACO68993.1| predicted protein [Micromonas sp. RCC299]
          Length = 292

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 91/162 (56%), Gaps = 8/162 (4%)

Query: 81  EPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSS 140
           E + LQ A  +  T   + +  R +R+R   AK+ +  +S +K+  +   L+ LK +   
Sbjct: 121 EATPLQNAFGLTFTIFSAWYFLRVVRKRGNAAKKFRVANSLSKEERERRDLEELKRV--K 178

Query: 141 SIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIV 200
            ++A+      Q+ +GG+TA  I+I+LY+F   I+A+ + + +  ++ +RQI+ T+RTIV
Sbjct: 179 KLNAQ------QSFVGGITAIGISIVLYEFAQNIQASFDSRPVPVSYQIRQISNTVRTIV 232

Query: 201 NGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
            GL YLATF++  N  GL   S Q  L++     ++  + EN
Sbjct: 233 EGLAYLATFIYAANGTGLVALSMQKLLDTAVPRDTTKTDGEN 274


>gi|302818397|ref|XP_002990872.1| hypothetical protein SELMODRAFT_49523 [Selaginella moellendorffii]
 gi|300141433|gb|EFJ08145.1| hypothetical protein SELMODRAFT_49523 [Selaginella moellendorffii]
          Length = 81

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 53/81 (65%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
           PS  Q L G L AG IA +LY+ TT +  +   + +   ++ R I +T+RTI++G+CYLA
Sbjct: 1   PSFEQTLYGALLAGAIAFVLYQCTTAVTGSFASKPLPTEYATRNIAVTVRTIISGICYLA 60

Query: 208 TFVFGINSVGLFLYSGQLALN 228
           TFVF  +++GL LY  QL +N
Sbjct: 61  TFVFAADAIGLALYGLQLLVN 81


>gi|412988915|emb|CCO15506.1| predicted protein [Bathycoccus prasinos]
          Length = 203

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 72/138 (52%), Gaps = 8/138 (5%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
           T+E + LQ     L T     + +R +++R KRA+   FR +      + +  +  + L 
Sbjct: 73  TEEATPLQNLFGFLFTAFCGWYFWRVVKKRGKRAQ--TFRVANQLSPEELKEREEERKLK 130

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRT 198
              ++A       QA  GG+T   I+  LY F + I    +  ++ ++++ RQI+IT+RT
Sbjct: 131 DKKLNAS------QAFTGGITGIAISAGLYIFASKISTGFDNSSLPESYTARQISITVRT 184

Query: 199 IVNGLCYLATFVFGINSV 216
           IV GL YLATFV+  N V
Sbjct: 185 IVEGLVYLATFVYAANGV 202


>gi|307106044|gb|EFN54291.1| expressed protein [Chlorella variabilis]
          Length = 223

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 96/216 (44%), Gaps = 37/216 (17%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS-------SGAKKSLKDEAL 131
           T    +L+    V   G +  F  R  +RRA++A   +  S        G   S  +E  
Sbjct: 8   TGGSDTLKTVAGVAYIGVVIFFFLRLFQRRAQKATSERIASVADVAAYGGGADSDDEEDE 67

Query: 132 DNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD---NFS 188
           +  KA       A    +P+Q+ +G   AGVI  +L++ +T ++A  +RQ + D    ++
Sbjct: 68  EEQKA---KQQQAATEVTPLQSFIGTAQAGVICYLLFQASTAVDAYFDRQNLPDITTQYT 124

Query: 189 VRQITITIRTIVNGLCYLATFVFGINSVG---------LFLYSG-----------QLALN 228
              + + ++T+  GL YL TF+FG N++G         LF  SG           QL   
Sbjct: 125 AHNVAVLVQTVCRGLVYLITFIFGANALGLAGLTIQLLLFPESGREEGGTGKRAPQLPKV 184

Query: 229 SFTED----SSSSKETENIGEQQSGSLNSTAENATD 260
           S T+D      + +E E +G+QQ+      A+ + D
Sbjct: 185 SVTDDVFALRRAFQEAERMGQQQAQKEIRKAQQSGD 220


>gi|443311517|ref|ZP_21041144.1| Protein of unknown function (DUF3082) [Synechocystis sp. PCC 7509]
 gi|442778396|gb|ELR88662.1| Protein of unknown function (DUF3082) [Synechocystis sp. PCC 7509]
          Length = 106

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 56/95 (58%), Gaps = 1/95 (1%)

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
           D K  P+P+++ LG + A  +AI LY+ T+ I  +  ++ + SDN  V +ITI +RT+V 
Sbjct: 12  DLKPLPTPLRSSLGAVIASGLAIALYRVTSAISLSFAQKPLHSDNALVVKITIAVRTLVV 71

Query: 202 GLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
           GL  LATF+F I +  LF  + Q  +    +  SS
Sbjct: 72  GLSTLATFIFAIVAFALFALAIQATIQKLKQSPSS 106


>gi|291567877|dbj|BAI90149.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 133

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 1/92 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           +P Q L+GGL AG  AI +Y  T  I      Q I SDN +  Q++I IRT+V G   L 
Sbjct: 42  NPFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAIRTLVIGGAALV 101

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
           T VF I S+GL   + Q+     T  +S+S++
Sbjct: 102 TGVFTIASLGLLALTVQVLFVRLTHKNSASQK 133


>gi|409994073|ref|ZP_11277194.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
           Paraca]
 gi|409935065|gb|EKN76608.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
           Paraca]
          Length = 122

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 1/91 (1%)

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLAT 208
           P Q L+GGL AG  AI +Y  T  I      Q I SDN +  Q++I IRT+V G   L T
Sbjct: 32  PFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAIRTLVIGGAALVT 91

Query: 209 FVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
            VF I S+GL   + Q+     T  +S+S++
Sbjct: 92  GVFTIASLGLLALTVQVLFVRLTHKNSASQK 122


>gi|209526258|ref|ZP_03274788.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
 gi|376007925|ref|ZP_09785107.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423062823|ref|ZP_17051613.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
 gi|209493355|gb|EDZ93680.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
 gi|375323718|emb|CCE20860.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406715779|gb|EKD10932.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
          Length = 122

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 51/91 (56%), Gaps = 1/91 (1%)

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLAT 208
           P Q L+GGL +G  AI +Y  T  I      Q I SDN +  QI+I IRT+V G   L T
Sbjct: 32  PFQVLMGGLFSGGFAIAMYSLTAAIAHTFANQNIYSDNRTALQISIAIRTLVIGGAALVT 91

Query: 209 FVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
            +F I S+GL   + Q+     T+ +S+S++
Sbjct: 92  GIFTIASLGLLALTVQVLFVRLTQKNSASEK 122


>gi|428226471|ref|YP_007110568.1| hypothetical protein GEI7407_3046 [Geitlerinema sp. PCC 7407]
 gi|427986372|gb|AFY67516.1| hypothetical protein GEI7407_3046 [Geitlerinema sp. PCC 7407]
          Length = 108

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 45/77 (58%), Gaps = 1/77 (1%)

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
           +   PP  +Q L   L AG IA+++YK T  I A      + SDN++ ++I   +RT+V 
Sbjct: 10  ETNAPPGVLQCLTSALIAGTIALLMYKVTGGIAANFASHPLHSDNYTAQRIASAVRTLVI 69

Query: 202 GLCYLATFVFGINSVGL 218
           G+C L + +FG  ++GL
Sbjct: 70  GMCALGSGIFGFAALGL 86


>gi|428208447|ref|YP_007092800.1| hypothetical protein Chro_3473 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428010368|gb|AFY88931.1| hypothetical protein Chro_3473 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 101

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYL 206
           P+P++ L G + +  IA   Y  T  I     R+ I SDNF+V  I   +RT+V G+  L
Sbjct: 12  PTPLRCLSGAVISAGIAFAAYSLTAAIAQTFARKPIHSDNFTVVNIAAAVRTLVVGIMAL 71

Query: 207 ATFVFGINSVGLFLYSGQLALNSFTE 232
            T +FGI ++GL   + QL++   T+
Sbjct: 72  GTGIFGIVAIGLVALAIQLSIQKLTK 97


>gi|443324810|ref|ZP_21053537.1| Protein of unknown function (DUF3082) [Xenococcus sp. PCC 7305]
 gi|442795591|gb|ELS04951.1| Protein of unknown function (DUF3082) [Xenococcus sp. PCC 7305]
          Length = 120

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQ-TISDNFSVRQITITIRTIVNGL 203
           K   +P++ LL  + AGV+A  +Y   + I    + +  +SDN  V ++TI +RT+V G+
Sbjct: 28  KQEVTPLKCLLSSVFAGVMAFGVYHLMSAIVTTYSTKPIVSDNIFVLKLTIAVRTLVIGV 87

Query: 204 CYLATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
             L T VFG  ++GLFL   Q+    F++  S+
Sbjct: 88  AALGTGVFGFVALGLFLLGLQITFKDFSKKFSN 120


>gi|428203436|ref|YP_007082025.1| hypothetical protein Ple7327_3245 [Pleurocapsa sp. PCC 7327]
 gi|427980868|gb|AFY78468.1| Protein of unknown function (DUF3082) [Pleurocapsa sp. PCC 7327]
          Length = 114

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 60/103 (58%), Gaps = 4/103 (3%)

Query: 140 SSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDN--FSVRQITITI 196
           SS++A +   +P++ LLG + +G +AI +Y  T +I      + I+     ++R I+  +
Sbjct: 12  SSVEANEQKVTPLRCLLGSVLSGSLAIAMYFLTYSIAYTFATKPITSTSQLAIR-ISAAV 70

Query: 197 RTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
           RT+V G+  L TF+FG  ++GL L + QL L  F E  SSS +
Sbjct: 71  RTLVVGVASLGTFIFGFVTLGLILLAIQLTLQIFKEKLSSSSD 113


>gi|427728680|ref|YP_007074917.1| hypothetical protein Nos7524_1442 [Nostoc sp. PCC 7524]
 gi|427364599|gb|AFY47320.1| Protein of unknown function (DUF3082) [Nostoc sp. PCC 7524]
          Length = 108

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G L +G +   LY     I     ++ I SDN  V  +T  +RT+V G+  
Sbjct: 17  PPTPLRCVTGALISGGLGYALYSLMIAIATTFAKKPIHSDNQLVVNLTSAVRTLVVGIVA 76

Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
           L T +FGI ++GLF  + QL   + T+  S 
Sbjct: 77  LGTGIFGIVAIGLFALAIQLLFQNLTKSKSG 107


>gi|427737778|ref|YP_007057322.1| hypothetical protein Riv7116_4348 [Rivularia sp. PCC 7116]
 gi|427372819|gb|AFY56775.1| Protein of unknown function (DUF3082) [Rivularia sp. PCC 7116]
          Length = 109

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
           P+P + + G L  G +   +YK   +I     N+   SDN  V  I+  +RT+V G+  L
Sbjct: 18  PTPTRCITGALMCGGLGFAMYKLMVSIATTFANKPVTSDNPMVVSISSAVRTLVVGVVAL 77

Query: 207 ATFVFGINSVGLFLYSGQLALNSFT 231
            T +FGI  VGLF  + QL +   T
Sbjct: 78  GTGIFGIVGVGLFALAIQLVIKQLT 102


>gi|428304799|ref|YP_007141624.1| hypothetical protein Cri9333_1213 [Crinalium epipsammum PCC 9333]
 gi|428246334|gb|AFZ12114.1| hypothetical protein Cri9333_1213 [Crinalium epipsammum PCC 9333]
          Length = 115

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV-RQITITIRTIVNGLCYLA 207
           SP++ L G + +G +AI LY  T++I  +   + ++ + +V   I + +RT+V G   LA
Sbjct: 23  SPLRCLTGAMISGGMAIALYSLTSSIAQSFAAKPVTSSSTVAHNIAVAVRTLVTGTAALA 82

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
           T VF + +VGL   + Q  +  FT+ SS+
Sbjct: 83  TGVFSLVTVGLVALAIQTVIQRFTKPSST 111


>gi|428315435|ref|YP_007113317.1| hypothetical protein Osc7112_0273 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428239115|gb|AFZ04901.1| hypothetical protein Osc7112_0273 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 115

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 55/97 (56%), Gaps = 3/97 (3%)

Query: 139 SSSIDAKGPPSP--VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITIT 195
           +S+ DA  P SP  ++ L G L A   A+ LY  T +I  +   + + SDN +  +I++ 
Sbjct: 15  TSAEDALPPKSPTVLRCLTGSLMAASFAVPLYALTMSIAQSFASKPVHSDNITAIKISVA 74

Query: 196 IRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTE 232
           +RT+V G+  L T +F + +VGL L + Q+ + +  +
Sbjct: 75  VRTLVVGIAALGTGIFALAAVGLMLLAVQILIQNLVK 111


>gi|254422677|ref|ZP_05036395.1| hypothetical protein S7335_2829 [Synechococcus sp. PCC 7335]
 gi|196190166|gb|EDX85130.1| hypothetical protein S7335_2829 [Synechococcus sp. PCC 7335]
          Length = 118

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 4/96 (4%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCY 205
           P +P++   G L A  ++++ Y+ T +I A   ++  +SDN     I+  +RT+V G+  
Sbjct: 26  PATPLRCFTGALMAATLSLLAYRLTISIAATFASKPIVSDNPVAVNISAAVRTLVTGIAA 85

Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETE 241
           L   VFG+ ++GL     QL    F  D+ SS+ T+
Sbjct: 86  LGAGVFGVAALGLTALGIQLI---FKPDAQSSEATD 118


>gi|16331459|ref|NP_442187.1| hypothetical protein slr0438 [Synechocystis sp. PCC 6803]
 gi|383323200|ref|YP_005384054.1| hypothetical protein SYNGTI_2292 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383326369|ref|YP_005387223.1| hypothetical protein SYNPCCP_2291 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383492253|ref|YP_005409930.1| hypothetical protein SYNPCCN_2291 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437521|ref|YP_005652246.1| hypothetical protein SYNGTS_2293 [Synechocystis sp. PCC 6803]
 gi|451815611|ref|YP_007452063.1| hypothetical protein MYO_123170 [Synechocystis sp. PCC 6803]
 gi|1001118|dbj|BAA10257.1| slr0438 [Synechocystis sp. PCC 6803]
 gi|339274554|dbj|BAK51041.1| hypothetical protein SYNGTS_2293 [Synechocystis sp. PCC 6803]
 gi|359272520|dbj|BAL30039.1| hypothetical protein SYNGTI_2292 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275690|dbj|BAL33208.1| hypothetical protein SYNPCCN_2291 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278860|dbj|BAL36377.1| hypothetical protein SYNPCCP_2291 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407961156|dbj|BAM54396.1| hypothetical protein BEST7613_5465 [Bacillus subtilis BEST7613]
 gi|451781580|gb|AGF52549.1| hypothetical protein MYO_123170 [Synechocystis sp. PCC 6803]
          Length = 107

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCY 205
           PP+P++ L+G   +  +A  LY  T+ I  +  ++  +SD   V++I   +RT+V GL  
Sbjct: 15  PPTPLRCLVGSGISATLAWGLYLLTSAIAVSFASKPVLSDKPIVQRIASAVRTLVLGLAS 74

Query: 206 LATFVFGINSVGLFLYSGQLALN 228
           + TF+F   ++GL L   QL L 
Sbjct: 75  MGTFIFAFVAIGLILLMVQLILQ 97


>gi|22299383|ref|NP_682630.1| hypothetical protein tsr1840 [Thermosynechococcus elongatus BP-1]
 gi|22295566|dbj|BAC09392.1| tsr1840 [Thermosynechococcus elongatus BP-1]
          Length = 97

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 46/85 (54%), Gaps = 1/85 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFS-VRQITITIRTIVNGLCYLA 207
           SP++ L G L AG + ++LY+ T  I        +S +   V  + + +RT+V GLC LA
Sbjct: 9   SPLRCLTGALIAGTLGLLLYRLTGAIAYLFATHAVSAHHQLVYSLAVAVRTLVVGLCTLA 68

Query: 208 TFVFGINSVGLFLYSGQLALNSFTE 232
           T VF I ++GL   + QL    + +
Sbjct: 69  TGVFSIIALGLVALTLQLLWERWVQ 93


>gi|119485099|ref|ZP_01619484.1| hypothetical protein L8106_06614 [Lyngbya sp. PCC 8106]
 gi|119457327|gb|EAW38452.1| hypothetical protein L8106_06614 [Lyngbya sp. PCC 8106]
          Length = 113

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 1/93 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           +P + L      G IA  LY  T ++      + I SDN +V Q+++T+RT+V GLC + 
Sbjct: 21  TPFRCLSNAAVLGSIATALYFLTRSVAQTYAAKPIHSDNLAVVQMSVTVRTLVTGLCMMG 80

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
           T VF   + GL   + QL++    +  + S ET
Sbjct: 81  TGVFAFLTCGLIFVAIQLSIQRIFKQDNPSSET 113


>gi|428297938|ref|YP_007136244.1| hypothetical protein Cal6303_1211 [Calothrix sp. PCC 6303]
 gi|428234482|gb|AFZ00272.1| hypothetical protein Cal6303_1211 [Calothrix sp. PCC 6303]
          Length = 101

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           SP++   G L +G +   LY    +I      Q I SDN  V +IT+ +RT++ G+  + 
Sbjct: 11  SPLRCFTGSLISGGMGYALYNVMVSIATKFANQPIHSDNEIVVRITVAVRTLIVGVFGMG 70

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
             +FG+ ++GLF    QL +  FT   SS
Sbjct: 71  MGIFGLVALGLFALGIQLGIEKFTNKESS 99


>gi|427419539|ref|ZP_18909722.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 7375]
 gi|425762252|gb|EKV03105.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 7375]
          Length = 112

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQ-TISDNFSVRQITITIRTIVNGLCYLA 207
           SP+    G   AG ++++ Y+    I  A   +  +SD   V  I++ +RT+V G+  L 
Sbjct: 23  SPLSCFSGSFLAGTLSVLCYRMMVAIVTAFAAKPVLSDKTIVLNISVAVRTLVAGIVALG 82

Query: 208 TFVFGINSVGLFLYSGQLALNSFT 231
             VFG+ +VGLFL   QL +   T
Sbjct: 83  AGVFGLTAVGLFLLGIQLIVRKMT 106


>gi|428219444|ref|YP_007103909.1| hypothetical protein Pse7367_3236 [Pseudanabaena sp. PCC 7367]
 gi|427991226|gb|AFY71481.1| hypothetical protein Pse7367_3236 [Pseudanabaena sp. PCC 7367]
          Length = 133

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 5/85 (5%)

Query: 145 KGPPSP----VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTI 199
           K PPS     +Q L G L AG IA+ LY FT ++   L +  I S N    +++ T+RT 
Sbjct: 38  KAPPSTMSMVIQNLSGVLMAGAIAVGLYFFTNSVAVKLAQNPITSSNTLAMRVSTTVRTF 97

Query: 200 VNGLCYLATFVFGINSVGLFLYSGQ 224
           +  L   AT +FG+ ++G+FL + Q
Sbjct: 98  LLALGTGATMIFGVVALGIFLLTIQ 122


>gi|170077404|ref|YP_001734042.1| hypothetical protein SYNPCC7002_A0781 [Synechococcus sp. PCC 7002]
 gi|169885073|gb|ACA98786.1| conserved hypothetical membrane protein [Synechococcus sp. PCC
           7002]
          Length = 105

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 8/98 (8%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
           PSP++   G   AG ++   Y  T  + A    +  + N    QI IT+RT++ GL  +A
Sbjct: 15  PSPLRCWYGTAVAGSMSFAAYLLTNKVIAGFASKPPTGNGLAIQIGITVRTLIMGLFTMA 74

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGE 245
           T VFG+ +VGLF+    L L S      S K+T + G+
Sbjct: 75  TGVFGLIAVGLFI----LGLQSLL----SGKQTTSPGD 104


>gi|434398956|ref|YP_007132960.1| hypothetical protein Sta7437_2460 [Stanieria cyanosphaera PCC 7437]
 gi|428270053|gb|AFZ35994.1| hypothetical protein Sta7437_2460 [Stanieria cyanosphaera PCC 7437]
          Length = 110

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 1/88 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTI-EAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
           +P++   G + +G +A  LY   ++I +    +  ++DN  V +IT+ +RT+V G+  L 
Sbjct: 23  TPIRCFFGTVVSGGLAFGLYSLMSSIVQTYATKPVVADNLLVLRITVAVRTLVIGVAALG 82

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSS 235
             VF   ++GL L   QL + S  E SS
Sbjct: 83  AGVFAFVALGLMLLGIQLTIQSLKEVSS 110


>gi|119511390|ref|ZP_01630502.1| hypothetical protein N9414_19894 [Nodularia spumigena CCY9414]
 gi|119463935|gb|EAW44860.1| hypothetical protein N9414_19894 [Nodularia spumigena CCY9414]
          Length = 106

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 48/91 (52%), Gaps = 1/91 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G + AG +   +Y    +I      + I SDN  V  I   +RT+V G+  
Sbjct: 16  PPTPLRCITGAVFAGGLGYAMYSLMISIATTFAAKPIHSDNPMVVNIGSAVRTLVVGVVA 75

Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
           L T +FGI ++GL   + QL +   T+  +S
Sbjct: 76  LGTGIFGIVAIGLLALAIQLVVQQLTKPKNS 106


>gi|56751369|ref|YP_172070.1| hypothetical protein syc1360_d [Synechococcus elongatus PCC 6301]
 gi|81298956|ref|YP_399164.1| hypothetical protein Synpcc7942_0145 [Synechococcus elongatus PCC
           7942]
 gi|56686328|dbj|BAD79550.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81167837|gb|ABB56177.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 107

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 1/90 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P+    G   +G IA+  Y  T +I ++   + + + N   + I + +RT++ G+  
Sbjct: 18  PPTPLSCWSGSAVSGAIAVGAYFLTVSIASSFAAKPLPTGNAFAQNIAVAVRTLIVGMGA 77

Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSS 235
           LAT +FGI ++GL L   Q+ L    +  S
Sbjct: 78  LATGIFGIVALGLLLLGIQVGLQGLRKAQS 107


>gi|428310658|ref|YP_007121635.1| hypothetical protein Mic7113_2427 [Microcoleus sp. PCC 7113]
 gi|428252270|gb|AFZ18229.1| Protein of unknown function (DUF3082) [Microcoleus sp. PCC 7113]
          Length = 115

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 48/84 (57%), Gaps = 6/84 (7%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
           PSP++ L G L +G  AI LY  T++I     N+   S N +  QI+I +RT+V G+  L
Sbjct: 22  PSPLKCLSGALISGGFAIALYFLTSSIGNTFANKPLPSSNQTAIQISIAVRTLVVGISTL 81

Query: 207 ATFVFGINSVGL-----FLYSGQL 225
           AT +F I  VGL     +L+  QL
Sbjct: 82  ATALFTIIGVGLMGVTIYLFVQQL 105


>gi|332712237|ref|ZP_08432165.1| hypothetical protein LYNGBM3L_71070 [Moorea producens 3L]
 gi|332349043|gb|EGJ28655.1| hypothetical protein LYNGBM3L_71070 [Moorea producens 3L]
          Length = 108

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 1/72 (1%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
           PSP + L G L +G +   +Y  T +I     N+   S++ +  +I I +RT+V G+  L
Sbjct: 18  PSPWRCLTGALISGGLTSAIYFITASIATTFANKPLTSNSQTALKIAIAVRTLVVGVSTL 77

Query: 207 ATFVFGINSVGL 218
           ATF+FGI ++GL
Sbjct: 78  ATFIFGITTIGL 89


>gi|359462355|ref|ZP_09250918.1| hypothetical protein ACCM5_26742 [Acaryochloris sp. CCMEE 5410]
          Length = 115

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 5/100 (5%)

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
           S S +   PPS  + L G + AG IA  L K ++ I  +       SDN +  +I I ++
Sbjct: 4   SPSTNKPTPPSIWRTLSGAVVAGTIAFPLAKLSSKIAQSFAEHPFTSDNQTAVKIAIALK 63

Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSS 237
           T+V GL  LAT +F + ++GL    G L++  F ++ +SS
Sbjct: 64  TLVVGLSTLATGIFCMAALGL----GALSIQIFWQNLTSS 99


>gi|158338025|ref|YP_001519201.1| hypothetical protein AM1_4912 [Acaryochloris marina MBIC11017]
 gi|158308266|gb|ABW29883.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 115

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 5/100 (5%)

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
           S S +   PPS  + L G + AG IA  L K ++ I  +       SDN +  +I I ++
Sbjct: 4   SPSTNKPTPPSIWRTLSGAVVAGTIAFPLAKLSSKIAQSFAEHPFTSDNQTAVKIAIALK 63

Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSS 237
           T+V GL  LAT +F + ++GL    G L++  F ++ +SS
Sbjct: 64  TLVVGLSTLATGIFCMAALGL----GALSIQIFWQNLTSS 99


>gi|428214090|ref|YP_007087234.1| hypothetical protein Oscil6304_3756 [Oscillatoria acuminata PCC
           6304]
 gi|428002471|gb|AFY83314.1| Protein of unknown function (DUF3082) [Oscillatoria acuminata PCC
           6304]
          Length = 109

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
           P+ ++  +G L +G  A      T+ I +   N+   SDN     I+  +RT+V G+  L
Sbjct: 22  PTLMRCFMGSLISGGFAYACMMLTSAIASTFANKPIHSDNPIAVNISAAVRTLVTGVSTL 81

Query: 207 ATFVFGINSVGLFLYSGQLALNSFTEDS 234
           ATFVF   ++GLF    QL    FT+ S
Sbjct: 82  ATFVFAFAALGLFCLGLQLLFQKFTKKS 109


>gi|440680351|ref|YP_007155146.1| hypothetical protein Anacy_0644 [Anabaena cylindrica PCC 7122]
 gi|428677470|gb|AFZ56236.1| hypothetical protein Anacy_0644 [Anabaena cylindrica PCC 7122]
          Length = 109

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 7/105 (6%)

Query: 141 SIDAKG--PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
           S D  G  PP+P++ + G + +G +   +Y     I      + I SDN    +I+  +R
Sbjct: 9   STDTSGQVPPTPLRCITGAIMSGGLGFAMYSLMIAIATTFAAKPIHSDNLLAIKISAAVR 68

Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
           T+V G+  L   +FGI  +GL   + QL     T+     KET +
Sbjct: 69  TLVVGIVALGAGIFGIVGIGLLALAIQLVFQQLTKP----KETNS 109


>gi|434387822|ref|YP_007098433.1| Protein of unknown function (DUF3082) [Chamaesiphon minutus PCC
           6605]
 gi|428018812|gb|AFY94906.1| Protein of unknown function (DUF3082) [Chamaesiphon minutus PCC
           6605]
          Length = 98

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI--SDNFSVRQITITIRTIVNGLCY 205
           PSP++ L G L AG ++I +Y  T+ I      + +    N +V  I   +RT+V G   
Sbjct: 10  PSPLRCLTGSLIAGAMSIAMYMMTSKIATTFATKPMVQKTNMAV-NIATALRTLVVGSTA 68

Query: 206 LATFVFGINSVGLFLYSGQLALN 228
           L   VFGI ++GL L + Q+A N
Sbjct: 69  LGAAVFGIIAIGLILLAIQVAFN 91


>gi|218441368|ref|YP_002379697.1| hypothetical protein PCC7424_4465 [Cyanothece sp. PCC 7424]
 gi|218174096|gb|ACK72829.1| hypothetical protein PCC7424_4465 [Cyanothece sp. PCC 7424]
          Length = 115

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 1/96 (1%)

Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGL 203
           K   +P + LLG   AG++   +Y  T  I     N+   S N  V  I+  +RT+V G+
Sbjct: 19  KNEVTPQKCLLGAAMAGILGTGMYYLTYAIGTTFANKPITSSNQLVINISSAVRTLVVGV 78

Query: 204 CYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
             LATF+F   S GL L + QL L        SS +
Sbjct: 79  ASLATFIFAFVSFGLVLLAIQLTLQGIKGRLKSSSD 114


>gi|334120141|ref|ZP_08494223.1| hypothetical protein MicvaDRAFT_4862 [Microcoleus vaginatus FGP-2]
 gi|333456929|gb|EGK85556.1| hypothetical protein MicvaDRAFT_4862 [Microcoleus vaginatus FGP-2]
          Length = 117

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 1/93 (1%)

Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQI 192
           L + G ++  +  PP+ ++ ++G L AG  A  LY  T +I        I SDN +   I
Sbjct: 14  LSSAGDATSASSTPPTILRCVMGSLIAGGFATALYALTLSIVQTFANSPIHSDNVTAINI 73

Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQL 225
              +RT+V G+  L   VF   S+GL L + QL
Sbjct: 74  GAAVRTLVMGIVALGAGVFAFASLGLMLLAVQL 106


>gi|428780609|ref|YP_007172395.1| hypothetical protein Dacsa_2441 [Dactylococcopsis salina PCC 8305]
 gi|428694888|gb|AFZ51038.1| Protein of unknown function (DUF3082) [Dactylococcopsis salina PCC
           8305]
          Length = 95

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 1/85 (1%)

Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISD-NFSVRQITITIRTIVNGLCYLATFVF 211
           ALLG + AG ++  LY+ T  I A    Q + D N +   I + +RT+V G+  LAT +F
Sbjct: 11  ALLGSVIAGGLSFPLYRLTIAIIARYAEQPLPDTNQTAATIAVAVRTLVMGMGVLATAIF 70

Query: 212 GINSVGLFLYSGQLALNSFTEDSSS 236
            + +VGLFL + Q  +    +   S
Sbjct: 71  ILVAVGLFLLAVQTGIQWLQDQQQS 95


>gi|254415378|ref|ZP_05029139.1| hypothetical protein MC7420_3302 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196177853|gb|EDX72856.1| hypothetical protein MC7420_3302 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 105

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)

Query: 151 VQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLATF 209
           ++ L+  L +G +AI  Y  TT I  +L N+   S N +   I I +RT+V G+  LAT 
Sbjct: 20  LRCLISALISGSLAIAGYWLTTAIAQSLANKPLPSGNVTAINIAIAVRTLVVGISTLATA 79

Query: 210 VFGINSVGLFLYSGQLALNSFTEDS 234
           VF + +VGL   + QL +  FT+ +
Sbjct: 80  VFSLVAVGLVALAIQLTVQRFTQGT 104


>gi|307150620|ref|YP_003886004.1| hypothetical protein Cyan7822_0697 [Cyanothece sp. PCC 7822]
 gi|306980848|gb|ADN12729.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
          Length = 113

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           +P++ LLG   +  +   +Y  T +I +   ++ I S N  V  I   +RT+V G+  LA
Sbjct: 23  TPLRCLLGAAISATLGTGMYYLTYSIGSTFAQKPITSSNQLVINIGSAVRTLVVGVASLA 82

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSK 238
           TF+F   S GL L + QL L +  +  SSS+
Sbjct: 83  TFIFAFVSFGLVLLAIQLTLQAIKQRLSSSE 113


>gi|434393091|ref|YP_007128038.1| hypothetical protein Glo7428_2367 [Gloeocapsa sp. PCC 7428]
 gi|428264932|gb|AFZ30878.1| hypothetical protein Glo7428_2367 [Gloeocapsa sp. PCC 7428]
          Length = 104

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 1/88 (1%)

Query: 151 VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLATF 209
           ++ L G L +G +AI LY  T+ I      + I SDN +V  I   +RT+V G+  L T 
Sbjct: 17  LRCLTGSLISGGLAIALYSLTSAIAQTFAAKPIHSDNPAVINIASAVRTLVVGITALGTG 76

Query: 210 VFGINSVGLFLYSGQLALNSFTEDSSSS 237
           +FG+ ++GL   + QL +   T+ SS S
Sbjct: 77  IFGLVALGLIGLAIQLLIQQATKKSSPS 104


>gi|428774211|ref|YP_007165999.1| hypothetical protein Cyast_2404 [Cyanobacterium stanieri PCC 7202]
 gi|428688490|gb|AFZ48350.1| hypothetical protein Cyast_2404 [Cyanobacterium stanieri PCC 7202]
          Length = 114

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 7/85 (8%)

Query: 138 GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFS---VRQITI 194
            S SID   P +P +   GG  +G++AI  Y  T ++   L    +  NF+     +I  
Sbjct: 17  ASESIDK--PITPTRCFTGGAISGLLAIAAYLLTKSV--VLTYADVPINFNNPMAARIAS 72

Query: 195 TIRTIVNGLCYLATFVFGINSVGLF 219
           T+RT+V G+  +ATFVF + ++GLF
Sbjct: 73  TVRTLVMGITTMATFVFVMVTIGLF 97


>gi|411117762|ref|ZP_11390143.1| Protein of unknown function (DUF3082) [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410711486|gb|EKQ68992.1| Protein of unknown function (DUF3082) [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 127

 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 143 DAKGPP-SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIV 200
           D+  PP +P++ L+G + A V+A  +Y  T  I  +     I S +  V++I+  +RT+V
Sbjct: 24  DSDNPPITPLRCLIGSMLATVLAYAMYLMTHAIAQSFAAHKIQSTSLIVQRISSAVRTLV 83

Query: 201 NGLCYLATFVFGINSV 216
            G+  L T VFGI ++
Sbjct: 84  VGMTTLGTGVFGIAAI 99


>gi|284929157|ref|YP_003421679.1| hypothetical protein UCYN_05970 [cyanobacterium UCYN-A]
 gi|284809616|gb|ADB95321.1| hypothetical protein UCYN_05970 [cyanobacterium UCYN-A]
          Length = 113

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 45/81 (55%), Gaps = 1/81 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALN-RQTISDNFSVRQITITIRTIVNGLCYLA 207
           +P   L+G   AG++   LY  T++I  + N +  IS N  + ++   ++T+V G+  LA
Sbjct: 28  NPFSCLIGSFLAGILGYGLYMLTSSIIHSFNAKPIISSNLLIIKLGSLVKTLVMGVASLA 87

Query: 208 TFVFGINSVGLFLYSGQLALN 228
           TF+    S GL L + QL LN
Sbjct: 88  TFMCFFISFGLILLACQLFLN 108


>gi|218246460|ref|YP_002371831.1| hypothetical protein PCC8801_1624 [Cyanothece sp. PCC 8801]
 gi|257059503|ref|YP_003137391.1| hypothetical protein Cyan8802_1651 [Cyanothece sp. PCC 8802]
 gi|218166938|gb|ACK65675.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
 gi|256589669|gb|ACV00556.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
          Length = 104

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGL 203
           K   +P++ L+G   +G +AI LY  T +I  +   + + S+N  V +I   +RT+V G+
Sbjct: 16  KEKVTPLRCLIGSTISGTLAIALYSLTASIFHSFATKPLTSNNALVLRIGSLVRTLVMGV 75

Query: 204 CYLATFVFGINSVGLFLYSGQLA 226
             L TF+F   + GL L + QL+
Sbjct: 76  ASLGTFIFAFVAFGLILLAIQLS 98


>gi|443318284|ref|ZP_21047540.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 6406]
 gi|442782090|gb|ELR92174.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 6406]
          Length = 133

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 152 QALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLATFV 210
           +A LG   A  +A++LY+    I     N+   SDN +V  ++  +RT+V G+  + T V
Sbjct: 42  RAFLGSGIATTLALLLYRMLAGIALTFANKPVHSDNVTVVNLSAAVRTLVLGVVAMGTGV 101

Query: 211 FGINSVGLFLYSGQLALNSFTEDSSSSKE 239
           FGI ++GL L   Q+     T  S ++ E
Sbjct: 102 FGIAALGLLLLGLQMVWQRLTGKSPATPE 130


>gi|427725819|ref|YP_007073096.1| hypothetical protein Lepto7376_4133 [Leptolyngbya sp. PCC 7376]
 gi|427357539|gb|AFY40262.1| hypothetical protein Lepto7376_4133 [Leptolyngbya sp. PCC 7376]
          Length = 109

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 37/70 (52%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
           PSP++   G   AG ++   Y  T  +         + N    +I IT+RT++ G+C +A
Sbjct: 18  PSPLRCWTGTAVAGSMSFAAYLVTRNVIINFANNPPTGNTVAMRIAITVRTLLMGICTMA 77

Query: 208 TFVFGINSVG 217
           T VFG+ ++G
Sbjct: 78  TAVFGMIAIG 87


>gi|300864421|ref|ZP_07109292.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300337565|emb|CBN54440.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 113

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLA 207
           S ++ + G + AG  A  LY  T +I     N+ T SDN +V +I   +RT+V G+  L 
Sbjct: 21  SILRCITGSMIAGAFASGLYLLTASIAQTFANKPTHSDNITVIKIASAVRTLVVGMAALG 80

Query: 208 TFVFGINSVGLFLYSGQL 225
           T +F + ++GL   + QL
Sbjct: 81  TGIFALAALGLMGLAVQL 98


>gi|427714245|ref|YP_007062869.1| hypothetical protein Syn6312_3283 [Synechococcus sp. PCC 6312]
 gi|427378374|gb|AFY62326.1| Protein of unknown function (DUF3082) [Synechococcus sp. PCC 6312]
          Length = 100

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ-ITITIRTIVNGLCYLA 207
           +P++ L G L AG + I+LY+ T  I ++     +     + Q +++ IRT+V GL  +A
Sbjct: 9   TPLRCLTGALVAGTLGILLYRLTQAIASSFATHPLHTQSQIAQSLSVAIRTLVVGLSTMA 68

Query: 208 TFVFGI 213
           T +F +
Sbjct: 69  TGIFAL 74


>gi|116074646|ref|ZP_01471907.1| hypothetical protein RS9916_28969 [Synechococcus sp. RS9916]
 gi|116067868|gb|EAU73621.1| hypothetical protein RS9916_28969 [Synechococcus sp. RS9916]
          Length = 124

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ-ITITIRTIVNGLCYLAT 208
           P+  L G  T+ V+A   + F+  +       +   + S+ Q I   ++T++ G+C+LAT
Sbjct: 38  PLSFLSGSATSLVLAWFSWGFSKRMVLYFAEHSTHFSSSIAQSIATALKTLMTGMCFLAT 97

Query: 209 FVFGINSVGLFL 220
           F F    +GLFL
Sbjct: 98  FSFAFIGLGLFL 109


>gi|126660189|ref|ZP_01731307.1| hypothetical protein CY0110_08921 [Cyanothece sp. CCY0110]
 gi|126618554|gb|EAZ89305.1| hypothetical protein CY0110_08921 [Cyanothece sp. CCY0110]
          Length = 105

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 1/79 (1%)

Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTI-EAALNRQTISDNFSVRQITITIRTIVNGLCYL 206
           P+P   L+G   +G +   LY  T +I +    +   S N  V +I   +RT+V G+  L
Sbjct: 17  PTPANCLIGATISGALGYALYTLTASIIQTFATKPLTSSNTLVLRIGSLVRTLVMGVASL 76

Query: 207 ATFVFGINSVGLFLYSGQL 225
            TF+F   + GL L + QL
Sbjct: 77  GTFIFFFVAFGLILLAIQL 95


>gi|354565157|ref|ZP_08984332.1| hypothetical protein FJSC11DRAFT_0538 [Fischerella sp. JSC-11]
 gi|353549116|gb|EHC18558.1| hypothetical protein FJSC11DRAFT_0538 [Fischerella sp. JSC-11]
          Length = 101

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 46/82 (56%), Gaps = 1/82 (1%)

Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGL 203
           +   +P+++L+G + +G +A   Y   + I  +   + + SDN  V +I+  +RT+V G+
Sbjct: 10  QAKATPLRSLIGAVISGSLAFAAYFLMSAIATSFAAKPLHSDNLIVLRISSAVRTLVLGI 69

Query: 204 CYLATFVFGINSVGLFLYSGQL 225
             L + VF I ++GL   + QL
Sbjct: 70  AALGSGVFAIVAIGLVALAIQL 91


>gi|186685003|ref|YP_001868199.1| hypothetical protein Npun_R4910 [Nostoc punctiforme PCC 73102]
 gi|186467455|gb|ACC83256.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 109

 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 1/89 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           SP++ L+G + +G +   LY     I      + + S N  V +I+  +RT+V G+  L 
Sbjct: 19  SPLRCLIGAVISGGMGYALYSLMIAIATNFAAKPLHSINPLVIKISSAVRTLVVGVVALG 78

Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
           + +FGI ++GL     QL +   ++  SS
Sbjct: 79  SGIFGIVAIGLLALGVQLLMQQLSKQKSS 107


>gi|282899537|ref|ZP_06307501.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281195416|gb|EFA70349.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 100

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 1/76 (1%)

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
           D   PP+P++ + G   +G +A  +Y     I      + I SDN  V  I   +RT+V 
Sbjct: 10  DTNAPPTPLRCITGATISGGLAFAMYSLMIAIATTFASKPIHSDNQIVINIASAVRTLVV 69

Query: 202 GLCYLATFVFGINSVG 217
           G+  L   +F I ++G
Sbjct: 70  GVVALGMGIFSIVAIG 85


>gi|342318945|gb|EGU10900.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 1616

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 18/98 (18%)

Query: 17  SHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIF 76
           + HP K+   + P +S S F RRRP   P+C          IT+PT S         S+ 
Sbjct: 224 TRHPPKV---RPPRLSPSLFARRRPSFIPHCRC--------ITSPTPSHSHRTTPCNSLL 272

Query: 77  ATTDEPSSLQVATSVLLTGAISVFL-FRALRRRAKRAK 113
                P  L++A  VL+ GA++V L  + LRRR +RA+
Sbjct: 273 -----PMLLRLAV-VLVYGAVAVSLPLQPLRRRVRRAE 304


>gi|394988291|ref|ZP_10381129.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
 gi|393792749|dbj|GAB70768.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
          Length = 196

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 56/117 (47%), Gaps = 7/117 (5%)

Query: 64  SEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
           ++E  +E+ + IFA  D    L++A+    TG  +V+  RAL     +  EL    +G +
Sbjct: 11  NQEPILEVLKEIFA--DRQRVLEIASG---TGQHAVYFGRALPHLTWQTSELAQNHAGIQ 65

Query: 124 KSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
             L D  L N+  L   +ID   P  P++++     A  + I+ ++   T+ A + R
Sbjct: 66  AWLDDSLLPNV--LPPVAIDVSAPKWPIESVDAVFNANTVHIVAWQEVETMFAGIAR 120


>gi|331671448|ref|ZP_08372246.1| pentapeptide repeats (8 copies) (3 repeats) [Escherichia coli
           TA280]
 gi|331071293|gb|EGI42650.1| pentapeptide repeats (8 copies) (3 repeats) [Escherichia coli
           TA280]
          Length = 704

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLN 252
           TI I TI   +C+    V  ++    +LY+  +  N    DS+  + +++I +Q S S +
Sbjct: 571 TIFIDTIFKNVCFFNAKVNNVSLKNAYLYNDNI--NKKANDSAEKQASDSIEKQASDSTD 628

Query: 253 STAENATDTELNSGKEDQSSDS 274
             A N+TD + N   + Q++DS
Sbjct: 629 KQANNSTDKQANDSTDKQANDS 650


>gi|17230723|ref|NP_487271.1| hypothetical protein alr3231 [Nostoc sp. PCC 7120]
 gi|17132326|dbj|BAB74930.1| alr3231 [Nostoc sp. PCC 7120]
          Length = 105

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G   +G +   LY    +I     ++ I SDN  V  +T  +RT+V G+  
Sbjct: 15  PPTPLRCITGAGISGGMGYALYLLMISIATTFAKKPIHSDNQLVISLTSAVRTLVVGIVA 74

Query: 206 LATFVFGINSVG 217
           L T +F I ++G
Sbjct: 75  LGTGIFAIVTIG 86


>gi|434403925|ref|YP_007146810.1| Protein of unknown function (DUF3082) [Cylindrospermum stagnale PCC
           7417]
 gi|428258180|gb|AFZ24130.1| Protein of unknown function (DUF3082) [Cylindrospermum stagnale PCC
           7417]
          Length = 109

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 1/72 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G   +G +   +Y     I      + I SDN  V  I   +RT+V G+  
Sbjct: 17  PPTPLRCVTGAAFSGGLGYAMYTLMIAIATTFATKPIRSDNPMVVNIGSAVRTLVVGVVA 76

Query: 206 LATFVFGINSVG 217
           L T +FG+ +VG
Sbjct: 77  LGTGIFGLVAVG 88


>gi|75911084|ref|YP_325380.1| hypothetical protein Ava_4888 [Anabaena variabilis ATCC 29413]
 gi|75704809|gb|ABA24485.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 105

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G   +G +   LY    +I     ++ I SDN  V  +T  +RT+V G+  
Sbjct: 15  PPTPLRCITGAGISGGMGYALYLLMISIATTFAKKPIHSDNQLVISLTSAVRTLVVGIVA 74

Query: 206 LATFVFGINSVG 217
           L T +F I ++G
Sbjct: 75  LGTGIFAIVTIG 86


>gi|220906290|ref|YP_002481601.1| hypothetical protein Cyan7425_0854 [Cyanothece sp. PCC 7425]
 gi|219862901|gb|ACL43240.1| hypothetical protein Cyan7425_0854 [Cyanothece sp. PCC 7425]
          Length = 108

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 1/83 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV-RQITITIRTIVNGLCYLA 207
           +P + L G + A ++A++LY+ TT I  +     IS +  +   +++ +RT+V GL  +A
Sbjct: 19  TPWKCLRGSVLAAILALLLYRLTTAIAQSFAAHPISTHNQITYNLSVAVRTLVVGLSTMA 78

Query: 208 TFVFGINSVGLFLYSGQLALNSF 230
           T +F I ++GL   + QL +  F
Sbjct: 79  TAIFAIAALGLVGLAIQLVVQKF 101


>gi|87302770|ref|ZP_01085581.1| hypothetical protein WH5701_13495 [Synechococcus sp. WH 5701]
 gi|87282653|gb|EAQ74611.1| hypothetical protein WH5701_13495 [Synechococcus sp. WH 5701]
          Length = 104

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 147 PP--SPVQALLGGLTAGVIAIILYKFTTTIEA--ALNRQTISDNFSVRQITITIRTIVNG 202
           PP   P+  L G LT+ ++A +    +  +    +L+    S  F+ + I   ++T+V G
Sbjct: 13  PPRKGPLSFLSGALTSALLAWVCLGLSQRVVGWYSLHPPHYSKAFA-QSIGTAMKTLVVG 71

Query: 203 LCYLATFVFGINSVGLFL 220
           + +LATF FG  ++GLFL
Sbjct: 72  MSFLATFTFGFVALGLFL 89


>gi|427709929|ref|YP_007052306.1| hypothetical protein Nos7107_4628 [Nostoc sp. PCC 7107]
 gi|427362434|gb|AFY45156.1| hypothetical protein Nos7107_4628 [Nostoc sp. PCC 7107]
          Length = 105

 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           SP++ + G + +  +A  LY     I  +   + + S N  V +IT  +RT+V G+  L 
Sbjct: 17  SPLRCVTGAIISSGMAYALYSLMIAIATSFASKPVHSHNELVIKITSAVRTLVVGIVALG 76

Query: 208 TFVFGINSVGLFLYSGQLALNSFTE 232
           + +FGI ++GL     QL +   ++
Sbjct: 77  SGIFGIVAIGLIALGVQLLIQQLSQ 101


>gi|113475976|ref|YP_722037.1| hypothetical protein Tery_2344 [Trichodesmium erythraeum IMS101]
 gi|110167024|gb|ABG51564.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 109

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
           +P+Q L   + +G +A + Y    ++      + I SDNF V  ++  +RT+V G+  L 
Sbjct: 17  NPLQCLASAVISGSLATVTYSLMQSVAKTFASKPITSDNFLVVNLSGAVRTLVVGVIALV 76

Query: 208 TFVFGINSVGLFLYSGQLALNSFTED 233
             +  I ++GL   + Q  ++ FT  
Sbjct: 77  ACICAIVTLGLIALAIQTIIHRFTNK 102


>gi|116794213|gb|ABK27047.1| unknown [Picea sitchensis]
          Length = 321

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 21/75 (28%), Positives = 37/75 (49%)

Query: 152 QALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVF 211
           QA +G + A    I+ +K +T +++         N +   +  ++R  +  LCY  T + 
Sbjct: 243 QAYIGSIIALAFGILSWKLSTGVQSVPKDSLQYANENALLLGNSLRGTLLALCYSCTLLS 302

Query: 212 GINSVGLFLYSGQLA 226
           G+  VGLFL   QL+
Sbjct: 303 GVTMVGLFLIGRQLS 317


>gi|428221170|ref|YP_007105340.1| hypothetical protein Syn7502_01096 [Synechococcus sp. PCC 7502]
 gi|427994510|gb|AFY73205.1| Protein of unknown function (DUF3082) [Synechococcus sp. PCC 7502]
          Length = 106

 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 19/92 (20%)

Query: 143 DAKGPP--------SPVQALLGGLTAGVIAIILYKFTTTIEAAL------NRQTISDNFS 188
           + K PP        SP+  ++G + + ++A  LY FT  +   L      N QT++D   
Sbjct: 5   NPKSPPNSQSQENISPISGIIGAVISAILAYGLYNFTLGVARKLALSPFQNAQTLAD--- 61

Query: 189 VRQITITIRTIVNGLCYLATFVFGINSVGLFL 220
             +I   IRTI+  +    T +FG+ ++GL L
Sbjct: 62  --RIGAAIRTILLAMGSGITIIFGVIAIGLVL 91


>gi|282896424|ref|ZP_06304445.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281198712|gb|EFA73592.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 100

 Score = 37.4 bits (85), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
           PP+P++ + G   +G +A  +Y     I      + I SDN  V  I   +RT+V G+  
Sbjct: 14  PPTPLRCITGATISGGLAFAMYSLMIAIATTFASKPIHSDNQIVINIASAVRTLVVGVVA 73

Query: 206 LATFVFGINSVG 217
           L   +F I ++G
Sbjct: 74  LGMGIFSIVTIG 85


>gi|113954978|ref|YP_730797.1| hypothetical protein sync_1592 [Synechococcus sp. CC9311]
 gi|113882329|gb|ABI47287.1| Uncharacterized membrane protein [Synechococcus sp. CC9311]
          Length = 105

 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLAT 208
           P+  L G LT+ +++ + ++ +T +     NR +   + + + I   ++T++ G+C+LAT
Sbjct: 19  PISFLSGSLTSLLMSWLSFRLSTGMVTYFANRPSTFSSATAQSIASALKTLLIGMCFLAT 78

Query: 209 FVFGINSVGLFL 220
           F      +GLFL
Sbjct: 79  FSCAFVGIGLFL 90


>gi|427701773|ref|YP_007044995.1| hypothetical protein Cyagr_0462 [Cyanobium gracile PCC 6307]
 gi|427344941|gb|AFY27654.1| Protein of unknown function (DUF3082) [Cyanobium gracile PCC 6307]
          Length = 100

 Score = 37.4 bits (85), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 147 PP--SPVQALLGGLTAGVIAIILYKFTT-TIEAALNRQTISDNFSVRQITITIRTIVNGL 203
           PP   P+  L G +T+G++A +    +   +   +      D+   + I   ++T++ G+
Sbjct: 9   PPRKGPLSFLSGAITSGLLAWLSLGVSQRVVGWYVTHPPHFDSAFAQSIATAMKTLMVGM 68

Query: 204 CYLATFVFGINSVGLFL 220
           C+LATF F    +GLFL
Sbjct: 69  CFLATFTFAFIGLGLFL 85


>gi|428768855|ref|YP_007160645.1| hypothetical protein Cyan10605_0459 [Cyanobacterium aponinum PCC
           10605]
 gi|428683134|gb|AFZ52601.1| hypothetical protein Cyan10605_0459 [Cyanobacterium aponinum PCC
           10605]
          Length = 109

 Score = 37.0 bits (84), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 149 SPVQALLGGLTAGVIAIILYKFT-TTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
           +P++ + G   +G +A+  Y  T + I   +N     +N     I  T+RT++ G+  +A
Sbjct: 22  TPLRCMTGSAISGGLAVAAYLLTKSVILTYVNMPIKFNNPLAANIASTVRTLIMGVTTMA 81

Query: 208 TFVFGINSVGLFLYSGQLALNSFTED 233
           TF+F + +VGLF     L +  F E+
Sbjct: 82  TFLFLMVAVGLF----ALGVKRFIEE 103


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.128    0.349 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,684,161,189
Number of Sequences: 23463169
Number of extensions: 138269352
Number of successful extensions: 433430
Number of sequences better than 100.0: 145
Number of HSP's better than 100.0 without gapping: 63
Number of HSP's successfully gapped in prelim test: 82
Number of HSP's that attempted gapping in prelim test: 433167
Number of HSP's gapped (non-prelim): 249
length of query: 276
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 136
effective length of database: 9,074,351,707
effective search space: 1234111832152
effective search space used: 1234111832152
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 76 (33.9 bits)