BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024212
         (271 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224129560|ref|XP_002320616.1| predicted protein [Populus trichocarpa]
 gi|222861389|gb|EEE98931.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score =  327 bits (839), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 158/208 (75%), Positives = 186/208 (89%), Gaps = 2/208 (0%)

Query: 65  SNITCSATDRSDP--EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           S ITCSA ++  P  E+SS A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA T+LI A
Sbjct: 30  STITCSAANKPSPSTEVSSTAKIRSEVLSPFRTVRMFFYLAFLASGALGGLIATTRLIAA 89

Query: 123 LANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE 182
           LANP+RAA+VPEILKGL ID+ A ++FAFLY RENKAKN QLARLSREE+LSNLKLRVDE
Sbjct: 90  LANPSRAAEVPEILKGLGIDIGAAAIFAFLYYRENKAKNVQLARLSREENLSNLKLRVDE 149

Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF 242
           KKIISV++LRG+ARLVIC+GPA +I+ESFK SEPFT+SLL+RGVLVV F ++GN+PSFEF
Sbjct: 150 KKIISVSSLRGVARLVICAGPAPFILESFKLSEPFTQSLLDRGVLVVPFASNGNLPSFEF 209

Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWS 270
           +E EEMKE T KRKRLWQL P+YVSEWS
Sbjct: 210 DEGEEMKELTTKRKRLWQLAPIYVSEWS 237


>gi|356548113|ref|XP_003542448.1| PREDICTED: uncharacterized protein LOC100799768 [Glycine max]
          Length = 326

 Score =  309 bits (791), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 159/219 (72%), Positives = 190/219 (86%), Gaps = 5/219 (2%)

Query: 55  TWTAKRSPLCSNITCSATDR--SDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGG 112
           +++ +R+   + I CSA ++  S  EISS A+IR+EVLSPFRSVRMFFY+AFVASG+LGG
Sbjct: 49  SYSHQRASSFATIVCSAANKPSSSSEISSTAKIRSEVLSPFRSVRMFFYIAFVASGALGG 108

Query: 113 LIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREES 172
            IA TQLIGALAN +RA++VPEILKGL ID++AVSLFAFLY RENKAKNAQ+ARLSREES
Sbjct: 109 FIATTQLIGALANSSRASEVPEILKGLGIDLAAVSLFAFLYFRENKAKNAQVARLSREES 168

Query: 173 LSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFT 232
           LSNLKLRVDEKKII VN+LRGIARLVIC+GPAS++ ESFKRSEPFT+SLL+RGVLVV F 
Sbjct: 169 LSNLKLRVDEKKIIPVNSLRGIARLVICAGPASFVTESFKRSEPFTDSLLDRGVLVVPFV 228

Query: 233 TDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
           TDGN P+ EFEE+EE+     +RKRLWQL PVY++EWS+
Sbjct: 229 TDGNSPALEFEETEEL---ATRRKRLWQLAPVYINEWSE 264


>gi|359480082|ref|XP_002267644.2| PREDICTED: uncharacterized protein LOC100253269 [Vitis vinifera]
          Length = 328

 Score =  308 bits (789), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 148/189 (78%), Positives = 170/189 (89%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA TQLI AL N +RA  VP+ILKGL ID
Sbjct: 77  AKIRSEVLSPFRTVRMFFYLAFIASGALGGLIATTQLIAALTNSSRAPLVPDILKGLGID 136

Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSG 202
           + AV++FAFLYSRE+ AKNAQLARL+REESLSNLKLRVDEKK+ISVN+LRGIARLVIC+G
Sbjct: 137 IGAVAIFAFLYSRESSAKNAQLARLTREESLSNLKLRVDEKKVISVNDLRGIARLVICAG 196

Query: 203 PASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLT 262
           PA +I ESFK S+PFT+ LL+RGVLVV F TDG +PSFE+EESEEMK+ T KRKRLWQL 
Sbjct: 197 PAPFIAESFKLSQPFTQGLLDRGVLVVPFVTDGKLPSFEYEESEEMKDITTKRKRLWQLV 256

Query: 263 PVYVSEWSK 271
           PVYVSEWSK
Sbjct: 257 PVYVSEWSK 265


>gi|297744261|emb|CBI37231.3| unnamed protein product [Vitis vinifera]
          Length = 346

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 148/189 (78%), Positives = 170/189 (89%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA TQLI AL N +RA  VP+ILKGL ID
Sbjct: 95  AKIRSEVLSPFRTVRMFFYLAFIASGALGGLIATTQLIAALTNSSRAPLVPDILKGLGID 154

Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSG 202
           + AV++FAFLYSRE+ AKNAQLARL+REESLSNLKLRVDEKK+ISVN+LRGIARLVIC+G
Sbjct: 155 IGAVAIFAFLYSRESSAKNAQLARLTREESLSNLKLRVDEKKVISVNDLRGIARLVICAG 214

Query: 203 PASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLT 262
           PA +I ESFK S+PFT+ LL+RGVLVV F TDG +PSFE+EESEEMK+ T KRKRLWQL 
Sbjct: 215 PAPFIAESFKLSQPFTQGLLDRGVLVVPFVTDGKLPSFEYEESEEMKDITTKRKRLWQLV 274

Query: 263 PVYVSEWSK 271
           PVYVSEWSK
Sbjct: 275 PVYVSEWSK 283


>gi|388514469|gb|AFK45296.1| unknown [Lotus japonicus]
          Length = 334

 Score =  304 bits (779), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 159/245 (64%), Positives = 187/245 (76%), Gaps = 5/245 (2%)

Query: 32  RQILNKYRS--RTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISS---AARIR 86
           R ++N +R+      G+S    L  T    R    + I CSA+++  P  S     ARIR
Sbjct: 28  RNLINNHRNFFSGNKGVSYGHNLFYTDPRTRRASLATILCSASNKPSPSSSEISSTARIR 87

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
            EVLSPFRS+RMFFY+AF+AS SLG  IA TQLIGALAN +RA+ VPEILKGL ID+ AV
Sbjct: 88  TEVLSPFRSLRMFFYVAFIASASLGAFIAATQLIGALANSSRASQVPEILKGLGIDIGAV 147

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
           S+FAFLY R+NKAKNAQ ARLSREE LSNLKLRVDEKKII V++LRGIARLVIC+GP S+
Sbjct: 148 SIFAFLYFRDNKAKNAQEARLSREEFLSNLKLRVDEKKIIPVSSLRGIARLVICAGPESF 207

Query: 207 IMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYV 266
           I ESFKRSEPFTE L++RGVLVV F TDGN P  EFEE+EEMK+   +RKRLWQLTPVY+
Sbjct: 208 ITESFKRSEPFTEGLMDRGVLVVPFVTDGNSPDLEFEETEEMKQLATRRKRLWQLTPVYI 267

Query: 267 SEWSK 271
           +EWS 
Sbjct: 268 TEWSN 272


>gi|449468602|ref|XP_004152010.1| PREDICTED: uncharacterized protein LOC101206246 [Cucumis sativus]
          Length = 338

 Score =  298 bits (764), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 146/206 (70%), Positives = 175/206 (84%), Gaps = 2/206 (0%)

Query: 65  SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
           +N  CSA D+  PEISS A+IR+EVLSPFRSVRMFFYL F+ASG+LGGLIA TQL+GALA
Sbjct: 71  ANFICSAADK--PEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLGALA 128

Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKK 184
           N +RA +VP+IL+GL +D  AV+LFAFLY REN AKNAQLARLSREESLSNLKLRVD+ K
Sbjct: 129 NSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNK 188

Query: 185 IISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEE 244
           +I ++ LRGIARLVIC+GP S+I+E+FK SEPFTE LLERGVLVV   TD    +FEF++
Sbjct: 189 VIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERGVLVVPLATDVTTLNFEFDD 248

Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWS 270
            EE+K+ T KRKRLW+LTPVY++EWS
Sbjct: 249 REEVKDITTKRKRLWRLTPVYMTEWS 274


>gi|255560289|ref|XP_002521162.1| conserved hypothetical protein [Ricinus communis]
 gi|223539731|gb|EEF41313.1| conserved hypothetical protein [Ricinus communis]
          Length = 331

 Score =  286 bits (732), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 161/209 (77%), Positives = 184/209 (88%), Gaps = 2/209 (0%)

Query: 65  SNITCSATDRSDP--EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           S+I CSA ++  P  +ISS A+IR+EVLSPFRSVRMFFYLAF+ASG LG LIA TQLI A
Sbjct: 60  SSIACSAANKPSPPTDISSTAKIRSEVLSPFRSVRMFFYLAFIASGGLGALIATTQLIAA 119

Query: 123 LANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE 182
           LANPARAA+VPEILKGL ID+ AVS+FAFLY REN AK AQ+ARLSREESLSNLKLRVD+
Sbjct: 120 LANPARAAEVPEILKGLGIDIGAVSIFAFLYYRENNAKQAQVARLSREESLSNLKLRVDD 179

Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF 242
           KKIIS+++LRGIARLVIC+GP SY++ESF+ SEPFT+SLL+RGVLVV F TDGN P+FEF
Sbjct: 180 KKIISISSLRGIARLVICAGPGSYVLESFRLSEPFTQSLLDRGVLVVPFATDGNFPNFEF 239

Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
           EESEEMKE T KRKRLWQL P +VSEWSK
Sbjct: 240 EESEEMKEITSKRKRLWQLNPTFVSEWSK 268


>gi|297803184|ref|XP_002869476.1| hypothetical protein ARALYDRAFT_913640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315312|gb|EFH45735.1| hypothetical protein ARALYDRAFT_913640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 328

 Score =  283 bits (723), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 148/249 (59%), Positives = 188/249 (75%), Gaps = 12/249 (4%)

Query: 25  YHASRFCRQILNKYRSRTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISSAAR 84
           Y   RF R   ++ R   +P L+I        T K  PL  N  C A D    EIS+ AR
Sbjct: 28  YSPRRFSRHN-HRERLDFSPRLTI--------TGKHQPLSFNTVCFAADEPS-EISADAR 77

Query: 85  IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
           IR+EVLSPFRSVRMFFYLAF+ASGSLGGLIA ++LIGALANPAR+ +V EI+KGL ID+ 
Sbjct: 78  IRSEVLSPFRSVRMFFYLAFIASGSLGGLIATSRLIGALANPARSGEVLEIVKGLGIDIG 137

Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVICSGP 203
           A SLFAFLY  ENK KNAQ+ARLSREE+L+ LK+RV+E  K+ISV +LRGIARLVIC+GP
Sbjct: 138 AASLFAFLYFNENKTKNAQMARLSREENLAKLKMRVEENNKVISVGDLRGIARLVICAGP 197

Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEM-KEFTRKRKRLWQLT 262
           A +I E+F RS+ FT+ L+ERGV+VV++ TDGN P  EF+E++ + +E +++RKRLW++T
Sbjct: 198 AEFIEEAFNRSKEFTQGLVERGVVVVAYATDGNSPVLEFDEADIVDEEISQRRKRLWRVT 257

Query: 263 PVYVSEWSK 271
           PV+V EW K
Sbjct: 258 PVFVPEWEK 266


>gi|18417257|ref|NP_567817.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332660136|gb|AEE85536.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 347

 Score =  280 bits (717), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 138/219 (63%), Positives = 177/219 (80%), Gaps = 3/219 (1%)

Query: 55  TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
           T T K+ PL  N  C A D    EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLGGLI
Sbjct: 68  TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGGLI 126

Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
           A ++LIGALANPAR+ +V EI+KGL +D+ A SLFAFLY  ENK KNAQ+ARLSREE+L 
Sbjct: 127 ATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 186

Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
            LK+RV+E  K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 187 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 246

Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
           DGN P  EF+E++   +E +++RK+LW++TPV+V EW K
Sbjct: 247 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 285


>gi|4218124|emb|CAA22978.1| hypothetical protein [Arabidopsis thaliana]
 gi|7269729|emb|CAB81462.1| hypothetical protein [Arabidopsis thaliana]
 gi|21593096|gb|AAM65045.1| unknown [Arabidopsis thaliana]
          Length = 331

 Score =  280 bits (717), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 138/219 (63%), Positives = 177/219 (80%), Gaps = 3/219 (1%)

Query: 55  TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
           T T K+ PL  N  C A D    EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLGGLI
Sbjct: 52  TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGGLI 110

Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
           A ++LIGALANPAR+ +V EI+KGL +D+ A SLFAFLY  ENK KNAQ+ARLSREE+L 
Sbjct: 111 ATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 170

Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
            LK+RV+E  K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 171 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 230

Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
           DGN P  EF+E++   +E +++RK+LW++TPV+V EW K
Sbjct: 231 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 269


>gi|449530478|ref|XP_004172222.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101223919, partial [Cucumis sativus]
          Length = 254

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 135/190 (71%), Positives = 161/190 (84%)

Query: 81  SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
           S A+IR+EVLSPFRSVRMFFYL F+ASG+LGGLIA TQL+GALAN +RA +VP+IL+GL 
Sbjct: 1   STAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLGALANSSRADEVPDILEGLG 60

Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
           ID  AV+LFAFLY REN AKNAQLARLSREESLSNLKLRVD+ K+I ++ LRGIARLVIC
Sbjct: 61  IDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISILRGIARLVIC 120

Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQ 260
           +GP S+I+E+FK SEPFTE LLERG   V   TD    +FEF++ EE+K+ T KRKRLW+
Sbjct: 121 AGPESFIIEAFKSSEPFTERLLERGGFFVPLATDVTTLNFEFDDREEVKDITTKRKRLWR 180

Query: 261 LTPVYVSEWS 270
           LTPVY++EWS
Sbjct: 181 LTPVYMTEWS 190


>gi|15081662|gb|AAK82486.1| AT4g28740/F16A16_150 [Arabidopsis thaliana]
 gi|20147175|gb|AAM10304.1| AT4g28740/F16A16_150 [Arabidopsis thaliana]
          Length = 347

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 136/219 (62%), Positives = 175/219 (79%), Gaps = 3/219 (1%)

Query: 55  TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
           T T K+ PL  N  C A D    EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLG LI
Sbjct: 68  TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGELI 126

Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
           A ++LIG LANPAR+ +V EI+KGL +D+ A SLFAFLY  ENK KNAQ+ARLSREE+L 
Sbjct: 127 ATSRLIGTLANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 186

Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
            LK+RV+E  K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 187 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 246

Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
           DGN P  EF+E++   +E +++RK+LW++TPV+V EW K
Sbjct: 247 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 285


>gi|326520740|dbj|BAJ92733.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 334

 Score =  256 bits (653), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 128/198 (64%), Positives = 155/198 (78%), Gaps = 6/198 (3%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
           SS A+IR+EVLSPFRSVRMFFYLAF+AS  LG LIA+TQL+ AL NPARAA VPE LKGL
Sbjct: 75  SSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLLPALGNPARAAGVPETLKGL 134

Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV-DEKKIISVNNLRGIARLV 198
            IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV D  +++ ++ LRG ARLV
Sbjct: 135 GIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRVGDGGRVVPLSELRGSARLV 194

Query: 199 ICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEES-----EEMKEFTR 253
           I +GPA ++ ESF+RS PF   L+ERGVLV+ F TDGN P+ EF E       E  E  R
Sbjct: 195 IVAGPAEFVAESFRRSRPFLRDLMERGVLVLPFPTDGNAPALEFGEDADGGEAEDDEVAR 254

Query: 254 KRKRLWQLTPVYVSEWSK 271
           K +RLWQLTPVY SEW+K
Sbjct: 255 KSRRLWQLTPVYTSEWAK 272


>gi|326517956|dbj|BAK07230.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 273

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 128/198 (64%), Positives = 155/198 (78%), Gaps = 6/198 (3%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
           SS A+IR+EVLSPFRSVRMFFYLAF+AS  LG LIA+TQL+ AL NPARAA VPE LKGL
Sbjct: 72  SSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLLPALGNPARAAGVPETLKGL 131

Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV-DEKKIISVNNLRGIARLV 198
            IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV D  +++ ++ LRG ARLV
Sbjct: 132 GIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRVGDGGRVVPLSELRGSARLV 191

Query: 199 ICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEES-----EEMKEFTR 253
           I +GPA ++ ESF+RS PF   L+ERGVLV+ F TDGN P+ EF E       E  E  R
Sbjct: 192 IVAGPAEFVAESFRRSRPFLRDLMERGVLVLPFPTDGNAPALEFGEDADGGEAEDDEVAR 251

Query: 254 KRKRLWQLTPVYVSEWSK 271
           K +RLWQLTPVY SEW+K
Sbjct: 252 KSRRLWQLTPVYTSEWAK 269


>gi|32489514|emb|CAE04717.1| OSJNBa0043L24.5 [Oryza sativa Japonica Group]
 gi|38567844|emb|CAE05687.2| OSJNBb0002J11.14 [Oryza sativa Japonica Group]
 gi|116310319|emb|CAH67335.1| OSIGBa0157A06.4 [Oryza sativa Indica Group]
 gi|125590937|gb|EAZ31287.1| hypothetical protein OsJ_15394 [Oryza sativa Japonica Group]
 gi|215692749|dbj|BAG88169.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 330

 Score =  244 bits (622), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 128/222 (57%), Positives = 169/222 (76%), Gaps = 14/222 (6%)

Query: 64  CSNITCSATDRSDP------EISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIA 115
           C  + CSA ++  P      E+SS   A+IR+EVLSPFRSVRMFFYLAF+AS  LG LIA
Sbjct: 46  CRGVRCSAANKPSPSTAPGTEVSSTSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGALIA 105

Query: 116 ITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSN 175
           +TQLI AL++PARAA   E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS 
Sbjct: 106 LTQLIPALSSPARAAAAGETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSR 165

Query: 176 LKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
           L++R  E +  + +  LRG ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TD
Sbjct: 166 LRIRAGEGRPPVPLGELRGTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTD 225

Query: 235 GNIPSFEFEESEEMKEFT-----RKRKRLWQLTPVYVSEWSK 271
           GN P  +F+E++E +E       + ++RLWQLTPVY SEW+K
Sbjct: 226 GNAPDLQFDEADEEEEEAAAAAGKMKRRLWQLTPVYTSEWAK 267


>gi|357167936|ref|XP_003581403.1| PREDICTED: uncharacterized protein LOC100835810 [Brachypodium
           distachyon]
          Length = 326

 Score =  238 bits (606), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 124/217 (57%), Positives = 158/217 (72%), Gaps = 14/217 (6%)

Query: 69  CSATDRSDP---EIS-----SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI 120
           CSA ++  P   E+S     S A+IR+EVLSPFRSVRMFFYLAF+AS  LG LIA+TQL+
Sbjct: 48  CSAANKPSPPPPEVSNSSSNSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLV 107

Query: 121 GALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV 180
            AL NPAR A   E L+GL IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV
Sbjct: 108 PALTNPARLAGAAETLRGLGIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRV 167

Query: 181 DE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPS 239
               +++ +  LRG ARLVI +GPA ++ ESF+RS+PF + L++RGVLV+ F TDGN P+
Sbjct: 168 GAGGRVVPLGELRGSARLVIVAGPAEFVEESFRRSKPFLKDLMDRGVLVLPFPTDGNTPA 227

Query: 240 FEFE-----ESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
            +F+        E      K KRLWQLTPVY SEW+K
Sbjct: 228 LQFDGEAEEGENEGDGIGEKSKRLWQLTPVYTSEWAK 264


>gi|215694761|dbj|BAG89952.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 250

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 120/195 (61%), Positives = 157/195 (80%), Gaps = 6/195 (3%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A+IR+EVLSPFRSVRMFFYLAF+AS  LG LIA+TQLI AL++PARAA   E LKGL ID
Sbjct: 2   AKIRSEVLSPFRSVRMFFYLAFMASAGLGALIALTQLIPALSSPARAAAAGETLKGLGID 61

Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVICS 201
           V+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R  E +  + +  LRG ARLVI +
Sbjct: 62  VAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELRGTARLVIVA 121

Query: 202 GPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT-----RKRK 256
           GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P  +F+E++E +E       + ++
Sbjct: 122 GPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEAAAAAGKMKR 181

Query: 257 RLWQLTPVYVSEWSK 271
           RLWQLTPVY SEW+K
Sbjct: 182 RLWQLTPVYTSEWAK 196


>gi|242076406|ref|XP_002448139.1| hypothetical protein SORBIDRAFT_06g021960 [Sorghum bicolor]
 gi|241939322|gb|EES12467.1| hypothetical protein SORBIDRAFT_06g021960 [Sorghum bicolor]
          Length = 332

 Score =  234 bits (597), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 132/266 (49%), Positives = 177/266 (66%), Gaps = 27/266 (10%)

Query: 28  SRFCRQILNKYRSRTTPGLSISLPLHSTWTAKRSPLC-SNITCSATDRSDP--------- 77
           S  C  +    R RT   +S  +P     +  R+  C + + CSA ++  P         
Sbjct: 9   SAVCVPVPCAERLRTPRPISRRVPHKPALSCSRA--CRAGVRCSAANKPSPPPPTTPDSS 66

Query: 78  EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI----GALANPARAADVP 133
           E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+ +L+    G+ ++PARAAD  
Sbjct: 67  EVSSMAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALARLLPLLSGSASDPARAADT- 125

Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD--EKKIISVNNL 191
             LKGL IDV+AVSLFAFLY+RE+KAK+AQ+ARL+REE LS L+LRV   E +  +++ L
Sbjct: 126 --LKGLGIDVAAVSLFAFLYARESKAKDAQVARLAREERLSRLRLRVGAAEGRPFTLSEL 183

Query: 192 RGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEM--- 248
           RG ARLVI +GPA ++ ESF+RS+PF   L ER VL V F TDGN P    ++  +    
Sbjct: 184 RGTARLVIVAGPADFVAESFRRSQPFLRELAERAVLAVPFATDGNTPELRLDDGGDEDVI 243

Query: 249 ---KEFTRKRKRLWQLTPVYVSEWSK 271
               +  R+ KRLWQLTPVY++EW++
Sbjct: 244 DGGDDVARRSKRLWQLTPVYITEWAQ 269


>gi|414586451|tpg|DAA37022.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 282

 Score =  196 bits (499), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 116/209 (55%), Positives = 148/209 (70%), Gaps = 18/209 (8%)

Query: 78  EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEI 135
           E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL  ++PA AA     
Sbjct: 59  EVSSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST--- 115

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIIS 187
           LKGL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR            +  +
Sbjct: 116 LKGLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFT 175

Query: 188 VNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF----- 242
           +  LRG  RLVI +GPA ++ ESF+RS+PF   L ER VL V F TDG+ P         
Sbjct: 176 LGELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGD 235

Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
           +E     +  R+ KRLWQLTPV+ +EW++
Sbjct: 236 DEDGVDDDVARRSKRLWQLTPVFTTEWAQ 264


>gi|414586452|tpg|DAA37023.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 327

 Score =  196 bits (498), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 116/209 (55%), Positives = 148/209 (70%), Gaps = 18/209 (8%)

Query: 78  EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEI 135
           E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL  ++PA AA     
Sbjct: 59  EVSSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST--- 115

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIIS 187
           LKGL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR            +  +
Sbjct: 116 LKGLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFT 175

Query: 188 VNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF----- 242
           +  LRG  RLVI +GPA ++ ESF+RS+PF   L ER VL V F TDG+ P         
Sbjct: 176 LGELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGD 235

Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
           +E     +  R+ KRLWQLTPV+ +EW++
Sbjct: 236 DEDGVDDDVARRSKRLWQLTPVFTTEWAQ 264


>gi|226496251|ref|NP_001140255.1| uncharacterized protein LOC100272297 [Zea mays]
 gi|194698712|gb|ACF83440.1| unknown [Zea mays]
 gi|414586449|tpg|DAA37020.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 229

 Score =  193 bits (490), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 115/207 (55%), Positives = 146/207 (70%), Gaps = 18/207 (8%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEILK 137
           SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL  ++PA AA     LK
Sbjct: 8   SSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LK 64

Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVN 189
           GL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR            +  ++ 
Sbjct: 65  GLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLG 124

Query: 190 NLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EE 244
            LRG  RLVI +GPA ++ ESF+RS+PF   L ER VL V F TDG+ P         +E
Sbjct: 125 ELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDE 184

Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWSK 271
                +  R+ KRLWQLTPV+ +EW++
Sbjct: 185 DGVDDDVARRSKRLWQLTPVFTTEWAQ 211


>gi|168020188|ref|XP_001762625.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686033|gb|EDQ72424.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 261

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 141/202 (69%), Gaps = 16/202 (7%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
           SS A+IR+EV++PFR++RMFFY AF+ASG +GGLIAIT+LI AL N   + D  EI KG+
Sbjct: 4   SSDAKIRSEVIAPFRALRMFFYGAFMASGGVGGLIAITRLIAALNNAPGSGDPIEIAKGI 63

Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
           +ID  AV +FA LY  ++KA++  LA+++REE+LSNL+L +  ++++++  LRG ARLVI
Sbjct: 64  AIDFGAVLIFALLYRSDSKARDKSLAKIAREENLSNLRLELSNRRVVTIGELRGTARLVI 123

Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFE----------FEESEEMK 249
            +GP SYI E+FK+SEP+ + LL+RGVLV ++ TDG   SF             +S  + 
Sbjct: 124 LAGPGSYIEEAFKQSEPYKQELLDRGVLVAAYATDG--ASFADYSSVSSSIESNQSVAVA 181

Query: 250 EFTRKRKRLWQLTPVYVSEWSK 271
           +  +K    W+ TP+Y SEW +
Sbjct: 182 DLEKK----WRATPIYTSEWGR 199


>gi|414586450|tpg|DAA37021.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 274

 Score =  192 bits (489), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 115/207 (55%), Positives = 146/207 (70%), Gaps = 18/207 (8%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEILK 137
           SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL  ++PA AA     LK
Sbjct: 8   SSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LK 64

Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVN 189
           GL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR            +  ++ 
Sbjct: 65  GLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLG 124

Query: 190 NLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EE 244
            LRG  RLVI +GPA ++ ESF+RS+PF   L ER VL V F TDG+ P         +E
Sbjct: 125 ELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDE 184

Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWSK 271
                +  R+ KRLWQLTPV+ +EW++
Sbjct: 185 DGVDDDVARRSKRLWQLTPVFTTEWAQ 211


>gi|414586447|tpg|DAA37018.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 204

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 100/189 (52%), Positives = 129/189 (68%), Gaps = 18/189 (9%)

Query: 98  MFFYLAFVASGSLGGLIAITQLIGALA--NPARAADVPEILKGLSIDVSAVSLFAFLYSR 155
           MFFYLAF+ASG+LGGLIA+T+L+ AL+  +PA AA     LKGL IDV+AVSLFAFLYSR
Sbjct: 1   MFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LKGLGIDVAAVSLFAFLYSR 57

Query: 156 ENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVNNLRGIARLVICSGPASYI 207
           E+KAK+AQ+ARL+REE LS L+LR            +  ++  LRG  RLVI +GPA ++
Sbjct: 58  ESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLGELRGTTRLVIVAGPAEFV 117

Query: 208 MESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EESEEMKEFTRKRKRLWQLT 262
            ESF+RS+PF   L ER VL V F TDG+ P         +E     +  R+ KRLWQLT
Sbjct: 118 AESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDEDGVDDDVARRSKRLWQLT 177

Query: 263 PVYVSEWSK 271
           PV+ +EW++
Sbjct: 178 PVFTTEWAQ 186


>gi|194691534|gb|ACF79851.1| unknown [Zea mays]
 gi|414586448|tpg|DAA37019.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
          Length = 249

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/189 (52%), Positives = 129/189 (68%), Gaps = 18/189 (9%)

Query: 98  MFFYLAFVASGSLGGLIAITQLIGALA--NPARAADVPEILKGLSIDVSAVSLFAFLYSR 155
           MFFYLAF+ASG+LGGLIA+T+L+ AL+  +PA AA     LKGL IDV+AVSLFAFLYSR
Sbjct: 1   MFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LKGLGIDVAAVSLFAFLYSR 57

Query: 156 ENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVNNLRGIARLVICSGPASYI 207
           E+KAK+AQ+ARL+REE LS L+LR            +  ++  LRG  RLVI +GPA ++
Sbjct: 58  ESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLGELRGTTRLVIVAGPAEFV 117

Query: 208 MESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EESEEMKEFTRKRKRLWQLT 262
            ESF+RS+PF   L ER VL V F TDG+ P         +E     +  R+ KRLWQLT
Sbjct: 118 AESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDEDGVDDDVARRSKRLWQLT 177

Query: 263 PVYVSEWSK 271
           PV+ +EW++
Sbjct: 178 PVFTTEWAQ 186


>gi|302811321|ref|XP_002987350.1| hypothetical protein SELMODRAFT_125820 [Selaginella moellendorffii]
 gi|300144985|gb|EFJ11665.1| hypothetical protein SELMODRAFT_125820 [Selaginella moellendorffii]
          Length = 287

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 132/204 (64%), Gaps = 12/204 (5%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVPE 134
           S  ARIR EV+SPFRS+R+F Y A +AS ++  L A+ +LI ALAN   A     A + +
Sbjct: 22  SREARIREEVISPFRSLRLFLYPALMASATIAALTALPRLIAALANSPNATVENNASLGQ 81

Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
           +L+GL IDV+A+ +FA LY  + KA++ QLA+LSREESL++L++ +  K++ ++  +RG 
Sbjct: 82  VLEGLGIDVAAIVVFALLYRSDTKARDLQLAKLSREESLASLRVELGNKRVATLEQMRGS 141

Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT-------DGNIPSFEFEESEE 247
           AR VI +GP S++ ++ + S PF E LLERGV +V F T       +    SF+ ++S+ 
Sbjct: 142 ARFVIIAGPKSHVDQALELSLPFQEQLLERGVTLVPFVTPPAASTEESQSSSFQAQKSQA 201

Query: 248 MKEFTRKRKRLWQLTPVYVSEWSK 271
             + +    R W + P+Y SEW+K
Sbjct: 202 KDDDSAAAARRWIVLPIYTSEWTK 225


>gi|218195173|gb|EEC77600.1| hypothetical protein OsI_16566 [Oryza sativa Indica Group]
          Length = 805

 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 82/144 (56%), Positives = 113/144 (78%), Gaps = 6/144 (4%)

Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLR 192
           E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R  E +  + +  LR
Sbjct: 292 ETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELR 351

Query: 193 GIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT 252
           G ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P  +F+E++E +E  
Sbjct: 352 GTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEA 411

Query: 253 -----RKRKRLWQLTPVYVSEWSK 271
                + ++RLWQLTPVY SEW+K
Sbjct: 412 AAAAGKMKRRLWQLTPVYTSEWAK 435



 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 82/144 (56%), Positives = 113/144 (78%), Gaps = 6/144 (4%)

Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLR 192
           E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R  E +  + +  LR
Sbjct: 599 ETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELR 658

Query: 193 GIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT 252
           G ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P  +F+E++E +E  
Sbjct: 659 GTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEA 718

Query: 253 -----RKRKRLWQLTPVYVSEWSK 271
                + ++RLWQLTPVY SEW+K
Sbjct: 719 AAAAGKMKRRLWQLTPVYTSEWAK 742


>gi|302814937|ref|XP_002989151.1| hypothetical protein SELMODRAFT_129404 [Selaginella moellendorffii]
 gi|300143051|gb|EFJ09745.1| hypothetical protein SELMODRAFT_129404 [Selaginella moellendorffii]
          Length = 290

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 86/207 (41%), Positives = 132/207 (63%), Gaps = 15/207 (7%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVPE 134
           S  ARIR EV+SPFRS+R+F Y A +AS ++  L A+ +LI ALAN   A     A + +
Sbjct: 22  SREARIREEVISPFRSLRLFLYPALMASATIAALTALPRLIAALANSPNATVENNASLGQ 81

Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
           +L+GL IDV+A+ +FA LY  + KA++ QLA+LSREESL++L++ +  K++ ++  +RG 
Sbjct: 82  VLEGLGIDVTAIVIFALLYRSDTKARDLQLAKLSREESLASLRVELGNKRVATLEQMRGS 141

Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT-------DGNIPSFEFEESEE 247
           AR VI +GP S++ ++ + S PF E LLERGV +V F T       +    SF+ ++S+ 
Sbjct: 142 ARFVIIAGPKSHVDQALELSLPFQEQLLERGVTLVPFVTPPAASTEESQSSSFQAQKSQA 201

Query: 248 MKEFT---RKRKRLWQLTPVYVSEWSK 271
             + +       R W + P+Y SEW+K
Sbjct: 202 KDDDSAAAAAAARRWIVLPIYTSEWTK 228


>gi|255088718|ref|XP_002506281.1| predicted protein [Micromonas sp. RCC299]
 gi|226521553|gb|ACO67539.1| predicted protein [Micromonas sp. RCC299]
          Length = 244

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 106/191 (55%), Gaps = 16/191 (8%)

Query: 85  IRNEVLSPFRSVRMFFYLAFVASGSLG----GLIAITQLIGALANPARAADVPEILKGLS 140
           +R+E  +PFR+ RMF + AF A+ +LG     L A+T+ +GA + P     + + L+ + 
Sbjct: 1   MRSEAQAPFRTARMFVFGAFAANAALGLSIATLQAVTKALGAPSAPP----LDQSLQNIG 56

Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
           ID+ A   F +LY ++ + +  Q+AR+SREE LS LK  +   K++S+ +LRG +R+VI 
Sbjct: 57  IDLGAALFFGYLYKKDAENREKQMARISREERLSALKCELSSGKVVSLFDLRGFSRVVIA 116

Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQ 260
           +G A Y   S   +E   + LLERGVLV+     G   S +  +    + F        +
Sbjct: 117 AGDADYCNASIAAAEAVKDPLLERGVLVIPLVLGGAAESVDAPDDATDRRF--------R 168

Query: 261 LTPVYVSEWSK 271
            TPVY  +W K
Sbjct: 169 ATPVYTDKWVK 179


>gi|302828698|ref|XP_002945916.1| hypothetical protein VOLCADRAFT_120237 [Volvox carteri f.
           nagariensis]
 gi|300268731|gb|EFJ52911.1| hypothetical protein VOLCADRAFT_120237 [Volvox carteri f.
           nagariensis]
          Length = 327

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 109/205 (53%), Gaps = 22/205 (10%)

Query: 81  SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
            A ++R E  +PFRS+R+ F    V S  LG LI+I QLIGAL     A  + ++ + L+
Sbjct: 64  DAVQMRAEAEAPFRSLRLVFSGFSVVSAGLGFLISIPQLIGALGGARSALPLEQVQQNLA 123

Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
           ID  AV++FAFL+  +  A+  QLARL REE+L  L +R+   K + + +LRG AR+V+ 
Sbjct: 124 IDAGAVAVFAFLFKLDWDARTKQLARLQREETLGALPIRLASGKSLRLGDLRGSARVVLV 183

Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF---------------EES 245
           +G    +  + + +E   E L++RGVLVV          FE                EE 
Sbjct: 184 AGTRRQVAAALREAESLREELVKRGVLVVPVVL------FELASSGAGGGAAGGGATEEP 237

Query: 246 EEMKEFTRKRKRLWQLTPVYVSEWS 270
           EE+ + T   +R  +  P  ++EW 
Sbjct: 238 EELPQLTPADQRF-RAEPQRLAEWQ 261


>gi|145355072|ref|XP_001421795.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582033|gb|ABP00089.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 290

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 105/196 (53%), Gaps = 13/196 (6%)

Query: 75  SDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPE 134
            D   S+  ++R+E  +PFR  R F Y A  AS ++G  IA  Q     A    A  +  
Sbjct: 40  DDARYSNETKLRSEAQAPFRVARQFLYGACAASATIGFGIATIQAATKAAGAPNAPPLEG 99

Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
            L+ L+ID +A++ FA+LY+RE  A+  Q+AR+ REE L  L++ +   K + + +LR  
Sbjct: 100 SLENLAIDGAALATFAWLYAREEAARERQMARIGREERLGRLRVELAGGKTVRLEDLRSF 159

Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVS-FTTDGNIPSFEFEESEEMKEFTR 253
           +R+V+ SG  +Y   + + +E   E+L+ERGVLVV     DG I +   E+         
Sbjct: 160 SRVVVVSGDEAYARTALEDAEDVREALIERGVLVVPVIRGDGAIEAPSAED--------- 210

Query: 254 KRKRLWQLTPVYVSEW 269
              R ++ TP+  ++W
Sbjct: 211 ---RKFRCTPLRANDW 223


>gi|303288507|ref|XP_003063542.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455374|gb|EEH52678.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 348

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 96/160 (60%), Gaps = 4/160 (2%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLG-GLIAITQLIGALANPARAADVPEILKG 138
           S+  ++R+E  +PFR+ RMF + AF A+ +LG G+  +  +  AL  P+ A  + + L+ 
Sbjct: 89  SNETKMRSEAQAPFRTARMFLFGAFAANATLGLGIATLQAVTKALGAPS-APPLEDSLQN 147

Query: 139 LSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV--DEKKIISVNNLRGIAR 196
           + +D+   + FAF Y ++ + ++ Q+AR+SREE L  LK ++     K +S+  LRG +R
Sbjct: 148 MGVDLLCAAFFAFFYKKDAEGRDKQMARISREERLGALKCQLGGGSGKRVSMAQLRGFSR 207

Query: 197 LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGN 236
           +VI +G A + +ES   +E   E L  RGVL+V   T G+
Sbjct: 208 VVIAAGSAEHCVESVNAAEAHREELTRRGVLLVPVVTSGD 247


>gi|302820375|ref|XP_002991855.1| hypothetical protein SELMODRAFT_134250 [Selaginella moellendorffii]
 gi|300140393|gb|EFJ07117.1| hypothetical protein SELMODRAFT_134250 [Selaginella moellendorffii]
          Length = 422

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 87/143 (60%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+AF  +  +   I + +LI AL     A  V E  + L+ID++ +
Sbjct: 161 SEVQAPFRGVRKFFYVAFTIAAGIATFITVPRLIRALQGGEDAPSVTETAQNLAIDLAGI 220

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
             F  L+  ++K +  Q+++++R E+LS L +R+   +I+ V  LRGI R VI +G    
Sbjct: 221 VAFVSLFIWDSKKEETQMSQITRNETLSRLPVRLGTGRIVEVVQLRGITRPVILAGTKES 280

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           I  + +++E + + LL+RGVL++
Sbjct: 281 IKGALEKAERYRKDLLQRGVLLI 303


>gi|302822713|ref|XP_002993013.1| hypothetical protein SELMODRAFT_162731 [Selaginella moellendorffii]
 gi|300139213|gb|EFJ05959.1| hypothetical protein SELMODRAFT_162731 [Selaginella moellendorffii]
          Length = 422

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 87/143 (60%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+AF  +  +   I + +LI AL     A  V E  + L+ID++ +
Sbjct: 161 SEVQAPFRGVRKFFYVAFTIAAGIATFITVPRLIRALQGGEDAPSVTETAQNLAIDLAGI 220

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
             F  L+  ++K +  Q+++++R E+LS L +R+   +I+ V  LRGI R VI +G    
Sbjct: 221 VAFVSLFIWDSKKEETQMSQITRNETLSRLPVRLGTGRIVEVVQLRGITRPVILAGTKES 280

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           I  + +++E + + LL+RGVL++
Sbjct: 281 IKGALEKAERYRKDLLQRGVLLI 303


>gi|412985100|emb|CCO20125.1| predicted protein [Bathycoccus prasinos]
          Length = 261

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 91/157 (57%)

Query: 78  EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILK 137
           + S+  ++R+E  +PFR  R F Y +   S ++G  IA  QL+  LA    +    + + 
Sbjct: 4   KFSNETKLRSEAQAPFRVARQFLYASCALSATIGFGIATIQLLTGLAGAPNSPPFDQSVS 63

Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARL 197
            + ID +  +LF FL+ ++ +AK  Q++R+ REE L  LK+ +   KI+ + +LRG +R+
Sbjct: 64  NVGIDGACAALFVFLWKKDEEAKQKQMSRIGREERLGGLKVELTTGKIVRMQSLRGFSRV 123

Query: 198 VICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
           V+ +G   Y+ +S + +E + + L+++GV +V    D
Sbjct: 124 VLFAGDLEYLEKSLEVAEEYKDELIKKGVFLVPVAMD 160


>gi|384250888|gb|EIE24367.1| hypothetical protein COCSUDRAFT_83671 [Coccomyxa subellipsoidea
           C-169]
          Length = 253

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 94/177 (53%), Gaps = 1/177 (0%)

Query: 94  RSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLY 153
           R++R+  +  F  S S+G LIA TQ+I +L    RA  + ++ + L ID++A   FAF  
Sbjct: 12  RALRLVLFGFFGISASVGALIATTQVIASLGGAQRALPLADVAQSLGIDLAAAGTFAFFL 71

Query: 154 SRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKR 213
            R+ +A++ Q+ARL+REE L  L L +   + + V  LR  +R+VI +G    +  + + 
Sbjct: 72  RRDLQARDKQIARLTREERLGALPLSLANGRRVKVGQLRSFSRVVIAAGSPEQVAAAVQA 131

Query: 214 SEPFTESLLERGVLVVSFTT-DGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYVSEW 269
           +EP  E L ERGVLVV      G+  S       E    +      W+ TP+ + EW
Sbjct: 132 AEPLREQLQERGVLVVPLPIFPGDPASSNGAAPAESSGSSSADDLRWKATPLRLEEW 188


>gi|307110349|gb|EFN58585.1| hypothetical protein CHLNCDRAFT_140754 [Chlorella variabilis]
          Length = 272

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 90/151 (59%), Gaps = 1/151 (0%)

Query: 82  AARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALAN-PARAADVPEILKGLS 140
            A +R E  +PFRS+R+F + A  AS +L  L  +  LIGAL+  P     + E L+ L+
Sbjct: 11  GASLRAEAEAPFRSLRLFLFGAGAASAALATLFGLPSLIGALSGAPGATKSIAEALQDLA 70

Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
           I+V +VS+  +L  R+ +A++ Q+ARL RE+ L   +L +   +++ +  LRG AR+V+ 
Sbjct: 71  INVGSVSVLGYLVVRDLQARDIQMARLMREDELGACQLELANGRVLRLAQLRGFARVVLM 130

Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSF 231
           +G    +  +   +EP+ + L ERGVLVV  
Sbjct: 131 AGTPQQVAAALAGAEPYKQQLQERGVLVVPL 161


>gi|449484299|ref|XP_004156843.1| PREDICTED: uncharacterized LOC101208184 [Cucumis sativus]
          Length = 435

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 88/159 (55%), Gaps = 1/159 (0%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+A  A+  +  L  I +L  A+     A DV E    L+++V
Sbjct: 168 KLISEVQAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNV 227

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +F  L+  +NK +  QLA++SR E+LS L LR+   +I+ +  LR   R VI +G 
Sbjct: 228 GGIIVFVALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGK 287

Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFT-TDGNIPSFE 241
              +  + +++E F   LL RGVL+V     +G  P  E
Sbjct: 288 KETVSSAIQKAERFRTELLRRGVLLVPVIWGEGREPQIE 326


>gi|115436512|ref|NP_001043014.1| Os01g0358300 [Oryza sativa Japonica Group]
 gi|53791615|dbj|BAD52962.1| unknown protein [Oryza sativa Japonica Group]
 gi|113532545|dbj|BAF04928.1| Os01g0358300 [Oryza sativa Japonica Group]
 gi|215765732|dbj|BAG87429.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218188197|gb|EEC70624.1| hypothetical protein OsI_01883 [Oryza sativa Indica Group]
 gi|222618419|gb|EEE54551.1| hypothetical protein OsJ_01736 [Oryza sativa Japonica Group]
          Length = 455

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 83/143 (58%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+A  A+  +     I +LI A+     A D+ E +   +I++  +
Sbjct: 193 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLILAVQGGDGAPDLLETVGNAAINIGGI 252

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   LY  ENK +  Q+ ++SR E+LS L +R+   +II +  LR I R VI +G  + 
Sbjct: 253 VVLVALYFWENKKEEQQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 312

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           I  + +R+E +   LL+RGVL++
Sbjct: 313 ITRAMQRAERYRTELLKRGVLLI 335


>gi|326496368|dbj|BAJ94646.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+AF+A+  +     I +LI AL     A D  E     +I++  +
Sbjct: 186 SEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINIGGI 245

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   L+  ENK +  Q+  +SR E+LS L +R+   +I  +  LR I R VI +G  + 
Sbjct: 246 VVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVILAGSKAS 305

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           + ++ +R+E +   LL+RGVL++
Sbjct: 306 VTQAMQRAERYRTELLKRGVLLI 328


>gi|225437302|ref|XP_002263425.1| PREDICTED: uncharacterized protein LOC100245310 [Vitis vinifera]
 gi|297743856|emb|CBI36826.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 81/143 (56%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+AF A+  +     I +L  A+     A D+ E     +I++  +
Sbjct: 173 SEVQAPFRGVRRFFYVAFSAAAGISTFFTIPRLFRAIKGGDGAPDIWETAGNAAINIGGI 232

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
                L+  +NK +  QLA++SR+E+LS L LR+   +++ +  LR   R VI +G    
Sbjct: 233 IALVLLFFWDNKKEEEQLAQISRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGTKET 292

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + K++E F   LL+RGVL+V
Sbjct: 293 VSLAIKKAERFRTELLKRGVLLV 315


>gi|223972901|gb|ACN30638.1| unknown [Zea mays]
          Length = 437

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++  +
Sbjct: 175 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 234

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   LY  ENK +  Q+ ++SR E+LS L +R+   +II +  LR I R VI +G  + 
Sbjct: 235 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 294

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + +R+E +   LL+RGVL++
Sbjct: 295 VTRALQRAERYRTELLKRGVLLI 317


>gi|414877423|tpg|DAA54554.1| TPA: hypothetical protein ZEAMMB73_812331, partial [Zea mays]
          Length = 374

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++  +
Sbjct: 112 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 171

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   LY  ENK +  Q+ ++SR E+LS L +R+   +II +  LR I R VI +G  + 
Sbjct: 172 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 231

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + +R+E +   LL+RGVL++
Sbjct: 232 VTRALQRAERYRTELLKRGVLLI 254


>gi|194707430|gb|ACF87799.1| unknown [Zea mays]
          Length = 450

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++  +
Sbjct: 188 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 247

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   LY  ENK +  Q+ ++SR E+LS L +R+   +II +  LR I R VI +G  + 
Sbjct: 248 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 307

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + +R+E +   LL+RGVL++
Sbjct: 308 VTRALQRAERYRTELLKRGVLLI 330


>gi|388497672|gb|AFK36902.1| unknown [Medicago truncatula]
          Length = 446

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR +R FFY+AF A+  +     + +++ A+     A DV E     ++++  +
Sbjct: 183 SEVQAPFRGIRRFFYVAFTAAAGISLFFNVPRILRAIQGGDGAPDVLETAGNAAVNIGGI 242

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +F  L+  +NK +  Q+A++SR E+LS L LR+   +++ +  LR   R VI +G    
Sbjct: 243 VVFVALFLWDNKKEEEQIAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVIIAGKKET 302

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + +++E F   LL RGVL+V
Sbjct: 303 VTLALRKAERFRTDLLRRGVLLV 325


>gi|357511233|ref|XP_003625905.1| hypothetical protein MTR_7g108590 [Medicago truncatula]
 gi|355500920|gb|AES82123.1| hypothetical protein MTR_7g108590 [Medicago truncatula]
          Length = 446

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 82/143 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR +R FFY+AF A+  +     + +++ A+     A DV E     ++++  +
Sbjct: 183 SEVQAPFRGIRRFFYVAFTAAAGISLFFNVPRILRAIQGGDGAPDVLETAGNAAVNIGGI 242

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +F  L+  +NK +  Q+A++SR E+LS L LR+   +++ +  LR   R VI +G    
Sbjct: 243 VVFVALFLWDNKKEEEQIAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVIIAGKKET 302

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + +++E F   LL RGVL+V
Sbjct: 303 VTLALRKAERFRTDLLRRGVLLV 325


>gi|147785370|emb|CAN77388.1| hypothetical protein VITISV_033366 [Vitis vinifera]
          Length = 467

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 81/146 (55%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF A+  +     I +L  A+     A D+ E     +I++
Sbjct: 201 KLISEVQAPFRGVRRFFYVAFSAAAGISTFFTIPRLFRAIKGGDGAPDIWETAGNAAINI 260

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             +     L   +NK +  QLA++SR+E+LS L LR+   +++ +  LR   R VI +G 
Sbjct: 261 GGIIALVLLXFWDNKKEEEQLAQISRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGT 320

Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
              +  + K +E F   LL+RGVL+V
Sbjct: 321 KETVSLAIKXAERFRTELLKRGVLLV 346


>gi|168023978|ref|XP_001764514.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684378|gb|EDQ70781.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 84/146 (57%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR +R FFY+A   +  +  L  + + I A+     A  V E ++ L+I++   
Sbjct: 202 SEVQAPFRGIRKFFYVALFLAAGISTLFTVPRFILAIQGGDGAPGVLETVQNLAINIGGG 261

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
           + F  +Y  ENK +  Q++R++R+E+LS L +R+   + + ++ LR   R VI +G A  
Sbjct: 262 AAFIAIYLWENKKEEEQISRITRDETLSRLPVRLANNRTVELSALRENTRPVIIAGTAEN 321

Query: 207 IMESFKRSEPFTESLLERGVLVVSFT 232
           +  S + S+ + E LL+RGVL++   
Sbjct: 322 VERSIRASQKYREDLLKRGVLIIPLV 347


>gi|357128170|ref|XP_003565748.1| PREDICTED: uncharacterized protein LOC100821693 [Brachypodium
           distachyon]
          Length = 454

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 83/143 (58%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY+AF+A+  +     I +LI A+     A D+ +     +I++  +
Sbjct: 192 SEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILAVQGGDGAPDLLQTAGNAAINIGGI 251

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   L+  EN+ +  Q+  +SR E+LS L +R+   +I  +  LR I R VI +G  + 
Sbjct: 252 VVLVALFIWENRKEEQQITVISRNETLSRLPVRLSTNRITELVQLRDITRPVILAGSKAS 311

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           + ++ +R+E +   LL+RGVL++
Sbjct: 312 VTKATQRAERYRTELLKRGVLLI 334


>gi|356572722|ref|XP_003554515.1| PREDICTED: uncharacterized protein LOC100793204 [Glycine max]
          Length = 437

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 82/146 (56%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF A+  +     + +L+ A+     A D+       +I++
Sbjct: 171 KLISEVQAPFRGVRRFFYVAFTAAAGISLFFTVPRLLRAINGGDGAPDLLATAGNAAINI 230

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +   L+  +NK +  QLA++SR+E+LS L L +   +++ +  LR   R VI +G 
Sbjct: 231 GGIVVLVALFFWDNKKEEEQLAQISRDETLSRLPLCLSTNRVVELVQLRDTVRPVILAGK 290

Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
              +  + +R+E F   LL RGVL+V
Sbjct: 291 KETVSLAMQRAERFQTELLRRGVLLV 316


>gi|356505558|ref|XP_003521557.1| PREDICTED: uncharacterized protein LOC100784188 [Glycine max]
          Length = 437

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 82/146 (56%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF A+  +     + +L+ A+     A D+       +I++
Sbjct: 171 KLISEVQAPFRGVRRFFYVAFTAAAGISLFFTLPRLLRAINGGDGAPDLLATAGNAAINI 230

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +   L+  +NK +  QLA++SR+E+LS L L +   +++ +  LR   R VI +G 
Sbjct: 231 GGIVVLVALFFWDNKKEEEQLAQISRDETLSRLPLCLSTNRVVELVQLRDTVRPVILAGK 290

Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
              +  + +R+E F   LL RGVL+V
Sbjct: 291 KETVSLAMQRAERFRTELLRRGVLLV 316


>gi|255582166|ref|XP_002531877.1| conserved hypothetical protein [Ricinus communis]
 gi|223528485|gb|EEF30514.1| conserved hypothetical protein [Ricinus communis]
          Length = 369

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 82/146 (56%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR +R FFY+AF A+  +  L  + +L  A+     A D+       +I++
Sbjct: 98  KLISEVQAPFRGIRRFFYVAFSAAAGISLLFTVPRLFRAIKGGDGAPDLWNTAGNAAINI 157

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +   L   +NK +  Q+A+++R+E+LS L LR+   +++ +  LR   R VI +G 
Sbjct: 158 GGIVVLVALLFWDNKKEEEQIAQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGK 217

Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
              +  + +++E F   LL RGVL+V
Sbjct: 218 KETVSLAMQKAERFRTELLRRGVLLV 243


>gi|449469098|ref|XP_004152258.1| PREDICTED: uncharacterized protein LOC101208184 [Cucumis sativus]
          Length = 427

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 9/159 (5%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+A  A+  +  L  I +L  A+     A DV E    L+++V
Sbjct: 168 KLISEVQAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNV 227

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +F  L+  +NK +  QLA++SR E+LS L LR+   +I+ +  LR   R VI +G 
Sbjct: 228 GGIIVFVALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGK 287

Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFT-TDGNIPSFE 241
              +  +       TE LL RGVL+V     +G  P  E
Sbjct: 288 KETVSSA-------TE-LLRRGVLLVPVIWGEGREPQIE 318


>gi|307104280|gb|EFN52535.1| hypothetical protein CHLNCDRAFT_58882 [Chlorella variabilis]
          Length = 445

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 78/153 (50%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+R+E  +PFR  R+ F        +LG  I  ++L+ AL     A D+ E  +   I+ 
Sbjct: 168 RLRSEAKAPFRLTRIIFLGGLAVGAALGLFIITSRLVAALQGGEGAPDLQETAQNFGINA 227

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
           +A+ +  F   R+  ++N     + REE+L  L++ +  ++++ +   RG  R VI +G 
Sbjct: 228 AALGVLGFFVYRDLSSQNKDQQVILREEALGRLQVSLGGERVLPLAAFRGTTRPVIIAGT 287

Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFTTDGN 236
            + +  +   +EPF E+L ERGV VV     G 
Sbjct: 288 RTQLSRALAGAEPFQEALRERGVSVVPVQLAGE 320


>gi|308812838|ref|XP_003083726.1| unnamed protein product [Ostreococcus tauri]
 gi|116055607|emb|CAL58275.1| unnamed protein product [Ostreococcus tauri]
          Length = 273

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 74/139 (53%), Gaps = 17/139 (12%)

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD-EKKIISVNNLRGI 194
           L+ +++D  A+  F  LY RE+ A+  Q+AR+ REE L  L+  +    K + +  LRG 
Sbjct: 80  LENVAVDGVALPAFGTLYRREDDARARQMARIGREERLGRLRCELGGNGKTVRLEQLRGF 139

Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVV----SFTTDGNIPSFEFEESEEMKE 250
           +R+V+ SG A Y+ ++   +E   E+L+ERGVL+V      + DG +     E+      
Sbjct: 140 SRVVVVSGDAEYVNDAIADAEAVREALMERGVLIVPVLNGASADGEVAPVGGED------ 193

Query: 251 FTRKRKRLWQLTPVYVSEW 269
                 R ++ TP+  +EW
Sbjct: 194 ------RKFRATPLRANEW 206


>gi|326487892|dbj|BAJ89785.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 359

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 69/127 (54%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF+A+  +     I +LI AL     A D  E     +I++
Sbjct: 211 KLISEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINI 270

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +   L+  ENK +  Q+  +SR E+LS L +R+   +I  +  LR I R V     
Sbjct: 271 GGIVVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVSIILL 330

Query: 204 ASYIMES 210
           AS+I+ +
Sbjct: 331 ASFILTT 337


>gi|326510437|dbj|BAJ87435.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 69/127 (54%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF+A+  +     I +LI AL     A D  E     +I++
Sbjct: 197 KLISEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINI 256

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
             + +   L+  ENK +  Q+  +SR E+LS L +R+   +I  +  LR I R V     
Sbjct: 257 GGIVVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVSIILL 316

Query: 204 ASYIMES 210
           AS+I+ +
Sbjct: 317 ASFILTT 323


>gi|422293631|gb|EKU20931.1| hypothetical protein NGA_0121000 [Nannochloropsis gaditana CCMP526]
          Length = 392

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 102/208 (49%), Gaps = 18/208 (8%)

Query: 78  EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILK 137
           + S   R+R E  SPFRS+R   Y    ASG++G L A+T  + A +    A  + E  K
Sbjct: 126 DYSVMTRLREEAESPFRSLRFLIYGGVGASGAIGALTALTGALAAFSGVRGAVPLSESGK 185

Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD------EKKIISVNNL 191
            ++ID+ A+ L A L++ +++ K  +L RLSR   +++LK++V       E+ ++S+ + 
Sbjct: 186 DIAIDLGAIGLAALLWNLDSQTKARKLGRLSRGAKIASLKVQVQDGGGAGERPVLSLADF 245

Query: 192 ---RGIA-RLVICSGPASYIMESFKRS--EPFTESLLERGVLVVSFT--TDG-NIPSFEF 242
              RG+A R+VI +G    +  +   +      E L+    LVV      DG + PS   
Sbjct: 246 RRDRGMARRVVIFAGGEDVVQAALNTAGEAGMKERLIRNAFLVVPLVLPADGEDNPSVLG 305

Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWS 270
              E ++    +    W   P+ + EW 
Sbjct: 306 LAQERLQGAQDEPHVGW---PLLLREWQ 330


>gi|159476322|ref|XP_001696260.1| hypothetical protein CHLREDRAFT_142189 [Chlamydomonas reinhardtii]
 gi|158282485|gb|EDP08237.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 222

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 67/125 (53%), Gaps = 26/125 (20%)

Query: 105 VASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQL 164
           V S  LG LI+I QLIGAL     A  + ++                           Q+
Sbjct: 8   VVSAGLGFLISIPQLIGALGGARSALPLEQV--------------------------EQI 41

Query: 165 ARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLER 224
           ARL+REE+L +L +R+   K++S+N+LRG AR+V+ +G  + +  + + +EP  E L +R
Sbjct: 42  ARLTREETLGSLPIRLSSGKMLSLNDLRGTARVVLAAGTRAQVAAALREAEPLREELQKR 101

Query: 225 GVLVV 229
           GVLVV
Sbjct: 102 GVLVV 106


>gi|298710249|emb|CBJ26324.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 345

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 79/153 (51%), Gaps = 7/153 (4%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+R E  SPFR VR+F + +FV S ++G L+A T+ I       +   + E+L  +++DV
Sbjct: 85  RLRVEAESPFRKVRIFLFSSFVLSATVGLLVAGTRTIALTQGIDQGQSMTELLTNVAVDV 144

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI-------AR 196
           +A+ +   L  R+  A+  ++AR+     L+ LK+R+  K  +S   L  +        R
Sbjct: 145 TAIVVCVGLVKRDLDAQEGRMARMEIGAKLAGLKVRLQTKDDLSTVTLSALRRDRGRDKR 204

Query: 197 LVICSGPASYIMESFKRSEPFTESLLERGVLVV 229
           + +  G A  +  S + + P+  +L    +L+V
Sbjct: 205 VAVLVGGAEAVQSSLESALPYGAALETSDILIV 237


>gi|255086543|ref|XP_002509238.1| predicted protein [Micromonas sp. RCC299]
 gi|226524516|gb|ACO70496.1| predicted protein [Micromonas sp. RCC299]
          Length = 476

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 82/151 (54%), Gaps = 1/151 (0%)

Query: 81  SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
           S  +++ E   PFR  +++ +    A   +G +I + +L  A+     A ++ E +K L 
Sbjct: 189 SVQKLKAEAQEPFRFFKLYAFGGLGAGAFIGLIIILARLSAAIKGGDDAPELGETIKNLL 248

Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVI 199
           I++ AVS FA+L+S E K +     +++REE L  L++ ++E K+ + V+ LRG  RL I
Sbjct: 249 INIVAVSAFAYLFSNELKQREKTEFKVAREEELGRLRMAIEESKEDVLVSQLRGNYRLFI 308

Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVS 230
            +G   +I  +  +   +   L E+ V++ +
Sbjct: 309 IAGSDEHIDNAMGQLVKYKAKLKEQNVVIAT 339


>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 1122

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 78/150 (52%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
           S+   +R E  +PFR  R+  +   + S ++G +I + +L+ AL     A D+ E L  L
Sbjct: 164 STKVDLRTEAKAPFRLPRIILFGGLLGSAAIGLVIIVFRLVKALQGGPEAPDLNESLTNL 223

Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
            ++  AV++ ++L  R+   K   +   +REE +  L++ +   +++ +   RG  R VI
Sbjct: 224 GVNSVAVAVLSYLLYRDVSQKQEAVRITTREELMGRLQIDLGNDRVLPLLKFRGQVRPVI 283

Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVV 229
            +G  S++  + K ++    +L +R V V+
Sbjct: 284 VAGTRSFVERAIKEADSQYINLRDRAVSVI 313


>gi|15218638|ref|NP_171790.1| tetratricopeptide repeat (TPR)-containing protein [Arabidopsis
           thaliana]
 gi|6056411|gb|AAF02875.1|AC009525_9 Hypothetical protein [Arabidopsis thaliana]
 gi|332189374|gb|AEE27495.1| tetratricopeptide repeat (TPR)-containing protein [Arabidopsis
           thaliana]
          Length = 453

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 80/143 (55%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR VR FFY AF A+  +     + +L+ A+     A ++ E     +I++  +
Sbjct: 190 SEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINIGGI 249

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   L+  ENK +  Q+ +++R+E+LS L LR+   +++ +  LR   R VI +G    
Sbjct: 250 VVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKET 309

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + ++++ F   LL RGVL+V
Sbjct: 310 VTLAMQKADRFRTELLRRGVLLV 332


>gi|119387903|gb|ABL73934.1| photosystem-II repair protein [Chlamydomonas reinhardtii]
          Length = 449

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 84/150 (56%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
            + A +R E  +PFR  R+  +   +A   LG ++ +T+L  +L     A D+ E L  L
Sbjct: 162 QTKADLRTEAKAPFRLPRLILFGGLLAGAGLGLMVILTRLAKSLQGGPDAPDLNESLTNL 221

Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
            I+V+AV++ ++L  R+  +K A+L   +REE L  L++ + + +++ +   RG  R VI
Sbjct: 222 GINVTAVAVLSWLVYRDVSSKQAELRITTREEGLGRLQVDLGKDRVLPLLKFRGQVRPVI 281

Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVV 229
            +G  S++ ++ K +E    +L +R V VV
Sbjct: 282 VAGSRSFVEKAIKEAEGQYLNLRDRAVSVV 311


>gi|297848508|ref|XP_002892135.1| hypothetical protein ARALYDRAFT_887439 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337977|gb|EFH68394.1| hypothetical protein ARALYDRAFT_887439 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 79/143 (55%)

Query: 87  NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
           +EV +PFR  R FFY AF A+  +     + +LI A+     A D+ E     +I++  +
Sbjct: 190 SEVRAPFRGFRKFFYFAFAAAAGISTFFTVPRLIQAIRGGDGAPDLLETTGNAAINIGGI 249

Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
            +   L+  ENK +  Q+ +++R+E+LS L LR+   +++ +  LR   R VI +G    
Sbjct: 250 VVLVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKET 309

Query: 207 IMESFKRSEPFTESLLERGVLVV 229
           +  + ++++ F   LL RGVL+V
Sbjct: 310 VTLAMQKADRFRTELLRRGVLLV 332


>gi|384251485|gb|EIE24963.1| hypothetical protein COCSUDRAFT_83655 [Coccomyxa subellipsoidea
           C-169]
          Length = 435

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 79/154 (51%)

Query: 86  RNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSA 145
           R E  +PFR  R+F +    A  +LG ++   +LI AL     A ++ E L   +I+ +A
Sbjct: 162 RAEAKAPFRLARLFIFGGLGAGAALGLVVIAGRLIAALKGGEGAPELTESLLNFTINAAA 221

Query: 146 VSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPAS 205
           ++ F++L+SR+ ++       + REE+L  L ++V   + + + + RG AR V+ +G + 
Sbjct: 222 LAGFSWLFSRDYQSAEKDRRMVDREEALGRLLVQVGADREVPLASFRGAARPVLLAGSSG 281

Query: 206 YIMESFKRSEPFTESLLERGVLVVSFTTDGNIPS 239
           YI  + K SE     L  RG+ V+        P 
Sbjct: 282 YISRALKASELHQTELRLRGISVIPLVLSSKDPG 315


>gi|449015970|dbj|BAM79372.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 371

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/202 (24%), Positives = 88/202 (43%), Gaps = 16/202 (7%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++R E  +PFR  R F Y   VAS ++  +IA++ L+    +        +  + L IDV
Sbjct: 114 QLREEAEAPFRKARQFLYAGGVASATIASVIALSSLVALTGSATATESWWQAARNLGIDV 173

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD--EKKIISVNNLRGIARLVICS 201
           ++  L A LY  ++ A   +L R+ +   L+ L+L  +   +    + +LRG  R++I  
Sbjct: 174 TSGVLCAALYRWDDVAAKRRLERMQQGARLAALQLESEPGPRWRFPLASLRGQYRVIITV 233

Query: 202 GPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT--------- 252
                  E   ++EP +++L++R V+ V    D    +     S E +  +         
Sbjct: 234 TSLEKAREWLAQAEPHSDALIQRSVVWVPVLWDAARQTPTVPSSAETRSASGVLDAWLVA 293

Query: 253 -----RKRKRLWQLTPVYVSEW 269
                R+    W  TP    EW
Sbjct: 294 LDAARRRANAFWIATPANPQEW 315


>gi|303284455|ref|XP_003061518.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456848|gb|EEH54148.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 514

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 77/147 (52%), Gaps = 1/147 (0%)

Query: 85  IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
           +R E   PFR  +++ +    A   +G +I +++L  AL     A ++ E LK L+++  
Sbjct: 218 LRAEAKEPFRFFKLYLFGGLGAGAFIGLIIILSRLAAALQGGDDAPELNETLKNLAVNAV 277

Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEK-KIISVNNLRGIARLVICSGP 203
           AV+ F FL   E   +     +++REE +  L++ VDE+   ++++ LRG  R+ I +G 
Sbjct: 278 AVTTFGFLLRGELAEREKTQEKVAREEEIGRLRVTVDERGDDVTLSALRGNYRVFIIAGS 337

Query: 204 ASYIMESFKRSEPFTESLLERGVLVVS 230
             ++ E+      +   L E+ V++ +
Sbjct: 338 DEHVEETVAGLNKYKALLKEKDVVIAT 364


>gi|428171606|gb|EKX40521.1| hypothetical protein GUITHDRAFT_113307 [Guillardia theta CCMP2712]
          Length = 332

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 93/202 (46%), Gaps = 17/202 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R R E+ SPFR +R+F + A  AS +LG  IA T+L+ A++      D  + L+   IDV
Sbjct: 70  RFREEIESPFRKIRLFLFPAAAASAALGAFIAGTRLL-AISTGVTGYDFDQTLQNFGIDV 128

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV----DEKKIISVNNLRGIARLVI 199
              + FA LY  +  A++  L R++R   LS L L++      +  + +   R   R+ I
Sbjct: 129 LGATGFALLYWNDLNARDRDLVRIARSGKLSTLTLKIVSETGREASLPMKAFRKQRRIAI 188

Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD--GNIPSFEFE---------ESEEM 248
            +G    +  + + +    E L + G +V+    D  G +   E E         E+ E 
Sbjct: 189 IAGGKDVVQVAREMARAGAEDLTKVGAIVLPLQLDQVGELDPAEVEGLRRAVGVPEAAEG 248

Query: 249 KEFTRKRKRLWQL-TPVYVSEW 269
           KE       L  L  P++V +W
Sbjct: 249 KEEADSESLLPCLAAPLFVEKW 270


>gi|443323256|ref|ZP_21052264.1| Protein of unknown function (DUF3493) [Gloeocapsa sp. PCC 73106]
 gi|442786994|gb|ELR96719.1| Protein of unknown function (DUF3493) [Gloeocapsa sp. PCC 73106]
          Length = 82

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 32/85 (37%), Positives = 47/85 (55%), Gaps = 7/85 (8%)

Query: 72  TDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAAD 131
           + R  PE    AR++ E+ +P++ +R F Y+AF ASG LG  + +TQL       A   D
Sbjct: 4   SSRIPPESEKYARLKAELKTPYKGLRKFVYIAFGASGLLGAFVFLTQL-------AAGQD 56

Query: 132 VPEILKGLSIDVSAVSLFAFLYSRE 156
           V  IL  L+I +  V+L  +LY  E
Sbjct: 57  VGTILPNLAIQIGVVALMIWLYRLE 81


>gi|219117770|ref|XP_002179674.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408727|gb|EEC48660.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 342

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/219 (22%), Positives = 91/219 (41%), Gaps = 39/219 (17%)

Query: 41  RTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFF 100
           R++P   ++ PL +     + P    +                ++ +E ++P+R++R+F 
Sbjct: 38  RSSPNALLTTPLFAEKEKSKKPFDEGLRT--------------KLVSESIAPWRTLRLFL 83

Query: 101 YLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYS------ 154
           Y A  A   +GGL+ ++    A++      D+      L+ID  AV++FAFL        
Sbjct: 84  YTALGAGAFIGGLVNLSGTAAAISGARSDVDMNTEYINLAIDFGAVAVFAFLTKFDLDKQ 143

Query: 155 ---RENKAKNAQLARLS---------REESLSNLKLRVD-----EKKIISVNNLRGIAR- 196
               EN  K  +L +           RE+ L+ L + +      + K  SV  L+  AR 
Sbjct: 144 KDLNENVEKKIELKKQQKQLVKGMKEREKQLATLNVEIQVSATGDTKQASVRQLQAGARQ 203

Query: 197 -LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
            +++ +GP     ++   +           VLVV F TD
Sbjct: 204 HMIVVAGPKKATRDALIGANLLKMDFAMSNVLVVPFETD 242


>gi|411120457|ref|ZP_11392829.1| Protein of unknown function (DUF3493) [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410709126|gb|EKQ66641.1| Protein of unknown function (DUF3493) [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 95

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 7/81 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E ++P+R +R F Y AF ASG++GGL+ + QL+          +V E L   ++ 
Sbjct: 22  ARLKAEAIAPYRGLRRFIYGAFAASGAIGGLVFLAQLLA-------GQNVQEALPNFALQ 74

Query: 143 VSAVSLFAFLYSRENKAKNAQ 163
           +  V+L  +L   ENKA+  Q
Sbjct: 75  LGVVALMIWLIRLENKAERKQ 95


>gi|170077218|ref|YP_001733856.1| hypothetical protein SYNPCC7002_A0593 [Synechococcus sp. PCC 7002]
 gi|169884887|gb|ACA98600.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 88

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 7/80 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR+R E  +P+RS+R F YL F  SG +GG+I + Q+       A   D+ + +  L++ 
Sbjct: 11  ARLRAEAKNPYRSLRQFIYLCFGLSGGIGGMIFLLQI-------AAGRDLDQAIPNLAVQ 63

Query: 143 VSAVSLFAFLYSRENKAKNA 162
              V L  FL+  ENK K A
Sbjct: 64  AGVVGLMIFLWRWENKNKLA 83


>gi|434399338|ref|YP_007133342.1| hypothetical protein Sta7437_2856 [Stanieria cyanosphaera PCC 7437]
 gi|428270435|gb|AFZ36376.1| hypothetical protein Sta7437_2856 [Stanieria cyanosphaera PCC 7437]
          Length = 96

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 9/103 (8%)

Query: 63  LCSNITCSATDRSDPEIS--SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI 120
           +  + T   T R   ++S    AR+R E  +P+R +R FFY+AF ASG +G  + + QL 
Sbjct: 1   MSDSSTKKPTSRPKSKLSPEKYARLRAEAKAPYRGLRKFFYVAFGASGLIGAFVFLAQL- 59

Query: 121 GALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
                 A   +V   L  L + V  V+L  +L+  E K +N Q
Sbjct: 60  ------AAGREVSSTLTNLGLQVGLVALVVWLFRWEEKQENKQ 96


>gi|298706793|emb|CBJ29716.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 334

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 72/146 (49%), Gaps = 9/146 (6%)

Query: 70  SATDRSDPEISSAA----RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALAN 125
           S  D  D EI++ A    +++ E  +PFR+ R+F Y AF  S  +GG+ ++TQL   L++
Sbjct: 33  SQADEEDDEIATPAFNIDKLKAETKTPFRTFRIFIYGAFGLSALIGGITSLTQLAATLSD 92

Query: 126 PARAADVPEILKGLSIDVSAVSLFAFLY----SRENKAKNAQLARLSREESLSNLKLRVD 181
              A ++ ++L  L +D   ++  AF Y    S++   +  + A   + + L   K+  D
Sbjct: 93  QPGALELQKVLINLLVDFGVMAAAAFCYNFETSQQEDLEQVEAANREKRQKLKATKITGD 152

Query: 182 EKKIISVNNLRGIARLVICSGPASYI 207
                 V  LR ++  V  SG  + +
Sbjct: 153 -VNADRVKQLRALSVKVPGSGSGAEL 177


>gi|425440324|ref|ZP_18820629.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719275|emb|CCH96867.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 84

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A A
Sbjct: 2   SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAA 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|166364756|ref|YP_001657029.1| hypothetical protein MAE_20150 [Microcystis aeruginosa NIES-843]
 gi|390438651|ref|ZP_10227099.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|166087129|dbj|BAG01837.1| hypothetical protein MAE_20150 [Microcystis aeruginosa NIES-843]
 gi|389837940|emb|CCI31223.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 84

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A A
Sbjct: 2   SKSPRLDPQ--KYARLKAEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAA 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|425444905|ref|ZP_18824946.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|425455682|ref|ZP_18835397.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389735246|emb|CCI01219.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389803405|emb|CCI17680.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 84

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A  
Sbjct: 2   SKSPRLDPQ--KYARLKEEAANPYRGLRQFLYLGLGASGFIGAVVFLAQL-------AAG 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|223998698|ref|XP_002289022.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976130|gb|EED94458.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1250

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 76/171 (44%), Gaps = 25/171 (14%)

Query: 88   EVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVS 147
            E ++PFR +R+FFY AF +   +GGLI  + +  AL+      D+      L ID  AV 
Sbjct: 973  ETIAPFRGLRLFFYAAFASGAFVGGLITASGVAAALSGVRTDVDLNAEYLNLGIDFGAVL 1032

Query: 148  LFAFL------YSRENKAK------------NAQLARLSREESLSNLKLRV-----DEKK 184
            LFA L         E KA+                A   RE+  + L+L V      E +
Sbjct: 1033 LFAVLAKFDLDKGEELKAQVEQKIENKKKQKKITKAMREREKYFAELELDVRVSDDGETR 1092

Query: 185  IISVNNLRGIAR--LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
              SV  ++  A+  +++ +GP   I ++ + ++    +     +LVV + T
Sbjct: 1093 RASVGTMQDKAKQHVILVAGPGRVIRDALRGAQLNKVNFAMTNILVVPYET 1143


>gi|425434849|ref|ZP_18815313.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|425448846|ref|ZP_18828690.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|159026365|emb|CAO88916.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|389675573|emb|CCH95320.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389767707|emb|CCI06979.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 84

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A  
Sbjct: 2   SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAG 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|425469435|ref|ZP_18848372.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389880879|emb|CCI38493.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 84

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A  
Sbjct: 2   SKSTRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AVG 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|422302284|ref|ZP_16389647.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|425466926|ref|ZP_18846220.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389788535|emb|CCI15744.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389830425|emb|CCI27635.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 84

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL       A  
Sbjct: 2   SKSPRLDPQ--KYARLKAEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAG 52

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
            DV   L   ++ +  V+L  +L+ RE KA
Sbjct: 53  RDVATALPNFALQIGVVALMIWLFRREKKA 82


>gi|149391123|gb|ABR25579.1| tpr domain containing protein [Oryza sativa Indica Group]
          Length = 192

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 45/72 (62%)

Query: 158 KAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPF 217
           K +  Q+ ++SR E+LS L +R+   +II +  LR I R VI +G  + I  + +R+E +
Sbjct: 1   KKEEQQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKASITRAMQRAERY 60

Query: 218 TESLLERGVLVV 229
              LL+RGVL++
Sbjct: 61  RTELLKRGVLLI 72


>gi|307151324|ref|YP_003886708.1| hypothetical protein Cyan7822_1436 [Cyanothece sp. PCC 7822]
 gi|306981552|gb|ADN13433.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
          Length = 91

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 9/85 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE    AR++ E  +P++  R+FFY AF ASG +G ++ + +L       A  A+V E 
Sbjct: 16  DPE--KLARLKAEASAPYKGFRIFFYFAFGASGLIGAVVFLAKL-------AAGANVTEA 66

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
           L   ++ V  V+L  +L+  E K+K
Sbjct: 67  LPNFALQVGLVALMIWLFRLEQKSK 91


>gi|428298237|ref|YP_007136543.1| hypothetical protein Cal6303_1524 [Calothrix sp. PCC 6303]
 gi|428234781|gb|AFZ00571.1| hypothetical protein Cal6303_1524 [Calothrix sp. PCC 6303]
          Length = 97

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 9/89 (10%)

Query: 65  SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
           SN       + DPE    AR++ E  +PFRS+R FFY +F ASG +G  I + Q +    
Sbjct: 4   SNSQNRNPKKLDPE--KYARLKAEATAPFRSLRKFFYFSFGASGMIGAFIFLMQFLA--- 58

Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLY 153
                 +  + L+ L++ +  V L  FL+
Sbjct: 59  ----GRNTEDTLQNLALQIGVVFLMGFLW 83


>gi|219119559|ref|XP_002180537.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408010|gb|EEC47945.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 395

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 17/136 (12%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVP- 133
           ++  R+R EV SPFR+VR+FF+ +   S       A+T L  +L N  +A      D P 
Sbjct: 116 TAKQRLREEVESPFRTVRLFFFGSSTGS-------ALTALYFSLLNILKAKSGLYTDGPS 168

Query: 134 --EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNL 191
             + L+  +I+V+AV +  +L  R+ KA    LAR+++  +L+ L +     + ++  + 
Sbjct: 169 LDDALQSTAINVAAVLICGYLTYRDWKAGETNLARIAKGGALAQLVVNGPSPRTLA--SY 226

Query: 192 RGIARLVICSGPASYI 207
           R  AR++I  G   YI
Sbjct: 227 RRNARVLIAVGGDGYI 242


>gi|223992929|ref|XP_002286148.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977463|gb|EED95789.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 337

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 67/133 (50%), Gaps = 7/133 (5%)

Query: 80  SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP---EIL 136
           ++  R+R E+ SPFR VR+ F+    AS S+   +  + L    AN    +DVP   E L
Sbjct: 60  TTKQRLREEIESPFRKVRLAFFSFSGASASVA--LYFSALAALKANMGGFSDVPPLNEAL 117

Query: 137 KGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--VDEKKIISVNNLRGI 194
           +  +I++     F  L  RE K   A L R+++   L+ L +   VD +   S+   R  
Sbjct: 118 ETCAINLGGAVGFGALAVRELKVGQANLERIAKGGLLARLIVESGVDGESKRSLKEYRRA 177

Query: 195 ARLVICSGPASYI 207
           +R+VI +G A YI
Sbjct: 178 SRVVIAAGGADYI 190


>gi|254410762|ref|ZP_05024540.1| hypothetical protein MC7420_240 [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182117|gb|EDX77103.1| hypothetical protein MC7420_240 [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 99

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 7/78 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E  +P+R +R F Y+AF ASG +GGL+ + QLI          +V      L++ 
Sbjct: 26  ARLKAEAKAPYRGLRKFIYVAFAASGFIGGLVFLAQLIA-------GREVGTAFPNLALQ 78

Query: 143 VSAVSLFAFLYSRENKAK 160
           V  V+L  +L+  E +A+
Sbjct: 79  VGIVALMIWLFRLEQRAE 96


>gi|218441759|ref|YP_002380088.1| hypothetical protein PCC7424_4863 [Cyanothece sp. PCC 7424]
 gi|218174487|gb|ACK73220.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 91

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 46/84 (54%), Gaps = 9/84 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE    AR++ E  +P++  RMFFY+AF ASG +G +I + QL+          +V   
Sbjct: 16  DPE--KYARLKAEAKAPYKGFRMFFYIAFGASGFIGAVIFLAQLVA-------GRNVSSA 66

Query: 136 LKGLSIDVSAVSLFAFLYSRENKA 159
           L   ++ V  V+L  +L+  E K+
Sbjct: 67  LPNFALQVGLVALMIWLFRLEQKS 90


>gi|416394633|ref|ZP_11686216.1| hypothetical protein CWATWH0003_3017 [Crocosphaera watsonii WH
           0003]
 gi|357263229|gb|EHJ12263.1| hypothetical protein CWATWH0003_3017 [Crocosphaera watsonii WH
           0003]
          Length = 86

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 9/83 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DP+  +A  +R E  +P++ +R FFY+AF ASG +G L+ + QL       A   DV   
Sbjct: 13  DPQKYAA--LRAEAKAPYKGLRKFFYIAFAASGFVGALVFLAQL-------AAGKDVTTA 63

Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
               ++ +  V+L  FL+  E+K
Sbjct: 64  FPNFALQIGVVALMIFLFRLESK 86


>gi|67925300|ref|ZP_00518658.1| hypothetical protein CwatDRAFT_1425 [Crocosphaera watsonii WH 8501]
 gi|67852844|gb|EAM48245.1| hypothetical protein CwatDRAFT_1425 [Crocosphaera watsonii WH 8501]
          Length = 86

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 9/83 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DP+  +A  +R E  +P++ +R FFY+AF ASG +G L+ + QL       A   DV   
Sbjct: 13  DPQKYAA--LRAEAKAPYKGLRKFFYIAFAASGFVGALVFLAQL-------AAGKDVTTA 63

Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
               ++ +  V+L  FL+  E+K
Sbjct: 64  FPNFALQIGVVALMIFLFRLESK 86


>gi|425459047|ref|ZP_18838533.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|389823287|emb|CCI28619.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
          Length = 84

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E  +P+R +R F YL   ASG +G ++ + QL       A   DV   L   ++ 
Sbjct: 13  ARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAGRDVATALPNFALQ 65

Query: 143 VSAVSLFAFLYSRENKA 159
           +  V+L  +L+ RE KA
Sbjct: 66  IGVVALMIWLFRREKKA 82


>gi|186685813|ref|YP_001869009.1| hypothetical protein Npun_F5768 [Nostoc punctiforme PCC 73102]
 gi|186468265|gb|ACC84066.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 97

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 9/92 (9%)

Query: 67  ITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANP 126
           +  +  +R +PE    A ++ E+ +P+R +R F Y+AF ASGS+G  +   Q++      
Sbjct: 2   VDQNLKNRLNPE--QYASLKAEIAAPYRGLRQFIYIAFGASGSIGAFVFFFQVLA----- 54

Query: 127 ARAADVPEILKGLSIDVSAVSLFAFLYSRENK 158
               DV   L  L++ V  V+L  FL+  E +
Sbjct: 55  --GRDVESALPSLALQVGIVALMVFLWRWEQR 84


>gi|427724820|ref|YP_007072097.1| hypothetical protein Lepto7376_3022 [Leptolyngbya sp. PCC 7376]
 gi|427356540|gb|AFY39263.1| hypothetical protein Lepto7376_3022 [Leptolyngbya sp. PCC 7376]
          Length = 85

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+R E  +P+R++R F Y+ F  SGS+GGL+ I Q++          D+   +    I  
Sbjct: 12  RLRTEAKNPYRTLRQFVYVCFGISGSVGGLVFIGQILA-------GRDLANAIPNFGIQA 64

Query: 144 SAVSLFAFLYSRENKAK 160
             V L  FL+  E+K K
Sbjct: 65  GVVGLMIFLWRWEDKNK 81


>gi|428184948|gb|EKX53802.1| hypothetical protein GUITHDRAFT_160931, partial [Guillardia theta
           CCMP2712]
          Length = 244

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 4/128 (3%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++  E  SP R  R FFY A   S SLG LI+  ++  AL        + E     +I++
Sbjct: 109 KLLEEQASPLRKFRQFFYAAACISASLGTLISGARIFAALQGIQGVQPLSESSTNTAINL 168

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSRE----ESLSNLKLRVDEKKIISVNNLRGIARLVI 199
             V   A L+  E+      L  L  +    E L+NL L + +  + ++ ++R  +RLVI
Sbjct: 169 GVVVTSAALWIWEDGKGKETLKLLQDKSGMREKLANLNLELADGTLKTMRDMRDESRLVI 228

Query: 200 CSGPASYI 207
            +G A+ +
Sbjct: 229 LAGTAAEV 236


>gi|119494064|ref|ZP_01624616.1| hypothetical protein L8106_04221 [Lyngbya sp. PCC 8106]
 gi|119452201|gb|EAW33403.1| hypothetical protein L8106_04221 [Lyngbya sp. PCC 8106]
          Length = 98

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 7/77 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E+ +P+R +R F Y+ F ASG +G L+ + Q++          +V   L  L++ 
Sbjct: 21  ARLKAELKTPYRGLRQFIYVGFGASGFIGALVFLAQVLA-------GREVASALPNLALQ 73

Query: 143 VSAVSLFAFLYSRENKA 159
           V  V+L  +L+  EN+A
Sbjct: 74  VGVVALMVWLFRLENRA 90


>gi|332706278|ref|ZP_08426346.1| hypothetical protein LYNGBM3L_16170 [Moorea producens 3L]
 gi|332354983|gb|EGJ34455.1| hypothetical protein LYNGBM3L_16170 [Moorea producens 3L]
          Length = 99

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 9/88 (10%)

Query: 73  DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
            + +PE    AR++ E   P+R +R F Y+AF ASG +G ++ + QL    A  A A  +
Sbjct: 18  QKINPE--KYARLKAEAQHPYRGLRKFIYIAFGASGFIGAMVFLAQLA---AGKAVATAM 72

Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAK 160
           P     L++ +  V L  +L+  ENKA+
Sbjct: 73  PN----LALQIGVVMLMVWLFRLENKAE 96


>gi|440755472|ref|ZP_20934674.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
 gi|440175678|gb|ELP55047.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
          Length = 84

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E  +P+R +R F YL   ASG +G ++ + QL       A   DV   L   ++ 
Sbjct: 13  ARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAGRDVATALPNFALQ 65

Query: 143 VSAVSLFAFLYSRENKA 159
           +  V+L  +L+ RE KA
Sbjct: 66  IGVVALMIWLFRREKKA 82


>gi|282896663|ref|ZP_06304671.1| hypothetical protein CRD_01368 [Raphidiopsis brookii D9]
 gi|281198381|gb|EFA73269.1| hypothetical protein CRD_01368 [Raphidiopsis brookii D9]
          Length = 79

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 9/85 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DP I +  R++ E+ SP++ +R F Y+   ASG +G  +   Q++          D+ + 
Sbjct: 3   DPNIKN--RLKAEIASPYKGLRQFVYIGMGASGLIGAFVFFFQILA-------GRDIDQA 53

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
           L   ++ V  V++ AFL+  E K K
Sbjct: 54  LPNFALQVGIVAIMAFLWKWERKGK 78


>gi|387202097|gb|AFJ68933.1| hypothetical protein NGATSA_2062300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 213

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 49/101 (48%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++R+E  +PFRS+R+F Y       SLGG  A+TQL+ +L +   A  + + +  + +D 
Sbjct: 105 KLRSEATTPFRSLRLFIYGGIGVGASLGGYTAVTQLVQSLQHRPDALPLSQSILNVGVDF 164

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKK 184
              +  A  +  EN+ K      +    +    +L  D KK
Sbjct: 165 GVAAAAAGAWVFENRLKQVTDKEMQALRAKEPFRLTFDMKK 205


>gi|428210032|ref|YP_007094385.1| hypothetical protein Chro_5144 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428011953|gb|AFY90516.1| hypothetical protein Chro_5144 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 92

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 67  ITCSATDRSDPEISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
           I  +  +R D  ++S   AR+R E  +P+R +R F Y +F ASG +G +I + QL     
Sbjct: 2   IEQNPKNRGDKRLTSEQYARLRAEAAAPYRGLRKFIYFSFGASGLIGAVIFLAQL----- 56

Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRE 156
             A   +V   L   ++ V  V+L  FL+  E
Sbjct: 57  --AAGQNVSNALPNFALQVGIVALMIFLFRLE 86


>gi|428774687|ref|YP_007166474.1| hypothetical protein PCC7418_0001 [Halothece sp. PCC 7418]
 gi|428688966|gb|AFZ42260.1| hypothetical protein PCC7418_0001 [Halothece sp. PCC 7418]
          Length = 90

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 7/78 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  +P+R +R   Y+AF ASG +G  + + QL       A   +V E L  L++ V
Sbjct: 19  RLQAEAKAPYRGLRKTIYVAFAASGLIGAFVMVAQL-------AAGQNVTETLPNLALQV 71

Query: 144 SAVSLFAFLYSRENKAKN 161
             V+L  +L+ ++ +A++
Sbjct: 72  GVVALMVWLFRQDTEAED 89


>gi|218247693|ref|YP_002373064.1| hypothetical protein PCC8801_2911 [Cyanothece sp. PCC 8801]
 gi|257060970|ref|YP_003138858.1| hypothetical protein Cyan8802_3185 [Cyanothece sp. PCC 8802]
 gi|218168171|gb|ACK66908.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
 gi|256591136|gb|ACV02023.1| hypothetical protein Cyan8802_3185 [Cyanothece sp. PCC 8802]
          Length = 91

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 9/95 (9%)

Query: 65  SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
           SN       + DPE    AR++ E  +P++ +R FFY+AF ASG +G L+   +L     
Sbjct: 6   SNKPRPPLSQQDPE--KYARLKAEAQAPYKVLRQFFYIAFGASGFIGALVFFAKL----- 58

Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
             A      E L   ++ V  V+L  +L+  E K+
Sbjct: 59  --ASGRGGVEALPNFALQVGVVALMVWLFRLEQKS 91


>gi|428305277|ref|YP_007142102.1| hypothetical protein Cri9333_1705 [Crinalium epipsammum PCC 9333]
 gi|428246812|gb|AFZ12592.1| hypothetical protein Cri9333_1705 [Crinalium epipsammum PCC 9333]
          Length = 88

 Score = 47.4 bits (111), Expect = 0.007,   Method: Composition-based stats.
 Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 9/85 (10%)

Query: 74  RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
           +++PE    AR++ E+ SP++ +R   YLA  AS  +GG+I I Q+       A    V 
Sbjct: 9   KTNPE--QYARLKAEIASPYKGLRQVIYLACAASSFIGGMIFIVQI-------AAGRQVE 59

Query: 134 EILKGLSIDVSAVSLFAFLYSRENK 158
             L  L++ +  ++L  FL+  E K
Sbjct: 60  SALPNLAVQIGVLALMIFLFRLEQK 84


>gi|428225951|ref|YP_007110048.1| hypothetical protein GEI7407_2521 [Geitlerinema sp. PCC 7407]
 gi|427985852|gb|AFY66996.1| hypothetical protein GEI7407_2521 [Geitlerinema sp. PCC 7407]
          Length = 101

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 10/88 (11%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE  +  R+R E  +P++ +R F Y+AF ASG +G  I +TQ++          DV   
Sbjct: 17  DPE--AFERLRAEAAAPYKGLRQFIYVAFAASGGIGSFIFLTQILA-------GRDVSSA 67

Query: 136 LKGLSIDVSAVSLFAFLYSRENKA-KNA 162
           L   ++ V  V+L   L   E ++ KNA
Sbjct: 68  LPNFALQVGVVALMIALLRWERRSPKNA 95


>gi|209525694|ref|ZP_03274231.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376003116|ref|ZP_09780931.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423067100|ref|ZP_17055890.1| hypothetical protein SPLC1_S531800 [Arthrospira platensis C1]
 gi|209493863|gb|EDZ94181.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375328441|emb|CCE16684.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406711386|gb|EKD06587.1| hypothetical protein SPLC1_S531800 [Arthrospira platensis C1]
          Length = 93

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 9/88 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE+    R++ E  +P+R +R F Y++F ASG +G ++ + QL+          DV   
Sbjct: 15  DPELYE--RLKAESKAPYRGLRQFIYVSFAASGLIGAVVFLAQLLS-------GRDVATA 65

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
           L   ++ V  V+L  +L+  E ++  ++
Sbjct: 66  LPNFALQVGVVALMVWLFRLEQRSPKSK 93


>gi|218197863|gb|EEC80290.1| hypothetical protein OsI_22302 [Oryza sativa Indica Group]
          Length = 120

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 6/62 (9%)

Query: 206 YIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVY 265
           ++ ESF RS+P    L+E GVL V F+TDGN P     + +      +K+  +   +PVY
Sbjct: 10  FVTESFHRSKPLLTDLMELGVLAVPFSTDGNTPDLRLLDGQ------KKQANVSPDSPVY 63

Query: 266 VS 267
           +S
Sbjct: 64  LS 65


>gi|172036842|ref|YP_001803343.1| hypothetical protein cce_1927 [Cyanothece sp. ATCC 51142]
 gi|354554648|ref|ZP_08973952.1| Protein of unknown function DUF3493 [Cyanothece sp. ATCC 51472]
 gi|171698296|gb|ACB51277.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
 gi|353553457|gb|EHC22849.1| Protein of unknown function DUF3493 [Cyanothece sp. ATCC 51472]
          Length = 89

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 45/83 (54%), Gaps = 9/83 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE    A ++ E  +P++ +R FFY+AF ASG +G ++ + +L       A   +V   
Sbjct: 16  DPE--KYASLKAEAEAPYKGLRKFFYIAFGASGFIGAVVFLAKL-------AAGKEVAST 66

Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
           L   ++ V  V+L  FL+  E+K
Sbjct: 67  LPNFALQVGVVALMIFLFRLESK 89


>gi|409992301|ref|ZP_11275500.1| hypothetical protein APPUASWS_14543 [Arthrospira platensis str.
           Paraca]
 gi|291566639|dbj|BAI88911.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409936855|gb|EKN78320.1| hypothetical protein APPUASWS_14543 [Arthrospira platensis str.
           Paraca]
          Length = 93

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 9/88 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE+    R++ E  +P+R +R F Y++F ASG +G ++ + QL+          DV   
Sbjct: 15  DPELYE--RLKAESKAPYRGLRQFIYVSFAASGLIGAVVFLAQLLS-------GRDVGTA 65

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
           L   ++ V  V+L  +L+  E ++  ++
Sbjct: 66  LPNFALQVGVVALMVWLFRLEQRSPKSK 93


>gi|126658399|ref|ZP_01729548.1| hypothetical protein CY0110_27605 [Cyanothece sp. CCY0110]
 gi|126620331|gb|EAZ91051.1| hypothetical protein CY0110_27605 [Cyanothece sp. CCY0110]
          Length = 89

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DPE    A ++ E  +P++  R FFY+AF ASG +G ++ + +L       A   +V   
Sbjct: 16  DPE--KYASLKAEAQAPYKGFRKFFYIAFAASGFIGAVVFLAKL-------AAGKEVATA 66

Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
           L   ++ +  V+L  FL+  E+K
Sbjct: 67  LPNFALQIGVVALMIFLFRLESK 89


>gi|427734970|ref|YP_007054514.1| hypothetical protein Riv7116_1404 [Rivularia sp. PCC 7116]
 gi|427370011|gb|AFY53967.1| Protein of unknown function (DUF3493) [Rivularia sp. PCC 7116]
          Length = 94

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E  SP+R +R F YL+F ASG +G  I + QL+      A    +   L  L++ 
Sbjct: 21  ARLKAEAASPYRGLRKFVYLSFGASGLIGAFIFVLQLL------AHKTSLEVGLPSLALQ 74

Query: 143 VSAVSLFAFLY 153
           V  V+L  FL+
Sbjct: 75  VGVVTLMIFLW 85


>gi|323451949|gb|EGB07824.1| hypothetical protein AURANDRAFT_71677 [Aureococcus anophagefferens]
          Length = 780

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 6/116 (5%)

Query: 79  ISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEIL 136
           IS+A  +++  E  +P+R+VR FFY AF  S ++GG+ A+ Q   ++A    A  +P   
Sbjct: 46  ISAAQKSKLLQEAQTPWRTVRTFFYGAFGLSATVGGITALAQTAASVAGQPDA--LPLSQ 103

Query: 137 KGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLR 192
            G+++ V A  + A LY   N   N + A ++  +  + L   V E +  +V  LR
Sbjct: 104 CGVNVLVDAGVVAACLYG--NSVDNGEAADITVADETAPLADDVAEARRDAVAKLR 157


>gi|452825495|gb|EME32491.1| hypothetical protein Gasu_02660 [Galdieria sulphuraria]
          Length = 295

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/129 (24%), Positives = 63/129 (48%), Gaps = 9/129 (6%)

Query: 67  ITCSATDRSDPEISSAA---------RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAIT 117
           +  +A+D  DP++ S++         R++ E  +PFR  RMF Y     S  +G LI+  
Sbjct: 45  MKTNASDSDDPKVYSSSESTSPSMFLRLQEEAQAPFRKFRMFIYGGAFVSSVIGFLISSL 104

Query: 118 QLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLK 177
           QL+  +     +  + E +  + I+   ++  + LY  + +A   +L RLS   S+  L 
Sbjct: 105 QLLSGILGTTTSYPIRESVYNMLINGIVIATASLLYFFDYQAGKQRLDRLSVLSSIRMLP 164

Query: 178 LRVDEKKII 186
           +  ++++ +
Sbjct: 165 VEYNQRQYL 173


>gi|397642957|gb|EJK75565.1| hypothetical protein THAOC_02707 [Thalassiosira oceanica]
          Length = 354

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 50/222 (22%), Positives = 95/222 (42%), Gaps = 31/222 (13%)

Query: 40  SRTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRS---DPEISSAARIRNEVLSPFRSV 96
           SR    L++++P  +T + +R P  + +      RS   D  +S+  ++  E ++P+R +
Sbjct: 26  SRRRLSLNLNVPTAATASPRRRP-TTELNQEDKKRSGGFDKGVSN--KLLEETVAPWRGL 82

Query: 97  RMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFL---- 152
           R+F Y A  +   + G+   + +I A+       D+      L+I+  AV LFA L    
Sbjct: 83  RLFLYGAAASGAFVSGMFTGSGVIAAMGGVRDDVDLNAEYVNLAINFGAVLLFAVLAKLD 142

Query: 153 --------------YSRENKAKNAQLARLSREESLSNLKL--RVDE-----KKIISVNNL 191
                           R+ K KN   +   RE  L +L L  RV E     +  + V   
Sbjct: 143 LDKGAELSASIEEKVERKKKIKNMTKSMRERERQLRDLSLNVRVTEDGELREAPVGVMQD 202

Query: 192 RGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
           +    +++ +GP   I ++ + ++    +     +LVV + T
Sbjct: 203 KAKQHIILVAGPGRAIRDALRGAQLNKVNFAMTNILVVPYET 244


>gi|428203982|ref|YP_007082571.1| hypothetical protein Ple7327_3856 [Pleurocapsa sp. PCC 7327]
 gi|427981414|gb|AFY79014.1| Protein of unknown function (DUF3493) [Pleurocapsa sp. PCC 7327]
          Length = 95

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 10/92 (10%)

Query: 65  SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
           S  T S   R DP+    AR+  E  +P++ +R F YLAF +SG +G L+ + QL     
Sbjct: 11  SKATNSLKQR-DPQ--KYARLMAEAQAPYKGLRAFIYLAFGSSGLIGALVFLAQL----- 62

Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRE 156
             A   DV   L   ++ V  V+L  +L+  E
Sbjct: 63  --AAGKDVTSALPNFALQVGLVALMVWLFRWE 92


>gi|434388883|ref|YP_007099494.1| Protein of unknown function (DUF3493) [Chamaesiphon minutus PCC
           6605]
 gi|428019873|gb|AFY95967.1| Protein of unknown function (DUF3493) [Chamaesiphon minutus PCC
           6605]
          Length = 89

 Score = 46.2 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 10/89 (11%)

Query: 73  DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
           D  DPE+    R++ E  SPFR +R F Y++   SG++GGL+   +L+          ++
Sbjct: 11  DSIDPELWR--RLQAEAKSPFRGLRQFVYVSCAISGAVGGLVFFFKLLA-------GREL 61

Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAKN 161
              +  L+I V  V+L  FL  R +KAK+
Sbjct: 62  ETTIPNLAIQVGVVALMVFLL-RIDKAKD 89


>gi|434392334|ref|YP_007127281.1| hypothetical protein Glo7428_1560 [Gloeocapsa sp. PCC 7428]
 gi|428264175|gb|AFZ30121.1| hypothetical protein Glo7428_1560 [Gloeocapsa sp. PCC 7428]
          Length = 98

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 53/102 (51%), Gaps = 9/102 (8%)

Query: 61  SPLCSNITCSATDRSDPEIS--SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQ 118
           +   S +  ++ +RS+   +    AR++ E  +P+R +R F Y++F ASG +G L+ + Q
Sbjct: 4   AAFFSMVKPTSKNRSNKRFTPEQYARLKAEAAAPYRGLRKFIYISFGASGFIGALVFLAQ 63

Query: 119 LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAK 160
           L       A   DV   L   ++ +  ++L  FL+  + +++
Sbjct: 64  L-------AAGRDVETALPNFALQLGVIALMVFLFRWDRESR 98


>gi|56752086|ref|YP_172787.1| hypothetical protein syc2077_c [Synechococcus elongatus PCC 6301]
 gi|81300827|ref|YP_401035.1| hypothetical protein Synpcc7942_2018 [Synechococcus elongatus PCC
           7942]
 gi|56687045|dbj|BAD80267.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169708|gb|ABB58048.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 81

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 9/83 (10%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DP+    AR++ EV++P+R +R  FY  F ASG++GG I    L+ A+A  + A  +P  
Sbjct: 5   DPQ--QRARLQAEVVAPYRGLRRTFYAVFAASGAIGGFIF---LLKAIAGESLATTLPN- 58

Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
              L++ +  V+L   L   E K
Sbjct: 59  ---LALQLGVVALMVSLLRLEQK 78


>gi|428772409|ref|YP_007164197.1| hypothetical protein Cyast_0570 [Cyanobacterium stanieri PCC 7202]
 gi|428686688|gb|AFZ46548.1| hypothetical protein Cyast_0570 [Cyanobacterium stanieri PCC 7202]
          Length = 95

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 9/91 (9%)

Query: 68  TCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPA 127
             S   ++DPE    A ++ E  +P+R +R F Y+ F ASG++G  I   Q++       
Sbjct: 14  NGSNLKQTDPE--KYAYLKAEAQAPYRGLRKFIYIGFGASGAIGAFIFFAQILA------ 65

Query: 128 RAADVPEILKGLSIDVSAVSLFAFLYSRENK 158
              +V + +  L + +  + L  FL+  E K
Sbjct: 66  -GKNVQDNISNLLVQLGVIGLMIFLFRLEKK 95


>gi|443311291|ref|ZP_21040921.1| Protein of unknown function (DUF3493) [Synechocystis sp. PCC 7509]
 gi|442778614|gb|ELR88877.1| Protein of unknown function (DUF3493) [Synechocystis sp. PCC 7509]
          Length = 91

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  +P+R +R F Y+AF ASG +G LI + Q+          +D P     L I V
Sbjct: 17  RLKAEAAAPYRGLRRFIYIAFGASGFVGALIFLAQIAA-----GNHSDAPLSNLALQIGV 71

Query: 144 SAVSLFAFLYSREN 157
            A+ +F F + +E+
Sbjct: 72  VAIMIFLFRWDKES 85


>gi|334118614|ref|ZP_08492703.1| hypothetical protein MicvaDRAFT_3339 [Microcoleus vaginatus FGP-2]
 gi|333459621|gb|EGK88234.1| hypothetical protein MicvaDRAFT_3339 [Microcoleus vaginatus FGP-2]
          Length = 102

 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+  E+ +P+R +R F YLA  ASG +GGLI + ++       A   ++   L   ++ V
Sbjct: 26  RLTAEMATPYRPLRQFVYLACGASGFIGGLIFLAKI-------ASGREIGSALPNFALQV 78

Query: 144 SAVSLFAFLYSRENKAK 160
             V+L  FL+  E +A+
Sbjct: 79  GVVALMVFLFRWEQRAE 95


>gi|428312657|ref|YP_007123634.1| hypothetical protein Mic7113_4545 [Microcoleus sp. PCC 7113]
 gi|428254269|gb|AFZ20228.1| Protein of unknown function (DUF3493) [Microcoleus sp. PCC 7113]
          Length = 99

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A++R E  +P+R +R F YLAF ASG +G  + + QL       A   +V   L   ++ 
Sbjct: 26  AKLRAEAKAPYRVLRQFIYLAFGASGFIGAFVFLGQL-------AAGHEVATALPNFALQ 78

Query: 143 VSAVSLFAFLYSRENKAK 160
           +  V+L  +L+  + +A+
Sbjct: 79  IGVVALMVWLFRVDRQAE 96


>gi|300865884|ref|ZP_07110626.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300336124|emb|CBN55784.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 112

 Score = 44.7 bits (104), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 46/79 (58%), Gaps = 7/79 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E+++P+R++R F Y+A  ASG +GGL+ + QL+          +V   +   ++ V
Sbjct: 37  RLKAEMVAPYRTLRQFVYVACGASGFIGGLVFLGQLM-------VGREVGSAIPNFALQV 89

Query: 144 SAVSLFAFLYSRENKAKNA 162
             V+L  +L+  E +++ +
Sbjct: 90  GVVALMVWLFRLEQRSRRS 108


>gi|428770088|ref|YP_007161878.1| hypothetical protein Cyan10605_1731 [Cyanobacterium aponinum PCC
           10605]
 gi|428684367|gb|AFZ53834.1| hypothetical protein Cyan10605_1731 [Cyanobacterium aponinum PCC
           10605]
          Length = 90

 Score = 44.3 bits (103), Expect = 0.060,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 7/78 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A ++ E  +P++ +R F YL   ASG++G  I   QL+          +V + +  L + 
Sbjct: 19  AYLKAEAQAPYKGLRKFIYLGLGASGAIGAFIFFAQLLA-------GKNVSDNIYNLFVQ 71

Query: 143 VSAVSLFAFLYSRENKAK 160
           ++ +++  FLY  E++ K
Sbjct: 72  ITVIAIMVFLYRWEDRKK 89


>gi|254423312|ref|ZP_05037030.1| hypothetical protein S7335_3468 [Synechococcus sp. PCC 7335]
 gi|196190801|gb|EDX85765.1| hypothetical protein S7335_3468 [Synechococcus sp. PCC 7335]
          Length = 123

 Score = 43.5 bits (101), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 10/88 (11%)

Query: 74  RSDPEISSAARIRN---EVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAA 130
           ++ P+  S AR      E  +P++ +R F Y AF ASG++GG +  T+++          
Sbjct: 38  KTPPKGMSQARYDKLVAESQAPYKGLRKFMYAAFGASGAVGGFVFFTKILAGR------- 90

Query: 131 DVPEILKGLSIDVSAVSLFAFLYSRENK 158
           D+   L  L++ +  V+L  FL+  E++
Sbjct: 91  DLAASLPNLALQLGVVALMVFLFRWEDR 118


>gi|37523365|ref|NP_926742.1| hypothetical protein gsr3796 [Gloeobacter violaceus PCC 7421]
 gi|35214369|dbj|BAC91737.1| gsr3796 [Gloeobacter violaceus PCC 7421]
          Length = 98

 Score = 43.5 bits (101), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 7/90 (7%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A +R E   P+R +R+  YLAF ASG +G  +A  +++          ++P  L  L++ 
Sbjct: 11  ASLRVESRHPWRKLRLGLYLAFGASGVVGFFVAFFRVLA-------GRELPVSLTNLAVQ 63

Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREES 172
              ++L  FL+  E + +NA   R   +E+
Sbjct: 64  SGVIALMVFLWRLETRKENALFERYLAQET 93


>gi|428320326|ref|YP_007118208.1| hypothetical protein Osc7112_5569 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428244006|gb|AFZ09792.1| hypothetical protein Osc7112_5569 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 102

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 7/78 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+  E+ +P+R +R F Y+A  ASG +GGL+ + ++       A   ++   L   ++ V
Sbjct: 26  RLTAEMSTPYRPLRQFVYVACGASGFIGGLVFLAKI-------ASGREIGSALPNFALQV 78

Query: 144 SAVSLFAFLYSRENKAKN 161
             V+L  FL+  E +A+ 
Sbjct: 79  GVVALMVFLFRWEQRAEG 96


>gi|282901335|ref|ZP_06309260.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281193614|gb|EFA68586.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 77

 Score = 43.1 bits (100), Expect = 0.12,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 76  DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
           DP I +  R++ E+ SP++ +R F Y+   ASG +G  +   Q++         A   + 
Sbjct: 3   DPNIKN--RLKAEIASPYKGLRQFIYIGMGASGIIGAFVFFFQIL---------AGRNQA 51

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
           L   ++ V  V++ A L+  E K+K
Sbjct: 52  LPNFALQVGIVAIMALLWKWERKSK 76


>gi|427415605|ref|ZP_18905788.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 7375]
 gi|425758318|gb|EKU99170.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 7375]
          Length = 97

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 7/77 (9%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+  E  SPF+ +R   Y A  ASG LG  + +TQL+          DV   L  L++ +
Sbjct: 25  RLVQETQSPFKGMRKVIYGAVGASGLLGAFVFLTQLMA-------GKDVETALPNLAVQM 77

Query: 144 SAVSLFAFLYSRENKAK 160
             + +  FL+  E + K
Sbjct: 78  GVIGIVVFLFWLEGRGK 94


>gi|443647614|ref|ZP_21129750.1| hypothetical protein C789_290 [Microcystis aeruginosa DIANCHI905]
 gi|443335468|gb|ELS49939.1| hypothetical protein C789_290 [Microcystis aeruginosa DIANCHI905]
          Length = 59

 Score = 42.7 bits (99), Expect = 0.17,   Method: Composition-based stats.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           S + R DP+    AR++ E  +P+R +R F YL   ASG +G ++ + QL G 
Sbjct: 2   SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQLAGG 52


>gi|219111561|ref|XP_002177532.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217412067|gb|EEC51995.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 321

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 22/165 (13%)

Query: 85  IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
           +R E  SPFR VR FFY +   +G+L  L      + A         + E    + +DV+
Sbjct: 68  LREEAESPFRKVRFFFYYSL-GAGALTSLAVSASRVAAALAGINTDLLQESAINVGVDVA 126

Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKII------------------ 186
            + +    + ++ +A++++L R S+   L+ L +R   K II                  
Sbjct: 127 GLVVLGLAFQKDKEAQDSRLKRASKGAELARLTVR-GSKSIITGDLPNVNEASETFTTSL 185

Query: 187 -SVNNLRGI-ARLVICSGPASYIMESFKRSEPFTESLLERGVLVV 229
            S+   RGI  R+VI +     I E  + S    +SL+   +L+V
Sbjct: 186 ASMRRGRGIEKRVVIAAAGKEKIAEILEDSIKLADSLVMSDLLIV 230


>gi|443315905|ref|ZP_21045373.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 6406]
 gi|442784478|gb|ELR94350.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 6406]
          Length = 98

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 7/76 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR+  E  +P+R  R F Y AF  SG++G  + ++Q+       A   DV   L  L + 
Sbjct: 28  ARLVAEAKAPYRGFRKFIYGAFGISGAIGAFVFLSQM-------AAGRDVANALPNLLLQ 80

Query: 143 VSAVSLFAFLYSRENK 158
           V  V+L  FL+  E++
Sbjct: 81  VGLVALMVFLFRLESR 96


>gi|428221458|ref|YP_007105628.1| hypothetical protein Syn7502_01415 [Synechococcus sp. PCC 7502]
 gi|427994798|gb|AFY73493.1| Protein of unknown function (DUF3493) [Synechococcus sp. PCC 7502]
          Length = 84

 Score = 42.0 bits (97), Expect = 0.26,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 38/70 (54%), Gaps = 6/70 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           +++ E+ +PF  +R F YLA   SG+LG  I +TQL+      A  ++V   L  L++ +
Sbjct: 11  KLKAELKTPFLGLRRFIYLACGTSGALGAYIFLTQLL------AGKSEVSTTLSNLAVQM 64

Query: 144 SAVSLFAFLY 153
             + L   LY
Sbjct: 65  GVIGLMILLY 74


>gi|427729974|ref|YP_007076211.1| hypothetical protein Nos7524_2788 [Nostoc sp. PCC 7524]
 gi|427365893|gb|AFY48614.1| Protein of unknown function (DUF3493) [Nostoc sp. PCC 7524]
          Length = 88

 Score = 41.6 bits (96), Expect = 0.33,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)

Query: 73  DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
           +R +PE     R++ E+++P+R +R F Y    ASG +G      +L+          D+
Sbjct: 8   NRLNPE--QYNRLKAEMVAPYRGLRQFIYFGVGASGFIGAFTFFFKLLA-------GKDI 58

Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAKN 161
              L   ++ V  V L  FL+  E + +N
Sbjct: 59  EHTLPNFALQVGIVILMIFLWRWEQRRQN 87


>gi|414079532|ref|YP_007000956.1| hypothetical protein ANA_C20461 [Anabaena sp. 90]
 gi|413972811|gb|AFW96899.1| hypothetical protein ANA_C20461 [Anabaena sp. 90]
          Length = 87

 Score = 41.6 bits (96), Expect = 0.35,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  +P+R +R F Y+   ASG +G  +   QL+          D+ +     +I +
Sbjct: 17  RLKAEAAAPYRGLRQFIYIGVGASGFIGAFVFFFQLLA-------GRDIEQAASNFAIQL 69

Query: 144 SAVSLFAFLYSREN 157
             + L  FL+  EN
Sbjct: 70  GVIGLMVFLWKWEN 83


>gi|323453284|gb|EGB09156.1| hypothetical protein AURANDRAFT_63641 [Aureococcus anophagefferens]
          Length = 340

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 2/126 (1%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R+R E+ SPFR VR+ F+ A   S  L    ++     A A    A  + + L+   I++
Sbjct: 63  RLREEIDSPFRKVRLLFFGASTGSAGLAFYFSLLNAFKANAGFRDAPPLSDALQSCGINL 122

Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--VDEKKIISVNNLRGIARLVICS 201
            A+   A +   + +A  A L R+++   L+ L +      +K   + + R  +R+VI  
Sbjct: 123 VAIVACAAVTYNDWRAGQANLERIAQGGRLAKLVVSPAAAPEKRAPLADYRRTSRVVIAC 182

Query: 202 GPASYI 207
           G  +Y+
Sbjct: 183 GGRAYV 188


>gi|428778508|ref|YP_007170294.1| hypothetical protein Dacsa_0122 [Dactylococcopsis salina PCC 8305]
 gi|428692787|gb|AFZ48937.1| Protein of unknown function (DUF3493) [Dactylococcopsis salina PCC
           8305]
          Length = 90

 Score = 40.4 bits (93), Expect = 0.71,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 7/77 (9%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  +P+R +R   Y+ F ASG +G  + +++L       A   ++ E L  L + V
Sbjct: 19  RLQAEANAPYRGLRKTIYVVFAASGLMGAFVMLSKL-------AAGTNIGETLPNLGLQV 71

Query: 144 SAVSLFAFLYSRENKAK 160
             V+L  +L+  +   K
Sbjct: 72  GIVALMVWLFRLDQDKK 88


>gi|220910557|ref|YP_002485868.1| hypothetical protein Cyan7425_5214 [Cyanothece sp. PCC 7425]
 gi|219867168|gb|ACL47507.1| hypothetical protein Cyan7425_5214 [Cyanothece sp. PCC 7425]
          Length = 99

 Score = 40.4 bits (93), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 22/108 (20%)

Query: 53  HSTWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGG 112
           HS+ + K++P            +DP      R+R E+ +P+R +R   YL F AS ++G 
Sbjct: 7   HSSSSRKKAP------------ADPRTE---RLRAELKAPYRGLRQVTYLVFGASAAIGA 51

Query: 113 LIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAK 160
            + + +L+   A       V   L  L++ +  ++L   L+  E K+K
Sbjct: 52  FVFVLKLLAGEA-------VETNLANLALQIGVLALMVGLFRLETKSK 92


>gi|224064428|ref|XP_002301471.1| predicted protein [Populus trichocarpa]
 gi|222843197|gb|EEE80744.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score = 40.4 bits (93), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 34/62 (54%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+AF A+  +  L  I +L  A+     A D+ E     + ++
Sbjct: 176 KLISEVQAPFRGVRRFFYVAFSAAAGISLLFTIPRLFLAIKGGGDAPDLWETAGNAATNI 235

Query: 144 SA 145
            A
Sbjct: 236 GA 237


>gi|113477411|ref|YP_723472.1| hypothetical protein Tery_3974 [Trichodesmium erythraeum IMS101]
 gi|110168459|gb|ABG52999.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 87

 Score = 40.0 bits (92), Expect = 0.95,   Method: Composition-based stats.
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           AR++ E+ +P+R +R F Y+AF  SG +G  I + Q+I  
Sbjct: 21  ARLKAELDAPYRGLRKFIYVAFAISGFIGAFIYLAQIIAG 60


>gi|407961699|dbj|BAM54939.1| hypothetical protein BEST7613_6008 [Bacillus subtilis BEST7613]
          Length = 86

 Score = 40.0 bits (92), Expect = 0.96,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 10/82 (12%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  +P+R  R F YL   ASG +G  I   +L+          D  EIL  L++ +
Sbjct: 10  RLKAEAEAPYRGFRKFIYLGLGASGLIGVFIFSLRLLAGY-------DPTEILPNLALQL 62

Query: 144 SAVSLFAFLY---SRENKAKNA 162
             V+L  +L+   S +N+  N 
Sbjct: 63  GVVALMTWLFRWESSKNQGNNG 84


>gi|427709388|ref|YP_007051765.1| hypothetical protein Nos7107_4063 [Nostoc sp. PCC 7107]
 gi|427361893|gb|AFY44615.1| hypothetical protein Nos7107_4063 [Nostoc sp. PCC 7107]
          Length = 89

 Score = 40.0 bits (92), Expect = 1.1,   Method: Composition-based stats.
 Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 9/85 (10%)

Query: 74  RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
           R +PE    AR++ E  +P+R +R F Y+   ASG +G  +   +++          ++ 
Sbjct: 9   RLNPE--QYARLKAEAATPYRGLRQFIYIGVGASGFIGAFVFFFRVLA-------GRELE 59

Query: 134 EILKGLSIDVSAVSLFAFLYSRENK 158
             L  L++ +  V+L AFL+  E +
Sbjct: 60  STLPNLALQLGIVALMAFLWRWEKR 84


>gi|414877422|tpg|DAA54553.1| TPA: hypothetical protein ZEAMMB73_812331 [Zea mays]
          Length = 113

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%)

Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRG 193
           E+L  +    + + +   LY  ENK +  Q+ ++SR E+LS L +R+   +II +  LR 
Sbjct: 11  EMLPSILEYAAGIVVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRD 70

Query: 194 IAR 196
           I R
Sbjct: 71  ITR 73


>gi|119511048|ref|ZP_01630168.1| hypothetical protein N9414_09896 [Nodularia spumigena CCY9414]
 gi|119464299|gb|EAW45216.1| hypothetical protein N9414_09896 [Nodularia spumigena CCY9414]
          Length = 89

 Score = 39.7 bits (91), Expect = 1.5,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 8/79 (10%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E+ +P+R  R F Y+ F AS  +G  I   Q++          D+      L++ 
Sbjct: 16  ARLKAEMATPYRGFRQFIYIGFGASALIGAFIFFFQVLA-------GRDIDNAAPNLALQ 68

Query: 143 VSAVSLFAFLYSRE-NKAK 160
           +  ++L  FL+  E N+ K
Sbjct: 69  IGVLALMIFLWRWEQNRQK 87


>gi|395848982|ref|XP_003797116.1| PREDICTED: protein FAM176C [Otolemur garnettii]
          Length = 441

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/145 (23%), Positives = 62/145 (42%), Gaps = 9/145 (6%)

Query: 77  PEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAIT-QLIGALANPARAADVPEI 135
           P+++    +R +V  P + +R+F+    V S  +  L   +  L   L N    A   + 
Sbjct: 13  PQLAQHPGLRRQVEQPGQLLRLFYCTVLVCSKEIAALSDFSGYLTKLLQNHTAYACDGDH 72

Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIA 195
           L       S +S+ +  Y ++ +  N+Q     RE+SL+ +     +K +    N R   
Sbjct: 73  LSLQCPRHSTISVQSAFYGQDYQMCNSQQPASQREDSLTCVAPTTFQKVLDECQNQRACH 132

Query: 196 RLV--------ICSGPASYIMESFK 212
            LV        +C G + Y++ SFK
Sbjct: 133 LLVNSRVFGPDLCPGSSKYLLVSFK 157


>gi|423063691|ref|ZP_17052481.1| multi-sensor signal transduction histidine kinase [Arthrospira
            platensis C1]
 gi|406715123|gb|EKD10281.1| multi-sensor signal transduction histidine kinase [Arthrospira
            platensis C1]
          Length = 1413

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)

Query: 106  ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
            A G++GG  I   Q   + G   + +   D  E+L+ L+  V+     A LY +E+KAK 
Sbjct: 1066 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1124

Query: 162  AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
             +L +  RE   + ++L +  +K+ S+  L  G+A  +  + P S+I  +   +  +TES
Sbjct: 1125 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1181

Query: 221  LLE 223
            LLE
Sbjct: 1182 LLE 1184


>gi|376003846|ref|ZP_09781647.1| putative two-component sensor histidine kinase [Arthrospira sp. PCC
            8005]
 gi|375327788|emb|CCE17400.1| putative two-component sensor histidine kinase [Arthrospira sp. PCC
            8005]
          Length = 1385

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)

Query: 106  ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
            A G++GG  I   Q   + G   + +   D  E+L+ L+  V+     A LY +E+KAK 
Sbjct: 1038 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1096

Query: 162  AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
             +L +  RE   + ++L +  +K+ S+  L  G+A  +  + P S+I  +   +  +TES
Sbjct: 1097 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1153

Query: 221  LLE 223
            LLE
Sbjct: 1154 LLE 1156


>gi|209523990|ref|ZP_03272542.1| multi-sensor signal transduction histidine kinase [Arthrospira maxima
            CS-328]
 gi|209495662|gb|EDZ95965.1| multi-sensor signal transduction histidine kinase [Arthrospira maxima
            CS-328]
          Length = 1411

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)

Query: 106  ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
            A G++GG  I   Q   + G   + +   D  E+L+ L+  V+     A LY +E+KAK 
Sbjct: 1064 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1122

Query: 162  AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
             +L +  RE   + ++L +  +K+ S+  L  G+A  +  + P S+I  +   +  +TES
Sbjct: 1123 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1179

Query: 221  LLE 223
            LLE
Sbjct: 1180 LLE 1182


>gi|298490036|ref|YP_003720213.1| hypothetical protein Aazo_0602 ['Nostoc azollae' 0708]
 gi|298231954|gb|ADI63090.1| conserved hypothetical protein ['Nostoc azollae' 0708]
          Length = 89

 Score = 38.9 bits (89), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A+++ E+ +P+R +R F Y+   ASG +G  +   Q++          ++   L   +++
Sbjct: 16  AQLKAEITAPYRGLRQFIYVGIGASGFIGAFVFSFQILA-------GRNIDSALPNFTLE 68

Query: 143 VSAVSLFAFLYSRENK 158
              V L  FLY  E +
Sbjct: 69  AGIVGLMVFLYRWEQR 84


>gi|428218811|ref|YP_007103276.1| hypothetical protein Pse7367_2592 [Pseudanabaena sp. PCC 7367]
 gi|427990593|gb|AFY70848.1| hypothetical protein Pse7367_2592 [Pseudanabaena sp. PCC 7367]
          Length = 103

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 6/74 (8%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR++ E  +P+R++R F YL+  ASG +G    + +++    N      + E L  L++ 
Sbjct: 30  ARLKAESKAPYRNLRKFVYLSLAASGLIGAFTFLMRILAGKGN------LTEDLGSLALQ 83

Query: 143 VSAVSLFAFLYSRE 156
           V+ V L    + R+
Sbjct: 84  VAIVVLMILAFRRD 97


>gi|443479266|ref|ZP_21068879.1| hypothetical protein Pse7429DRAFT_4585 [Pseudanabaena biceps PCC
           7429]
 gi|443015234|gb|ELS30286.1| hypothetical protein Pse7429DRAFT_4585 [Pseudanabaena biceps PCC
           7429]
          Length = 80

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 6/70 (8%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           R++ E  SP+R +R+F Y AF  SG +G ++ + +LI      A   ++   L   ++ +
Sbjct: 11  RLKLEAKSPYRGLRIFIYAAFAGSGFIGAVVFLARLI------AGKGELEANLGSFALQI 64

Query: 144 SAVSLFAFLY 153
             ++L  +LY
Sbjct: 65  GVLALMVWLY 74


>gi|440682401|ref|YP_007157196.1| hypothetical protein Anacy_2855 [Anabaena cylindrica PCC 7122]
 gi|428679520|gb|AFZ58286.1| hypothetical protein Anacy_2855 [Anabaena cylindrica PCC 7122]
          Length = 89

 Score = 38.5 bits (88), Expect = 3.5,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           A ++ E+ +P+R +R F Y+   ASG +G      QL+          ++   L   ++ 
Sbjct: 16  AHLKAEIAAPYRGLRQFIYIGVGASGFIGAFTFFFQLLA-------GKNIDSTLPNFALQ 68

Query: 143 VSAVSLFAFLYSRE 156
           V  V+L  FL+  E
Sbjct: 69  VGIVALMIFLWKWE 82


>gi|226501606|ref|NP_001143133.1| uncharacterized protein LOC100275611 [Zea mays]
 gi|195614826|gb|ACG29243.1| hypothetical protein [Zea mays]
          Length = 273

 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++
Sbjct: 185 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDDAPDLLETAGNAAINI 244

Query: 144 S------AVSLFAFLYSRENK 158
                      F FL  +E +
Sbjct: 245 GVCRRYCCAGSFVFLGKQERR 265


>gi|359457400|ref|ZP_09245963.1| hypothetical protein ACCM5_01647 [Acaryochloris sp. CCMEE 5410]
          Length = 92

 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           ++R E+ SP+R +R FFY+AF  S  +G ++   QL+  
Sbjct: 21  QLRAELKSPYRGLRQFFYIAFAGSAFVGAVVFFFQLLAG 59


>gi|427711370|ref|YP_007059994.1| hypothetical protein Syn6312_0210 [Synechococcus sp. PCC 6312]
 gi|427375499|gb|AFY59451.1| Protein of unknown function (DUF3493) [Synechococcus sp. PCC 6312]
          Length = 89

 Score = 38.1 bits (87), Expect = 4.1,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)

Query: 70  SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
           S   R++ E     R+  E  +P+R +R   YL F ASG +G +I + +LI   A+    
Sbjct: 5   SPQTRAEAEFRQ--RLLKEAQAPYRGLRQVMYLVFAASGLIGAMIFLLKLIAGQAD---- 58

Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENK 158
               E+    ++ +  ++L   L+  E K
Sbjct: 59  ---QELWGNFALQLGVIALMISLWRWEQK 84


>gi|428215965|ref|YP_007089109.1| hypothetical protein Oscil6304_5714 [Oscillatoria acuminata PCC
           6304]
 gi|428004346|gb|AFY85189.1| Protein of unknown function (DUF3493) [Oscillatoria acuminata PCC
           6304]
          Length = 101

 Score = 37.7 bits (86), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 7/75 (9%)

Query: 83  ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
           AR+R E  +P++ +R F Y+   ASG +G  + + QL+          +V   L   ++ 
Sbjct: 32  ARLRAEAAAPYKGLRKFVYVVCGASGGIGAFVFLAQLLA-------GREVGAALPNFALQ 84

Query: 143 VSAVSLFAFLYSREN 157
           +  V+L  +L+  E 
Sbjct: 85  LGVVALMVWLFRLEK 99


>gi|17231101|ref|NP_487649.1| hypothetical protein asl3609 [Nostoc sp. PCC 7120]
 gi|17132742|dbj|BAB75308.1| asl3609 [Nostoc sp. PCC 7120]
          Length = 88

 Score = 37.7 bits (86), Expect = 4.6,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 9/83 (10%)

Query: 74  RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
           R +PE    AR++ E+ +P+R +R  FY+ F ASG +G  I   QL+          DV 
Sbjct: 9   RLNPE--QYARLKAEMATPYRGLRQLFYIGFGASGFIGAFIFFFQLLA-------GRDVE 59

Query: 134 EILKGLSIDVSAVSLFAFLYSRE 156
             L   ++ V  V L  FL+  E
Sbjct: 60  TALPNFALQVGVVGLMIFLWRWE 82


>gi|414877424|tpg|DAA54555.1| TPA: hypothetical protein ZEAMMB73_812331, partial [Zea mays]
          Length = 197

 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 33/61 (54%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++
Sbjct: 109 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINI 168

Query: 144 S 144
            
Sbjct: 169 G 169


>gi|194697836|gb|ACF83002.1| unknown [Zea mays]
          Length = 273

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
           ++ +EV +PFR VR FFY+A  A+  +     I +L+ A+     A D+ E     +I++
Sbjct: 185 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINI 244

Query: 144 SAV------SLFAFLYSRENK 158
                      F FL  +E +
Sbjct: 245 GVCRRYCCSGSFVFLGKQERR 265


>gi|158334971|ref|YP_001516143.1| hypothetical protein AM1_1807 [Acaryochloris marina MBIC11017]
 gi|158305212|gb|ABW26829.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 92

 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query: 84  RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
           ++R E+ SP+R +R FFY+AF  S  +G ++   QL+  
Sbjct: 21  QLRAELKSPYRGLRRFFYIAFAGSAFVGAVVFFFQLLAG 59


>gi|125974000|ref|YP_001037910.1| methyl-accepting chemotaxis sensory transducer [Clostridium
           thermocellum ATCC 27405]
 gi|125714225|gb|ABN52717.1| methyl-accepting chemotaxis sensory transducer [Clostridium
           thermocellum ATCC 27405]
          Length = 758

 Score = 37.0 bits (84), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 47/88 (53%), Gaps = 4/88 (4%)

Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTES---LLERGVLVVSFTTDGNIPS 239
           ++IISV+N + I   +IC   + YI++ ++  +   ++   +L+   LV+S  +D  +  
Sbjct: 255 RRIISVSNAKTIG-YIICVMKSDYILKQYEGIDLGEDTDIFILDSNGLVISSNSDEFVAG 313

Query: 240 FEFEESEEMKEFTRKRKRLWQLTPVYVS 267
            EF + + +KE T K +      P Y++
Sbjct: 314 LEFPDKDFVKELTLKNEEKINAFPAYIN 341


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.133    0.385 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,969,147,834
Number of Sequences: 23463169
Number of extensions: 148312490
Number of successful extensions: 402426
Number of sequences better than 100.0: 173
Number of HSP's better than 100.0 without gapping: 126
Number of HSP's successfully gapped in prelim test: 47
Number of HSP's that attempted gapping in prelim test: 402221
Number of HSP's gapped (non-prelim): 177
length of query: 271
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 131
effective length of database: 9,074,351,707
effective search space: 1188740073617
effective search space used: 1188740073617
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)