BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024212
(271 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224129560|ref|XP_002320616.1| predicted protein [Populus trichocarpa]
gi|222861389|gb|EEE98931.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 158/208 (75%), Positives = 186/208 (89%), Gaps = 2/208 (0%)
Query: 65 SNITCSATDRSDP--EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
S ITCSA ++ P E+SS A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA T+LI A
Sbjct: 30 STITCSAANKPSPSTEVSSTAKIRSEVLSPFRTVRMFFYLAFLASGALGGLIATTRLIAA 89
Query: 123 LANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE 182
LANP+RAA+VPEILKGL ID+ A ++FAFLY RENKAKN QLARLSREE+LSNLKLRVDE
Sbjct: 90 LANPSRAAEVPEILKGLGIDIGAAAIFAFLYYRENKAKNVQLARLSREENLSNLKLRVDE 149
Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF 242
KKIISV++LRG+ARLVIC+GPA +I+ESFK SEPFT+SLL+RGVLVV F ++GN+PSFEF
Sbjct: 150 KKIISVSSLRGVARLVICAGPAPFILESFKLSEPFTQSLLDRGVLVVPFASNGNLPSFEF 209
Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWS 270
+E EEMKE T KRKRLWQL P+YVSEWS
Sbjct: 210 DEGEEMKELTTKRKRLWQLAPIYVSEWS 237
>gi|356548113|ref|XP_003542448.1| PREDICTED: uncharacterized protein LOC100799768 [Glycine max]
Length = 326
Score = 309 bits (791), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 159/219 (72%), Positives = 190/219 (86%), Gaps = 5/219 (2%)
Query: 55 TWTAKRSPLCSNITCSATDR--SDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGG 112
+++ +R+ + I CSA ++ S EISS A+IR+EVLSPFRSVRMFFY+AFVASG+LGG
Sbjct: 49 SYSHQRASSFATIVCSAANKPSSSSEISSTAKIRSEVLSPFRSVRMFFYIAFVASGALGG 108
Query: 113 LIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREES 172
IA TQLIGALAN +RA++VPEILKGL ID++AVSLFAFLY RENKAKNAQ+ARLSREES
Sbjct: 109 FIATTQLIGALANSSRASEVPEILKGLGIDLAAVSLFAFLYFRENKAKNAQVARLSREES 168
Query: 173 LSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFT 232
LSNLKLRVDEKKII VN+LRGIARLVIC+GPAS++ ESFKRSEPFT+SLL+RGVLVV F
Sbjct: 169 LSNLKLRVDEKKIIPVNSLRGIARLVICAGPASFVTESFKRSEPFTDSLLDRGVLVVPFV 228
Query: 233 TDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
TDGN P+ EFEE+EE+ +RKRLWQL PVY++EWS+
Sbjct: 229 TDGNSPALEFEETEEL---ATRRKRLWQLAPVYINEWSE 264
>gi|359480082|ref|XP_002267644.2| PREDICTED: uncharacterized protein LOC100253269 [Vitis vinifera]
Length = 328
Score = 308 bits (789), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 148/189 (78%), Positives = 170/189 (89%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA TQLI AL N +RA VP+ILKGL ID
Sbjct: 77 AKIRSEVLSPFRTVRMFFYLAFIASGALGGLIATTQLIAALTNSSRAPLVPDILKGLGID 136
Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSG 202
+ AV++FAFLYSRE+ AKNAQLARL+REESLSNLKLRVDEKK+ISVN+LRGIARLVIC+G
Sbjct: 137 IGAVAIFAFLYSRESSAKNAQLARLTREESLSNLKLRVDEKKVISVNDLRGIARLVICAG 196
Query: 203 PASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLT 262
PA +I ESFK S+PFT+ LL+RGVLVV F TDG +PSFE+EESEEMK+ T KRKRLWQL
Sbjct: 197 PAPFIAESFKLSQPFTQGLLDRGVLVVPFVTDGKLPSFEYEESEEMKDITTKRKRLWQLV 256
Query: 263 PVYVSEWSK 271
PVYVSEWSK
Sbjct: 257 PVYVSEWSK 265
>gi|297744261|emb|CBI37231.3| unnamed protein product [Vitis vinifera]
Length = 346
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 148/189 (78%), Positives = 170/189 (89%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A+IR+EVLSPFR+VRMFFYLAF+ASG+LGGLIA TQLI AL N +RA VP+ILKGL ID
Sbjct: 95 AKIRSEVLSPFRTVRMFFYLAFIASGALGGLIATTQLIAALTNSSRAPLVPDILKGLGID 154
Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSG 202
+ AV++FAFLYSRE+ AKNAQLARL+REESLSNLKLRVDEKK+ISVN+LRGIARLVIC+G
Sbjct: 155 IGAVAIFAFLYSRESSAKNAQLARLTREESLSNLKLRVDEKKVISVNDLRGIARLVICAG 214
Query: 203 PASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLT 262
PA +I ESFK S+PFT+ LL+RGVLVV F TDG +PSFE+EESEEMK+ T KRKRLWQL
Sbjct: 215 PAPFIAESFKLSQPFTQGLLDRGVLVVPFVTDGKLPSFEYEESEEMKDITTKRKRLWQLV 274
Query: 263 PVYVSEWSK 271
PVYVSEWSK
Sbjct: 275 PVYVSEWSK 283
>gi|388514469|gb|AFK45296.1| unknown [Lotus japonicus]
Length = 334
Score = 304 bits (779), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/245 (64%), Positives = 187/245 (76%), Gaps = 5/245 (2%)
Query: 32 RQILNKYRS--RTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISS---AARIR 86
R ++N +R+ G+S L T R + I CSA+++ P S ARIR
Sbjct: 28 RNLINNHRNFFSGNKGVSYGHNLFYTDPRTRRASLATILCSASNKPSPSSSEISSTARIR 87
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
EVLSPFRS+RMFFY+AF+AS SLG IA TQLIGALAN +RA+ VPEILKGL ID+ AV
Sbjct: 88 TEVLSPFRSLRMFFYVAFIASASLGAFIAATQLIGALANSSRASQVPEILKGLGIDIGAV 147
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
S+FAFLY R+NKAKNAQ ARLSREE LSNLKLRVDEKKII V++LRGIARLVIC+GP S+
Sbjct: 148 SIFAFLYFRDNKAKNAQEARLSREEFLSNLKLRVDEKKIIPVSSLRGIARLVICAGPESF 207
Query: 207 IMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYV 266
I ESFKRSEPFTE L++RGVLVV F TDGN P EFEE+EEMK+ +RKRLWQLTPVY+
Sbjct: 208 ITESFKRSEPFTEGLMDRGVLVVPFVTDGNSPDLEFEETEEMKQLATRRKRLWQLTPVYI 267
Query: 267 SEWSK 271
+EWS
Sbjct: 268 TEWSN 272
>gi|449468602|ref|XP_004152010.1| PREDICTED: uncharacterized protein LOC101206246 [Cucumis sativus]
Length = 338
Score = 298 bits (764), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/206 (70%), Positives = 175/206 (84%), Gaps = 2/206 (0%)
Query: 65 SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
+N CSA D+ PEISS A+IR+EVLSPFRSVRMFFYL F+ASG+LGGLIA TQL+GALA
Sbjct: 71 ANFICSAADK--PEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLGALA 128
Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKK 184
N +RA +VP+IL+GL +D AV+LFAFLY REN AKNAQLARLSREESLSNLKLRVD+ K
Sbjct: 129 NSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNK 188
Query: 185 IISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEE 244
+I ++ LRGIARLVIC+GP S+I+E+FK SEPFTE LLERGVLVV TD +FEF++
Sbjct: 189 VIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERGVLVVPLATDVTTLNFEFDD 248
Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWS 270
EE+K+ T KRKRLW+LTPVY++EWS
Sbjct: 249 REEVKDITTKRKRLWRLTPVYMTEWS 274
>gi|255560289|ref|XP_002521162.1| conserved hypothetical protein [Ricinus communis]
gi|223539731|gb|EEF41313.1| conserved hypothetical protein [Ricinus communis]
Length = 331
Score = 286 bits (732), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 161/209 (77%), Positives = 184/209 (88%), Gaps = 2/209 (0%)
Query: 65 SNITCSATDRSDP--EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
S+I CSA ++ P +ISS A+IR+EVLSPFRSVRMFFYLAF+ASG LG LIA TQLI A
Sbjct: 60 SSIACSAANKPSPPTDISSTAKIRSEVLSPFRSVRMFFYLAFIASGGLGALIATTQLIAA 119
Query: 123 LANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE 182
LANPARAA+VPEILKGL ID+ AVS+FAFLY REN AK AQ+ARLSREESLSNLKLRVD+
Sbjct: 120 LANPARAAEVPEILKGLGIDIGAVSIFAFLYYRENNAKQAQVARLSREESLSNLKLRVDD 179
Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF 242
KKIIS+++LRGIARLVIC+GP SY++ESF+ SEPFT+SLL+RGVLVV F TDGN P+FEF
Sbjct: 180 KKIISISSLRGIARLVICAGPGSYVLESFRLSEPFTQSLLDRGVLVVPFATDGNFPNFEF 239
Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
EESEEMKE T KRKRLWQL P +VSEWSK
Sbjct: 240 EESEEMKEITSKRKRLWQLNPTFVSEWSK 268
>gi|297803184|ref|XP_002869476.1| hypothetical protein ARALYDRAFT_913640 [Arabidopsis lyrata subsp.
lyrata]
gi|297315312|gb|EFH45735.1| hypothetical protein ARALYDRAFT_913640 [Arabidopsis lyrata subsp.
lyrata]
Length = 328
Score = 283 bits (723), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 148/249 (59%), Positives = 188/249 (75%), Gaps = 12/249 (4%)
Query: 25 YHASRFCRQILNKYRSRTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISSAAR 84
Y RF R ++ R +P L+I T K PL N C A D EIS+ AR
Sbjct: 28 YSPRRFSRHN-HRERLDFSPRLTI--------TGKHQPLSFNTVCFAADEPS-EISADAR 77
Query: 85 IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
IR+EVLSPFRSVRMFFYLAF+ASGSLGGLIA ++LIGALANPAR+ +V EI+KGL ID+
Sbjct: 78 IRSEVLSPFRSVRMFFYLAFIASGSLGGLIATSRLIGALANPARSGEVLEIVKGLGIDIG 137
Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVICSGP 203
A SLFAFLY ENK KNAQ+ARLSREE+L+ LK+RV+E K+ISV +LRGIARLVIC+GP
Sbjct: 138 AASLFAFLYFNENKTKNAQMARLSREENLAKLKMRVEENNKVISVGDLRGIARLVICAGP 197
Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEM-KEFTRKRKRLWQLT 262
A +I E+F RS+ FT+ L+ERGV+VV++ TDGN P EF+E++ + +E +++RKRLW++T
Sbjct: 198 AEFIEEAFNRSKEFTQGLVERGVVVVAYATDGNSPVLEFDEADIVDEEISQRRKRLWRVT 257
Query: 263 PVYVSEWSK 271
PV+V EW K
Sbjct: 258 PVFVPEWEK 266
>gi|18417257|ref|NP_567817.1| uncharacterized protein [Arabidopsis thaliana]
gi|332660136|gb|AEE85536.1| uncharacterized protein [Arabidopsis thaliana]
Length = 347
Score = 280 bits (717), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 138/219 (63%), Positives = 177/219 (80%), Gaps = 3/219 (1%)
Query: 55 TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
T T K+ PL N C A D EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLGGLI
Sbjct: 68 TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGGLI 126
Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
A ++LIGALANPAR+ +V EI+KGL +D+ A SLFAFLY ENK KNAQ+ARLSREE+L
Sbjct: 127 ATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 186
Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
LK+RV+E K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 187 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 246
Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
DGN P EF+E++ +E +++RK+LW++TPV+V EW K
Sbjct: 247 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 285
>gi|4218124|emb|CAA22978.1| hypothetical protein [Arabidopsis thaliana]
gi|7269729|emb|CAB81462.1| hypothetical protein [Arabidopsis thaliana]
gi|21593096|gb|AAM65045.1| unknown [Arabidopsis thaliana]
Length = 331
Score = 280 bits (717), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 138/219 (63%), Positives = 177/219 (80%), Gaps = 3/219 (1%)
Query: 55 TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
T T K+ PL N C A D EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLGGLI
Sbjct: 52 TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGGLI 110
Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
A ++LIGALANPAR+ +V EI+KGL +D+ A SLFAFLY ENK KNAQ+ARLSREE+L
Sbjct: 111 ATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 170
Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
LK+RV+E K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 171 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 230
Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
DGN P EF+E++ +E +++RK+LW++TPV+V EW K
Sbjct: 231 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 269
>gi|449530478|ref|XP_004172222.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101223919, partial [Cucumis sativus]
Length = 254
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 135/190 (71%), Positives = 161/190 (84%)
Query: 81 SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
S A+IR+EVLSPFRSVRMFFYL F+ASG+LGGLIA TQL+GALAN +RA +VP+IL+GL
Sbjct: 1 STAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLGALANSSRADEVPDILEGLG 60
Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
ID AV+LFAFLY REN AKNAQLARLSREESLSNLKLRVD+ K+I ++ LRGIARLVIC
Sbjct: 61 IDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISILRGIARLVIC 120
Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQ 260
+GP S+I+E+FK SEPFTE LLERG V TD +FEF++ EE+K+ T KRKRLW+
Sbjct: 121 AGPESFIIEAFKSSEPFTERLLERGGFFVPLATDVTTLNFEFDDREEVKDITTKRKRLWR 180
Query: 261 LTPVYVSEWS 270
LTPVY++EWS
Sbjct: 181 LTPVYMTEWS 190
>gi|15081662|gb|AAK82486.1| AT4g28740/F16A16_150 [Arabidopsis thaliana]
gi|20147175|gb|AAM10304.1| AT4g28740/F16A16_150 [Arabidopsis thaliana]
Length = 347
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 136/219 (62%), Positives = 175/219 (79%), Gaps = 3/219 (1%)
Query: 55 TWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLI 114
T T K+ PL N C A D EIS+ ARIR+EVLSPFRSVRMFFYLAF+ASGSLG LI
Sbjct: 68 TITRKQQPLSFNTVCFAADEPS-EISADARIRSEVLSPFRSVRMFFYLAFIASGSLGELI 126
Query: 115 AITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLS 174
A ++LIG LANPAR+ +V EI+KGL +D+ A SLFAFLY ENK KNAQ+ARLSREE+L
Sbjct: 127 ATSRLIGTLANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSREENLG 186
Query: 175 NLKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
LK+RV+E K+ISV +LRG+ARLVIC+GPA +I E+FKRS+ +T+ L+ERGV+VV++ T
Sbjct: 187 KLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVVVAYAT 246
Query: 234 DGNIPSFEFEESE-EMKEFTRKRKRLWQLTPVYVSEWSK 271
DGN P EF+E++ +E +++RK+LW++TPV+V EW K
Sbjct: 247 DGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEK 285
>gi|326520740|dbj|BAJ92733.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 334
Score = 256 bits (653), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 128/198 (64%), Positives = 155/198 (78%), Gaps = 6/198 (3%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
SS A+IR+EVLSPFRSVRMFFYLAF+AS LG LIA+TQL+ AL NPARAA VPE LKGL
Sbjct: 75 SSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLLPALGNPARAAGVPETLKGL 134
Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV-DEKKIISVNNLRGIARLV 198
IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV D +++ ++ LRG ARLV
Sbjct: 135 GIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRVGDGGRVVPLSELRGSARLV 194
Query: 199 ICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEES-----EEMKEFTR 253
I +GPA ++ ESF+RS PF L+ERGVLV+ F TDGN P+ EF E E E R
Sbjct: 195 IVAGPAEFVAESFRRSRPFLRDLMERGVLVLPFPTDGNAPALEFGEDADGGEAEDDEVAR 254
Query: 254 KRKRLWQLTPVYVSEWSK 271
K +RLWQLTPVY SEW+K
Sbjct: 255 KSRRLWQLTPVYTSEWAK 272
>gi|326517956|dbj|BAK07230.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 273
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/198 (64%), Positives = 155/198 (78%), Gaps = 6/198 (3%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
SS A+IR+EVLSPFRSVRMFFYLAF+AS LG LIA+TQL+ AL NPARAA VPE LKGL
Sbjct: 72 SSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLLPALGNPARAAGVPETLKGL 131
Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV-DEKKIISVNNLRGIARLV 198
IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV D +++ ++ LRG ARLV
Sbjct: 132 GIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRVGDGGRVVPLSELRGSARLV 191
Query: 199 ICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEES-----EEMKEFTR 253
I +GPA ++ ESF+RS PF L+ERGVLV+ F TDGN P+ EF E E E R
Sbjct: 192 IVAGPAEFVAESFRRSRPFLRDLMERGVLVLPFPTDGNAPALEFGEDADGGEAEDDEVAR 251
Query: 254 KRKRLWQLTPVYVSEWSK 271
K +RLWQLTPVY SEW+K
Sbjct: 252 KSRRLWQLTPVYTSEWAK 269
>gi|32489514|emb|CAE04717.1| OSJNBa0043L24.5 [Oryza sativa Japonica Group]
gi|38567844|emb|CAE05687.2| OSJNBb0002J11.14 [Oryza sativa Japonica Group]
gi|116310319|emb|CAH67335.1| OSIGBa0157A06.4 [Oryza sativa Indica Group]
gi|125590937|gb|EAZ31287.1| hypothetical protein OsJ_15394 [Oryza sativa Japonica Group]
gi|215692749|dbj|BAG88169.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 330
Score = 244 bits (622), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 128/222 (57%), Positives = 169/222 (76%), Gaps = 14/222 (6%)
Query: 64 CSNITCSATDRSDP------EISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIA 115
C + CSA ++ P E+SS A+IR+EVLSPFRSVRMFFYLAF+AS LG LIA
Sbjct: 46 CRGVRCSAANKPSPSTAPGTEVSSTSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGALIA 105
Query: 116 ITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSN 175
+TQLI AL++PARAA E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS
Sbjct: 106 LTQLIPALSSPARAAAAGETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSR 165
Query: 176 LKLRVDE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
L++R E + + + LRG ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TD
Sbjct: 166 LRIRAGEGRPPVPLGELRGTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTD 225
Query: 235 GNIPSFEFEESEEMKEFT-----RKRKRLWQLTPVYVSEWSK 271
GN P +F+E++E +E + ++RLWQLTPVY SEW+K
Sbjct: 226 GNAPDLQFDEADEEEEEAAAAAGKMKRRLWQLTPVYTSEWAK 267
>gi|357167936|ref|XP_003581403.1| PREDICTED: uncharacterized protein LOC100835810 [Brachypodium
distachyon]
Length = 326
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 124/217 (57%), Positives = 158/217 (72%), Gaps = 14/217 (6%)
Query: 69 CSATDRSDP---EIS-----SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI 120
CSA ++ P E+S S A+IR+EVLSPFRSVRMFFYLAF+AS LG LIA+TQL+
Sbjct: 48 CSAANKPSPPPPEVSNSSSNSMAKIRSEVLSPFRSVRMFFYLAFMASAGLGSLIALTQLV 107
Query: 121 GALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV 180
AL NPAR A E L+GL IDV+AV++FAFLYSR+ KA +AQ+ARL+REE+LS LKLRV
Sbjct: 108 PALTNPARLAGAAETLRGLGIDVAAVAVFAFLYSRDRKASDAQVARLTREENLSRLKLRV 167
Query: 181 DE-KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPS 239
+++ + LRG ARLVI +GPA ++ ESF+RS+PF + L++RGVLV+ F TDGN P+
Sbjct: 168 GAGGRVVPLGELRGSARLVIVAGPAEFVEESFRRSKPFLKDLMDRGVLVLPFPTDGNTPA 227
Query: 240 FEFE-----ESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
+F+ E K KRLWQLTPVY SEW+K
Sbjct: 228 LQFDGEAEEGENEGDGIGEKSKRLWQLTPVYTSEWAK 264
>gi|215694761|dbj|BAG89952.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 250
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 120/195 (61%), Positives = 157/195 (80%), Gaps = 6/195 (3%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A+IR+EVLSPFRSVRMFFYLAF+AS LG LIA+TQLI AL++PARAA E LKGL ID
Sbjct: 2 AKIRSEVLSPFRSVRMFFYLAFMASAGLGALIALTQLIPALSSPARAAAAGETLKGLGID 61
Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVICS 201
V+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R E + + + LRG ARLVI +
Sbjct: 62 VAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELRGTARLVIVA 121
Query: 202 GPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT-----RKRK 256
GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P +F+E++E +E + ++
Sbjct: 122 GPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEAAAAAGKMKR 181
Query: 257 RLWQLTPVYVSEWSK 271
RLWQLTPVY SEW+K
Sbjct: 182 RLWQLTPVYTSEWAK 196
>gi|242076406|ref|XP_002448139.1| hypothetical protein SORBIDRAFT_06g021960 [Sorghum bicolor]
gi|241939322|gb|EES12467.1| hypothetical protein SORBIDRAFT_06g021960 [Sorghum bicolor]
Length = 332
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 132/266 (49%), Positives = 177/266 (66%), Gaps = 27/266 (10%)
Query: 28 SRFCRQILNKYRSRTTPGLSISLPLHSTWTAKRSPLC-SNITCSATDRSDP--------- 77
S C + R RT +S +P + R+ C + + CSA ++ P
Sbjct: 9 SAVCVPVPCAERLRTPRPISRRVPHKPALSCSRA--CRAGVRCSAANKPSPPPPTTPDSS 66
Query: 78 EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI----GALANPARAADVP 133
E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+ +L+ G+ ++PARAAD
Sbjct: 67 EVSSMAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALARLLPLLSGSASDPARAADT- 125
Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD--EKKIISVNNL 191
LKGL IDV+AVSLFAFLY+RE+KAK+AQ+ARL+REE LS L+LRV E + +++ L
Sbjct: 126 --LKGLGIDVAAVSLFAFLYARESKAKDAQVARLAREERLSRLRLRVGAAEGRPFTLSEL 183
Query: 192 RGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEM--- 248
RG ARLVI +GPA ++ ESF+RS+PF L ER VL V F TDGN P ++ +
Sbjct: 184 RGTARLVIVAGPADFVAESFRRSQPFLRELAERAVLAVPFATDGNTPELRLDDGGDEDVI 243
Query: 249 ---KEFTRKRKRLWQLTPVYVSEWSK 271
+ R+ KRLWQLTPVY++EW++
Sbjct: 244 DGGDDVARRSKRLWQLTPVYITEWAQ 269
>gi|414586451|tpg|DAA37022.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 282
Score = 196 bits (499), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 116/209 (55%), Positives = 148/209 (70%), Gaps = 18/209 (8%)
Query: 78 EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEI 135
E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL ++PA AA
Sbjct: 59 EVSSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST--- 115
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIIS 187
LKGL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR + +
Sbjct: 116 LKGLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFT 175
Query: 188 VNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF----- 242
+ LRG RLVI +GPA ++ ESF+RS+PF L ER VL V F TDG+ P
Sbjct: 176 LGELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGD 235
Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
+E + R+ KRLWQLTPV+ +EW++
Sbjct: 236 DEDGVDDDVARRSKRLWQLTPVFTTEWAQ 264
>gi|414586452|tpg|DAA37023.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 327
Score = 196 bits (498), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 116/209 (55%), Positives = 148/209 (70%), Gaps = 18/209 (8%)
Query: 78 EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEI 135
E+SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL ++PA AA
Sbjct: 59 EVSSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST--- 115
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIIS 187
LKGL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR + +
Sbjct: 116 LKGLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFT 175
Query: 188 VNNLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF----- 242
+ LRG RLVI +GPA ++ ESF+RS+PF L ER VL V F TDG+ P
Sbjct: 176 LGELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGD 235
Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWSK 271
+E + R+ KRLWQLTPV+ +EW++
Sbjct: 236 DEDGVDDDVARRSKRLWQLTPVFTTEWAQ 264
>gi|226496251|ref|NP_001140255.1| uncharacterized protein LOC100272297 [Zea mays]
gi|194698712|gb|ACF83440.1| unknown [Zea mays]
gi|414586449|tpg|DAA37020.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 229
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 115/207 (55%), Positives = 146/207 (70%), Gaps = 18/207 (8%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEILK 137
SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL ++PA AA LK
Sbjct: 8 SSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LK 64
Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVN 189
GL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR + ++
Sbjct: 65 GLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLG 124
Query: 190 NLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EE 244
LRG RLVI +GPA ++ ESF+RS+PF L ER VL V F TDG+ P +E
Sbjct: 125 ELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDE 184
Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWSK 271
+ R+ KRLWQLTPV+ +EW++
Sbjct: 185 DGVDDDVARRSKRLWQLTPVFTTEWAQ 211
>gi|168020188|ref|XP_001762625.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686033|gb|EDQ72424.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 261
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 141/202 (69%), Gaps = 16/202 (7%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
SS A+IR+EV++PFR++RMFFY AF+ASG +GGLIAIT+LI AL N + D EI KG+
Sbjct: 4 SSDAKIRSEVIAPFRALRMFFYGAFMASGGVGGLIAITRLIAALNNAPGSGDPIEIAKGI 63
Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
+ID AV +FA LY ++KA++ LA+++REE+LSNL+L + ++++++ LRG ARLVI
Sbjct: 64 AIDFGAVLIFALLYRSDSKARDKSLAKIAREENLSNLRLELSNRRVVTIGELRGTARLVI 123
Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFE----------FEESEEMK 249
+GP SYI E+FK+SEP+ + LL+RGVLV ++ TDG SF +S +
Sbjct: 124 LAGPGSYIEEAFKQSEPYKQELLDRGVLVAAYATDG--ASFADYSSVSSSIESNQSVAVA 181
Query: 250 EFTRKRKRLWQLTPVYVSEWSK 271
+ +K W+ TP+Y SEW +
Sbjct: 182 DLEKK----WRATPIYTSEWGR 199
>gi|414586450|tpg|DAA37021.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 274
Score = 192 bits (489), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 115/207 (55%), Positives = 146/207 (70%), Gaps = 18/207 (8%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGAL--ANPARAADVPEILK 137
SS A+IR+EVLSPFRSVRMFFYLAF+ASG+LGGLIA+T+L+ AL ++PA AA LK
Sbjct: 8 SSIAKIRSEVLSPFRSVRMFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LK 64
Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVN 189
GL IDV+AVSLFAFLYSRE+KAK+AQ+ARL+REE LS L+LR + ++
Sbjct: 65 GLGIDVAAVSLFAFLYSRESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLG 124
Query: 190 NLRGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EE 244
LRG RLVI +GPA ++ ESF+RS+PF L ER VL V F TDG+ P +E
Sbjct: 125 ELRGTTRLVIVAGPAEFVAESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDE 184
Query: 245 SEEMKEFTRKRKRLWQLTPVYVSEWSK 271
+ R+ KRLWQLTPV+ +EW++
Sbjct: 185 DGVDDDVARRSKRLWQLTPVFTTEWAQ 211
>gi|414586447|tpg|DAA37018.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 204
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 100/189 (52%), Positives = 129/189 (68%), Gaps = 18/189 (9%)
Query: 98 MFFYLAFVASGSLGGLIAITQLIGALA--NPARAADVPEILKGLSIDVSAVSLFAFLYSR 155
MFFYLAF+ASG+LGGLIA+T+L+ AL+ +PA AA LKGL IDV+AVSLFAFLYSR
Sbjct: 1 MFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LKGLGIDVAAVSLFAFLYSR 57
Query: 156 ENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVNNLRGIARLVICSGPASYI 207
E+KAK+AQ+ARL+REE LS L+LR + ++ LRG RLVI +GPA ++
Sbjct: 58 ESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLGELRGTTRLVIVAGPAEFV 117
Query: 208 MESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EESEEMKEFTRKRKRLWQLT 262
ESF+RS+PF L ER VL V F TDG+ P +E + R+ KRLWQLT
Sbjct: 118 AESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDEDGVDDDVARRSKRLWQLT 177
Query: 263 PVYVSEWSK 271
PV+ +EW++
Sbjct: 178 PVFTTEWAQ 186
>gi|194691534|gb|ACF79851.1| unknown [Zea mays]
gi|414586448|tpg|DAA37019.1| TPA: hypothetical protein ZEAMMB73_248209 [Zea mays]
Length = 249
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/189 (52%), Positives = 129/189 (68%), Gaps = 18/189 (9%)
Query: 98 MFFYLAFVASGSLGGLIAITQLIGALA--NPARAADVPEILKGLSIDVSAVSLFAFLYSR 155
MFFYLAF+ASG+LGGLIA+T+L+ AL+ +PA AA LKGL IDV+AVSLFAFLYSR
Sbjct: 1 MFFYLAFMASGTLGGLIALTRLLPALSGSDPASAAST---LKGLGIDVAAVSLFAFLYSR 57
Query: 156 ENKAKNAQLARLSREESLSNLKLR--------VDEKKIISVNNLRGIARLVICSGPASYI 207
E+KAK+AQ+ARL+REE LS L+LR + ++ LRG RLVI +GPA ++
Sbjct: 58 ESKAKDAQVARLAREERLSRLRLRLRAGAAEGAGSSRPFTLGELRGTTRLVIVAGPAEFV 117
Query: 208 MESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF-----EESEEMKEFTRKRKRLWQLT 262
ESF+RS+PF L ER VL V F TDG+ P +E + R+ KRLWQLT
Sbjct: 118 AESFRRSQPFLRELAERAVLAVPFATDGSTPELRLDDGGDDEDGVDDDVARRSKRLWQLT 177
Query: 263 PVYVSEWSK 271
PV+ +EW++
Sbjct: 178 PVFTTEWAQ 186
>gi|302811321|ref|XP_002987350.1| hypothetical protein SELMODRAFT_125820 [Selaginella moellendorffii]
gi|300144985|gb|EFJ11665.1| hypothetical protein SELMODRAFT_125820 [Selaginella moellendorffii]
Length = 287
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 132/204 (64%), Gaps = 12/204 (5%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVPE 134
S ARIR EV+SPFRS+R+F Y A +AS ++ L A+ +LI ALAN A A + +
Sbjct: 22 SREARIREEVISPFRSLRLFLYPALMASATIAALTALPRLIAALANSPNATVENNASLGQ 81
Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
+L+GL IDV+A+ +FA LY + KA++ QLA+LSREESL++L++ + K++ ++ +RG
Sbjct: 82 VLEGLGIDVAAIVVFALLYRSDTKARDLQLAKLSREESLASLRVELGNKRVATLEQMRGS 141
Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT-------DGNIPSFEFEESEE 247
AR VI +GP S++ ++ + S PF E LLERGV +V F T + SF+ ++S+
Sbjct: 142 ARFVIIAGPKSHVDQALELSLPFQEQLLERGVTLVPFVTPPAASTEESQSSSFQAQKSQA 201
Query: 248 MKEFTRKRKRLWQLTPVYVSEWSK 271
+ + R W + P+Y SEW+K
Sbjct: 202 KDDDSAAAARRWIVLPIYTSEWTK 225
>gi|218195173|gb|EEC77600.1| hypothetical protein OsI_16566 [Oryza sativa Indica Group]
Length = 805
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/144 (56%), Positives = 113/144 (78%), Gaps = 6/144 (4%)
Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLR 192
E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R E + + + LR
Sbjct: 292 ETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELR 351
Query: 193 GIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT 252
G ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P +F+E++E +E
Sbjct: 352 GTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEA 411
Query: 253 -----RKRKRLWQLTPVYVSEWSK 271
+ ++RLWQLTPVY SEW+K
Sbjct: 412 AAAAGKMKRRLWQLTPVYTSEWAK 435
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/144 (56%), Positives = 113/144 (78%), Gaps = 6/144 (4%)
Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLR 192
E LKGL IDV+AVS+FAFLY RE+KAK+AQ+A+L+REE+LS L++R E + + + LR
Sbjct: 599 ETLKGLGIDVAAVSVFAFLYWRESKAKDAQVAKLTREENLSRLRIRAGEGRPPVPLGELR 658
Query: 193 GIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT 252
G ARLVI +GPA+++ ESF+RS+PF + L+ERGVLVV F+TDGN P +F+E++E +E
Sbjct: 659 GTARLVIVAGPAAFVTESFRRSKPFLKDLMERGVLVVPFSTDGNAPDLQFDEADEEEEEA 718
Query: 253 -----RKRKRLWQLTPVYVSEWSK 271
+ ++RLWQLTPVY SEW+K
Sbjct: 719 AAAAGKMKRRLWQLTPVYTSEWAK 742
>gi|302814937|ref|XP_002989151.1| hypothetical protein SELMODRAFT_129404 [Selaginella moellendorffii]
gi|300143051|gb|EFJ09745.1| hypothetical protein SELMODRAFT_129404 [Selaginella moellendorffii]
Length = 290
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 86/207 (41%), Positives = 132/207 (63%), Gaps = 15/207 (7%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVPE 134
S ARIR EV+SPFRS+R+F Y A +AS ++ L A+ +LI ALAN A A + +
Sbjct: 22 SREARIREEVISPFRSLRLFLYPALMASATIAALTALPRLIAALANSPNATVENNASLGQ 81
Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
+L+GL IDV+A+ +FA LY + KA++ QLA+LSREESL++L++ + K++ ++ +RG
Sbjct: 82 VLEGLGIDVTAIVIFALLYRSDTKARDLQLAKLSREESLASLRVELGNKRVATLEQMRGS 141
Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT-------DGNIPSFEFEESEE 247
AR VI +GP S++ ++ + S PF E LLERGV +V F T + SF+ ++S+
Sbjct: 142 ARFVIIAGPKSHVDQALELSLPFQEQLLERGVTLVPFVTPPAASTEESQSSSFQAQKSQA 201
Query: 248 MKEFT---RKRKRLWQLTPVYVSEWSK 271
+ + R W + P+Y SEW+K
Sbjct: 202 KDDDSAAAAAAARRWIVLPIYTSEWTK 228
>gi|255088718|ref|XP_002506281.1| predicted protein [Micromonas sp. RCC299]
gi|226521553|gb|ACO67539.1| predicted protein [Micromonas sp. RCC299]
Length = 244
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 106/191 (55%), Gaps = 16/191 (8%)
Query: 85 IRNEVLSPFRSVRMFFYLAFVASGSLG----GLIAITQLIGALANPARAADVPEILKGLS 140
+R+E +PFR+ RMF + AF A+ +LG L A+T+ +GA + P + + L+ +
Sbjct: 1 MRSEAQAPFRTARMFVFGAFAANAALGLSIATLQAVTKALGAPSAPP----LDQSLQNIG 56
Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
ID+ A F +LY ++ + + Q+AR+SREE LS LK + K++S+ +LRG +R+VI
Sbjct: 57 IDLGAALFFGYLYKKDAENREKQMARISREERLSALKCELSSGKVVSLFDLRGFSRVVIA 116
Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQ 260
+G A Y S +E + LLERGVLV+ G S + + + F +
Sbjct: 117 AGDADYCNASIAAAEAVKDPLLERGVLVIPLVLGGAAESVDAPDDATDRRF--------R 168
Query: 261 LTPVYVSEWSK 271
TPVY +W K
Sbjct: 169 ATPVYTDKWVK 179
>gi|302828698|ref|XP_002945916.1| hypothetical protein VOLCADRAFT_120237 [Volvox carteri f.
nagariensis]
gi|300268731|gb|EFJ52911.1| hypothetical protein VOLCADRAFT_120237 [Volvox carteri f.
nagariensis]
Length = 327
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 109/205 (53%), Gaps = 22/205 (10%)
Query: 81 SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
A ++R E +PFRS+R+ F V S LG LI+I QLIGAL A + ++ + L+
Sbjct: 64 DAVQMRAEAEAPFRSLRLVFSGFSVVSAGLGFLISIPQLIGALGGARSALPLEQVQQNLA 123
Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
ID AV++FAFL+ + A+ QLARL REE+L L +R+ K + + +LRG AR+V+
Sbjct: 124 IDAGAVAVFAFLFKLDWDARTKQLARLQREETLGALPIRLASGKSLRLGDLRGSARVVLV 183
Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEF---------------EES 245
+G + + + +E E L++RGVLVV FE EE
Sbjct: 184 AGTRRQVAAALREAESLREELVKRGVLVVPVVL------FELASSGAGGGAAGGGATEEP 237
Query: 246 EEMKEFTRKRKRLWQLTPVYVSEWS 270
EE+ + T +R + P ++EW
Sbjct: 238 EELPQLTPADQRF-RAEPQRLAEWQ 261
>gi|145355072|ref|XP_001421795.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582033|gb|ABP00089.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 290
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 105/196 (53%), Gaps = 13/196 (6%)
Query: 75 SDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPE 134
D S+ ++R+E +PFR R F Y A AS ++G IA Q A A +
Sbjct: 40 DDARYSNETKLRSEAQAPFRVARQFLYGACAASATIGFGIATIQAATKAAGAPNAPPLEG 99
Query: 135 ILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI 194
L+ L+ID +A++ FA+LY+RE A+ Q+AR+ REE L L++ + K + + +LR
Sbjct: 100 SLENLAIDGAALATFAWLYAREEAARERQMARIGREERLGRLRVELAGGKTVRLEDLRSF 159
Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVVS-FTTDGNIPSFEFEESEEMKEFTR 253
+R+V+ SG +Y + + +E E+L+ERGVLVV DG I + E+
Sbjct: 160 SRVVVVSGDEAYARTALEDAEDVREALIERGVLVVPVIRGDGAIEAPSAED--------- 210
Query: 254 KRKRLWQLTPVYVSEW 269
R ++ TP+ ++W
Sbjct: 211 ---RKFRCTPLRANDW 223
>gi|303288507|ref|XP_003063542.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455374|gb|EEH52678.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 348
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 96/160 (60%), Gaps = 4/160 (2%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLG-GLIAITQLIGALANPARAADVPEILKG 138
S+ ++R+E +PFR+ RMF + AF A+ +LG G+ + + AL P+ A + + L+
Sbjct: 89 SNETKMRSEAQAPFRTARMFLFGAFAANATLGLGIATLQAVTKALGAPS-APPLEDSLQN 147
Query: 139 LSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV--DEKKIISVNNLRGIAR 196
+ +D+ + FAF Y ++ + ++ Q+AR+SREE L LK ++ K +S+ LRG +R
Sbjct: 148 MGVDLLCAAFFAFFYKKDAEGRDKQMARISREERLGALKCQLGGGSGKRVSMAQLRGFSR 207
Query: 197 LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTDGN 236
+VI +G A + +ES +E E L RGVL+V T G+
Sbjct: 208 VVIAAGSAEHCVESVNAAEAHREELTRRGVLLVPVVTSGD 247
>gi|302820375|ref|XP_002991855.1| hypothetical protein SELMODRAFT_134250 [Selaginella moellendorffii]
gi|300140393|gb|EFJ07117.1| hypothetical protein SELMODRAFT_134250 [Selaginella moellendorffii]
Length = 422
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 87/143 (60%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+AF + + I + +LI AL A V E + L+ID++ +
Sbjct: 161 SEVQAPFRGVRKFFYVAFTIAAGIATFITVPRLIRALQGGEDAPSVTETAQNLAIDLAGI 220
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
F L+ ++K + Q+++++R E+LS L +R+ +I+ V LRGI R VI +G
Sbjct: 221 VAFVSLFIWDSKKEETQMSQITRNETLSRLPVRLGTGRIVEVVQLRGITRPVILAGTKES 280
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
I + +++E + + LL+RGVL++
Sbjct: 281 IKGALEKAERYRKDLLQRGVLLI 303
>gi|302822713|ref|XP_002993013.1| hypothetical protein SELMODRAFT_162731 [Selaginella moellendorffii]
gi|300139213|gb|EFJ05959.1| hypothetical protein SELMODRAFT_162731 [Selaginella moellendorffii]
Length = 422
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 87/143 (60%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+AF + + I + +LI AL A V E + L+ID++ +
Sbjct: 161 SEVQAPFRGVRKFFYVAFTIAAGIATFITVPRLIRALQGGEDAPSVTETAQNLAIDLAGI 220
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
F L+ ++K + Q+++++R E+LS L +R+ +I+ V LRGI R VI +G
Sbjct: 221 VAFVSLFIWDSKKEETQMSQITRNETLSRLPVRLGTGRIVEVVQLRGITRPVILAGTKES 280
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
I + +++E + + LL+RGVL++
Sbjct: 281 IKGALEKAERYRKDLLQRGVLLI 303
>gi|412985100|emb|CCO20125.1| predicted protein [Bathycoccus prasinos]
Length = 261
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 91/157 (57%)
Query: 78 EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILK 137
+ S+ ++R+E +PFR R F Y + S ++G IA QL+ LA + + +
Sbjct: 4 KFSNETKLRSEAQAPFRVARQFLYASCALSATIGFGIATIQLLTGLAGAPNSPPFDQSVS 63
Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARL 197
+ ID + +LF FL+ ++ +AK Q++R+ REE L LK+ + KI+ + +LRG +R+
Sbjct: 64 NVGIDGACAALFVFLWKKDEEAKQKQMSRIGREERLGGLKVELTTGKIVRMQSLRGFSRV 123
Query: 198 VICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
V+ +G Y+ +S + +E + + L+++GV +V D
Sbjct: 124 VLFAGDLEYLEKSLEVAEEYKDELIKKGVFLVPVAMD 160
>gi|384250888|gb|EIE24367.1| hypothetical protein COCSUDRAFT_83671 [Coccomyxa subellipsoidea
C-169]
Length = 253
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 94/177 (53%), Gaps = 1/177 (0%)
Query: 94 RSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLY 153
R++R+ + F S S+G LIA TQ+I +L RA + ++ + L ID++A FAF
Sbjct: 12 RALRLVLFGFFGISASVGALIATTQVIASLGGAQRALPLADVAQSLGIDLAAAGTFAFFL 71
Query: 154 SRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKR 213
R+ +A++ Q+ARL+REE L L L + + + V LR +R+VI +G + + +
Sbjct: 72 RRDLQARDKQIARLTREERLGALPLSLANGRRVKVGQLRSFSRVVIAAGSPEQVAAAVQA 131
Query: 214 SEPFTESLLERGVLVVSFTT-DGNIPSFEFEESEEMKEFTRKRKRLWQLTPVYVSEW 269
+EP E L ERGVLVV G+ S E + W+ TP+ + EW
Sbjct: 132 AEPLREQLQERGVLVVPLPIFPGDPASSNGAAPAESSGSSSADDLRWKATPLRLEEW 188
>gi|307110349|gb|EFN58585.1| hypothetical protein CHLNCDRAFT_140754 [Chlorella variabilis]
Length = 272
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 90/151 (59%), Gaps = 1/151 (0%)
Query: 82 AARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALAN-PARAADVPEILKGLS 140
A +R E +PFRS+R+F + A AS +L L + LIGAL+ P + E L+ L+
Sbjct: 11 GASLRAEAEAPFRSLRLFLFGAGAASAALATLFGLPSLIGALSGAPGATKSIAEALQDLA 70
Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVIC 200
I+V +VS+ +L R+ +A++ Q+ARL RE+ L +L + +++ + LRG AR+V+
Sbjct: 71 INVGSVSVLGYLVVRDLQARDIQMARLMREDELGACQLELANGRVLRLAQLRGFARVVLM 130
Query: 201 SGPASYIMESFKRSEPFTESLLERGVLVVSF 231
+G + + +EP+ + L ERGVLVV
Sbjct: 131 AGTPQQVAAALAGAEPYKQQLQERGVLVVPL 161
>gi|449484299|ref|XP_004156843.1| PREDICTED: uncharacterized LOC101208184 [Cucumis sativus]
Length = 435
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 88/159 (55%), Gaps = 1/159 (0%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+A A+ + L I +L A+ A DV E L+++V
Sbjct: 168 KLISEVQAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNV 227
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ +F L+ +NK + QLA++SR E+LS L LR+ +I+ + LR R VI +G
Sbjct: 228 GGIIVFVALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGK 287
Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFT-TDGNIPSFE 241
+ + +++E F LL RGVL+V +G P E
Sbjct: 288 KETVSSAIQKAERFRTELLRRGVLLVPVIWGEGREPQIE 326
>gi|115436512|ref|NP_001043014.1| Os01g0358300 [Oryza sativa Japonica Group]
gi|53791615|dbj|BAD52962.1| unknown protein [Oryza sativa Japonica Group]
gi|113532545|dbj|BAF04928.1| Os01g0358300 [Oryza sativa Japonica Group]
gi|215765732|dbj|BAG87429.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188197|gb|EEC70624.1| hypothetical protein OsI_01883 [Oryza sativa Indica Group]
gi|222618419|gb|EEE54551.1| hypothetical protein OsJ_01736 [Oryza sativa Japonica Group]
Length = 455
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 83/143 (58%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+A A+ + I +LI A+ A D+ E + +I++ +
Sbjct: 193 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLILAVQGGDGAPDLLETVGNAAINIGGI 252
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ LY ENK + Q+ ++SR E+LS L +R+ +II + LR I R VI +G +
Sbjct: 253 VVLVALYFWENKKEEQQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 312
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
I + +R+E + LL+RGVL++
Sbjct: 313 ITRAMQRAERYRTELLKRGVLLI 335
>gi|326496368|dbj|BAJ94646.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+AF+A+ + I +LI AL A D E +I++ +
Sbjct: 186 SEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINIGGI 245
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ L+ ENK + Q+ +SR E+LS L +R+ +I + LR I R VI +G +
Sbjct: 246 VVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVILAGSKAS 305
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ ++ +R+E + LL+RGVL++
Sbjct: 306 VTQAMQRAERYRTELLKRGVLLI 328
>gi|225437302|ref|XP_002263425.1| PREDICTED: uncharacterized protein LOC100245310 [Vitis vinifera]
gi|297743856|emb|CBI36826.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 81/143 (56%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+AF A+ + I +L A+ A D+ E +I++ +
Sbjct: 173 SEVQAPFRGVRRFFYVAFSAAAGISTFFTIPRLFRAIKGGDGAPDIWETAGNAAINIGGI 232
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
L+ +NK + QLA++SR+E+LS L LR+ +++ + LR R VI +G
Sbjct: 233 IALVLLFFWDNKKEEEQLAQISRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGTKET 292
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + K++E F LL+RGVL+V
Sbjct: 293 VSLAIKKAERFRTELLKRGVLLV 315
>gi|223972901|gb|ACN30638.1| unknown [Zea mays]
Length = 437
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++ +
Sbjct: 175 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 234
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ LY ENK + Q+ ++SR E+LS L +R+ +II + LR I R VI +G +
Sbjct: 235 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 294
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + +R+E + LL+RGVL++
Sbjct: 295 VTRALQRAERYRTELLKRGVLLI 317
>gi|414877423|tpg|DAA54554.1| TPA: hypothetical protein ZEAMMB73_812331, partial [Zea mays]
Length = 374
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++ +
Sbjct: 112 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 171
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ LY ENK + Q+ ++SR E+LS L +R+ +II + LR I R VI +G +
Sbjct: 172 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 231
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + +R+E + LL+RGVL++
Sbjct: 232 VTRALQRAERYRTELLKRGVLLI 254
>gi|194707430|gb|ACF87799.1| unknown [Zea mays]
Length = 450
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++ +
Sbjct: 188 SEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINIGGI 247
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ LY ENK + Q+ ++SR E+LS L +R+ +II + LR I R VI +G +
Sbjct: 248 VVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKAS 307
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + +R+E + LL+RGVL++
Sbjct: 308 VTRALQRAERYRTELLKRGVLLI 330
>gi|388497672|gb|AFK36902.1| unknown [Medicago truncatula]
Length = 446
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR +R FFY+AF A+ + + +++ A+ A DV E ++++ +
Sbjct: 183 SEVQAPFRGIRRFFYVAFTAAAGISLFFNVPRILRAIQGGDGAPDVLETAGNAAVNIGGI 242
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+F L+ +NK + Q+A++SR E+LS L LR+ +++ + LR R VI +G
Sbjct: 243 VVFVALFLWDNKKEEEQIAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVIIAGKKET 302
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + +++E F LL RGVL+V
Sbjct: 303 VTLALRKAERFRTDLLRRGVLLV 325
>gi|357511233|ref|XP_003625905.1| hypothetical protein MTR_7g108590 [Medicago truncatula]
gi|355500920|gb|AES82123.1| hypothetical protein MTR_7g108590 [Medicago truncatula]
Length = 446
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 82/143 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR +R FFY+AF A+ + + +++ A+ A DV E ++++ +
Sbjct: 183 SEVQAPFRGIRRFFYVAFTAAAGISLFFNVPRILRAIQGGDGAPDVLETAGNAAVNIGGI 242
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+F L+ +NK + Q+A++SR E+LS L LR+ +++ + LR R VI +G
Sbjct: 243 VVFVALFLWDNKKEEEQIAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVIIAGKKET 302
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + +++E F LL RGVL+V
Sbjct: 303 VTLALRKAERFRTDLLRRGVLLV 325
>gi|147785370|emb|CAN77388.1| hypothetical protein VITISV_033366 [Vitis vinifera]
Length = 467
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 81/146 (55%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF A+ + I +L A+ A D+ E +I++
Sbjct: 201 KLISEVQAPFRGVRRFFYVAFSAAAGISTFFTIPRLFRAIKGGDGAPDIWETAGNAAINI 260
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ L +NK + QLA++SR+E+LS L LR+ +++ + LR R VI +G
Sbjct: 261 GGIIALVLLXFWDNKKEEEQLAQISRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGT 320
Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
+ + K +E F LL+RGVL+V
Sbjct: 321 KETVSLAIKXAERFRTELLKRGVLLV 346
>gi|168023978|ref|XP_001764514.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684378|gb|EDQ70781.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 84/146 (57%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR +R FFY+A + + L + + I A+ A V E ++ L+I++
Sbjct: 202 SEVQAPFRGIRKFFYVALFLAAGISTLFTVPRFILAIQGGDGAPGVLETVQNLAINIGGG 261
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ F +Y ENK + Q++R++R+E+LS L +R+ + + ++ LR R VI +G A
Sbjct: 262 AAFIAIYLWENKKEEEQISRITRDETLSRLPVRLANNRTVELSALRENTRPVIIAGTAEN 321
Query: 207 IMESFKRSEPFTESLLERGVLVVSFT 232
+ S + S+ + E LL+RGVL++
Sbjct: 322 VERSIRASQKYREDLLKRGVLIIPLV 347
>gi|357128170|ref|XP_003565748.1| PREDICTED: uncharacterized protein LOC100821693 [Brachypodium
distachyon]
Length = 454
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 83/143 (58%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY+AF+A+ + I +LI A+ A D+ + +I++ +
Sbjct: 192 SEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILAVQGGDGAPDLLQTAGNAAINIGGI 251
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ L+ EN+ + Q+ +SR E+LS L +R+ +I + LR I R VI +G +
Sbjct: 252 VVLVALFIWENRKEEQQITVISRNETLSRLPVRLSTNRITELVQLRDITRPVILAGSKAS 311
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ ++ +R+E + LL+RGVL++
Sbjct: 312 VTKATQRAERYRTELLKRGVLLI 334
>gi|356572722|ref|XP_003554515.1| PREDICTED: uncharacterized protein LOC100793204 [Glycine max]
Length = 437
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 82/146 (56%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF A+ + + +L+ A+ A D+ +I++
Sbjct: 171 KLISEVQAPFRGVRRFFYVAFTAAAGISLFFTVPRLLRAINGGDGAPDLLATAGNAAINI 230
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ + L+ +NK + QLA++SR+E+LS L L + +++ + LR R VI +G
Sbjct: 231 GGIVVLVALFFWDNKKEEEQLAQISRDETLSRLPLCLSTNRVVELVQLRDTVRPVILAGK 290
Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
+ + +R+E F LL RGVL+V
Sbjct: 291 KETVSLAMQRAERFQTELLRRGVLLV 316
>gi|356505558|ref|XP_003521557.1| PREDICTED: uncharacterized protein LOC100784188 [Glycine max]
Length = 437
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 82/146 (56%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF A+ + + +L+ A+ A D+ +I++
Sbjct: 171 KLISEVQAPFRGVRRFFYVAFTAAAGISLFFTLPRLLRAINGGDGAPDLLATAGNAAINI 230
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ + L+ +NK + QLA++SR+E+LS L L + +++ + LR R VI +G
Sbjct: 231 GGIVVLVALFFWDNKKEEEQLAQISRDETLSRLPLCLSTNRVVELVQLRDTVRPVILAGK 290
Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
+ + +R+E F LL RGVL+V
Sbjct: 291 KETVSLAMQRAERFRTELLRRGVLLV 316
>gi|255582166|ref|XP_002531877.1| conserved hypothetical protein [Ricinus communis]
gi|223528485|gb|EEF30514.1| conserved hypothetical protein [Ricinus communis]
Length = 369
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 82/146 (56%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR +R FFY+AF A+ + L + +L A+ A D+ +I++
Sbjct: 98 KLISEVQAPFRGIRRFFYVAFSAAAGISLLFTVPRLFRAIKGGDGAPDLWNTAGNAAINI 157
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ + L +NK + Q+A+++R+E+LS L LR+ +++ + LR R VI +G
Sbjct: 158 GGIVVLVALLFWDNKKEEEQIAQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGK 217
Query: 204 ASYIMESFKRSEPFTESLLERGVLVV 229
+ + +++E F LL RGVL+V
Sbjct: 218 KETVSLAMQKAERFRTELLRRGVLLV 243
>gi|449469098|ref|XP_004152258.1| PREDICTED: uncharacterized protein LOC101208184 [Cucumis sativus]
Length = 427
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 9/159 (5%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+A A+ + L I +L A+ A DV E L+++V
Sbjct: 168 KLISEVQAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNV 227
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ +F L+ +NK + QLA++SR E+LS L LR+ +I+ + LR R VI +G
Sbjct: 228 GGIIVFVALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGK 287
Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFT-TDGNIPSFE 241
+ + TE LL RGVL+V +G P E
Sbjct: 288 KETVSSA-------TE-LLRRGVLLVPVIWGEGREPQIE 318
>gi|307104280|gb|EFN52535.1| hypothetical protein CHLNCDRAFT_58882 [Chlorella variabilis]
Length = 445
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 78/153 (50%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+R+E +PFR R+ F +LG I ++L+ AL A D+ E + I+
Sbjct: 168 RLRSEAKAPFRLTRIIFLGGLAVGAALGLFIITSRLVAALQGGEGAPDLQETAQNFGINA 227
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+A+ + F R+ ++N + REE+L L++ + ++++ + RG R VI +G
Sbjct: 228 AALGVLGFFVYRDLSSQNKDQQVILREEALGRLQVSLGGERVLPLAAFRGTTRPVIIAGT 287
Query: 204 ASYIMESFKRSEPFTESLLERGVLVVSFTTDGN 236
+ + + +EPF E+L ERGV VV G
Sbjct: 288 RTQLSRALAGAEPFQEALRERGVSVVPVQLAGE 320
>gi|308812838|ref|XP_003083726.1| unnamed protein product [Ostreococcus tauri]
gi|116055607|emb|CAL58275.1| unnamed protein product [Ostreococcus tauri]
Length = 273
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 74/139 (53%), Gaps = 17/139 (12%)
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD-EKKIISVNNLRGI 194
L+ +++D A+ F LY RE+ A+ Q+AR+ REE L L+ + K + + LRG
Sbjct: 80 LENVAVDGVALPAFGTLYRREDDARARQMARIGREERLGRLRCELGGNGKTVRLEQLRGF 139
Query: 195 ARLVICSGPASYIMESFKRSEPFTESLLERGVLVV----SFTTDGNIPSFEFEESEEMKE 250
+R+V+ SG A Y+ ++ +E E+L+ERGVL+V + DG + E+
Sbjct: 140 SRVVVVSGDAEYVNDAIADAEAVREALMERGVLIVPVLNGASADGEVAPVGGED------ 193
Query: 251 FTRKRKRLWQLTPVYVSEW 269
R ++ TP+ +EW
Sbjct: 194 ------RKFRATPLRANEW 206
>gi|326487892|dbj|BAJ89785.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 359
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 69/127 (54%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF+A+ + I +LI AL A D E +I++
Sbjct: 211 KLISEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINI 270
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ + L+ ENK + Q+ +SR E+LS L +R+ +I + LR I R V
Sbjct: 271 GGIVVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVSIILL 330
Query: 204 ASYIMES 210
AS+I+ +
Sbjct: 331 ASFILTT 337
>gi|326510437|dbj|BAJ87435.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 69/127 (54%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF+A+ + I +LI AL A D E +I++
Sbjct: 197 KLISEVQAPFRGVRRFFYVAFIAAAGISTFFTIPRLILALQGGDGAPDFLETAGNAAINI 256
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGP 203
+ + L+ ENK + Q+ +SR E+LS L +R+ +I + LR I R V
Sbjct: 257 GGIVVLVALFVWENKKEEEQITNISRNETLSRLPVRLSTNRITELVQLRDITRPVSIILL 316
Query: 204 ASYIMES 210
AS+I+ +
Sbjct: 317 ASFILTT 323
>gi|422293631|gb|EKU20931.1| hypothetical protein NGA_0121000 [Nannochloropsis gaditana CCMP526]
Length = 392
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 102/208 (49%), Gaps = 18/208 (8%)
Query: 78 EISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILK 137
+ S R+R E SPFRS+R Y ASG++G L A+T + A + A + E K
Sbjct: 126 DYSVMTRLREEAESPFRSLRFLIYGGVGASGAIGALTALTGALAAFSGVRGAVPLSESGK 185
Query: 138 GLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD------EKKIISVNNL 191
++ID+ A+ L A L++ +++ K +L RLSR +++LK++V E+ ++S+ +
Sbjct: 186 DIAIDLGAIGLAALLWNLDSQTKARKLGRLSRGAKIASLKVQVQDGGGAGERPVLSLADF 245
Query: 192 ---RGIA-RLVICSGPASYIMESFKRS--EPFTESLLERGVLVVSFT--TDG-NIPSFEF 242
RG+A R+VI +G + + + E L+ LVV DG + PS
Sbjct: 246 RRDRGMARRVVIFAGGEDVVQAALNTAGEAGMKERLIRNAFLVVPLVLPADGEDNPSVLG 305
Query: 243 EESEEMKEFTRKRKRLWQLTPVYVSEWS 270
E ++ + W P+ + EW
Sbjct: 306 LAQERLQGAQDEPHVGW---PLLLREWQ 330
>gi|159476322|ref|XP_001696260.1| hypothetical protein CHLREDRAFT_142189 [Chlamydomonas reinhardtii]
gi|158282485|gb|EDP08237.1| predicted protein [Chlamydomonas reinhardtii]
Length = 222
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 67/125 (53%), Gaps = 26/125 (20%)
Query: 105 VASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQL 164
V S LG LI+I QLIGAL A + ++ Q+
Sbjct: 8 VVSAGLGFLISIPQLIGALGGARSALPLEQV--------------------------EQI 41
Query: 165 ARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTESLLER 224
ARL+REE+L +L +R+ K++S+N+LRG AR+V+ +G + + + + +EP E L +R
Sbjct: 42 ARLTREETLGSLPIRLSSGKMLSLNDLRGTARVVLAAGTRAQVAAALREAEPLREELQKR 101
Query: 225 GVLVV 229
GVLVV
Sbjct: 102 GVLVV 106
>gi|298710249|emb|CBJ26324.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 345
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 79/153 (51%), Gaps = 7/153 (4%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+R E SPFR VR+F + +FV S ++G L+A T+ I + + E+L +++DV
Sbjct: 85 RLRVEAESPFRKVRIFLFSSFVLSATVGLLVAGTRTIALTQGIDQGQSMTELLTNVAVDV 144
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGI-------AR 196
+A+ + L R+ A+ ++AR+ L+ LK+R+ K +S L + R
Sbjct: 145 TAIVVCVGLVKRDLDAQEGRMARMEIGAKLAGLKVRLQTKDDLSTVTLSALRRDRGRDKR 204
Query: 197 LVICSGPASYIMESFKRSEPFTESLLERGVLVV 229
+ + G A + S + + P+ +L +L+V
Sbjct: 205 VAVLVGGAEAVQSSLESALPYGAALETSDILIV 237
>gi|255086543|ref|XP_002509238.1| predicted protein [Micromonas sp. RCC299]
gi|226524516|gb|ACO70496.1| predicted protein [Micromonas sp. RCC299]
Length = 476
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 82/151 (54%), Gaps = 1/151 (0%)
Query: 81 SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLS 140
S +++ E PFR +++ + A +G +I + +L A+ A ++ E +K L
Sbjct: 189 SVQKLKAEAQEPFRFFKLYAFGGLGAGAFIGLIIILARLSAAIKGGDDAPELGETIKNLL 248
Query: 141 IDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDE-KKIISVNNLRGIARLVI 199
I++ AVS FA+L+S E K + +++REE L L++ ++E K+ + V+ LRG RL I
Sbjct: 249 INIVAVSAFAYLFSNELKQREKTEFKVAREEELGRLRMAIEESKEDVLVSQLRGNYRLFI 308
Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVS 230
+G +I + + + L E+ V++ +
Sbjct: 309 IAGSDEHIDNAMGQLVKYKAKLKEQNVVIAT 339
>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 1122
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 78/150 (52%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
S+ +R E +PFR R+ + + S ++G +I + +L+ AL A D+ E L L
Sbjct: 164 STKVDLRTEAKAPFRLPRIILFGGLLGSAAIGLVIIVFRLVKALQGGPEAPDLNESLTNL 223
Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
++ AV++ ++L R+ K + +REE + L++ + +++ + RG R VI
Sbjct: 224 GVNSVAVAVLSYLLYRDVSQKQEAVRITTREELMGRLQIDLGNDRVLPLLKFRGQVRPVI 283
Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVV 229
+G S++ + K ++ +L +R V V+
Sbjct: 284 VAGTRSFVERAIKEADSQYINLRDRAVSVI 313
>gi|15218638|ref|NP_171790.1| tetratricopeptide repeat (TPR)-containing protein [Arabidopsis
thaliana]
gi|6056411|gb|AAF02875.1|AC009525_9 Hypothetical protein [Arabidopsis thaliana]
gi|332189374|gb|AEE27495.1| tetratricopeptide repeat (TPR)-containing protein [Arabidopsis
thaliana]
Length = 453
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 80/143 (55%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR VR FFY AF A+ + + +L+ A+ A ++ E +I++ +
Sbjct: 190 SEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINIGGI 249
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ L+ ENK + Q+ +++R+E+LS L LR+ +++ + LR R VI +G
Sbjct: 250 VVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKET 309
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + ++++ F LL RGVL+V
Sbjct: 310 VTLAMQKADRFRTELLRRGVLLV 332
>gi|119387903|gb|ABL73934.1| photosystem-II repair protein [Chlamydomonas reinhardtii]
Length = 449
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 84/150 (56%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGL 139
+ A +R E +PFR R+ + +A LG ++ +T+L +L A D+ E L L
Sbjct: 162 QTKADLRTEAKAPFRLPRLILFGGLLAGAGLGLMVILTRLAKSLQGGPDAPDLNESLTNL 221
Query: 140 SIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVI 199
I+V+AV++ ++L R+ +K A+L +REE L L++ + + +++ + RG R VI
Sbjct: 222 GINVTAVAVLSWLVYRDVSSKQAELRITTREEGLGRLQVDLGKDRVLPLLKFRGQVRPVI 281
Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVV 229
+G S++ ++ K +E +L +R V VV
Sbjct: 282 VAGSRSFVEKAIKEAEGQYLNLRDRAVSVV 311
>gi|297848508|ref|XP_002892135.1| hypothetical protein ARALYDRAFT_887439 [Arabidopsis lyrata subsp.
lyrata]
gi|297337977|gb|EFH68394.1| hypothetical protein ARALYDRAFT_887439 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 79/143 (55%)
Query: 87 NEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAV 146
+EV +PFR R FFY AF A+ + + +LI A+ A D+ E +I++ +
Sbjct: 190 SEVRAPFRGFRKFFYFAFAAAAGISTFFTVPRLIQAIRGGDGAPDLLETTGNAAINIGGI 249
Query: 147 SLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASY 206
+ L+ ENK + Q+ +++R+E+LS L LR+ +++ + LR R VI +G
Sbjct: 250 VVLVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKET 309
Query: 207 IMESFKRSEPFTESLLERGVLVV 229
+ + ++++ F LL RGVL+V
Sbjct: 310 VTLAMQKADRFRTELLRRGVLLV 332
>gi|384251485|gb|EIE24963.1| hypothetical protein COCSUDRAFT_83655 [Coccomyxa subellipsoidea
C-169]
Length = 435
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 79/154 (51%)
Query: 86 RNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSA 145
R E +PFR R+F + A +LG ++ +LI AL A ++ E L +I+ +A
Sbjct: 162 RAEAKAPFRLARLFIFGGLGAGAALGLVVIAGRLIAALKGGEGAPELTESLLNFTINAAA 221
Query: 146 VSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPAS 205
++ F++L+SR+ ++ + REE+L L ++V + + + + RG AR V+ +G +
Sbjct: 222 LAGFSWLFSRDYQSAEKDRRMVDREEALGRLLVQVGADREVPLASFRGAARPVLLAGSSG 281
Query: 206 YIMESFKRSEPFTESLLERGVLVVSFTTDGNIPS 239
YI + K SE L RG+ V+ P
Sbjct: 282 YISRALKASELHQTELRLRGISVIPLVLSSKDPG 315
>gi|449015970|dbj|BAM79372.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 371
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 88/202 (43%), Gaps = 16/202 (7%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++R E +PFR R F Y VAS ++ +IA++ L+ + + + L IDV
Sbjct: 114 QLREEAEAPFRKARQFLYAGGVASATIASVIALSSLVALTGSATATESWWQAARNLGIDV 173
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVD--EKKIISVNNLRGIARLVICS 201
++ L A LY ++ A +L R+ + L+ L+L + + + +LRG R++I
Sbjct: 174 TSGVLCAALYRWDDVAAKRRLERMQQGARLAALQLESEPGPRWRFPLASLRGQYRVIITV 233
Query: 202 GPASYIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFT--------- 252
E ++EP +++L++R V+ V D + S E + +
Sbjct: 234 TSLEKAREWLAQAEPHSDALIQRSVVWVPVLWDAARQTPTVPSSAETRSASGVLDAWLVA 293
Query: 253 -----RKRKRLWQLTPVYVSEW 269
R+ W TP EW
Sbjct: 294 LDAARRRANAFWIATPANPQEW 315
>gi|303284455|ref|XP_003061518.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456848|gb|EEH54148.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 514
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 77/147 (52%), Gaps = 1/147 (0%)
Query: 85 IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
+R E PFR +++ + A +G +I +++L AL A ++ E LK L+++
Sbjct: 218 LRAEAKEPFRFFKLYLFGGLGAGAFIGLIIILSRLAAALQGGDDAPELNETLKNLAVNAV 277
Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEK-KIISVNNLRGIARLVICSGP 203
AV+ F FL E + +++REE + L++ VDE+ ++++ LRG R+ I +G
Sbjct: 278 AVTTFGFLLRGELAEREKTQEKVAREEEIGRLRVTVDERGDDVTLSALRGNYRVFIIAGS 337
Query: 204 ASYIMESFKRSEPFTESLLERGVLVVS 230
++ E+ + L E+ V++ +
Sbjct: 338 DEHVEETVAGLNKYKALLKEKDVVIAT 364
>gi|428171606|gb|EKX40521.1| hypothetical protein GUITHDRAFT_113307 [Guillardia theta CCMP2712]
Length = 332
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 93/202 (46%), Gaps = 17/202 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R R E+ SPFR +R+F + A AS +LG IA T+L+ A++ D + L+ IDV
Sbjct: 70 RFREEIESPFRKIRLFLFPAAAASAALGAFIAGTRLL-AISTGVTGYDFDQTLQNFGIDV 128
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRV----DEKKIISVNNLRGIARLVI 199
+ FA LY + A++ L R++R LS L L++ + + + R R+ I
Sbjct: 129 LGATGFALLYWNDLNARDRDLVRIARSGKLSTLTLKIVSETGREASLPMKAFRKQRRIAI 188
Query: 200 CSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD--GNIPSFEFE---------ESEEM 248
+G + + + + E L + G +V+ D G + E E E+ E
Sbjct: 189 IAGGKDVVQVAREMARAGAEDLTKVGAIVLPLQLDQVGELDPAEVEGLRRAVGVPEAAEG 248
Query: 249 KEFTRKRKRLWQL-TPVYVSEW 269
KE L L P++V +W
Sbjct: 249 KEEADSESLLPCLAAPLFVEKW 270
>gi|443323256|ref|ZP_21052264.1| Protein of unknown function (DUF3493) [Gloeocapsa sp. PCC 73106]
gi|442786994|gb|ELR96719.1| Protein of unknown function (DUF3493) [Gloeocapsa sp. PCC 73106]
Length = 82
Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats.
Identities = 32/85 (37%), Positives = 47/85 (55%), Gaps = 7/85 (8%)
Query: 72 TDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAAD 131
+ R PE AR++ E+ +P++ +R F Y+AF ASG LG + +TQL A D
Sbjct: 4 SSRIPPESEKYARLKAELKTPYKGLRKFVYIAFGASGLLGAFVFLTQL-------AAGQD 56
Query: 132 VPEILKGLSIDVSAVSLFAFLYSRE 156
V IL L+I + V+L +LY E
Sbjct: 57 VGTILPNLAIQIGVVALMIWLYRLE 81
>gi|219117770|ref|XP_002179674.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408727|gb|EEC48660.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 342
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/219 (22%), Positives = 91/219 (41%), Gaps = 39/219 (17%)
Query: 41 RTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFF 100
R++P ++ PL + + P + ++ +E ++P+R++R+F
Sbjct: 38 RSSPNALLTTPLFAEKEKSKKPFDEGLRT--------------KLVSESIAPWRTLRLFL 83
Query: 101 YLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYS------ 154
Y A A +GGL+ ++ A++ D+ L+ID AV++FAFL
Sbjct: 84 YTALGAGAFIGGLVNLSGTAAAISGARSDVDMNTEYINLAIDFGAVAVFAFLTKFDLDKQ 143
Query: 155 ---RENKAKNAQLARLS---------REESLSNLKLRVD-----EKKIISVNNLRGIAR- 196
EN K +L + RE+ L+ L + + + K SV L+ AR
Sbjct: 144 KDLNENVEKKIELKKQQKQLVKGMKEREKQLATLNVEIQVSATGDTKQASVRQLQAGARQ 203
Query: 197 -LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTTD 234
+++ +GP ++ + VLVV F TD
Sbjct: 204 HMIVVAGPKKATRDALIGANLLKMDFAMSNVLVVPFETD 242
>gi|411120457|ref|ZP_11392829.1| Protein of unknown function (DUF3493) [Oscillatoriales
cyanobacterium JSC-12]
gi|410709126|gb|EKQ66641.1| Protein of unknown function (DUF3493) [Oscillatoriales
cyanobacterium JSC-12]
Length = 95
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 7/81 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E ++P+R +R F Y AF ASG++GGL+ + QL+ +V E L ++
Sbjct: 22 ARLKAEAIAPYRGLRRFIYGAFAASGAIGGLVFLAQLLA-------GQNVQEALPNFALQ 74
Query: 143 VSAVSLFAFLYSRENKAKNAQ 163
+ V+L +L ENKA+ Q
Sbjct: 75 LGVVALMIWLIRLENKAERKQ 95
>gi|170077218|ref|YP_001733856.1| hypothetical protein SYNPCC7002_A0593 [Synechococcus sp. PCC 7002]
gi|169884887|gb|ACA98600.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 88
Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats.
Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 7/80 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR+R E +P+RS+R F YL F SG +GG+I + Q+ A D+ + + L++
Sbjct: 11 ARLRAEAKNPYRSLRQFIYLCFGLSGGIGGMIFLLQI-------AAGRDLDQAIPNLAVQ 63
Query: 143 VSAVSLFAFLYSRENKAKNA 162
V L FL+ ENK K A
Sbjct: 64 AGVVGLMIFLWRWENKNKLA 83
>gi|434399338|ref|YP_007133342.1| hypothetical protein Sta7437_2856 [Stanieria cyanosphaera PCC 7437]
gi|428270435|gb|AFZ36376.1| hypothetical protein Sta7437_2856 [Stanieria cyanosphaera PCC 7437]
Length = 96
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 9/103 (8%)
Query: 63 LCSNITCSATDRSDPEIS--SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLI 120
+ + T T R ++S AR+R E +P+R +R FFY+AF ASG +G + + QL
Sbjct: 1 MSDSSTKKPTSRPKSKLSPEKYARLRAEAKAPYRGLRKFFYVAFGASGLIGAFVFLAQL- 59
Query: 121 GALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
A +V L L + V V+L +L+ E K +N Q
Sbjct: 60 ------AAGREVSSTLTNLGLQVGLVALVVWLFRWEEKQENKQ 96
>gi|298706793|emb|CBJ29716.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 334
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 72/146 (49%), Gaps = 9/146 (6%)
Query: 70 SATDRSDPEISSAA----RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALAN 125
S D D EI++ A +++ E +PFR+ R+F Y AF S +GG+ ++TQL L++
Sbjct: 33 SQADEEDDEIATPAFNIDKLKAETKTPFRTFRIFIYGAFGLSALIGGITSLTQLAATLSD 92
Query: 126 PARAADVPEILKGLSIDVSAVSLFAFLY----SRENKAKNAQLARLSREESLSNLKLRVD 181
A ++ ++L L +D ++ AF Y S++ + + A + + L K+ D
Sbjct: 93 QPGALELQKVLINLLVDFGVMAAAAFCYNFETSQQEDLEQVEAANREKRQKLKATKITGD 152
Query: 182 EKKIISVNNLRGIARLVICSGPASYI 207
V LR ++ V SG + +
Sbjct: 153 -VNADRVKQLRALSVKVPGSGSGAEL 177
>gi|425440324|ref|ZP_18820629.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719275|emb|CCH96867.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 84
Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A A
Sbjct: 2 SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAA 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|166364756|ref|YP_001657029.1| hypothetical protein MAE_20150 [Microcystis aeruginosa NIES-843]
gi|390438651|ref|ZP_10227099.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|166087129|dbj|BAG01837.1| hypothetical protein MAE_20150 [Microcystis aeruginosa NIES-843]
gi|389837940|emb|CCI31223.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 84
Score = 53.9 bits (128), Expect = 6e-05, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A A
Sbjct: 2 SKSPRLDPQ--KYARLKAEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAA 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|425444905|ref|ZP_18824946.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|425455682|ref|ZP_18835397.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389735246|emb|CCI01219.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389803405|emb|CCI17680.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 84
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A
Sbjct: 2 SKSPRLDPQ--KYARLKEEAANPYRGLRQFLYLGLGASGFIGAVVFLAQL-------AAG 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|223998698|ref|XP_002289022.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976130|gb|EED94458.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1250
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 76/171 (44%), Gaps = 25/171 (14%)
Query: 88 EVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVS 147
E ++PFR +R+FFY AF + +GGLI + + AL+ D+ L ID AV
Sbjct: 973 ETIAPFRGLRLFFYAAFASGAFVGGLITASGVAAALSGVRTDVDLNAEYLNLGIDFGAVL 1032
Query: 148 LFAFL------YSRENKAK------------NAQLARLSREESLSNLKLRV-----DEKK 184
LFA L E KA+ A RE+ + L+L V E +
Sbjct: 1033 LFAVLAKFDLDKGEELKAQVEQKIENKKKQKKITKAMREREKYFAELELDVRVSDDGETR 1092
Query: 185 IISVNNLRGIAR--LVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
SV ++ A+ +++ +GP I ++ + ++ + +LVV + T
Sbjct: 1093 RASVGTMQDKAKQHVILVAGPGRVIRDALRGAQLNKVNFAMTNILVVPYET 1143
>gi|425434849|ref|ZP_18815313.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|425448846|ref|ZP_18828690.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|159026365|emb|CAO88916.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389675573|emb|CCH95320.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389767707|emb|CCI06979.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 84
Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A
Sbjct: 2 SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAG 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|425469435|ref|ZP_18848372.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389880879|emb|CCI38493.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 84
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A
Sbjct: 2 SKSTRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AVG 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|422302284|ref|ZP_16389647.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|425466926|ref|ZP_18846220.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389788535|emb|CCI15744.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389830425|emb|CCI27635.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 84
Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL A
Sbjct: 2 SKSPRLDPQ--KYARLKAEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAG 52
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
DV L ++ + V+L +L+ RE KA
Sbjct: 53 RDVATALPNFALQIGVVALMIWLFRREKKA 82
>gi|149391123|gb|ABR25579.1| tpr domain containing protein [Oryza sativa Indica Group]
Length = 192
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 45/72 (62%)
Query: 158 KAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIARLVICSGPASYIMESFKRSEPF 217
K + Q+ ++SR E+LS L +R+ +II + LR I R VI +G + I + +R+E +
Sbjct: 1 KKEEQQITQISRNETLSRLPVRLSTNRIIELVQLRDITRPVILAGSKASITRAMQRAERY 60
Query: 218 TESLLERGVLVV 229
LL+RGVL++
Sbjct: 61 RTELLKRGVLLI 72
>gi|307151324|ref|YP_003886708.1| hypothetical protein Cyan7822_1436 [Cyanothece sp. PCC 7822]
gi|306981552|gb|ADN13433.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length = 91
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 9/85 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE AR++ E +P++ R+FFY AF ASG +G ++ + +L A A+V E
Sbjct: 16 DPE--KLARLKAEASAPYKGFRIFFYFAFGASGLIGAVVFLAKL-------AAGANVTEA 66
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
L ++ V V+L +L+ E K+K
Sbjct: 67 LPNFALQVGLVALMIWLFRLEQKSK 91
>gi|428298237|ref|YP_007136543.1| hypothetical protein Cal6303_1524 [Calothrix sp. PCC 6303]
gi|428234781|gb|AFZ00571.1| hypothetical protein Cal6303_1524 [Calothrix sp. PCC 6303]
Length = 97
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 9/89 (10%)
Query: 65 SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
SN + DPE AR++ E +PFRS+R FFY +F ASG +G I + Q +
Sbjct: 4 SNSQNRNPKKLDPE--KYARLKAEATAPFRSLRKFFYFSFGASGMIGAFIFLMQFLA--- 58
Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLY 153
+ + L+ L++ + V L FL+
Sbjct: 59 ----GRNTEDTLQNLALQIGVVFLMGFLW 83
>gi|219119559|ref|XP_002180537.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408010|gb|EEC47945.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 395
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 17/136 (12%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA-----ADVP- 133
++ R+R EV SPFR+VR+FF+ + S A+T L +L N +A D P
Sbjct: 116 TAKQRLREEVESPFRTVRLFFFGSSTGS-------ALTALYFSLLNILKAKSGLYTDGPS 168
Query: 134 --EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNL 191
+ L+ +I+V+AV + +L R+ KA LAR+++ +L+ L + + ++ +
Sbjct: 169 LDDALQSTAINVAAVLICGYLTYRDWKAGETNLARIAKGGALAQLVVNGPSPRTLA--SY 226
Query: 192 RGIARLVICSGPASYI 207
R AR++I G YI
Sbjct: 227 RRNARVLIAVGGDGYI 242
>gi|223992929|ref|XP_002286148.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977463|gb|EED95789.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 337
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 67/133 (50%), Gaps = 7/133 (5%)
Query: 80 SSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP---EIL 136
++ R+R E+ SPFR VR+ F+ AS S+ + + L AN +DVP E L
Sbjct: 60 TTKQRLREEIESPFRKVRLAFFSFSGASASVA--LYFSALAALKANMGGFSDVPPLNEAL 117
Query: 137 KGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--VDEKKIISVNNLRGI 194
+ +I++ F L RE K A L R+++ L+ L + VD + S+ R
Sbjct: 118 ETCAINLGGAVGFGALAVRELKVGQANLERIAKGGLLARLIVESGVDGESKRSLKEYRRA 177
Query: 195 ARLVICSGPASYI 207
+R+VI +G A YI
Sbjct: 178 SRVVIAAGGADYI 190
>gi|254410762|ref|ZP_05024540.1| hypothetical protein MC7420_240 [Coleofasciculus chthonoplastes PCC
7420]
gi|196182117|gb|EDX77103.1| hypothetical protein MC7420_240 [Coleofasciculus chthonoplastes PCC
7420]
Length = 99
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 7/78 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E +P+R +R F Y+AF ASG +GGL+ + QLI +V L++
Sbjct: 26 ARLKAEAKAPYRGLRKFIYVAFAASGFIGGLVFLAQLIA-------GREVGTAFPNLALQ 78
Query: 143 VSAVSLFAFLYSRENKAK 160
V V+L +L+ E +A+
Sbjct: 79 VGIVALMIWLFRLEQRAE 96
>gi|218441759|ref|YP_002380088.1| hypothetical protein PCC7424_4863 [Cyanothece sp. PCC 7424]
gi|218174487|gb|ACK73220.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 91
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 46/84 (54%), Gaps = 9/84 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE AR++ E +P++ RMFFY+AF ASG +G +I + QL+ +V
Sbjct: 16 DPE--KYARLKAEAKAPYKGFRMFFYIAFGASGFIGAVIFLAQLVA-------GRNVSSA 66
Query: 136 LKGLSIDVSAVSLFAFLYSRENKA 159
L ++ V V+L +L+ E K+
Sbjct: 67 LPNFALQVGLVALMIWLFRLEQKS 90
>gi|416394633|ref|ZP_11686216.1| hypothetical protein CWATWH0003_3017 [Crocosphaera watsonii WH
0003]
gi|357263229|gb|EHJ12263.1| hypothetical protein CWATWH0003_3017 [Crocosphaera watsonii WH
0003]
Length = 86
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 9/83 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DP+ +A +R E +P++ +R FFY+AF ASG +G L+ + QL A DV
Sbjct: 13 DPQKYAA--LRAEAKAPYKGLRKFFYIAFAASGFVGALVFLAQL-------AAGKDVTTA 63
Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
++ + V+L FL+ E+K
Sbjct: 64 FPNFALQIGVVALMIFLFRLESK 86
>gi|67925300|ref|ZP_00518658.1| hypothetical protein CwatDRAFT_1425 [Crocosphaera watsonii WH 8501]
gi|67852844|gb|EAM48245.1| hypothetical protein CwatDRAFT_1425 [Crocosphaera watsonii WH 8501]
Length = 86
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 45/83 (54%), Gaps = 9/83 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DP+ +A +R E +P++ +R FFY+AF ASG +G L+ + QL A DV
Sbjct: 13 DPQKYAA--LRAEAKAPYKGLRKFFYIAFAASGFVGALVFLAQL-------AAGKDVTTA 63
Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
++ + V+L FL+ E+K
Sbjct: 64 FPNFALQIGVVALMIFLFRLESK 86
>gi|425459047|ref|ZP_18838533.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389823287|emb|CCI28619.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 84
Score = 50.8 bits (120), Expect = 7e-04, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E +P+R +R F YL ASG +G ++ + QL A DV L ++
Sbjct: 13 ARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAGRDVATALPNFALQ 65
Query: 143 VSAVSLFAFLYSRENKA 159
+ V+L +L+ RE KA
Sbjct: 66 IGVVALMIWLFRREKKA 82
>gi|186685813|ref|YP_001869009.1| hypothetical protein Npun_F5768 [Nostoc punctiforme PCC 73102]
gi|186468265|gb|ACC84066.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 97
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 9/92 (9%)
Query: 67 ITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANP 126
+ + +R +PE A ++ E+ +P+R +R F Y+AF ASGS+G + Q++
Sbjct: 2 VDQNLKNRLNPE--QYASLKAEIAAPYRGLRQFIYIAFGASGSIGAFVFFFQVLA----- 54
Query: 127 ARAADVPEILKGLSIDVSAVSLFAFLYSRENK 158
DV L L++ V V+L FL+ E +
Sbjct: 55 --GRDVESALPSLALQVGIVALMVFLWRWEQR 84
>gi|427724820|ref|YP_007072097.1| hypothetical protein Lepto7376_3022 [Leptolyngbya sp. PCC 7376]
gi|427356540|gb|AFY39263.1| hypothetical protein Lepto7376_3022 [Leptolyngbya sp. PCC 7376]
Length = 85
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+R E +P+R++R F Y+ F SGS+GGL+ I Q++ D+ + I
Sbjct: 12 RLRTEAKNPYRTLRQFVYVCFGISGSVGGLVFIGQILA-------GRDLANAIPNFGIQA 64
Query: 144 SAVSLFAFLYSRENKAK 160
V L FL+ E+K K
Sbjct: 65 GVVGLMIFLWRWEDKNK 81
>gi|428184948|gb|EKX53802.1| hypothetical protein GUITHDRAFT_160931, partial [Guillardia theta
CCMP2712]
Length = 244
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 4/128 (3%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ E SP R R FFY A S SLG LI+ ++ AL + E +I++
Sbjct: 109 KLLEEQASPLRKFRQFFYAAACISASLGTLISGARIFAALQGIQGVQPLSESSTNTAINL 168
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSRE----ESLSNLKLRVDEKKIISVNNLRGIARLVI 199
V A L+ E+ L L + E L+NL L + + + ++ ++R +RLVI
Sbjct: 169 GVVVTSAALWIWEDGKGKETLKLLQDKSGMREKLANLNLELADGTLKTMRDMRDESRLVI 228
Query: 200 CSGPASYI 207
+G A+ +
Sbjct: 229 LAGTAAEV 236
>gi|119494064|ref|ZP_01624616.1| hypothetical protein L8106_04221 [Lyngbya sp. PCC 8106]
gi|119452201|gb|EAW33403.1| hypothetical protein L8106_04221 [Lyngbya sp. PCC 8106]
Length = 98
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E+ +P+R +R F Y+ F ASG +G L+ + Q++ +V L L++
Sbjct: 21 ARLKAELKTPYRGLRQFIYVGFGASGFIGALVFLAQVLA-------GREVASALPNLALQ 73
Query: 143 VSAVSLFAFLYSRENKA 159
V V+L +L+ EN+A
Sbjct: 74 VGVVALMVWLFRLENRA 90
>gi|332706278|ref|ZP_08426346.1| hypothetical protein LYNGBM3L_16170 [Moorea producens 3L]
gi|332354983|gb|EGJ34455.1| hypothetical protein LYNGBM3L_16170 [Moorea producens 3L]
Length = 99
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 9/88 (10%)
Query: 73 DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
+ +PE AR++ E P+R +R F Y+AF ASG +G ++ + QL A A A +
Sbjct: 18 QKINPE--KYARLKAEAQHPYRGLRKFIYIAFGASGFIGAMVFLAQLA---AGKAVATAM 72
Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAK 160
P L++ + V L +L+ ENKA+
Sbjct: 73 PN----LALQIGVVMLMVWLFRLENKAE 96
>gi|440755472|ref|ZP_20934674.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
gi|440175678|gb|ELP55047.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
Length = 84
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E +P+R +R F YL ASG +G ++ + QL A DV L ++
Sbjct: 13 ARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQL-------AAGRDVATALPNFALQ 65
Query: 143 VSAVSLFAFLYSRENKA 159
+ V+L +L+ RE KA
Sbjct: 66 IGVVALMIWLFRREKKA 82
>gi|282896663|ref|ZP_06304671.1| hypothetical protein CRD_01368 [Raphidiopsis brookii D9]
gi|281198381|gb|EFA73269.1| hypothetical protein CRD_01368 [Raphidiopsis brookii D9]
Length = 79
Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 9/85 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DP I + R++ E+ SP++ +R F Y+ ASG +G + Q++ D+ +
Sbjct: 3 DPNIKN--RLKAEIASPYKGLRQFVYIGMGASGLIGAFVFFFQILA-------GRDIDQA 53
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
L ++ V V++ AFL+ E K K
Sbjct: 54 LPNFALQVGIVAIMAFLWKWERKGK 78
>gi|387202097|gb|AFJ68933.1| hypothetical protein NGATSA_2062300, partial [Nannochloropsis
gaditana CCMP526]
Length = 213
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 49/101 (48%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++R+E +PFRS+R+F Y SLGG A+TQL+ +L + A + + + + +D
Sbjct: 105 KLRSEATTPFRSLRLFIYGGIGVGASLGGYTAVTQLVQSLQHRPDALPLSQSILNVGVDF 164
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKK 184
+ A + EN+ K + + +L D KK
Sbjct: 165 GVAAAAAGAWVFENRLKQVTDKEMQALRAKEPFRLTFDMKK 205
>gi|428210032|ref|YP_007094385.1| hypothetical protein Chro_5144 [Chroococcidiopsis thermalis PCC
7203]
gi|428011953|gb|AFY90516.1| hypothetical protein Chro_5144 [Chroococcidiopsis thermalis PCC
7203]
Length = 92
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 9/92 (9%)
Query: 67 ITCSATDRSDPEISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
I + +R D ++S AR+R E +P+R +R F Y +F ASG +G +I + QL
Sbjct: 2 IEQNPKNRGDKRLTSEQYARLRAEAAAPYRGLRKFIYFSFGASGLIGAVIFLAQL----- 56
Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRE 156
A +V L ++ V V+L FL+ E
Sbjct: 57 --AAGQNVSNALPNFALQVGIVALMIFLFRLE 86
>gi|428774687|ref|YP_007166474.1| hypothetical protein PCC7418_0001 [Halothece sp. PCC 7418]
gi|428688966|gb|AFZ42260.1| hypothetical protein PCC7418_0001 [Halothece sp. PCC 7418]
Length = 90
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 7/78 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E +P+R +R Y+AF ASG +G + + QL A +V E L L++ V
Sbjct: 19 RLQAEAKAPYRGLRKTIYVAFAASGLIGAFVMVAQL-------AAGQNVTETLPNLALQV 71
Query: 144 SAVSLFAFLYSRENKAKN 161
V+L +L+ ++ +A++
Sbjct: 72 GVVALMVWLFRQDTEAED 89
>gi|218247693|ref|YP_002373064.1| hypothetical protein PCC8801_2911 [Cyanothece sp. PCC 8801]
gi|257060970|ref|YP_003138858.1| hypothetical protein Cyan8802_3185 [Cyanothece sp. PCC 8802]
gi|218168171|gb|ACK66908.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256591136|gb|ACV02023.1| hypothetical protein Cyan8802_3185 [Cyanothece sp. PCC 8802]
Length = 91
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 9/95 (9%)
Query: 65 SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
SN + DPE AR++ E +P++ +R FFY+AF ASG +G L+ +L
Sbjct: 6 SNKPRPPLSQQDPE--KYARLKAEAQAPYKVLRQFFYIAFGASGFIGALVFFAKL----- 58
Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRENKA 159
A E L ++ V V+L +L+ E K+
Sbjct: 59 --ASGRGGVEALPNFALQVGVVALMVWLFRLEQKS 91
>gi|428305277|ref|YP_007142102.1| hypothetical protein Cri9333_1705 [Crinalium epipsammum PCC 9333]
gi|428246812|gb|AFZ12592.1| hypothetical protein Cri9333_1705 [Crinalium epipsammum PCC 9333]
Length = 88
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 74 RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
+++PE AR++ E+ SP++ +R YLA AS +GG+I I Q+ A V
Sbjct: 9 KTNPE--QYARLKAEIASPYKGLRQVIYLACAASSFIGGMIFIVQI-------AAGRQVE 59
Query: 134 EILKGLSIDVSAVSLFAFLYSRENK 158
L L++ + ++L FL+ E K
Sbjct: 60 SALPNLAVQIGVLALMIFLFRLEQK 84
>gi|428225951|ref|YP_007110048.1| hypothetical protein GEI7407_2521 [Geitlerinema sp. PCC 7407]
gi|427985852|gb|AFY66996.1| hypothetical protein GEI7407_2521 [Geitlerinema sp. PCC 7407]
Length = 101
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 10/88 (11%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE + R+R E +P++ +R F Y+AF ASG +G I +TQ++ DV
Sbjct: 17 DPE--AFERLRAEAAAPYKGLRQFIYVAFAASGGIGSFIFLTQILA-------GRDVSSA 67
Query: 136 LKGLSIDVSAVSLFAFLYSRENKA-KNA 162
L ++ V V+L L E ++ KNA
Sbjct: 68 LPNFALQVGVVALMIALLRWERRSPKNA 95
>gi|209525694|ref|ZP_03274231.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376003116|ref|ZP_09780931.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067100|ref|ZP_17055890.1| hypothetical protein SPLC1_S531800 [Arthrospira platensis C1]
gi|209493863|gb|EDZ94181.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375328441|emb|CCE16684.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406711386|gb|EKD06587.1| hypothetical protein SPLC1_S531800 [Arthrospira platensis C1]
Length = 93
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 9/88 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE+ R++ E +P+R +R F Y++F ASG +G ++ + QL+ DV
Sbjct: 15 DPELYE--RLKAESKAPYRGLRQFIYVSFAASGLIGAVVFLAQLLS-------GRDVATA 65
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
L ++ V V+L +L+ E ++ ++
Sbjct: 66 LPNFALQVGVVALMVWLFRLEQRSPKSK 93
>gi|218197863|gb|EEC80290.1| hypothetical protein OsI_22302 [Oryza sativa Indica Group]
Length = 120
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 6/62 (9%)
Query: 206 YIMESFKRSEPFTESLLERGVLVVSFTTDGNIPSFEFEESEEMKEFTRKRKRLWQLTPVY 265
++ ESF RS+P L+E GVL V F+TDGN P + + +K+ + +PVY
Sbjct: 10 FVTESFHRSKPLLTDLMELGVLAVPFSTDGNTPDLRLLDGQ------KKQANVSPDSPVY 63
Query: 266 VS 267
+S
Sbjct: 64 LS 65
>gi|172036842|ref|YP_001803343.1| hypothetical protein cce_1927 [Cyanothece sp. ATCC 51142]
gi|354554648|ref|ZP_08973952.1| Protein of unknown function DUF3493 [Cyanothece sp. ATCC 51472]
gi|171698296|gb|ACB51277.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
gi|353553457|gb|EHC22849.1| Protein of unknown function DUF3493 [Cyanothece sp. ATCC 51472]
Length = 89
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 45/83 (54%), Gaps = 9/83 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE A ++ E +P++ +R FFY+AF ASG +G ++ + +L A +V
Sbjct: 16 DPE--KYASLKAEAEAPYKGLRKFFYIAFGASGFIGAVVFLAKL-------AAGKEVAST 66
Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
L ++ V V+L FL+ E+K
Sbjct: 67 LPNFALQVGVVALMIFLFRLESK 89
>gi|409992301|ref|ZP_11275500.1| hypothetical protein APPUASWS_14543 [Arthrospira platensis str.
Paraca]
gi|291566639|dbj|BAI88911.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409936855|gb|EKN78320.1| hypothetical protein APPUASWS_14543 [Arthrospira platensis str.
Paraca]
Length = 93
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 9/88 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE+ R++ E +P+R +R F Y++F ASG +G ++ + QL+ DV
Sbjct: 15 DPELYE--RLKAESKAPYRGLRQFIYVSFAASGLIGAVVFLAQLLS-------GRDVGTA 65
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQ 163
L ++ V V+L +L+ E ++ ++
Sbjct: 66 LPNFALQVGVVALMVWLFRLEQRSPKSK 93
>gi|126658399|ref|ZP_01729548.1| hypothetical protein CY0110_27605 [Cyanothece sp. CCY0110]
gi|126620331|gb|EAZ91051.1| hypothetical protein CY0110_27605 [Cyanothece sp. CCY0110]
Length = 89
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DPE A ++ E +P++ R FFY+AF ASG +G ++ + +L A +V
Sbjct: 16 DPE--KYASLKAEAQAPYKGFRKFFYIAFAASGFIGAVVFLAKL-------AAGKEVATA 66
Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
L ++ + V+L FL+ E+K
Sbjct: 67 LPNFALQIGVVALMIFLFRLESK 89
>gi|427734970|ref|YP_007054514.1| hypothetical protein Riv7116_1404 [Rivularia sp. PCC 7116]
gi|427370011|gb|AFY53967.1| Protein of unknown function (DUF3493) [Rivularia sp. PCC 7116]
Length = 94
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E SP+R +R F YL+F ASG +G I + QL+ A + L L++
Sbjct: 21 ARLKAEAASPYRGLRKFVYLSFGASGLIGAFIFVLQLL------AHKTSLEVGLPSLALQ 74
Query: 143 VSAVSLFAFLY 153
V V+L FL+
Sbjct: 75 VGVVTLMIFLW 85
>gi|323451949|gb|EGB07824.1| hypothetical protein AURANDRAFT_71677 [Aureococcus anophagefferens]
Length = 780
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 79 ISSA--ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEIL 136
IS+A +++ E +P+R+VR FFY AF S ++GG+ A+ Q ++A A +P
Sbjct: 46 ISAAQKSKLLQEAQTPWRTVRTFFYGAFGLSATVGGITALAQTAASVAGQPDA--LPLSQ 103
Query: 137 KGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLR 192
G+++ V A + A LY N N + A ++ + + L V E + +V LR
Sbjct: 104 CGVNVLVDAGVVAACLYG--NSVDNGEAADITVADETAPLADDVAEARRDAVAKLR 157
>gi|452825495|gb|EME32491.1| hypothetical protein Gasu_02660 [Galdieria sulphuraria]
Length = 295
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 31/129 (24%), Positives = 63/129 (48%), Gaps = 9/129 (6%)
Query: 67 ITCSATDRSDPEISSAA---------RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAIT 117
+ +A+D DP++ S++ R++ E +PFR RMF Y S +G LI+
Sbjct: 45 MKTNASDSDDPKVYSSSESTSPSMFLRLQEEAQAPFRKFRMFIYGGAFVSSVIGFLISSL 104
Query: 118 QLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLK 177
QL+ + + + E + + I+ ++ + LY + +A +L RLS S+ L
Sbjct: 105 QLLSGILGTTTSYPIRESVYNMLINGIVIATASLLYFFDYQAGKQRLDRLSVLSSIRMLP 164
Query: 178 LRVDEKKII 186
+ ++++ +
Sbjct: 165 VEYNQRQYL 173
>gi|397642957|gb|EJK75565.1| hypothetical protein THAOC_02707 [Thalassiosira oceanica]
Length = 354
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 50/222 (22%), Positives = 95/222 (42%), Gaps = 31/222 (13%)
Query: 40 SRTTPGLSISLPLHSTWTAKRSPLCSNITCSATDRS---DPEISSAARIRNEVLSPFRSV 96
SR L++++P +T + +R P + + RS D +S+ ++ E ++P+R +
Sbjct: 26 SRRRLSLNLNVPTAATASPRRRP-TTELNQEDKKRSGGFDKGVSN--KLLEETVAPWRGL 82
Query: 97 RMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFL---- 152
R+F Y A + + G+ + +I A+ D+ L+I+ AV LFA L
Sbjct: 83 RLFLYGAAASGAFVSGMFTGSGVIAAMGGVRDDVDLNAEYVNLAINFGAVLLFAVLAKLD 142
Query: 153 --------------YSRENKAKNAQLARLSREESLSNLKL--RVDE-----KKIISVNNL 191
R+ K KN + RE L +L L RV E + + V
Sbjct: 143 LDKGAELSASIEEKVERKKKIKNMTKSMRERERQLRDLSLNVRVTEDGELREAPVGVMQD 202
Query: 192 RGIARLVICSGPASYIMESFKRSEPFTESLLERGVLVVSFTT 233
+ +++ +GP I ++ + ++ + +LVV + T
Sbjct: 203 KAKQHIILVAGPGRAIRDALRGAQLNKVNFAMTNILVVPYET 244
>gi|428203982|ref|YP_007082571.1| hypothetical protein Ple7327_3856 [Pleurocapsa sp. PCC 7327]
gi|427981414|gb|AFY79014.1| Protein of unknown function (DUF3493) [Pleurocapsa sp. PCC 7327]
Length = 95
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 10/92 (10%)
Query: 65 SNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALA 124
S T S R DP+ AR+ E +P++ +R F YLAF +SG +G L+ + QL
Sbjct: 11 SKATNSLKQR-DPQ--KYARLMAEAQAPYKGLRAFIYLAFGSSGLIGALVFLAQL----- 62
Query: 125 NPARAADVPEILKGLSIDVSAVSLFAFLYSRE 156
A DV L ++ V V+L +L+ E
Sbjct: 63 --AAGKDVTSALPNFALQVGLVALMVWLFRWE 92
>gi|434388883|ref|YP_007099494.1| Protein of unknown function (DUF3493) [Chamaesiphon minutus PCC
6605]
gi|428019873|gb|AFY95967.1| Protein of unknown function (DUF3493) [Chamaesiphon minutus PCC
6605]
Length = 89
Score = 46.2 bits (108), Expect = 0.014, Method: Composition-based stats.
Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 10/89 (11%)
Query: 73 DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
D DPE+ R++ E SPFR +R F Y++ SG++GGL+ +L+ ++
Sbjct: 11 DSIDPELWR--RLQAEAKSPFRGLRQFVYVSCAISGAVGGLVFFFKLLA-------GREL 61
Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAKN 161
+ L+I V V+L FL R +KAK+
Sbjct: 62 ETTIPNLAIQVGVVALMVFLL-RIDKAKD 89
>gi|434392334|ref|YP_007127281.1| hypothetical protein Glo7428_1560 [Gloeocapsa sp. PCC 7428]
gi|428264175|gb|AFZ30121.1| hypothetical protein Glo7428_1560 [Gloeocapsa sp. PCC 7428]
Length = 98
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 61 SPLCSNITCSATDRSDPEIS--SAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQ 118
+ S + ++ +RS+ + AR++ E +P+R +R F Y++F ASG +G L+ + Q
Sbjct: 4 AAFFSMVKPTSKNRSNKRFTPEQYARLKAEAAAPYRGLRKFIYISFGASGFIGALVFLAQ 63
Query: 119 LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAK 160
L A DV L ++ + ++L FL+ + +++
Sbjct: 64 L-------AAGRDVETALPNFALQLGVIALMVFLFRWDRESR 98
>gi|56752086|ref|YP_172787.1| hypothetical protein syc2077_c [Synechococcus elongatus PCC 6301]
gi|81300827|ref|YP_401035.1| hypothetical protein Synpcc7942_2018 [Synechococcus elongatus PCC
7942]
gi|56687045|dbj|BAD80267.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169708|gb|ABB58048.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 81
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 9/83 (10%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DP+ AR++ EV++P+R +R FY F ASG++GG I L+ A+A + A +P
Sbjct: 5 DPQ--QRARLQAEVVAPYRGLRRTFYAVFAASGAIGGFIF---LLKAIAGESLATTLPN- 58
Query: 136 LKGLSIDVSAVSLFAFLYSRENK 158
L++ + V+L L E K
Sbjct: 59 ---LALQLGVVALMVSLLRLEQK 78
>gi|428772409|ref|YP_007164197.1| hypothetical protein Cyast_0570 [Cyanobacterium stanieri PCC 7202]
gi|428686688|gb|AFZ46548.1| hypothetical protein Cyast_0570 [Cyanobacterium stanieri PCC 7202]
Length = 95
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 9/91 (9%)
Query: 68 TCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPA 127
S ++DPE A ++ E +P+R +R F Y+ F ASG++G I Q++
Sbjct: 14 NGSNLKQTDPE--KYAYLKAEAQAPYRGLRKFIYIGFGASGAIGAFIFFAQILA------ 65
Query: 128 RAADVPEILKGLSIDVSAVSLFAFLYSRENK 158
+V + + L + + + L FL+ E K
Sbjct: 66 -GKNVQDNISNLLVQLGVIGLMIFLFRLEKK 95
>gi|443311291|ref|ZP_21040921.1| Protein of unknown function (DUF3493) [Synechocystis sp. PCC 7509]
gi|442778614|gb|ELR88877.1| Protein of unknown function (DUF3493) [Synechocystis sp. PCC 7509]
Length = 91
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 5/74 (6%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E +P+R +R F Y+AF ASG +G LI + Q+ +D P L I V
Sbjct: 17 RLKAEAAAPYRGLRRFIYIAFGASGFVGALIFLAQIAA-----GNHSDAPLSNLALQIGV 71
Query: 144 SAVSLFAFLYSREN 157
A+ +F F + +E+
Sbjct: 72 VAIMIFLFRWDKES 85
>gi|334118614|ref|ZP_08492703.1| hypothetical protein MicvaDRAFT_3339 [Microcoleus vaginatus FGP-2]
gi|333459621|gb|EGK88234.1| hypothetical protein MicvaDRAFT_3339 [Microcoleus vaginatus FGP-2]
Length = 102
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 7/77 (9%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+ E+ +P+R +R F YLA ASG +GGLI + ++ A ++ L ++ V
Sbjct: 26 RLTAEMATPYRPLRQFVYLACGASGFIGGLIFLAKI-------ASGREIGSALPNFALQV 78
Query: 144 SAVSLFAFLYSRENKAK 160
V+L FL+ E +A+
Sbjct: 79 GVVALMVFLFRWEQRAE 95
>gi|428312657|ref|YP_007123634.1| hypothetical protein Mic7113_4545 [Microcoleus sp. PCC 7113]
gi|428254269|gb|AFZ20228.1| Protein of unknown function (DUF3493) [Microcoleus sp. PCC 7113]
Length = 99
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A++R E +P+R +R F YLAF ASG +G + + QL A +V L ++
Sbjct: 26 AKLRAEAKAPYRVLRQFIYLAFGASGFIGAFVFLGQL-------AAGHEVATALPNFALQ 78
Query: 143 VSAVSLFAFLYSRENKAK 160
+ V+L +L+ + +A+
Sbjct: 79 IGVVALMVWLFRVDRQAE 96
>gi|300865884|ref|ZP_07110626.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336124|emb|CBN55784.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 112
Score = 44.7 bits (104), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 46/79 (58%), Gaps = 7/79 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E+++P+R++R F Y+A ASG +GGL+ + QL+ +V + ++ V
Sbjct: 37 RLKAEMVAPYRTLRQFVYVACGASGFIGGLVFLGQLM-------VGREVGSAIPNFALQV 89
Query: 144 SAVSLFAFLYSRENKAKNA 162
V+L +L+ E +++ +
Sbjct: 90 GVVALMVWLFRLEQRSRRS 108
>gi|428770088|ref|YP_007161878.1| hypothetical protein Cyan10605_1731 [Cyanobacterium aponinum PCC
10605]
gi|428684367|gb|AFZ53834.1| hypothetical protein Cyan10605_1731 [Cyanobacterium aponinum PCC
10605]
Length = 90
Score = 44.3 bits (103), Expect = 0.060, Method: Composition-based stats.
Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 7/78 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A ++ E +P++ +R F YL ASG++G I QL+ +V + + L +
Sbjct: 19 AYLKAEAQAPYKGLRKFIYLGLGASGAIGAFIFFAQLLA-------GKNVSDNIYNLFVQ 71
Query: 143 VSAVSLFAFLYSRENKAK 160
++ +++ FLY E++ K
Sbjct: 72 ITVIAIMVFLYRWEDRKK 89
>gi|254423312|ref|ZP_05037030.1| hypothetical protein S7335_3468 [Synechococcus sp. PCC 7335]
gi|196190801|gb|EDX85765.1| hypothetical protein S7335_3468 [Synechococcus sp. PCC 7335]
Length = 123
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 10/88 (11%)
Query: 74 RSDPEISSAARIRN---EVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAA 130
++ P+ S AR E +P++ +R F Y AF ASG++GG + T+++
Sbjct: 38 KTPPKGMSQARYDKLVAESQAPYKGLRKFMYAAFGASGAVGGFVFFTKILAGR------- 90
Query: 131 DVPEILKGLSIDVSAVSLFAFLYSRENK 158
D+ L L++ + V+L FL+ E++
Sbjct: 91 DLAASLPNLALQLGVVALMVFLFRWEDR 118
>gi|37523365|ref|NP_926742.1| hypothetical protein gsr3796 [Gloeobacter violaceus PCC 7421]
gi|35214369|dbj|BAC91737.1| gsr3796 [Gloeobacter violaceus PCC 7421]
Length = 98
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 7/90 (7%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A +R E P+R +R+ YLAF ASG +G +A +++ ++P L L++
Sbjct: 11 ASLRVESRHPWRKLRLGLYLAFGASGVVGFFVAFFRVLA-------GRELPVSLTNLAVQ 63
Query: 143 VSAVSLFAFLYSRENKAKNAQLARLSREES 172
++L FL+ E + +NA R +E+
Sbjct: 64 SGVIALMVFLWRLETRKENALFERYLAQET 93
>gi|428320326|ref|YP_007118208.1| hypothetical protein Osc7112_5569 [Oscillatoria nigro-viridis PCC
7112]
gi|428244006|gb|AFZ09792.1| hypothetical protein Osc7112_5569 [Oscillatoria nigro-viridis PCC
7112]
Length = 102
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 7/78 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+ E+ +P+R +R F Y+A ASG +GGL+ + ++ A ++ L ++ V
Sbjct: 26 RLTAEMSTPYRPLRQFVYVACGASGFIGGLVFLAKI-------ASGREIGSALPNFALQV 78
Query: 144 SAVSLFAFLYSRENKAKN 161
V+L FL+ E +A+
Sbjct: 79 GVVALMVFLFRWEQRAEG 96
>gi|282901335|ref|ZP_06309260.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193614|gb|EFA68586.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 77
Score = 43.1 bits (100), Expect = 0.12, Method: Composition-based stats.
Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 76 DPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEI 135
DP I + R++ E+ SP++ +R F Y+ ASG +G + Q++ A +
Sbjct: 3 DPNIKN--RLKAEIASPYKGLRQFIYIGMGASGIIGAFVFFFQIL---------AGRNQA 51
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAK 160
L ++ V V++ A L+ E K+K
Sbjct: 52 LPNFALQVGIVAIMALLWKWERKSK 76
>gi|427415605|ref|ZP_18905788.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 7375]
gi|425758318|gb|EKU99170.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 7375]
Length = 97
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 7/77 (9%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+ E SPF+ +R Y A ASG LG + +TQL+ DV L L++ +
Sbjct: 25 RLVQETQSPFKGMRKVIYGAVGASGLLGAFVFLTQLMA-------GKDVETALPNLAVQM 77
Query: 144 SAVSLFAFLYSRENKAK 160
+ + FL+ E + K
Sbjct: 78 GVIGIVVFLFWLEGRGK 94
>gi|443647614|ref|ZP_21129750.1| hypothetical protein C789_290 [Microcystis aeruginosa DIANCHI905]
gi|443335468|gb|ELS49939.1| hypothetical protein C789_290 [Microcystis aeruginosa DIANCHI905]
Length = 59
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
S + R DP+ AR++ E +P+R +R F YL ASG +G ++ + QL G
Sbjct: 2 SKSPRLDPQ--KYARLKEEAANPYRGLRQFIYLGLGASGFIGAVVFLAQLAGG 52
>gi|219111561|ref|XP_002177532.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217412067|gb|EEC51995.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 321
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 22/165 (13%)
Query: 85 IRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDVS 144
+R E SPFR VR FFY + +G+L L + A + E + +DV+
Sbjct: 68 LREEAESPFRKVRFFFYYSL-GAGALTSLAVSASRVAAALAGINTDLLQESAINVGVDVA 126
Query: 145 AVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKII------------------ 186
+ + + ++ +A++++L R S+ L+ L +R K II
Sbjct: 127 GLVVLGLAFQKDKEAQDSRLKRASKGAELARLTVR-GSKSIITGDLPNVNEASETFTTSL 185
Query: 187 -SVNNLRGI-ARLVICSGPASYIMESFKRSEPFTESLLERGVLVV 229
S+ RGI R+VI + I E + S +SL+ +L+V
Sbjct: 186 ASMRRGRGIEKRVVIAAAGKEKIAEILEDSIKLADSLVMSDLLIV 230
>gi|443315905|ref|ZP_21045373.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 6406]
gi|442784478|gb|ELR94350.1| Protein of unknown function (DUF3493) [Leptolyngbya sp. PCC 6406]
Length = 98
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 7/76 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR+ E +P+R R F Y AF SG++G + ++Q+ A DV L L +
Sbjct: 28 ARLVAEAKAPYRGFRKFIYGAFGISGAIGAFVFLSQM-------AAGRDVANALPNLLLQ 80
Query: 143 VSAVSLFAFLYSRENK 158
V V+L FL+ E++
Sbjct: 81 VGLVALMVFLFRLESR 96
>gi|428221458|ref|YP_007105628.1| hypothetical protein Syn7502_01415 [Synechococcus sp. PCC 7502]
gi|427994798|gb|AFY73493.1| Protein of unknown function (DUF3493) [Synechococcus sp. PCC 7502]
Length = 84
Score = 42.0 bits (97), Expect = 0.26, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
+++ E+ +PF +R F YLA SG+LG I +TQL+ A ++V L L++ +
Sbjct: 11 KLKAELKTPFLGLRRFIYLACGTSGALGAYIFLTQLL------AGKSEVSTTLSNLAVQM 64
Query: 144 SAVSLFAFLY 153
+ L LY
Sbjct: 65 GVIGLMILLY 74
>gi|427729974|ref|YP_007076211.1| hypothetical protein Nos7524_2788 [Nostoc sp. PCC 7524]
gi|427365893|gb|AFY48614.1| Protein of unknown function (DUF3493) [Nostoc sp. PCC 7524]
Length = 88
Score = 41.6 bits (96), Expect = 0.33, Method: Composition-based stats.
Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 73 DRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADV 132
+R +PE R++ E+++P+R +R F Y ASG +G +L+ D+
Sbjct: 8 NRLNPE--QYNRLKAEMVAPYRGLRQFIYFGVGASGFIGAFTFFFKLLA-------GKDI 58
Query: 133 PEILKGLSIDVSAVSLFAFLYSRENKAKN 161
L ++ V V L FL+ E + +N
Sbjct: 59 EHTLPNFALQVGIVILMIFLWRWEQRRQN 87
>gi|414079532|ref|YP_007000956.1| hypothetical protein ANA_C20461 [Anabaena sp. 90]
gi|413972811|gb|AFW96899.1| hypothetical protein ANA_C20461 [Anabaena sp. 90]
Length = 87
Score = 41.6 bits (96), Expect = 0.35, Method: Composition-based stats.
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E +P+R +R F Y+ ASG +G + QL+ D+ + +I +
Sbjct: 17 RLKAEAAAPYRGLRQFIYIGVGASGFIGAFVFFFQLLA-------GRDIEQAASNFAIQL 69
Query: 144 SAVSLFAFLYSREN 157
+ L FL+ EN
Sbjct: 70 GVIGLMVFLWKWEN 83
>gi|323453284|gb|EGB09156.1| hypothetical protein AURANDRAFT_63641 [Aureococcus anophagefferens]
Length = 340
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 2/126 (1%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R+R E+ SPFR VR+ F+ A S L ++ A A A + + L+ I++
Sbjct: 63 RLREEIDSPFRKVRLLFFGASTGSAGLAFYFSLLNAFKANAGFRDAPPLSDALQSCGINL 122
Query: 144 SAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLR--VDEKKIISVNNLRGIARLVICS 201
A+ A + + +A A L R+++ L+ L + +K + + R +R+VI
Sbjct: 123 VAIVACAAVTYNDWRAGQANLERIAQGGRLAKLVVSPAAAPEKRAPLADYRRTSRVVIAC 182
Query: 202 GPASYI 207
G +Y+
Sbjct: 183 GGRAYV 188
>gi|428778508|ref|YP_007170294.1| hypothetical protein Dacsa_0122 [Dactylococcopsis salina PCC 8305]
gi|428692787|gb|AFZ48937.1| Protein of unknown function (DUF3493) [Dactylococcopsis salina PCC
8305]
Length = 90
Score = 40.4 bits (93), Expect = 0.71, Method: Composition-based stats.
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 7/77 (9%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E +P+R +R Y+ F ASG +G + +++L A ++ E L L + V
Sbjct: 19 RLQAEANAPYRGLRKTIYVVFAASGLMGAFVMLSKL-------AAGTNIGETLPNLGLQV 71
Query: 144 SAVSLFAFLYSRENKAK 160
V+L +L+ + K
Sbjct: 72 GIVALMVWLFRLDQDKK 88
>gi|220910557|ref|YP_002485868.1| hypothetical protein Cyan7425_5214 [Cyanothece sp. PCC 7425]
gi|219867168|gb|ACL47507.1| hypothetical protein Cyan7425_5214 [Cyanothece sp. PCC 7425]
Length = 99
Score = 40.4 bits (93), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 22/108 (20%)
Query: 53 HSTWTAKRSPLCSNITCSATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGG 112
HS+ + K++P +DP R+R E+ +P+R +R YL F AS ++G
Sbjct: 7 HSSSSRKKAP------------ADPRTE---RLRAELKAPYRGLRQVTYLVFGASAAIGA 51
Query: 113 LIAITQLIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAK 160
+ + +L+ A V L L++ + ++L L+ E K+K
Sbjct: 52 FVFVLKLLAGEA-------VETNLANLALQIGVLALMVGLFRLETKSK 92
>gi|224064428|ref|XP_002301471.1| predicted protein [Populus trichocarpa]
gi|222843197|gb|EEE80744.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 34/62 (54%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+AF A+ + L I +L A+ A D+ E + ++
Sbjct: 176 KLISEVQAPFRGVRRFFYVAFSAAAGISLLFTIPRLFLAIKGGGDAPDLWETAGNAATNI 235
Query: 144 SA 145
A
Sbjct: 236 GA 237
>gi|113477411|ref|YP_723472.1| hypothetical protein Tery_3974 [Trichodesmium erythraeum IMS101]
gi|110168459|gb|ABG52999.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 87
Score = 40.0 bits (92), Expect = 0.95, Method: Composition-based stats.
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
AR++ E+ +P+R +R F Y+AF SG +G I + Q+I
Sbjct: 21 ARLKAELDAPYRGLRKFIYVAFAISGFIGAFIYLAQIIAG 60
>gi|407961699|dbj|BAM54939.1| hypothetical protein BEST7613_6008 [Bacillus subtilis BEST7613]
Length = 86
Score = 40.0 bits (92), Expect = 0.96, Method: Composition-based stats.
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E +P+R R F YL ASG +G I +L+ D EIL L++ +
Sbjct: 10 RLKAEAEAPYRGFRKFIYLGLGASGLIGVFIFSLRLLAGY-------DPTEILPNLALQL 62
Query: 144 SAVSLFAFLY---SRENKAKNA 162
V+L +L+ S +N+ N
Sbjct: 63 GVVALMTWLFRWESSKNQGNNG 84
>gi|427709388|ref|YP_007051765.1| hypothetical protein Nos7107_4063 [Nostoc sp. PCC 7107]
gi|427361893|gb|AFY44615.1| hypothetical protein Nos7107_4063 [Nostoc sp. PCC 7107]
Length = 89
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 9/85 (10%)
Query: 74 RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
R +PE AR++ E +P+R +R F Y+ ASG +G + +++ ++
Sbjct: 9 RLNPE--QYARLKAEAATPYRGLRQFIYIGVGASGFIGAFVFFFRVLA-------GRELE 59
Query: 134 EILKGLSIDVSAVSLFAFLYSRENK 158
L L++ + V+L AFL+ E +
Sbjct: 60 STLPNLALQLGIVALMAFLWRWEKR 84
>gi|414877422|tpg|DAA54553.1| TPA: hypothetical protein ZEAMMB73_812331 [Zea mays]
Length = 113
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%)
Query: 134 EILKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRG 193
E+L + + + + LY ENK + Q+ ++SR E+LS L +R+ +II + LR
Sbjct: 11 EMLPSILEYAAGIVVLVALYFWENKKEEKQITQISRNETLSRLPVRLSTNRIIELVQLRD 70
Query: 194 IAR 196
I R
Sbjct: 71 ITR 73
>gi|119511048|ref|ZP_01630168.1| hypothetical protein N9414_09896 [Nodularia spumigena CCY9414]
gi|119464299|gb|EAW45216.1| hypothetical protein N9414_09896 [Nodularia spumigena CCY9414]
Length = 89
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 8/79 (10%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E+ +P+R R F Y+ F AS +G I Q++ D+ L++
Sbjct: 16 ARLKAEMATPYRGFRQFIYIGFGASALIGAFIFFFQVLA-------GRDIDNAAPNLALQ 68
Query: 143 VSAVSLFAFLYSRE-NKAK 160
+ ++L FL+ E N+ K
Sbjct: 69 IGVLALMIFLWRWEQNRQK 87
>gi|395848982|ref|XP_003797116.1| PREDICTED: protein FAM176C [Otolemur garnettii]
Length = 441
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/145 (23%), Positives = 62/145 (42%), Gaps = 9/145 (6%)
Query: 77 PEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAIT-QLIGALANPARAADVPEI 135
P+++ +R +V P + +R+F+ V S + L + L L N A +
Sbjct: 13 PQLAQHPGLRRQVEQPGQLLRLFYCTVLVCSKEIAALSDFSGYLTKLLQNHTAYACDGDH 72
Query: 136 LKGLSIDVSAVSLFAFLYSRENKAKNAQLARLSREESLSNLKLRVDEKKIISVNNLRGIA 195
L S +S+ + Y ++ + N+Q RE+SL+ + +K + N R
Sbjct: 73 LSLQCPRHSTISVQSAFYGQDYQMCNSQQPASQREDSLTCVAPTTFQKVLDECQNQRACH 132
Query: 196 RLV--------ICSGPASYIMESFK 212
LV +C G + Y++ SFK
Sbjct: 133 LLVNSRVFGPDLCPGSSKYLLVSFK 157
>gi|423063691|ref|ZP_17052481.1| multi-sensor signal transduction histidine kinase [Arthrospira
platensis C1]
gi|406715123|gb|EKD10281.1| multi-sensor signal transduction histidine kinase [Arthrospira
platensis C1]
Length = 1413
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)
Query: 106 ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
A G++GG I Q + G + + D E+L+ L+ V+ A LY +E+KAK
Sbjct: 1066 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1124
Query: 162 AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
+L + RE + ++L + +K+ S+ L G+A + + P S+I + + +TES
Sbjct: 1125 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1181
Query: 221 LLE 223
LLE
Sbjct: 1182 LLE 1184
>gi|376003846|ref|ZP_09781647.1| putative two-component sensor histidine kinase [Arthrospira sp. PCC
8005]
gi|375327788|emb|CCE17400.1| putative two-component sensor histidine kinase [Arthrospira sp. PCC
8005]
Length = 1385
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)
Query: 106 ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
A G++GG I Q + G + + D E+L+ L+ V+ A LY +E+KAK
Sbjct: 1038 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1096
Query: 162 AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
+L + RE + ++L + +K+ S+ L G+A + + P S+I + + +TES
Sbjct: 1097 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1153
Query: 221 LLE 223
LLE
Sbjct: 1154 LLE 1156
>gi|209523990|ref|ZP_03272542.1| multi-sensor signal transduction histidine kinase [Arthrospira maxima
CS-328]
gi|209495662|gb|EDZ95965.1| multi-sensor signal transduction histidine kinase [Arthrospira maxima
CS-328]
Length = 1411
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 9/123 (7%)
Query: 106 ASGSLGGL-IAITQ---LIGALANPARAADVPEILKGLSIDVSAVSLFAFLYSRENKAKN 161
A G++GG I Q + G + + D E+L+ L+ V+ A LY +E+KAK
Sbjct: 1064 AGGTIGGFEIGRVQSDCIKGCECDRSWLDDEIELLQNLATQVAIAIYQAQLY-QESKAKG 1122
Query: 162 AQLARLSREESLSNLKLRVDEKKIISVNNL-RGIARLVICSGPASYIMESFKRSEPFTES 220
+L + RE + ++L + +K+ S+ L G+A + + P S+I + + +TES
Sbjct: 1123 KELQKAYRELQETQVQL-IQAEKMSSLGQLVAGVAHEI--NNPVSFIYGNLTHTSTYTES 1179
Query: 221 LLE 223
LLE
Sbjct: 1180 LLE 1182
>gi|298490036|ref|YP_003720213.1| hypothetical protein Aazo_0602 ['Nostoc azollae' 0708]
gi|298231954|gb|ADI63090.1| conserved hypothetical protein ['Nostoc azollae' 0708]
Length = 89
Score = 38.9 bits (89), Expect = 2.3, Method: Composition-based stats.
Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A+++ E+ +P+R +R F Y+ ASG +G + Q++ ++ L +++
Sbjct: 16 AQLKAEITAPYRGLRQFIYVGIGASGFIGAFVFSFQILA-------GRNIDSALPNFTLE 68
Query: 143 VSAVSLFAFLYSRENK 158
V L FLY E +
Sbjct: 69 AGIVGLMVFLYRWEQR 84
>gi|428218811|ref|YP_007103276.1| hypothetical protein Pse7367_2592 [Pseudanabaena sp. PCC 7367]
gi|427990593|gb|AFY70848.1| hypothetical protein Pse7367_2592 [Pseudanabaena sp. PCC 7367]
Length = 103
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 6/74 (8%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR++ E +P+R++R F YL+ ASG +G + +++ N + E L L++
Sbjct: 30 ARLKAESKAPYRNLRKFVYLSLAASGLIGAFTFLMRILAGKGN------LTEDLGSLALQ 83
Query: 143 VSAVSLFAFLYSRE 156
V+ V L + R+
Sbjct: 84 VAIVVLMILAFRRD 97
>gi|443479266|ref|ZP_21068879.1| hypothetical protein Pse7429DRAFT_4585 [Pseudanabaena biceps PCC
7429]
gi|443015234|gb|ELS30286.1| hypothetical protein Pse7429DRAFT_4585 [Pseudanabaena biceps PCC
7429]
Length = 80
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
R++ E SP+R +R+F Y AF SG +G ++ + +LI A ++ L ++ +
Sbjct: 11 RLKLEAKSPYRGLRIFIYAAFAGSGFIGAVVFLARLI------AGKGELEANLGSFALQI 64
Query: 144 SAVSLFAFLY 153
++L +LY
Sbjct: 65 GVLALMVWLY 74
>gi|440682401|ref|YP_007157196.1| hypothetical protein Anacy_2855 [Anabaena cylindrica PCC 7122]
gi|428679520|gb|AFZ58286.1| hypothetical protein Anacy_2855 [Anabaena cylindrica PCC 7122]
Length = 89
Score = 38.5 bits (88), Expect = 3.5, Method: Composition-based stats.
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
A ++ E+ +P+R +R F Y+ ASG +G QL+ ++ L ++
Sbjct: 16 AHLKAEIAAPYRGLRQFIYIGVGASGFIGAFTFFFQLLA-------GKNIDSTLPNFALQ 68
Query: 143 VSAVSLFAFLYSRE 156
V V+L FL+ E
Sbjct: 69 VGIVALMIFLWKWE 82
>gi|226501606|ref|NP_001143133.1| uncharacterized protein LOC100275611 [Zea mays]
gi|195614826|gb|ACG29243.1| hypothetical protein [Zea mays]
Length = 273
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++
Sbjct: 185 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDDAPDLLETAGNAAINI 244
Query: 144 S------AVSLFAFLYSRENK 158
F FL +E +
Sbjct: 245 GVCRRYCCAGSFVFLGKQERR 265
>gi|359457400|ref|ZP_09245963.1| hypothetical protein ACCM5_01647 [Acaryochloris sp. CCMEE 5410]
Length = 92
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
++R E+ SP+R +R FFY+AF S +G ++ QL+
Sbjct: 21 QLRAELKSPYRGLRQFFYIAFAGSAFVGAVVFFFQLLAG 59
>gi|427711370|ref|YP_007059994.1| hypothetical protein Syn6312_0210 [Synechococcus sp. PCC 6312]
gi|427375499|gb|AFY59451.1| Protein of unknown function (DUF3493) [Synechococcus sp. PCC 6312]
Length = 89
Score = 38.1 bits (87), Expect = 4.1, Method: Composition-based stats.
Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 70 SATDRSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARA 129
S R++ E R+ E +P+R +R YL F ASG +G +I + +LI A+
Sbjct: 5 SPQTRAEAEFRQ--RLLKEAQAPYRGLRQVMYLVFAASGLIGAMIFLLKLIAGQAD---- 58
Query: 130 ADVPEILKGLSIDVSAVSLFAFLYSRENK 158
E+ ++ + ++L L+ E K
Sbjct: 59 ---QELWGNFALQLGVIALMISLWRWEQK 84
>gi|428215965|ref|YP_007089109.1| hypothetical protein Oscil6304_5714 [Oscillatoria acuminata PCC
6304]
gi|428004346|gb|AFY85189.1| Protein of unknown function (DUF3493) [Oscillatoria acuminata PCC
6304]
Length = 101
Score = 37.7 bits (86), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 7/75 (9%)
Query: 83 ARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSID 142
AR+R E +P++ +R F Y+ ASG +G + + QL+ +V L ++
Sbjct: 32 ARLRAEAAAPYKGLRKFVYVVCGASGGIGAFVFLAQLLA-------GREVGAALPNFALQ 84
Query: 143 VSAVSLFAFLYSREN 157
+ V+L +L+ E
Sbjct: 85 LGVVALMVWLFRLEK 99
>gi|17231101|ref|NP_487649.1| hypothetical protein asl3609 [Nostoc sp. PCC 7120]
gi|17132742|dbj|BAB75308.1| asl3609 [Nostoc sp. PCC 7120]
Length = 88
Score = 37.7 bits (86), Expect = 4.6, Method: Composition-based stats.
Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 9/83 (10%)
Query: 74 RSDPEISSAARIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVP 133
R +PE AR++ E+ +P+R +R FY+ F ASG +G I QL+ DV
Sbjct: 9 RLNPE--QYARLKAEMATPYRGLRQLFYIGFGASGFIGAFIFFFQLLA-------GRDVE 59
Query: 134 EILKGLSIDVSAVSLFAFLYSRE 156
L ++ V V L FL+ E
Sbjct: 60 TALPNFALQVGVVGLMIFLWRWE 82
>gi|414877424|tpg|DAA54555.1| TPA: hypothetical protein ZEAMMB73_812331, partial [Zea mays]
Length = 197
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 33/61 (54%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++
Sbjct: 109 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINI 168
Query: 144 S 144
Sbjct: 169 G 169
>gi|194697836|gb|ACF83002.1| unknown [Zea mays]
Length = 273
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGALANPARAADVPEILKGLSIDV 143
++ +EV +PFR VR FFY+A A+ + I +L+ A+ A D+ E +I++
Sbjct: 185 KLISEVQAPFRGVRRFFYVALTAAAGISTFFTIPRLVLAVRGGDGAPDLLETAGNAAINI 244
Query: 144 SAV------SLFAFLYSRENK 158
F FL +E +
Sbjct: 245 GVCRRYCCSGSFVFLGKQERR 265
>gi|158334971|ref|YP_001516143.1| hypothetical protein AM1_1807 [Acaryochloris marina MBIC11017]
gi|158305212|gb|ABW26829.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 92
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 84 RIRNEVLSPFRSVRMFFYLAFVASGSLGGLIAITQLIGA 122
++R E+ SP+R +R FFY+AF S +G ++ QL+
Sbjct: 21 QLRAELKSPYRGLRRFFYIAFAGSAFVGAVVFFFQLLAG 59
>gi|125974000|ref|YP_001037910.1| methyl-accepting chemotaxis sensory transducer [Clostridium
thermocellum ATCC 27405]
gi|125714225|gb|ABN52717.1| methyl-accepting chemotaxis sensory transducer [Clostridium
thermocellum ATCC 27405]
Length = 758
Score = 37.0 bits (84), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 47/88 (53%), Gaps = 4/88 (4%)
Query: 183 KKIISVNNLRGIARLVICSGPASYIMESFKRSEPFTES---LLERGVLVVSFTTDGNIPS 239
++IISV+N + I +IC + YI++ ++ + ++ +L+ LV+S +D +
Sbjct: 255 RRIISVSNAKTIG-YIICVMKSDYILKQYEGIDLGEDTDIFILDSNGLVISSNSDEFVAG 313
Query: 240 FEFEESEEMKEFTRKRKRLWQLTPVYVS 267
EF + + +KE T K + P Y++
Sbjct: 314 LEFPDKDFVKELTLKNEEKINAFPAYIN 341
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.133 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,969,147,834
Number of Sequences: 23463169
Number of extensions: 148312490
Number of successful extensions: 402426
Number of sequences better than 100.0: 173
Number of HSP's better than 100.0 without gapping: 126
Number of HSP's successfully gapped in prelim test: 47
Number of HSP's that attempted gapping in prelim test: 402221
Number of HSP's gapped (non-prelim): 177
length of query: 271
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 131
effective length of database: 9,074,351,707
effective search space: 1188740073617
effective search space used: 1188740073617
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)