BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023868
(276 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus]
gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus]
Length = 282
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 174/283 (61%), Positives = 205/283 (72%), Gaps = 15/283 (5%)
Query: 1 MLQTHHLLSLNFP-FTVSHHPQKLN---FLQKPTISLSAFPRR------RPLIEPYCLAQ 50
M T +LLS NFP FT+S P N FL PT +LS+ R PL C
Sbjct: 1 MWHTQNLLSSNFPLFTLS--PPTYNHKLFLSPPT-TLSSLHRPITFHSVSPLTNHRCFCL 57
Query: 51 AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
Q T A + GPVELP +IFATTD+PSSLQVATSVLLTGAISVFLFR+LRRRAK
Sbjct: 58 PQF-TDLADATFLDDNGPVELPPTIFATTDDPSSLQVATSVLLTGAISVFLFRSLRRRAK 116
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKF 170
R KELKFRS G KKSLK+EA+D+LKA+ + I++K PSP+QA LG + AGVIA+ILYKF
Sbjct: 117 RVKELKFRSGGVKKSLKEEAMDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKF 176
Query: 171 TTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSF 230
TTTIEA+LNRQT+SDNFSVRQ+TITIRTIVNGLCYLATFVFGIN++GLFLYSGQLA+NS
Sbjct: 177 TTTIEASLNRQTVSDNFSVRQLTITIRTIVNGLCYLATFVFGINAIGLFLYSGQLAMNSV 236
Query: 231 TEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSS 272
E+ S E + ++Q STAE D TE ++ K+DQSS
Sbjct: 237 MEEGSKDTEPKAKSDEQVSPPTSTAETTLDSTESSNSKDDQSS 279
>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera]
gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera]
Length = 277
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 174/275 (63%), Positives = 205/275 (74%), Gaps = 21/275 (7%)
Query: 6 HLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSE 65
H L+ PF P L+ +S R PL+ AQ P T TAP E
Sbjct: 20 HNPCLSPPFCSLSRPTSLHL-----VSAHLRTRAEPLL-------AQVPDTTTTAP---E 64
Query: 66 EGPVELP---QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
EGP+ELP SIFAT D+P+ LQVATSVLLTGAISVFLFR++RRR KRAKEL+FRSSG
Sbjct: 65 EGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRRRVKRAKELRFRSSGV 124
Query: 123 KKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
KK+LK+EALD+LKA+GS S+ A PPSPVQALLGG+TAGVIA+ILYKFT TIEA+LNRQT
Sbjct: 125 KKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALILYKFTITIEASLNRQT 183
Query: 183 ISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
+SDNFSVRQITITIRTI+NGLCYLATFVFGINSVGLFLYSGQLALNSF E S+S+ET+
Sbjct: 184 VSDNFSVRQITITIRTIINGLCYLATFVFGINSVGLFLYSGQLALNSFME-GSTSEETQI 242
Query: 243 IGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
GE+Q + N+ ++ D T+LNS + DQ + Q
Sbjct: 243 KGEEQLSTPNTLTDSPPDGTQLNSSEGDQRAQDEQ 277
>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis]
gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis]
Length = 285
Score = 286 bits (733), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 177/288 (61%), Positives = 211/288 (73%), Gaps = 15/288 (5%)
Query: 1 MLQTHHLLSLN--FPFTVSHHPQKLNF-----LQKPTISLSAFPRRRPLIEPYCLAQAQE 53
ML HH + FP HH KPT SLS F P +P ++
Sbjct: 1 MLHCHHYIPSPQFFPRNHHHHTSTPYLSLTSSFSKPT-SLSTF-SALPPSKPDIWLASEP 58
Query: 54 PTTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA 109
+P+ E+GP+ELP SIFATTD+PS +QVATSVLLTG+I VFLFRALRRRA
Sbjct: 59 TPVTPPSPSPEEDGPIELPFPSSPSIFATTDDPSPIQVATSVLLTGSIGVFLFRALRRRA 118
Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
KRAKELKFRSSGAKK++K+EALD+LKA+GS+SIDA+ PPSP+Q LGG++A VIA+ILYK
Sbjct: 119 KRAKELKFRSSGAKKTMKEEALDSLKAMGSASIDAQKPPSPLQTFLGGVSAAVIALILYK 178
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
FTTTIEAALNRQT+SDNFSVRQITIT+RTI+NGLCYLATFVFG+NSVGLFLYSGQLA+NS
Sbjct: 179 FTTTIEAALNRQTVSDNFSVRQITITVRTIINGLCYLATFVFGLNSVGLFLYSGQLAINS 238
Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
F E S+S + G +QSGSLNS E+ D TE NS + DQ+ D+ Q
Sbjct: 239 FMEGSTSEESESK-GNEQSGSLNSAPESPIDSTESNSSQGDQNPDNSQ 285
>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max]
Length = 277
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 176/288 (61%), Positives = 209/288 (72%), Gaps = 34/288 (11%)
Query: 5 HHLLSLNFPFTVSHHPQK--LNFL----QKPTI----SLSAFPRRRPLIEPYCLAQAQEP 54
HHLL FP T++HH K L+ L KPTI S SA P P+ A
Sbjct: 8 HHLL---FPLTLAHHKVKPPLSILPPLHHKPTILHSVSASAIP-------PWAAQLADAA 57
Query: 55 TTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
T EGPVELP SIFAT+D+PS +QVA+SVLLTGA+SVFLFR+LRRRA+
Sbjct: 58 ADIDT------EGPVELPFSSTPSIFATSDDPSPIQVASSVLLTGAVSVFLFRSLRRRAQ 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAK-GPPSPVQALLGGLTAGVIAIILYK 169
R K+ +FRSSG +KS+K+EALD+LKA+GS+S+ AK SPVQALLGG++AGVIA+ILYK
Sbjct: 112 RVKQSQFRSSG-EKSIKEEALDSLKAMGSASVKAKEDSASPVQALLGGISAGVIALILYK 170
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
F TTIEAALNRQTISDNFSVRQITITIRTIVNGL YLATFVFG+NS+GLFLYSGQLAL S
Sbjct: 171 FATTIEAALNRQTISDNFSVRQITITIRTIVNGLTYLATFVFGLNSLGLFLYSGQLALKS 230
Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATD-TELNSGKEDQSSDSRQ 276
D S+ KETE+ Q NS+ E++TD TEL+S KE+QSS+ Q
Sbjct: 231 IMGD-STEKETESKSTDQPNLSNSSNESSTDKTELSSRKEEQSSNDAQ 277
>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa]
gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa]
Length = 211
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/197 (75%), Positives = 172/197 (87%), Gaps = 2/197 (1%)
Query: 74 SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDN 133
SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR+KELKFRSSGAKK+LK+EALD+
Sbjct: 16 SIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKRSKELKFRSSGAKKTLKEEALDS 75
Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
LK GS+ ID K PPSPVQA LG ++AGVIA+ILYKF+TTIEA+LNRQT+SDNFSVRQIT
Sbjct: 76 LKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFSTTIEASLNRQTVSDNFSVRQIT 135
Query: 194 ITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLNS 253
IT+RTIVNGLCYLATFVFGINSVGLFLYS QLA NS E + + + TE+ QSGS NS
Sbjct: 136 ITVRTIVNGLCYLATFVFGINSVGLFLYSAQLAFNSLME-AGTDEVTESTTSGQSGSSNS 194
Query: 254 TAENATD-TELNSGKED 269
T+++ATD EL S + D
Sbjct: 195 TSDSATDGAELTSSQGD 211
>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 238
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 144/225 (64%), Positives = 168/225 (74%), Gaps = 13/225 (5%)
Query: 2 LQTHHLLSLN---FPFT---VSHHPQKLNFLQKPTI----SLSAFPRRRPLIEPYCLAQA 51
LQ H+ L+ FP + +HH L FL +P I +LSA PR RP P LAQ
Sbjct: 4 LQCHNFHPLSPQIFPLSHHHTNHHLYPLKFLYRPPIFTPFTLSALPRSRP-DPPLWLAQL 62
Query: 52 QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
E + SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 63 PEQAPEPEE--EGPIEILRSSPSIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKR 120
Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
+KELKFRSSGAKK+LK+EALD+LK GS+ ID K PPSPVQA LG ++AGVIA+ILYKFT
Sbjct: 121 SKELKFRSSGAKKTLKEEALDSLKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFT 180
Query: 172 TTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSV 216
TTIEA+LNRQT+SDNFSVRQIT+T+RTIVNGLCYLATFVFGINS+
Sbjct: 181 TTIEASLNRQTVSDNFSVRQITVTVRTIVNGLCYLATFVFGINSL 225
>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana]
Length = 266
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 153/285 (53%), Positives = 192/285 (67%), Gaps = 36/285 (12%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H L + P+ + P +L P SLS+F R RP I L+ +E
Sbjct: 3 VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL+FRS+GAKKSLK+EA+DNLKAL S+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELQFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA N
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLAFN- 230
Query: 230 FTEDSSSS--KETENIGEQQSGSLNSTAENATDTELNSGKEDQSS 272
EDS+ K T G+ SG ++E+N EDQSS
Sbjct: 231 --EDSAEENMKATTQPGDSSSGD---------NSEVNKSNEDQSS 264
>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana]
gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana]
gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana]
Length = 266
Score = 257 bits (656), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 153/285 (53%), Positives = 191/285 (67%), Gaps = 36/285 (12%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H L + P+ + P +L P SLS+F R RP I L+ +E
Sbjct: 3 VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL FRS+GAKKSLK+EA+DNLKAL S+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA N
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLAFN- 230
Query: 230 FTEDSSSS--KETENIGEQQSGSLNSTAENATDTELNSGKEDQSS 272
EDS+ K T G+ SG ++E+N EDQSS
Sbjct: 231 --EDSAEENMKATTQPGDSSSGD---------NSEVNKSNEDQSS 264
>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
lyrata]
gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
lyrata]
Length = 261
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/282 (52%), Positives = 187/282 (66%), Gaps = 34/282 (12%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H FT+ H + + P SLS+F R RP + ++ +E
Sbjct: 3 VLQSHQC-----RFTLPHRLRPTRLIS-PIHSLSSFTRIRP--DRIRVSVVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPPSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL+FRS+GAKKSLK+EA+DNLKALGS+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELRFRSTGAKKSLKEEAMDNLKALGSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNS 229
FT T+E+ LNRQTISDNFSVRQIT+T+RTI+NG+CYLATFVFG+N+ GL LY+GQLA N
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTVRTIINGICYLATFVFGLNAFGLLLYAGQLAFN- 230
Query: 230 FTEDSSSSKETENIGEQQSGSLNSTAENATDTELNSGKEDQS 271
EDS + E SG ++E+N EDQS
Sbjct: 231 --EDSDEDMKATTQPEDSSGD---------NSEVNKSNEDQS 261
>gi|217072908|gb|ACJ84814.1| unknown [Medicago truncatula]
Length = 271
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 132/217 (60%), Positives = 167/217 (76%), Gaps = 8/217 (3%)
Query: 66 EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
+GP+E+P S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57 DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116
Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+KS+K+EAL+ LKA G++SI+ KGPPSPVQ LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKATGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175
Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
QTISD+FS RQITIT+RTI+NGL YLATF+FG+NS+GL LYSGQLAL SFT D S KET
Sbjct: 176 QTISDDFSARQITITVRTIINGLTYLATFIFGLNSLGLLLYSGQLALKSFTGD-SIEKET 234
Query: 241 ENIGEQQSGSLNSTAE-NATDTELNSGKEDQSSDSRQ 276
EN QS N + E D EL+ E+QSS+ Q
Sbjct: 235 ENKSADQSSLSNLSVETRINDAELSGRSEEQSSNDAQ 271
>gi|388511081|gb|AFK43606.1| unknown [Lotus japonicus]
Length = 251
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 143/278 (51%), Positives = 177/278 (63%), Gaps = 52/278 (18%)
Query: 9 SLNFPFTVSHH---------PQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNIT 59
+L P T++HH L+FL +PTI P I P + E TT
Sbjct: 15 NLPVPLTLTHHHSSFSHPPPSHLLHFLHRPTI----IPHTISAIPPTLSSWLTELTTG-- 68
Query: 60 APTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRAKRAKELKFR 118
TD+PSS +QV +++L+TGAI++F FR L+RRAKRAKELKFR
Sbjct: 69 -------------------TDDPSSTIQVTSTILVTGAITLFFFRTLQRRAKRAKELKFR 109
Query: 119 SSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL 178
SSG KSL S+SI AK PSP QALLG + AGVIA+ILY+FTT++EA L
Sbjct: 110 SSGVNKSL-----------ASTSIKAKKRPSPDQALLGAIIAGVIAVILYRFTTSVEATL 158
Query: 179 NRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSK 238
RQTISDNFSVRQ+TITIRTI+NGLCYLATFV+GINSVGL LYSGQLA+N+F E S K
Sbjct: 159 YRQTISDNFSVRQVTITIRTIINGLCYLATFVYGINSVGLLLYSGQLAMNTF-EGGPSGK 217
Query: 239 ETENIGEQQSGSLNSTAENATDTELNSGKEDQSSDSRQ 276
ETE +QSGS NS+ + +LNS KED++S++ Q
Sbjct: 218 ETEKRILEQSGSSNSSTD-----KLNSTKEDENSNNSQ 250
>gi|357497633|ref|XP_003619105.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
gi|355494120|gb|AES75323.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
Length = 254
Score = 221 bits (562), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 148/204 (72%), Gaps = 20/204 (9%)
Query: 75 IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNL 134
IF DE SS QV SV+L+ AI+VFLF ++RR KRAK+LK+RSSG KKS +L++
Sbjct: 69 IFDAIDESSSNQVTISVVLSVAIAVFLFPTIQRRIKRAKQLKYRSSGVKKS----SLNST 124
Query: 135 KAL-GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
K L GS+ +K PPSP +ALLG + AGVIA+ILY+FTTTIEA+L RQTISDNFSVRQIT
Sbjct: 125 KGLMGSTRKKSKNPPSPDKALLGAVMAGVIAVILYRFTTTIEASLYRQTISDNFSVRQIT 184
Query: 194 ITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLNS 253
ITIRT++NGLCYLATFV+GINS GL LYSGQLALN++ + SSS K+ E
Sbjct: 185 ITIRTVINGLCYLATFVYGINSFGLLLYSGQLALNTYEKGSSSGKKIE------------ 232
Query: 254 TAENATDTELNSGK-EDQSSDSRQ 276
+ +NS K EDQSS++ Q
Sbjct: 233 --SKIMENHINSNKNEDQSSNNSQ 254
>gi|356514641|ref|XP_003526013.1| PREDICTED: uncharacterized protein LOC100783357 [Glycine max]
Length = 276
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 146/273 (53%), Positives = 181/273 (66%), Gaps = 24/273 (8%)
Query: 10 LNFPFTVSHHPQKL---NFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEE 66
L T +HH L +FL +P I+ + I P A+ TT +
Sbjct: 18 LRLSLTTTHHNSSLPSLHFLHRPIITTTTTTTTVSAIPPTAAWLAKL--------TTQDF 69
Query: 67 GPVELPQS-IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKS 125
ELP S +F TD+PS++QVA SVLLTGAI+VF FR+++RRAKRAKELKFRSSG KS
Sbjct: 70 AHFELPSSHLF--TDDPSTIQVACSVLLTGAITVFFFRSVQRRAKRAKELKFRSSGVNKS 127
Query: 126 LKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
L DNLKA+ SSSI AK PPSP QALLG + AGVIA++LY+FTT+IEA L RQT+SD
Sbjct: 128 L-----DNLKAISSSSIKAKSPPSPDQALLGAIIAGVIAVVLYRFTTSIEATLGRQTLSD 182
Query: 186 NFSVRQ-ITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIG 244
NFSVRQ TI+NGLCYLATFV+GINSVGLFLYSGQLA+++ SS KETE+
Sbjct: 183 NFSVRQITITIRETIINGLCYLATFVYGINSVGLFLYSGQLAMDTM-GGGSSGKETESKI 241
Query: 245 EQQSG---SLNSTAENATDTELNSGKEDQSSDS 274
QQ G + + N + +L+S KEDQSS++
Sbjct: 242 TQQPGLSNASVENSNNNNNNKLSSTKEDQSSNN 274
>gi|357144153|ref|XP_003573191.1| PREDICTED: uncharacterized protein LOC100830225 [Brachypodium
distachyon]
Length = 284
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/221 (61%), Positives = 161/221 (72%), Gaps = 9/221 (4%)
Query: 35 AFPRRRPLI-----EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVA 88
A P RRP EP A + T EEGPVEL ++F+ D P+SLQVA
Sbjct: 28 ARPGRRPTALRAEPEPQSPPPASAAASPDTTDDGGEEGPVELRAPTLFSVDDNPTSLQVA 87
Query: 89 TSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSIDAKG 146
TSVLLTGAISVFLFR+LRRRA+RAKEL+ RS G +K +L +EAL+ L+ + +S ++ +
Sbjct: 88 TSVLLTGAISVFLFRSLRRRARRAKELRVRSGGLEKPKNLSEEALEALRMVSTSPVETEK 147
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYL 206
PPSPVQALLGG+ AGVIA+ LYKF TTIEA+LNRQTISDNFSVRQIT+TIRTI+NG+CYL
Sbjct: 148 PPSPVQALLGGIAAGVIALFLYKFATTIEASLNRQTISDNFSVRQITVTIRTIINGICYL 207
Query: 207 ATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE-TENIGEQ 246
ATFVFGIN+VGL LYS QL NS +D SS E I EQ
Sbjct: 208 ATFVFGINAVGLILYSLQLTFNSIMDDDDSSSSPVEKISEQ 248
>gi|326511271|dbj|BAJ87649.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326513094|dbj|BAK06787.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529293|dbj|BAK01040.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 281
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 123/186 (66%), Positives = 149/186 (80%), Gaps = 3/186 (1%)
Query: 64 SEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
S +GPVE+ ++FA D P+ LQVATSV+LTGAISVFLFR+LRRRA+RAKEL+ RS G
Sbjct: 63 SGDGPVEIRAPTLFAVDDNPTPLQVATSVMLTGAISVFLFRSLRRRARRAKELRVRSGGV 122
Query: 123 KK--SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+K +L +EAL+ L+ + +S ++ PPSPVQALLGG+ AGVIA+ LYKF +T+EA+LNR
Sbjct: 123 EKPRNLSEEALEALRMVSTSPVETNKPPSPVQALLGGIAAGVIALFLYKFASTVEASLNR 182
Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
QTISDNFSVRQIT+TIRTI+NGLCYLATFVFGIN VGL LYS QL NS +D SSS
Sbjct: 183 QTISDNFSVRQITVTIRTIINGLCYLATFVFGINGVGLILYSLQLTFNSLMDDDSSSSSV 242
Query: 241 ENIGEQ 246
E I EQ
Sbjct: 243 EKISEQ 248
>gi|115445727|ref|NP_001046643.1| Os02g0307800 [Oryza sativa Japonica Group]
gi|46390708|dbj|BAD16208.1| unknown protein [Oryza sativa Japonica Group]
gi|113536174|dbj|BAF08557.1| Os02g0307800 [Oryza sativa Japonica Group]
gi|125581805|gb|EAZ22736.1| hypothetical protein OsJ_06408 [Oryza sativa Japonica Group]
gi|215692736|dbj|BAG88156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215694440|dbj|BAG89457.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 273
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 132/224 (58%), Positives = 157/224 (70%), Gaps = 9/224 (4%)
Query: 29 PTISLSAFPRRRPLI---EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
P S SA RR + EP A EP +GPVEL ++F+ + P+
Sbjct: 26 PAASASALRLRRSVAVRAEPELSTSAAEPPPGDDG---EGDGPVELRTPTLFSIDENPTP 82
Query: 85 LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
LQ ATSVLLTGAISVFLFR++RRR +RAKEL+ RS G +K +L EAL+ L+ + +S I
Sbjct: 83 LQTATSVLLTGAISVFLFRSIRRRVRRAKELRVRSGGVEKPNNLSKEALEGLRLVSASPI 142
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNG 202
+ PPSPVQALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQITITIRTI+NG
Sbjct: 143 EVDKPPSPVQALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQITITIRTIING 202
Query: 203 LCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQ 246
+CYLATFVFGINS+GL LY QL S D +SS E I EQ
Sbjct: 203 ICYLATFVFGINSIGLVLYGLQLTFASIMGDDNSSSAAEKISEQ 246
>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula]
Length = 230
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 97/152 (63%), Positives = 126/152 (82%), Gaps = 6/152 (3%)
Query: 66 EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
+GP+E+P S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57 DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116
Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+KS+K+EAL+ LKA+G++SI+ KGPPSPVQ LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKAMGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175
Query: 181 QTISDNFSVRQITITIRTIVNGLCYLATFVFG 212
QTISD+FS RQITIT+RTI+NGL YLATF F
Sbjct: 176 QTISDDFSARQITITVRTIINGLTYLATFYFW 207
>gi|218190566|gb|EEC72993.1| hypothetical protein OsI_06901 [Oryza sativa Indica Group]
Length = 325
Score = 187 bits (474), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/214 (57%), Positives = 145/214 (67%), Gaps = 33/214 (15%)
Query: 66 EGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKF------- 117
+GPVEL ++F+ + P+ LQ ATSVLLTGAISVFLFR++RRRA+RAKEL
Sbjct: 85 DGPVELRTPTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRARRAKELVIPSLLIHL 144
Query: 118 -----------------------RSSGAKK--SLKDEALDNLKALGSSSIDAKGPPSPVQ 152
RS G +K +L EAL+ L+ + +S I+ PPSPVQ
Sbjct: 145 RCAFSTGEHTWSTNPISFRDTEGRSGGVEKPNNLSKEALEGLRLVSASPIEVDKPPSPVQ 204
Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFG 212
ALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQITITIRTI+NG+CYLATFVFG
Sbjct: 205 ALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQITITIRTIINGICYLATFVFG 264
Query: 213 INSVGLFLYSGQLALNSFTEDSSSSKETENIGEQ 246
INS+GL LY QL S D +SS E I EQ
Sbjct: 265 INSIGLVLYGLQLTFASIMGDDNSSSAAEKISEQ 298
>gi|242065424|ref|XP_002454001.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
gi|241933832|gb|EES06977.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
Length = 293
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 122/190 (64%), Positives = 147/190 (77%), Gaps = 5/190 (2%)
Query: 45 PYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFR 103
P A+ + P A +GPVEL ++F+T D P+ LQ ATS+LLTGAISVFLFR
Sbjct: 55 PVSAAEPEPPDAGAVA-EAEGQGPVELRAPTLFSTDDNPTPLQTATSLLLTGAISVFLFR 113
Query: 104 ALRRRAKRAKELKFRSSGAKK---SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTA 160
+LRRRA+RAKEL+ RSSG KK +L +EAL+ L+ + +S I+ + PPSPVQ LLGG+ A
Sbjct: 114 SLRRRARRAKELRVRSSGVKKKPNNLTEEALEGLRLVSASPIETEKPPSPVQTLLGGIAA 173
Query: 161 GVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFL 220
GVIA+ILYKF+TTIEAALNRQTISD+FSVRQITITIRTI+ GLCYLAT VFGIN+VGL L
Sbjct: 174 GVIAVILYKFSTTIEAALNRQTISDSFSVRQITITIRTIITGLCYLATSVFGINAVGLVL 233
Query: 221 YSGQLALNSF 230
YS QL L S
Sbjct: 234 YSLQLTLQSI 243
>gi|226494037|ref|NP_001143499.1| uncharacterized protein LOC100276178 [Zea mays]
gi|195621548|gb|ACG32604.1| hypothetical protein [Zea mays]
gi|224036011|gb|ACN37081.1| unknown [Zea mays]
gi|413936044|gb|AFW70595.1| hypothetical protein ZEAMMB73_826587 [Zea mays]
Length = 285
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 126/208 (60%), Positives = 157/208 (75%), Gaps = 7/208 (3%)
Query: 26 LQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
+++PT + R PL P A+++ P + A +GPVEL ++F+T D P+
Sbjct: 32 VRRPTAIRAEAEVRPPL--PVSAAESEPP--DAGAVEAEGQGPVELRAPTLFSTDDNPTP 87
Query: 85 LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
LQ ATS+LLTGAISVFLFRALRRRA+RAKEL+ RSSG KK +L +EAL+ L+ + +S I
Sbjct: 88 LQTATSLLLTGAISVFLFRALRRRARRAKELRVRSSGLKKPNNLTEEALERLRLMSASPI 147
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNG 202
+ + SP+QALLGG+ AGVIA+ILYKF+TT+EAALNRQTISD+FSVRQITITIRTIV G
Sbjct: 148 ETEKATSPIQALLGGIAAGVIALILYKFSTTVEAALNRQTISDSFSVRQITITIRTIVTG 207
Query: 203 LCYLATFVFGINSVGLFLYSGQLALNSF 230
LCYLAT VFGIN+VGL LYS QL L S
Sbjct: 208 LCYLATSVFGINAVGLVLYSLQLTLQSI 235
>gi|11994500|dbj|BAB02565.1| unnamed protein product [Arabidopsis thaliana]
Length = 135
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 84/145 (57%), Positives = 103/145 (71%), Gaps = 15/145 (10%)
Query: 131 LDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
+DNLKAL S+ I+ PS QA LG + AGVIA+ILYKFT T+E+ LNRQTISDNFSV
Sbjct: 1 MDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKFTVTVESGLNRQTISDNFSV 60
Query: 190 RQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSS--SSKETENIGEQQ 247
RQIT+T+RTI+NG+CYLATFVFG+N+ GL LYSGQLA F EDS+ + K T G+
Sbjct: 61 RQITVTVRTIINGICYLATFVFGLNAFGLLLYSGQLA---FNEDSAEENMKATTQPGDSS 117
Query: 248 SGSLNSTAENATDTELNSGKEDQSS 272
SG ++E+N EDQSS
Sbjct: 118 SGD---------NSEVNKSNEDQSS 133
>gi|168035549|ref|XP_001770272.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678489|gb|EDQ64947.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 277
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 105/153 (68%), Gaps = 6/153 (3%)
Query: 66 EGPVELPQSIF--ATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
+GP+ELP + E S LQVA SV+LTG I+V L R+LRRR+K+AKE +FRS+G
Sbjct: 88 DGPIELPPELLDPLAIPEASPLQVAASVVLTGLITVLLIRSLRRRSKKAKETRFRSTG-- 145
Query: 124 KSLKDEALDNLKALGSSSIDAKGPP-SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
+K++A + AL + + + + PP S +Q G + AG IA++LYKFT T+E + +
Sbjct: 146 -EIKEDARKSAMALLNKAPEVETPPPSALQTFSGAVVAGFIALVLYKFTVTVEGSFTGKA 204
Query: 183 ISDNFSVRQITITIRTIVNGLCYLATFVFGINS 215
+S N+S+R +TIT+RTIV GLCYLATFVF NS
Sbjct: 205 VSMNYSIRNLTITVRTIVTGLCYLATFVFAANS 237
>gi|302785097|ref|XP_002974320.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
gi|300157918|gb|EFJ24542.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
Length = 1433
Score = 124 bits (311), Expect = 4e-26, Method: Composition-based stats.
Identities = 78/179 (43%), Positives = 112/179 (62%), Gaps = 13/179 (7%)
Query: 51 AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRA 109
AQ I ++S +GP+ELP + ++P+S LQ+A SV+LTGAISV L R+LRRR
Sbjct: 1244 AQVAEQAIADASSSLDGPIELPGDL----EQPTSPLQIAGSVVLTGAISVLLLRSLRRRY 1299
Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
K AKE KK LK++A N+ A ++ + K PPS Q L G L AG IA +LY+
Sbjct: 1300 KEAKE--------KKPLKEQAKQNVLASLANQVVEKPPPSFEQTLYGALLAGAIAFVLYQ 1351
Query: 170 FTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALN 228
TT + + + + ++ R I +T+RTI++G+CYLATFVF +++GL LY QL +N
Sbjct: 1352 CTTAVTGSFASKPLPTEYATRNIAVTVRTIISGICYLATFVFAADAIGLALYGLQLLVN 1410
>gi|145346872|ref|XP_001417906.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578134|gb|ABO96199.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 181
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 88/152 (57%), Gaps = 8/152 (5%)
Query: 79 TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
T+E + LQ VL TG + RA+++R+ +A+E FR + + +D +A
Sbjct: 9 TEEATPLQNVFGVLFTGFSFWYFARAVQKRSGKARE--FRVANRLPEEERAKIDEERAKK 66
Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRT 198
+ + A +Q+ GGLT I+++LY F + + A+ + + + ++ +VRQI+IT+RT
Sbjct: 67 TKELTA------MQSFTGGLTGIGISVVLYAFASKVSASFDGKALPESETVRQISITVRT 120
Query: 199 IVNGLCYLATFVFGINSVGLFLYSGQLALNSF 230
IV GL YLATFV+ N VGL +GQ L+ F
Sbjct: 121 IVEGLTYLATFVYAANGVGLIALAGQKTLDYF 152
>gi|159491108|ref|XP_001703515.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280439|gb|EDP06197.1| predicted protein [Chlamydomonas reinhardtii]
Length = 206
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 77/129 (59%), Gaps = 10/129 (7%)
Query: 91 VLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK-SLKDEALDNLKALGSSSIDAKGPPS 149
+LL G +FLFR R+RA +A+ K A S+ DE + L A +G +
Sbjct: 86 ILLVG---IFLFRLFRKRAAKARSEKIAGQLATGPSIFDELREKLGA------PKRGKAT 136
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATF 209
P+ A LG + A +A L+ F+T ++A++ Q++ D ++ R I IT+RTI+ GLCYLATF
Sbjct: 137 PMNAFLGAVQALGLAAGLWFFSTKVQASIEAQSLPDGYTARNIAITVRTIILGLCYLATF 196
Query: 210 VFGINSVGL 218
+FG N VGL
Sbjct: 197 IFGANGVGL 205
>gi|303283914|ref|XP_003061248.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457599|gb|EEH54898.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 332
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 13/159 (8%)
Query: 80 DEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSS-------GAKKSLKDEALD 132
DE + LQ A T + + R +++R RAKE + ++ G + E D
Sbjct: 145 DEATPLQNAFGFAFTIFCAWYFVRVVKKRGNRAKEFRVANALPVRVRAGRPSKEERERRD 204
Query: 133 NLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQI 192
+ S + AK A +GG+T I+ LY+F I + + + +++ RQI
Sbjct: 205 AEQLAKSKKLSAK------DAFVGGVTGLAISFALYQFAVVIGESFEGRLMPESYQARQI 258
Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFT 231
TIT+RTIVNGL YLATFV+ N GL L + + AL+S T
Sbjct: 259 TITVRTIVNGLAYLATFVYAANGTGLVLLAARKALDSLT 297
>gi|302848926|ref|XP_002955994.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
nagariensis]
gi|300258720|gb|EFJ42954.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
nagariensis]
Length = 325
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 75/138 (54%), Gaps = 10/138 (7%)
Query: 91 VLLTGAISVFLFRALRRRAKRAKELKFRSSG-AKKSLKDEALDNLKALGSSSIDAKGPPS 149
+LL G +FLFR R+RA +AK + + + DE + A S +
Sbjct: 108 ILLVG---IFLFRLFRKRAAKAKTERIAGQAPSGPTFFDELRQKVGAPKQSKA------T 158
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATF 209
PV A +G A V+A L+ FTT ++ ++ Q + D ++ R I IT+RTI+ GLCYLATF
Sbjct: 159 PVNAFIGCGQALVLAYGLWFFTTKVQGSIEGQALPDGYTARNIAITVRTIILGLCYLATF 218
Query: 210 VFGINSVGLFLYSGQLAL 227
+F N VGL S QL L
Sbjct: 219 IFAANGVGLAGLSVQLLL 236
>gi|384247605|gb|EIE21091.1| hypothetical protein COCSUDRAFT_33788 [Coccomyxa subellipsoidea
C-169]
Length = 232
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 73/134 (54%), Gaps = 16/134 (11%)
Query: 97 ISVFLFRALRRRAKRAKELKFRSS--GAKKSLKDEALDNLKALGSSSIDAKGPPSPVQAL 154
++VF R L +RAK A E + S+ G + D ++ + +P+Q+L
Sbjct: 84 LAVFAVRLLTKRAKFATEERLASAAKGENERPPDPSIKRV--------------TPLQSL 129
Query: 155 LGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVFGIN 214
+ AGV A +LY F++ ++ RQ + D ++ R IT+ I TIV GL YL TF+FG N
Sbjct: 130 WAAVQAGVFAYLLYLFSSNVDGYFERQPLPDQYTARNITVAISTIVRGLSYLITFIFGAN 189
Query: 215 SVGLFLYSGQLALN 228
+VGL + QL +N
Sbjct: 190 AVGLSALTVQLFIN 203
>gi|255071307|ref|XP_002507735.1| predicted protein [Micromonas sp. RCC299]
gi|226523010|gb|ACO68993.1| predicted protein [Micromonas sp. RCC299]
Length = 292
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 91/162 (56%), Gaps = 8/162 (4%)
Query: 81 EPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSS 140
E + LQ A + T + + R +R+R AK+ + +S +K+ + L+ LK +
Sbjct: 121 EATPLQNAFGLTFTIFSAWYFLRVVRKRGNAAKKFRVANSLSKEERERRDLEELKRV--K 178
Query: 141 SIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIV 200
++A+ Q+ +GG+TA I+I+LY+F I+A+ + + + ++ +RQI+ T+RTIV
Sbjct: 179 KLNAQ------QSFVGGITAIGISIVLYEFAQNIQASFDSRPVPVSYQIRQISNTVRTIV 232
Query: 201 NGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
GL YLATF++ N GL S Q L++ ++ + EN
Sbjct: 233 EGLAYLATFIYAANGTGLVALSMQKLLDTAVPRDTTKTDGEN 274
>gi|302818397|ref|XP_002990872.1| hypothetical protein SELMODRAFT_49523 [Selaginella moellendorffii]
gi|300141433|gb|EFJ08145.1| hypothetical protein SELMODRAFT_49523 [Selaginella moellendorffii]
Length = 81
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 53/81 (65%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
PS Q L G L AG IA +LY+ TT + + + + ++ R I +T+RTI++G+CYLA
Sbjct: 1 PSFEQTLYGALLAGAIAFVLYQCTTAVTGSFASKPLPTEYATRNIAVTVRTIISGICYLA 60
Query: 208 TFVFGINSVGLFLYSGQLALN 228
TFVF +++GL LY QL +N
Sbjct: 61 TFVFAADAIGLALYGLQLLVN 81
>gi|412988915|emb|CCO15506.1| predicted protein [Bathycoccus prasinos]
Length = 203
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 72/138 (52%), Gaps = 8/138 (5%)
Query: 79 TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
T+E + LQ L T + +R +++R KRA+ FR + + + + + L
Sbjct: 73 TEEATPLQNLFGFLFTAFCGWYFWRVVKKRGKRAQ--TFRVANQLSPEELKEREEERKLK 130
Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRT 198
++A QA GG+T I+ LY F + I + ++ ++++ RQI+IT+RT
Sbjct: 131 DKKLNAS------QAFTGGITGIAISAGLYIFASKISTGFDNSSLPESYTARQISITVRT 184
Query: 199 IVNGLCYLATFVFGINSV 216
IV GL YLATFV+ N V
Sbjct: 185 IVEGLVYLATFVYAANGV 202
>gi|307106044|gb|EFN54291.1| expressed protein [Chlorella variabilis]
Length = 223
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 96/216 (44%), Gaps = 37/216 (17%)
Query: 79 TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS-------SGAKKSLKDEAL 131
T +L+ V G + F R +RRA++A + S G S +E
Sbjct: 8 TGGSDTLKTVAGVAYIGVVIFFFLRLFQRRAQKATSERIASVADVAAYGGGADSDDEEDE 67
Query: 132 DNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD---NFS 188
+ KA A +P+Q+ +G AGVI +L++ +T ++A +RQ + D ++
Sbjct: 68 EEQKA---KQQQAATEVTPLQSFIGTAQAGVICYLLFQASTAVDAYFDRQNLPDITTQYT 124
Query: 189 VRQITITIRTIVNGLCYLATFVFGINSVG---------LFLYSG-----------QLALN 228
+ + ++T+ GL YL TF+FG N++G LF SG QL
Sbjct: 125 AHNVAVLVQTVCRGLVYLITFIFGANALGLAGLTIQLLLFPESGREEGGTGKRAPQLPKV 184
Query: 229 SFTED----SSSSKETENIGEQQSGSLNSTAENATD 260
S T+D + +E E +G+QQ+ A+ + D
Sbjct: 185 SVTDDVFALRRAFQEAERMGQQQAQKEIRKAQQSGD 220
>gi|443311517|ref|ZP_21041144.1| Protein of unknown function (DUF3082) [Synechocystis sp. PCC 7509]
gi|442778396|gb|ELR88662.1| Protein of unknown function (DUF3082) [Synechocystis sp. PCC 7509]
Length = 106
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 56/95 (58%), Gaps = 1/95 (1%)
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
D K P+P+++ LG + A +AI LY+ T+ I + ++ + SDN V +ITI +RT+V
Sbjct: 12 DLKPLPTPLRSSLGAVIASGLAIALYRVTSAISLSFAQKPLHSDNALVVKITIAVRTLVV 71
Query: 202 GLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
GL LATF+F I + LF + Q + + SS
Sbjct: 72 GLSTLATFIFAIVAFALFALAIQATIQKLKQSPSS 106
>gi|291567877|dbj|BAI90149.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 133
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
+P Q L+GGL AG AI +Y T I Q I SDN + Q++I IRT+V G L
Sbjct: 42 NPFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAIRTLVIGGAALV 101
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
T VF I S+GL + Q+ T +S+S++
Sbjct: 102 TGVFTIASLGLLALTVQVLFVRLTHKNSASQK 133
>gi|409994073|ref|ZP_11277194.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
Paraca]
gi|409935065|gb|EKN76608.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
Paraca]
Length = 122
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLAT 208
P Q L+GGL AG AI +Y T I Q I SDN + Q++I IRT+V G L T
Sbjct: 32 PFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAIRTLVIGGAALVT 91
Query: 209 FVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
VF I S+GL + Q+ T +S+S++
Sbjct: 92 GVFTIASLGLLALTVQVLFVRLTHKNSASQK 122
>gi|209526258|ref|ZP_03274788.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
gi|376007925|ref|ZP_09785107.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423062823|ref|ZP_17051613.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
gi|209493355|gb|EDZ93680.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
gi|375323718|emb|CCE20860.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406715779|gb|EKD10932.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
Length = 122
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLAT 208
P Q L+GGL +G AI +Y T I Q I SDN + QI+I IRT+V G L T
Sbjct: 32 PFQVLMGGLFSGGFAIAMYSLTAAIAHTFANQNIYSDNRTALQISIAIRTLVIGGAALVT 91
Query: 209 FVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
+F I S+GL + Q+ T+ +S+S++
Sbjct: 92 GIFTIASLGLLALTVQVLFVRLTQKNSASEK 122
>gi|428226471|ref|YP_007110568.1| hypothetical protein GEI7407_3046 [Geitlerinema sp. PCC 7407]
gi|427986372|gb|AFY67516.1| hypothetical protein GEI7407_3046 [Geitlerinema sp. PCC 7407]
Length = 108
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 45/77 (58%), Gaps = 1/77 (1%)
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
+ PP +Q L L AG IA+++YK T I A + SDN++ ++I +RT+V
Sbjct: 10 ETNAPPGVLQCLTSALIAGTIALLMYKVTGGIAANFASHPLHSDNYTAQRIASAVRTLVI 69
Query: 202 GLCYLATFVFGINSVGL 218
G+C L + +FG ++GL
Sbjct: 70 GMCALGSGIFGFAALGL 86
>gi|428208447|ref|YP_007092800.1| hypothetical protein Chro_3473 [Chroococcidiopsis thermalis PCC
7203]
gi|428010368|gb|AFY88931.1| hypothetical protein Chro_3473 [Chroococcidiopsis thermalis PCC
7203]
Length = 101
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYL 206
P+P++ L G + + IA Y T I R+ I SDNF+V I +RT+V G+ L
Sbjct: 12 PTPLRCLSGAVISAGIAFAAYSLTAAIAQTFARKPIHSDNFTVVNIAAAVRTLVVGIMAL 71
Query: 207 ATFVFGINSVGLFLYSGQLALNSFTE 232
T +FGI ++GL + QL++ T+
Sbjct: 72 GTGIFGIVAIGLVALAIQLSIQKLTK 97
>gi|443324810|ref|ZP_21053537.1| Protein of unknown function (DUF3082) [Xenococcus sp. PCC 7305]
gi|442795591|gb|ELS04951.1| Protein of unknown function (DUF3082) [Xenococcus sp. PCC 7305]
Length = 120
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQ-TISDNFSVRQITITIRTIVNGL 203
K +P++ LL + AGV+A +Y + I + + +SDN V ++TI +RT+V G+
Sbjct: 28 KQEVTPLKCLLSSVFAGVMAFGVYHLMSAIVTTYSTKPIVSDNIFVLKLTIAVRTLVIGV 87
Query: 204 CYLATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
L T VFG ++GLFL Q+ F++ S+
Sbjct: 88 AALGTGVFGFVALGLFLLGLQITFKDFSKKFSN 120
>gi|428203436|ref|YP_007082025.1| hypothetical protein Ple7327_3245 [Pleurocapsa sp. PCC 7327]
gi|427980868|gb|AFY78468.1| Protein of unknown function (DUF3082) [Pleurocapsa sp. PCC 7327]
Length = 114
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 60/103 (58%), Gaps = 4/103 (3%)
Query: 140 SSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDN--FSVRQITITI 196
SS++A + +P++ LLG + +G +AI +Y T +I + I+ ++R I+ +
Sbjct: 12 SSVEANEQKVTPLRCLLGSVLSGSLAIAMYFLTYSIAYTFATKPITSTSQLAIR-ISAAV 70
Query: 197 RTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
RT+V G+ L TF+FG ++GL L + QL L F E SSS +
Sbjct: 71 RTLVVGVASLGTFIFGFVTLGLILLAIQLTLQIFKEKLSSSSD 113
>gi|427728680|ref|YP_007074917.1| hypothetical protein Nos7524_1442 [Nostoc sp. PCC 7524]
gi|427364599|gb|AFY47320.1| Protein of unknown function (DUF3082) [Nostoc sp. PCC 7524]
Length = 108
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G L +G + LY I ++ I SDN V +T +RT+V G+
Sbjct: 17 PPTPLRCVTGALISGGLGYALYSLMIAIATTFAKKPIHSDNQLVVNLTSAVRTLVVGIVA 76
Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
L T +FGI ++GLF + QL + T+ S
Sbjct: 77 LGTGIFGIVAIGLFALAIQLLFQNLTKSKSG 107
>gi|427737778|ref|YP_007057322.1| hypothetical protein Riv7116_4348 [Rivularia sp. PCC 7116]
gi|427372819|gb|AFY56775.1| Protein of unknown function (DUF3082) [Rivularia sp. PCC 7116]
Length = 109
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
P+P + + G L G + +YK +I N+ SDN V I+ +RT+V G+ L
Sbjct: 18 PTPTRCITGALMCGGLGFAMYKLMVSIATTFANKPVTSDNPMVVSISSAVRTLVVGVVAL 77
Query: 207 ATFVFGINSVGLFLYSGQLALNSFT 231
T +FGI VGLF + QL + T
Sbjct: 78 GTGIFGIVGVGLFALAIQLVIKQLT 102
>gi|428304799|ref|YP_007141624.1| hypothetical protein Cri9333_1213 [Crinalium epipsammum PCC 9333]
gi|428246334|gb|AFZ12114.1| hypothetical protein Cri9333_1213 [Crinalium epipsammum PCC 9333]
Length = 115
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV-RQITITIRTIVNGLCYLA 207
SP++ L G + +G +AI LY T++I + + ++ + +V I + +RT+V G LA
Sbjct: 23 SPLRCLTGAMISGGMAIALYSLTSSIAQSFAAKPVTSSSTVAHNIAVAVRTLVTGTAALA 82
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
T VF + +VGL + Q + FT+ SS+
Sbjct: 83 TGVFSLVTVGLVALAIQTVIQRFTKPSST 111
>gi|428315435|ref|YP_007113317.1| hypothetical protein Osc7112_0273 [Oscillatoria nigro-viridis PCC
7112]
gi|428239115|gb|AFZ04901.1| hypothetical protein Osc7112_0273 [Oscillatoria nigro-viridis PCC
7112]
Length = 115
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 55/97 (56%), Gaps = 3/97 (3%)
Query: 139 SSSIDAKGPPSP--VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITIT 195
+S+ DA P SP ++ L G L A A+ LY T +I + + + SDN + +I++
Sbjct: 15 TSAEDALPPKSPTVLRCLTGSLMAASFAVPLYALTMSIAQSFASKPVHSDNITAIKISVA 74
Query: 196 IRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTE 232
+RT+V G+ L T +F + +VGL L + Q+ + + +
Sbjct: 75 VRTLVVGIAALGTGIFALAAVGLMLLAVQILIQNLVK 111
>gi|254422677|ref|ZP_05036395.1| hypothetical protein S7335_2829 [Synechococcus sp. PCC 7335]
gi|196190166|gb|EDX85130.1| hypothetical protein S7335_2829 [Synechococcus sp. PCC 7335]
Length = 118
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 4/96 (4%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCY 205
P +P++ G L A ++++ Y+ T +I A ++ +SDN I+ +RT+V G+
Sbjct: 26 PATPLRCFTGALMAATLSLLAYRLTISIAATFASKPIVSDNPVAVNISAAVRTLVTGIAA 85
Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETE 241
L VFG+ ++GL QL F D+ SS+ T+
Sbjct: 86 LGAGVFGVAALGLTALGIQLI---FKPDAQSSEATD 118
>gi|16331459|ref|NP_442187.1| hypothetical protein slr0438 [Synechocystis sp. PCC 6803]
gi|383323200|ref|YP_005384054.1| hypothetical protein SYNGTI_2292 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383326369|ref|YP_005387223.1| hypothetical protein SYNPCCP_2291 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383492253|ref|YP_005409930.1| hypothetical protein SYNPCCN_2291 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437521|ref|YP_005652246.1| hypothetical protein SYNGTS_2293 [Synechocystis sp. PCC 6803]
gi|451815611|ref|YP_007452063.1| hypothetical protein MYO_123170 [Synechocystis sp. PCC 6803]
gi|1001118|dbj|BAA10257.1| slr0438 [Synechocystis sp. PCC 6803]
gi|339274554|dbj|BAK51041.1| hypothetical protein SYNGTS_2293 [Synechocystis sp. PCC 6803]
gi|359272520|dbj|BAL30039.1| hypothetical protein SYNGTI_2292 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275690|dbj|BAL33208.1| hypothetical protein SYNPCCN_2291 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278860|dbj|BAL36377.1| hypothetical protein SYNPCCP_2291 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407961156|dbj|BAM54396.1| hypothetical protein BEST7613_5465 [Bacillus subtilis BEST7613]
gi|451781580|gb|AGF52549.1| hypothetical protein MYO_123170 [Synechocystis sp. PCC 6803]
Length = 107
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCY 205
PP+P++ L+G + +A LY T+ I + ++ +SD V++I +RT+V GL
Sbjct: 15 PPTPLRCLVGSGISATLAWGLYLLTSAIAVSFASKPVLSDKPIVQRIASAVRTLVLGLAS 74
Query: 206 LATFVFGINSVGLFLYSGQLALN 228
+ TF+F ++GL L QL L
Sbjct: 75 MGTFIFAFVAIGLILLMVQLILQ 97
>gi|22299383|ref|NP_682630.1| hypothetical protein tsr1840 [Thermosynechococcus elongatus BP-1]
gi|22295566|dbj|BAC09392.1| tsr1840 [Thermosynechococcus elongatus BP-1]
Length = 97
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 46/85 (54%), Gaps = 1/85 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFS-VRQITITIRTIVNGLCYLA 207
SP++ L G L AG + ++LY+ T I +S + V + + +RT+V GLC LA
Sbjct: 9 SPLRCLTGALIAGTLGLLLYRLTGAIAYLFATHAVSAHHQLVYSLAVAVRTLVVGLCTLA 68
Query: 208 TFVFGINSVGLFLYSGQLALNSFTE 232
T VF I ++GL + QL + +
Sbjct: 69 TGVFSIIALGLVALTLQLLWERWVQ 93
>gi|119485099|ref|ZP_01619484.1| hypothetical protein L8106_06614 [Lyngbya sp. PCC 8106]
gi|119457327|gb|EAW38452.1| hypothetical protein L8106_06614 [Lyngbya sp. PCC 8106]
Length = 113
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
+P + L G IA LY T ++ + I SDN +V Q+++T+RT+V GLC +
Sbjct: 21 TPFRCLSNAAVLGSIATALYFLTRSVAQTYAAKPIHSDNLAVVQMSVTVRTLVTGLCMMG 80
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKET 240
T VF + GL + QL++ + + S ET
Sbjct: 81 TGVFAFLTCGLIFVAIQLSIQRIFKQDNPSSET 113
>gi|428297938|ref|YP_007136244.1| hypothetical protein Cal6303_1211 [Calothrix sp. PCC 6303]
gi|428234482|gb|AFZ00272.1| hypothetical protein Cal6303_1211 [Calothrix sp. PCC 6303]
Length = 101
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
SP++ G L +G + LY +I Q I SDN V +IT+ +RT++ G+ +
Sbjct: 11 SPLRCFTGSLISGGMGYALYNVMVSIATKFANQPIHSDNEIVVRITVAVRTLIVGVFGMG 70
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
+FG+ ++GLF QL + FT SS
Sbjct: 71 MGIFGLVALGLFALGIQLGIEKFTNKESS 99
>gi|427419539|ref|ZP_18909722.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 7375]
gi|425762252|gb|EKV03105.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 7375]
Length = 112
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQ-TISDNFSVRQITITIRTIVNGLCYLA 207
SP+ G AG ++++ Y+ I A + +SD V I++ +RT+V G+ L
Sbjct: 23 SPLSCFSGSFLAGTLSVLCYRMMVAIVTAFAAKPVLSDKTIVLNISVAVRTLVAGIVALG 82
Query: 208 TFVFGINSVGLFLYSGQLALNSFT 231
VFG+ +VGLFL QL + T
Sbjct: 83 AGVFGLTAVGLFLLGIQLIVRKMT 106
>gi|428219444|ref|YP_007103909.1| hypothetical protein Pse7367_3236 [Pseudanabaena sp. PCC 7367]
gi|427991226|gb|AFY71481.1| hypothetical protein Pse7367_3236 [Pseudanabaena sp. PCC 7367]
Length = 133
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 145 KGPPSP----VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTI 199
K PPS +Q L G L AG IA+ LY FT ++ L + I S N +++ T+RT
Sbjct: 38 KAPPSTMSMVIQNLSGVLMAGAIAVGLYFFTNSVAVKLAQNPITSSNTLAMRVSTTVRTF 97
Query: 200 VNGLCYLATFVFGINSVGLFLYSGQ 224
+ L AT +FG+ ++G+FL + Q
Sbjct: 98 LLALGTGATMIFGVVALGIFLLTIQ 122
>gi|170077404|ref|YP_001734042.1| hypothetical protein SYNPCC7002_A0781 [Synechococcus sp. PCC 7002]
gi|169885073|gb|ACA98786.1| conserved hypothetical membrane protein [Synechococcus sp. PCC
7002]
Length = 105
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 8/98 (8%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
PSP++ G AG ++ Y T + A + + N QI IT+RT++ GL +A
Sbjct: 15 PSPLRCWYGTAVAGSMSFAAYLLTNKVIAGFASKPPTGNGLAIQIGITVRTLIMGLFTMA 74
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGE 245
T VFG+ +VGLF+ L L S S K+T + G+
Sbjct: 75 TGVFGLIAVGLFI----LGLQSLL----SGKQTTSPGD 104
>gi|434398956|ref|YP_007132960.1| hypothetical protein Sta7437_2460 [Stanieria cyanosphaera PCC 7437]
gi|428270053|gb|AFZ35994.1| hypothetical protein Sta7437_2460 [Stanieria cyanosphaera PCC 7437]
Length = 110
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTI-EAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
+P++ G + +G +A LY ++I + + ++DN V +IT+ +RT+V G+ L
Sbjct: 23 TPIRCFFGTVVSGGLAFGLYSLMSSIVQTYATKPVVADNLLVLRITVAVRTLVIGVAALG 82
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSS 235
VF ++GL L QL + S E SS
Sbjct: 83 AGVFAFVALGLMLLGIQLTIQSLKEVSS 110
>gi|119511390|ref|ZP_01630502.1| hypothetical protein N9414_19894 [Nodularia spumigena CCY9414]
gi|119463935|gb|EAW44860.1| hypothetical protein N9414_19894 [Nodularia spumigena CCY9414]
Length = 106
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G + AG + +Y +I + I SDN V I +RT+V G+
Sbjct: 16 PPTPLRCITGAVFAGGLGYAMYSLMISIATTFAAKPIHSDNPMVVNIGSAVRTLVVGVVA 75
Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSSS 236
L T +FGI ++GL + QL + T+ +S
Sbjct: 76 LGTGIFGIVAIGLLALAIQLVVQQLTKPKNS 106
>gi|56751369|ref|YP_172070.1| hypothetical protein syc1360_d [Synechococcus elongatus PCC 6301]
gi|81298956|ref|YP_399164.1| hypothetical protein Synpcc7942_0145 [Synechococcus elongatus PCC
7942]
gi|56686328|dbj|BAD79550.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81167837|gb|ABB56177.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 107
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 1/90 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P+ G +G IA+ Y T +I ++ + + + N + I + +RT++ G+
Sbjct: 18 PPTPLSCWSGSAVSGAIAVGAYFLTVSIASSFAAKPLPTGNAFAQNIAVAVRTLIVGMGA 77
Query: 206 LATFVFGINSVGLFLYSGQLALNSFTEDSS 235
LAT +FGI ++GL L Q+ L + S
Sbjct: 78 LATGIFGIVALGLLLLGIQVGLQGLRKAQS 107
>gi|428310658|ref|YP_007121635.1| hypothetical protein Mic7113_2427 [Microcoleus sp. PCC 7113]
gi|428252270|gb|AFZ18229.1| Protein of unknown function (DUF3082) [Microcoleus sp. PCC 7113]
Length = 115
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 48/84 (57%), Gaps = 6/84 (7%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
PSP++ L G L +G AI LY T++I N+ S N + QI+I +RT+V G+ L
Sbjct: 22 PSPLKCLSGALISGGFAIALYFLTSSIGNTFANKPLPSSNQTAIQISIAVRTLVVGISTL 81
Query: 207 ATFVFGINSVGL-----FLYSGQL 225
AT +F I VGL +L+ QL
Sbjct: 82 ATALFTIIGVGLMGVTIYLFVQQL 105
>gi|332712237|ref|ZP_08432165.1| hypothetical protein LYNGBM3L_71070 [Moorea producens 3L]
gi|332349043|gb|EGJ28655.1| hypothetical protein LYNGBM3L_71070 [Moorea producens 3L]
Length = 108
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
PSP + L G L +G + +Y T +I N+ S++ + +I I +RT+V G+ L
Sbjct: 18 PSPWRCLTGALISGGLTSAIYFITASIATTFANKPLTSNSQTALKIAIAVRTLVVGVSTL 77
Query: 207 ATFVFGINSVGL 218
ATF+FGI ++GL
Sbjct: 78 ATFIFGITTIGL 89
>gi|359462355|ref|ZP_09250918.1| hypothetical protein ACCM5_26742 [Acaryochloris sp. CCMEE 5410]
Length = 115
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
S S + PPS + L G + AG IA L K ++ I + SDN + +I I ++
Sbjct: 4 SPSTNKPTPPSIWRTLSGAVVAGTIAFPLAKLSSKIAQSFAEHPFTSDNQTAVKIAIALK 63
Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSS 237
T+V GL LAT +F + ++GL G L++ F ++ +SS
Sbjct: 64 TLVVGLSTLATGIFCMAALGL----GALSIQIFWQNLTSS 99
>gi|158338025|ref|YP_001519201.1| hypothetical protein AM1_4912 [Acaryochloris marina MBIC11017]
gi|158308266|gb|ABW29883.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 115
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
S S + PPS + L G + AG IA L K ++ I + SDN + +I I ++
Sbjct: 4 SPSTNKPTPPSIWRTLSGAVVAGTIAFPLAKLSSKIAQSFAEHPFTSDNQTAVKIAIALK 63
Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSS 237
T+V GL LAT +F + ++GL G L++ F ++ +SS
Sbjct: 64 TLVVGLSTLATGIFCMAALGL----GALSIQIFWQNLTSS 99
>gi|428214090|ref|YP_007087234.1| hypothetical protein Oscil6304_3756 [Oscillatoria acuminata PCC
6304]
gi|428002471|gb|AFY83314.1| Protein of unknown function (DUF3082) [Oscillatoria acuminata PCC
6304]
Length = 109
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYL 206
P+ ++ +G L +G A T+ I + N+ SDN I+ +RT+V G+ L
Sbjct: 22 PTLMRCFMGSLISGGFAYACMMLTSAIASTFANKPIHSDNPIAVNISAAVRTLVTGVSTL 81
Query: 207 ATFVFGINSVGLFLYSGQLALNSFTEDS 234
ATFVF ++GLF QL FT+ S
Sbjct: 82 ATFVFAFAALGLFCLGLQLLFQKFTKKS 109
>gi|440680351|ref|YP_007155146.1| hypothetical protein Anacy_0644 [Anabaena cylindrica PCC 7122]
gi|428677470|gb|AFZ56236.1| hypothetical protein Anacy_0644 [Anabaena cylindrica PCC 7122]
Length = 109
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 7/105 (6%)
Query: 141 SIDAKG--PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIR 197
S D G PP+P++ + G + +G + +Y I + I SDN +I+ +R
Sbjct: 9 STDTSGQVPPTPLRCITGAIMSGGLGFAMYSLMIAIATTFAAKPIHSDNLLAIKISAAVR 68
Query: 198 TIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETEN 242
T+V G+ L +FGI +GL + QL T+ KET +
Sbjct: 69 TLVVGIVALGAGIFGIVGIGLLALAIQLVFQQLTKP----KETNS 109
>gi|434387822|ref|YP_007098433.1| Protein of unknown function (DUF3082) [Chamaesiphon minutus PCC
6605]
gi|428018812|gb|AFY94906.1| Protein of unknown function (DUF3082) [Chamaesiphon minutus PCC
6605]
Length = 98
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI--SDNFSVRQITITIRTIVNGLCY 205
PSP++ L G L AG ++I +Y T+ I + + N +V I +RT+V G
Sbjct: 10 PSPLRCLTGSLIAGAMSIAMYMMTSKIATTFATKPMVQKTNMAV-NIATALRTLVVGSTA 68
Query: 206 LATFVFGINSVGLFLYSGQLALN 228
L VFGI ++GL L + Q+A N
Sbjct: 69 LGAAVFGIIAIGLILLAIQVAFN 91
>gi|218441368|ref|YP_002379697.1| hypothetical protein PCC7424_4465 [Cyanothece sp. PCC 7424]
gi|218174096|gb|ACK72829.1| hypothetical protein PCC7424_4465 [Cyanothece sp. PCC 7424]
Length = 115
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 1/96 (1%)
Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGL 203
K +P + LLG AG++ +Y T I N+ S N V I+ +RT+V G+
Sbjct: 19 KNEVTPQKCLLGAAMAGILGTGMYYLTYAIGTTFANKPITSSNQLVINISSAVRTLVVGV 78
Query: 204 CYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKE 239
LATF+F S GL L + QL L SS +
Sbjct: 79 ASLATFIFAFVSFGLVLLAIQLTLQGIKGRLKSSSD 114
>gi|334120141|ref|ZP_08494223.1| hypothetical protein MicvaDRAFT_4862 [Microcoleus vaginatus FGP-2]
gi|333456929|gb|EGK85556.1| hypothetical protein MicvaDRAFT_4862 [Microcoleus vaginatus FGP-2]
Length = 117
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 1/93 (1%)
Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQI 192
L + G ++ + PP+ ++ ++G L AG A LY T +I I SDN + I
Sbjct: 14 LSSAGDATSASSTPPTILRCVMGSLIAGGFATALYALTLSIVQTFANSPIHSDNVTAINI 73
Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQL 225
+RT+V G+ L VF S+GL L + QL
Sbjct: 74 GAAVRTLVMGIVALGAGVFAFASLGLMLLAVQL 106
>gi|428780609|ref|YP_007172395.1| hypothetical protein Dacsa_2441 [Dactylococcopsis salina PCC 8305]
gi|428694888|gb|AFZ51038.1| Protein of unknown function (DUF3082) [Dactylococcopsis salina PCC
8305]
Length = 95
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 1/85 (1%)
Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISD-NFSVRQITITIRTIVNGLCYLATFVF 211
ALLG + AG ++ LY+ T I A Q + D N + I + +RT+V G+ LAT +F
Sbjct: 11 ALLGSVIAGGLSFPLYRLTIAIIARYAEQPLPDTNQTAATIAVAVRTLVMGMGVLATAIF 70
Query: 212 GINSVGLFLYSGQLALNSFTEDSSS 236
+ +VGLFL + Q + + S
Sbjct: 71 ILVAVGLFLLAVQTGIQWLQDQQQS 95
>gi|254415378|ref|ZP_05029139.1| hypothetical protein MC7420_3302 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196177853|gb|EDX72856.1| hypothetical protein MC7420_3302 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 105
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 151 VQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLATF 209
++ L+ L +G +AI Y TT I +L N+ S N + I I +RT+V G+ LAT
Sbjct: 20 LRCLISALISGSLAIAGYWLTTAIAQSLANKPLPSGNVTAINIAIAVRTLVVGISTLATA 79
Query: 210 VFGINSVGLFLYSGQLALNSFTEDS 234
VF + +VGL + QL + FT+ +
Sbjct: 80 VFSLVAVGLVALAIQLTVQRFTQGT 104
>gi|307150620|ref|YP_003886004.1| hypothetical protein Cyan7822_0697 [Cyanothece sp. PCC 7822]
gi|306980848|gb|ADN12729.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length = 113
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
+P++ LLG + + +Y T +I + ++ I S N V I +RT+V G+ LA
Sbjct: 23 TPLRCLLGAAISATLGTGMYYLTYSIGSTFAQKPITSSNQLVINIGSAVRTLVVGVASLA 82
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSSSK 238
TF+F S GL L + QL L + + SSS+
Sbjct: 83 TFIFAFVSFGLVLLAIQLTLQAIKQRLSSSE 113
>gi|434393091|ref|YP_007128038.1| hypothetical protein Glo7428_2367 [Gloeocapsa sp. PCC 7428]
gi|428264932|gb|AFZ30878.1| hypothetical protein Glo7428_2367 [Gloeocapsa sp. PCC 7428]
Length = 104
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
Query: 151 VQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLATF 209
++ L G L +G +AI LY T+ I + I SDN +V I +RT+V G+ L T
Sbjct: 17 LRCLTGSLISGGLAIALYSLTSAIAQTFAAKPIHSDNPAVINIASAVRTLVVGITALGTG 76
Query: 210 VFGINSVGLFLYSGQLALNSFTEDSSSS 237
+FG+ ++GL + QL + T+ SS S
Sbjct: 77 IFGLVALGLIGLAIQLLIQQATKKSSPS 104
>gi|428774211|ref|YP_007165999.1| hypothetical protein Cyast_2404 [Cyanobacterium stanieri PCC 7202]
gi|428688490|gb|AFZ48350.1| hypothetical protein Cyast_2404 [Cyanobacterium stanieri PCC 7202]
Length = 114
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 7/85 (8%)
Query: 138 GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFS---VRQITI 194
S SID P +P + GG +G++AI Y T ++ L + NF+ +I
Sbjct: 17 ASESIDK--PITPTRCFTGGAISGLLAIAAYLLTKSV--VLTYADVPINFNNPMAARIAS 72
Query: 195 TIRTIVNGLCYLATFVFGINSVGLF 219
T+RT+V G+ +ATFVF + ++GLF
Sbjct: 73 TVRTLVMGITTMATFVFVMVTIGLF 97
>gi|411117762|ref|ZP_11390143.1| Protein of unknown function (DUF3082) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711486|gb|EKQ68992.1| Protein of unknown function (DUF3082) [Oscillatoriales
cyanobacterium JSC-12]
Length = 127
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 143 DAKGPP-SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIV 200
D+ PP +P++ L+G + A V+A +Y T I + I S + V++I+ +RT+V
Sbjct: 24 DSDNPPITPLRCLIGSMLATVLAYAMYLMTHAIAQSFAAHKIQSTSLIVQRISSAVRTLV 83
Query: 201 NGLCYLATFVFGINSV 216
G+ L T VFGI ++
Sbjct: 84 VGMTTLGTGVFGIAAI 99
>gi|284929157|ref|YP_003421679.1| hypothetical protein UCYN_05970 [cyanobacterium UCYN-A]
gi|284809616|gb|ADB95321.1| hypothetical protein UCYN_05970 [cyanobacterium UCYN-A]
Length = 113
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALN-RQTISDNFSVRQITITIRTIVNGLCYLA 207
+P L+G AG++ LY T++I + N + IS N + ++ ++T+V G+ LA
Sbjct: 28 NPFSCLIGSFLAGILGYGLYMLTSSIIHSFNAKPIISSNLLIIKLGSLVKTLVMGVASLA 87
Query: 208 TFVFGINSVGLFLYSGQLALN 228
TF+ S GL L + QL LN
Sbjct: 88 TFMCFFISFGLILLACQLFLN 108
>gi|218246460|ref|YP_002371831.1| hypothetical protein PCC8801_1624 [Cyanothece sp. PCC 8801]
gi|257059503|ref|YP_003137391.1| hypothetical protein Cyan8802_1651 [Cyanothece sp. PCC 8802]
gi|218166938|gb|ACK65675.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256589669|gb|ACV00556.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 104
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGL 203
K +P++ L+G +G +AI LY T +I + + + S+N V +I +RT+V G+
Sbjct: 16 KEKVTPLRCLIGSTISGTLAIALYSLTASIFHSFATKPLTSNNALVLRIGSLVRTLVMGV 75
Query: 204 CYLATFVFGINSVGLFLYSGQLA 226
L TF+F + GL L + QL+
Sbjct: 76 ASLGTFIFAFVAFGLILLAIQLS 98
>gi|443318284|ref|ZP_21047540.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 6406]
gi|442782090|gb|ELR92174.1| Protein of unknown function (DUF3082) [Leptolyngbya sp. PCC 6406]
Length = 133
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 152 QALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLATFV 210
+A LG A +A++LY+ I N+ SDN +V ++ +RT+V G+ + T V
Sbjct: 42 RAFLGSGIATTLALLLYRMLAGIALTFANKPVHSDNVTVVNLSAAVRTLVLGVVAMGTGV 101
Query: 211 FGINSVGLFLYSGQLALNSFTEDSSSSKE 239
FGI ++GL L Q+ T S ++ E
Sbjct: 102 FGIAALGLLLLGLQMVWQRLTGKSPATPE 130
>gi|427725819|ref|YP_007073096.1| hypothetical protein Lepto7376_4133 [Leptolyngbya sp. PCC 7376]
gi|427357539|gb|AFY40262.1| hypothetical protein Lepto7376_4133 [Leptolyngbya sp. PCC 7376]
Length = 109
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 37/70 (52%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
PSP++ G AG ++ Y T + + N +I IT+RT++ G+C +A
Sbjct: 18 PSPLRCWTGTAVAGSMSFAAYLVTRNVIINFANNPPTGNTVAMRIAITVRTLLMGICTMA 77
Query: 208 TFVFGINSVG 217
T VFG+ ++G
Sbjct: 78 TAVFGMIAIG 87
>gi|300864421|ref|ZP_07109292.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300337565|emb|CBN54440.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 113
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLA 207
S ++ + G + AG A LY T +I N+ T SDN +V +I +RT+V G+ L
Sbjct: 21 SILRCITGSMIAGAFASGLYLLTASIAQTFANKPTHSDNITVIKIASAVRTLVVGMAALG 80
Query: 208 TFVFGINSVGLFLYSGQL 225
T +F + ++GL + QL
Sbjct: 81 TGIFALAALGLMGLAVQL 98
>gi|427714245|ref|YP_007062869.1| hypothetical protein Syn6312_3283 [Synechococcus sp. PCC 6312]
gi|427378374|gb|AFY62326.1| Protein of unknown function (DUF3082) [Synechococcus sp. PCC 6312]
Length = 100
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ-ITITIRTIVNGLCYLA 207
+P++ L G L AG + I+LY+ T I ++ + + Q +++ IRT+V GL +A
Sbjct: 9 TPLRCLTGALVAGTLGILLYRLTQAIASSFATHPLHTQSQIAQSLSVAIRTLVVGLSTMA 68
Query: 208 TFVFGI 213
T +F +
Sbjct: 69 TGIFAL 74
>gi|116074646|ref|ZP_01471907.1| hypothetical protein RS9916_28969 [Synechococcus sp. RS9916]
gi|116067868|gb|EAU73621.1| hypothetical protein RS9916_28969 [Synechococcus sp. RS9916]
Length = 124
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ-ITITIRTIVNGLCYLAT 208
P+ L G T+ V+A + F+ + + + S+ Q I ++T++ G+C+LAT
Sbjct: 38 PLSFLSGSATSLVLAWFSWGFSKRMVLYFAEHSTHFSSSIAQSIATALKTLMTGMCFLAT 97
Query: 209 FVFGINSVGLFL 220
F F +GLFL
Sbjct: 98 FSFAFIGLGLFL 109
>gi|126660189|ref|ZP_01731307.1| hypothetical protein CY0110_08921 [Cyanothece sp. CCY0110]
gi|126618554|gb|EAZ89305.1| hypothetical protein CY0110_08921 [Cyanothece sp. CCY0110]
Length = 105
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 148 PSPVQALLGGLTAGVIAIILYKFTTTI-EAALNRQTISDNFSVRQITITIRTIVNGLCYL 206
P+P L+G +G + LY T +I + + S N V +I +RT+V G+ L
Sbjct: 17 PTPANCLIGATISGALGYALYTLTASIIQTFATKPLTSSNTLVLRIGSLVRTLVMGVASL 76
Query: 207 ATFVFGINSVGLFLYSGQL 225
TF+F + GL L + QL
Sbjct: 77 GTFIFFFVAFGLILLAIQL 95
>gi|354565157|ref|ZP_08984332.1| hypothetical protein FJSC11DRAFT_0538 [Fischerella sp. JSC-11]
gi|353549116|gb|EHC18558.1| hypothetical protein FJSC11DRAFT_0538 [Fischerella sp. JSC-11]
Length = 101
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 46/82 (56%), Gaps = 1/82 (1%)
Query: 145 KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGL 203
+ +P+++L+G + +G +A Y + I + + + SDN V +I+ +RT+V G+
Sbjct: 10 QAKATPLRSLIGAVISGSLAFAAYFLMSAIATSFAAKPLHSDNLIVLRISSAVRTLVLGI 69
Query: 204 CYLATFVFGINSVGLFLYSGQL 225
L + VF I ++GL + QL
Sbjct: 70 AALGSGVFAIVAIGLVALAIQL 91
>gi|186685003|ref|YP_001868199.1| hypothetical protein Npun_R4910 [Nostoc punctiforme PCC 73102]
gi|186467455|gb|ACC83256.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 109
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 1/89 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
SP++ L+G + +G + LY I + + S N V +I+ +RT+V G+ L
Sbjct: 19 SPLRCLIGAVISGGMGYALYSLMIAIATNFAAKPLHSINPLVIKISSAVRTLVVGVVALG 78
Query: 208 TFVFGINSVGLFLYSGQLALNSFTEDSSS 236
+ +FGI ++GL QL + ++ SS
Sbjct: 79 SGIFGIVAIGLLALGVQLLMQQLSKQKSS 107
>gi|282899537|ref|ZP_06307501.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281195416|gb|EFA70349.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 100
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVN 201
D PP+P++ + G +G +A +Y I + I SDN V I +RT+V
Sbjct: 10 DTNAPPTPLRCITGATISGGLAFAMYSLMIAIATTFASKPIHSDNQIVINIASAVRTLVV 69
Query: 202 GLCYLATFVFGINSVG 217
G+ L +F I ++G
Sbjct: 70 GVVALGMGIFSIVAIG 85
>gi|342318945|gb|EGU10900.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1616
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 18/98 (18%)
Query: 17 SHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIF 76
+ HP K+ + P +S S F RRRP P+C IT+PT S S+
Sbjct: 224 TRHPPKV---RPPRLSPSLFARRRPSFIPHCRC--------ITSPTPSHSHRTTPCNSLL 272
Query: 77 ATTDEPSSLQVATSVLLTGAISVFL-FRALRRRAKRAK 113
P L++A VL+ GA++V L + LRRR +RA+
Sbjct: 273 -----PMLLRLAV-VLVYGAVAVSLPLQPLRRRVRRAE 304
>gi|394988291|ref|ZP_10381129.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
gi|393792749|dbj|GAB70768.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
Length = 196
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 64 SEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
++E +E+ + IFA D L++A+ TG +V+ RAL + EL +G +
Sbjct: 11 NQEPILEVLKEIFA--DRQRVLEIASG---TGQHAVYFGRALPHLTWQTSELAQNHAGIQ 65
Query: 124 KSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
L D L N+ L +ID P P++++ A + I+ ++ T+ A + R
Sbjct: 66 AWLDDSLLPNV--LPPVAIDVSAPKWPIESVDAVFNANTVHIVAWQEVETMFAGIAR 120
>gi|331671448|ref|ZP_08372246.1| pentapeptide repeats (8 copies) (3 repeats) [Escherichia coli
TA280]
gi|331071293|gb|EGI42650.1| pentapeptide repeats (8 copies) (3 repeats) [Escherichia coli
TA280]
Length = 704
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 193 TITIRTIVNGLCYLATFVFGINSVGLFLYSGQLALNSFTEDSSSSKETENIGEQQSGSLN 252
TI I TI +C+ V ++ +LY+ + N DS+ + +++I +Q S S +
Sbjct: 571 TIFIDTIFKNVCFFNAKVNNVSLKNAYLYNDNI--NKKANDSAEKQASDSIEKQASDSTD 628
Query: 253 STAENATDTELNSGKEDQSSDS 274
A N+TD + N + Q++DS
Sbjct: 629 KQANNSTDKQANDSTDKQANDS 650
>gi|17230723|ref|NP_487271.1| hypothetical protein alr3231 [Nostoc sp. PCC 7120]
gi|17132326|dbj|BAB74930.1| alr3231 [Nostoc sp. PCC 7120]
Length = 105
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G +G + LY +I ++ I SDN V +T +RT+V G+
Sbjct: 15 PPTPLRCITGAGISGGMGYALYLLMISIATTFAKKPIHSDNQLVISLTSAVRTLVVGIVA 74
Query: 206 LATFVFGINSVG 217
L T +F I ++G
Sbjct: 75 LGTGIFAIVTIG 86
>gi|434403925|ref|YP_007146810.1| Protein of unknown function (DUF3082) [Cylindrospermum stagnale PCC
7417]
gi|428258180|gb|AFZ24130.1| Protein of unknown function (DUF3082) [Cylindrospermum stagnale PCC
7417]
Length = 109
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G +G + +Y I + I SDN V I +RT+V G+
Sbjct: 17 PPTPLRCVTGAAFSGGLGYAMYTLMIAIATTFATKPIRSDNPMVVNIGSAVRTLVVGVVA 76
Query: 206 LATFVFGINSVG 217
L T +FG+ +VG
Sbjct: 77 LGTGIFGLVAVG 88
>gi|75911084|ref|YP_325380.1| hypothetical protein Ava_4888 [Anabaena variabilis ATCC 29413]
gi|75704809|gb|ABA24485.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 105
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G +G + LY +I ++ I SDN V +T +RT+V G+
Sbjct: 15 PPTPLRCITGAGISGGMGYALYLLMISIATTFAKKPIHSDNQLVISLTSAVRTLVVGIVA 74
Query: 206 LATFVFGINSVG 217
L T +F I ++G
Sbjct: 75 LGTGIFAIVTIG 86
>gi|220906290|ref|YP_002481601.1| hypothetical protein Cyan7425_0854 [Cyanothece sp. PCC 7425]
gi|219862901|gb|ACL43240.1| hypothetical protein Cyan7425_0854 [Cyanothece sp. PCC 7425]
Length = 108
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV-RQITITIRTIVNGLCYLA 207
+P + L G + A ++A++LY+ TT I + IS + + +++ +RT+V GL +A
Sbjct: 19 TPWKCLRGSVLAAILALLLYRLTTAIAQSFAAHPISTHNQITYNLSVAVRTLVVGLSTMA 78
Query: 208 TFVFGINSVGLFLYSGQLALNSF 230
T +F I ++GL + QL + F
Sbjct: 79 TAIFAIAALGLVGLAIQLVVQKF 101
>gi|87302770|ref|ZP_01085581.1| hypothetical protein WH5701_13495 [Synechococcus sp. WH 5701]
gi|87282653|gb|EAQ74611.1| hypothetical protein WH5701_13495 [Synechococcus sp. WH 5701]
Length = 104
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 5/78 (6%)
Query: 147 PP--SPVQALLGGLTAGVIAIILYKFTTTIEA--ALNRQTISDNFSVRQITITIRTIVNG 202
PP P+ L G LT+ ++A + + + +L+ S F+ + I ++T+V G
Sbjct: 13 PPRKGPLSFLSGALTSALLAWVCLGLSQRVVGWYSLHPPHYSKAFA-QSIGTAMKTLVVG 71
Query: 203 LCYLATFVFGINSVGLFL 220
+ +LATF FG ++GLFL
Sbjct: 72 MSFLATFTFGFVALGLFL 89
>gi|427709929|ref|YP_007052306.1| hypothetical protein Nos7107_4628 [Nostoc sp. PCC 7107]
gi|427362434|gb|AFY45156.1| hypothetical protein Nos7107_4628 [Nostoc sp. PCC 7107]
Length = 105
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
SP++ + G + + +A LY I + + + S N V +IT +RT+V G+ L
Sbjct: 17 SPLRCVTGAIISSGMAYALYSLMIAIATSFASKPVHSHNELVIKITSAVRTLVVGIVALG 76
Query: 208 TFVFGINSVGLFLYSGQLALNSFTE 232
+ +FGI ++GL QL + ++
Sbjct: 77 SGIFGIVAIGLIALGVQLLIQQLSQ 101
>gi|113475976|ref|YP_722037.1| hypothetical protein Tery_2344 [Trichodesmium erythraeum IMS101]
gi|110167024|gb|ABG51564.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 109
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 1/86 (1%)
Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCYLA 207
+P+Q L + +G +A + Y ++ + I SDNF V ++ +RT+V G+ L
Sbjct: 17 NPLQCLASAVISGSLATVTYSLMQSVAKTFASKPITSDNFLVVNLSGAVRTLVVGVIALV 76
Query: 208 TFVFGINSVGLFLYSGQLALNSFTED 233
+ I ++GL + Q ++ FT
Sbjct: 77 ACICAIVTLGLIALAIQTIIHRFTNK 102
>gi|116794213|gb|ABK27047.1| unknown [Picea sitchensis]
Length = 321
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 37/75 (49%)
Query: 152 QALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLATFVF 211
QA +G + A I+ +K +T +++ N + + ++R + LCY T +
Sbjct: 243 QAYIGSIIALAFGILSWKLSTGVQSVPKDSLQYANENALLLGNSLRGTLLALCYSCTLLS 302
Query: 212 GINSVGLFLYSGQLA 226
G+ VGLFL QL+
Sbjct: 303 GVTMVGLFLIGRQLS 317
>gi|428221170|ref|YP_007105340.1| hypothetical protein Syn7502_01096 [Synechococcus sp. PCC 7502]
gi|427994510|gb|AFY73205.1| Protein of unknown function (DUF3082) [Synechococcus sp. PCC 7502]
Length = 106
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 19/92 (20%)
Query: 143 DAKGPP--------SPVQALLGGLTAGVIAIILYKFTTTIEAAL------NRQTISDNFS 188
+ K PP SP+ ++G + + ++A LY FT + L N QT++D
Sbjct: 5 NPKSPPNSQSQENISPISGIIGAVISAILAYGLYNFTLGVARKLALSPFQNAQTLAD--- 61
Query: 189 VRQITITIRTIVNGLCYLATFVFGINSVGLFL 220
+I IRTI+ + T +FG+ ++GL L
Sbjct: 62 --RIGAAIRTILLAMGSGITIIFGVIAIGLVL 91
>gi|282896424|ref|ZP_06304445.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198712|gb|EFA73592.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 100
Score = 37.4 bits (85), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITIRTIVNGLCY 205
PP+P++ + G +G +A +Y I + I SDN V I +RT+V G+
Sbjct: 14 PPTPLRCITGATISGGLAFAMYSLMIAIATTFASKPIHSDNQIVINIASAVRTLVVGVVA 73
Query: 206 LATFVFGINSVG 217
L +F I ++G
Sbjct: 74 LGMGIFSIVTIG 85
>gi|113954978|ref|YP_730797.1| hypothetical protein sync_1592 [Synechococcus sp. CC9311]
gi|113882329|gb|ABI47287.1| Uncharacterized membrane protein [Synechococcus sp. CC9311]
Length = 105
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 150 PVQALLGGLTAGVIAIILYKFTTTIEAAL-NRQTISDNFSVRQITITIRTIVNGLCYLAT 208
P+ L G LT+ +++ + ++ +T + NR + + + + I ++T++ G+C+LAT
Sbjct: 19 PISFLSGSLTSLLMSWLSFRLSTGMVTYFANRPSTFSSATAQSIASALKTLLIGMCFLAT 78
Query: 209 FVFGINSVGLFL 220
F +GLFL
Sbjct: 79 FSCAFVGIGLFL 90
>gi|427701773|ref|YP_007044995.1| hypothetical protein Cyagr_0462 [Cyanobium gracile PCC 6307]
gi|427344941|gb|AFY27654.1| Protein of unknown function (DUF3082) [Cyanobium gracile PCC 6307]
Length = 100
Score = 37.4 bits (85), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 147 PP--SPVQALLGGLTAGVIAIILYKFTT-TIEAALNRQTISDNFSVRQITITIRTIVNGL 203
PP P+ L G +T+G++A + + + + D+ + I ++T++ G+
Sbjct: 9 PPRKGPLSFLSGAITSGLLAWLSLGVSQRVVGWYVTHPPHFDSAFAQSIATAMKTLMVGM 68
Query: 204 CYLATFVFGINSVGLFL 220
C+LATF F +GLFL
Sbjct: 69 CFLATFTFAFIGLGLFL 85
>gi|428768855|ref|YP_007160645.1| hypothetical protein Cyan10605_0459 [Cyanobacterium aponinum PCC
10605]
gi|428683134|gb|AFZ52601.1| hypothetical protein Cyan10605_0459 [Cyanobacterium aponinum PCC
10605]
Length = 109
Score = 37.0 bits (84), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 149 SPVQALLGGLTAGVIAIILYKFT-TTIEAALNRQTISDNFSVRQITITIRTIVNGLCYLA 207
+P++ + G +G +A+ Y T + I +N +N I T+RT++ G+ +A
Sbjct: 22 TPLRCMTGSAISGGLAVAAYLLTKSVILTYVNMPIKFNNPLAANIASTVRTLIMGVTTMA 81
Query: 208 TFVFGINSVGLFLYSGQLALNSFTED 233
TF+F + +VGLF L + F E+
Sbjct: 82 TFLFLMVAVGLF----ALGVKRFIEE 103
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.128 0.349
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,684,161,189
Number of Sequences: 23463169
Number of extensions: 138269352
Number of successful extensions: 433430
Number of sequences better than 100.0: 145
Number of HSP's better than 100.0 without gapping: 63
Number of HSP's successfully gapped in prelim test: 82
Number of HSP's that attempted gapping in prelim test: 433167
Number of HSP's gapped (non-prelim): 249
length of query: 276
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 136
effective length of database: 9,074,351,707
effective search space: 1234111832152
effective search space used: 1234111832152
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 76 (33.9 bits)