BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047194
(270 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540025|ref|XP_002511077.1| conserved hypothetical protein [Ricinus communis]
gi|223550192|gb|EEF51679.1| conserved hypothetical protein [Ricinus communis]
Length = 265
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 224/270 (82%), Positives = 243/270 (90%), Gaps = 5/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
M S+S SN GW RA+G+ +K VLIGGAVL+KRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1 MVTGSESESN---GWS-RARGIVIKTLVLIGGAVLVKRLTKSTTRWDHARFVSQSLSGEK 56
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
FS+EQA+RDPDN+FNIRMLTCPAAEMVDGSK+LY EQAFWRTPQKPFRQRFYMVKPCPKE
Sbjct: 57 FSREQAARDPDNYFNIRMLTCPAAEMVDGSKVLYFEQAFWRTPQKPFRQRFYMVKPCPKE 116
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
LKCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDI EHLTTI+L RCDRGKRCLYE
Sbjct: 117 LKCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIAEHLTTIYLNRCDRGKRCLYE 176
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPPGGFPNSW NGAT+ TSELA+LKNNE+HTWDRG+DDDGNQVWGVK GPYEFKPAP
Sbjct: 177 GSTPPGGFPNSW-NGATFCTSELAILKNNELHTWDRGYDDDGNQVWGVKEGPYEFKPAPG 235
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
SS +DMFSPLNFPP +EKRIEGSFVLQE
Sbjct: 236 SSVNDMFSPLNFPPPLSMEKRIEGSFVLQE 265
>gi|224122242|ref|XP_002318786.1| predicted protein [Populus trichocarpa]
gi|222859459|gb|EEE97006.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 466 bits (1198), Expect = e-129, Method: Compositional matrix adjust.
Identities = 222/268 (82%), Positives = 241/268 (89%), Gaps = 3/268 (1%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G SD S+ NG WG RA+GL +K+ VLIGG +L+KRLTKS TRWDHA+IV SL+GEKF
Sbjct: 8 GSGSDPTSDSNG-WG-RARGLALKSLVLIGGVLLVKRLTKSTTRWDHAKIVTQSLTGEKF 65
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
SKEQASRDPDN+FNIRMLTCPAAEMVDGSK+LY EQAFWRTPQKPFRQRFYMVKPCPKEL
Sbjct: 66 SKEQASRDPDNYFNIRMLTCPAAEMVDGSKVLYFEQAFWRTPQKPFRQRFYMVKPCPKEL 125
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEV SYAIR+ EEYKNFCDR +DQRPLPEEVIGDI EHLTTIHL+RCDRGKRCLYEG
Sbjct: 126 KCDVEVGSYAIRDAEEYKNFCDRSKDQRPLPEEVIGDIAEHLTTIHLKRCDRGKRCLYEG 185
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
S PPGGFPNSW NGATY TSELA+LKNNEIHTWDRG+DD GNQVWGVK GPYEFKPAP+S
Sbjct: 186 SNPPGGFPNSW-NGATYCTSELAILKNNEIHTWDRGYDDGGNQVWGVKEGPYEFKPAPAS 244
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQ 269
S S++FSPLN PP Q +EKRIEGSFVLQ
Sbjct: 245 SVSELFSPLNLPPLQSMEKRIEGSFVLQ 272
>gi|317106609|dbj|BAJ53116.1| JHL07K02.6 [Jatropha curcas]
Length = 265
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 222/270 (82%), Positives = 240/270 (88%), Gaps = 5/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG SDS SN GW RA+GL VK VLIGGA+L+KRLTKS TRWDH RIV SLSGEK
Sbjct: 1 MGSGSDSESN---GWS-RARGLVVKTLVLIGGAILVKRLTKSTTRWDHTRIVTQSLSGEK 56
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
FS+EQASRDPDN+FNIRMLTCPAA+MVDGSK+LY E+AF RTPQKPFRQRFYMVKPCPKE
Sbjct: 57 FSREQASRDPDNYFNIRMLTCPAADMVDGSKVLYFERAFCRTPQKPFRQRFYMVKPCPKE 116
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
LKCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDI EHLTTI+L+RCDRGKRCLYE
Sbjct: 117 LKCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIAEHLTTIYLKRCDRGKRCLYE 176
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPPGGFPNSW NGAT+ TSELA+LKNNEIHTWDRG+DDDGNQVWGVK GPYEFKPAP+
Sbjct: 177 GSTPPGGFPNSW-NGATHCTSELAILKNNEIHTWDRGYDDDGNQVWGVKDGPYEFKPAPA 235
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
SS + M LNFPP +EKRIEGSFVLQE
Sbjct: 236 SSVNGMLPALNFPPPHSMEKRIEGSFVLQE 265
>gi|356516466|ref|XP_003526915.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Glycine max]
Length = 268
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 215/270 (79%), Positives = 243/270 (90%), Gaps = 4/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG DSDS +N GW RA+GL +K +L GGA+L+KRL KS TRWDHA V+ SL+GEK
Sbjct: 1 MGTDSDSDAN---GWN-RARGLALKTLLLFGGALLVKRLRKSTTRWDHAHFVSKSLAGEK 56
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
+SKEQASRDPDN+FNIRMLTCPAAE+VDGSK+LY EQAFWRTPQKPFRQR +MVKPCPKE
Sbjct: 57 YSKEQASRDPDNYFNIRMLTCPAAELVDGSKVLYFEQAFWRTPQKPFRQRLFMVKPCPKE 116
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
LKCDVE+S+YAIR++EEYKNFCDRPRDQRP PEEVIGDI EHLTT+HL+RC RGKRCLY+
Sbjct: 117 LKCDVELSTYAIRDMEEYKNFCDRPRDQRPQPEEVIGDIAEHLTTVHLKRCPRGKRCLYK 176
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPPGGFPNSWQNGAT+ TSELA+LKNN+IHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 177 GSTPPGGFPNSWQNGATHCTSELAILKNNDIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 236
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
SS+SDMFSPLNFPP +E+RIEGSFVLQE
Sbjct: 237 SSFSDMFSPLNFPPPPSMERRIEGSFVLQE 266
>gi|225456513|ref|XP_002284706.1| PREDICTED: chromophore lyase CpcT/CpeT 3 [Vitis vinifera]
gi|147860848|emb|CAN83158.1| hypothetical protein VITISV_022553 [Vitis vinifera]
gi|297734523|emb|CBI15770.3| unnamed protein product [Vitis vinifera]
Length = 267
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 219/270 (81%), Positives = 242/270 (89%), Gaps = 3/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG SDS S NG WG RA+G+ VKAAVLIGGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1 MGTGSDSESEGNG-WG-RARGILVKAAVLIGGAILLKRLTKSTTRWDHARFVSHSLSGEK 58
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
FS EQASRDPDN+FN RM+TCPAAE+VDGS++LY EQAFWRTP KPFRQRFYMVKPCPKE
Sbjct: 59 FSMEQASRDPDNYFNFRMVTCPAAELVDGSRVLYFEQAFWRTPSKPFRQRFYMVKPCPKE 118
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
+KCDVE+SSYAIR+VEEYKNFCDR + QRPLPEEVIGDI EHLTTI+L+RC+RGKRCLYE
Sbjct: 119 MKCDVELSSYAIRDVEEYKNFCDRSKAQRPLPEEVIGDIAEHLTTIYLKRCERGKRCLYE 178
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTP GGFPNSW +GATY TSELAVLKNNEIH WDRG+DD+GNQVWGVK GPYEFKPAP+
Sbjct: 179 GSTPSGGFPNSW-SGATYCTSELAVLKNNEIHIWDRGYDDEGNQVWGVKEGPYEFKPAPA 237
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
SS +DMFSPLNF P +EKRI+GSFVLQE
Sbjct: 238 SSSNDMFSPLNFAPPLPMEKRIDGSFVLQE 267
>gi|356508878|ref|XP_003523180.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Glycine max]
Length = 266
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 215/267 (80%), Positives = 241/267 (90%), Gaps = 5/267 (1%)
Query: 4 DSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSK 63
DSDS SN GW RA+GL +K +LIGGA+L+KRL KS TRWDHA V++SL+GEK+SK
Sbjct: 5 DSDSESN---GWN-RARGLALKTLLLIGGALLVKRLRKSTTRWDHAHFVSNSLTGEKYSK 60
Query: 64 EQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKC 123
EQASRDPDN+FNIRMLTCPAAE+VDGSK+LY EQAFWRTPQKPFRQR +MVKPCPKELKC
Sbjct: 61 EQASRDPDNYFNIRMLTCPAAELVDGSKVLYFEQAFWRTPQKPFRQRLFMVKPCPKELKC 120
Query: 124 DVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGST 183
DVE+S+YAIR++EEYKNFCDRPRDQRP PEEVIGDI EHLTT+HL+RC RGKRCLYEGST
Sbjct: 121 DVELSTYAIRDMEEYKNFCDRPRDQRPQPEEVIGDIAEHLTTVHLKRCPRGKRCLYEGST 180
Query: 184 PPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSY 243
PPGGFPNSW NGATY TSELA+LKNNEIHTWDRG+DD GNQVWG K GPYEFKPAP+SS+
Sbjct: 181 PPGGFPNSW-NGATYCTSELAILKNNEIHTWDRGYDDGGNQVWGQKEGPYEFKPAPTSSF 239
Query: 244 SDMFSPLNFPPQQFLEKRIEGSFVLQE 270
+DMFSPLNFPP +E+RIEGSFVLQE
Sbjct: 240 NDMFSPLNFPPPPSMERRIEGSFVLQE 266
>gi|388500552|gb|AFK38342.1| unknown [Lotus japonicus]
Length = 265
Score = 439 bits (1128), Expect = e-121, Method: Compositional matrix adjust.
Identities = 204/253 (80%), Positives = 232/253 (91%), Gaps = 1/253 (0%)
Query: 18 RAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
RA+GL VK +LIGGA+L+KRL KS TRWDHAR V++SL+GEK+SKEQASRDP+N+FNIR
Sbjct: 14 RARGLAVKTLLLIGGALLVKRLRKSTTRWDHARFVSNSLTGEKYSKEQASRDPENYFNIR 73
Query: 78 MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
MLTCPAAE+VDGSK+LY EQAFWR+PQKPFRQRFY+VKP PKELKCDVE+S+YAIR++EE
Sbjct: 74 MLTCPAAELVDGSKVLYFEQAFWRSPQKPFRQRFYVVKPYPKELKCDVELSTYAIRDMEE 133
Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
YKNFCDRP+DQRP PEEVIGDI EHLTTIHL+RC RGKRC YEGSTPPGGFPNSW NGAT
Sbjct: 134 YKNFCDRPKDQRPQPEEVIGDIAEHLTTIHLKRCPRGKRCSYEGSTPPGGFPNSW-NGAT 192
Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQF 257
Y TSELAVLKNNEIHTWDRGFDDDGNQVWG K GPYEFKPAP+S ++DM SPLNFPP
Sbjct: 193 YCTSELAVLKNNEIHTWDRGFDDDGNQVWGQKDGPYEFKPAPTSCFNDMLSPLNFPPPPS 252
Query: 258 LEKRIEGSFVLQE 270
+++RIEGSF+LQ+
Sbjct: 253 MDRRIEGSFILQD 265
>gi|15241343|ref|NP_199915.1| protein crumpled leaf [Arabidopsis thaliana]
gi|13877867|gb|AAK44011.1|AF370196_1 unknown protein [Arabidopsis thaliana]
gi|9758249|dbj|BAB08748.1| unnamed protein product [Arabidopsis thaliana]
gi|21281135|gb|AAM44980.1| unknown protein [Arabidopsis thaliana]
gi|45504116|dbj|BAD12566.1| CRUMPLED LEAF [Arabidopsis thaliana]
gi|332008639|gb|AED96022.1| protein crumpled leaf [Arabidopsis thaliana]
Length = 269
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 206/269 (76%), Positives = 233/269 (86%), Gaps = 5/269 (1%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL VK VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6 GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQASRDPDN+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62 TREQASRDPDNYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDIGEHLTTIHL CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++ S LN Q ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269
>gi|297795903|ref|XP_002865836.1| hypothetical protein ARALYDRAFT_918134 [Arabidopsis lyrata subsp.
lyrata]
gi|297311671|gb|EFH42095.1| hypothetical protein ARALYDRAFT_918134 [Arabidopsis lyrata subsp.
lyrata]
Length = 269
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 205/269 (76%), Positives = 233/269 (86%), Gaps = 5/269 (1%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL VK VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6 GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQASRDP+N+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62 TREQASRDPENYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDIGEHLTTIHL CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++ S LN Q ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269
>gi|21554618|gb|AAM63635.1| unknown [Arabidopsis thaliana]
Length = 269
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/269 (76%), Positives = 232/269 (86%), Gaps = 5/269 (1%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL VK VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6 GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQASRDPDN+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62 TREQASRDPDNYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEVSSYAIR+ EEYK FCDRP+DQRPLPEEVIGDIGEHLTTIHL CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKXFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++ S LN Q ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269
>gi|357464915|ref|XP_003602739.1| hypothetical protein MTR_3g098510 [Medicago truncatula]
gi|355491787|gb|AES72990.1| hypothetical protein MTR_3g098510 [Medicago truncatula]
Length = 267
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/270 (77%), Positives = 238/270 (88%), Gaps = 3/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
M +S+S S+ NG W RAQGL +KA +L+GGA+L+KRL KS TRWDH +V SL+GEK
Sbjct: 1 MCKESESESDSNG-WN-RAQGLALKALLLLGGALLVKRLRKSTTRWDHTHLVTQSLTGEK 58
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
+SK+QASRDPDN+FNIRMLTCPAAE+VDGS +LY EQAFWR+PQKPFRQR M KPCPKE
Sbjct: 59 YSKDQASRDPDNYFNIRMLTCPAAELVDGSNVLYYEQAFWRSPQKPFRQRLLMTKPCPKE 118
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
LKCDVE+S+YAIR++EEYKNFCDRP+DQRP PEEVIGDIGEHLTTIHL+RC RGKRCLYE
Sbjct: 119 LKCDVELSTYAIRDMEEYKNFCDRPKDQRPQPEEVIGDIGEHLTTIHLKRCSRGKRCLYE 178
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPP GFPNSW NGATY TSELAV+KNNEIHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 179 GSTPPEGFPNSW-NGATYCTSELAVMKNNEIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 237
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++D FSPLNFPP +++RIEGSFVLQE
Sbjct: 238 SCFNDTFSPLNFPPPPSMDRRIEGSFVLQE 267
>gi|388508268|gb|AFK42200.1| unknown [Medicago truncatula]
Length = 267
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/270 (76%), Positives = 237/270 (87%), Gaps = 3/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
M +S+S S+ NG W RAQGL +KA +L+GGA+L+KRL KS TRWDH +V SL+GEK
Sbjct: 1 MCKESESESDSNG-WN-RAQGLALKALLLLGGALLVKRLRKSTTRWDHTHLVTQSLTGEK 58
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
+SK+QASRDPDN+FNIRMLTCPAAE+VDGS +LY EQAFWR+PQKPFRQR M KPCPKE
Sbjct: 59 YSKDQASRDPDNYFNIRMLTCPAAELVDGSNVLYYEQAFWRSPQKPFRQRLLMTKPCPKE 118
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
LKCDVE+S+YAI ++EEYKNFCDRP+DQRP PEEVIGDIGEHLTTIHL+RC RGKRCLYE
Sbjct: 119 LKCDVELSTYAIGDMEEYKNFCDRPKDQRPQPEEVIGDIGEHLTTIHLKRCSRGKRCLYE 178
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPP GFPNSW NGATY TSELAV+KNNEIHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 179 GSTPPEGFPNSW-NGATYCTSELAVMKNNEIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 237
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++D FSPLNFPP +++RIEGSFVLQE
Sbjct: 238 SCFNDTFSPLNFPPPPSMDRRIEGSFVLQE 267
>gi|357156625|ref|XP_003577521.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Brachypodium
distachyon]
Length = 269
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 192/270 (71%), Positives = 221/270 (81%), Gaps = 2/270 (0%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG DS G GG +G +KA V+ GG +L++RL +S TRWDHAR VAD+LSGEK
Sbjct: 1 MGSGEDSPGGTGAGVGGIVRGAVLKALVVFGGVLLIRRLRRSTTRWDHARAVADALSGEK 60
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
FS+EQA +DP+N+FN+RMLTCPA EMVDGS +LY EQAFWR P+KPFRQRFYMVKPCPKE
Sbjct: 61 FSREQARKDPENYFNLRMLTCPATEMVDGSGVLYFEQAFWRAPEKPFRQRFYMVKPCPKE 120
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
+KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI EHLTTIHL RC RGKRCLYE
Sbjct: 121 MKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAEHLTTIHLSRCGRGKRCLYE 180
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTP GGFPN+W +GA Y TS+L++ KN EIH WD+GFDD+GNQVWG KAGPYEFKPAP
Sbjct: 181 GSTPSGGFPNNW-SGAAYCTSDLSIHKNGEIHIWDKGFDDEGNQVWGTKAGPYEFKPAPK 239
Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S+Y DMFSPLNF LEK IE S+V +
Sbjct: 240 SNYDDMFSPLNFSAPLSLEK-IESSYVTDD 268
>gi|326514408|dbj|BAJ96191.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 269
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 182/247 (73%), Positives = 212/247 (85%), Gaps = 2/247 (0%)
Query: 24 VKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA 83
+KA V+ GG +L++RL +S TRWDHAR VAD+LSGEKFS+EQA +DP N+FN+RMLTCPA
Sbjct: 24 LKALVVFGGVILIRRLRRSTTRWDHARTVADALSGEKFSREQARQDPGNYFNLRMLTCPA 83
Query: 84 AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCD 143
EMVDGS +LY EQAFWR P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+VEEYKNFCD
Sbjct: 84 TEMVDGSGVLYFEQAFWRAPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDVEEYKNFCD 143
Query: 144 RPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSEL 203
RP+DQRP PEEVI DI EHLTTIHL RC+RGKRCLYEGSTP GGFPNSW +GA Y TS+L
Sbjct: 144 RPKDQRPQPEEVIADIAEHLTTIHLSRCERGKRCLYEGSTPAGGFPNSW-SGAAYCTSDL 202
Query: 204 AVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIE 263
++ KN EIH WD+GFDD+G+QVWG KAGPYEFKPAP S+Y DMFSPLNF LEK +E
Sbjct: 203 SIHKNGEIHIWDKGFDDNGSQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLSLEK-ME 261
Query: 264 GSFVLQE 270
S+ + +
Sbjct: 262 SSYAIDD 268
>gi|449454794|ref|XP_004145139.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Cucumis sativus]
gi|449473876|ref|XP_004154009.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Cucumis sativus]
Length = 266
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 200/270 (74%), Positives = 231/270 (85%), Gaps = 5/270 (1%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG +S+ +N + GW RA GL +K VL+GGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1 MGTNSNDNNN-SVGWN-RAGGLIMKTLVLVGGALLLKRLTKSTTRWDHARFVSQSLSGEK 58
Query: 61 FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
FSKEQA+RDPDN+FNIR + P +VDGS +LY EQAF RTPQKPFRQRFY VKPC KE
Sbjct: 59 FSKEQAARDPDNYFNIRYVNPPPPPLVDGSTVLYFEQAFSRTPQKPFRQRFYTVKPCSKE 118
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
+KCDVE+SSYAIRE+EEYKNFCDR +DQRPLPEE++GDI EHLTT+HL+RCDRGKRCLYE
Sbjct: 119 MKCDVELSSYAIREMEEYKNFCDRTKDQRPLPEEIVGDIAEHLTTLHLKRCDRGKRCLYE 178
Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
GSTPPGGFPNSW +GA+Y TSE+A+LKNNE+HTW+RG+D+DGNQVWG K GPYEFKP P+
Sbjct: 179 GSTPPGGFPNSW-SGASYCTSEIAILKNNEVHTWERGYDEDGNQVWGTKEGPYEFKPVPA 237
Query: 241 SSYSDMFSPLNFPPQQFLEKRI-EGSFVLQ 269
SS DMFSPLNF Q +EKRI EGSFVLQ
Sbjct: 238 SSLKDMFSPLNF-SQPSMEKRILEGSFVLQ 266
>gi|115485691|ref|NP_001067989.1| Os11g0524300 [Oryza sativa Japonica Group]
gi|77551200|gb|ABA93997.1| expressed protein [Oryza sativa Japonica Group]
gi|113645211|dbj|BAF28352.1| Os11g0524300 [Oryza sativa Japonica Group]
gi|215741068|dbj|BAG97563.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218185839|gb|EEC68266.1| hypothetical protein OsI_36301 [Oryza sativa Indica Group]
Length = 275
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 176/229 (76%), Positives = 202/229 (88%), Gaps = 1/229 (0%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR V D+LSGEKFS+EQA +DPDN+FN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 47 STTRWDHARAVVDALSGEKFSREQARKDPDNYFNLRMLTCPATEMVDGSRVLYFEQAFWR 106
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
+P+KPFRQRFYMVKPCPK++KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI E
Sbjct: 107 SPEKPFRQRFYMVKPCPKDMKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAE 166
Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
HLTTIHL RC+RGKRCLY+GSTPP GFPNSW +GATY TS+L++ KN E+H WD+GFDDD
Sbjct: 167 HLTTIHLSRCERGKRCLYKGSTPPEGFPNSW-SGATYCTSDLSIHKNGEVHIWDKGFDDD 225
Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
GNQVWG KAGPYEFKPAP S+Y DMFSPLNF LEK+IE SF + +
Sbjct: 226 GNQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLTLEKKIESSFAIDD 274
>gi|125577323|gb|EAZ18545.1| hypothetical protein OsJ_34073 [Oryza sativa Japonica Group]
Length = 275
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 176/227 (77%), Positives = 201/227 (88%), Gaps = 1/227 (0%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR V D+LSGEKFS+EQA +DPDN+FN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 47 STTRWDHARAVVDALSGEKFSREQARKDPDNYFNLRMLTCPATEMVDGSRVLYFEQAFWR 106
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
+P+KPFRQRFYMVKPCPK++KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI E
Sbjct: 107 SPEKPFRQRFYMVKPCPKDMKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAE 166
Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
HLTTIHL RC+RGKRCLY+GSTPP GFPNSW +GATY TS+L++ KN E+H WD+GFDDD
Sbjct: 167 HLTTIHLSRCERGKRCLYKGSTPPEGFPNSW-SGATYCTSDLSIHKNGEVHIWDKGFDDD 225
Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVL 268
GNQVWG KAGPYEFKPAP S+Y DMFSPLNF LEK+IE SF +
Sbjct: 226 GNQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLTLEKKIESSFAI 272
>gi|293331841|ref|NP_001168901.1| uncharacterized protein LOC100382707 [Zea mays]
gi|223973587|gb|ACN30981.1| unknown [Zea mays]
Length = 266
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 171/229 (74%), Positives = 199/229 (86%), Gaps = 1/229 (0%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38 STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
+P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP EEVI DI E
Sbjct: 98 SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQAEEVIADIAE 157
Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
HLTTIHL RC RGKRCLYEGSTPP GFPN+W +GA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-SGASYCTSDLSIHKNGEVHIWDKGFDDE 216
Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
GNQVWG KAGPYEFKPAP S Y DMFSPLNF LEK+++ ++V+ +
Sbjct: 217 GNQVWGTKAGPYEFKPAPKSKYDDMFSPLNFSAPLSLEKKLDKAYVIDD 265
>gi|242065282|ref|XP_002453930.1| hypothetical protein SORBIDRAFT_04g021600 [Sorghum bicolor]
gi|241933761|gb|EES06906.1| hypothetical protein SORBIDRAFT_04g021600 [Sorghum bicolor]
Length = 266
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 170/229 (74%), Positives = 197/229 (86%), Gaps = 1/229 (0%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA E VDGS++LY E AFWR
Sbjct: 38 STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATETVDGSRVLYFEHAFWR 97
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
+P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP PEEVI DI E
Sbjct: 98 SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQPEEVIADIAE 157
Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
HLTTIHL RC RGKRCLYEGSTPP GFPN+W NGA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-NGASYCTSDLSIHKNGEVHIWDKGFDDE 216
Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
GNQVWG K GPYEFKPAP S Y DMFSPLNF LEK+++ ++V+ +
Sbjct: 217 GNQVWGTKVGPYEFKPAPKSKYDDMFSPLNFSAPLSLEKKLDKAYVIDD 265
>gi|242068607|ref|XP_002449580.1| hypothetical protein SORBIDRAFT_05g019560 [Sorghum bicolor]
gi|241935423|gb|EES08568.1| hypothetical protein SORBIDRAFT_05g019560 [Sorghum bicolor]
Length = 265
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 171/229 (74%), Positives = 198/229 (86%), Gaps = 2/229 (0%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38 STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
+P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP PEEVI DI E
Sbjct: 98 SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQPEEVIADIAE 157
Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
HLTTIHL RC RGKRCLYEGSTPP GFPN+W +GA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-SGASYCTSDLSIHKNGEVHIWDKGFDDE 216
Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
GNQVWG K GPYEFKPAP S Y DMFSPLNF LEK ++ ++V+ +
Sbjct: 217 GNQVWGTKVGPYEFKPAPKSKYDDMFSPLNFSAPLSLEK-LDKAYVIDD 264
>gi|294463077|gb|ADE77076.1| unknown [Picea sitchensis]
Length = 264
Score = 356 bits (913), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 163/258 (63%), Positives = 207/258 (80%), Gaps = 3/258 (1%)
Query: 11 PNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDP 70
P+ RA+ +F+KA ++GGA+L+++LT+ T DHARIVAD+L+GEKFS EQASRDP
Sbjct: 7 PSSSGRSRARNVFIKALCVVGGALLVRKLTQPTT--DHARIVADALTGEKFSSEQASRDP 64
Query: 71 DNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSY 130
N+FN+RML CPA M DGS++LY EQAFWRTP+KP+RQRFY+VKPC KE+KCDVEV+SY
Sbjct: 65 MNYFNLRMLVCPATVMADGSRVLYFEQAFWRTPEKPYRQRFYVVKPCHKEMKCDVEVNSY 124
Query: 131 AIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPN 190
A+R+VEEY+NFC+RP+ QRP EEV D+ +HLTT++L RC+RGKRCLYEGSTPPGGFPN
Sbjct: 125 AVRDVEEYRNFCERPKGQRPQAEEVERDVAKHLTTVYLSRCERGKRCLYEGSTPPGGFPN 184
Query: 191 SWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPL 250
SW NGAT+ TS+L + KN E+H WDR +DD+GNQVWGV GPYEFK A +SS + F PL
Sbjct: 185 SW-NGATHCTSDLTIYKNGEVHCWDRAYDDEGNQVWGVTEGPYEFKSATTSSSNGSFFPL 243
Query: 251 NFPPQQFLEKRIEGSFVL 268
+F +K + SFV+
Sbjct: 244 DFSSISLGKKLDDKSFVV 261
>gi|302771463|ref|XP_002969150.1| hypothetical protein SELMODRAFT_170435 [Selaginella moellendorffii]
gi|300163655|gb|EFJ30266.1| hypothetical protein SELMODRAFT_170435 [Selaginella moellendorffii]
Length = 270
Score = 350 bits (898), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 153/225 (68%), Positives = 192/225 (85%), Gaps = 1/225 (0%)
Query: 20 QGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRML 79
+ +KAA L+GG +LL++LTK+KTRWDH R+VAD+L+GEKFS+EQA+RDP +FN+RML
Sbjct: 34 RSFVLKAACLLGGVLLLRKLTKAKTRWDHTRLVADALTGEKFSQEQAARDPMTYFNLRML 93
Query: 80 TCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYK 139
CPA + DG+K+LY EQAFWRTP KP+RQRFY+V+PCPKE+KCDVEV SYA+R++EEYK
Sbjct: 94 ACPATVLDDGAKVLYFEQAFWRTPDKPYRQRFYVVRPCPKEMKCDVEVGSYAVRDIEEYK 153
Query: 140 NFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYS 199
NFC+RP+DQRP PEE+ GDI EHLT+++L C RG+RCLYEGSTPPGGFPN+W NGA+
Sbjct: 154 NFCERPKDQRPQPEEIPGDISEHLTSVYLSACARGQRCLYEGSTPPGGFPNNW-NGASRC 212
Query: 200 TSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYS 244
TSEL +LK+ EIH WDR +DD+GNQVWGV+ GPYEFKP S + S
Sbjct: 213 TSELTILKSGEIHCWDRAYDDEGNQVWGVRQGPYEFKPGTSKNRS 257
>gi|380356234|dbj|BAL72655.1| CRUMPLED LEAF1 [Physcomitrella patens subsp. patens]
Length = 314
Score = 349 bits (896), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 152/222 (68%), Positives = 188/222 (84%), Gaps = 1/222 (0%)
Query: 19 AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRM 78
+GL +KAA LIGGA LL++LTKS TRWDHAR VA SLSGEKFS EQA+RDP +FN+R+
Sbjct: 87 VRGLVIKAACLIGGAFLLRKLTKSTTRWDHARKVAQSLSGEKFSTEQAARDPTTYFNLRL 146
Query: 79 LTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEY 138
LTCPA + DG++++Y EQAFWRTP++P+RQRFY +KPCPKE+KCDVEV SYA+R++EEY
Sbjct: 147 LTCPATVLADGARVMYFEQAFWRTPERPYRQRFYSIKPCPKEMKCDVEVGSYAVRDIEEY 206
Query: 139 KNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATY 198
KNFCDR +D+RP P+EV+ D+ EHL T++L C+RG+RCLYEGSTPPGGFPNSW NGA+
Sbjct: 207 KNFCDRSKDERPQPDEVLKDMAEHLNTVYLSVCERGRRCLYEGSTPPGGFPNSW-NGASR 265
Query: 199 STSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
TSEL + KN E+H WDR +DD+GNQVWGV+ GPYEFK A S
Sbjct: 266 CTSELTIYKNGEVHCWDRAYDDEGNQVWGVRQGPYEFKTATS 307
>gi|168022710|ref|XP_001763882.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684887|gb|EDQ71286.1| predicted protein [Physcomitrella patens subsp. patens]
gi|380356236|dbj|BAL72656.1| CRUMPLED LEAF2 [Physcomitrella patens subsp. patens]
Length = 342
Score = 349 bits (896), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 151/224 (67%), Positives = 188/224 (83%), Gaps = 1/224 (0%)
Query: 19 AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRM 78
+GL VKAA LIGGA LL++LTK+ TRWDHAR+V +LSGEKFS EQA+RDP +FN+R
Sbjct: 92 VRGLVVKAACLIGGAFLLRKLTKTTTRWDHARMVVQALSGEKFSTEQAARDPMTYFNLRF 151
Query: 79 LTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEY 138
LTCPA + DG+ ++Y EQAFWRTP++P+RQRFY VKPCPKE+KCDVEV SYA+R++E+Y
Sbjct: 152 LTCPATVLADGAHVMYFEQAFWRTPERPYRQRFYNVKPCPKEMKCDVEVGSYAVRDIEDY 211
Query: 139 KNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATY 198
+NFCDRP+DQRP P+EV+ D+ EHL T++L C+RG+RCLYEGSTPP GFPNSW NGA+
Sbjct: 212 RNFCDRPKDQRPQPDEVLKDVAEHLNTVYLSACERGRRCLYEGSTPPEGFPNSW-NGASR 270
Query: 199 STSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSS 242
TSEL + KN E+H WDR +D++GNQVWGV+ GPYEFKPA SS
Sbjct: 271 CTSELTIYKNGEVHCWDRAYDNEGNQVWGVRQGPYEFKPATQSS 314
>gi|168031997|ref|XP_001768506.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680219|gb|EDQ66657.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 277
Score = 347 bits (889), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 153/223 (68%), Positives = 189/223 (84%), Gaps = 2/223 (0%)
Query: 19 AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNI-R 77
+GL +KAA LIGGA LL++LTKS TRWDHAR VA SLSGEKFS EQA+RDP +FN+ R
Sbjct: 49 VRGLVIKAACLIGGAFLLRKLTKSTTRWDHARKVAQSLSGEKFSTEQAARDPTTYFNLSR 108
Query: 78 MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
+LTCPA + DG++++Y EQAFWRTP++P+RQRFY +KPCPKE+KCDVEVSSYA+R++EE
Sbjct: 109 LLTCPATVLADGARVMYFEQAFWRTPERPYRQRFYSIKPCPKEMKCDVEVSSYAVRDIEE 168
Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
YKNFCDR +D+RP P+EV+ D+ EHL T++L C+RG+RCLYEGSTPPGGFPNSW NGA+
Sbjct: 169 YKNFCDRSKDERPQPDEVLKDMAEHLNTVYLSVCERGRRCLYEGSTPPGGFPNSW-NGAS 227
Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
TSEL + KN E+H WDR +DD+GNQVWGV+ GPYEFK A S
Sbjct: 228 RCTSELTIYKNGEVHCWDRAYDDEGNQVWGVRQGPYEFKTATS 270
>gi|449523435|ref|XP_004168729.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like, partial [Cucumis
sativus]
Length = 191
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 154/193 (79%), Positives = 174/193 (90%), Gaps = 3/193 (1%)
Query: 78 MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
M+TCPAAEMVDGS +LY EQAF RTPQKPFRQRFY VKPC KE+KCDVE+SSYAIRE+EE
Sbjct: 1 MITCPAAEMVDGSTVLYFEQAFSRTPQKPFRQRFYTVKPCSKEMKCDVELSSYAIREMEE 60
Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
YKNFCDR +DQRPLPEE++GDI EHLTT+HL+RCDRGKRCLYEGSTPPGGFPNSW +GA+
Sbjct: 61 YKNFCDRTKDQRPLPEEIVGDIAEHLTTLHLKRCDRGKRCLYEGSTPPGGFPNSW-SGAS 119
Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQF 257
Y TSE+A+LKNNE+HTW+RG+D+DGNQVWG K GPYEFKP P+SS DMFSPLNF Q
Sbjct: 120 YCTSEIAILKNNEVHTWERGYDEDGNQVWGTKEGPYEFKPVPASSLKDMFSPLNF-SQPS 178
Query: 258 LEKRI-EGSFVLQ 269
+EKRI EGSFVLQ
Sbjct: 179 MEKRILEGSFVLQ 191
>gi|302784280|ref|XP_002973912.1| hypothetical protein SELMODRAFT_100123 [Selaginella moellendorffii]
gi|300158244|gb|EFJ24867.1| hypothetical protein SELMODRAFT_100123 [Selaginella moellendorffii]
Length = 270
Score = 330 bits (845), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 152/225 (67%), Positives = 191/225 (84%), Gaps = 1/225 (0%)
Query: 20 QGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRML 79
+ +KA L+GG +LL++LTK+KTRWDH R+VAD+L+GEKFS+EQA+RDP +FN+RML
Sbjct: 34 RSFVLKAVCLLGGVLLLRKLTKAKTRWDHTRLVADALTGEKFSQEQAARDPMTYFNLRML 93
Query: 80 TCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYK 139
CPA + DG+K+LY EQAFWRTP KP+RQRFY+V+PCPKE+KCDVEV SYA+R++EEYK
Sbjct: 94 ACPATVLDDGAKVLYFEQAFWRTPDKPYRQRFYVVRPCPKEMKCDVEVGSYAVRDIEEYK 153
Query: 140 NFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYS 199
NFC+RP+DQRP PEE+ GDI EHLT+++L C RG+RCLYEGSTPPGGFPN+W NGA+
Sbjct: 154 NFCERPKDQRPQPEEIPGDISEHLTSVYLSACARGQRCLYEGSTPPGGFPNNW-NGASRC 212
Query: 200 TSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYS 244
TSEL +LK+ EIH WDR +DD+GNQVWGV+ GPYEFKP S + S
Sbjct: 213 TSELTILKSGEIHCWDRAYDDEGNQVWGVRQGPYEFKPGTSKNRS 257
>gi|387169542|gb|AFJ66202.1| hypothetical protein 7G9.10 [Boechera stricta]
Length = 197
Score = 256 bits (653), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 142/269 (52%), Positives = 163/269 (60%), Gaps = 77/269 (28%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL VK VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEK
Sbjct: 6 GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 60
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
AFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 61 ------------------------------------AFWRTPQKPFRQRLYMVKPCPKEL 84
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIG
Sbjct: 85 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIG------------------------ 120
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
NGA+YS S+L VLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 121 ------------NGASYSISDLVVLKNNEIHLWDRGFDENRNQVWGPKQGPYEFKPATSS 168
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S ++ S LN Q ++K I+GS +LQ+
Sbjct: 169 SVNENLSALNILYQSSIDKPIQGSLILQD 197
>gi|388506300|gb|AFK41216.1| unknown [Lotus japonicus]
Length = 136
Score = 247 bits (631), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 113/136 (83%), Positives = 124/136 (91%)
Query: 135 VEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQN 194
+EEYKNFCDRP+DQRP PEEVIGDI EHLTTIHL+RC RGKRCLYEGST PGGFPNSWQN
Sbjct: 1 MEEYKNFCDRPKDQRPQPEEVIGDIAEHLTTIHLKRCPRGKRCLYEGSTSPGGFPNSWQN 60
Query: 195 GATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPP 254
GATY TSELAVLKNNEIHTWDRGFDDDGNQVWG K GPYEFKPAP+S ++DM SPLNFPP
Sbjct: 61 GATYCTSELAVLKNNEIHTWDRGFDDDGNQVWGQKDGPYEFKPAPTSCFNDMLSPLNFPP 120
Query: 255 QQFLEKRIEGSFVLQE 270
+++RIEGSF+LQ+
Sbjct: 121 PPSMDRRIEGSFILQD 136
>gi|387169574|gb|AFJ66233.1| hypothetical protein 34G24.30 [Capsella rubella]
Length = 186
Score = 229 bits (585), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 130/269 (48%), Positives = 153/269 (56%), Gaps = 89/269 (33%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL +K VL+GGA+L+KRLTKS TR DHAR+V+ SL+GEK
Sbjct: 7 GSDPESSSN---GWS-RARGLVIKTLVLVGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 61
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
R YMVKPCPKEL
Sbjct: 62 ------------------------------------------------RLYMVKPCPKEL 73
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIG
Sbjct: 74 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIG------------------------ 109
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
NGA+YSTS+LA+LKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 110 ------------NGASYSTSDLAILKNNEIHLWDRGFDENRNQVWGPKQGPYEFKPATSS 157
Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
S S+ S LN Q ++K I+GS +LQ+
Sbjct: 158 SVSENLSALNILYQSSIDKPIQGSLILQD 186
>gi|413920957|gb|AFW60889.1| hypothetical protein ZEAMMB73_192264 [Zea mays]
Length = 168
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 94/116 (81%), Positives = 106/116 (91%)
Query: 42 SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38 STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97
Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
+P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP EEVI
Sbjct: 98 SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQAEEVIA 153
>gi|387169526|gb|AFJ66187.1| hypothetical protein 11M19.7 [Arabidopsis halleri]
Length = 245
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 118/259 (45%), Positives = 133/259 (51%), Gaps = 112/259 (43%)
Query: 2 GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
G D +S+SN GW RA+GL VK VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEK
Sbjct: 6 GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 60
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR------------- 108
AFWRTPQKPFR
Sbjct: 61 ------------------------------------AFWRTPQKPFRQVCNHSFPDYLLI 84
Query: 109 ----------------------QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPR 146
QR YMVKPCPKELKCDVEVSSYAIR+ EEYKNFCDRP+
Sbjct: 85 DSIREVYYCVAEVLCIVCLFWLQRLYMVKPCPKELKCDVEVSSYAIRDAEEYKNFCDRPK 144
Query: 147 DQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVL 206
DQRPLPEEVIG NGA+Y TS+LAVL
Sbjct: 145 DQRPLPEEVIG------------------------------------NGASYCTSDLAVL 168
Query: 207 KNNEIHTWDRGFDDDGNQV 225
KN+EIH WDRGFD++ NQV
Sbjct: 169 KNSEIHLWDRGFDENRNQV 187
>gi|428226391|ref|YP_007110488.1| hypothetical protein GEI7407_2966 [Geitlerinema sp. PCC 7407]
gi|427986292|gb|AFY67436.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
Length = 400
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 125/256 (48%), Gaps = 30/256 (11%)
Query: 21 GLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLT 80
G A L GA+ + ++ + A VAD L G + QA ++RM+T
Sbjct: 7 GALGMAIALQMGAIAPPSIAQAAPLEEQATEVADRLVGTLSTVHQALTS-RRVADVRMVT 65
Query: 81 CPA----AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVE 136
CP A G LY EQA P++P+RQRF + P P VE +++ E+
Sbjct: 66 CPVTVTDAAPAPGVSFLYQEQAIALNPEQPYRQRFLRIAPYPASQS--VESTAWKPPELA 123
Query: 137 EYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA 196
+ C +P QR + + D+G L ++L+ G R + G+TPP G P++++ GA
Sbjct: 124 PWVGLCQKPLAQR---QVRLADLGSPLCQVYLKPA--GDR--FVGATPPEGCPSNFR-GA 175
Query: 197 TYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK---PAPSSSYSDMFSPLNFP 253
T T+++ VL+ + + TWDRGFD G QVWG + Y+F+ PA + D +
Sbjct: 176 TRVTNQI-VLEADRMETWDRGFDAAGQQVWGAQDESYQFRRLDPASRDAEVDAIA----- 229
Query: 254 PQQFLEKRIEGSFVLQ 269
+R+ G+FV Q
Sbjct: 230 ------RRLHGAFVAQ 239
>gi|218441065|ref|YP_002379394.1| hypothetical protein PCC7424_4155 [Cyanothece sp. PCC 7424]
gi|218173793|gb|ACK72526.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 212
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 98/191 (51%), Gaps = 15/191 (7%)
Query: 48 HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL--EQAFWRTPQK 105
H V L G + Q +++P N N+RM TC + G+ +YL EQA +T K
Sbjct: 29 HINSVVSHLVGVMDTSAQVAKNP-NKANVRMTTCQVT-LTGGNDSIYLYQEQALTKTLDK 86
Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
P+RQRF +KP + VE SY + E FC++P +R L + ++GE + +
Sbjct: 87 PYRQRFLEIKPTLEPET--VESKSYKPLQAERLIGFCNKPLSERVLN---VSELGEPVCS 141
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
+ L+ + G Y G T P G P + + T + + L L + + TWD+G+D GNQV
Sbjct: 142 VFLKPSNNG----YLGETQPEGCPANVRGAVTITNTIL--LHSEGMDTWDKGYDAQGNQV 195
Query: 226 WGVKAGPYEFK 236
WG P+EF+
Sbjct: 196 WGANDDPFEFR 206
>gi|411119057|ref|ZP_11391437.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
gi|410710920|gb|EKQ68427.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
Length = 223
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 100/194 (51%), Gaps = 18/194 (9%)
Query: 48 HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LLYLEQAFWRT 102
A+ VA L G + QA +P +++M TC +++D + LY EQA
Sbjct: 35 QAKKVATLLEGVMDTTAQARVNPK-APSVQMTTC-RVQVIDTNNHTSAIFLYQEQALTSK 92
Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
+P+RQRF + P P+ V S+ + + E + FC++P QR L D+G
Sbjct: 93 LSQPYRQRFLEISPHPETQT--VRSRSFRLTQPERWAGFCNQPDAQRTLQSR---DLGNP 147
Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
+ ++ L++ G + G TPP G P + + GA T+ + L ++TWDRGFD DG
Sbjct: 148 VCSVFLKQTPTG----FLGETPPEGCPTNAR-GAVRITNTIE-LNTAGMNTWDRGFDADG 201
Query: 223 NQVWGVKAGPYEFK 236
NQVWG ++ Y+F+
Sbjct: 202 NQVWGAQSESYQFR 215
>gi|425465472|ref|ZP_18844781.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389832277|emb|CCI24241.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 202
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 97/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G ++EQA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGQQVWGAQDNVYQFR 198
>gi|427419543|ref|ZP_18909726.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
sp. PCC 7375]
gi|425762256|gb|EKV03109.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
sp. PCC 7375]
Length = 214
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 21/194 (10%)
Query: 48 HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LLYLEQAFWRT 102
H V D L+ + EQA R+P F ++M TCP V GS LY EQA
Sbjct: 33 HVDAVVDHLTRPMDTTEQAERNP-RFVGVQMTTCPI--QVTGSADQYGIYLYQEQALTAQ 89
Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
+ P+RQRF + + VE ++ E + C Q+ P +GE
Sbjct: 90 IESPYRQRFLYITLSKDATR--VESHTFKPPSPEAWTGLC-----QQATPSIDTHALGEL 142
Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
+ + LR G Y GSTP G P + + GA T+ + VL + + TWDRGFD +G
Sbjct: 143 VCVVSLRPSTLG----YVGSTPAEGCPVNLR-GAVRLTNTI-VLHQDGMDTWDRGFDANG 196
Query: 223 NQVWGVKAGPYEFK 236
QVWG +A PY+++
Sbjct: 197 EQVWGAEADPYQYR 210
>gi|425470697|ref|ZP_18849557.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389883514|emb|CCI36065.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 202
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G + +QA +P ++M TC S LY EQ
Sbjct: 12 LTQSLLLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198
>gi|425439758|ref|ZP_18820073.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719933|emb|CCH96295.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 202
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 99/200 (49%), Gaps = 15/200 (7%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMV--DGSKLLYLE 96
LT+S + VA+ L G + +QA +P ++M+TC A + S LY E
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMITC-AVDFSPKQDSIYLYQE 69
Query: 97 QAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
QA +P+RQR +++P P VE +Y + + NFC++ +R L +
Sbjct: 70 QAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERRLN---V 124
Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DR
Sbjct: 125 SDLAESVCTVFLKPIAGG----YRGETPPQGCPTNVR-GAVKITNTI-ILHSQGMDTSDR 178
Query: 217 GFDDDGNQVWGVKAGPYEFK 236
G+D G QVWG + Y+F+
Sbjct: 179 GYDSLGQQVWGAQDNFYQFR 198
>gi|422304440|ref|ZP_16391785.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9806]
gi|389790410|emb|CCI13706.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9806]
Length = 202
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 15/200 (7%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMV--DGSKLLYLE 96
LT+S + VA+ L G + +QA +P ++M TC A + S LY E
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTC-AVDFSPKQDSIYLYQE 69
Query: 97 QAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
QA +P+RQR +++P P VE +Y + + NFC++ +R L +
Sbjct: 70 QAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKNLTERKLN---V 124
Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DR
Sbjct: 125 SDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDR 178
Query: 217 GFDDDGNQVWGVKAGPYEFK 236
G+D G QVWG + Y+F+
Sbjct: 179 GYDSLGQQVWGARDNFYQFR 198
>gi|425435450|ref|ZP_18815901.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389679974|emb|CCH91262.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 202
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 99/203 (48%), Gaps = 21/203 (10%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
LT+S + VA+ L G ++EQA +P + ++M TC VD S L
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-SIAKVQMTTCA----VDFSPKQDNIYL 66
Query: 94 YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
Y EQA +P+RQR +++P VE +Y + + NFC++ +R L
Sbjct: 67 YQEQAIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKDLTERKLN- 123
Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
+ D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T
Sbjct: 124 --VSDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILNSQGMDT 175
Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
DRG+D G QVWG + Y+F+
Sbjct: 176 SDRGYDSLGQQVWGAQDNVYQFR 198
>gi|390440170|ref|ZP_10228521.1| Genome sequencing data, contig C295 [Microcystis sp. T1-4]
gi|389836454|emb|CCI32647.1| Genome sequencing data, contig C295 [Microcystis sp. T1-4]
Length = 202
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G + +QA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKDLTERKLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+GE + T+ L+ G Y G TPP G + + GA T+ + +L + + T DRG
Sbjct: 126 DLGESVCTVFLKPIAGG----YRGETPPQGCSTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198
>gi|428217847|ref|YP_007102312.1| hypothetical protein Pse7367_1595 [Pseudanabaena sp. PCC 7367]
gi|427989629|gb|AFY69884.1| protein of unknown function DUF1001 [Pseudanabaena sp. PCC 7367]
Length = 427
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 98/194 (50%), Gaps = 18/194 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVD-----GSKLLYLEQAFWRTPQKP 106
VAD L G + A + D ++RM TC E++D LY EQA ++P
Sbjct: 48 VADYLIGAMDTSAAAIANRD-VADVRMTTC-VVEVIDREPKESVIFLYQEQAISTKLEQP 105
Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTI 166
+RQRF + P K+ K VE S+ + + C+R + R + + ++GE L ++
Sbjct: 106 YRQRFLRIAPS-KDGK-QVESRSFRPAVSDRWIGLCNREPEARQVSQ---AELGEALCSV 160
Query: 167 HL----RRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
L R G Y GSTPP G P +++ GA T+++ +L + E+ TWDRGFD G
Sbjct: 161 FLSLDAERDQYGNYYRYIGSTPPEGCPTNFR-GAVKITNKI-LLTDGEMETWDRGFDASG 218
Query: 223 NQVWGVKAGPYEFK 236
QVWG Y+FK
Sbjct: 219 KQVWGADDSSYKFK 232
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 89/192 (46%), Gaps = 17/192 (8%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTC-----PAAEMVDGSKLLYLEQAFWRTPQKP 106
+A LSG+ + EQ DP F +R C P G++++ LEQA K
Sbjct: 246 IAAMLSGKFNNAEQQQSDP-TFLPVRFNNCVVNIEPDELFPTGTQVMVLEQAANSPELKF 304
Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTI 166
QR + P + +Y + + ++ +FCDRP + L IG +T
Sbjct: 305 ASQRVAHLYRLPAP-DNSFRMVTYKLVD-GDFADFCDRPASGKVLTATQIGQRECQIT-- 360
Query: 167 HLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVW 226
+ + D Y G+TP G P+ ++ G+TY T + + L ++++ W+R +D QV
Sbjct: 361 YQQESD-----YYVGTTPDSGCPSQFR-GSTYITID-SRLSDSQLEFWERWYDGRSRQVA 413
Query: 227 GVKAGPYEFKPA 238
G ++G Y +KP
Sbjct: 414 GSESGFYIYKPV 425
>gi|425450833|ref|ZP_18830656.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440753055|ref|ZP_20932258.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa TAIHU98]
gi|389768139|emb|CCI06654.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440177548|gb|ELP56821.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa TAIHU98]
Length = 202
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G ++EQA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILNSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198
>gi|166365024|ref|YP_001657297.1| hypothetical protein MAE_22830 [Microcystis aeruginosa NIES-843]
gi|317374819|sp|B0JGA0.1|CPXT3_MICAN RecName: Full=Chromophore lyase CpcT/CpeT 3
gi|166087397|dbj|BAG02105.1| hypothetical protein MAE_22830 [Microcystis aeruginosa NIES-843]
Length = 202
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G ++EQA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSAD--NSTVESKAYKLNNAPNFINFCNKDLTERELN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLQPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGQQVWGAQDNVYQFR 198
>gi|443322051|ref|ZP_21051086.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
gi|442788254|gb|ELR97952.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
Length = 212
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 92/191 (48%), Gaps = 14/191 (7%)
Query: 48 HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA--EMVDGSKLLYLEQAFWRTPQK 105
H V + L G + Q + + + +RM TC V GS LLY EQA K
Sbjct: 29 HRDTVVNYLVGTMDATTQPNPESEPV-QVRMTTCIVQLEPSVRGSVLLYQEQALNNELLK 87
Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
P+RQRF + E K +E SY + + + FC +R + + D+GE + +
Sbjct: 88 PYRQRFLEITA--SEDKEIIESKSYRPNQPDSWSGFCSLHETERVITTD---DLGESVCS 142
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
+ LR +Y G TPP G P + + GA T+ + +L + + TWDRGFD NQV
Sbjct: 143 VFLRPL----ATIYLGKTPPEGCPANVR-GAVKITNTI-ILHSQGMETWDRGFDAQNNQV 196
Query: 226 WGVKAGPYEFK 236
WG K Y+F+
Sbjct: 197 WGAKDQSYQFR 207
>gi|425456903|ref|ZP_18836609.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389801879|emb|CCI18997.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 202
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G + +QA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RVAKVQMTTCAVNFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P VE +Y + ++ NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAADFINFCNKDLTERKLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGQQVWGAQDNFYQFR 198
>gi|425459384|ref|ZP_18838870.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9808]
gi|389822925|emb|CCI29288.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9808]
Length = 202
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 98/203 (48%), Gaps = 21/203 (10%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
LT+S + + VA+ L G + +QA +P ++M TC VD S L
Sbjct: 12 LTQSLSLNPEVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCA----VDFSPKQDNIYL 66
Query: 94 YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
Y EQA +P+RQR +++P VE +Y + + NFC++ +R L
Sbjct: 67 YQEQAIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKNLTERKLN- 123
Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
+ D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T
Sbjct: 124 --VSDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDT 175
Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
DRG+D G QVWG + Y+F+
Sbjct: 176 SDRGYDSLGRQVWGAQDNFYQFR 198
>gi|449525842|ref|XP_004169925.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like, partial [Cucumis
sativus]
Length = 74
Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats.
Identities = 52/76 (68%), Positives = 62/76 (81%), Gaps = 2/76 (2%)
Query: 1 MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
MG +S+ +N + GW RA GL +K VL+GGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1 MGTNSNDNNN-SVGWN-RAGGLIMKTLVLVGGALLLKRLTKSTTRWDHARFVSQSLSGEK 58
Query: 61 FSKEQASRDPDNFFNI 76
FSKEQA+RDPDN+FNI
Sbjct: 59 FSKEQAARDPDNYFNI 74
>gi|443651763|ref|ZP_21130696.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa DIANCHI905]
gi|159027459|emb|CAO89424.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334404|gb|ELS48916.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa DIANCHI905]
Length = 203
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 97/203 (47%), Gaps = 20/203 (9%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
LT+S + VA+ L G + +QA +P ++M TC VD S L
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCA----VDFSPKQDNIYL 66
Query: 94 YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
Y EQA +P+RQR +++P P VE +Y + + NFC++ +R L
Sbjct: 67 YQEQAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKNLTERKLNV 124
Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
+ D E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T
Sbjct: 125 SNLAD--ESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDT 176
Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
DRG+D G QVWG + Y+F+
Sbjct: 177 SDRGYDSLGRQVWGAQDNFYQFR 199
>gi|425446505|ref|ZP_18826508.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9443]
gi|389733245|emb|CCI02962.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
9443]
Length = 206
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 13/199 (6%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
LT+S + VA+ L G + +QA +P ++M TC S LY EQ
Sbjct: 12 LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RVAKVQMTTCAVNFSPKQDSIYLYQEQ 70
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A +P+RQR +++P VE +Y + + NFC++ +R L +
Sbjct: 71 AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAANFINFCNKDLTERRLN---VS 125
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
D+ E + T+ L+ G Y G TPP G P + + GA T+ + +L + + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179
Query: 218 FDDDGNQVWGVKAGPYEFK 236
+D G QVWG + Y+F+
Sbjct: 180 YDSLGRQVWGAQDNFYQFR 198
>gi|119489665|ref|ZP_01622424.1| hypothetical protein L8106_13110 [Lyngbya sp. PCC 8106]
gi|119454402|gb|EAW35551.1| hypothetical protein L8106_13110 [Lyngbya sp. PCC 8106]
Length = 228
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 105/228 (46%), Gaps = 27/228 (11%)
Query: 17 GRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNI 76
G A + A L G ++ + V L+G + QA PD ++
Sbjct: 12 GYAIATLITTASLFGCGLIFNSTPRQVLE------VVSYLTGVMETSAQAQAVPDAP-SV 64
Query: 77 RMLTCPAAEMVDGSKL--------LYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVS 128
RM TC ++ D ++ LY EQA P+KP+RQRF + P + VE +
Sbjct: 65 RMTTC-VVKVADADEITQRSPAIFLYQEQAMTSNPKKPYRQRFLQISPSADGQQ--VESA 121
Query: 129 SYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGF 188
++ NFC + +R + I DIG++ ++ L+ G + Y G T P G
Sbjct: 122 TFVPANPRALVNFCSQSESERLV---TIEDIGDYRCSVFLQPA--GSQ--YIGQTQPEGC 174
Query: 189 PNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
P +++ GA T+++ L + T+DRG+D GNQ+WG + Y+++
Sbjct: 175 PANYK-GAVTITNQI-TLDAESMETFDRGYDQKGNQIWGAEDQSYQYQ 220
>gi|326513334|dbj|BAK06907.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 77
Score = 81.6 bits (200), Expect = 3e-13, Method: Composition-based stats.
Identities = 35/54 (64%), Positives = 46/54 (85%)
Query: 24 VKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
+KA V+ GG +L++RL +S TRWDHAR VAD+LSGEKFS+EQA +DP N+FN+R
Sbjct: 24 LKALVVFGGVILIRRLRRSTTRWDHARTVADALSGEKFSREQARQDPGNYFNLR 77
>gi|307151475|ref|YP_003886859.1| hypothetical protein Cyan7822_1593 [Cyanothece sp. PCC 7822]
gi|306981703|gb|ADN13584.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
Length = 220
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 95/194 (48%), Gaps = 21/194 (10%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCP-----AAEMVDGS----KLLYLEQAFWRT 102
V L G + QA+ +P ++RM TC A + V S LY EQA +
Sbjct: 33 VVTHLVGIMDTSAQAAENPKKS-SVRMTTCQVTFRGANDSVQNSFGSPVYLYQEQALTQE 91
Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
KP+RQRF ++P + VE SY + E C++P +R L + D+GE
Sbjct: 92 LNKPYRQRFLEIQPSVGQET--VESKSYKPAQPETLIGLCNKPESERVLQQ---SDLGEF 146
Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
+ + L+ G + G TPP G P + + GA T+ + +L + + TWD+G+D G
Sbjct: 147 VCRVFLKPSPDG----FIGETPPEGCPANVR-GAVKITNTI-ILHSRGMDTWDKGYDAQG 200
Query: 223 NQVWGVKAGPYEFK 236
+QVWG + Y+++
Sbjct: 201 HQVWGAREDAYQYR 214
>gi|440751007|ref|ZP_20930245.1| hypothetical protein C943_2938 [Mariniradius saccharolyticus AK6]
gi|436480350|gb|ELP36588.1| hypothetical protein C943_2938 [Mariniradius saccharolyticus AK6]
Length = 211
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 93/186 (50%), Gaps = 19/186 (10%)
Query: 54 DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
D ++G FS E S++ +F +I + P + LY+EQA T KP+RQR Y
Sbjct: 34 DMMAGT-FSSEAQSKEDADFLHISLEMQPIWQKNKDGYWLYVEQAMATTKDKPYRQRIYH 92
Query: 114 VKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL----PEEVIGDIGEHLTTIHLR 169
+ + D + E++E + F + D L P++++ G I LR
Sbjct: 93 L------YQMDDTTLVSQVYELKEPEKFAGKANDPTALGSLTPDQLVSKEG---CGIFLR 143
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVK 229
+ G ++EGST P++ + GA ++TS++ + K+ + +WD+G+D +G QVWG
Sbjct: 144 KTKSG---IFEGSTQDKACPSNLR-GARFTTSKVTLTKDG-MESWDQGWDANGQQVWGAT 198
Query: 230 AGPYEF 235
G Y F
Sbjct: 199 KGGYRF 204
>gi|37520007|ref|NP_923384.1| hypothetical protein glr0438 [Gloeobacter violaceus PCC 7421]
gi|81711246|sp|Q7NNH3.1|CPXT1_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 1
gi|35210999|dbj|BAC88379.1| glr0438 [Gloeobacter violaceus PCC 7421]
Length = 229
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 87/191 (45%), Gaps = 21/191 (10%)
Query: 52 VADSLSGEKFSKEQASRD---PDNFFNIRMLTCPAAEMVDGSKLLYL--EQAFWRTPQKP 106
V L+G + +QA D PD +RM TC D SK +YL EQ P
Sbjct: 50 VVTRLTGIMTTTQQAQSDAARPD----VRMTTCSVQLEGDKSKAIYLYQEQTMSNNLGAP 105
Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTT 165
+RQR + VE + + E + C +P +R + P + GD T
Sbjct: 106 YRQRLLRIAASADGRA--VESAGFKFVEAKPLAGLCAKPAAERLIAPMALDGD-----PT 158
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
+R G + Y G+TP GG ++ + GA T+E+ + K + T DRGFD GNQV
Sbjct: 159 CTVRLVQAGDK--YMGTTPEGGCQSNVR-GAARITNEITLYKEG-MDTRDRGFDAQGNQV 214
Query: 226 WGVKAGPYEFK 236
WG K PY F+
Sbjct: 215 WGAKEEPYRFR 225
>gi|434392494|ref|YP_007127441.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
gi|428264335|gb|AFZ30281.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
Length = 240
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 88/191 (46%), Gaps = 18/191 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGS------KLLYLEQAFWRTPQK 105
V L G + QA +P N ++R+ TC G+ +Y EQA + +
Sbjct: 40 VVSHLDGAMDTSAQARANP-NAPDVRITTCKVTVKNAGAIARPHAVFMYQEQALSQRLSQ 98
Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
P+RQRF + P +E + + + + C++P QR + + DIG +
Sbjct: 99 PYRQRFLRIAPSVDH--NSIESAVFRPPTPQAWIGLCNKPEAQRIIN---VKDIGTSNCS 153
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
+ L R + Y G T G P++++ GA T+ + L + TWDRGFD GNQV
Sbjct: 154 VFLGRQQQN----YIGETSASGCPSNYK-GAVRITNRI-TLHQAGMDTWDRGFDATGNQV 207
Query: 226 WGVKAGPYEFK 236
WG + Y+F+
Sbjct: 208 WGAEGEAYQFR 218
>gi|443242425|ref|YP_007375650.1| DUF1001 domain containing protein [Nonlabens dokdonensis DSW-6]
gi|442799824|gb|AGC75629.1| DUF1001 domain containing protein [Nonlabens dokdonensis DSW-6]
Length = 219
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 103/215 (47%), Gaps = 19/215 (8%)
Query: 22 LFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTC 81
L + +LIG + +K T+ V + G S Q+ D ++++I +
Sbjct: 7 LLISFFILIGCSS-----SKHNTKDKELDKVYKMMQGSYDSSLQSKVDS-SYYDISLEMH 60
Query: 82 PAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY-MVKPCPKELKCDVEVSSYAIREVEEYKN 140
P + G + LY+EQA +P+R R Y +++ E+ +V Y I ++Y
Sbjct: 61 PVWKN-SGERWLYVEQALTSQKDQPYRVRMYRLLRNDNNEIISEV----YTIPNEKQYYG 115
Query: 141 FCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYST 200
+P+ + L + + + +++ + G Y G T ++ + GA+Y+T
Sbjct: 116 KFKKPKAFKYLTPDYLEK--REGCEVIIQKEENGS---YTGKTGKSTCLSTMR-GASYAT 169
Query: 201 SELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEF 235
S + V+ +N+I +WDRGFD DGNQVWG + G Y F
Sbjct: 170 S-IVVINDNQIISWDRGFDKDGNQVWGAEKGGYIF 203
>gi|443323211|ref|ZP_21052220.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
gi|442787121|gb|ELR96845.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
Length = 191
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 83/186 (44%), Gaps = 12/186 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+GE ++EQA +P + N+R+ P ++ S L+ EQA +P+R R
Sbjct: 10 LASYLAGEFENREQAIAEPAWYVNLRLWLRPVPDLWSDSLTLFAEQASTLKLDQPYRPRL 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
++ P ++ V Y +E++ + +P + L E I E L L+
Sbjct: 70 LRLRQAP-----ELVVEHYMFKELQAFIGSGRQPEKLKALTLEQI----EFLPGCTLKVQ 120
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVKA 230
++ ++ PG G+TY S + E+ T+DRG D G +WG
Sbjct: 121 QHNQQ--FQALPQPGETCCFQYQGSTYQVSLGFEVNEAELSTYDRGIDSATGKAIWGAIL 178
Query: 231 GPYEFK 236
GPY FK
Sbjct: 179 GPYRFK 184
>gi|163756594|ref|ZP_02163706.1| hypothetical protein KAOT1_01325 [Kordia algicida OT-1]
gi|161323488|gb|EDP94825.1| hypothetical protein KAOT1_01325 [Kordia algicida OT-1]
Length = 209
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 98/220 (44%), Gaps = 21/220 (9%)
Query: 24 VKAAVLIGGAVLLKRLT------KSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
+K L A++L L KS D + + ++G S EQAS D D+++NI
Sbjct: 1 MKKTFLYSFAIILCFLISSCNSYKSTNTQDELKALQTLMTGSFDSSEQASAD-DSYYNIS 59
Query: 78 MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
+ P DG LY+EQA KP+RQR Y + E + + SS + +E
Sbjct: 60 LHMYPIWTSKDGY-WLYVEQALNSNQDKPYRQRVYQL-----EKLANGKFSS-KVYTLEN 112
Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHL-TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA 196
K + + + + + E + L + K Y GST ++ GA
Sbjct: 113 PKEAIGKWKTPAYFDQFDVSMLKEREGCAVILEK----KGAYYSGSTNEKDCKST-MRGA 167
Query: 197 TYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
+Y+TSE+ + K N I +WDRGF+ VWG + Y FK
Sbjct: 168 SYATSEVTI-KPNVIESWDRGFNAKDEHVWGAEKAGYVFK 206
>gi|428773606|ref|YP_007165394.1| hypothetical protein Cyast_1789 [Cyanobacterium stanieri PCC 7202]
gi|428687885|gb|AFZ47745.1| protein of unknown function DUF1001 [Cyanobacterium stanieri PCC
7202]
Length = 194
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 88/197 (44%), Gaps = 27/197 (13%)
Query: 49 ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
AR++A S E QA +P F +IR+ P + G +LEQA+ +P+R
Sbjct: 11 ARLMAADFSNEP----QAIENPPFFAHIRVCMRPLPHSLLGEVSFFLEQAYDFLLSQPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP----EEVIG-DIGEHL 163
R + +K L E+ Y ++E +E+ P + L E++ G D+ H
Sbjct: 67 LRVFTIKAVDDHL----ELEHYKLKEEKEFYGASRNPEKLKGLTLDHLEKMNGCDMIAHW 122
Query: 164 TTIHLR-RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
T H + GK C+ E NS++ + N++ ++DRG D +
Sbjct: 123 TGTHFKGHVKPGKACIVERKGKKSYLDNSFE------------IDENKLISFDRGRDLET 170
Query: 223 NQ-VWGVKAGPYEFKPA 238
++ +WG AGP+ F+P
Sbjct: 171 DELLWGSVAGPFHFQPT 187
>gi|148243132|ref|YP_001228289.1| hypothetical protein SynRCC307_2033 [Synechococcus sp. RCC307]
gi|147851442|emb|CAK28936.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 201
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 25/209 (11%)
Query: 44 TRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTP 103
TRW L G +++QA +P + +I + P ++ S L+ EQA+ TP
Sbjct: 9 TRW---------LCGSFSNQQQAFENPPLYGHIHVRYRPLGQLAPRSMLI--EQAYAITP 57
Query: 104 QKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL 163
++P+R R CP++ ++V +Y + E + + D L E+ D HL
Sbjct: 58 KEPYRIRVVRPLLCPEQ---GLKVMNYTLAEPQRFIGAI----DDADLRAEIREDDLTHL 110
Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDG 222
+ D+G + GS PG + G +Y SE VL+N + T DRG D D
Sbjct: 111 EGCNYLVRDQGDH--FSGSVEPGCRCKVRRKGRDSYLVSEF-VLRNGTMETIDRGHDPDT 167
Query: 223 N-QVWGVKAGPYEFKPAPSSSYSDMFSPL 250
+ Q+WG GP+ F+ +SD PL
Sbjct: 168 HEQLWGSLPGPFLFE--RKEDWSDELLPL 194
>gi|17228143|ref|NP_484691.1| hypothetical protein alr0647 [Nostoc sp. PCC 7120]
gi|81772968|sp|Q8YZ40.1|CPCTL_NOSS1 RecName: Full=Phycocyanobilin lyase CpcT homolog
gi|17129993|dbj|BAB72605.1| alr0647 [Nostoc sp. PCC 7120]
Length = 198
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 88/203 (43%), Gaps = 39/203 (19%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+ + L+GE ++EQA +P F ++R+ P D S L+ EQA +P+RQR
Sbjct: 10 LGNYLAGEFDNREQALGEPIWFVHLRLWQRPVDLFSDDSITLFAEQANIVNLDRPYRQRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH---LTTIHL 168
+ P P D E Y V+ Y P++ P +IG G H L T+
Sbjct: 70 LRLMPAP-----DSETGLY----VQYYM-----PKN----PSALIG-AGRHPDLLKTLTP 110
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQNGA--------TYSTSELAV-------LKNNEIHT 213
++ + C+ S PNS+Q A T+S E V + E+HT
Sbjct: 111 QQLELLPGCVLSVSQQTVA-PNSYQFTASPLPNTCCTFSYLENTVQVSLGFAVTETELHT 169
Query: 214 WDRGFDDD-GNQVWGVKAGPYEF 235
+D+G D + G WG GPY +
Sbjct: 170 YDKGIDQETGKATWGAIVGPYRY 192
>gi|86606684|ref|YP_475447.1| CpeT protein [Synechococcus sp. JA-3-3Ab]
gi|123505735|sp|Q2JT26.1|CPXT2_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 2
gi|86555226|gb|ABD00184.1| CpeT protein [Synechococcus sp. JA-3-3Ab]
Length = 199
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 88/189 (46%), Gaps = 16/189 (8%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+G+ ++EQA +P F +IR+ P V G Y+EQA+ ++P+RQ
Sbjct: 12 MARWLAGDFSNQEQAWENPPFFASIRVAYRPLPTAVLGGIGFYVEQAYSGHLEEPYRQAV 71
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ ++ + + +Y + + ++ C R R ++ L + D+ +L C
Sbjct: 72 VEL----TQVGDGIVIRNYRPLQPQRWRG-CARGRAEQ-LSQLSAADLA------YLPGC 119
Query: 172 D---RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-QVWG 227
D + + L+ G T PG +NG T L NE + DRG D + QVWG
Sbjct: 120 DVQVKRQGSLFVGVTEPGCRCCVVRNGQTTYLQTTLHLSENEFCSHDRGMDPVTHRQVWG 179
Query: 228 VKAGPYEFK 236
AGP+ F+
Sbjct: 180 AVAGPFRFR 188
>gi|75910775|ref|YP_325071.1| hypothetical protein Ava_4579 [Anabaena variabilis ATCC 29413]
gi|75704500|gb|ABA24176.1| Protein of unknown function DUF1001 [Anabaena variabilis ATCC
29413]
Length = 198
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 85/203 (41%), Gaps = 39/203 (19%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+ + L+GE ++EQA +P F ++R+ P D S L+ EQA +P+RQR
Sbjct: 10 LGNYLAGEFDNREQALGEPIWFVHLRLWQRPVGLFSDDSITLFAEQANIVNLDRPYRQRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH---LTTIHL 168
+ P P D E Y V+ Y P++ P ++G G H L T+
Sbjct: 70 LRLMPAP-----DSETGLY----VQYYM-----PKN----PSALVG-AGRHPELLKTLTP 110
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQ---------------NGATYSTSELAVLKNNEIHT 213
++ + C+ S PNS+Q G T S + E+HT
Sbjct: 111 QQLELLPGCVLSVSQKTVA-PNSYQFTASPLPDTCCTFSYLGNTVQVSLGFAVTETELHT 169
Query: 214 WDRGFDDD-GNQVWGVKAGPYEF 235
+D+G D + G WG GPY +
Sbjct: 170 YDKGIDQETGKATWGAIVGPYRY 192
>gi|411116555|ref|ZP_11389042.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
gi|410712658|gb|EKQ70159.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
Length = 200
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/186 (22%), Positives = 83/186 (44%), Gaps = 4/186 (2%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A+ L+G +++QA+ +P + ++R+ P + S ++ EQA KP+RQR
Sbjct: 10 LAEYLTGVFENRKQAADEPIWYVHLRLWQHPVQLFAEDSVTIFAEQASVVNLDKPYRQRL 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ P + ++V Y ++ ++ P L + + + + I+ +
Sbjct: 70 LRLMRSPANPEA-LQVQYYGFKDPSAFRGAGQAPEKLEFLAIDQVDFLPGCILEINQQPT 128
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKA 230
G R + + PPG +G + + +E T+D+G D + GN +WG
Sbjct: 129 STGTRFI--ATAPPGACCRFQYDGKIGQVALGFEVSADEYLTYDKGIDPETGNALWGAIM 186
Query: 231 GPYEFK 236
GPY ++
Sbjct: 187 GPYRYQ 192
>gi|428201304|ref|YP_007079893.1| hypothetical protein Ple7327_0912 [Pleurocapsa sp. PCC 7327]
gi|427978736|gb|AFY76336.1| CpeT/CpcT family (DUF1001) [Pleurocapsa sp. PCC 7327]
Length = 204
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 82/187 (43%), Gaps = 15/187 (8%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L+GE +++QA +P + ++R+ P + + S L+ EQA ++P+R R ++
Sbjct: 14 LAGEFDNRQQALAEPIWYVHLRLWLRPTSLFREDSLTLFAEQASIVNLEQPYRPRLLRLR 73
Query: 116 -----PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
P P + V Y ++++ + +P R L E + + + + R
Sbjct: 74 QSQTHPLP------LRVEHYMFKDLDAVRGASSKPELLRQLTPEQVEFLPSCTLNVEIER 127
Query: 171 CDRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGV 228
DR + ++P P + G TY S + +E+ T D+G D G +WG
Sbjct: 128 LDRDRFKF--KASPASEKPCRFTYGGQTYQVSLGFEVTPDELKTHDKGIDPTTGQAIWGA 185
Query: 229 KAGPYEF 235
GPY F
Sbjct: 186 LMGPYRF 192
>gi|434400021|ref|YP_007134025.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
gi|428271118|gb|AFZ37059.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
Length = 197
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 85/192 (44%), Gaps = 21/192 (10%)
Query: 49 ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
AR++A S + EQA +P F +IR+ P + + L+LEQA+ +P+R
Sbjct: 11 ARLMAADFSNQ----EQAFENPPFFAHIRVCMRPLPDSILDGTSLFLEQAYDYMLNRPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ +Y ++E E++ R RD+ + + D+ E L
Sbjct: 67 LRVFKLSVVENR----IELENYKVKEEEKFYG-ASRDRDRL---KNLTPDLIEKLPG--- 115
Query: 169 RRCDRG---KRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
CD K + G PG + G T ++ + ++DRG D ++
Sbjct: 116 --CDMNVIWKNNSFHGEIKPGKACIVERQGKTTYLDNSFIIDQETLISYDRGRDPGTDEL 173
Query: 225 VWGVKAGPYEFK 236
VWG AGP+ FK
Sbjct: 174 VWGSLAGPFHFK 185
>gi|300867127|ref|ZP_07111793.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300334882|emb|CBN56959.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 196
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 89/206 (43%), Gaps = 31/206 (15%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
+T S AR +A S + QA +P F +IR+ P + G LYLEQA
Sbjct: 1 MTNSTDIATLARWMASDFSNQP----QAFENPPFFAHIRVCMRPLPVELLGGISLYLEQA 56
Query: 99 FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
+ +P+R R + P + C +E+ +YAI E++ P + L E I
Sbjct: 57 YDYQINQPYRVRVLKLVPA---VNC-IEIENYAIENEEQFYGSSRDPVRLQALKAETIKK 112
Query: 159 ------IGEHLTTIHLRRCDRGKRC--LYEGSTPPGGFPNSWQNGATYSTSELAVLKNNE 210
I + L R + GK C + +G T TY SE + ++
Sbjct: 113 LPGCTFITQWTGNSFLGRVEPGKGCTVVRKGQT-------------TYLDSEFEIDEHKF 159
Query: 211 IHTWDRGFDDDGNQ-VWGVKAGPYEF 235
I + DRG D + ++ +WG AGP+EF
Sbjct: 160 I-SHDRGLDPETDEHIWGAVAGPFEF 184
>gi|88807813|ref|ZP_01123324.1| hypothetical protein WH7805_06621 [Synechococcus sp. WH 7805]
gi|88787852|gb|EAR19008.1| hypothetical protein WH7805_06621 [Synechococcus sp. WH 7805]
Length = 234
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 83/183 (45%), Gaps = 12/183 (6%)
Query: 55 SLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
SLS ++EQA +P + NI + P ++ GS L LEQA+ +P+R R
Sbjct: 32 SLSASFSNQEQAFENPPLYGNILVRLRPLPQLDPGS--LLLEQAYAIASNEPYRVRVLQP 89
Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
CP + + +YAIR+ + + + P+ + L + + + +R D G
Sbjct: 90 TVCPTR---GLLILNYAIRDDQRFWGSVEDPQRRSQLTAADLTLL--EGCSYQVRETDEG 144
Query: 175 KRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPY 233
+ G PG + G T L ++ + T DRG+D ++ +WG AGP+
Sbjct: 145 ----FRGEVEPGCRCLVTRRGKTSYLVSSFELTDDGMSTIDRGYDPATHEHLWGSIAGPF 200
Query: 234 EFK 236
+FK
Sbjct: 201 QFK 203
>gi|300866329|ref|ZP_07111032.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335672|emb|CBN56192.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 206
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 80/188 (42%), Gaps = 7/188 (3%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+A ++GE ++ QA DP + ++R+ P + + S LY+EQA P KP+R
Sbjct: 10 LARYMAGEFDNRTQAIADPVWYVHLRLWQRPFPISLFSEPSITLYVEQANILKPDKPYRP 69
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHL 168
R ++ + +EV Y ++++ + P + L PE+V G LT
Sbjct: 70 RIVQLRHSQSAPEL-IEVQYYMLKDIAALQGAGQNPNLLKQLTPEQVELLPGCTLTVT-- 126
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWG 227
R+ R + S PP G TY E ++D+G D G +WG
Sbjct: 127 RQDLTSNRYRFRASLPPETQCCFTYGGQTYQIDLGFEASAEEFLSYDKGIDPKTGKSIWG 186
Query: 228 VKAGPYEF 235
GP+ F
Sbjct: 187 ALMGPFRF 194
>gi|443317243|ref|ZP_21046659.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
gi|442783128|gb|ELR93052.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
Length = 197
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 88/207 (42%), Gaps = 29/207 (14%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++ + ++ QA +P F +IR+ P + G LYLEQA+ +P+R R
Sbjct: 10 LARWMAADFSNQPQAFENPPLFAHIRVCMRPLPYTLMGGVSLYLEQAYDFMLTQPYRARV 69
Query: 112 YMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
K + D +++ +Y I ++ +F RD+ L LT HL
Sbjct: 70 L------KLILVDGKIQIENYLI---QDEASFHGAARDRDRLSS---------LTAAHLE 111
Query: 170 RCDRGKRCL------YEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDG 222
R + + G PG +NG TY SE + +WDRG D D
Sbjct: 112 RMPGCNMTVDWTGHSFRGQVEPGKACMVVRNGKTTYLDSEFEI-DGQHFTSWDRGRDPDT 170
Query: 223 NQ-VWGVKAGPYEFKPAPSSSYSDMFS 248
++ +WG AG ++F P S ++ S
Sbjct: 171 DEHLWGSLAGAFQFSPRASYAHEVTLS 197
>gi|86605374|ref|YP_474137.1| hypothetical protein CYA_0661 [Synechococcus sp. JA-3-3Ab]
gi|123507394|sp|Q2JWI9.1|CPXT1_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 1
gi|86553916|gb|ABC98874.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 218
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 81/204 (39%), Gaps = 23/204 (11%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
AD L G ++ QA +P F + + + + EQ TP + +R R
Sbjct: 36 ADWLEGYYSNRAQAMAEPVWFVPVSLWYVRLPHLFSEGIGFFTEQFNEHTPGRFYRSRVL 95
Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
V P + + +Y +R+ + +D + L + + D+ +HL +
Sbjct: 96 QVLADP------LRIENYKLRDQAAWAGAS---QDLQRLAQLSLSDL-QHLPGCRILVEK 145
Query: 173 RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAG 231
R Y G PGG +TY E L E TWDRGFD G Q WG +AG
Sbjct: 146 RAD--CYHGQMLPGGGCRLNPGDSTYIHIEFD-LTAQEFITWDRGFDATTGQQTWGSRAG 202
Query: 232 PYEFKPAPSSSYSDMFSPLNFPPQ 255
PY ++ P+ FPP+
Sbjct: 203 PYRYQKR---------IPVVFPPR 217
>gi|88807824|ref|ZP_01123335.1| CpeT-like protein [Synechococcus sp. WH 7805]
gi|88787863|gb|EAR19019.1| CpeT-like protein [Synechococcus sp. WH 7805]
Length = 203
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 19/196 (9%)
Query: 51 IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQR 110
+ A +L G ++EQA DP F +I + P V Y EQ++ P +P+RQ
Sbjct: 6 LFARTLCGHYSNREQAQNDPSKFAHINIFFMPLPWDVLKGPGFYSEQSYDHDPWRPYRQG 65
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLR 169
+ ++ + V ++ + + E P + P+ + G +H +
Sbjct: 66 VHRLREGDNGVHV---VENFGMAQPERVAGAPQHPALLAAIQPDTLQARCG---CAMHFQ 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTW---DRGFDDDGNQ- 224
C YEG PG ++G TY SE+ V + H+W DRGFD D +
Sbjct: 120 SCGANA---YEGRVEPGKGCLVPRDGRLTYLVSEVQV----DAHSWSSRDRGFDPDSDDL 172
Query: 225 VWGVKAGPYEFKPAPS 240
VWG + G +F+ S
Sbjct: 173 VWGSEHGMLQFQRVAS 188
>gi|443328463|ref|ZP_21057060.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
gi|442791917|gb|ELS01407.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
Length = 194
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 90/196 (45%), Gaps = 23/196 (11%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ + + ++ + ++ QA +P F +IR+ P + + GS L+LEQA+ P+R
Sbjct: 8 KTLVNWMAADFSNQAQAYANPPFFASIRVCMRPLPDELLGSPSLFLEQAYDFMLTNPYRV 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFC-DRPRDQRPLPEEVIGDIGEHLTT--- 165
R L +E+ +Y ++E E++ DR + Q+ PE + G +T
Sbjct: 68 RVLKFSI----LDDHIELENYKVQEQEQFFGASRDREQLQKLTPELLEKMPGCDMTVNWM 123
Query: 166 -IHLR-RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
H + + GK C+ E NS++ + ++ ++DRG D + +
Sbjct: 124 GNHFQGKIKPGKACIVERKGKITYLDNSFE------------IDQEKLISYDRGRDPETD 171
Query: 224 Q-VWGVKAGPYEFKPA 238
+ VWG AGP+EF P
Sbjct: 172 ELVWGSVAGPFEFTPV 187
>gi|86608101|ref|YP_476863.1| hypothetical protein CYB_0612 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556643|gb|ABD01600.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 195
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 82/193 (42%), Gaps = 18/193 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A+ L G ++ QA +P F + + + + EQ TP + +R R
Sbjct: 13 LANWLEGYYSNRTQAMAEPVWFVPVSLWYVRLPHLFSEGIGFFTEQFNQHTPGQFYRSRV 72
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
V P + + +Y +R+ + P QR L + + D+ EHL +
Sbjct: 73 LQVLGDP------LRIENYKLRDQAAWAGASQDP--QR-LAQLSLADL-EHLPGCRIL-V 121
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVKA 230
++ C Y G PGG +TY E L +E TWDRGFD G Q WG +A
Sbjct: 122 EKRPDC-YHGEMLPGGECRLNPGDSTYIHIEFD-LTPSEFITWDRGFDAATGQQTWGSRA 179
Query: 231 GPYEFK----PAP 239
GPY ++ PAP
Sbjct: 180 GPYRYQKGIPPAP 192
>gi|86610167|ref|YP_478929.1| CpeT protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558709|gb|ABD03666.1| CpeT protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 199
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 18/190 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+G+ ++EQA +P F IR+ P V G Y+EQA+ + P+RQ
Sbjct: 12 MARWLAGDFSNQEQAWENPPFFAQIRVAYRPLPAHVLGGIGFYVEQAYSGHLEDPYRQAV 71
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ + + + +Y E + ++ C R R + L + D+ +L C
Sbjct: 72 VEL----TQASAGIVIRNYRPLEPQRWRG-CARERADQ-LSQLQAADLA------YLEGC 119
Query: 172 D---RGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGN-QVW 226
D + + L+ G T PG +NG ATY + L L +E + DRG D + QVW
Sbjct: 120 DVHVKRQGSLFVGVTEPGCRCCVVRNGQATYLQTTLH-LSESEFCSHDRGMDPITHRQVW 178
Query: 227 GVKAGPYEFK 236
G AG + F+
Sbjct: 179 GALAGAFRFQ 188
>gi|428302139|ref|YP_007140445.1| hypothetical protein Cal6303_5596 [Calothrix sp. PCC 6303]
gi|428238683|gb|AFZ04473.1| protein of unknown function DUF1001 [Calothrix sp. PCC 6303]
Length = 200
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/195 (21%), Positives = 85/195 (43%), Gaps = 6/195 (3%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+ + L+GE + +Q+ +P F ++R+ P + S + EQA +P+RQR
Sbjct: 10 LGNYLAGEFDNSKQSLAEPAWFVHLRLWHIPVPLFPEDSITFFAEQANIVNLDQPYRQRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
++ ++ ++V YA+++ P + L + + D+ + + ++
Sbjct: 70 IRIRQV-GNVESSLQVQYYALKDYISLVGAGKEPGRLKNLTPDDLEDLPGCVLEVEIKET 128
Query: 172 DRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVK 229
+ + +TP S++ NG S + E+ T+D+G D + G WG
Sbjct: 129 SHHLQFI---ATPVANSICSFRYNGNIVRVSLGFEVTQEELKTYDKGIDPETGRATWGAI 185
Query: 230 AGPYEFKPAPSSSYS 244
GPY ++ SS S
Sbjct: 186 LGPYHYQKRKQSSLS 200
>gi|113478364|ref|YP_724425.1| hypothetical protein Tery_5043 [Trichodesmium erythraeum IMS101]
gi|123056021|sp|Q10UX2.1|CPXT2_TRIEI RecName: Full=Chromophore lyase CpcT/CpeT 2
gi|110169412|gb|ABG53952.1| protein of unknown function DUF1001 [Trichodesmium erythraeum
IMS101]
Length = 195
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 13/176 (7%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
+++QA +P F +IR+ P + LYLEQA+ T +P+R R ++K P E
Sbjct: 20 NQQQAFDNPPLFAHIRVCMRPLPYQLLNGLSLYLEQAYDITLNQPYRVR--VLKLVPSE- 76
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
+E+ +Y I + E+ P+ + L E I + L GK ++G
Sbjct: 77 -NHIEIENYIIDQEAEFYGASRDPQRLKNLKTEYI----KKLPGCTFITKWTGKS--FKG 129
Query: 182 STPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPYEF 235
PG G + +TY S + ++ I + D G+D + QVWG AGP+EF
Sbjct: 130 EVEPGKGCTVVREGKSTYLKSYFEINEHKFI-SHDTGYDPETDQQVWGAIAGPFEF 184
>gi|428776095|ref|YP_007167882.1| hypothetical protein PCC7418_1479 [Halothece sp. PCC 7418]
gi|428690374|gb|AFZ43668.1| protein of unknown function DUF1001 [Halothece sp. PCC 7418]
Length = 197
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 88/204 (43%), Gaps = 27/204 (13%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+GE ++ QA P F N+++ P +DG+ L+ EQA ++ +RQR
Sbjct: 5 LAQWLAGEFSNESQALSQPAWFVNLKLWHRPLPVSIDGNYALFAEQAPALKLEQAYRQRI 64
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
++++P E K + V YA +E ++++ P + L ++ L+
Sbjct: 65 FVIQPA-TETK-PMTVQYYAFKEHQKWRGAGMNP---------------QILDSLRLQDI 107
Query: 172 DRGKRCLYEGSTPPGG-----FPNS----WQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
++ C E T FPNS + G + + E + D+G D +
Sbjct: 108 EKLPGCALEVITTENKFSAQPFPNSVCQFYIQGNLCQIELGFAVTSQEFFSSDKGIDPET 167
Query: 223 NQ-VWGVKAGPYEFKPAPSSSYSD 245
+ +WG PY+F+ S S+
Sbjct: 168 QKPIWGALIAPYQFQKMQGFSVSE 191
>gi|86607187|ref|YP_475950.1| hypothetical protein CYA_2568 [Synechococcus sp. JA-3-3Ab]
gi|123505022|sp|Q2JRQ9.1|CPXT3_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 3
gi|86555729|gb|ABD00687.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 198
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 79/189 (41%), Gaps = 19/189 (10%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++G + QA +P F NI + CP V Y+EQ + P +P+RQR
Sbjct: 10 LATWMAGHFNNLHQAIAEPVWFANIHVYQCPLPWSVFQGIGFYVEQLYDIYPDQPYRQRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI----GEHLTTIH 167
+ P ++ + +YA+ E YK RD L ++ G +
Sbjct: 70 IHLFETPDGIR----IQNYALGSPEAYKC---AGRDLGKLASLAAAELELLPGCAVQVEW 122
Query: 168 LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
C RG+ S P G + TY SE + + H+ D+G D + +Q VW
Sbjct: 123 TGSCYRGR------SVPGKGCIVERKGRTTYLYSEFEI-GADYFHSLDQGRDPETDQVVW 175
Query: 227 GVKAGPYEF 235
G +GP+ F
Sbjct: 176 GSLSGPFRF 184
>gi|254422202|ref|ZP_05035920.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196189691|gb|EDX84655.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 196
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 82/184 (44%), Gaps = 17/184 (9%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
++ E ++EQA +P F +I++ P + LYLEQA+ KP+R R
Sbjct: 14 MAAEFSNQEQAFENPPFFAHIKVCMRPLPKSFQPGVSLYLEQAYSFQLDKPYRVRVLHF- 72
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
+ + DV + +Y +++ E+ F RD L + D+ + CD
Sbjct: 73 ---IQREDDVLLENYKVKDEEK---FYGAARDLEKLATLSVDDLEPMMG------CDIFV 120
Query: 176 RCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAG 231
+ G PG + + GAT ++ + + ++DRG D + ++ VWG AG
Sbjct: 121 EWTGDSFRGKVEPGKKCSVIRKGATTYLDNEFIVTESHMTSYDRGRDPETDELVWGSVAG 180
Query: 232 PYEF 235
P+EF
Sbjct: 181 PFEF 184
>gi|282898790|ref|ZP_06306777.1| protein of unknown function DUF1001 [Cylindrospermopsis raciborskii
CS-505]
gi|281196317|gb|EFA71227.1| protein of unknown function DUF1001 [Cylindrospermopsis raciborskii
CS-505]
Length = 197
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 89/201 (44%), Gaps = 21/201 (10%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
+T S+ AR +A S ++ QA +P + +IR+ P A + L++EQA
Sbjct: 1 MTHSQDVLTLARWMAADFS----NQAQAFENPPLYAHIRVCMRPLAYSLLSGVSLFVEQA 56
Query: 99 FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVI 156
+ + P+R R LK VE I V++ K+F RD L + +
Sbjct: 57 YDYDLKDPYRVRV---------LKLLVEQGQIIIENYTVKDEKDFYGSSRDLGKL-QNLT 106
Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWD 215
D E L ++R G + G+ PG ++ G TY S+ + + + +WD
Sbjct: 107 ADRLEKLCGCNMRVEWTGS--CFTGTVEPGKNCLVFRKGQKTYLDSKFEI-DDQKFLSWD 163
Query: 216 RGFD-DDGNQVWGVKAGPYEF 235
RG D + G VWG AGP+ F
Sbjct: 164 RGRDLETGQHVWGSLAGPFHF 184
>gi|56750773|ref|YP_171474.1| hypothetical protein syc0764_d [Synechococcus elongatus PCC 6301]
gi|81299583|ref|YP_399791.1| hypothetical protein Synpcc7942_0772 [Synechococcus elongatus PCC
7942]
gi|56685732|dbj|BAD78954.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81168464|gb|ABB56804.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 197
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 75/186 (40%), Gaps = 15/186 (8%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
LSGE ++ QA +P +F +R+ P L+LEQA +P+RQR ++
Sbjct: 11 LSGEFSNQPQAIAEPVHFVQLRLWQRPLLLFGHDRPCLFLEQANALQLDRPYRQRLLCLE 70
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLRRCDRG 174
+ + Y + P R L PE+V + L + L ++
Sbjct: 71 ---ADFSGQLSAQFYQFKNAAAVLGAGQDPDRLRSLTPEDV-----QLLESCRL-AVEQT 121
Query: 175 KRCLYEGSTPPGGFPNSWQNGATYSTS-ELAVLKNNE---IHTWDRGFD-DDGNQVWGVK 229
R Y P G W +G S V ++ H++DRG D + G +WG
Sbjct: 122 SRGGYNAQLPAGCRCRFWTDGIERQVSLGFEVWSESDRVNFHSFDRGIDPETGRSLWGAL 181
Query: 230 AGPYEF 235
GPYEF
Sbjct: 182 MGPYEF 187
>gi|428770503|ref|YP_007162293.1| hypothetical protein Cyan10605_2163 [Cyanobacterium aponinum PCC
10605]
gi|428684782|gb|AFZ54249.1| protein of unknown function DUF1001 [Cyanobacterium aponinum PCC
10605]
Length = 194
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 91/212 (42%), Gaps = 27/212 (12%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
++KS AR++A S +++QA +P F +IR+ P E + G L+LEQA
Sbjct: 1 MSKSTDVKGLARLMAADFS----NQQQAWDNPPFFAHIRVCMRPLPESLLGGTSLFLEQA 56
Query: 99 FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
+ +P+R R + ++ L E+ Y ++E E F R++ L E
Sbjct: 57 YDFLLNQPYRLRVFKIQAVDDHL----ELEHYKLKEEAE---FYGASRNREKLKE----- 104
Query: 159 IGEHLTTIHLRR---CDRGKR---CLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
LT HL + CD ++G PG + G + ++
Sbjct: 105 ----LTIDHLEKMNGCDMIAEWNGSHFKGYIKPGKACIVVRKGKESYLDNSFEIDERKLI 160
Query: 213 TWDRGFDDDGNQ-VWGVKAGPYEFKPAPSSSY 243
++DRG D ++ +WG AGP+ F S ++
Sbjct: 161 SFDRGRDLVTDELLWGSVAGPFHFTRIESFAH 192
>gi|385808825|ref|YP_005845221.1| hypothetical protein IALB_0241 [Ignavibacterium album JCM 16511]
gi|383800873|gb|AFH47953.1| Hypothetical protein IALB_0241 [Ignavibacterium album JCM 16511]
Length = 87
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 21/41 (51%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 195 GATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEF 235
GATY+T+E+ + K+ I +WDRGF+D QVWG + G Y F
Sbjct: 43 GATYATTEVVITKDKMI-SWDRGFNDKDEQVWGAEKGGYIF 82
>gi|428778548|ref|YP_007170334.1| hypothetical protein Dacsa_0162 [Dactylococcopsis salina PCC 8305]
gi|428692827|gb|AFZ48977.1| CpeT/CpcT family (DUF1001) [Dactylococcopsis salina PCC 8305]
Length = 193
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 84/195 (43%), Gaps = 12/195 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+GE +KEQA +P F ++++ P ++DG+ L+ EQA +P+RQR
Sbjct: 9 LAQWLAGEFENKEQAMSEPAWFVSLKLWHRPLPFLIDGNYALFAEQAPALKLDRPYRQRV 68
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+++ + + YA ++ + ++ R+ + L + D+ E L L+
Sbjct: 69 FVITATDP-----IAIQYYAFKQPQLWQG---SGRNPQQLDSLRLDDL-EKLPDCVLQVT 119
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKA 230
D K + P + G + E + D+G D + + +WG
Sbjct: 120 DTEKE--FSAKPLPNAVCQFYAQGRLCEVKLGFTVTAEEFFSDDKGIDPETKKPIWGALM 177
Query: 231 GPYEFKPAPSSSYSD 245
PY F+ S + S+
Sbjct: 178 SPYRFQKIQSFTVSE 192
>gi|16331059|ref|NP_441787.1| hypothetical protein slr1649 [Synechocystis sp. PCC 6803]
gi|383322801|ref|YP_005383654.1| hypothetical protein SYNGTI_1892 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325970|ref|YP_005386823.1| hypothetical protein SYNPCCP_1891 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491854|ref|YP_005409530.1| hypothetical protein SYNPCCN_1891 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384437122|ref|YP_005651846.1| hypothetical protein SYNGTS_1893 [Synechocystis sp. PCC 6803]
gi|451815216|ref|YP_007451668.1| hypothetical protein MYO_119100 [Synechocystis sp. PCC 6803]
gi|81671316|sp|P74371.1|CPXT_SYNY3 RecName: Full=Chromophore lyase CpcT/CpeT
gi|1653553|dbj|BAA18466.1| slr1649 [Synechocystis sp. PCC 6803]
gi|339274154|dbj|BAK50641.1| hypothetical protein SYNGTS_1893 [Synechocystis sp. PCC 6803]
gi|359272120|dbj|BAL29639.1| hypothetical protein SYNGTI_1892 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275290|dbj|BAL32808.1| hypothetical protein SYNPCCN_1891 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278460|dbj|BAL35977.1| hypothetical protein SYNPCCP_1891 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407961573|dbj|BAM54813.1| hypothetical protein BEST7613_5882 [Bacillus subtilis BEST7613]
gi|451781185|gb|AGF52154.1| hypothetical protein MYO_119100 [Synechocystis sp. PCC 6803]
Length = 196
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 19/179 (10%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++ QA +P + +IR+ P + G +LL+LEQA+ Q+P+R R +K
Sbjct: 20 NQAQAFENPPFYAHIRVAIRPLDQAKFGDRLLFLEQAYDFMLQRPYRLRVLKLKVVEDH- 78
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKR---CL 178
+E+ ++ +++ E+ F RD L + D L +H CD
Sbjct: 79 ---IEIENFKVKDEEK---FYGAARDLGKLAQLTPAD----LEPMH--GCDMIVEWTGTS 126
Query: 179 YEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
++G PG ++G TY + V + I + DRG+D + N+ VWG AG + F
Sbjct: 127 FKGEVQPGRQCRVMRDGKETYLENSFEVSETGLI-SLDRGYDPETNERVWGSVAGAFHF 184
>gi|428216190|ref|YP_007089334.1| hypothetical protein Oscil6304_5951 [Oscillatoria acuminata PCC
6304]
gi|428004571|gb|AFY85414.1| CpeT/CpcT family (DUF1001) [Oscillatoria acuminata PCC 6304]
Length = 198
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 89/206 (43%), Gaps = 19/206 (9%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQ 97
+T S AR +A S + +QA +P F +IR+ P E +DG L YLEQ
Sbjct: 1 MTHSTDVLTLARWMASDFS----NFDQAIENPPFFAHIRVCMRPLPYEFLDGVSL-YLEQ 55
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIR-EVEEYKNFCDRPRDQRPLPEEVI 156
A+ +P+R R ++K PK + +E+ +Y ++ E Y + DR Q P+++
Sbjct: 56 AYDYMLDQPYRVR--VLKLVPKGDR--IEIENYKVQDEASFYGSARDRHLLQSLTPDKLE 111
Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
G T ++G PG + G T + N + DR
Sbjct: 112 KIPGSSFIT-------EWTGSSFKGEVEPGKNCIVVRKGRTTYLDSTFEIDENLFISHDR 164
Query: 217 GFDDDGNQ-VWGVKAGPYEFKPAPSS 241
G D D ++ VWG AGP+ F +S
Sbjct: 165 GLDPDTDEMVWGSVAGPFVFNKRVAS 190
>gi|428222477|ref|YP_007106647.1| hypothetical protein Syn7502_02541 [Synechococcus sp. PCC 7502]
gi|427995817|gb|AFY74512.1| CpeT/CpcT family (DUF1001) [Synechococcus sp. PCC 7502]
Length = 192
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 83/190 (43%), Gaps = 21/190 (11%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++GE ++EQA +P + N+R+ P + + + EQA +P+RQR
Sbjct: 8 LAQYMAGEFSNREQAIAEPVWYVNLRLWQIPLPQSLLDGIAFFAEQANIINLDQPYRQRI 67
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ +E +++ YA+ E ++ RP + L I D+ I L C
Sbjct: 68 INL----RETDGNIQADYYALIEPSAWRGAATRPEQLQQLS---INDL------IPLPDC 114
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYS--TSELAV---LKNNEIHTWDRGFD-DDGNQV 225
+ YE T P + + TY T ++++ +E ++D+G D + G +
Sbjct: 115 TL--KVKYEQETFKAELPINAKCCFTYQGETRQVSLGFAASKSEFLSYDKGIDMNTGAGI 172
Query: 226 WGVKAGPYEF 235
WG G Y F
Sbjct: 173 WGAITGAYRF 182
>gi|428316405|ref|YP_007114287.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
7112]
gi|428240085|gb|AFZ05871.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
7112]
Length = 196
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 88/183 (48%), Gaps = 15/183 (8%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
++ + ++ QA +P F +IR+ P E++DG L YLEQA+ P+R R ++
Sbjct: 14 MAADFSNQAQAFENPPFFAHIRVCMRPLPIEVLDGISL-YLEQAYDIELNVPYRVR--VL 70
Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
K P + D+E +YAI E + F RD + L E + + + G
Sbjct: 71 KLVPAGDRIDIE--NYAI---ENEEQFYGASRDPQRLQEIKTAQLTLLPGCTFITQW-TG 124
Query: 175 KRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVKAGP 232
K ++G PG +NG TY SE + ++ + DRG D + + VWG +AGP
Sbjct: 125 KS--FKGFVEPGKGCAVVRNGKKTYLDSEFEI-DQDKFTSHDRGRDPETDAHVWGAQAGP 181
Query: 233 YEF 235
+EF
Sbjct: 182 FEF 184
>gi|186685249|ref|YP_001868445.1| hypothetical protein Npun_R5170 [Nostoc punctiforme PCC 73102]
gi|186467701|gb|ACC83502.1| protein of unknown function DUF1001 [Nostoc punctiforme PCC 73102]
Length = 201
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 77/186 (41%), Gaps = 5/186 (2%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+ + L+GE ++EQA +P + ++RM P + S L+ EQA KP+RQR
Sbjct: 10 LGEYLAGEFDNREQAIEEPVWYVHLRMWQRPVNLFTEDSITLFAEQASVINLDKPYRQRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHLRR 170
++ +EV Y ++ + + + P L + + + G LT H
Sbjct: 70 MRLRQGSNS-DTSLEVQYYIPQDPDALRGAGNNPALLNTLSAKQLDLLPGCVLTVTH--E 126
Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVK 229
G R + + P + G + S E H++D+G D + G WG
Sbjct: 127 ALAGDRYKFTATPQPETRCSFTYLGNSIYVSLGFETTAAEFHSYDKGIDPETGKATWGAI 186
Query: 230 AGPYEF 235
GPY +
Sbjct: 187 MGPYRY 192
>gi|158340571|ref|YP_001521565.1| CpcT protein, putative [Acaryochloris marina MBIC11017]
gi|158310812|gb|ABW32426.1| CpcT protein, putative [Acaryochloris marina MBIC11017]
Length = 197
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 82/188 (43%), Gaps = 17/188 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A L+ E ++ QA +P + +I++ P + LYLEQA+ +KP+R R
Sbjct: 10 LAKWLAAEFSNQAQAIENPPLYAHIQVSIRPLPRKLLTGYCLYLEQAYAFMLKKPYRTRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ E K + + +Y E+ + + F RD PL I EHL I C
Sbjct: 70 IHI----LEHKDSILIENY---ELHKKELFLGGTRD--PLILNKIST--EHL--IKKEGC 116
Query: 172 DRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
D + G+ PG +NG T + N+ + DRG D + ++ +WG
Sbjct: 117 DMNVEWTGHSFVGTVVPGKSCKVVRNGKTTYLDNSFEVDQNKFISIDRGRDPETDELIWG 176
Query: 228 VKAGPYEF 235
AGP+ F
Sbjct: 177 SIAGPFHF 184
>gi|434397505|ref|YP_007131509.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
gi|428268602|gb|AFZ34543.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
Length = 208
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 92/221 (41%), Gaps = 32/221 (14%)
Query: 35 LLKRLTKSKTRWDHARI-VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLL 93
+L + S+T+ + I +A ++G+ + +QA D + +IR+ P
Sbjct: 1 MLPTIDNSQTQETNLLIDLAKLMAGDFSNHKQALADSKTYAHIRIFFRPLPYEFFSGIGF 60
Query: 94 YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
Y EQ + P+RQ + + C E + + +Y++++ Y
Sbjct: 61 YSEQTYDYDLWSPYRQGVHRLIDCGDE----IYIENYSLQDAYLYAG------------- 103
Query: 154 EVIGDIGEHLTTIHLRRCDRGKRC---------LYEGSTPPGGFPNSWQNGA-TYSTSEL 203
G + L TI+ +R C L+ GS PG +NG TY SE+
Sbjct: 104 --AGHNRDILLTINPNCLERRYHCSMIFKKDGDLFRGSVEPGNRCLINRNGTETYLVSEV 161
Query: 204 AVLKNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSY 243
+ + I +WDRG D Q+WG AGP +F+ S ++
Sbjct: 162 ELTEATWI-SWDRGMDLVTHQQIWGSAAGPLKFEKRASFAH 201
>gi|260436513|ref|ZP_05790483.1| hypothetical protein SH8109_0091 [Synechococcus sp. WH 8109]
gi|260414387|gb|EEX07683.1| hypothetical protein SH8109_0091 [Synechococcus sp. WH 8109]
Length = 197
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/188 (23%), Positives = 84/188 (44%), Gaps = 12/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
R + L GE +++QA +P + +I + P ++ GS LL EQ + P+ P+R
Sbjct: 7 RRFLNLLCGEFSNQQQAFDNPPLYAHIFLRYRPLIQLRPGSILL--EQTYAVDPKHPYRL 64
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + ++ +++ ++ R+ E + P+ + + E + + + H +
Sbjct: 65 RMIRAE---EQASGAIKLWNHIFRDPERFAGATFDPQLRLAIQERDLISLNQ----CHYQ 117
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGV 228
++ Y G+ PG +NG L+ + + T DRG D D N+ WG
Sbjct: 118 VLEQPDG--YHGAIEPGCKCIVQRNGEDTVLVSSFHLEGDSLSTLDRGHDPDTNERCWGS 175
Query: 229 KAGPYEFK 236
AGP+ FK
Sbjct: 176 VAGPFRFK 183
>gi|434393935|ref|YP_007128882.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
gi|428265776|gb|AFZ31722.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
Length = 195
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 78/188 (41%), Gaps = 5/188 (2%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A L+GE ++EQA +P + ++R+ P A + S L+ EQA +P+RQ
Sbjct: 8 KALAQYLTGEFTNQEQAIAEPAWYVHLRLWHQPVALFTEDSFTLFAEQANIVKLDQPYRQ 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R ++ P ++ ++V Y + + P + L E + L + +
Sbjct: 68 RLIRLQQRPDDVA--LQVQYYMPKNPTALRGAGANPDILKTLTPEDFELLPGCLLDVTVN 125
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD-GNQVWGV 228
+ +R ++ + P G T S E ++D+G D G WG
Sbjct: 126 QLERDYH--FKATLPSDRRCCFAYAGNTVQVSIGFEASPEEFLSYDKGIDSTTGKATWGA 183
Query: 229 KAGPYEFK 236
GPY F+
Sbjct: 184 ILGPYRFR 191
>gi|425469168|ref|ZP_18848127.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9701]
gi|389883085|emb|CCI36480.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9701]
Length = 194
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 13/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + G L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLGGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ ++ ++E + P+ + L E++ E + +
Sbjct: 68 RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELL----EPMAGCDMN 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
G ++G PG + ++ TY + + + I DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDTETDQLVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|307154999|ref|YP_003890383.1| hypothetical protein Cyan7822_5227 [Cyanothece sp. PCC 7822]
gi|306985227|gb|ADN17108.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
Length = 207
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/183 (20%), Positives = 77/183 (42%), Gaps = 3/183 (1%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L+GE ++ QA F ++R+ P + S L+ EQA +P+R R + ++
Sbjct: 16 LAGEFENRPQAMESAAWFVHLRLWLRPVPLFTEDSITLFAEQASIVNLDQPYRPRLWRLR 75
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
C ++V Y +++ + +P + L E + + T+ +
Sbjct: 76 QVSTSPVC-LQVEHYMFKDISPVQGLGRKPELLQQLTLEQVECLTAPGCTLKVESNQLAG 134
Query: 176 RCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPY 233
+ + P ++ G ++ S + + E+ T+D+G D + G +WG GPY
Sbjct: 135 NHYHFKALAVSESPCAFSYQGQSFQVSLGFEVNSQELKTYDKGIDINTGRAIWGALMGPY 194
Query: 234 EFK 236
++
Sbjct: 195 CYQ 197
>gi|257059192|ref|YP_003137080.1| hypothetical protein Cyan8802_1322 [Cyanothece sp. PCC 8802]
gi|256589358|gb|ACV00245.1| hypothetical protein Cyan8802_1322 [Cyanothece sp. PCC 8802]
Length = 235
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 16/150 (10%)
Query: 93 LYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP 152
++LEQ F TPQ P R R+Y P L V +S + + C+ P +R L
Sbjct: 83 IFLEQTFLTTPQPP-RLRYYTFSP----LTTGVLLSVRSFIDQTHLSGLCNLPESERTLE 137
Query: 153 EEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
+I I L R R Y G+ P G P + +G++ + + N+I
Sbjct: 138 ---FSNINPLSCDIELSRLVEPVR--YFGTNSPDGCPAA--SGSSVTVISTISIHPNQID 190
Query: 213 TWDRGFDDDGNQVWGVKAGPYEFKPAPSSS 242
++D+GF + P F P SS
Sbjct: 191 SFDQGFIGT----IPIFGTPITFNRVPESS 216
>gi|282895778|ref|ZP_06303865.1| Protein of unknown function DUF1001 [Raphidiopsis brookii D9]
gi|281199278|gb|EFA74144.1| Protein of unknown function DUF1001 [Raphidiopsis brookii D9]
Length = 197
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 89/201 (44%), Gaps = 21/201 (10%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
+T S+ AR +A S ++ QA +P + +IR+ P A + L++EQA
Sbjct: 1 MTHSQDLLTLARWMAADFS----NQAQAFENPPLYAHIRVCMRPLAYSLLSGVSLFVEQA 56
Query: 99 FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVI 156
+ + P+R R LK VE I V++ K+F RD L + +
Sbjct: 57 YDYDLKDPYRVRV---------LKLLVEEGQIIIENYTVKDEKDFYGSSRDLGKL-QNLT 106
Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWD 215
D E L ++R G + G+ G ++ G TY S+ + + + +WD
Sbjct: 107 ADRLEKLCGCNMRVEWTGN--CFTGTVESGKNCLVFRKGQKTYLDSKFQI-DDQKFLSWD 163
Query: 216 RGFDDDGNQ-VWGVKAGPYEF 235
RG D + +Q VWG AGP+ F
Sbjct: 164 RGRDLETDQHVWGSLAGPFHF 184
>gi|334119341|ref|ZP_08493427.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
gi|333458129|gb|EGK86748.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
Length = 196
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 90/203 (44%), Gaps = 25/203 (12%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQ 97
+T S AR +A S + QA +P F +IR+ P E++DG L YLEQ
Sbjct: 1 MTHSTDIVTLARWMAADFSNQA----QAFENPPFFAHIRVCMRPLPVEVLDGLSL-YLEQ 55
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A+ P+R R + P +E+ +YAI E + F RD + L E
Sbjct: 56 AYDIELNVPYRVRVLKLVPVDDR----IEIENYAI---ENEEQFYGASRDPQRLQE---- 104
Query: 158 DIGEHLTTIHLRRCDRGKRCL---YEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHT 213
I + T+ L C + ++G PG G + TY SE + ++ I +
Sbjct: 105 -IKKAQLTL-LPGCTFITQWTGNSFKGFVEPGKGCMVVRKGKKTYLDSEFEIDQDKFI-S 161
Query: 214 WDRGFDDDGN-QVWGVKAGPYEF 235
DRG D + + VWG AGP+EF
Sbjct: 162 HDRGRDPETDAHVWGALAGPFEF 184
>gi|428218917|ref|YP_007103382.1| hypothetical protein Pse7367_2699 [Pseudanabaena sp. PCC 7367]
gi|427990699|gb|AFY70954.1| protein of unknown function DUF1001 [Pseudanabaena sp. PCC 7367]
Length = 202
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 89/201 (44%), Gaps = 25/201 (12%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFR-QR 110
A ++G+ + +QA +P F +IR+ P ++ D LY EQA+ +P+R
Sbjct: 14 AQWMTGDYSNWDQAIANPPLFAHIRVCIRPLPNQLSDHGIWLYSEQAYDFQIDRPYRIAV 73
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
++V + +E+ ++ + + E + +P + + + L
Sbjct: 74 LHLVLAADR-----IEIENHKLNDEESFYGAARQPERLAAITAA---------SVVKLPG 119
Query: 171 C----DRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
C DR + G PG N +N TY E + + N + T DRG+D + ++
Sbjct: 120 CNILVDRTPTGSFVGKIEPGKKCNVHRKNQDTYLVHEFELNQTN-LTTLDRGYDPETDER 178
Query: 225 VWGVKAGPYEFKPAPSSSYSD 245
VWG AGP+EF +S++D
Sbjct: 179 VWGTIAGPFEF--VKQASFAD 197
>gi|427719033|ref|YP_007067027.1| hypothetical protein Cal7507_3805 [Calothrix sp. PCC 7507]
gi|427351469|gb|AFY34193.1| protein of unknown function DUF1001 [Calothrix sp. PCC 7507]
Length = 198
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 42/187 (22%), Positives = 75/187 (40%), Gaps = 5/187 (2%)
Query: 51 IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQR 110
++ + LSGE ++ QA DP + ++R+ P ++ S L+ EQA P+RQR
Sbjct: 9 VLGEYLSGEFDNQAQAIADPVWYVHLRLWQRPVNLFLEDSITLFAEQANVINLDNPYRQR 68
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHLR 169
+ P ++ ++V Y ++ + P L E + + G LT
Sbjct: 69 IIRLLP-RRDDDAQLQVQYYMPKDPSALRGAGKNPALLNTLTSEQLDLLPGCFLTVTKQT 127
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD-GNQVWGV 228
+ + + PP G + S E +++D+G D + G WG
Sbjct: 128 LATNDYK--FAATPPPNTCCQFTYLGNSIQVSLGFEATAKEFYSYDKGIDSETGKATWGA 185
Query: 229 KAGPYEF 235
GPY +
Sbjct: 186 TIGPYRY 192
>gi|148238834|ref|YP_001224221.1| phycoerythrin linker gene region [Synechococcus sp. WH 7803]
gi|147847373|emb|CAK22924.1| Conserved hypothetical protein, CpeT homolog, phycoerythrin linker
gene region [Synechococcus sp. WH 7803]
Length = 204
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 79/186 (42%), Gaps = 11/186 (5%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
A +L+G +++QA +P +F +I + P V + Y EQ++ P P+RQ +
Sbjct: 12 AKTLAGHYSNRKQAQNNPKDFAHINIFFRPLPWEVLNAPGFYSEQSYDHAPWSPYRQGVH 71
Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
+ P + V ++ + RP L + + +H R D
Sbjct: 72 RLLPAEGD---HFIVENFGFGDPIRLAGAGQRPELLNSLNTDSLKPRCG--CGMHFREID 126
Query: 173 RGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVKA 230
G Y G PG ++G TY SE+ + + I + DRGFD + + Q WG +
Sbjct: 127 AGH---YRGGVEPGKNCLVPRDGHLTYLVSEVEMNATDWI-SRDRGFDPETDEQRWGSEH 182
Query: 231 GPYEFK 236
GP F+
Sbjct: 183 GPLRFE 188
>gi|425433932|ref|ZP_18814405.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9432]
gi|389679000|emb|CCH92160.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9432]
Length = 194
Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 13/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ ++ ++E + P+ + L E++ E + +
Sbjct: 68 RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELL----EPMPGCDMN 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
GK ++G PG + ++ TY + + + I DRG D + +Q VWG
Sbjct: 120 VTWTGKS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|186681243|ref|YP_001864439.1| hypothetical protein Npun_F0744 [Nostoc punctiforme PCC 73102]
gi|186463695|gb|ACC79496.1| protein of unknown function DUF1001 [Nostoc punctiforme PCC 73102]
Length = 197
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 84/186 (45%), Gaps = 13/186 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A +S + ++EQA +P + +IR+ P + L+LEQA+ +P+R R
Sbjct: 10 LARWMSADFSNQEQAFENPPFYAHIRVCIRPLPLELFSGVSLFLEQAYDFMLNQPYRMRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ P + + Y ++E +++ P + L + + E ++ ++
Sbjct: 70 MKLIPAENH----IAIEHYTVKEEQKFYGASREPERLKDLSVDQL----EKMSGCNMIVE 121
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVK 229
GK ++G PG ++G TY +E + E + DRG D D ++ +WG
Sbjct: 122 WTGKS--FKGRVEPGKGCIVVRDGKNTYLDNEFEI-DAKEFFSLDRGRDLDTDERLWGSI 178
Query: 230 AGPYEF 235
AGP+ F
Sbjct: 179 AGPFHF 184
>gi|37521107|ref|NP_924484.1| hypothetical protein glr1538 [Gloeobacter violaceus PCC 7421]
gi|81710316|sp|Q7NKE0.1|CPXT4_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 4
gi|35212103|dbj|BAC89479.1| glr1538 [Gloeobacter violaceus PCC 7421]
Length = 183
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 76/191 (39%), Gaps = 22/191 (11%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ V L+G ++ QA +P F + + A + + EQ TP +P+R
Sbjct: 5 QTVRTWLAGTYSNRAQAMAEPVWFIPVTLWYVEVAGLFGEGAGFFTEQVSEHTPNQPYRS 64
Query: 110 RFYMVKPCPKELKC----DVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
R + P L+ D +V + A ++ E R D LP T
Sbjct: 65 RVLQLLDNPLRLENYRLKDQKVWAGAAKDPERLGRL--RADDCEQLPG----------CT 112
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
++L R RG+ + G PGG + A+Y E L T DRGF+ G Q
Sbjct: 113 LYLER--RGE--TFTGKMQPGGGCRLFPGDASYVEIEFE-LGERLFFTLDRGFEATTGEQ 167
Query: 225 VWGVKAGPYEF 235
WG +AG Y +
Sbjct: 168 TWGSRAGAYRY 178
>gi|422303151|ref|ZP_16390505.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389791909|emb|CCI12308.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 194
Score = 43.9 bits (102), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
+ +A ++ + ++EQA +P F +IR+ P E+++G+ L +LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 67 LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
G ++G PG + ++ TY + + + I DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175
Query: 227 GVKAGPYEF 235
G AGP+EF
Sbjct: 176 GSIAGPFEF 184
>gi|56750747|ref|YP_171448.1| hypothetical protein syc0738_d [Synechococcus elongatus PCC 6301]
gi|56685706|dbj|BAD78928.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 197
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 87/187 (46%), Gaps = 15/187 (8%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQR 110
+A L+ + ++ QA +P F +IR+ P A E++DG L Y+EQA+ +P+R R
Sbjct: 10 LARWLAADFSNQAQAFENPPFFAHIRVCMRPLALEVLDGLSL-YVEQAYDLDVNQPYRAR 68
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
+ P +E+ ++ + + E+ R RD+ ++ D E L L
Sbjct: 69 VLKLVPAGDH----IEIENFRVIDGEDLFG-ASRDRDRLA---KLTRDRLEKLPGCTLIT 120
Query: 171 CDRGKRCLYEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRG-FDDDGNQVWGV 228
G ++G PG G + ATY SE + + ++ + DRG + QVWG
Sbjct: 121 HWTGNG--FKGVVEPGKGCIVERKGQATYLESEFEI-DDEKMISLDRGRHPETDEQVWGS 177
Query: 229 KAGPYEF 235
AGP+ F
Sbjct: 178 VAGPFHF 184
>gi|443657119|ref|ZP_21131838.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa DIANCHI905]
gi|159029854|emb|CAO90908.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333248|gb|ELS47816.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa DIANCHI905]
Length = 194
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 83/193 (43%), Gaps = 23/193 (11%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHLT 164
R + + +E+ ++ ++E + P+ + L E++ D+ T
Sbjct: 68 RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELLEPMPGCDMNVTWT 123
Query: 165 TIHLRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
+ + GK+CL NS++ + + + DRG D + +
Sbjct: 124 GNSFKGVVKPGKQCLVFRKDKMTYLDNSFE------------ISEQGLISVDRGLDTETD 171
Query: 224 Q-VWGVKAGPYEF 235
Q VWG AGP+EF
Sbjct: 172 QLVWGSIAGPFEF 184
>gi|425464319|ref|ZP_18843641.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833697|emb|CCI21546.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 194
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
+ +A ++ + ++EQA +P F +IR+ P E+++G+ L +LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 67 LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
G ++G PG + ++ TY + + + I DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175
Query: 227 GVKAGPYEF 235
G AGP+EF
Sbjct: 176 GSIAGPFEF 184
>gi|428313473|ref|YP_007124450.1| hypothetical protein Mic7113_5401 [Microcoleus sp. PCC 7113]
gi|428255085|gb|AFZ21044.1| CpeT/CpcT family (DUF1001) [Microcoleus sp. PCC 7113]
Length = 204
Score = 43.5 bits (101), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 76/191 (39%), Gaps = 15/191 (7%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++GE +++QA DP + ++ + P + S L+ EQA +P+R R
Sbjct: 10 LARYMAGEFDNRQQAIADPAWYVHLHLWLRPVPLFTEDSLTLFAEQANIVNLDQPYRPRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ P P E ++V Y ++ + + P + L I E L LR
Sbjct: 70 LRLYPNPTE-PGSLQVQFYMLKNPDAIRGAGRNPELLKQLTTAQI----EFLPGCTLRVK 124
Query: 172 DR------GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
R + + + P F S+Q G Y S E T+D+G D G
Sbjct: 125 QRLVSPNAYEFSAFSATNTPCSF--SYQ-GKDYQISLGFEATPEEFRTYDKGIDPATGKA 181
Query: 225 VWGVKAGPYEF 235
WG GP+ F
Sbjct: 182 TWGALLGPFRF 192
>gi|425443208|ref|ZP_18823433.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715535|emb|CCI00103.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 194
Score = 43.5 bits (101), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
+ +A ++ + ++EQA +P F +IR+ P E+++G+ L +LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 67 LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
G ++G PG + ++ TY + + + I DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175
Query: 227 GVKAGPYEF 235
G AGP+EF
Sbjct: 176 GSIAGPFEF 184
>gi|87300967|ref|ZP_01083809.1| hypothetical protein WH5701_05945 [Synechococcus sp. WH 5701]
gi|87284838|gb|EAQ76790.1| hypothetical protein WH5701_05945 [Synechococcus sp. WH 5701]
Length = 201
Score = 43.5 bits (101), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 88/218 (40%), Gaps = 36/218 (16%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
+T S TR + LS ++ QA +P + +I + P ++ GS L LEQ+
Sbjct: 1 MTSSLTR------LVRMLSAGFSNQHQAFENPPIYAHILVRFRPLPQLEPGS--LLLEQS 52
Query: 99 FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
+ P +P+R R + P + + ++A+ + + + P + E +
Sbjct: 53 YAFAPGQPYRIRVLRAENAPD---GSLRIHNHALHDERRFWGAVEDPERMAQIQTEDLRL 109
Query: 159 IGEHLTTIH------LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
+ +H + + G RCL E + GAT + +
Sbjct: 110 LEGCTYVVHGQGEGFVGEVEPGCRCLVE------------RKGATAYLVSSFEIDGESMR 157
Query: 213 TWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSYSDMFSP 249
T DRG D +Q+WG AGP+EF+ +D FSP
Sbjct: 158 TIDRGHDPATHDQLWGSLAGPFEFQ------RTDDFSP 189
>gi|443320585|ref|ZP_21049676.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
gi|442789691|gb|ELR99333.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
Length = 195
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 83/190 (43%), Gaps = 26/190 (13%)
Query: 55 SLSGEKFS-KEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
SL FS ++QA +P F +IR+ P L+LEQA+ P+R R
Sbjct: 12 SLMAADFSNQQQAYANPPFFAHIRVCMRPLPYEFLSETSLFLEQAYDCMLDVPYRLRVLK 71
Query: 114 VKPCPKELKCDVEVSSYAIREVEEY----KNFCDRPRDQRPLPEEVI--GDIGEHLTTIH 167
+E+ +Y +RE E++ +N + ++ +P E++ D+ T
Sbjct: 72 FTSVDGH----IELENYKVREQEKFFGASRNL-ELLKNLQPEQLELLEGCDMTVEWTGSS 126
Query: 168 LRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-V 225
+ + GK C+ E NS++ + ++ ++DRGFD + N+ V
Sbjct: 127 FKGLIKPGKACIVERKGKKTYLDNSFE------------IDEQKLISFDRGFDPETNELV 174
Query: 226 WGVKAGPYEF 235
WG AGP+ F
Sbjct: 175 WGSIAGPFNF 184
>gi|166363473|ref|YP_001655746.1| hypothetical protein MAE_07320 [Microcystis aeruginosa NIES-843]
gi|317374817|sp|B0JQ95.1|CPXT1_MICAN RecName: Full=Chromophore lyase CpcT/CpeT 1
gi|166085846|dbj|BAG00554.1| hypothetical protein MAE_07320 [Microcystis aeruginosa NIES-843]
Length = 194
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
+ +A ++ + ++EQA +P F +IR+ P E+++G+ L +LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 67 LRVFKLSLVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
G ++G PG + ++ TY + + + I DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175
Query: 227 GVKAGPYEF 235
G AGP+EF
Sbjct: 176 GSIAGPFEF 184
>gi|443475946|ref|ZP_21065875.1| protein of unknown function DUF1001 [Pseudanabaena biceps PCC 7429]
gi|443019142|gb|ELS33279.1| protein of unknown function DUF1001 [Pseudanabaena biceps PCC 7429]
Length = 214
Score = 43.5 bits (101), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 96/213 (45%), Gaps = 20/213 (9%)
Query: 51 IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAE-MVDGSKLLYLEQAFWRTPQKPFRQ 109
I+A L+G+ + EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 15 ILAQWLAGDHSNWEQAIDNPPFFAHIRVGIRPLPNPITENGVWLFLEQAYDFELNHPYRT 74
Query: 110 R-FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQ-RPLPEEVIGDIGEHLTTIH 167
+++ + +E+ +Y ++ E + N R RD+ L + I + ++
Sbjct: 75 AVLHLIYQNDR-----IEMINYRLKNAETFFN-ASRDRDRLAALNADAIAQLEGCTQWVY 128
Query: 168 LRRCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-V 225
R ++GS PG + G TY E V +N + DRG+D ++ V
Sbjct: 129 -----RNDAHTFKGSVEPGKKCCINRKGVDTYLAIEFEVTENT-YSSLDRGYDVVTDERV 182
Query: 226 WGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFL 258
WG AG ++F +S+SD S L PQ +
Sbjct: 183 WGSIAGAFQF--VKKASFSDEVS-LGSTPQGII 212
>gi|427722227|ref|YP_007069504.1| hypothetical protein Lepto7376_0226 [Leptolyngbya sp. PCC 7376]
gi|427353947|gb|AFY36670.1| protein of unknown function DUF1001 [Leptolyngbya sp. PCC 7376]
Length = 197
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 88/210 (41%), Gaps = 39/210 (18%)
Query: 49 ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
A+ +A ++GE + +QA + F +I++ P + L+LEQA+ ++P+R
Sbjct: 7 AKALATWMAGEFSNFDQALANSALFAHIKVCMRPMPQEFFEGYGLFLEQAYSSGLEQPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHL 163
R + +KP L E+ YK P+++ + E+ G DI L
Sbjct: 67 LRVFHIKPVGDHL------------ELIHYK-----PKEE--IKEQFFGASRNLDI---L 104
Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA----------TYSTSELAVLKNNEIHT 213
T+ + D C + G F Q G +Y + + NN I +
Sbjct: 105 KTVTMEALDPMPGCDMIVTFENGAFQGVVQEGKGCKVFRNNTESYLDNRFEITANNLI-S 163
Query: 214 WDRGFDDDGNQ-VWGVKAGPYEFKPAPSSS 242
DRG D ++ VWG AG +EFK + S
Sbjct: 164 IDRGRDPVTDEIVWGSLAGAFEFKKIKNFS 193
>gi|428223567|ref|YP_007107664.1| hypothetical protein GEI7407_0107 [Geitlerinema sp. PCC 7407]
gi|427983468|gb|AFY64612.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
Length = 205
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 9/186 (4%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L+G + +QA +P + +R+ P + S LYLEQ+ +P+R R +
Sbjct: 14 LAGAFENGDQAREEPAWYVPLRLWQRPIPLFREDSWTLYLEQSNLLKLDQPYRPRVLRIT 73
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
P + + Y +R+ + P R L + + D+ TI + R +G
Sbjct: 74 EHPSQPDALLG-RYYQLRDAAAVRGGGQDPERLRSLCADDLVDLPGCGLTIAIARSPQGP 132
Query: 176 RCLYEGSTPPGGFPNSWQNGATYSTS---ELAVLKNNEIH--TWDRGFD-DDGNQVWGVK 229
++ S PP G G+ S E+ + ++D+G D + G +WG
Sbjct: 133 --YFQASLPPEGRCCFTVEGSLRQVSLGFEVGTTAEGSVEFLSYDKGIDPETGRAIWGAL 190
Query: 230 AGPYEF 235
GP+ F
Sbjct: 191 MGPFRF 196
>gi|390440361|ref|ZP_10228696.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389836219|emb|CCI32822.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 194
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 90/189 (47%), Gaps = 15/189 (7%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
+ +A ++ + ++EQA +P F +IR+ P E+++G+ L +LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLEGTSL-FLEQAYDFMLNTPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R + + +E+ ++ +V+E NF R+ L + + D+ E + +
Sbjct: 67 LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPHRL-KNLTLDLLEPMPGCDM 118
Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
G ++G PG + ++ TY + + + I DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175
Query: 227 GVKAGPYEF 235
G AGP+EF
Sbjct: 176 GSIAGPFEF 184
>gi|81299611|ref|YP_399819.1| hypothetical protein Synpcc7942_0800 [Synechococcus elongatus PCC
7942]
gi|81168492|gb|ABB56832.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 197
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 84/196 (42%), Gaps = 33/196 (16%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQR 110
+A L+ + ++ QA +P F +IR+ P A E++DG L Y+EQA+ +P+R R
Sbjct: 10 LARWLAADFSNQAQAFENPPFFAHIRVCMRPLALEVLDGLSL-YVEQAYDLDVNQPYRAR 68
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
+ P +E+ ++ + + E+ R RD+ L + R
Sbjct: 69 VLKLVPAGDH----IEIENFRVIDGEDLFG-ASRDRDR--------------LAKLTRDR 109
Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNG----------ATYSTSELAVLKNNEIHTWDRG-FD 219
++ C + GF + G ATY SE + + ++ + DRG
Sbjct: 110 LEKLPGCTFITHWTGNGFKGVVEPGKGCIVERKGQATYLESEFEI-DDEKMISLDRGRHP 168
Query: 220 DDGNQVWGVKAGPYEF 235
+ QVWG AGP+ F
Sbjct: 169 ETDEQVWGSVAGPFHF 184
>gi|37520762|ref|NP_924139.1| hypothetical protein gvip163 [Gloeobacter violaceus PCC 7421]
gi|81710603|sp|Q7NLD3.1|CPXT3_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 3
gi|35211757|dbj|BAC89134.1| cpeT [Gloeobacter violaceus PCC 7421]
Length = 203
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 13/187 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++G+ +++QA P F +IR+ P G+ Y EQ + P+RQ
Sbjct: 12 LARWMAGDFSNQKQALAQPQTFAHIRVFFRPLPFAFFGTVGFYSEQTYDYDLWSPYRQGL 71
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ + + + + + +Y +++ Y P +P + + + R
Sbjct: 72 HRLL----DREGGIYIENYGLQDAGLYAGAGHDPEILATIPTDCL-QPRRGCAMVFRREG 126
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVK 229
D + GS PG ++G TY SE+ + ++ + + DRG D + + Q+WG +
Sbjct: 127 D-----CFRGSVEPGNHCLIPRDGYWTYLVSEVELTESTWV-SLDRGMDRETHRQIWGSQ 180
Query: 230 AGPYEFK 236
AGP F+
Sbjct: 181 AGPLHFE 187
>gi|425450956|ref|ZP_18830779.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 7941]
gi|440755306|ref|ZP_20934508.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa TAIHU98]
gi|389767993|emb|CCI06782.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 7941]
gi|440175512|gb|ELP54881.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa TAIHU98]
Length = 194
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 83/193 (43%), Gaps = 23/193 (11%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHLT 164
R + + +E+ ++ ++E + P+ + L E++ D+ T
Sbjct: 68 RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELLEPMPGCDMNVTWT 123
Query: 165 TIHLRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
+ + GK+CL NS++ + + + DRG D + +
Sbjct: 124 GNSFKGVVKPGKQCLVFRKDKMTYLDNSFE------------ISEQGLISVDRGLDPETD 171
Query: 224 Q-VWGVKAGPYEF 235
Q VWG AGP+EF
Sbjct: 172 QLVWGSIAGPFEF 184
>gi|119513456|ref|ZP_01632483.1| hypothetical protein N9414_11904 [Nodularia spumigena CCY9414]
gi|119461888|gb|EAW42898.1| hypothetical protein N9414_11904 [Nodularia spumigena CCY9414]
Length = 196
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 79/176 (44%), Gaps = 13/176 (7%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++ QA +P F +IR+ P V L++EQA+ P+R R +
Sbjct: 16 NQAQAFENPPFFAHIRVCMRPLPVEVLSGISLFVEQAYDYMLNDPYRVRVLQLVNAGNR- 74
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
+E+ +Y +++ E NF R+ L + + GD E L ++ G ++G
Sbjct: 75 ---IEIVNYTVKQEE---NFYGASRNLERL-KTLTGDRLEKLPGCNMTVEWAGNS--FKG 125
Query: 182 STPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
PG ++ G TY SE + + I + DRG D + N+ +WG AGP+ F
Sbjct: 126 KVEPGKGCIVFRKGQKTYLDSEFEISEERFI-SLDRGRDLETNEHIWGSVAGPFHF 180
>gi|307154867|ref|YP_003890251.1| hypothetical protein Cyan7822_5091 [Cyanothece sp. PCC 7822]
gi|306985095|gb|ADN16976.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
Length = 195
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 88/204 (43%), Gaps = 34/204 (16%)
Query: 41 KSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFW 100
K+ RW ++ + ++EQA +P + +IR+ P + G+ L+LEQA+
Sbjct: 8 KTLVRW---------MAADFSNQEQAWANPPLYAHIRVCMRPLPYELLGATSLFLEQAYD 58
Query: 101 RTPQKPFRQR---FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
P+R R F +V +E+ +Y ++E E+Y RD L +
Sbjct: 59 FMLNVPYRLRVLKFTVVDDV-------IELENYKVKEQEQYFG---AARDLERLKNLTVD 108
Query: 158 DIGEHLTTIHLRRCDRGKRCL---YEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHT 213
D+ + L CD ++G+ PG +N TY + V + I +
Sbjct: 109 DLEK------LPGCDMTVEWTGNSFKGTIKPGKQCIVVRNNKETYLDNSFEVDEQTLI-S 161
Query: 214 WDRGFDDDGNQ-VWGVKAGPYEFK 236
DRG+D ++ VWG AG + FK
Sbjct: 162 LDRGYDPQTDELVWGSVAGAFHFK 185
>gi|172036295|ref|YP_001802796.1| hypothetical protein cce_1380 [Cyanothece sp. ATCC 51142]
gi|354553088|ref|ZP_08972395.1| protein of unknown function DUF1001 [Cyanothece sp. ATCC 51472]
gi|171697749|gb|ACB50730.1| DUF1001-containing protein [Cyanothece sp. ATCC 51142]
gi|353554918|gb|EHC24307.1| protein of unknown function DUF1001 [Cyanothece sp. ATCC 51472]
Length = 196
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 78/183 (42%), Gaps = 17/183 (9%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQA +P + +IR+ P L+LEQA+ +P+R R + +
Sbjct: 20 NQEQAFENPPFYAHIRVCIRPLPLSNFPEPSLFLEQAYDYALNQPYRIRVLKLNIVEER- 78
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG---KRCL 178
+E+ +Y +++ E F R+ L + DI ++ CD
Sbjct: 79 ---IELENYKLKDKE---TFLGASREPEKLKKLTPNDIE------LMQGCDMFVDWTGTS 126
Query: 179 YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFKP 237
++G PG +NG + ++++ + DRG+D N+ VWG AG + FKP
Sbjct: 127 FKGMVKPGRNCRIVRNGKETYLDNSFEINDHQLISLDRGYDPITNELVWGSVAGAFHFKP 186
Query: 238 APS 240
S
Sbjct: 187 RQS 189
>gi|428213027|ref|YP_007086171.1| hypothetical protein Oscil6304_2635 [Oscillatoria acuminata PCC
6304]
gi|428001408|gb|AFY82251.1| CpeT/CpcT family (DUF1001) [Oscillatoria acuminata PCC 6304]
Length = 211
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 79/192 (41%), Gaps = 13/192 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ L+GE ++EQA DP + ++ + P + S L+ EQA +P+R
Sbjct: 10 LGQYLAGEFDNREQAIADPAWYVHLHLWQRPVPVQLFSEPSLTLFAEQANILHLDRPYRP 69
Query: 110 RFYMVKPCPKELKCD----VEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLT 164
R ++ + L D +EV + +++ +Y+ +P + L P +V G LT
Sbjct: 70 RIMQLR---QLLNGDSPPQLEVQYFMLKDPNQYRGAGRQPHLLQQLNPAQVELLPGCTLT 126
Query: 165 TIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGN 223
R D + S P G G T E+ ++D+G D + G
Sbjct: 127 VSQKRLTDGTYH--FAASLPSGARCCFLYQGETRQVDLGFEASPTELFSFDKGIDPNTGK 184
Query: 224 QVWGVKAGPYEF 235
+WG GP+ F
Sbjct: 185 ALWGAILGPFHF 196
>gi|148238814|ref|YP_001224201.1| hypothetical protein SynWH7803_0478 [Synechococcus sp. WH 7803]
gi|147847353|emb|CAK22904.1| Conserved hypothetical protein in phycobilisome rod gene region
[Synechococcus sp. WH 7803]
Length = 197
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 76/188 (40%), Gaps = 12/188 (6%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L GE +++QA +P + +I + P ++ GS LL EQ + P P+R R +
Sbjct: 13 LCGEYSNQQQAFDNPPLYAHIFLRYRPLPQLTPGSILL--EQTYAVDPNTPYRLRMIRAE 70
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
P +++ ++ RE + P L + +L H + ++
Sbjct: 71 EQPSG---AIKLWNHTFREPSRFAGATFNPE----LRHSIQATDLINLEQCHYQVLEQED 123
Query: 176 RCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYE 234
Y G+ PG +NG L+ + T DRG D N+ WG AGP+
Sbjct: 124 G--YHGAMEPGCQCIVRRNGKDTVLVSSFHLQGESLQTLDRGHDPITNERCWGSVAGPFR 181
Query: 235 FKPAPSSS 242
F+ S S
Sbjct: 182 FQRTQSWS 189
>gi|257059281|ref|YP_003137169.1| hypothetical protein Cyan8802_1417 [Cyanothece sp. PCC 8802]
gi|256589447|gb|ACV00334.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 8802]
Length = 196
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 85/203 (41%), Gaps = 23/203 (11%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A +S + ++EQA +P F +IR+ P V L+LEQA+ +P+R R
Sbjct: 10 LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPLSVFSEPTLFLEQAYDFMLNQPYRLRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR- 170
+ + +E+ +Y +V+E +F RD L LT HL +
Sbjct: 70 FRFNI----IDGRIELENY---KVQEEASFFGASRDLEKL---------RKLTPDHLEKM 113
Query: 171 --CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
CD ++G PG +N + +++ + DRG D ++
Sbjct: 114 PGCDMFVDWTGNSFKGVVKPGKNCIVVRNERETYLDNSFEIDEHKLISVDRGLDPKTDEL 173
Query: 225 VWGVKAGPYEFKPAPSSSYSDMF 247
VWG AGP+ F+ S ++ F
Sbjct: 174 VWGSVAGPFHFQRKTSFAHEVEF 196
>gi|425453939|ref|ZP_18833688.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9807]
gi|389799891|emb|CCI20607.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9807]
Length = 194
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 68 RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMPGCDMN 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
G ++G PG + ++ TY + + + I DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLIFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|425458822|ref|ZP_18838308.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9808]
gi|389823850|emb|CCI27632.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9808]
Length = 194
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 68 RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLSLDLLEPMPGCDMN 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
G ++G PG + ++ TY + + + I DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|409994233|ref|ZP_11277350.1| hypothetical protein APPUASWS_23988 [Arthrospira platensis str.
Paraca]
gi|291566213|dbj|BAI88485.1| putative phycocyanobilin lyase [Arthrospira platensis NIES-39]
gi|409934895|gb|EKN76442.1| hypothetical protein APPUASWS_23988 [Arthrospira platensis str.
Paraca]
Length = 198
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 80/183 (43%), Gaps = 15/183 (8%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
L+ + ++ QA +P F +IR+ P E+++G L YLEQA+ ++P+R R +
Sbjct: 14 LASDFSNQPQAFENPPFFAHIRVCMRPLPWELLNGLSL-YLEQAYDINLKQPYRVRVLKL 72
Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
P +E+ +Y I EE+ + L E + H G
Sbjct: 73 IPVDDH----IEIENYEIDGQEEFYGASREVDRLKALKREQLNKRCGCTFITHW----TG 124
Query: 175 KRCLYEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
K ++G PG G + TY S + + + DRG D + ++ +WG AGP
Sbjct: 125 KG--FKGVVEPGKGCMVERKGKLTYLDSSFEI-DGDRFISHDRGMDPETDEHIWGAVAGP 181
Query: 233 YEF 235
+EF
Sbjct: 182 FEF 184
>gi|428224114|ref|YP_007108211.1| hypothetical protein GEI7407_0661 [Geitlerinema sp. PCC 7407]
gi|427984015|gb|AFY65159.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
Length = 197
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 89/200 (44%), Gaps = 19/200 (9%)
Query: 39 LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
+T S AR +A S +++QA +P F +IR+ P A +++DG Y+EQ
Sbjct: 1 MTHSTDILALARWMAADFS----NQQQAFDNPPFFAHIRVCMRPLALDLLDGVSF-YVEQ 55
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A+ P+R R + +E+ +Y +++ E+Y +P + L E +
Sbjct: 56 AYDYILNSPYRVRVLKLVVNGDR----IEIENYKVKQQEQYYGASRQPHRLQTLSAEGL- 110
Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDR 216
E L ++ G ++G PG + G TY SE + + + + DR
Sbjct: 111 ---EKLPGCNMVVEWTGSS--FKGVVEPGKACMVERKGNLTYLDSEFEI-DDEKFISLDR 164
Query: 217 GFDDDGNQ-VWGVKAGPYEF 235
G D ++ +WG AGP+ F
Sbjct: 165 GRDPQTDEHIWGSVAGPFHF 184
>gi|218440601|ref|YP_002378930.1| hypothetical protein PCC7424_3679 [Cyanothece sp. PCC 7424]
gi|218173329|gb|ACK72062.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7424]
Length = 195
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 88/196 (44%), Gaps = 29/196 (14%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P + +IR+ P + V G L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAWANPPLYAHIRVCMRPLSHDVLGGTSLFLEQAYDFMLNSPYRL 67
Query: 110 RFYMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIH 167
R K + D +E+ +Y ++E E+ F RD + L + LT H
Sbjct: 68 RVL------KFIVVDDRIELENYKVKEQEQ---FFGAARDPKRL---------QTLTIDH 109
Query: 168 LRR---CDRGKRCL---YEGSTPPGG-FPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD 220
L + CD ++G PG ++ TY + + ++ I + DRGFD
Sbjct: 110 LEKLPGCDMTVEWTGHSFKGQVKPGKQCIVIRKDKETYLDNSFEIDEHKLI-SIDRGFDP 168
Query: 221 DGNQ-VWGVKAGPYEF 235
++ VWG AG + F
Sbjct: 169 VTDELVWGSIAGAFHF 184
>gi|116072611|ref|ZP_01469877.1| hypothetical protein BL107_08871 [Synechococcus sp. BL107]
gi|116064498|gb|EAU70258.1| hypothetical protein BL107_08871 [Synechococcus sp. BL107]
Length = 192
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 80/183 (43%), Gaps = 12/183 (6%)
Query: 54 DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
D L GE +++QA +P F +I + + GS LL EQ++ P+KP+R R
Sbjct: 10 DLLCGEYSNQKQAIENPPFFAHIFLKYKAINHLQPGSILL--EQSYAVDPKKPYRLRMIR 67
Query: 114 VKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDR 173
+ +++ ++A ++ +++ + + EE + + H+ D
Sbjct: 68 AEETSSG---TIKLWNHAFKDPKQFTEAMGEQTQTKNITEEDLILLDN--CHYHVVEKDD 122
Query: 174 GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
G Y G G ++G L+ N++ T DRG+D N+ +WG AG
Sbjct: 123 G----YHGGMEAGCRCIVCRDGKDSYLKSSFHLEGNKLSTLDRGYDLKTNERLWGSIAGE 178
Query: 233 YEF 235
++F
Sbjct: 179 FQF 181
>gi|124023924|ref|YP_001018231.1| CpeT [Prochlorococcus marinus str. MIT 9303]
gi|123964210|gb|ABM78966.1| CpeT [Prochlorococcus marinus str. MIT 9303]
Length = 204
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 30/195 (15%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
A +L+ +K+QA +DP F I + P V Y EQ++ P +P+RQ +
Sbjct: 10 AKTLAAHYSNKQQAQKDPIKFARINIYFRPLLWDVFQGPGFYSEQSYDYAPWRPYRQGVH 69
Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL---TTIHLR 169
+ KE+ + +Y+++E + + P D+ HL T + R
Sbjct: 70 KLL-ASKEVFI---LENYSLQEPKRFAGAGFYP------------DLLNHLCADTLLPRR 113
Query: 170 RC----DRGKRCLYEGSTPPGG---FPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
C K Y G PG P + N TY SE+ + I + DRGFD
Sbjct: 114 GCAMHFKEVKPNQYRGQVEPGNNCLVPRN--NTMTYLVSEVEFDEKIWI-SRDRGFDPKT 170
Query: 223 N-QVWGVKAGPYEFK 236
+ Q+WG K GP F+
Sbjct: 171 HEQLWGSKHGPLHFQ 185
>gi|425445152|ref|ZP_18825188.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9443]
gi|389734886|emb|CCI01493.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9443]
Length = 194
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + + L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ ++ +V+E NF R+ + L + + D+ E + +
Sbjct: 68 RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLSLDLLEPMPGCDMN 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
G ++G PG + ++ TY + + + I DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLIFRKDRMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|428304538|ref|YP_007141363.1| hypothetical protein Cri9333_0938 [Crinalium epipsammum PCC 9333]
gi|428246073|gb|AFZ11853.1| protein of unknown function DUF1001 [Crinalium epipsammum PCC 9333]
Length = 204
Score = 41.6 bits (96), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 39/182 (21%), Positives = 71/182 (39%), Gaps = 5/182 (2%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L+GE ++ QA +P + ++ + P + S L+ EQA P+R R ++
Sbjct: 14 LAGEFDNRAQALAEPAWYVHLHLWQRPVPLFTEDSITLFAEQANIVNLDHPYRPRILRLR 73
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
+ V Y + +E K +P + L I + TI+ ++
Sbjct: 74 DSNTP-SGSIHVQYYMFKNLEAIKGAGRKPELLKTLTTSDIELLPSCTLTINPQQITTDT 132
Query: 176 RCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPY 233
++P P + G TY + NE H++D+G + G +WG G Y
Sbjct: 133 WEF--ATSPTSDVPCCFNYQGQTYQVFLGFTARKNESHSYDKGIEPQTGKAIWGALMGSY 190
Query: 234 EF 235
+
Sbjct: 191 RY 192
>gi|126657830|ref|ZP_01728983.1| hypothetical protein CY0110_13236 [Cyanothece sp. CCY0110]
gi|126620770|gb|EAZ91486.1| hypothetical protein CY0110_13236 [Cyanothece sp. CCY0110]
Length = 196
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 76/176 (43%), Gaps = 11/176 (6%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQA +P + +IR+ P L+LEQA+ +P+R R +K
Sbjct: 20 NQEQAFENPPFYAHIRVCIRPLPVDNFPEPSLFLEQAYDYALNQPYRVRVLQLKIVDNR- 78
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
+E+ +Y ++E E + +P + E+ D E + + G ++G
Sbjct: 79 ---IELENYKLKEKEAFLGASRQPEKLK----EITPDDIELMQGCDMFVDWTGTS--FKG 129
Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFK 236
+ PG + G + +++ + DRG+D N+ VWG AGP+ F+
Sbjct: 130 TVKPGRNCRIVRKGKETYLDNSFEINEHQLISLDRGYDPVTNELVWGSVAGPFHFE 185
>gi|218246228|ref|YP_002371599.1| hypothetical protein PCC8801_1383 [Cyanothece sp. PCC 8801]
gi|218166706|gb|ACK65443.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 8801]
Length = 196
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 85/203 (41%), Gaps = 23/203 (11%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A +S + ++EQA +P F +IR+ P V L+LEQA+ +P+R R
Sbjct: 10 LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPLSVFPEPTLFLEQAYDFMLNQPYRLRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR- 170
+ + +E+ +Y +V+E +F RD L LT HL +
Sbjct: 70 FRFNI----IDGRIELENY---KVQEEASFFGASRDLEKL---------RKLTPDHLEKM 113
Query: 171 --CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
CD ++G PG +N + +++ + DRG D ++
Sbjct: 114 PGCDMFVDWTGNSFKGVVKPGKNCIVVRNERETYLDNSFEIDEHKLISVDRGLDPKTDEL 173
Query: 225 VWGVKAGPYEFKPAPSSSYSDMF 247
VWG AGP+ F+ S ++ F
Sbjct: 174 VWGSVAGPFHFQRKTSFAHEVEF 196
>gi|352096586|ref|ZP_08957413.1| protein of unknown function DUF1001 [Synechococcus sp. WH 8016]
gi|351676236|gb|EHA59390.1| protein of unknown function DUF1001 [Synechococcus sp. WH 8016]
Length = 201
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 14/191 (7%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
A +L+G + EQ+ +P +F +I + P Y EQ++ P P+RQ +
Sbjct: 11 AKTLAGHYSNFEQSQDNPKDFAHINIYFRPLPWETLKGPGFYSEQSYDHDPWSPYRQGIH 70
Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP-EEVIGDIGEHLTTIHLRRC 171
++ +++ V ++ + RP R L E + G +H
Sbjct: 71 RLQ----QIQDIFVVENFGFADALRLAGSGQRPELLRSLKVESLTPRCG---CDMHFHEV 123
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVK 229
+ G Y G PG ++G TY SE+ V + + DRGFD + + Q WG +
Sbjct: 124 ETG---CYRGEVEPGHKCLVPRDGQLTYLISEVDVDSTTWV-SRDRGFDPETHAQKWGSE 179
Query: 230 AGPYEFKPAPS 240
GP +FK S
Sbjct: 180 HGPLKFKRVTS 190
>gi|428777464|ref|YP_007169251.1| hypothetical protein PCC7418_2907 [Halothece sp. PCC 7418]
gi|428691743|gb|AFZ45037.1| protein of unknown function DUF1001 [Halothece sp. PCC 7418]
Length = 198
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 15/183 (8%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
++ + ++EQA +P F +IR+ P + L++EQA+ P+R R
Sbjct: 14 MAADFSNQEQAFANPPFFAHIRVCMRPLPVALLSGISLFVEQAYDYMLNNPYRMRVL--- 70
Query: 116 PCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDR 173
K L D +E+ +Y +++ F RD + L +E+ D E L ++
Sbjct: 71 ---KLLVVDDHIEIENYVLKDPSA---FPGASRDPQRL-QELTSDNLERLPGCNMVVNWT 123
Query: 174 GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
G ++G PG + G + N+ + DRG D + ++ VWG AG
Sbjct: 124 GNS--FQGEVEPGKACKVVRKGQETYLDSTFEIDANQFTSLDRGRDPETDERVWGSVAGA 181
Query: 233 YEF 235
+EF
Sbjct: 182 FEF 184
>gi|119486368|ref|ZP_01620427.1| hypothetical protein L8106_16979 [Lyngbya sp. PCC 8106]
gi|119456581|gb|EAW37711.1| hypothetical protein L8106_16979 [Lyngbya sp. PCC 8106]
Length = 196
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 80/192 (41%), Gaps = 23/192 (11%)
Query: 49 ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
AR +A S + QA +P F +IR+ P G LYLEQA+ +P+R
Sbjct: 11 ARWMASDFSNQP----QAFENPPFFAHIRVCMRPLPTEFLGGVSLYLEQAYDYMINQPYR 66
Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
R P + +E+ +YA+ E++ +P + L E + L
Sbjct: 67 VRVLNFIP----RQDHIEIENYALENQEQFYGASRQPERLQNLKIEQLRK---------L 113
Query: 169 RRCDRGKRCL---YEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ 224
C R ++G PG G + TY S V ++ + + DRG + ++
Sbjct: 114 PCCSFITRWTGHSFKGEVEPGKGCMVERKGQVTYLDSWFEVDEHRMV-SHDRGRHPETDE 172
Query: 225 -VWGVKAGPYEF 235
VWG AGP+EF
Sbjct: 173 HVWGSVAGPFEF 184
>gi|443317265|ref|ZP_21046681.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
gi|442783150|gb|ELR93074.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
Length = 208
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 39/197 (19%), Positives = 84/197 (42%), Gaps = 20/197 (10%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKL-LYLEQAFWRTPQKPFRQR 110
+A L+GE +K QA +P F +++ P ++ + +LEQA Q P+RQR
Sbjct: 8 LAQCLAGEFTNKPQALAEPIWFVQLKLWIRPVPQLSSTEHVTFFLEQASMAYDQPPYRQR 67
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP-EEVIGDIGEHLTTIH-- 167
+ + ++ + A+++ + ++ P L ++++ G L+ +H
Sbjct: 68 LLQL----ASQQGNLTATYSALKDPKSWQGAAQNPDRLGTLTDQDLMALTGSTLSVVHPE 123
Query: 168 -------LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD 220
+ R G+RC ++ G + G + ++ + +DRG+D
Sbjct: 124 NSHGETFIARHFPGERCQFQVD----GQAREVELGFDAISGTGTAQSPDQFYMYDRGYDA 179
Query: 221 DGNQ-VWGVKAGPYEFK 236
++ WG GP++ +
Sbjct: 180 TQDKYTWGALHGPFKLQ 196
>gi|434396856|ref|YP_007130860.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
gi|428267953|gb|AFZ33894.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
7437]
Length = 204
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/194 (21%), Positives = 81/194 (41%), Gaps = 13/194 (6%)
Query: 49 ARIVA--DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKP 106
A+++A L+GE + QA P + ++R+ P + S L+ EQA +P
Sbjct: 5 AQLIALGQYLAGEFDNHPQALAQPAWYVHLRLWIRPVPLFTEDSITLFAEQASVVNLNQP 64
Query: 107 FRQRFYMVK---PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL 163
+R R ++ P++L+ V Y +++E K+ P+ + + E I +
Sbjct: 65 YRPRLLRLRENEASPQQLR----VEYYMFKDIESVKDAGTNPQLLQQITLEQIEFLPNCT 120
Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DD 221
+ + K + ++P P S+ G + + + T+D+G D
Sbjct: 121 LNVEFQHNSANKY--HFKTSPQSDCPCSFNYQGKNFQVFLGFEVTPETLLTYDKGIDPQT 178
Query: 222 GNQVWGVKAGPYEF 235
G +WG G Y+F
Sbjct: 179 GKGIWGALLGAYQF 192
>gi|194477086|ref|YP_002049265.1| hypothetical protein PCC_0629 [Paulinella chromatophora]
gi|317374820|sp|B1X536.1|CPXT_PAUCH RecName: Full=Chromophore lyase CpcT/CpeT
gi|171192093|gb|ACB43055.1| hypothetical protein PCC_0629 [Paulinella chromatophora]
Length = 201
Score = 40.8 bits (94), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 80/186 (43%), Gaps = 14/186 (7%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+ LS +++QA +P + +I + P ++ GS L +EQ++ P KP+R R
Sbjct: 8 LVHQLSARFNNQQQAFDNPPLYAHIVVNCRPLVHLLPGS--LLIEQSYAMDPLKPYRIRV 65
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ ++L + SY++ + ++Y P + E+ + I E I
Sbjct: 66 LRAQTRDEKLI----IFSYSLSDEQKYWGSVYEPERMLKIEEKDLQAI-EGCNYI----- 115
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVK 229
R K + G PG + G TY S+ + E+ T DRG + Q+WG
Sbjct: 116 VRKKNSNFIGEVEPGCRCLVDRKGVTTYIVSKFELTNKGEMRTLDRGHNPVTHEQLWGSL 175
Query: 230 AGPYEF 235
G +EF
Sbjct: 176 GGVFEF 181
>gi|209523368|ref|ZP_03271923.1| protein of unknown function DUF1001 [Arthrospira maxima CS-328]
gi|376006618|ref|ZP_09783855.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423064550|ref|ZP_17053340.1| hypothetical protein SPLC1_S203620 [Arthrospira platensis C1]
gi|209496110|gb|EDZ96410.1| protein of unknown function DUF1001 [Arthrospira maxima CS-328]
gi|375325005|emb|CCE19608.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406713793|gb|EKD08961.1| hypothetical protein SPLC1_S203620 [Arthrospira platensis C1]
Length = 198
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 82/184 (44%), Gaps = 17/184 (9%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
L+ + ++ QA +P F +IR+ P E+++G L YLEQA+ ++P+R R +
Sbjct: 14 LAADFSNQAQAFENPPFFAHIRVCMRPLPWELLNGLSL-YLEQAYDINLKQPYRVRVLKL 72
Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLRRCDR 173
+E+ +Y I EE+ + L PE++ G T +
Sbjct: 73 IAVDDH----IEIENYEIDGQEEFFGASREVSRLKALKPEQLNKRCG---CTFITHWTGK 125
Query: 174 GKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAG 231
G ++G PG + G TY S + + + DRG D + ++ +WG AG
Sbjct: 126 G----FKGVVEPGKSCMVERKGKLTYLDSSFEI-DGDRFISHDRGMDPETDEHIWGAVAG 180
Query: 232 PYEF 235
P+EF
Sbjct: 181 PFEF 184
>gi|87124907|ref|ZP_01080754.1| hypothetical protein RS9917_02858 [Synechococcus sp. RS9917]
gi|86167227|gb|EAQ68487.1| hypothetical protein RS9917_02858 [Synechococcus sp. RS9917]
Length = 197
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 74/192 (38%), Gaps = 33/192 (17%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L G ++ QA +P + +I + P + G +L LEQA+ P +P+R R +
Sbjct: 12 LCGSFTNEAQAFENPPLYAHINVTVRPLPHLEAG--MLLLEQAYALAPAEPYRIRVLEAR 69
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
L V ++++ + + + P LT I +
Sbjct: 70 QVGDAL----HVRNHSLSDAQRFWGATLNPH---------------QLTAIGSDDLQLLE 110
Query: 176 RCLYEGSTPPGGFPNSWQNG----------ATYSTSELAVLKNNEIHTWDRGFDDDGN-Q 224
C Y S GF + G TY S+ + + T DRG D + + Q
Sbjct: 111 GCTYVVSPKGDGFSGEVEPGCRCMVERKGRTTYLVSQFEI-DAAGMSTIDRGHDPETHEQ 169
Query: 225 VWGVKAGPYEFK 236
VWG AGP++F+
Sbjct: 170 VWGSLAGPFQFQ 181
>gi|37520751|ref|NP_924128.1| hypothetical protein glr1182 [Gloeobacter violaceus PCC 7421]
gi|81710612|sp|Q7NLE2.1|CPXT2_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 2
gi|35211746|dbj|BAC89123.1| glr1182 [Gloeobacter violaceus PCC 7421]
Length = 202
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 76/184 (41%), Gaps = 14/184 (7%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
L+G+ ++EQA P F +IR+ P V LY EQA P+R +
Sbjct: 13 LAGDFSNREQAWDAPAFFSHIRLCMRPLPWQVFEGYGLYSEQADDYDWAHPYRIVVLNLV 72
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNF---CDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
P + +E +YA+++ Y +R R R PE++ G T +R
Sbjct: 73 EQPDGI---IECRNYALKDTAPYLGAAREAERARLHRLTPEQLEPLPG---CTFLFKR-- 124
Query: 173 RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAG 231
+ L+ G PG ++ G A + + + DRG D + QVWG +G
Sbjct: 125 --EGSLFRGRVRPGKGCRVFRKGRDTYLDGEATVSADYYKSIDRGRDLESDEQVWGSVSG 182
Query: 232 PYEF 235
P+ F
Sbjct: 183 PFHF 186
>gi|434405137|ref|YP_007148022.1| CpeT/CpcT family (DUF1001) [Cylindrospermum stagnale PCC 7417]
gi|428259392|gb|AFZ25342.1| CpeT/CpcT family (DUF1001) [Cylindrospermum stagnale PCC 7417]
Length = 201
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 39/185 (21%), Positives = 77/185 (41%), Gaps = 3/185 (1%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+++ L+GE ++EQA +P + ++R+ P + S L+ EQA +P+RQR
Sbjct: 10 LSEYLAGEFDNREQAIAEPVWYVHLRLWQRPVNLFPEDSLTLFAEQANIVNLDQPYRQRI 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
++ ++ + ++V Y ++ K P L + + + + ++ +
Sbjct: 70 MRLRQG-RDSEAPLDVQYYMPKDPTALKGAGSNPALLNALTIDQLDLLPGCILSVTQDKI 128
Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG-NQVWGVKA 230
R + S PP + G T S + H++D+G D WG
Sbjct: 129 -APNRYKFAASAPPDTCCSFTYLGNTVQVSLGFEVTEATFHSYDKGIDSATLKATWGAII 187
Query: 231 GPYEF 235
GPY +
Sbjct: 188 GPYRY 192
>gi|428202218|ref|YP_007080807.1| hypothetical protein Ple7327_1905 [Pleurocapsa sp. PCC 7327]
gi|427979650|gb|AFY77250.1| CpeT/CpcT family (DUF1001) [Pleurocapsa sp. PCC 7327]
Length = 196
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 84/187 (44%), Gaps = 11/187 (5%)
Query: 50 RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+ +A ++ + ++EQA +P F +IR+ P + G L+LEQA+ P+R
Sbjct: 8 KTLARWMAADFSNQEQAWANPPLFAHIRVCMRPLPLALLGETSLFLEQAYDFMLDTPYRL 67
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
R + + +E+ +Y +++ E +F R+ L +++ D E + +
Sbjct: 68 RVFKLAVVGDR----IELENYKVKDEE---DFFGASRNLERL-QKLTSDRLEKMEGCDMF 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGV 228
G ++G PG +N T + +++ + DRG D + ++ VWG
Sbjct: 120 VDWTGSS--FKGYIKPGKNCIVVRNDKTTYLDNSFEVDEHKLFSIDRGLDPETDELVWGS 177
Query: 229 KAGPYEF 235
AG +EF
Sbjct: 178 IAGAFEF 184
>gi|334116991|ref|ZP_08491083.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
gi|333461811|gb|EGK90416.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
Length = 206
Score = 40.4 bits (93), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 76/191 (39%), Gaps = 13/191 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+A ++GE ++EQA +P + ++R+ P + S L+ EQA KP+R
Sbjct: 10 LAQYMAGEFDNREQALAEPAWYVHLRLWQRPLPVPLFSEPSIALFAEQANILELDKPYRP 69
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHL 168
R ++ + +E Y +++ K P L E I + G L+ +
Sbjct: 70 RILQLRHS-QIAPGLIEAQYYMFKDIAAVKGAGCNPDLLGKLTNEQIQLLPGCTLSVVVQ 128
Query: 169 RRCDRGKRCLYEGSTPPG---GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
G R + S P G GF QN E A E ++D+G G
Sbjct: 129 NLASNGYR--FRASLPEGTRCGFAYGGQNYQVDLGFEAA---GEEFLSYDKGISPTTGKA 183
Query: 225 VWGVKAGPYEF 235
+WG GP+ F
Sbjct: 184 IWGALMGPFRF 194
>gi|254421325|ref|ZP_05035043.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196188814|gb|EDX83778.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 210
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 42/198 (21%), Positives = 83/198 (41%), Gaps = 12/198 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL-EQAFWRTPQKPFRQR 110
+A L+GE ++ QA+ +P + ++R+ P + K +L EQ + Q P+RQR
Sbjct: 9 LASYLAGEFENQRQAAAEPAWYVHLRLWQRPIQALSSEQKFTFLLEQQNVISGQPPYRQR 68
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
+ + +E +A+++ YK P+ + + ++ + + I +
Sbjct: 69 ILQITDSAE--VNQLEGQYFALKDPSRYKGAGQDPKILKGISQKDLVTLPNSGAVICYQA 126
Query: 171 CDRGKRCLYEGSTPP-----GGFPNSWQNGATYSTSELAVLKNNEI--HTWDRGFD-DDG 222
G R Y+ + F Q Y ++ + ++N I +D+G D D G
Sbjct: 127 LSNGGREGYQFESALRDNQLCSFSYGGQTKYVYLGFDV-IPQDNTILLKVYDKGIDPDTG 185
Query: 223 NQVWGVKAGPYEFKPAPS 240
+WG GP++ S
Sbjct: 186 RGLWGALMGPFQLTKQAS 203
>gi|428317171|ref|YP_007115053.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
7112]
gi|428240851|gb|AFZ06637.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
7112]
Length = 206
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 76/191 (39%), Gaps = 13/191 (6%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
+A ++GE ++EQA +P + ++RM P + S L+ EQA KP+R
Sbjct: 10 LAQYMAGEFDNREQALAEPTWYVHLRMWQRPLPVPLFSEPSIALFAEQANILELDKPYRP 69
Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHL 168
R ++ + +E Y ++ K P L +E I + G L+
Sbjct: 70 RIVQLRHS-QIAPGLIEAQYYMFHDITAVKGAGCNPDLLGKLTKEQIQLLPGCTLSVAVQ 128
Query: 169 RRCDRGKRCLYEGSTPPG---GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
G R + S P G GF QN E A E ++D+G + G
Sbjct: 129 NLASNGYR--FRASLPEGTRCGFAYGGQNYQVDLGFEAA---GEEFLSYDKGINPTTGKA 183
Query: 225 VWGVKAGPYEF 235
+WG GP+ F
Sbjct: 184 IWGALMGPFRF 194
>gi|427417732|ref|ZP_18907915.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
sp. PCC 7375]
gi|425760445|gb|EKV01298.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
sp. PCC 7375]
Length = 196
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 86/193 (44%), Gaps = 33/193 (17%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
++ + ++ QA +P F +IR+ P LYLEQA+ +P+R R +++
Sbjct: 14 MAADFSNQAQAIENPPFFAHIRVCMRPLPSNFLSGNCLYLEQAYDFMLGQPYRVR--VLR 71
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG---DIGE--HLTTIHLRR 170
P++ +E S+ +DQ E V+G D+ + LT H+ +
Sbjct: 72 FLPQDDHILLENYSF---------------KDQ----EAVVGAGRDLAKLATLTADHIEK 112
Query: 171 ---CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ 224
CD R + G PG + G T + ++++ ++DRG D + ++
Sbjct: 113 MTGCDMTVRWTGHSFIGKVVPGKNCIVVRQGKTSYLDNEFEITDDKLISYDRGRDPETDE 172
Query: 225 -VWGVKAGPYEFK 236
+WG AGP+EF+
Sbjct: 173 LLWGSIAGPFEFE 185
>gi|428206109|ref|YP_007090462.1| hypothetical protein Chro_1061 [Chroococcidiopsis thermalis PCC
7203]
gi|428008030|gb|AFY86593.1| protein of unknown function DUF1001 [Chroococcidiopsis thermalis
PCC 7203]
Length = 197
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 85/188 (45%), Gaps = 17/188 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++ + ++ QA +P F +IR+ P + Y+EQA+ P+R R
Sbjct: 10 LARWMAADFSNQAQAFENPPFFAHIRVCMRPLPLNLLSGVSFYVEQAYDYMLGNPYRVRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL--TTIHLR 169
+ +E+ +Y I++ EE+ RD + L + + D E L + ++
Sbjct: 70 LKLLQAGDR----IEIENYTIKQEEEFHG---ASRDLKRL-QALSSDRLEKLPGCNMIVQ 121
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
D G ++G+ PG + G TY SE + ++ + DRG D + ++ VWG
Sbjct: 122 WVDNG----FKGTVEPGKCCMVVRKGKTTYLDSEFEI-DGDKFISRDRGRDPETDEHVWG 176
Query: 228 VKAGPYEF 235
AGP+ F
Sbjct: 177 SVAGPFYF 184
>gi|427714031|ref|YP_007062655.1| hypothetical protein Syn6312_3060 [Synechococcus sp. PCC 6312]
gi|427378160|gb|AFY62112.1| CpeT/CpcT family (DUF1001) [Synechococcus sp. PCC 6312]
Length = 192
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 77/188 (40%), Gaps = 21/188 (11%)
Query: 54 DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
+ L+GE ++ QA P F ++R+ + + G + ++ EQA KP+RQR +
Sbjct: 16 ECLTGEFQNQAQAQAQPTWFVHLRLWQRAIPQGIQGQRAIFAEQANALYLDKPYRQRILV 75
Query: 114 VKPCPKELK---CDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-DIGEHLTTIHLR 169
++P P ++ C S +A + + R D PLP V+ + +H L
Sbjct: 76 IQPEPLQIHYWACK-SPSQWAGAGMAPSRLQGLRESDLDPLPGCVLNVEYADHCFQATL- 133
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGV 228
RC FP Q + K ++DRG D G +WG
Sbjct: 134 --PETSRCC---------FPYQGQERQVVLGFRVTAQK---FWSFDRGVDPVTGQGLWGA 179
Query: 229 KAGPYEFK 236
GPYEF+
Sbjct: 180 LIGPYEFE 187
>gi|443327631|ref|ZP_21056251.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
gi|442792723|gb|ELS02190.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
Length = 209
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 87/215 (40%), Gaps = 37/215 (17%)
Query: 38 RLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQ 97
R T+S D A+++A S +K QA + ++ +IR+ P G+ Y EQ
Sbjct: 9 RETESTLLIDLAKVMAGDFSNQK----QAFANSKDYAHIRVFFRPLPWEFFGAIGFYSEQ 64
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCD-VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
+ P+RQ + + K D + + +Y++ Q P+
Sbjct: 65 VYDYDLWTPYRQGIHRLVD-----KGDHIYIENYSL---------------QDPVIYAGA 104
Query: 157 GDIGEHLTTIHLRRCDRGKRC---------LYEGSTPPGGFPNSWQNGA-TYSTSELAVL 206
G + + TI DR C ++ G+ PG + G TY S++ +
Sbjct: 105 GHNKDIIQTISPTSIDRRYNCSMVFKQDGEVFRGAVEPGNKCLIHRKGVETYLVSDVEIT 164
Query: 207 KNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPS 240
+ I + DRG D + G Q+WG GP +F+ S
Sbjct: 165 ETTWI-SLDRGMDLNTGEQIWGSAKGPLKFEKRAS 198
>gi|254412444|ref|ZP_05026218.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196180754|gb|EDX75744.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 196
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 82/177 (46%), Gaps = 15/177 (8%)
Query: 62 SKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
+++QA +P + +IR+ P A ++DG ++EQA+ P+R R +
Sbjct: 20 NQQQAFDNPPFYAHIRVFMRPLPASLLDGVS-FFVEQAYDYLLNAPYRLRVLKLMVVGDH 78
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
+E+ +Y ++E E+ F RD L + + D + L ++ GK ++
Sbjct: 79 ----IEIENYTVKEPEK---FYGASRDLNRL-QTLTTDQLDKLPGCNMIVEWTGKS--FQ 128
Query: 181 GSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
G PG ++ G TY S + +N I + DRG D ++ +WG AGP+ F
Sbjct: 129 GHVEPGKACKVFRKGKETYLDSTFEIDENQFI-SLDRGRDPVTDEHIWGSIAGPFYF 184
>gi|411118829|ref|ZP_11391209.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
gi|410710692|gb|EKQ68199.1| protein of unknown function DUF1001, CpeT/CpcT family
[Oscillatoriales cyanobacterium JSC-12]
Length = 196
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 15/177 (8%)
Query: 62 SKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
++ QA +P F +IR+ P E++ G L +EQA+ P+R R ++K ++
Sbjct: 20 NQAQAFENPPFFAHIRVCMRPLPLELLSGVSFL-VEQAYDYNLNDPYRLR--VLKLITQD 76
Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
+E+ +Y ++ EE F RD + L I + E L +++ G + ++
Sbjct: 77 --GHIEIENYCVKGEEE---FFGASRDLKRLSALRIDQL-EMLPGCNMQVEWTGSQ--FK 128
Query: 181 GSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
G PG + G TY SE + + I + DRG D + ++ VWG AGP+EF
Sbjct: 129 GRVQPGKACIVVRKGKTTYLDSEFEIDADKFI-SHDRGRDPETDEHVWGSVAGPFEF 184
>gi|307153848|ref|YP_003889232.1| hypothetical protein Cyan7822_4032 [Cyanothece sp. PCC 7822]
gi|306984076|gb|ADN15957.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
Length = 206
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 78/194 (40%), Gaps = 22/194 (11%)
Query: 53 ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
A L+G+ + +QA +P NF +IR+ P G Y EQ + P+RQ +
Sbjct: 10 ARCLAGDFSNSQQAWENPQNFAHIRIFFRPLPFDFFGGIGFYSEQTYDYDLWSPYRQGIH 69
Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
+ E K + + +Y +++ Y E++ + H RR +
Sbjct: 70 RL----VEQKEQIYIENYGLKDAYLYAG--------SGHNREILATLKPHCIE---RRYN 114
Query: 173 -----RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-QVW 226
R + L+ G+ TY SE+ + +N + + DRG D + + QVW
Sbjct: 115 CSMVFRQQGNLWRGNVEGNQCLIERNGKQTYLVSEVELTENTWV-SLDRGMDINTHEQVW 173
Query: 227 GVKAGPYEFKPAPS 240
G GP F+ S
Sbjct: 174 GSVHGPLRFEKRTS 187
>gi|434387951|ref|YP_007098562.1| CpeT/CpcT family (DUF1001) [Chamaesiphon minutus PCC 6605]
gi|428018941|gb|AFY95035.1| CpeT/CpcT family (DUF1001) [Chamaesiphon minutus PCC 6605]
Length = 203
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 81/188 (43%), Gaps = 17/188 (9%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQR 110
+A ++G+ ++ QA +P F +IR+ P E+ DG L +LEQA+ Q P+R R
Sbjct: 10 LARWMAGDFSNQAQAIENPPFFAHIRVCMRPLPYELFDGVSL-FLEQAYDFMLQNPYRLR 68
Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRD-QRPLPEEVIGDIGEHLTTIHLR 169
+ +E+ Y + E+ P Q+ + +I G + T
Sbjct: 69 VLNFVVVNER----IEIEHYTLDPEAEFFGAARNPEQLQQITRDNIIKMPGCTMIT---- 120
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
++G PG +NG TY + + + I + DRG D ++ VWG
Sbjct: 121 ---EWTGSTFKGYVEPGKGCKVTRNGKDTYLDNSFEISPGHLI-SLDRGRDIATDEHVWG 176
Query: 228 VKAGPYEF 235
AGP+EF
Sbjct: 177 SIAGPFEF 184
>gi|186681237|ref|YP_001864433.1| CpeT protein [Nostoc punctiforme PCC 73102]
gi|186463689|gb|ACC79490.1| CpeT protein [Nostoc punctiforme PCC 73102]
Length = 209
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/190 (22%), Positives = 81/190 (42%), Gaps = 19/190 (10%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
+SG+ + +Q+ +P + +I + P + Y EQA+ P+RQ + +
Sbjct: 22 MSGDFSNYKQSFANPQLYAHIHIFFRPLPIEFFSAIGFYSEQAYDHDLWTPYRQGVHKLV 81
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
C + + +Y++++ Y + + + + D E + G+
Sbjct: 82 DCSDR----IYIENYSLKDQMLYAG----AARELDILKTITPDCIERRYNCSMVFMREGE 133
Query: 176 RCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTW---DRGFDDDGN-QVWGVKA 230
++ GS PG +NG TY S++ + + HTW D+G D + + Q+WG A
Sbjct: 134 --MFRGSVEPGNQCLINRNGCQTYLVSDVEITE----HTWISLDKGMDIETHKQIWGSTA 187
Query: 231 GPYEFKPAPS 240
GP F+ S
Sbjct: 188 GPLRFEKRES 197
>gi|22299699|ref|NP_682946.1| hypothetical protein tlr2156 [Thermosynechococcus elongatus BP-1]
gi|81742471|sp|Q8DH04.1|CPXT_THEEB RecName: Full=Chromophore lyase CpcT/CpeT
gi|22295883|dbj|BAC09708.1| tlr2156 [Thermosynechococcus elongatus BP-1]
Length = 196
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 78/182 (42%), Gaps = 13/182 (7%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
++ + ++ QA +P + +IR+ P V LY+EQA+ P+R R +
Sbjct: 14 MAADFSNQAQAFENPPFYAHIRVCMRPLPRGVLEGIALYVEQAYDYLLSVPYRTRVLELM 73
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
P + + +Y ++ + K F RD++ L D+ E L ++ G
Sbjct: 74 PANDH----IVIKNYVLK---DEKRFFGAARDRQRLQAMTADDL-ELLCGCNMLTYWTGH 125
Query: 176 RCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPY 233
+ G PG + G TY S + + I + DRG D + ++ VWG AGP+
Sbjct: 126 S--FRGEVEPGKACKVVRKGRETYLDSTFEIDGDRFI-SHDRGRDPETDEHVWGSVAGPF 182
Query: 234 EF 235
F
Sbjct: 183 HF 184
>gi|427736066|ref|YP_007055610.1| hypothetical protein Riv7116_2555 [Rivularia sp. PCC 7116]
gi|427371107|gb|AFY55063.1| CpeT/CpcT family (DUF1001) [Rivularia sp. PCC 7116]
Length = 197
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 37/69 (53%)
Query: 48 HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPF 107
+ ++A+ L+GE +KEQA +P + N+ M + + S L+ EQA ++P+
Sbjct: 6 NLNVLANYLAGEFENKEQAMTEPAWYVNLLMWQRTVSLFQEDSLALFAEQANILNLEQPY 65
Query: 108 RQRFYMVKP 116
RQR + P
Sbjct: 66 RQRLLRITP 74
>gi|160331181|ref|XP_001712298.1| cpeT-like protein [Hemiselmis andersenii]
gi|159765745|gb|ABW97973.1| cpeT-like protein [Hemiselmis andersenii]
Length = 221
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 102/221 (46%), Gaps = 29/221 (13%)
Query: 37 KRLTKSKTRWD-HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL 95
+ +K+ T+ + A+ L+G ++EQA++ P ++ +I++ P + + Y
Sbjct: 18 RNFSKNSTKMSCEVELFANYLAGIWTNEEQATKFPTDWSHIQLGFYPLDQSLLNGYSFYT 77
Query: 96 EQAFWRTPQKPFRQRFYMVKPCPKELKCDV-EVSSYAIREVEEYKNFCDRPRDQRPLPEE 154
E A + +P++ ++ E K ++ EV S++I+ E++ P L ++
Sbjct: 78 ESANEFSLDEPYKSGVTLL-----EKKGEIIEVKSFSIKGPEDFWYGTYEPSLLANLTKD 132
Query: 155 -VIGDI-GEHL-------TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAV 205
++ D G +L + L + GK+C+ P G P TY S AV
Sbjct: 133 RLVNDKDGCNLEFKYDAKKKLFLGKTKSGKQCI----IPREGNP-------TYLDST-AV 180
Query: 206 LKNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSYSD 245
LK NE + D G + ++ +VWG AGP+ F S S S+
Sbjct: 181 LKENEYSSLDIGRNIENDEKVWGPSAGPFVFLKKKSFSISN 221
>gi|427722465|ref|YP_007069742.1| hypothetical protein Lepto7376_0476 [Leptolyngbya sp. PCC 7376]
gi|427354185|gb|AFY36908.1| protein of unknown function DUF1001 [Leptolyngbya sp. PCC 7376]
Length = 208
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 74/195 (37%), Gaps = 22/195 (11%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A +++G +++Q+ D N+ +IR+ P Y EQ + P+RQ
Sbjct: 21 LARTMAGGFSNQKQSFADSKNYAHIRVFFRPLPWEFFNGIGFYSEQVYDYDLWLPYRQGI 80
Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
+ P E V + +Y++++ Y R + EVI RRC
Sbjct: 81 HRFVPHGDE----VYIENYSLKDALLYAGAGRNLDILRSIKPEVIE-----------RRC 125
Query: 172 D-----RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQV 225
+ R + ++G + TY S+ A WDRG D D QV
Sbjct: 126 NCSMIFRPENNGFQGRVEGKCCVIEKRGKRTYLVSD-ARFTETSFIGWDRGLDVDTDEQV 184
Query: 226 WGVKAGPYEFKPAPS 240
WG GP EF S
Sbjct: 185 WGSAIGPLEFDKTDS 199
>gi|428304753|ref|YP_007141578.1| hypothetical protein Cri9333_1163 [Crinalium epipsammum PCC 9333]
gi|428246288|gb|AFZ12068.1| protein of unknown function DUF1001 [Crinalium epipsammum PCC 9333]
Length = 196
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 40/193 (20%), Positives = 80/193 (41%), Gaps = 27/193 (13%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A ++ + ++ QA +P F +IR+ P L++EQA+ T P+R R
Sbjct: 10 LARWMAADFSNQAQAFENPPFFAHIRVAMRPLPPEFLSGVSLFVEQAYDYTLNDPYRLR- 68
Query: 112 YMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
K + D +E+ +Y++ + +++ +P ++ + LT HL
Sbjct: 69 -----VLKLIVVDDHIEIENYSVNQEQQFYGASRKP------------ELLKALTVEHLE 111
Query: 170 RCDRGKRCL------YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
+ + ++G PG + G T + ++ + DRG D D +
Sbjct: 112 KLPGCNMIVEWTGNSFKGYVQPGKACIVVRKGHTTYLDSTFEIDEHKFISLDRGRDPDTD 171
Query: 224 Q-VWGVKAGPYEF 235
+ +WG GP+ F
Sbjct: 172 EHIWGSFGGPFHF 184
>gi|326783641|ref|YP_004323856.1| antenna protein [Synechococcus phage Syn19]
gi|310005144|gb|ADO99534.1| antenna protein [Synechococcus phage Syn19]
Length = 153
Score = 38.1 bits (87), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%), Gaps = 1/31 (3%)
Query: 206 LKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
L ++IHT D+G+D DGN +WG K PY+FK
Sbjct: 120 LFGSKIHTCDQGYDSDGNMIWGSK-NPYKFK 149
>gi|428298429|ref|YP_007136735.1| hypothetical protein Cal6303_1725 [Calothrix sp. PCC 6303]
gi|428234973|gb|AFZ00763.1| protein of unknown function DUF1001 [Calothrix sp. PCC 6303]
Length = 197
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 83/189 (43%), Gaps = 17/189 (8%)
Query: 52 VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
+A +S + ++EQA +P F +IR+ P + L+LEQA+ +P+R R
Sbjct: 10 LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPFELFSEVGLFLEQAYDFALNQPYRMRV 69
Query: 112 YMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
+K E AI + E +N+ RD L + + ++ + + ++
Sbjct: 70 ---------MKLTTENDHIAITHFTINEEQNYYGASRDLSRLAKLSVDEL-KPMPGCNMI 119
Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
GK ++G PG ++G TY +E + E + DRG D + N+ WG
Sbjct: 120 VQWTGKS--FKGHVEPGKGCRVVRDGKETYLDNEFEI-DEQEFFSLDRGRDLETNEHAWG 176
Query: 228 VKAGPYEFK 236
AG + F+
Sbjct: 177 SIAGAFHFR 185
>gi|427702904|ref|YP_007046126.1| hypothetical protein Cyagr_1634 [Cyanobium gracile PCC 6307]
gi|427346072|gb|AFY28785.1| CpeT/CpcT family (DUF1001) [Cyanobium gracile PCC 6307]
Length = 197
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 33/192 (17%)
Query: 56 LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
LSG ++ QA +P + +I + P ++ GS L LEQ++ +P P+R R +
Sbjct: 12 LSGGFSNQSQAFDNPPLYAHILVKFRPLPQLAPGS--LLLEQSYAISPGTPYRIRVLRAE 69
Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCD---RPR---DQRPLPEE----VIGDIGEHLTT 165
EL + + A+ E + + + R R D LP E V+ ++G+
Sbjct: 70 RRDGELI----IHNQALHEEQRFWGAIEDEGRRRSIGDADLLPLEGCTYVVREVGDGFAG 125
Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-Q 224
+ G RCL E + Y S + + + T DRG D + Q
Sbjct: 126 ----EVEPGCRCLVE-----------RKGSVAYLVSSFEI-DSRGMRTIDRGHDPATHEQ 169
Query: 225 VWGVKAGPYEFK 236
+WG AGP+EF+
Sbjct: 170 LWGSLAGPFEFE 181
>gi|434392355|ref|YP_007127302.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
gi|428264196|gb|AFZ30142.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
Length = 217
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 82/209 (39%), Gaps = 35/209 (16%)
Query: 38 RLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQ 97
++T S AR +A S + QA +P F +IR+ P Y+EQ
Sbjct: 21 QMTHSTDIATLARWMAADFSNQA----QAFENPPFFAHIRVCMRPLPLETLSGVSFYVEQ 76
Query: 98 AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
A+ P+R R + + + +Y I++ +++ P
Sbjct: 77 AYDYMLNNPYRVRVLKLVNAGDR----ITIENYLIKDEQQFYGASREP------------ 120
Query: 158 DIGEHLTTIHLRRCDRGKRC---------LYEGSTPPG-GFPNSWQNGATYSTSELAVLK 207
+ L I L R ++ C ++GS PG G ++ TY SE + +
Sbjct: 121 ---QRLNAITLERLEKLPGCNMLVEWTGTSFKGSVEPGKGCIVVRKDKKTYLDSEFEIDQ 177
Query: 208 NNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
I + DRG D + ++ VWG AGP+ F
Sbjct: 178 ERFI-SLDRGRDPETDEHVWGSVAGPFHF 205
>gi|332706197|ref|ZP_08426266.1| protein of unknown function, DUF1001 [Moorea producens 3L]
gi|332355034|gb|EGJ34505.1| protein of unknown function, DUF1001 [Moorea producens 3L]
Length = 204
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 50/210 (23%), Positives = 89/210 (42%), Gaps = 38/210 (18%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
++EQA +P + +IR+ P ++DG L +LEQA+ P+R R + E
Sbjct: 20 NQEQAFENPPLYAHIRVCMRPLPWSVLDGCSL-FLEQAYDYMLNTPYRVRVLKI----VE 74
Query: 121 LKCDVEVSSYAI----------REVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
L +++ +Y + R++E K+ + ++ P + G
Sbjct: 75 LDGQIKIENYKVQDEKLLYGASRDLERLKSLSAQQLEKLPGCTFNVAWTGHSFKA----E 130
Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVK 229
+ GK CL ++ TY S V++ I + DRG D ++ +WG
Sbjct: 131 VEPGKACLVVRNS-----------KETYLDSSFEVMEEKLI-SLDRGRDPQTDELIWGSI 178
Query: 230 AGPYEFKPAPSSSYSDMFSPLNFPPQQFLE 259
AG + F AP +S++ S + P +F E
Sbjct: 179 AGAFHF--APRTSFA---SEVLIPTPEFTE 203
>gi|67922199|ref|ZP_00515713.1| Protein of unknown function DUF1001 [Crocosphaera watsonii WH 8501]
gi|416389332|ref|ZP_11685331.1| hypothetical protein CWATWH0003_2153 [Crocosphaera watsonii WH
0003]
gi|67855902|gb|EAM51147.1| Protein of unknown function DUF1001 [Crocosphaera watsonii WH 8501]
gi|357264241|gb|EHJ13154.1| hypothetical protein CWATWH0003_2153 [Crocosphaera watsonii WH
0003]
Length = 196
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 76/179 (42%), Gaps = 17/179 (9%)
Query: 62 SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
++EQA +P + +IR+ P L+LEQA+ +P+R R +K +
Sbjct: 20 NQEQAFENPPFYAHIRVCIRPLPITNFPEPSLFLEQAYDYALDQPYRVRVLRLKIVEGRM 79
Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCL--- 178
E+ +Y ++E E F RD+ L + D+ ++ CD
Sbjct: 80 ----ELENYKLKEKE---GFLGAARDREKLKKITPDDLE------LMQGCDMFIDWTGNS 126
Query: 179 YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFK 236
++G PG ++ G + + ++ + DRG+D ++ VWG AG + FK
Sbjct: 127 FKGVVKPGKNCLVFRKGKETYLDNSFEIDDQKLISLDRGYDPVTDELVWGSVAGAFNFK 185
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.136 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,913,092,488
Number of Sequences: 23463169
Number of extensions: 226888879
Number of successful extensions: 400820
Number of sequences better than 100.0: 190
Number of HSP's better than 100.0 without gapping: 55
Number of HSP's successfully gapped in prelim test: 135
Number of HSP's that attempted gapping in prelim test: 400580
Number of HSP's gapped (non-prelim): 203
length of query: 270
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 130
effective length of database: 9,074,351,707
effective search space: 1179665721910
effective search space used: 1179665721910
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)