BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047194
         (270 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540025|ref|XP_002511077.1| conserved hypothetical protein [Ricinus communis]
 gi|223550192|gb|EEF51679.1| conserved hypothetical protein [Ricinus communis]
          Length = 265

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 224/270 (82%), Positives = 243/270 (90%), Gaps = 5/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           M   S+S SN   GW  RA+G+ +K  VLIGGAVL+KRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1   MVTGSESESN---GWS-RARGIVIKTLVLIGGAVLVKRLTKSTTRWDHARFVSQSLSGEK 56

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           FS+EQA+RDPDN+FNIRMLTCPAAEMVDGSK+LY EQAFWRTPQKPFRQRFYMVKPCPKE
Sbjct: 57  FSREQAARDPDNYFNIRMLTCPAAEMVDGSKVLYFEQAFWRTPQKPFRQRFYMVKPCPKE 116

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           LKCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDI EHLTTI+L RCDRGKRCLYE
Sbjct: 117 LKCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIAEHLTTIYLNRCDRGKRCLYE 176

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPPGGFPNSW NGAT+ TSELA+LKNNE+HTWDRG+DDDGNQVWGVK GPYEFKPAP 
Sbjct: 177 GSTPPGGFPNSW-NGATFCTSELAILKNNELHTWDRGYDDDGNQVWGVKEGPYEFKPAPG 235

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           SS +DMFSPLNFPP   +EKRIEGSFVLQE
Sbjct: 236 SSVNDMFSPLNFPPPLSMEKRIEGSFVLQE 265


>gi|224122242|ref|XP_002318786.1| predicted protein [Populus trichocarpa]
 gi|222859459|gb|EEE97006.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  466 bits (1198), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 222/268 (82%), Positives = 241/268 (89%), Gaps = 3/268 (1%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G  SD  S+ NG WG RA+GL +K+ VLIGG +L+KRLTKS TRWDHA+IV  SL+GEKF
Sbjct: 8   GSGSDPTSDSNG-WG-RARGLALKSLVLIGGVLLVKRLTKSTTRWDHAKIVTQSLTGEKF 65

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           SKEQASRDPDN+FNIRMLTCPAAEMVDGSK+LY EQAFWRTPQKPFRQRFYMVKPCPKEL
Sbjct: 66  SKEQASRDPDNYFNIRMLTCPAAEMVDGSKVLYFEQAFWRTPQKPFRQRFYMVKPCPKEL 125

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEV SYAIR+ EEYKNFCDR +DQRPLPEEVIGDI EHLTTIHL+RCDRGKRCLYEG
Sbjct: 126 KCDVEVGSYAIRDAEEYKNFCDRSKDQRPLPEEVIGDIAEHLTTIHLKRCDRGKRCLYEG 185

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
           S PPGGFPNSW NGATY TSELA+LKNNEIHTWDRG+DD GNQVWGVK GPYEFKPAP+S
Sbjct: 186 SNPPGGFPNSW-NGATYCTSELAILKNNEIHTWDRGYDDGGNQVWGVKEGPYEFKPAPAS 244

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQ 269
           S S++FSPLN PP Q +EKRIEGSFVLQ
Sbjct: 245 SVSELFSPLNLPPLQSMEKRIEGSFVLQ 272


>gi|317106609|dbj|BAJ53116.1| JHL07K02.6 [Jatropha curcas]
          Length = 265

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 222/270 (82%), Positives = 240/270 (88%), Gaps = 5/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           MG  SDS SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TRWDH RIV  SLSGEK
Sbjct: 1   MGSGSDSESN---GWS-RARGLVVKTLVLIGGAILVKRLTKSTTRWDHTRIVTQSLSGEK 56

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           FS+EQASRDPDN+FNIRMLTCPAA+MVDGSK+LY E+AF RTPQKPFRQRFYMVKPCPKE
Sbjct: 57  FSREQASRDPDNYFNIRMLTCPAADMVDGSKVLYFERAFCRTPQKPFRQRFYMVKPCPKE 116

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           LKCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDI EHLTTI+L+RCDRGKRCLYE
Sbjct: 117 LKCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIAEHLTTIYLKRCDRGKRCLYE 176

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPPGGFPNSW NGAT+ TSELA+LKNNEIHTWDRG+DDDGNQVWGVK GPYEFKPAP+
Sbjct: 177 GSTPPGGFPNSW-NGATHCTSELAILKNNEIHTWDRGYDDDGNQVWGVKDGPYEFKPAPA 235

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           SS + M   LNFPP   +EKRIEGSFVLQE
Sbjct: 236 SSVNGMLPALNFPPPHSMEKRIEGSFVLQE 265


>gi|356516466|ref|XP_003526915.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Glycine max]
          Length = 268

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 215/270 (79%), Positives = 243/270 (90%), Gaps = 4/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           MG DSDS +N   GW  RA+GL +K  +L GGA+L+KRL KS TRWDHA  V+ SL+GEK
Sbjct: 1   MGTDSDSDAN---GWN-RARGLALKTLLLFGGALLVKRLRKSTTRWDHAHFVSKSLAGEK 56

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           +SKEQASRDPDN+FNIRMLTCPAAE+VDGSK+LY EQAFWRTPQKPFRQR +MVKPCPKE
Sbjct: 57  YSKEQASRDPDNYFNIRMLTCPAAELVDGSKVLYFEQAFWRTPQKPFRQRLFMVKPCPKE 116

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           LKCDVE+S+YAIR++EEYKNFCDRPRDQRP PEEVIGDI EHLTT+HL+RC RGKRCLY+
Sbjct: 117 LKCDVELSTYAIRDMEEYKNFCDRPRDQRPQPEEVIGDIAEHLTTVHLKRCPRGKRCLYK 176

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPPGGFPNSWQNGAT+ TSELA+LKNN+IHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 177 GSTPPGGFPNSWQNGATHCTSELAILKNNDIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 236

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           SS+SDMFSPLNFPP   +E+RIEGSFVLQE
Sbjct: 237 SSFSDMFSPLNFPPPPSMERRIEGSFVLQE 266


>gi|225456513|ref|XP_002284706.1| PREDICTED: chromophore lyase CpcT/CpeT 3 [Vitis vinifera]
 gi|147860848|emb|CAN83158.1| hypothetical protein VITISV_022553 [Vitis vinifera]
 gi|297734523|emb|CBI15770.3| unnamed protein product [Vitis vinifera]
          Length = 267

 Score =  456 bits (1172), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 219/270 (81%), Positives = 242/270 (89%), Gaps = 3/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           MG  SDS S  NG WG RA+G+ VKAAVLIGGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1   MGTGSDSESEGNG-WG-RARGILVKAAVLIGGAILLKRLTKSTTRWDHARFVSHSLSGEK 58

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           FS EQASRDPDN+FN RM+TCPAAE+VDGS++LY EQAFWRTP KPFRQRFYMVKPCPKE
Sbjct: 59  FSMEQASRDPDNYFNFRMVTCPAAELVDGSRVLYFEQAFWRTPSKPFRQRFYMVKPCPKE 118

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           +KCDVE+SSYAIR+VEEYKNFCDR + QRPLPEEVIGDI EHLTTI+L+RC+RGKRCLYE
Sbjct: 119 MKCDVELSSYAIRDVEEYKNFCDRSKAQRPLPEEVIGDIAEHLTTIYLKRCERGKRCLYE 178

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTP GGFPNSW +GATY TSELAVLKNNEIH WDRG+DD+GNQVWGVK GPYEFKPAP+
Sbjct: 179 GSTPSGGFPNSW-SGATYCTSELAVLKNNEIHIWDRGYDDEGNQVWGVKEGPYEFKPAPA 237

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           SS +DMFSPLNF P   +EKRI+GSFVLQE
Sbjct: 238 SSSNDMFSPLNFAPPLPMEKRIDGSFVLQE 267


>gi|356508878|ref|XP_003523180.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Glycine max]
          Length = 266

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 215/267 (80%), Positives = 241/267 (90%), Gaps = 5/267 (1%)

Query: 4   DSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSK 63
           DSDS SN   GW  RA+GL +K  +LIGGA+L+KRL KS TRWDHA  V++SL+GEK+SK
Sbjct: 5   DSDSESN---GWN-RARGLALKTLLLIGGALLVKRLRKSTTRWDHAHFVSNSLTGEKYSK 60

Query: 64  EQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKC 123
           EQASRDPDN+FNIRMLTCPAAE+VDGSK+LY EQAFWRTPQKPFRQR +MVKPCPKELKC
Sbjct: 61  EQASRDPDNYFNIRMLTCPAAELVDGSKVLYFEQAFWRTPQKPFRQRLFMVKPCPKELKC 120

Query: 124 DVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGST 183
           DVE+S+YAIR++EEYKNFCDRPRDQRP PEEVIGDI EHLTT+HL+RC RGKRCLYEGST
Sbjct: 121 DVELSTYAIRDMEEYKNFCDRPRDQRPQPEEVIGDIAEHLTTVHLKRCPRGKRCLYEGST 180

Query: 184 PPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSY 243
           PPGGFPNSW NGATY TSELA+LKNNEIHTWDRG+DD GNQVWG K GPYEFKPAP+SS+
Sbjct: 181 PPGGFPNSW-NGATYCTSELAILKNNEIHTWDRGYDDGGNQVWGQKEGPYEFKPAPTSSF 239

Query: 244 SDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           +DMFSPLNFPP   +E+RIEGSFVLQE
Sbjct: 240 NDMFSPLNFPPPPSMERRIEGSFVLQE 266


>gi|388500552|gb|AFK38342.1| unknown [Lotus japonicus]
          Length = 265

 Score =  439 bits (1128), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 204/253 (80%), Positives = 232/253 (91%), Gaps = 1/253 (0%)

Query: 18  RAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
           RA+GL VK  +LIGGA+L+KRL KS TRWDHAR V++SL+GEK+SKEQASRDP+N+FNIR
Sbjct: 14  RARGLAVKTLLLIGGALLVKRLRKSTTRWDHARFVSNSLTGEKYSKEQASRDPENYFNIR 73

Query: 78  MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
           MLTCPAAE+VDGSK+LY EQAFWR+PQKPFRQRFY+VKP PKELKCDVE+S+YAIR++EE
Sbjct: 74  MLTCPAAELVDGSKVLYFEQAFWRSPQKPFRQRFYVVKPYPKELKCDVELSTYAIRDMEE 133

Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
           YKNFCDRP+DQRP PEEVIGDI EHLTTIHL+RC RGKRC YEGSTPPGGFPNSW NGAT
Sbjct: 134 YKNFCDRPKDQRPQPEEVIGDIAEHLTTIHLKRCPRGKRCSYEGSTPPGGFPNSW-NGAT 192

Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQF 257
           Y TSELAVLKNNEIHTWDRGFDDDGNQVWG K GPYEFKPAP+S ++DM SPLNFPP   
Sbjct: 193 YCTSELAVLKNNEIHTWDRGFDDDGNQVWGQKDGPYEFKPAPTSCFNDMLSPLNFPPPPS 252

Query: 258 LEKRIEGSFVLQE 270
           +++RIEGSF+LQ+
Sbjct: 253 MDRRIEGSFILQD 265


>gi|15241343|ref|NP_199915.1| protein crumpled leaf [Arabidopsis thaliana]
 gi|13877867|gb|AAK44011.1|AF370196_1 unknown protein [Arabidopsis thaliana]
 gi|9758249|dbj|BAB08748.1| unnamed protein product [Arabidopsis thaliana]
 gi|21281135|gb|AAM44980.1| unknown protein [Arabidopsis thaliana]
 gi|45504116|dbj|BAD12566.1| CRUMPLED LEAF [Arabidopsis thaliana]
 gi|332008639|gb|AED96022.1| protein crumpled leaf [Arabidopsis thaliana]
          Length = 269

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 206/269 (76%), Positives = 233/269 (86%), Gaps = 5/269 (1%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6   GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQASRDPDN+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62  TREQASRDPDNYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDIGEHLTTIHL  CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
           ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++  S LN   Q  ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269


>gi|297795903|ref|XP_002865836.1| hypothetical protein ARALYDRAFT_918134 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311671|gb|EFH42095.1| hypothetical protein ARALYDRAFT_918134 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 269

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 205/269 (76%), Positives = 233/269 (86%), Gaps = 5/269 (1%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6   GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQASRDP+N+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62  TREQASRDPENYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIGDIGEHLTTIHL  CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
           ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++  S LN   Q  ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269


>gi|21554618|gb|AAM63635.1| unknown [Arabidopsis thaliana]
          Length = 269

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 205/269 (76%), Positives = 232/269 (86%), Gaps = 5/269 (1%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEKF
Sbjct: 6   GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEKF 61

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQASRDPDN+FNIRML+CPAAEMVDGS++LYLEQAFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 62  TREQASRDPDNYFNIRMLSCPAAEMVDGSEVLYLEQAFWRTPQKPFRQRLYMVKPCPKEL 121

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEVSSYAIR+ EEYK FCDRP+DQRPLPEEVIGDIGEHLTTIHL  CDRGKRCLYEG
Sbjct: 122 KCDVEVSSYAIRDAEEYKXFCDRPKDQRPLPEEVIGDIGEHLTTIHLNCCDRGKRCLYEG 181

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
           ST PGGFPNSW NGA+Y TS+LAVLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 182 STSPGGFPNSW-NGASYCTSDLAVLKNNEIHLWDRGFDENRNQVWGPKEGPYEFKPATSS 240

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++  S LN   Q  ++K I+GS +LQ+
Sbjct: 241 SINENLSALNILYQSSIDKPIQGSLILQD 269


>gi|357464915|ref|XP_003602739.1| hypothetical protein MTR_3g098510 [Medicago truncatula]
 gi|355491787|gb|AES72990.1| hypothetical protein MTR_3g098510 [Medicago truncatula]
          Length = 267

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 208/270 (77%), Positives = 238/270 (88%), Gaps = 3/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           M  +S+S S+ NG W  RAQGL +KA +L+GGA+L+KRL KS TRWDH  +V  SL+GEK
Sbjct: 1   MCKESESESDSNG-WN-RAQGLALKALLLLGGALLVKRLRKSTTRWDHTHLVTQSLTGEK 58

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           +SK+QASRDPDN+FNIRMLTCPAAE+VDGS +LY EQAFWR+PQKPFRQR  M KPCPKE
Sbjct: 59  YSKDQASRDPDNYFNIRMLTCPAAELVDGSNVLYYEQAFWRSPQKPFRQRLLMTKPCPKE 118

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           LKCDVE+S+YAIR++EEYKNFCDRP+DQRP PEEVIGDIGEHLTTIHL+RC RGKRCLYE
Sbjct: 119 LKCDVELSTYAIRDMEEYKNFCDRPKDQRPQPEEVIGDIGEHLTTIHLKRCSRGKRCLYE 178

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPP GFPNSW NGATY TSELAV+KNNEIHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 179 GSTPPEGFPNSW-NGATYCTSELAVMKNNEIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 237

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++D FSPLNFPP   +++RIEGSFVLQE
Sbjct: 238 SCFNDTFSPLNFPPPPSMDRRIEGSFVLQE 267


>gi|388508268|gb|AFK42200.1| unknown [Medicago truncatula]
          Length = 267

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 207/270 (76%), Positives = 237/270 (87%), Gaps = 3/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           M  +S+S S+ NG W  RAQGL +KA +L+GGA+L+KRL KS TRWDH  +V  SL+GEK
Sbjct: 1   MCKESESESDSNG-WN-RAQGLALKALLLLGGALLVKRLRKSTTRWDHTHLVTQSLTGEK 58

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           +SK+QASRDPDN+FNIRMLTCPAAE+VDGS +LY EQAFWR+PQKPFRQR  M KPCPKE
Sbjct: 59  YSKDQASRDPDNYFNIRMLTCPAAELVDGSNVLYYEQAFWRSPQKPFRQRLLMTKPCPKE 118

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           LKCDVE+S+YAI ++EEYKNFCDRP+DQRP PEEVIGDIGEHLTTIHL+RC RGKRCLYE
Sbjct: 119 LKCDVELSTYAIGDMEEYKNFCDRPKDQRPQPEEVIGDIGEHLTTIHLKRCSRGKRCLYE 178

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPP GFPNSW NGATY TSELAV+KNNEIHTWDRG+DDDGNQVWG K GPYEFKPAP+
Sbjct: 179 GSTPPEGFPNSW-NGATYCTSELAVMKNNEIHTWDRGYDDDGNQVWGQKEGPYEFKPAPT 237

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++D FSPLNFPP   +++RIEGSFVLQE
Sbjct: 238 SCFNDTFSPLNFPPPPSMDRRIEGSFVLQE 267


>gi|357156625|ref|XP_003577521.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Brachypodium
           distachyon]
          Length = 269

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 192/270 (71%), Positives = 221/270 (81%), Gaps = 2/270 (0%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           MG   DS      G GG  +G  +KA V+ GG +L++RL +S TRWDHAR VAD+LSGEK
Sbjct: 1   MGSGEDSPGGTGAGVGGIVRGAVLKALVVFGGVLLIRRLRRSTTRWDHARAVADALSGEK 60

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           FS+EQA +DP+N+FN+RMLTCPA EMVDGS +LY EQAFWR P+KPFRQRFYMVKPCPKE
Sbjct: 61  FSREQARKDPENYFNLRMLTCPATEMVDGSGVLYFEQAFWRAPEKPFRQRFYMVKPCPKE 120

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           +KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI EHLTTIHL RC RGKRCLYE
Sbjct: 121 MKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAEHLTTIHLSRCGRGKRCLYE 180

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTP GGFPN+W +GA Y TS+L++ KN EIH WD+GFDD+GNQVWG KAGPYEFKPAP 
Sbjct: 181 GSTPSGGFPNNW-SGAAYCTSDLSIHKNGEIHIWDKGFDDEGNQVWGTKAGPYEFKPAPK 239

Query: 241 SSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S+Y DMFSPLNF     LEK IE S+V  +
Sbjct: 240 SNYDDMFSPLNFSAPLSLEK-IESSYVTDD 268


>gi|326514408|dbj|BAJ96191.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 269

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 182/247 (73%), Positives = 212/247 (85%), Gaps = 2/247 (0%)

Query: 24  VKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA 83
           +KA V+ GG +L++RL +S TRWDHAR VAD+LSGEKFS+EQA +DP N+FN+RMLTCPA
Sbjct: 24  LKALVVFGGVILIRRLRRSTTRWDHARTVADALSGEKFSREQARQDPGNYFNLRMLTCPA 83

Query: 84  AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCD 143
            EMVDGS +LY EQAFWR P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+VEEYKNFCD
Sbjct: 84  TEMVDGSGVLYFEQAFWRAPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDVEEYKNFCD 143

Query: 144 RPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSEL 203
           RP+DQRP PEEVI DI EHLTTIHL RC+RGKRCLYEGSTP GGFPNSW +GA Y TS+L
Sbjct: 144 RPKDQRPQPEEVIADIAEHLTTIHLSRCERGKRCLYEGSTPAGGFPNSW-SGAAYCTSDL 202

Query: 204 AVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIE 263
           ++ KN EIH WD+GFDD+G+QVWG KAGPYEFKPAP S+Y DMFSPLNF     LEK +E
Sbjct: 203 SIHKNGEIHIWDKGFDDNGSQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLSLEK-ME 261

Query: 264 GSFVLQE 270
            S+ + +
Sbjct: 262 SSYAIDD 268


>gi|449454794|ref|XP_004145139.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Cucumis sativus]
 gi|449473876|ref|XP_004154009.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like [Cucumis sativus]
          Length = 266

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 200/270 (74%), Positives = 231/270 (85%), Gaps = 5/270 (1%)

Query: 1   MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
           MG +S+  +N + GW  RA GL +K  VL+GGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1   MGTNSNDNNN-SVGWN-RAGGLIMKTLVLVGGALLLKRLTKSTTRWDHARFVSQSLSGEK 58

Query: 61  FSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           FSKEQA+RDPDN+FNIR +  P   +VDGS +LY EQAF RTPQKPFRQRFY VKPC KE
Sbjct: 59  FSKEQAARDPDNYFNIRYVNPPPPPLVDGSTVLYFEQAFSRTPQKPFRQRFYTVKPCSKE 118

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
           +KCDVE+SSYAIRE+EEYKNFCDR +DQRPLPEE++GDI EHLTT+HL+RCDRGKRCLYE
Sbjct: 119 MKCDVELSSYAIREMEEYKNFCDRTKDQRPLPEEIVGDIAEHLTTLHLKRCDRGKRCLYE 178

Query: 181 GSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
           GSTPPGGFPNSW +GA+Y TSE+A+LKNNE+HTW+RG+D+DGNQVWG K GPYEFKP P+
Sbjct: 179 GSTPPGGFPNSW-SGASYCTSEIAILKNNEVHTWERGYDEDGNQVWGTKEGPYEFKPVPA 237

Query: 241 SSYSDMFSPLNFPPQQFLEKRI-EGSFVLQ 269
           SS  DMFSPLNF  Q  +EKRI EGSFVLQ
Sbjct: 238 SSLKDMFSPLNF-SQPSMEKRILEGSFVLQ 266


>gi|115485691|ref|NP_001067989.1| Os11g0524300 [Oryza sativa Japonica Group]
 gi|77551200|gb|ABA93997.1| expressed protein [Oryza sativa Japonica Group]
 gi|113645211|dbj|BAF28352.1| Os11g0524300 [Oryza sativa Japonica Group]
 gi|215741068|dbj|BAG97563.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218185839|gb|EEC68266.1| hypothetical protein OsI_36301 [Oryza sativa Indica Group]
          Length = 275

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 176/229 (76%), Positives = 202/229 (88%), Gaps = 1/229 (0%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR V D+LSGEKFS+EQA +DPDN+FN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 47  STTRWDHARAVVDALSGEKFSREQARKDPDNYFNLRMLTCPATEMVDGSRVLYFEQAFWR 106

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
           +P+KPFRQRFYMVKPCPK++KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI E
Sbjct: 107 SPEKPFRQRFYMVKPCPKDMKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAE 166

Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
           HLTTIHL RC+RGKRCLY+GSTPP GFPNSW +GATY TS+L++ KN E+H WD+GFDDD
Sbjct: 167 HLTTIHLSRCERGKRCLYKGSTPPEGFPNSW-SGATYCTSDLSIHKNGEVHIWDKGFDDD 225

Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           GNQVWG KAGPYEFKPAP S+Y DMFSPLNF     LEK+IE SF + +
Sbjct: 226 GNQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLTLEKKIESSFAIDD 274


>gi|125577323|gb|EAZ18545.1| hypothetical protein OsJ_34073 [Oryza sativa Japonica Group]
          Length = 275

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 176/227 (77%), Positives = 201/227 (88%), Gaps = 1/227 (0%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR V D+LSGEKFS+EQA +DPDN+FN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 47  STTRWDHARAVVDALSGEKFSREQARKDPDNYFNLRMLTCPATEMVDGSRVLYFEQAFWR 106

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
           +P+KPFRQRFYMVKPCPK++KCDVE+SSYAIR+VEEYKNFCDRP+DQRP PEEVI DI E
Sbjct: 107 SPEKPFRQRFYMVKPCPKDMKCDVELSSYAIRDVEEYKNFCDRPKDQRPQPEEVIADIAE 166

Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
           HLTTIHL RC+RGKRCLY+GSTPP GFPNSW +GATY TS+L++ KN E+H WD+GFDDD
Sbjct: 167 HLTTIHLSRCERGKRCLYKGSTPPEGFPNSW-SGATYCTSDLSIHKNGEVHIWDKGFDDD 225

Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVL 268
           GNQVWG KAGPYEFKPAP S+Y DMFSPLNF     LEK+IE SF +
Sbjct: 226 GNQVWGTKAGPYEFKPAPKSNYDDMFSPLNFSAPLTLEKKIESSFAI 272


>gi|293331841|ref|NP_001168901.1| uncharacterized protein LOC100382707 [Zea mays]
 gi|223973587|gb|ACN30981.1| unknown [Zea mays]
          Length = 266

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 171/229 (74%), Positives = 199/229 (86%), Gaps = 1/229 (0%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38  STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
           +P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP  EEVI DI E
Sbjct: 98  SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQAEEVIADIAE 157

Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
           HLTTIHL RC RGKRCLYEGSTPP GFPN+W +GA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-SGASYCTSDLSIHKNGEVHIWDKGFDDE 216

Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           GNQVWG KAGPYEFKPAP S Y DMFSPLNF     LEK+++ ++V+ +
Sbjct: 217 GNQVWGTKAGPYEFKPAPKSKYDDMFSPLNFSAPLSLEKKLDKAYVIDD 265


>gi|242065282|ref|XP_002453930.1| hypothetical protein SORBIDRAFT_04g021600 [Sorghum bicolor]
 gi|241933761|gb|EES06906.1| hypothetical protein SORBIDRAFT_04g021600 [Sorghum bicolor]
          Length = 266

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 170/229 (74%), Positives = 197/229 (86%), Gaps = 1/229 (0%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA E VDGS++LY E AFWR
Sbjct: 38  STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATETVDGSRVLYFEHAFWR 97

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
           +P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP PEEVI DI E
Sbjct: 98  SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQPEEVIADIAE 157

Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
           HLTTIHL RC RGKRCLYEGSTPP GFPN+W NGA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-NGASYCTSDLSIHKNGEVHIWDKGFDDE 216

Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           GNQVWG K GPYEFKPAP S Y DMFSPLNF     LEK+++ ++V+ +
Sbjct: 217 GNQVWGTKVGPYEFKPAPKSKYDDMFSPLNFSAPLSLEKKLDKAYVIDD 265


>gi|242068607|ref|XP_002449580.1| hypothetical protein SORBIDRAFT_05g019560 [Sorghum bicolor]
 gi|241935423|gb|EES08568.1| hypothetical protein SORBIDRAFT_05g019560 [Sorghum bicolor]
          Length = 265

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 171/229 (74%), Positives = 198/229 (86%), Gaps = 2/229 (0%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38  STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGE 161
           +P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP PEEVI DI E
Sbjct: 98  SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQPEEVIADIAE 157

Query: 162 HLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD 221
           HLTTIHL RC RGKRCLYEGSTPP GFPN+W +GA+Y TS+L++ KN E+H WD+GFDD+
Sbjct: 158 HLTTIHLSRCGRGKRCLYEGSTPPEGFPNNW-SGASYCTSDLSIHKNGEVHIWDKGFDDE 216

Query: 222 GNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           GNQVWG K GPYEFKPAP S Y DMFSPLNF     LEK ++ ++V+ +
Sbjct: 217 GNQVWGTKVGPYEFKPAPKSKYDDMFSPLNFSAPLSLEK-LDKAYVIDD 264


>gi|294463077|gb|ADE77076.1| unknown [Picea sitchensis]
          Length = 264

 Score =  356 bits (913), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 163/258 (63%), Positives = 207/258 (80%), Gaps = 3/258 (1%)

Query: 11  PNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDP 70
           P+     RA+ +F+KA  ++GGA+L+++LT+  T  DHARIVAD+L+GEKFS EQASRDP
Sbjct: 7   PSSSGRSRARNVFIKALCVVGGALLVRKLTQPTT--DHARIVADALTGEKFSSEQASRDP 64

Query: 71  DNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSY 130
            N+FN+RML CPA  M DGS++LY EQAFWRTP+KP+RQRFY+VKPC KE+KCDVEV+SY
Sbjct: 65  MNYFNLRMLVCPATVMADGSRVLYFEQAFWRTPEKPYRQRFYVVKPCHKEMKCDVEVNSY 124

Query: 131 AIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPN 190
           A+R+VEEY+NFC+RP+ QRP  EEV  D+ +HLTT++L RC+RGKRCLYEGSTPPGGFPN
Sbjct: 125 AVRDVEEYRNFCERPKGQRPQAEEVERDVAKHLTTVYLSRCERGKRCLYEGSTPPGGFPN 184

Query: 191 SWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPL 250
           SW NGAT+ TS+L + KN E+H WDR +DD+GNQVWGV  GPYEFK A +SS +  F PL
Sbjct: 185 SW-NGATHCTSDLTIYKNGEVHCWDRAYDDEGNQVWGVTEGPYEFKSATTSSSNGSFFPL 243

Query: 251 NFPPQQFLEKRIEGSFVL 268
           +F      +K  + SFV+
Sbjct: 244 DFSSISLGKKLDDKSFVV 261


>gi|302771463|ref|XP_002969150.1| hypothetical protein SELMODRAFT_170435 [Selaginella moellendorffii]
 gi|300163655|gb|EFJ30266.1| hypothetical protein SELMODRAFT_170435 [Selaginella moellendorffii]
          Length = 270

 Score =  350 bits (898), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 153/225 (68%), Positives = 192/225 (85%), Gaps = 1/225 (0%)

Query: 20  QGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRML 79
           +   +KAA L+GG +LL++LTK+KTRWDH R+VAD+L+GEKFS+EQA+RDP  +FN+RML
Sbjct: 34  RSFVLKAACLLGGVLLLRKLTKAKTRWDHTRLVADALTGEKFSQEQAARDPMTYFNLRML 93

Query: 80  TCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYK 139
            CPA  + DG+K+LY EQAFWRTP KP+RQRFY+V+PCPKE+KCDVEV SYA+R++EEYK
Sbjct: 94  ACPATVLDDGAKVLYFEQAFWRTPDKPYRQRFYVVRPCPKEMKCDVEVGSYAVRDIEEYK 153

Query: 140 NFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYS 199
           NFC+RP+DQRP PEE+ GDI EHLT+++L  C RG+RCLYEGSTPPGGFPN+W NGA+  
Sbjct: 154 NFCERPKDQRPQPEEIPGDISEHLTSVYLSACARGQRCLYEGSTPPGGFPNNW-NGASRC 212

Query: 200 TSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYS 244
           TSEL +LK+ EIH WDR +DD+GNQVWGV+ GPYEFKP  S + S
Sbjct: 213 TSELTILKSGEIHCWDRAYDDEGNQVWGVRQGPYEFKPGTSKNRS 257


>gi|380356234|dbj|BAL72655.1| CRUMPLED LEAF1 [Physcomitrella patens subsp. patens]
          Length = 314

 Score =  349 bits (896), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 152/222 (68%), Positives = 188/222 (84%), Gaps = 1/222 (0%)

Query: 19  AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRM 78
            +GL +KAA LIGGA LL++LTKS TRWDHAR VA SLSGEKFS EQA+RDP  +FN+R+
Sbjct: 87  VRGLVIKAACLIGGAFLLRKLTKSTTRWDHARKVAQSLSGEKFSTEQAARDPTTYFNLRL 146

Query: 79  LTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEY 138
           LTCPA  + DG++++Y EQAFWRTP++P+RQRFY +KPCPKE+KCDVEV SYA+R++EEY
Sbjct: 147 LTCPATVLADGARVMYFEQAFWRTPERPYRQRFYSIKPCPKEMKCDVEVGSYAVRDIEEY 206

Query: 139 KNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATY 198
           KNFCDR +D+RP P+EV+ D+ EHL T++L  C+RG+RCLYEGSTPPGGFPNSW NGA+ 
Sbjct: 207 KNFCDRSKDERPQPDEVLKDMAEHLNTVYLSVCERGRRCLYEGSTPPGGFPNSW-NGASR 265

Query: 199 STSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
            TSEL + KN E+H WDR +DD+GNQVWGV+ GPYEFK A S
Sbjct: 266 CTSELTIYKNGEVHCWDRAYDDEGNQVWGVRQGPYEFKTATS 307


>gi|168022710|ref|XP_001763882.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684887|gb|EDQ71286.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|380356236|dbj|BAL72656.1| CRUMPLED LEAF2 [Physcomitrella patens subsp. patens]
          Length = 342

 Score =  349 bits (896), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 151/224 (67%), Positives = 188/224 (83%), Gaps = 1/224 (0%)

Query: 19  AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRM 78
            +GL VKAA LIGGA LL++LTK+ TRWDHAR+V  +LSGEKFS EQA+RDP  +FN+R 
Sbjct: 92  VRGLVVKAACLIGGAFLLRKLTKTTTRWDHARMVVQALSGEKFSTEQAARDPMTYFNLRF 151

Query: 79  LTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEY 138
           LTCPA  + DG+ ++Y EQAFWRTP++P+RQRFY VKPCPKE+KCDVEV SYA+R++E+Y
Sbjct: 152 LTCPATVLADGAHVMYFEQAFWRTPERPYRQRFYNVKPCPKEMKCDVEVGSYAVRDIEDY 211

Query: 139 KNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATY 198
           +NFCDRP+DQRP P+EV+ D+ EHL T++L  C+RG+RCLYEGSTPP GFPNSW NGA+ 
Sbjct: 212 RNFCDRPKDQRPQPDEVLKDVAEHLNTVYLSACERGRRCLYEGSTPPEGFPNSW-NGASR 270

Query: 199 STSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSS 242
            TSEL + KN E+H WDR +D++GNQVWGV+ GPYEFKPA  SS
Sbjct: 271 CTSELTIYKNGEVHCWDRAYDNEGNQVWGVRQGPYEFKPATQSS 314


>gi|168031997|ref|XP_001768506.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680219|gb|EDQ66657.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 277

 Score =  347 bits (889), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 153/223 (68%), Positives = 189/223 (84%), Gaps = 2/223 (0%)

Query: 19  AQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNI-R 77
            +GL +KAA LIGGA LL++LTKS TRWDHAR VA SLSGEKFS EQA+RDP  +FN+ R
Sbjct: 49  VRGLVIKAACLIGGAFLLRKLTKSTTRWDHARKVAQSLSGEKFSTEQAARDPTTYFNLSR 108

Query: 78  MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
           +LTCPA  + DG++++Y EQAFWRTP++P+RQRFY +KPCPKE+KCDVEVSSYA+R++EE
Sbjct: 109 LLTCPATVLADGARVMYFEQAFWRTPERPYRQRFYSIKPCPKEMKCDVEVSSYAVRDIEE 168

Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
           YKNFCDR +D+RP P+EV+ D+ EHL T++L  C+RG+RCLYEGSTPPGGFPNSW NGA+
Sbjct: 169 YKNFCDRSKDERPQPDEVLKDMAEHLNTVYLSVCERGRRCLYEGSTPPGGFPNSW-NGAS 227

Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPS 240
             TSEL + KN E+H WDR +DD+GNQVWGV+ GPYEFK A S
Sbjct: 228 RCTSELTIYKNGEVHCWDRAYDDEGNQVWGVRQGPYEFKTATS 270


>gi|449523435|ref|XP_004168729.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like, partial [Cucumis
           sativus]
          Length = 191

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 154/193 (79%), Positives = 174/193 (90%), Gaps = 3/193 (1%)

Query: 78  MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
           M+TCPAAEMVDGS +LY EQAF RTPQKPFRQRFY VKPC KE+KCDVE+SSYAIRE+EE
Sbjct: 1   MITCPAAEMVDGSTVLYFEQAFSRTPQKPFRQRFYTVKPCSKEMKCDVELSSYAIREMEE 60

Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGAT 197
           YKNFCDR +DQRPLPEE++GDI EHLTT+HL+RCDRGKRCLYEGSTPPGGFPNSW +GA+
Sbjct: 61  YKNFCDRTKDQRPLPEEIVGDIAEHLTTLHLKRCDRGKRCLYEGSTPPGGFPNSW-SGAS 119

Query: 198 YSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQF 257
           Y TSE+A+LKNNE+HTW+RG+D+DGNQVWG K GPYEFKP P+SS  DMFSPLNF  Q  
Sbjct: 120 YCTSEIAILKNNEVHTWERGYDEDGNQVWGTKEGPYEFKPVPASSLKDMFSPLNF-SQPS 178

Query: 258 LEKRI-EGSFVLQ 269
           +EKRI EGSFVLQ
Sbjct: 179 MEKRILEGSFVLQ 191


>gi|302784280|ref|XP_002973912.1| hypothetical protein SELMODRAFT_100123 [Selaginella moellendorffii]
 gi|300158244|gb|EFJ24867.1| hypothetical protein SELMODRAFT_100123 [Selaginella moellendorffii]
          Length = 270

 Score =  330 bits (845), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 152/225 (67%), Positives = 191/225 (84%), Gaps = 1/225 (0%)

Query: 20  QGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRML 79
           +   +KA  L+GG +LL++LTK+KTRWDH R+VAD+L+GEKFS+EQA+RDP  +FN+RML
Sbjct: 34  RSFVLKAVCLLGGVLLLRKLTKAKTRWDHTRLVADALTGEKFSQEQAARDPMTYFNLRML 93

Query: 80  TCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYK 139
            CPA  + DG+K+LY EQAFWRTP KP+RQRFY+V+PCPKE+KCDVEV SYA+R++EEYK
Sbjct: 94  ACPATVLDDGAKVLYFEQAFWRTPDKPYRQRFYVVRPCPKEMKCDVEVGSYAVRDIEEYK 153

Query: 140 NFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYS 199
           NFC+RP+DQRP PEE+ GDI EHLT+++L  C RG+RCLYEGSTPPGGFPN+W NGA+  
Sbjct: 154 NFCERPKDQRPQPEEIPGDISEHLTSVYLSACARGQRCLYEGSTPPGGFPNNW-NGASRC 212

Query: 200 TSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYS 244
           TSEL +LK+ EIH WDR +DD+GNQVWGV+ GPYEFKP  S + S
Sbjct: 213 TSELTILKSGEIHCWDRAYDDEGNQVWGVRQGPYEFKPGTSKNRS 257


>gi|387169542|gb|AFJ66202.1| hypothetical protein 7G9.10 [Boechera stricta]
          Length = 197

 Score =  256 bits (653), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 142/269 (52%), Positives = 163/269 (60%), Gaps = 77/269 (28%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEK 
Sbjct: 6   GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 60

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
                                               AFWRTPQKPFRQR YMVKPCPKEL
Sbjct: 61  ------------------------------------AFWRTPQKPFRQRLYMVKPCPKEL 84

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIG                        
Sbjct: 85  KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIG------------------------ 120

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
                       NGA+YS S+L VLKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 121 ------------NGASYSISDLVVLKNNEIHLWDRGFDENRNQVWGPKQGPYEFKPATSS 168

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S ++  S LN   Q  ++K I+GS +LQ+
Sbjct: 169 SVNENLSALNILYQSSIDKPIQGSLILQD 197


>gi|388506300|gb|AFK41216.1| unknown [Lotus japonicus]
          Length = 136

 Score =  247 bits (631), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 113/136 (83%), Positives = 124/136 (91%)

Query: 135 VEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQN 194
           +EEYKNFCDRP+DQRP PEEVIGDI EHLTTIHL+RC RGKRCLYEGST PGGFPNSWQN
Sbjct: 1   MEEYKNFCDRPKDQRPQPEEVIGDIAEHLTTIHLKRCPRGKRCLYEGSTSPGGFPNSWQN 60

Query: 195 GATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSSSYSDMFSPLNFPP 254
           GATY TSELAVLKNNEIHTWDRGFDDDGNQVWG K GPYEFKPAP+S ++DM SPLNFPP
Sbjct: 61  GATYCTSELAVLKNNEIHTWDRGFDDDGNQVWGQKDGPYEFKPAPTSCFNDMLSPLNFPP 120

Query: 255 QQFLEKRIEGSFVLQE 270
              +++RIEGSF+LQ+
Sbjct: 121 PPSMDRRIEGSFILQD 136


>gi|387169574|gb|AFJ66233.1| hypothetical protein 34G24.30 [Capsella rubella]
          Length = 186

 Score =  229 bits (585), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 130/269 (48%), Positives = 153/269 (56%), Gaps = 89/269 (33%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL +K  VL+GGA+L+KRLTKS TR DHAR+V+ SL+GEK 
Sbjct: 7   GSDPESSSN---GWS-RARGLVIKTLVLVGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 61

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
                                                           R YMVKPCPKEL
Sbjct: 62  ------------------------------------------------RLYMVKPCPKEL 73

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
           KCDVEVSSYAIR+ EEYKNFCDRP+DQRPLPEEVIG                        
Sbjct: 74  KCDVEVSSYAIRDAEEYKNFCDRPKDQRPLPEEVIG------------------------ 109

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFKPAPSS 241
                       NGA+YSTS+LA+LKNNEIH WDRGFD++ NQVWG K GPYEFKPA SS
Sbjct: 110 ------------NGASYSTSDLAILKNNEIHLWDRGFDENRNQVWGPKQGPYEFKPATSS 157

Query: 242 SYSDMFSPLNFPPQQFLEKRIEGSFVLQE 270
           S S+  S LN   Q  ++K I+GS +LQ+
Sbjct: 158 SVSENLSALNILYQSSIDKPIQGSLILQD 186


>gi|413920957|gb|AFW60889.1| hypothetical protein ZEAMMB73_192264 [Zea mays]
          Length = 168

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 94/116 (81%), Positives = 106/116 (91%)

Query: 42  SKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWR 101
           S TRWDHAR VAD+LSGEKFS+EQA +DPDNFFN+RMLTCPA EMVDGS++LY EQAFWR
Sbjct: 38  STTRWDHARAVADALSGEKFSREQARKDPDNFFNLRMLTCPATEMVDGSRVLYFEQAFWR 97

Query: 102 TPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           +P+KPFRQRFYMVKPCPKE+KCDVE+SSYAIR+ EEYKNFCDR +DQRP  EEVI 
Sbjct: 98  SPEKPFRQRFYMVKPCPKEMKCDVELSSYAIRDAEEYKNFCDRQKDQRPQAEEVIA 153


>gi|387169526|gb|AFJ66187.1| hypothetical protein 11M19.7 [Arabidopsis halleri]
          Length = 245

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 118/259 (45%), Positives = 133/259 (51%), Gaps = 112/259 (43%)

Query: 2   GPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKF 61
           G D +S+SN   GW  RA+GL VK  VLIGGA+L+KRLTKS TR DHAR+V+ SL+GEK 
Sbjct: 6   GSDPESSSN---GWS-RARGLVVKTLVLIGGALLIKRLTKSTTRRDHARVVSRSLTGEK- 60

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR------------- 108
                                               AFWRTPQKPFR             
Sbjct: 61  ------------------------------------AFWRTPQKPFRQVCNHSFPDYLLI 84

Query: 109 ----------------------QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPR 146
                                 QR YMVKPCPKELKCDVEVSSYAIR+ EEYKNFCDRP+
Sbjct: 85  DSIREVYYCVAEVLCIVCLFWLQRLYMVKPCPKELKCDVEVSSYAIRDAEEYKNFCDRPK 144

Query: 147 DQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVL 206
           DQRPLPEEVIG                                    NGA+Y TS+LAVL
Sbjct: 145 DQRPLPEEVIG------------------------------------NGASYCTSDLAVL 168

Query: 207 KNNEIHTWDRGFDDDGNQV 225
           KN+EIH WDRGFD++ NQV
Sbjct: 169 KNSEIHLWDRGFDENRNQV 187


>gi|428226391|ref|YP_007110488.1| hypothetical protein GEI7407_2966 [Geitlerinema sp. PCC 7407]
 gi|427986292|gb|AFY67436.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
          Length = 400

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 80/256 (31%), Positives = 125/256 (48%), Gaps = 30/256 (11%)

Query: 21  GLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLT 80
           G    A  L  GA+    + ++    + A  VAD L G   +  QA        ++RM+T
Sbjct: 7   GALGMAIALQMGAIAPPSIAQAAPLEEQATEVADRLVGTLSTVHQALTS-RRVADVRMVT 65

Query: 81  CPA----AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVE 136
           CP     A    G   LY EQA    P++P+RQRF  + P P      VE +++   E+ 
Sbjct: 66  CPVTVTDAAPAPGVSFLYQEQAIALNPEQPYRQRFLRIAPYPASQS--VESTAWKPPELA 123

Query: 137 EYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA 196
            +   C +P  QR   +  + D+G  L  ++L+    G R  + G+TPP G P++++ GA
Sbjct: 124 PWVGLCQKPLAQR---QVRLADLGSPLCQVYLKPA--GDR--FVGATPPEGCPSNFR-GA 175

Query: 197 TYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK---PAPSSSYSDMFSPLNFP 253
           T  T+++ VL+ + + TWDRGFD  G QVWG +   Y+F+   PA   +  D  +     
Sbjct: 176 TRVTNQI-VLEADRMETWDRGFDAAGQQVWGAQDESYQFRRLDPASRDAEVDAIA----- 229

Query: 254 PQQFLEKRIEGSFVLQ 269
                 +R+ G+FV Q
Sbjct: 230 ------RRLHGAFVAQ 239


>gi|218441065|ref|YP_002379394.1| hypothetical protein PCC7424_4155 [Cyanothece sp. PCC 7424]
 gi|218173793|gb|ACK72526.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 212

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 98/191 (51%), Gaps = 15/191 (7%)

Query: 48  HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL--EQAFWRTPQK 105
           H   V   L G   +  Q +++P N  N+RM TC    +  G+  +YL  EQA  +T  K
Sbjct: 29  HINSVVSHLVGVMDTSAQVAKNP-NKANVRMTTCQVT-LTGGNDSIYLYQEQALTKTLDK 86

Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
           P+RQRF  +KP  +     VE  SY   + E    FC++P  +R L    + ++GE + +
Sbjct: 87  PYRQRFLEIKPTLEPET--VESKSYKPLQAERLIGFCNKPLSERVLN---VSELGEPVCS 141

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
           + L+  + G    Y G T P G P + +   T + + L  L +  + TWD+G+D  GNQV
Sbjct: 142 VFLKPSNNG----YLGETQPEGCPANVRGAVTITNTIL--LHSEGMDTWDKGYDAQGNQV 195

Query: 226 WGVKAGPYEFK 236
           WG    P+EF+
Sbjct: 196 WGANDDPFEFR 206


>gi|411119057|ref|ZP_11391437.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
 gi|410710920|gb|EKQ68427.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
          Length = 223

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 100/194 (51%), Gaps = 18/194 (9%)

Query: 48  HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LLYLEQAFWRT 102
            A+ VA  L G   +  QA  +P    +++M TC   +++D +       LY EQA    
Sbjct: 35  QAKKVATLLEGVMDTTAQARVNPK-APSVQMTTC-RVQVIDTNNHTSAIFLYQEQALTSK 92

Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
             +P+RQRF  + P P+     V   S+ + + E +  FC++P  QR L      D+G  
Sbjct: 93  LSQPYRQRFLEISPHPETQT--VRSRSFRLTQPERWAGFCNQPDAQRTLQSR---DLGNP 147

Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
           + ++ L++   G    + G TPP G P + + GA   T+ +  L    ++TWDRGFD DG
Sbjct: 148 VCSVFLKQTPTG----FLGETPPEGCPTNAR-GAVRITNTIE-LNTAGMNTWDRGFDADG 201

Query: 223 NQVWGVKAGPYEFK 236
           NQVWG ++  Y+F+
Sbjct: 202 NQVWGAQSESYQFR 215


>gi|425465472|ref|ZP_18844781.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389832277|emb|CCI24241.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 202

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 97/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   ++EQA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGQQVWGAQDNVYQFR 198


>gi|427419543|ref|ZP_18909726.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
           sp. PCC 7375]
 gi|425762256|gb|EKV03109.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
           sp. PCC 7375]
          Length = 214

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 21/194 (10%)

Query: 48  HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LLYLEQAFWRT 102
           H   V D L+    + EQA R+P  F  ++M TCP    V GS       LY EQA    
Sbjct: 33  HVDAVVDHLTRPMDTTEQAERNP-RFVGVQMTTCPI--QVTGSADQYGIYLYQEQALTAQ 89

Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
            + P+RQRF  +       +  VE  ++     E +   C     Q+  P      +GE 
Sbjct: 90  IESPYRQRFLYITLSKDATR--VESHTFKPPSPEAWTGLC-----QQATPSIDTHALGEL 142

Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
           +  + LR    G    Y GSTP  G P + + GA   T+ + VL  + + TWDRGFD +G
Sbjct: 143 VCVVSLRPSTLG----YVGSTPAEGCPVNLR-GAVRLTNTI-VLHQDGMDTWDRGFDANG 196

Query: 223 NQVWGVKAGPYEFK 236
            QVWG +A PY+++
Sbjct: 197 EQVWGAEADPYQYR 210


>gi|425470697|ref|ZP_18849557.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
 gi|389883514|emb|CCI36065.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
          Length = 202

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   + +QA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLLLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198


>gi|425439758|ref|ZP_18820073.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719933|emb|CCH96295.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 202

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 99/200 (49%), Gaps = 15/200 (7%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMV--DGSKLLYLE 96
           LT+S       + VA+ L G   + +QA  +P     ++M+TC A +      S  LY E
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMITC-AVDFSPKQDSIYLYQE 69

Query: 97  QAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
           QA      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L    +
Sbjct: 70  QAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAPNFINFCNKDLTERRLN---V 124

Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
            D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DR
Sbjct: 125 SDLAESVCTVFLKPIAGG----YRGETPPQGCPTNVR-GAVKITNTI-ILHSQGMDTSDR 178

Query: 217 GFDDDGNQVWGVKAGPYEFK 236
           G+D  G QVWG +   Y+F+
Sbjct: 179 GYDSLGQQVWGAQDNFYQFR 198


>gi|422304440|ref|ZP_16391785.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9806]
 gi|389790410|emb|CCI13706.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9806]
          Length = 202

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 15/200 (7%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMV--DGSKLLYLE 96
           LT+S       + VA+ L G   + +QA  +P     ++M TC A +      S  LY E
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTC-AVDFSPKQDSIYLYQE 69

Query: 97  QAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
           QA      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L    +
Sbjct: 70  QAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKNLTERKLN---V 124

Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
            D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DR
Sbjct: 125 SDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDR 178

Query: 217 GFDDDGNQVWGVKAGPYEFK 236
           G+D  G QVWG +   Y+F+
Sbjct: 179 GYDSLGQQVWGARDNFYQFR 198


>gi|425435450|ref|ZP_18815901.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389679974|emb|CCH91262.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
          Length = 202

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 99/203 (48%), Gaps = 21/203 (10%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
           LT+S       + VA+ L G   ++EQA  +P +   ++M TC     VD S       L
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-SIAKVQMTTCA----VDFSPKQDNIYL 66

Query: 94  YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
           Y EQA      +P+RQR  +++P        VE  +Y +     + NFC++   +R L  
Sbjct: 67  YQEQAIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKDLTERKLN- 123

Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
             + D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T
Sbjct: 124 --VSDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILNSQGMDT 175

Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
            DRG+D  G QVWG +   Y+F+
Sbjct: 176 SDRGYDSLGQQVWGAQDNVYQFR 198


>gi|390440170|ref|ZP_10228521.1| Genome sequencing data, contig C295 [Microcystis sp. T1-4]
 gi|389836454|emb|CCI32647.1| Genome sequencing data, contig C295 [Microcystis sp. T1-4]
          Length = 202

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   + +QA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKDLTERKLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+GE + T+ L+    G    Y G TPP G   + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLGESVCTVFLKPIAGG----YRGETPPQGCSTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198


>gi|428217847|ref|YP_007102312.1| hypothetical protein Pse7367_1595 [Pseudanabaena sp. PCC 7367]
 gi|427989629|gb|AFY69884.1| protein of unknown function DUF1001 [Pseudanabaena sp. PCC 7367]
          Length = 427

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 98/194 (50%), Gaps = 18/194 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVD-----GSKLLYLEQAFWRTPQKP 106
           VAD L G   +   A  + D   ++RM TC   E++D         LY EQA     ++P
Sbjct: 48  VADYLIGAMDTSAAAIANRD-VADVRMTTC-VVEVIDREPKESVIFLYQEQAISTKLEQP 105

Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTI 166
           +RQRF  + P  K+ K  VE  S+     + +   C+R  + R + +    ++GE L ++
Sbjct: 106 YRQRFLRIAPS-KDGK-QVESRSFRPAVSDRWIGLCNREPEARQVSQ---AELGEALCSV 160

Query: 167 HL----RRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
            L     R   G    Y GSTPP G P +++ GA   T+++ +L + E+ TWDRGFD  G
Sbjct: 161 FLSLDAERDQYGNYYRYIGSTPPEGCPTNFR-GAVKITNKI-LLTDGEMETWDRGFDASG 218

Query: 223 NQVWGVKAGPYEFK 236
            QVWG     Y+FK
Sbjct: 219 KQVWGADDSSYKFK 232



 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 89/192 (46%), Gaps = 17/192 (8%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTC-----PAAEMVDGSKLLYLEQAFWRTPQKP 106
           +A  LSG+  + EQ   DP  F  +R   C     P      G++++ LEQA      K 
Sbjct: 246 IAAMLSGKFNNAEQQQSDP-TFLPVRFNNCVVNIEPDELFPTGTQVMVLEQAANSPELKF 304

Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTI 166
             QR   +   P        + +Y + +  ++ +FCDRP   + L    IG     +T  
Sbjct: 305 ASQRVAHLYRLPAP-DNSFRMVTYKLVD-GDFADFCDRPASGKVLTATQIGQRECQIT-- 360

Query: 167 HLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVW 226
           + +  D      Y G+TP  G P+ ++ G+TY T + + L ++++  W+R +D    QV 
Sbjct: 361 YQQESD-----YYVGTTPDSGCPSQFR-GSTYITID-SRLSDSQLEFWERWYDGRSRQVA 413

Query: 227 GVKAGPYEFKPA 238
           G ++G Y +KP 
Sbjct: 414 GSESGFYIYKPV 425


>gi|425450833|ref|ZP_18830656.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|440753055|ref|ZP_20932258.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa TAIHU98]
 gi|389768139|emb|CCI06654.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|440177548|gb|ELP56821.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa TAIHU98]
          Length = 202

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   ++EQA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P        VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKDLTERKLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILNSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGRQVWGAQDNVYQFR 198


>gi|166365024|ref|YP_001657297.1| hypothetical protein MAE_22830 [Microcystis aeruginosa NIES-843]
 gi|317374819|sp|B0JGA0.1|CPXT3_MICAN RecName: Full=Chromophore lyase CpcT/CpeT 3
 gi|166087397|dbj|BAG02105.1| hypothetical protein MAE_22830 [Microcystis aeruginosa NIES-843]
          Length = 202

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   ++EQA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTREQAQTNP-RIAKVQMTTCAVDFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P        VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSAD--NSTVESKAYKLNNAPNFINFCNKDLTERELN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLQPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGQQVWGAQDNVYQFR 198


>gi|443322051|ref|ZP_21051086.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
 gi|442788254|gb|ELR97952.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
          Length = 212

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 92/191 (48%), Gaps = 14/191 (7%)

Query: 48  HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA--EMVDGSKLLYLEQAFWRTPQK 105
           H   V + L G   +  Q + + +    +RM TC       V GS LLY EQA      K
Sbjct: 29  HRDTVVNYLVGTMDATTQPNPESEPV-QVRMTTCIVQLEPSVRGSVLLYQEQALNNELLK 87

Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
           P+RQRF  +     E K  +E  SY   + + +  FC     +R +  +   D+GE + +
Sbjct: 88  PYRQRFLEITA--SEDKEIIESKSYRPNQPDSWSGFCSLHETERVITTD---DLGESVCS 142

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
           + LR        +Y G TPP G P + + GA   T+ + +L +  + TWDRGFD   NQV
Sbjct: 143 VFLRPL----ATIYLGKTPPEGCPANVR-GAVKITNTI-ILHSQGMETWDRGFDAQNNQV 196

Query: 226 WGVKAGPYEFK 236
           WG K   Y+F+
Sbjct: 197 WGAKDQSYQFR 207


>gi|425456903|ref|ZP_18836609.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389801879|emb|CCI18997.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 202

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 96/199 (48%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   + +QA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RVAKVQMTTCAVNFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P        VE  +Y +    ++ NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAADFINFCNKDLTERKLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGQQVWGAQDNFYQFR 198


>gi|425459384|ref|ZP_18838870.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9808]
 gi|389822925|emb|CCI29288.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9808]
          Length = 202

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 98/203 (48%), Gaps = 21/203 (10%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
           LT+S +     + VA+ L G   + +QA  +P     ++M TC     VD S       L
Sbjct: 12  LTQSLSLNPEVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCA----VDFSPKQDNIYL 66

Query: 94  YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
           Y EQA      +P+RQR  +++P        VE  +Y +     + NFC++   +R L  
Sbjct: 67  YQEQAIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAPNFINFCNKNLTERKLN- 123

Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
             + D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T
Sbjct: 124 --VSDLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDT 175

Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
            DRG+D  G QVWG +   Y+F+
Sbjct: 176 SDRGYDSLGRQVWGAQDNFYQFR 198


>gi|449525842|ref|XP_004169925.1| PREDICTED: chromophore lyase CpcT/CpeT 3-like, partial [Cucumis
          sativus]
          Length = 74

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 52/76 (68%), Positives = 62/76 (81%), Gaps = 2/76 (2%)

Query: 1  MGPDSDSASNPNGGWGGRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEK 60
          MG +S+  +N + GW  RA GL +K  VL+GGA+LLKRLTKS TRWDHAR V+ SLSGEK
Sbjct: 1  MGTNSNDNNN-SVGWN-RAGGLIMKTLVLVGGALLLKRLTKSTTRWDHARFVSQSLSGEK 58

Query: 61 FSKEQASRDPDNFFNI 76
          FSKEQA+RDPDN+FNI
Sbjct: 59 FSKEQAARDPDNYFNI 74


>gi|443651763|ref|ZP_21130696.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa DIANCHI905]
 gi|159027459|emb|CAO89424.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334404|gb|ELS48916.1| chromophore lyase CpcT/CpeT 3 [Microcystis aeruginosa DIANCHI905]
          Length = 203

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 97/203 (47%), Gaps = 20/203 (9%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSK-----LL 93
           LT+S       + VA+ L G   + +QA  +P     ++M TC     VD S       L
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RIAKVQMTTCA----VDFSPKQDNIYL 66

Query: 94  YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
           Y EQA      +P+RQR  +++P P      VE  +Y +     + NFC++   +R L  
Sbjct: 67  YQEQAIIDRLNQPYRQRILVIQPSPDN--STVESKAYKLNNAANFINFCNKNLTERKLNV 124

Query: 154 EVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHT 213
             + D  E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T
Sbjct: 125 SNLAD--ESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDT 176

Query: 214 WDRGFDDDGNQVWGVKAGPYEFK 236
            DRG+D  G QVWG +   Y+F+
Sbjct: 177 SDRGYDSLGRQVWGAQDNFYQFR 199


>gi|425446505|ref|ZP_18826508.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9443]
 gi|389733245|emb|CCI02962.1| Genome sequencing data, contig C295 [Microcystis aeruginosa PCC
           9443]
          Length = 206

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 13/199 (6%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           LT+S       + VA+ L G   + +QA  +P     ++M TC         S  LY EQ
Sbjct: 12  LTQSLPLNPQVQGVANHLIGVMDTTQQAQTNP-RVAKVQMTTCAVNFSPKQDSIYLYQEQ 70

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A      +P+RQR  +++P        VE  +Y +     + NFC++   +R L    + 
Sbjct: 71  AIIDRLNQPYRQRILVIQPSADN--STVESKAYKLNNAANFINFCNKDLTERRLN---VS 125

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRG 217
           D+ E + T+ L+    G    Y G TPP G P + + GA   T+ + +L +  + T DRG
Sbjct: 126 DLAESVCTVFLKPIAGG----YRGETPPQGCPTNAR-GAVKITNTI-ILHSQGMDTSDRG 179

Query: 218 FDDDGNQVWGVKAGPYEFK 236
           +D  G QVWG +   Y+F+
Sbjct: 180 YDSLGRQVWGAQDNFYQFR 198


>gi|119489665|ref|ZP_01622424.1| hypothetical protein L8106_13110 [Lyngbya sp. PCC 8106]
 gi|119454402|gb|EAW35551.1| hypothetical protein L8106_13110 [Lyngbya sp. PCC 8106]
          Length = 228

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 63/228 (27%), Positives = 105/228 (46%), Gaps = 27/228 (11%)

Query: 17  GRAQGLFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNI 76
           G A    +  A L G  ++     +          V   L+G   +  QA   PD   ++
Sbjct: 12  GYAIATLITTASLFGCGLIFNSTPRQVLE------VVSYLTGVMETSAQAQAVPDAP-SV 64

Query: 77  RMLTCPAAEMVDGSKL--------LYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVS 128
           RM TC   ++ D  ++        LY EQA    P+KP+RQRF  + P     +  VE +
Sbjct: 65  RMTTC-VVKVADADEITQRSPAIFLYQEQAMTSNPKKPYRQRFLQISPSADGQQ--VESA 121

Query: 129 SYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGF 188
           ++         NFC +   +R +    I DIG++  ++ L+    G +  Y G T P G 
Sbjct: 122 TFVPANPRALVNFCSQSESERLV---TIEDIGDYRCSVFLQPA--GSQ--YIGQTQPEGC 174

Query: 189 PNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
           P +++ GA   T+++  L    + T+DRG+D  GNQ+WG +   Y+++
Sbjct: 175 PANYK-GAVTITNQI-TLDAESMETFDRGYDQKGNQIWGAEDQSYQYQ 220


>gi|326513334|dbj|BAK06907.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 77

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 35/54 (64%), Positives = 46/54 (85%)

Query: 24 VKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
          +KA V+ GG +L++RL +S TRWDHAR VAD+LSGEKFS+EQA +DP N+FN+R
Sbjct: 24 LKALVVFGGVILIRRLRRSTTRWDHARTVADALSGEKFSREQARQDPGNYFNLR 77


>gi|307151475|ref|YP_003886859.1| hypothetical protein Cyan7822_1593 [Cyanothece sp. PCC 7822]
 gi|306981703|gb|ADN13584.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
          Length = 220

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 95/194 (48%), Gaps = 21/194 (10%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCP-----AAEMVDGS----KLLYLEQAFWRT 102
           V   L G   +  QA+ +P    ++RM TC      A + V  S      LY EQA  + 
Sbjct: 33  VVTHLVGIMDTSAQAAENPKKS-SVRMTTCQVTFRGANDSVQNSFGSPVYLYQEQALTQE 91

Query: 103 PQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH 162
             KP+RQRF  ++P   +    VE  SY   + E     C++P  +R L +    D+GE 
Sbjct: 92  LNKPYRQRFLEIQPSVGQET--VESKSYKPAQPETLIGLCNKPESERVLQQ---SDLGEF 146

Query: 163 LTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
           +  + L+    G    + G TPP G P + + GA   T+ + +L +  + TWD+G+D  G
Sbjct: 147 VCRVFLKPSPDG----FIGETPPEGCPANVR-GAVKITNTI-ILHSRGMDTWDKGYDAQG 200

Query: 223 NQVWGVKAGPYEFK 236
           +QVWG +   Y+++
Sbjct: 201 HQVWGAREDAYQYR 214


>gi|440751007|ref|ZP_20930245.1| hypothetical protein C943_2938 [Mariniradius saccharolyticus AK6]
 gi|436480350|gb|ELP36588.1| hypothetical protein C943_2938 [Mariniradius saccharolyticus AK6]
          Length = 211

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/186 (29%), Positives = 93/186 (50%), Gaps = 19/186 (10%)

Query: 54  DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
           D ++G  FS E  S++  +F +I +   P  +       LY+EQA   T  KP+RQR Y 
Sbjct: 34  DMMAGT-FSSEAQSKEDADFLHISLEMQPIWQKNKDGYWLYVEQAMATTKDKPYRQRIYH 92

Query: 114 VKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL----PEEVIGDIGEHLTTIHLR 169
           +       + D       + E++E + F  +  D   L    P++++   G     I LR
Sbjct: 93  L------YQMDDTTLVSQVYELKEPEKFAGKANDPTALGSLTPDQLVSKEG---CGIFLR 143

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVK 229
           +   G   ++EGST     P++ + GA ++TS++ + K+  + +WD+G+D +G QVWG  
Sbjct: 144 KTKSG---IFEGSTQDKACPSNLR-GARFTTSKVTLTKDG-MESWDQGWDANGQQVWGAT 198

Query: 230 AGPYEF 235
            G Y F
Sbjct: 199 KGGYRF 204


>gi|37520007|ref|NP_923384.1| hypothetical protein glr0438 [Gloeobacter violaceus PCC 7421]
 gi|81711246|sp|Q7NNH3.1|CPXT1_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 1
 gi|35210999|dbj|BAC88379.1| glr0438 [Gloeobacter violaceus PCC 7421]
          Length = 229

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 87/191 (45%), Gaps = 21/191 (10%)

Query: 52  VADSLSGEKFSKEQASRD---PDNFFNIRMLTCPAAEMVDGSKLLYL--EQAFWRTPQKP 106
           V   L+G   + +QA  D   PD    +RM TC      D SK +YL  EQ        P
Sbjct: 50  VVTRLTGIMTTTQQAQSDAARPD----VRMTTCSVQLEGDKSKAIYLYQEQTMSNNLGAP 105

Query: 107 FRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTT 165
           +RQR   +          VE + +   E +     C +P  +R + P  + GD      T
Sbjct: 106 YRQRLLRIAASADGRA--VESAGFKFVEAKPLAGLCAKPAAERLIAPMALDGD-----PT 158

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
             +R    G +  Y G+TP GG  ++ + GA   T+E+ + K   + T DRGFD  GNQV
Sbjct: 159 CTVRLVQAGDK--YMGTTPEGGCQSNVR-GAARITNEITLYKEG-MDTRDRGFDAQGNQV 214

Query: 226 WGVKAGPYEFK 236
           WG K  PY F+
Sbjct: 215 WGAKEEPYRFR 225


>gi|434392494|ref|YP_007127441.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
 gi|428264335|gb|AFZ30281.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
          Length = 240

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/191 (28%), Positives = 88/191 (46%), Gaps = 18/191 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGS------KLLYLEQAFWRTPQK 105
           V   L G   +  QA  +P N  ++R+ TC       G+        +Y EQA  +   +
Sbjct: 40  VVSHLDGAMDTSAQARANP-NAPDVRITTCKVTVKNAGAIARPHAVFMYQEQALSQRLSQ 98

Query: 106 PFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
           P+RQRF  + P        +E + +     + +   C++P  QR +    + DIG    +
Sbjct: 99  PYRQRFLRIAPSVDH--NSIESAVFRPPTPQAWIGLCNKPEAQRIIN---VKDIGTSNCS 153

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQV 225
           + L R  +     Y G T   G P++++ GA   T+ +  L    + TWDRGFD  GNQV
Sbjct: 154 VFLGRQQQN----YIGETSASGCPSNYK-GAVRITNRI-TLHQAGMDTWDRGFDATGNQV 207

Query: 226 WGVKAGPYEFK 236
           WG +   Y+F+
Sbjct: 208 WGAEGEAYQFR 218


>gi|443242425|ref|YP_007375650.1| DUF1001 domain containing protein [Nonlabens dokdonensis DSW-6]
 gi|442799824|gb|AGC75629.1| DUF1001 domain containing protein [Nonlabens dokdonensis DSW-6]
          Length = 219

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/215 (26%), Positives = 103/215 (47%), Gaps = 19/215 (8%)

Query: 22  LFVKAAVLIGGAVLLKRLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTC 81
           L +   +LIG +      +K  T+      V   + G   S  Q+  D  ++++I +   
Sbjct: 7   LLISFFILIGCSS-----SKHNTKDKELDKVYKMMQGSYDSSLQSKVDS-SYYDISLEMH 60

Query: 82  PAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY-MVKPCPKELKCDVEVSSYAIREVEEYKN 140
           P  +   G + LY+EQA      +P+R R Y +++    E+  +V    Y I   ++Y  
Sbjct: 61  PVWKN-SGERWLYVEQALTSQKDQPYRVRMYRLLRNDNNEIISEV----YTIPNEKQYYG 115

Query: 141 FCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYST 200
              +P+  + L  + +         + +++ + G    Y G T      ++ + GA+Y+T
Sbjct: 116 KFKKPKAFKYLTPDYLEK--REGCEVIIQKEENGS---YTGKTGKSTCLSTMR-GASYAT 169

Query: 201 SELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEF 235
           S + V+ +N+I +WDRGFD DGNQVWG + G Y F
Sbjct: 170 S-IVVINDNQIISWDRGFDKDGNQVWGAEKGGYIF 203


>gi|443323211|ref|ZP_21052220.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
 gi|442787121|gb|ELR96845.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
          Length = 191

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 83/186 (44%), Gaps = 12/186 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+GE  ++EQA  +P  + N+R+   P  ++   S  L+ EQA      +P+R R 
Sbjct: 10  LASYLAGEFENREQAIAEPAWYVNLRLWLRPVPDLWSDSLTLFAEQASTLKLDQPYRPRL 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             ++  P     ++ V  Y  +E++ +     +P   + L  E I    E L    L+  
Sbjct: 70  LRLRQAP-----ELVVEHYMFKELQAFIGSGRQPEKLKALTLEQI----EFLPGCTLKVQ 120

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVKA 230
              ++  ++    PG        G+TY  S    +   E+ T+DRG D   G  +WG   
Sbjct: 121 QHNQQ--FQALPQPGETCCFQYQGSTYQVSLGFEVNEAELSTYDRGIDSATGKAIWGAIL 178

Query: 231 GPYEFK 236
           GPY FK
Sbjct: 179 GPYRFK 184


>gi|163756594|ref|ZP_02163706.1| hypothetical protein KAOT1_01325 [Kordia algicida OT-1]
 gi|161323488|gb|EDP94825.1| hypothetical protein KAOT1_01325 [Kordia algicida OT-1]
          Length = 209

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/220 (29%), Positives = 98/220 (44%), Gaps = 21/220 (9%)

Query: 24  VKAAVLIGGAVLLKRLT------KSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIR 77
           +K   L   A++L  L       KS    D  + +   ++G   S EQAS D D+++NI 
Sbjct: 1   MKKTFLYSFAIILCFLISSCNSYKSTNTQDELKALQTLMTGSFDSSEQASAD-DSYYNIS 59

Query: 78  MLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEE 137
           +   P     DG   LY+EQA      KP+RQR Y +     E   + + SS  +  +E 
Sbjct: 60  LHMYPIWTSKDGY-WLYVEQALNSNQDKPYRQRVYQL-----EKLANGKFSS-KVYTLEN 112

Query: 138 YKNFCDRPRDQRPLPEEVIGDIGEHL-TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA 196
            K    + +      +  +  + E     + L +    K   Y GST      ++   GA
Sbjct: 113 PKEAIGKWKTPAYFDQFDVSMLKEREGCAVILEK----KGAYYSGSTNEKDCKST-MRGA 167

Query: 197 TYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
           +Y+TSE+ + K N I +WDRGF+     VWG +   Y FK
Sbjct: 168 SYATSEVTI-KPNVIESWDRGFNAKDEHVWGAEKAGYVFK 206


>gi|428773606|ref|YP_007165394.1| hypothetical protein Cyast_1789 [Cyanobacterium stanieri PCC 7202]
 gi|428687885|gb|AFZ47745.1| protein of unknown function DUF1001 [Cyanobacterium stanieri PCC
           7202]
          Length = 194

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 88/197 (44%), Gaps = 27/197 (13%)

Query: 49  ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
           AR++A   S E     QA  +P  F +IR+   P    + G    +LEQA+     +P+R
Sbjct: 11  ARLMAADFSNEP----QAIENPPFFAHIRVCMRPLPHSLLGEVSFFLEQAYDFLLSQPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP----EEVIG-DIGEHL 163
            R + +K     L    E+  Y ++E +E+      P   + L     E++ G D+  H 
Sbjct: 67  LRVFTIKAVDDHL----ELEHYKLKEEKEFYGASRNPEKLKGLTLDHLEKMNGCDMIAHW 122

Query: 164 TTIHLR-RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
           T  H +     GK C+ E         NS++            +  N++ ++DRG D + 
Sbjct: 123 TGTHFKGHVKPGKACIVERKGKKSYLDNSFE------------IDENKLISFDRGRDLET 170

Query: 223 NQ-VWGVKAGPYEFKPA 238
           ++ +WG  AGP+ F+P 
Sbjct: 171 DELLWGSVAGPFHFQPT 187


>gi|148243132|ref|YP_001228289.1| hypothetical protein SynRCC307_2033 [Synechococcus sp. RCC307]
 gi|147851442|emb|CAK28936.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 201

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 25/209 (11%)

Query: 44  TRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTP 103
           TRW         L G   +++QA  +P  + +I +   P  ++   S L+  EQA+  TP
Sbjct: 9   TRW---------LCGSFSNQQQAFENPPLYGHIHVRYRPLGQLAPRSMLI--EQAYAITP 57

Query: 104 QKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL 163
           ++P+R R      CP++    ++V +Y + E + +        D   L  E+  D   HL
Sbjct: 58  KEPYRIRVVRPLLCPEQ---GLKVMNYTLAEPQRFIGAI----DDADLRAEIREDDLTHL 110

Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDG 222
              +    D+G    + GS  PG      + G  +Y  SE  VL+N  + T DRG D D 
Sbjct: 111 EGCNYLVRDQGDH--FSGSVEPGCRCKVRRKGRDSYLVSEF-VLRNGTMETIDRGHDPDT 167

Query: 223 N-QVWGVKAGPYEFKPAPSSSYSDMFSPL 250
           + Q+WG   GP+ F+      +SD   PL
Sbjct: 168 HEQLWGSLPGPFLFE--RKEDWSDELLPL 194


>gi|17228143|ref|NP_484691.1| hypothetical protein alr0647 [Nostoc sp. PCC 7120]
 gi|81772968|sp|Q8YZ40.1|CPCTL_NOSS1 RecName: Full=Phycocyanobilin lyase CpcT homolog
 gi|17129993|dbj|BAB72605.1| alr0647 [Nostoc sp. PCC 7120]
          Length = 198

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 88/203 (43%), Gaps = 39/203 (19%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           + + L+GE  ++EQA  +P  F ++R+   P     D S  L+ EQA      +P+RQR 
Sbjct: 10  LGNYLAGEFDNREQALGEPIWFVHLRLWQRPVDLFSDDSITLFAEQANIVNLDRPYRQRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH---LTTIHL 168
             + P P     D E   Y    V+ Y      P++    P  +IG  G H   L T+  
Sbjct: 70  LRLMPAP-----DSETGLY----VQYYM-----PKN----PSALIG-AGRHPDLLKTLTP 110

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQNGA--------TYSTSELAV-------LKNNEIHT 213
           ++ +    C+   S      PNS+Q  A        T+S  E  V       +   E+HT
Sbjct: 111 QQLELLPGCVLSVSQQTVA-PNSYQFTASPLPNTCCTFSYLENTVQVSLGFAVTETELHT 169

Query: 214 WDRGFDDD-GNQVWGVKAGPYEF 235
           +D+G D + G   WG   GPY +
Sbjct: 170 YDKGIDQETGKATWGAIVGPYRY 192


>gi|86606684|ref|YP_475447.1| CpeT protein [Synechococcus sp. JA-3-3Ab]
 gi|123505735|sp|Q2JT26.1|CPXT2_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 2
 gi|86555226|gb|ABD00184.1| CpeT protein [Synechococcus sp. JA-3-3Ab]
          Length = 199

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 88/189 (46%), Gaps = 16/189 (8%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+G+  ++EQA  +P  F +IR+   P    V G    Y+EQA+    ++P+RQ  
Sbjct: 12  MARWLAGDFSNQEQAWENPPFFASIRVAYRPLPTAVLGGIGFYVEQAYSGHLEEPYRQAV 71

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             +     ++   + + +Y   + + ++  C R R ++ L +    D+       +L  C
Sbjct: 72  VEL----TQVGDGIVIRNYRPLQPQRWRG-CARGRAEQ-LSQLSAADLA------YLPGC 119

Query: 172 D---RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-QVWG 227
           D   + +  L+ G T PG      +NG T        L  NE  + DRG D   + QVWG
Sbjct: 120 DVQVKRQGSLFVGVTEPGCRCCVVRNGQTTYLQTTLHLSENEFCSHDRGMDPVTHRQVWG 179

Query: 228 VKAGPYEFK 236
             AGP+ F+
Sbjct: 180 AVAGPFRFR 188


>gi|75910775|ref|YP_325071.1| hypothetical protein Ava_4579 [Anabaena variabilis ATCC 29413]
 gi|75704500|gb|ABA24176.1| Protein of unknown function DUF1001 [Anabaena variabilis ATCC
           29413]
          Length = 198

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 85/203 (41%), Gaps = 39/203 (19%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           + + L+GE  ++EQA  +P  F ++R+   P     D S  L+ EQA      +P+RQR 
Sbjct: 10  LGNYLAGEFDNREQALGEPIWFVHLRLWQRPVGLFSDDSITLFAEQANIVNLDRPYRQRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEH---LTTIHL 168
             + P P     D E   Y    V+ Y      P++    P  ++G  G H   L T+  
Sbjct: 70  LRLMPAP-----DSETGLY----VQYYM-----PKN----PSALVG-AGRHPELLKTLTP 110

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQ---------------NGATYSTSELAVLKNNEIHT 213
           ++ +    C+   S      PNS+Q                G T   S    +   E+HT
Sbjct: 111 QQLELLPGCVLSVSQKTVA-PNSYQFTASPLPDTCCTFSYLGNTVQVSLGFAVTETELHT 169

Query: 214 WDRGFDDD-GNQVWGVKAGPYEF 235
           +D+G D + G   WG   GPY +
Sbjct: 170 YDKGIDQETGKATWGAIVGPYRY 192


>gi|411116555|ref|ZP_11389042.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
 gi|410712658|gb|EKQ70159.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
          Length = 200

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/186 (22%), Positives = 83/186 (44%), Gaps = 4/186 (2%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A+ L+G   +++QA+ +P  + ++R+   P     + S  ++ EQA      KP+RQR 
Sbjct: 10  LAEYLTGVFENRKQAADEPIWYVHLRLWQHPVQLFAEDSVTIFAEQASVVNLDKPYRQRL 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             +   P   +  ++V  Y  ++   ++     P     L  + +  +   +  I+ +  
Sbjct: 70  LRLMRSPANPEA-LQVQYYGFKDPSAFRGAGQAPEKLEFLAIDQVDFLPGCILEINQQPT 128

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKA 230
             G R +   + PPG       +G     +    +  +E  T+D+G D + GN +WG   
Sbjct: 129 STGTRFI--ATAPPGACCRFQYDGKIGQVALGFEVSADEYLTYDKGIDPETGNALWGAIM 186

Query: 231 GPYEFK 236
           GPY ++
Sbjct: 187 GPYRYQ 192


>gi|428201304|ref|YP_007079893.1| hypothetical protein Ple7327_0912 [Pleurocapsa sp. PCC 7327]
 gi|427978736|gb|AFY76336.1| CpeT/CpcT family (DUF1001) [Pleurocapsa sp. PCC 7327]
          Length = 204

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 82/187 (43%), Gaps = 15/187 (8%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L+GE  +++QA  +P  + ++R+   P +   + S  L+ EQA     ++P+R R   ++
Sbjct: 14  LAGEFDNRQQALAEPIWYVHLRLWLRPTSLFREDSLTLFAEQASIVNLEQPYRPRLLRLR 73

Query: 116 -----PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
                P P      + V  Y  ++++  +    +P   R L  E +  +      + + R
Sbjct: 74  QSQTHPLP------LRVEHYMFKDLDAVRGASSKPELLRQLTPEQVEFLPSCTLNVEIER 127

Query: 171 CDRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGV 228
            DR +      ++P    P  +   G TY  S    +  +E+ T D+G D   G  +WG 
Sbjct: 128 LDRDRFKF--KASPASEKPCRFTYGGQTYQVSLGFEVTPDELKTHDKGIDPTTGQAIWGA 185

Query: 229 KAGPYEF 235
             GPY F
Sbjct: 186 LMGPYRF 192


>gi|434400021|ref|YP_007134025.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
 gi|428271118|gb|AFZ37059.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
          Length = 197

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 85/192 (44%), Gaps = 21/192 (10%)

Query: 49  ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
           AR++A   S +    EQA  +P  F +IR+   P  + +     L+LEQA+     +P+R
Sbjct: 11  ARLMAADFSNQ----EQAFENPPFFAHIRVCMRPLPDSILDGTSLFLEQAYDYMLNRPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ +Y ++E E++     R RD+    + +  D+ E L     
Sbjct: 67  LRVFKLSVVENR----IELENYKVKEEEKFYG-ASRDRDRL---KNLTPDLIEKLPG--- 115

Query: 169 RRCDRG---KRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
             CD     K   + G   PG      + G T       ++    + ++DRG D   ++ 
Sbjct: 116 --CDMNVIWKNNSFHGEIKPGKACIVERQGKTTYLDNSFIIDQETLISYDRGRDPGTDEL 173

Query: 225 VWGVKAGPYEFK 236
           VWG  AGP+ FK
Sbjct: 174 VWGSLAGPFHFK 185


>gi|300867127|ref|ZP_07111793.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300334882|emb|CBN56959.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 196

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 89/206 (43%), Gaps = 31/206 (15%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
           +T S      AR +A   S +     QA  +P  F +IR+   P    + G   LYLEQA
Sbjct: 1   MTNSTDIATLARWMASDFSNQP----QAFENPPFFAHIRVCMRPLPVELLGGISLYLEQA 56

Query: 99  FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
           +     +P+R R   + P    + C +E+ +YAI   E++      P   + L  E I  
Sbjct: 57  YDYQINQPYRVRVLKLVPA---VNC-IEIENYAIENEEQFYGSSRDPVRLQALKAETIKK 112

Query: 159 ------IGEHLTTIHLRRCDRGKRC--LYEGSTPPGGFPNSWQNGATYSTSELAVLKNNE 210
                 I +      L R + GK C  + +G T             TY  SE  + ++  
Sbjct: 113 LPGCTFITQWTGNSFLGRVEPGKGCTVVRKGQT-------------TYLDSEFEIDEHKF 159

Query: 211 IHTWDRGFDDDGNQ-VWGVKAGPYEF 235
           I + DRG D + ++ +WG  AGP+EF
Sbjct: 160 I-SHDRGLDPETDEHIWGAVAGPFEF 184


>gi|88807813|ref|ZP_01123324.1| hypothetical protein WH7805_06621 [Synechococcus sp. WH 7805]
 gi|88787852|gb|EAR19008.1| hypothetical protein WH7805_06621 [Synechococcus sp. WH 7805]
          Length = 234

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 83/183 (45%), Gaps = 12/183 (6%)

Query: 55  SLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
           SLS    ++EQA  +P  + NI +   P  ++  GS  L LEQA+     +P+R R    
Sbjct: 32  SLSASFSNQEQAFENPPLYGNILVRLRPLPQLDPGS--LLLEQAYAIASNEPYRVRVLQP 89

Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
             CP      + + +YAIR+ + +    + P+ +  L    +  +     +  +R  D G
Sbjct: 90  TVCPTR---GLLILNYAIRDDQRFWGSVEDPQRRSQLTAADLTLL--EGCSYQVRETDEG 144

Query: 175 KRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPY 233
               + G   PG      + G T        L ++ + T DRG+D   ++ +WG  AGP+
Sbjct: 145 ----FRGEVEPGCRCLVTRRGKTSYLVSSFELTDDGMSTIDRGYDPATHEHLWGSIAGPF 200

Query: 234 EFK 236
           +FK
Sbjct: 201 QFK 203


>gi|300866329|ref|ZP_07111032.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335672|emb|CBN56192.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 206

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 80/188 (42%), Gaps = 7/188 (3%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           +A  ++GE  ++ QA  DP  + ++R+     P +   + S  LY+EQA    P KP+R 
Sbjct: 10  LARYMAGEFDNRTQAIADPVWYVHLRLWQRPFPISLFSEPSITLYVEQANILKPDKPYRP 69

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHL 168
           R   ++      +  +EV  Y ++++   +     P   + L PE+V    G  LT    
Sbjct: 70  RIVQLRHSQSAPEL-IEVQYYMLKDIAALQGAGQNPNLLKQLTPEQVELLPGCTLTVT-- 126

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWG 227
           R+     R  +  S PP         G TY           E  ++D+G D   G  +WG
Sbjct: 127 RQDLTSNRYRFRASLPPETQCCFTYGGQTYQIDLGFEASAEEFLSYDKGIDPKTGKSIWG 186

Query: 228 VKAGPYEF 235
              GP+ F
Sbjct: 187 ALMGPFRF 194


>gi|443317243|ref|ZP_21046659.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
 gi|442783128|gb|ELR93052.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
          Length = 197

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 88/207 (42%), Gaps = 29/207 (14%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++ +  ++ QA  +P  F +IR+   P    + G   LYLEQA+     +P+R R 
Sbjct: 10  LARWMAADFSNQPQAFENPPLFAHIRVCMRPLPYTLMGGVSLYLEQAYDFMLTQPYRARV 69

Query: 112 YMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
                  K +  D  +++ +Y I   ++  +F    RD+  L           LT  HL 
Sbjct: 70  L------KLILVDGKIQIENYLI---QDEASFHGAARDRDRLSS---------LTAAHLE 111

Query: 170 RCDRGKRCL------YEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDG 222
           R       +      + G   PG      +NG  TY  SE  +       +WDRG D D 
Sbjct: 112 RMPGCNMTVDWTGHSFRGQVEPGKACMVVRNGKTTYLDSEFEI-DGQHFTSWDRGRDPDT 170

Query: 223 NQ-VWGVKAGPYEFKPAPSSSYSDMFS 248
           ++ +WG  AG ++F P  S ++    S
Sbjct: 171 DEHLWGSLAGAFQFSPRASYAHEVTLS 197


>gi|86605374|ref|YP_474137.1| hypothetical protein CYA_0661 [Synechococcus sp. JA-3-3Ab]
 gi|123507394|sp|Q2JWI9.1|CPXT1_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 1
 gi|86553916|gb|ABC98874.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 218

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 81/204 (39%), Gaps = 23/204 (11%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
           AD L G   ++ QA  +P  F  + +       +       + EQ    TP + +R R  
Sbjct: 36  ADWLEGYYSNRAQAMAEPVWFVPVSLWYVRLPHLFSEGIGFFTEQFNEHTPGRFYRSRVL 95

Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
            V   P      + + +Y +R+   +       +D + L +  + D+ +HL    +    
Sbjct: 96  QVLADP------LRIENYKLRDQAAWAGAS---QDLQRLAQLSLSDL-QHLPGCRILVEK 145

Query: 173 RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAG 231
           R     Y G   PGG        +TY   E   L   E  TWDRGFD   G Q WG +AG
Sbjct: 146 RAD--CYHGQMLPGGGCRLNPGDSTYIHIEFD-LTAQEFITWDRGFDATTGQQTWGSRAG 202

Query: 232 PYEFKPAPSSSYSDMFSPLNFPPQ 255
           PY ++            P+ FPP+
Sbjct: 203 PYRYQKR---------IPVVFPPR 217


>gi|88807824|ref|ZP_01123335.1| CpeT-like protein [Synechococcus sp. WH 7805]
 gi|88787863|gb|EAR19019.1| CpeT-like protein [Synechococcus sp. WH 7805]
          Length = 203

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 19/196 (9%)

Query: 51  IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQR 110
           + A +L G   ++EQA  DP  F +I +   P    V      Y EQ++   P +P+RQ 
Sbjct: 6   LFARTLCGHYSNREQAQNDPSKFAHINIFFMPLPWDVLKGPGFYSEQSYDHDPWRPYRQG 65

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLR 169
            + ++     +     V ++ + + E        P     + P+ +    G     +H +
Sbjct: 66  VHRLREGDNGVHV---VENFGMAQPERVAGAPQHPALLAAIQPDTLQARCG---CAMHFQ 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTW---DRGFDDDGNQ- 224
            C       YEG   PG      ++G  TY  SE+ V    + H+W   DRGFD D +  
Sbjct: 120 SCGANA---YEGRVEPGKGCLVPRDGRLTYLVSEVQV----DAHSWSSRDRGFDPDSDDL 172

Query: 225 VWGVKAGPYEFKPAPS 240
           VWG + G  +F+   S
Sbjct: 173 VWGSEHGMLQFQRVAS 188


>gi|443328463|ref|ZP_21057060.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
 gi|442791917|gb|ELS01407.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
          Length = 194

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 90/196 (45%), Gaps = 23/196 (11%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + + + ++ +  ++ QA  +P  F +IR+   P  + + GS  L+LEQA+      P+R 
Sbjct: 8   KTLVNWMAADFSNQAQAYANPPFFASIRVCMRPLPDELLGSPSLFLEQAYDFMLTNPYRV 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFC-DRPRDQRPLPEEVIGDIGEHLTT--- 165
           R          L   +E+ +Y ++E E++     DR + Q+  PE +    G  +T    
Sbjct: 68  RVLKFSI----LDDHIELENYKVQEQEQFFGASRDREQLQKLTPELLEKMPGCDMTVNWM 123

Query: 166 -IHLR-RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
             H + +   GK C+ E         NS++            +   ++ ++DRG D + +
Sbjct: 124 GNHFQGKIKPGKACIVERKGKITYLDNSFE------------IDQEKLISYDRGRDPETD 171

Query: 224 Q-VWGVKAGPYEFKPA 238
           + VWG  AGP+EF P 
Sbjct: 172 ELVWGSVAGPFEFTPV 187


>gi|86608101|ref|YP_476863.1| hypothetical protein CYB_0612 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86556643|gb|ABD01600.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 195

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 82/193 (42%), Gaps = 18/193 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A+ L G   ++ QA  +P  F  + +       +       + EQ    TP + +R R 
Sbjct: 13  LANWLEGYYSNRTQAMAEPVWFVPVSLWYVRLPHLFSEGIGFFTEQFNQHTPGQFYRSRV 72

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             V   P      + + +Y +R+   +      P  QR L +  + D+ EHL    +   
Sbjct: 73  LQVLGDP------LRIENYKLRDQAAWAGASQDP--QR-LAQLSLADL-EHLPGCRIL-V 121

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVKA 230
           ++   C Y G   PGG        +TY   E   L  +E  TWDRGFD   G Q WG +A
Sbjct: 122 EKRPDC-YHGEMLPGGECRLNPGDSTYIHIEFD-LTPSEFITWDRGFDAATGQQTWGSRA 179

Query: 231 GPYEFK----PAP 239
           GPY ++    PAP
Sbjct: 180 GPYRYQKGIPPAP 192


>gi|86610167|ref|YP_478929.1| CpeT protein [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558709|gb|ABD03666.1| CpeT protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 199

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 18/190 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+G+  ++EQA  +P  F  IR+   P    V G    Y+EQA+    + P+RQ  
Sbjct: 12  MARWLAGDFSNQEQAWENPPFFAQIRVAYRPLPAHVLGGIGFYVEQAYSGHLEDPYRQAV 71

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             +     +    + + +Y   E + ++  C R R  + L +    D+       +L  C
Sbjct: 72  VEL----TQASAGIVIRNYRPLEPQRWRG-CARERADQ-LSQLQAADLA------YLEGC 119

Query: 172 D---RGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGN-QVW 226
           D   + +  L+ G T PG      +NG ATY  + L  L  +E  + DRG D   + QVW
Sbjct: 120 DVHVKRQGSLFVGVTEPGCRCCVVRNGQATYLQTTLH-LSESEFCSHDRGMDPITHRQVW 178

Query: 227 GVKAGPYEFK 236
           G  AG + F+
Sbjct: 179 GALAGAFRFQ 188


>gi|428302139|ref|YP_007140445.1| hypothetical protein Cal6303_5596 [Calothrix sp. PCC 6303]
 gi|428238683|gb|AFZ04473.1| protein of unknown function DUF1001 [Calothrix sp. PCC 6303]
          Length = 200

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/195 (21%), Positives = 85/195 (43%), Gaps = 6/195 (3%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           + + L+GE  + +Q+  +P  F ++R+   P     + S   + EQA      +P+RQR 
Sbjct: 10  LGNYLAGEFDNSKQSLAEPAWFVHLRLWHIPVPLFPEDSITFFAEQANIVNLDQPYRQRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             ++     ++  ++V  YA+++          P   + L  + + D+   +  + ++  
Sbjct: 70  IRIRQV-GNVESSLQVQYYALKDYISLVGAGKEPGRLKNLTPDDLEDLPGCVLEVEIKET 128

Query: 172 DRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVK 229
               + +   +TP      S++ NG     S    +   E+ T+D+G D + G   WG  
Sbjct: 129 SHHLQFI---ATPVANSICSFRYNGNIVRVSLGFEVTQEELKTYDKGIDPETGRATWGAI 185

Query: 230 AGPYEFKPAPSSSYS 244
            GPY ++    SS S
Sbjct: 186 LGPYHYQKRKQSSLS 200


>gi|113478364|ref|YP_724425.1| hypothetical protein Tery_5043 [Trichodesmium erythraeum IMS101]
 gi|123056021|sp|Q10UX2.1|CPXT2_TRIEI RecName: Full=Chromophore lyase CpcT/CpeT 2
 gi|110169412|gb|ABG53952.1| protein of unknown function DUF1001 [Trichodesmium erythraeum
           IMS101]
          Length = 195

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 13/176 (7%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           +++QA  +P  F +IR+   P    +     LYLEQA+  T  +P+R R  ++K  P E 
Sbjct: 20  NQQQAFDNPPLFAHIRVCMRPLPYQLLNGLSLYLEQAYDITLNQPYRVR--VLKLVPSE- 76

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
              +E+ +Y I +  E+      P+  + L  E I    + L          GK   ++G
Sbjct: 77  -NHIEIENYIIDQEAEFYGASRDPQRLKNLKTEYI----KKLPGCTFITKWTGKS--FKG 129

Query: 182 STPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPYEF 235
              PG G     +  +TY  S   + ++  I + D G+D +   QVWG  AGP+EF
Sbjct: 130 EVEPGKGCTVVREGKSTYLKSYFEINEHKFI-SHDTGYDPETDQQVWGAIAGPFEF 184


>gi|428776095|ref|YP_007167882.1| hypothetical protein PCC7418_1479 [Halothece sp. PCC 7418]
 gi|428690374|gb|AFZ43668.1| protein of unknown function DUF1001 [Halothece sp. PCC 7418]
          Length = 197

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 88/204 (43%), Gaps = 27/204 (13%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+GE  ++ QA   P  F N+++   P    +DG+  L+ EQA     ++ +RQR 
Sbjct: 5   LAQWLAGEFSNESQALSQPAWFVNLKLWHRPLPVSIDGNYALFAEQAPALKLEQAYRQRI 64

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
           ++++P   E K  + V  YA +E ++++     P               + L ++ L+  
Sbjct: 65  FVIQPA-TETK-PMTVQYYAFKEHQKWRGAGMNP---------------QILDSLRLQDI 107

Query: 172 DRGKRCLYEGSTPPGG-----FPNS----WQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
           ++   C  E  T         FPNS    +  G          + + E  + D+G D + 
Sbjct: 108 EKLPGCALEVITTENKFSAQPFPNSVCQFYIQGNLCQIELGFAVTSQEFFSSDKGIDPET 167

Query: 223 NQ-VWGVKAGPYEFKPAPSSSYSD 245
            + +WG    PY+F+     S S+
Sbjct: 168 QKPIWGALIAPYQFQKMQGFSVSE 191


>gi|86607187|ref|YP_475950.1| hypothetical protein CYA_2568 [Synechococcus sp. JA-3-3Ab]
 gi|123505022|sp|Q2JRQ9.1|CPXT3_SYNJA RecName: Full=Chromophore lyase CpcT/CpeT 3
 gi|86555729|gb|ABD00687.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 198

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 79/189 (41%), Gaps = 19/189 (10%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++G   +  QA  +P  F NI +  CP    V      Y+EQ +   P +P+RQR 
Sbjct: 10  LATWMAGHFNNLHQAIAEPVWFANIHVYQCPLPWSVFQGIGFYVEQLYDIYPDQPYRQRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI----GEHLTTIH 167
             +   P  ++    + +YA+   E YK      RD   L      ++    G  +    
Sbjct: 70  IHLFETPDGIR----IQNYALGSPEAYKC---AGRDLGKLASLAAAELELLPGCAVQVEW 122

Query: 168 LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
              C RG+      S P  G     +   TY  SE  +   +  H+ D+G D + +Q VW
Sbjct: 123 TGSCYRGR------SVPGKGCIVERKGRTTYLYSEFEI-GADYFHSLDQGRDPETDQVVW 175

Query: 227 GVKAGPYEF 235
           G  +GP+ F
Sbjct: 176 GSLSGPFRF 184


>gi|254422202|ref|ZP_05035920.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196189691|gb|EDX84655.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 196

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 82/184 (44%), Gaps = 17/184 (9%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           ++ E  ++EQA  +P  F +I++   P  +       LYLEQA+     KP+R R     
Sbjct: 14  MAAEFSNQEQAFENPPFFAHIKVCMRPLPKSFQPGVSLYLEQAYSFQLDKPYRVRVLHF- 72

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
               + + DV + +Y +++ E+   F    RD   L    + D+   +       CD   
Sbjct: 73  ---IQREDDVLLENYKVKDEEK---FYGAARDLEKLATLSVDDLEPMMG------CDIFV 120

Query: 176 RCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAG 231
                 + G   PG   +  + GAT       ++  + + ++DRG D + ++ VWG  AG
Sbjct: 121 EWTGDSFRGKVEPGKKCSVIRKGATTYLDNEFIVTESHMTSYDRGRDPETDELVWGSVAG 180

Query: 232 PYEF 235
           P+EF
Sbjct: 181 PFEF 184


>gi|282898790|ref|ZP_06306777.1| protein of unknown function DUF1001 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196317|gb|EFA71227.1| protein of unknown function DUF1001 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 197

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 89/201 (44%), Gaps = 21/201 (10%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
           +T S+     AR +A   S    ++ QA  +P  + +IR+   P A  +     L++EQA
Sbjct: 1   MTHSQDVLTLARWMAADFS----NQAQAFENPPLYAHIRVCMRPLAYSLLSGVSLFVEQA 56

Query: 99  FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVI 156
           +    + P+R R          LK  VE     I    V++ K+F    RD   L + + 
Sbjct: 57  YDYDLKDPYRVRV---------LKLLVEQGQIIIENYTVKDEKDFYGSSRDLGKL-QNLT 106

Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWD 215
            D  E L   ++R    G    + G+  PG     ++ G  TY  S+  +  + +  +WD
Sbjct: 107 ADRLEKLCGCNMRVEWTGS--CFTGTVEPGKNCLVFRKGQKTYLDSKFEI-DDQKFLSWD 163

Query: 216 RGFD-DDGNQVWGVKAGPYEF 235
           RG D + G  VWG  AGP+ F
Sbjct: 164 RGRDLETGQHVWGSLAGPFHF 184


>gi|56750773|ref|YP_171474.1| hypothetical protein syc0764_d [Synechococcus elongatus PCC 6301]
 gi|81299583|ref|YP_399791.1| hypothetical protein Synpcc7942_0772 [Synechococcus elongatus PCC
           7942]
 gi|56685732|dbj|BAD78954.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81168464|gb|ABB56804.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 197

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/186 (26%), Positives = 75/186 (40%), Gaps = 15/186 (8%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           LSGE  ++ QA  +P +F  +R+   P          L+LEQA      +P+RQR   ++
Sbjct: 11  LSGEFSNQPQAIAEPVHFVQLRLWQRPLLLFGHDRPCLFLEQANALQLDRPYRQRLLCLE 70

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLRRCDRG 174
               +    +    Y  +           P   R L PE+V     + L +  L   ++ 
Sbjct: 71  ---ADFSGQLSAQFYQFKNAAAVLGAGQDPDRLRSLTPEDV-----QLLESCRL-AVEQT 121

Query: 175 KRCLYEGSTPPGGFPNSWQNGATYSTS-ELAVLKNNE---IHTWDRGFD-DDGNQVWGVK 229
            R  Y    P G     W +G     S    V   ++    H++DRG D + G  +WG  
Sbjct: 122 SRGGYNAQLPAGCRCRFWTDGIERQVSLGFEVWSESDRVNFHSFDRGIDPETGRSLWGAL 181

Query: 230 AGPYEF 235
            GPYEF
Sbjct: 182 MGPYEF 187


>gi|428770503|ref|YP_007162293.1| hypothetical protein Cyan10605_2163 [Cyanobacterium aponinum PCC
           10605]
 gi|428684782|gb|AFZ54249.1| protein of unknown function DUF1001 [Cyanobacterium aponinum PCC
           10605]
          Length = 194

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 91/212 (42%), Gaps = 27/212 (12%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
           ++KS      AR++A   S    +++QA  +P  F +IR+   P  E + G   L+LEQA
Sbjct: 1   MSKSTDVKGLARLMAADFS----NQQQAWDNPPFFAHIRVCMRPLPESLLGGTSLFLEQA 56

Query: 99  FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
           +     +P+R R + ++     L    E+  Y ++E  E   F    R++  L E     
Sbjct: 57  YDFLLNQPYRLRVFKIQAVDDHL----ELEHYKLKEEAE---FYGASRNREKLKE----- 104

Query: 159 IGEHLTTIHLRR---CDRGKR---CLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
               LT  HL +   CD         ++G   PG      + G          +   ++ 
Sbjct: 105 ----LTIDHLEKMNGCDMIAEWNGSHFKGYIKPGKACIVVRKGKESYLDNSFEIDERKLI 160

Query: 213 TWDRGFDDDGNQ-VWGVKAGPYEFKPAPSSSY 243
           ++DRG D   ++ +WG  AGP+ F    S ++
Sbjct: 161 SFDRGRDLVTDELLWGSVAGPFHFTRIESFAH 192


>gi|385808825|ref|YP_005845221.1| hypothetical protein IALB_0241 [Ignavibacterium album JCM 16511]
 gi|383800873|gb|AFH47953.1| Hypothetical protein IALB_0241 [Ignavibacterium album JCM 16511]
          Length = 87

 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/41 (51%), Positives = 29/41 (70%), Gaps = 1/41 (2%)

Query: 195 GATYSTSELAVLKNNEIHTWDRGFDDDGNQVWGVKAGPYEF 235
           GATY+T+E+ + K+  I +WDRGF+D   QVWG + G Y F
Sbjct: 43  GATYATTEVVITKDKMI-SWDRGFNDKDEQVWGAEKGGYIF 82


>gi|428778548|ref|YP_007170334.1| hypothetical protein Dacsa_0162 [Dactylococcopsis salina PCC 8305]
 gi|428692827|gb|AFZ48977.1| CpeT/CpcT family (DUF1001) [Dactylococcopsis salina PCC 8305]
          Length = 193

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/195 (22%), Positives = 84/195 (43%), Gaps = 12/195 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+GE  +KEQA  +P  F ++++   P   ++DG+  L+ EQA      +P+RQR 
Sbjct: 9   LAQWLAGEFENKEQAMSEPAWFVSLKLWHRPLPFLIDGNYALFAEQAPALKLDRPYRQRV 68

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
           +++          + +  YA ++ + ++      R+ + L    + D+ E L    L+  
Sbjct: 69  FVITATDP-----IAIQYYAFKQPQLWQG---SGRNPQQLDSLRLDDL-EKLPDCVLQVT 119

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKA 230
           D  K   +     P      +  G          +   E  + D+G D +  + +WG   
Sbjct: 120 DTEKE--FSAKPLPNAVCQFYAQGRLCEVKLGFTVTAEEFFSDDKGIDPETKKPIWGALM 177

Query: 231 GPYEFKPAPSSSYSD 245
            PY F+   S + S+
Sbjct: 178 SPYRFQKIQSFTVSE 192


>gi|16331059|ref|NP_441787.1| hypothetical protein slr1649 [Synechocystis sp. PCC 6803]
 gi|383322801|ref|YP_005383654.1| hypothetical protein SYNGTI_1892 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325970|ref|YP_005386823.1| hypothetical protein SYNPCCP_1891 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491854|ref|YP_005409530.1| hypothetical protein SYNPCCN_1891 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437122|ref|YP_005651846.1| hypothetical protein SYNGTS_1893 [Synechocystis sp. PCC 6803]
 gi|451815216|ref|YP_007451668.1| hypothetical protein MYO_119100 [Synechocystis sp. PCC 6803]
 gi|81671316|sp|P74371.1|CPXT_SYNY3 RecName: Full=Chromophore lyase CpcT/CpeT
 gi|1653553|dbj|BAA18466.1| slr1649 [Synechocystis sp. PCC 6803]
 gi|339274154|dbj|BAK50641.1| hypothetical protein SYNGTS_1893 [Synechocystis sp. PCC 6803]
 gi|359272120|dbj|BAL29639.1| hypothetical protein SYNGTI_1892 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275290|dbj|BAL32808.1| hypothetical protein SYNPCCN_1891 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278460|dbj|BAL35977.1| hypothetical protein SYNPCCP_1891 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407961573|dbj|BAM54813.1| hypothetical protein BEST7613_5882 [Bacillus subtilis BEST7613]
 gi|451781185|gb|AGF52154.1| hypothetical protein MYO_119100 [Synechocystis sp. PCC 6803]
          Length = 196

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 19/179 (10%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++ QA  +P  + +IR+   P  +   G +LL+LEQA+    Q+P+R R   +K      
Sbjct: 20  NQAQAFENPPFYAHIRVAIRPLDQAKFGDRLLFLEQAYDFMLQRPYRLRVLKLKVVEDH- 78

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKR---CL 178
              +E+ ++ +++ E+   F    RD   L +    D    L  +H   CD         
Sbjct: 79  ---IEIENFKVKDEEK---FYGAARDLGKLAQLTPAD----LEPMH--GCDMIVEWTGTS 126

Query: 179 YEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
           ++G   PG      ++G  TY  +   V +   I + DRG+D + N+ VWG  AG + F
Sbjct: 127 FKGEVQPGRQCRVMRDGKETYLENSFEVSETGLI-SLDRGYDPETNERVWGSVAGAFHF 184


>gi|428216190|ref|YP_007089334.1| hypothetical protein Oscil6304_5951 [Oscillatoria acuminata PCC
           6304]
 gi|428004571|gb|AFY85414.1| CpeT/CpcT family (DUF1001) [Oscillatoria acuminata PCC 6304]
          Length = 198

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 89/206 (43%), Gaps = 19/206 (9%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQ 97
           +T S      AR +A   S    + +QA  +P  F +IR+   P   E +DG  L YLEQ
Sbjct: 1   MTHSTDVLTLARWMASDFS----NFDQAIENPPFFAHIRVCMRPLPYEFLDGVSL-YLEQ 55

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIR-EVEEYKNFCDRPRDQRPLPEEVI 156
           A+     +P+R R  ++K  PK  +  +E+ +Y ++ E   Y +  DR   Q   P+++ 
Sbjct: 56  AYDYMLDQPYRVR--VLKLVPKGDR--IEIENYKVQDEASFYGSARDRHLLQSLTPDKLE 111

Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDR 216
              G    T             ++G   PG      + G T        +  N   + DR
Sbjct: 112 KIPGSSFIT-------EWTGSSFKGEVEPGKNCIVVRKGRTTYLDSTFEIDENLFISHDR 164

Query: 217 GFDDDGNQ-VWGVKAGPYEFKPAPSS 241
           G D D ++ VWG  AGP+ F    +S
Sbjct: 165 GLDPDTDEMVWGSVAGPFVFNKRVAS 190


>gi|428222477|ref|YP_007106647.1| hypothetical protein Syn7502_02541 [Synechococcus sp. PCC 7502]
 gi|427995817|gb|AFY74512.1| CpeT/CpcT family (DUF1001) [Synechococcus sp. PCC 7502]
          Length = 192

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 83/190 (43%), Gaps = 21/190 (11%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++GE  ++EQA  +P  + N+R+   P  + +      + EQA      +P+RQR 
Sbjct: 8   LAQYMAGEFSNREQAIAEPVWYVNLRLWQIPLPQSLLDGIAFFAEQANIINLDQPYRQRI 67

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             +    +E   +++   YA+ E   ++    RP   + L    I D+      I L  C
Sbjct: 68  INL----RETDGNIQADYYALIEPSAWRGAATRPEQLQQLS---INDL------IPLPDC 114

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYS--TSELAV---LKNNEIHTWDRGFD-DDGNQV 225
               +  YE  T     P + +   TY   T ++++      +E  ++D+G D + G  +
Sbjct: 115 TL--KVKYEQETFKAELPINAKCCFTYQGETRQVSLGFAASKSEFLSYDKGIDMNTGAGI 172

Query: 226 WGVKAGPYEF 235
           WG   G Y F
Sbjct: 173 WGAITGAYRF 182


>gi|428316405|ref|YP_007114287.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428240085|gb|AFZ05871.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 196

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 88/183 (48%), Gaps = 15/183 (8%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
           ++ +  ++ QA  +P  F +IR+   P   E++DG  L YLEQA+      P+R R  ++
Sbjct: 14  MAADFSNQAQAFENPPFFAHIRVCMRPLPIEVLDGISL-YLEQAYDIELNVPYRVR--VL 70

Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
           K  P   + D+E  +YAI   E  + F    RD + L E     +        + +   G
Sbjct: 71  KLVPAGDRIDIE--NYAI---ENEEQFYGASRDPQRLQEIKTAQLTLLPGCTFITQW-TG 124

Query: 175 KRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVKAGP 232
           K   ++G   PG      +NG  TY  SE  +   ++  + DRG D + +  VWG +AGP
Sbjct: 125 KS--FKGFVEPGKGCAVVRNGKKTYLDSEFEI-DQDKFTSHDRGRDPETDAHVWGAQAGP 181

Query: 233 YEF 235
           +EF
Sbjct: 182 FEF 184


>gi|186685249|ref|YP_001868445.1| hypothetical protein Npun_R5170 [Nostoc punctiforme PCC 73102]
 gi|186467701|gb|ACC83502.1| protein of unknown function DUF1001 [Nostoc punctiforme PCC 73102]
          Length = 201

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 77/186 (41%), Gaps = 5/186 (2%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           + + L+GE  ++EQA  +P  + ++RM   P     + S  L+ EQA      KP+RQR 
Sbjct: 10  LGEYLAGEFDNREQAIEEPVWYVHLRMWQRPVNLFTEDSITLFAEQASVINLDKPYRQRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHLRR 170
             ++         +EV  Y  ++ +  +   + P     L  + +  + G  LT  H   
Sbjct: 70  MRLRQGSNS-DTSLEVQYYIPQDPDALRGAGNNPALLNTLSAKQLDLLPGCVLTVTH--E 126

Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVK 229
              G R  +  +  P    +    G +   S        E H++D+G D + G   WG  
Sbjct: 127 ALAGDRYKFTATPQPETRCSFTYLGNSIYVSLGFETTAAEFHSYDKGIDPETGKATWGAI 186

Query: 230 AGPYEF 235
            GPY +
Sbjct: 187 MGPYRY 192


>gi|158340571|ref|YP_001521565.1| CpcT protein, putative [Acaryochloris marina MBIC11017]
 gi|158310812|gb|ABW32426.1| CpcT protein, putative [Acaryochloris marina MBIC11017]
          Length = 197

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 82/188 (43%), Gaps = 17/188 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  L+ E  ++ QA  +P  + +I++   P    +     LYLEQA+    +KP+R R 
Sbjct: 10  LAKWLAAEFSNQAQAIENPPLYAHIQVSIRPLPRKLLTGYCLYLEQAYAFMLKKPYRTRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             +     E K  + + +Y   E+ + + F    RD  PL    I    EHL  I    C
Sbjct: 70  IHI----LEHKDSILIENY---ELHKKELFLGGTRD--PLILNKIST--EHL--IKKEGC 116

Query: 172 DRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
           D         + G+  PG      +NG T        +  N+  + DRG D + ++ +WG
Sbjct: 117 DMNVEWTGHSFVGTVVPGKSCKVVRNGKTTYLDNSFEVDQNKFISIDRGRDPETDELIWG 176

Query: 228 VKAGPYEF 235
             AGP+ F
Sbjct: 177 SIAGPFHF 184


>gi|434397505|ref|YP_007131509.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
 gi|428268602|gb|AFZ34543.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
          Length = 208

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 92/221 (41%), Gaps = 32/221 (14%)

Query: 35  LLKRLTKSKTRWDHARI-VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLL 93
           +L  +  S+T+  +  I +A  ++G+  + +QA  D   + +IR+   P           
Sbjct: 1   MLPTIDNSQTQETNLLIDLAKLMAGDFSNHKQALADSKTYAHIRIFFRPLPYEFFSGIGF 60

Query: 94  YLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPE 153
           Y EQ +      P+RQ  + +  C  E    + + +Y++++   Y               
Sbjct: 61  YSEQTYDYDLWSPYRQGVHRLIDCGDE----IYIENYSLQDAYLYAG------------- 103

Query: 154 EVIGDIGEHLTTIHLRRCDRGKRC---------LYEGSTPPGGFPNSWQNGA-TYSTSEL 203
              G   + L TI+    +R   C         L+ GS  PG      +NG  TY  SE+
Sbjct: 104 --AGHNRDILLTINPNCLERRYHCSMIFKKDGDLFRGSVEPGNRCLINRNGTETYLVSEV 161

Query: 204 AVLKNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSY 243
            + +   I +WDRG D     Q+WG  AGP +F+   S ++
Sbjct: 162 ELTEATWI-SWDRGMDLVTHQQIWGSAAGPLKFEKRASFAH 201


>gi|260436513|ref|ZP_05790483.1| hypothetical protein SH8109_0091 [Synechococcus sp. WH 8109]
 gi|260414387|gb|EEX07683.1| hypothetical protein SH8109_0091 [Synechococcus sp. WH 8109]
          Length = 197

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 45/188 (23%), Positives = 84/188 (44%), Gaps = 12/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           R   + L GE  +++QA  +P  + +I +   P  ++  GS LL  EQ +   P+ P+R 
Sbjct: 7   RRFLNLLCGEFSNQQQAFDNPPLYAHIFLRYRPLIQLRPGSILL--EQTYAVDPKHPYRL 64

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R    +   ++    +++ ++  R+ E +      P+ +  + E  +  + +     H +
Sbjct: 65  RMIRAE---EQASGAIKLWNHIFRDPERFAGATFDPQLRLAIQERDLISLNQ----CHYQ 117

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGV 228
             ++     Y G+  PG      +NG          L+ + + T DRG D D N+  WG 
Sbjct: 118 VLEQPDG--YHGAIEPGCKCIVQRNGEDTVLVSSFHLEGDSLSTLDRGHDPDTNERCWGS 175

Query: 229 KAGPYEFK 236
            AGP+ FK
Sbjct: 176 VAGPFRFK 183


>gi|434393935|ref|YP_007128882.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
 gi|428265776|gb|AFZ31722.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
          Length = 195

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 78/188 (41%), Gaps = 5/188 (2%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  L+GE  ++EQA  +P  + ++R+   P A   + S  L+ EQA      +P+RQ
Sbjct: 8   KALAQYLTGEFTNQEQAIAEPAWYVHLRLWHQPVALFTEDSFTLFAEQANIVKLDQPYRQ 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R   ++  P ++   ++V  Y  +     +     P   + L  E    +   L  + + 
Sbjct: 68  RLIRLQQRPDDVA--LQVQYYMPKNPTALRGAGANPDILKTLTPEDFELLPGCLLDVTVN 125

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD-GNQVWGV 228
           + +R     ++ + P          G T   S        E  ++D+G D   G   WG 
Sbjct: 126 QLERDYH--FKATLPSDRRCCFAYAGNTVQVSIGFEASPEEFLSYDKGIDSTTGKATWGA 183

Query: 229 KAGPYEFK 236
             GPY F+
Sbjct: 184 ILGPYRFR 191


>gi|425469168|ref|ZP_18848127.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9701]
 gi|389883085|emb|CCI36480.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9701]
          Length = 194

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 13/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + + G   L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLGGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ ++ ++E   +      P+  + L  E++    E +    + 
Sbjct: 68  RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELL----EPMAGCDMN 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDTETDQLVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|307154999|ref|YP_003890383.1| hypothetical protein Cyan7822_5227 [Cyanothece sp. PCC 7822]
 gi|306985227|gb|ADN17108.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
          Length = 207

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 37/183 (20%), Positives = 77/183 (42%), Gaps = 3/183 (1%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L+GE  ++ QA      F ++R+   P     + S  L+ EQA      +P+R R + ++
Sbjct: 16  LAGEFENRPQAMESAAWFVHLRLWLRPVPLFTEDSITLFAEQASIVNLDQPYRPRLWRLR 75

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
                  C ++V  Y  +++   +    +P   + L  E +  +     T+ +       
Sbjct: 76  QVSTSPVC-LQVEHYMFKDISPVQGLGRKPELLQQLTLEQVECLTAPGCTLKVESNQLAG 134

Query: 176 RCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPY 233
              +  +      P ++   G ++  S    + + E+ T+D+G D + G  +WG   GPY
Sbjct: 135 NHYHFKALAVSESPCAFSYQGQSFQVSLGFEVNSQELKTYDKGIDINTGRAIWGALMGPY 194

Query: 234 EFK 236
            ++
Sbjct: 195 CYQ 197


>gi|257059192|ref|YP_003137080.1| hypothetical protein Cyan8802_1322 [Cyanothece sp. PCC 8802]
 gi|256589358|gb|ACV00245.1| hypothetical protein Cyan8802_1322 [Cyanothece sp. PCC 8802]
          Length = 235

 Score = 45.4 bits (106), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 16/150 (10%)

Query: 93  LYLEQAFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP 152
           ++LEQ F  TPQ P R R+Y   P    L   V +S  +  +       C+ P  +R L 
Sbjct: 83  IFLEQTFLTTPQPP-RLRYYTFSP----LTTGVLLSVRSFIDQTHLSGLCNLPESERTLE 137

Query: 153 EEVIGDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
                +I      I L R     R  Y G+  P G P +  +G++ +      +  N+I 
Sbjct: 138 ---FSNINPLSCDIELSRLVEPVR--YFGTNSPDGCPAA--SGSSVTVISTISIHPNQID 190

Query: 213 TWDRGFDDDGNQVWGVKAGPYEFKPAPSSS 242
           ++D+GF         +   P  F   P SS
Sbjct: 191 SFDQGFIGT----IPIFGTPITFNRVPESS 216


>gi|282895778|ref|ZP_06303865.1| Protein of unknown function DUF1001 [Raphidiopsis brookii D9]
 gi|281199278|gb|EFA74144.1| Protein of unknown function DUF1001 [Raphidiopsis brookii D9]
          Length = 197

 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 89/201 (44%), Gaps = 21/201 (10%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
           +T S+     AR +A   S    ++ QA  +P  + +IR+   P A  +     L++EQA
Sbjct: 1   MTHSQDLLTLARWMAADFS----NQAQAFENPPLYAHIRVCMRPLAYSLLSGVSLFVEQA 56

Query: 99  FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVI 156
           +    + P+R R          LK  VE     I    V++ K+F    RD   L + + 
Sbjct: 57  YDYDLKDPYRVRV---------LKLLVEEGQIIIENYTVKDEKDFYGSSRDLGKL-QNLT 106

Query: 157 GDIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWD 215
            D  E L   ++R    G    + G+   G     ++ G  TY  S+  +  + +  +WD
Sbjct: 107 ADRLEKLCGCNMRVEWTGN--CFTGTVESGKNCLVFRKGQKTYLDSKFQI-DDQKFLSWD 163

Query: 216 RGFDDDGNQ-VWGVKAGPYEF 235
           RG D + +Q VWG  AGP+ F
Sbjct: 164 RGRDLETDQHVWGSLAGPFHF 184


>gi|334119341|ref|ZP_08493427.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
 gi|333458129|gb|EGK86748.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
          Length = 196

 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 90/203 (44%), Gaps = 25/203 (12%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQ 97
           +T S      AR +A   S +     QA  +P  F +IR+   P   E++DG  L YLEQ
Sbjct: 1   MTHSTDIVTLARWMAADFSNQA----QAFENPPFFAHIRVCMRPLPVEVLDGLSL-YLEQ 55

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A+      P+R R   + P        +E+ +YAI   E  + F    RD + L E    
Sbjct: 56  AYDIELNVPYRVRVLKLVPVDDR----IEIENYAI---ENEEQFYGASRDPQRLQE---- 104

Query: 158 DIGEHLTTIHLRRCDRGKRCL---YEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHT 213
            I +   T+ L  C    +     ++G   PG G     +   TY  SE  + ++  I +
Sbjct: 105 -IKKAQLTL-LPGCTFITQWTGNSFKGFVEPGKGCMVVRKGKKTYLDSEFEIDQDKFI-S 161

Query: 214 WDRGFDDDGN-QVWGVKAGPYEF 235
            DRG D + +  VWG  AGP+EF
Sbjct: 162 HDRGRDPETDAHVWGALAGPFEF 184


>gi|428218917|ref|YP_007103382.1| hypothetical protein Pse7367_2699 [Pseudanabaena sp. PCC 7367]
 gi|427990699|gb|AFY70954.1| protein of unknown function DUF1001 [Pseudanabaena sp. PCC 7367]
          Length = 202

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 89/201 (44%), Gaps = 25/201 (12%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFR-QR 110
           A  ++G+  + +QA  +P  F +IR+   P   ++ D    LY EQA+     +P+R   
Sbjct: 14  AQWMTGDYSNWDQAIANPPLFAHIRVCIRPLPNQLSDHGIWLYSEQAYDFQIDRPYRIAV 73

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
            ++V    +     +E+ ++ + + E +     +P     +            + + L  
Sbjct: 74  LHLVLAADR-----IEIENHKLNDEESFYGAARQPERLAAITAA---------SVVKLPG 119

Query: 171 C----DRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
           C    DR     + G   PG   N   +N  TY   E  + + N + T DRG+D + ++ 
Sbjct: 120 CNILVDRTPTGSFVGKIEPGKKCNVHRKNQDTYLVHEFELNQTN-LTTLDRGYDPETDER 178

Query: 225 VWGVKAGPYEFKPAPSSSYSD 245
           VWG  AGP+EF     +S++D
Sbjct: 179 VWGTIAGPFEF--VKQASFAD 197


>gi|427719033|ref|YP_007067027.1| hypothetical protein Cal7507_3805 [Calothrix sp. PCC 7507]
 gi|427351469|gb|AFY34193.1| protein of unknown function DUF1001 [Calothrix sp. PCC 7507]
          Length = 198

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 42/187 (22%), Positives = 75/187 (40%), Gaps = 5/187 (2%)

Query: 51  IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQR 110
           ++ + LSGE  ++ QA  DP  + ++R+   P    ++ S  L+ EQA       P+RQR
Sbjct: 9   VLGEYLSGEFDNQAQAIADPVWYVHLRLWQRPVNLFLEDSITLFAEQANVINLDNPYRQR 68

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHLR 169
              + P  ++    ++V  Y  ++    +     P     L  E +  + G  LT     
Sbjct: 69  IIRLLP-RRDDDAQLQVQYYMPKDPSALRGAGKNPALLNTLTSEQLDLLPGCFLTVTKQT 127

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDD-GNQVWGV 228
                 +  +  + PP         G +   S        E +++D+G D + G   WG 
Sbjct: 128 LATNDYK--FAATPPPNTCCQFTYLGNSIQVSLGFEATAKEFYSYDKGIDSETGKATWGA 185

Query: 229 KAGPYEF 235
             GPY +
Sbjct: 186 TIGPYRY 192


>gi|148238834|ref|YP_001224221.1| phycoerythrin linker gene region [Synechococcus sp. WH 7803]
 gi|147847373|emb|CAK22924.1| Conserved hypothetical protein, CpeT homolog, phycoerythrin linker
           gene region [Synechococcus sp. WH 7803]
          Length = 204

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 47/186 (25%), Positives = 79/186 (42%), Gaps = 11/186 (5%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
           A +L+G   +++QA  +P +F +I +   P    V  +   Y EQ++   P  P+RQ  +
Sbjct: 12  AKTLAGHYSNRKQAQNNPKDFAHINIFFRPLPWEVLNAPGFYSEQSYDHAPWSPYRQGVH 71

Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
            + P   +      V ++   +         RP     L  + +         +H R  D
Sbjct: 72  RLLPAEGD---HFIVENFGFGDPIRLAGAGQRPELLNSLNTDSLKPRCG--CGMHFREID 126

Query: 173 RGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVKA 230
            G    Y G   PG      ++G  TY  SE+ +   + I + DRGFD + + Q WG + 
Sbjct: 127 AGH---YRGGVEPGKNCLVPRDGHLTYLVSEVEMNATDWI-SRDRGFDPETDEQRWGSEH 182

Query: 231 GPYEFK 236
           GP  F+
Sbjct: 183 GPLRFE 188


>gi|425433932|ref|ZP_18814405.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9432]
 gi|389679000|emb|CCH92160.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9432]
          Length = 194

 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 13/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ ++ ++E   +      P+  + L  E++    E +    + 
Sbjct: 68  RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELL----EPMPGCDMN 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               GK   ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VWG
Sbjct: 120 VTWTGKS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|186681243|ref|YP_001864439.1| hypothetical protein Npun_F0744 [Nostoc punctiforme PCC 73102]
 gi|186463695|gb|ACC79496.1| protein of unknown function DUF1001 [Nostoc punctiforme PCC 73102]
          Length = 197

 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 44/186 (23%), Positives = 84/186 (45%), Gaps = 13/186 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  +S +  ++EQA  +P  + +IR+   P    +     L+LEQA+     +P+R R 
Sbjct: 10  LARWMSADFSNQEQAFENPPFYAHIRVCIRPLPLELFSGVSLFLEQAYDFMLNQPYRMRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             + P        + +  Y ++E +++      P   + L  + +    E ++  ++   
Sbjct: 70  MKLIPAENH----IAIEHYTVKEEQKFYGASREPERLKDLSVDQL----EKMSGCNMIVE 121

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVK 229
             GK   ++G   PG      ++G  TY  +E  +    E  + DRG D D ++ +WG  
Sbjct: 122 WTGKS--FKGRVEPGKGCIVVRDGKNTYLDNEFEI-DAKEFFSLDRGRDLDTDERLWGSI 178

Query: 230 AGPYEF 235
           AGP+ F
Sbjct: 179 AGPFHF 184


>gi|37521107|ref|NP_924484.1| hypothetical protein glr1538 [Gloeobacter violaceus PCC 7421]
 gi|81710316|sp|Q7NKE0.1|CPXT4_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 4
 gi|35212103|dbj|BAC89479.1| glr1538 [Gloeobacter violaceus PCC 7421]
          Length = 183

 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 50/191 (26%), Positives = 76/191 (39%), Gaps = 22/191 (11%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + V   L+G   ++ QA  +P  F  + +     A +       + EQ    TP +P+R 
Sbjct: 5   QTVRTWLAGTYSNRAQAMAEPVWFIPVTLWYVEVAGLFGEGAGFFTEQVSEHTPNQPYRS 64

Query: 110 RFYMVKPCPKELKC----DVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTT 165
           R   +   P  L+     D +V + A ++ E       R  D   LP            T
Sbjct: 65  RVLQLLDNPLRLENYRLKDQKVWAGAAKDPERLGRL--RADDCEQLPG----------CT 112

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
           ++L R  RG+   + G   PGG    +   A+Y   E   L      T DRGF+   G Q
Sbjct: 113 LYLER--RGE--TFTGKMQPGGGCRLFPGDASYVEIEFE-LGERLFFTLDRGFEATTGEQ 167

Query: 225 VWGVKAGPYEF 235
            WG +AG Y +
Sbjct: 168 TWGSRAGAYRY 178


>gi|422303151|ref|ZP_16390505.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389791909|emb|CCI12308.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 194

 Score = 43.9 bits (102), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
           + +A  ++ +  ++EQA  +P  F +IR+   P   E+++G+ L +LEQA+      P+R
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    +
Sbjct: 67  LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
                G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175

Query: 227 GVKAGPYEF 235
           G  AGP+EF
Sbjct: 176 GSIAGPFEF 184


>gi|56750747|ref|YP_171448.1| hypothetical protein syc0738_d [Synechococcus elongatus PCC 6301]
 gi|56685706|dbj|BAD78928.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 197

 Score = 43.9 bits (102), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 87/187 (46%), Gaps = 15/187 (8%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQR 110
           +A  L+ +  ++ QA  +P  F +IR+   P A E++DG  L Y+EQA+     +P+R R
Sbjct: 10  LARWLAADFSNQAQAFENPPFFAHIRVCMRPLALEVLDGLSL-YVEQAYDLDVNQPYRAR 68

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
              + P        +E+ ++ + + E+      R RD+     ++  D  E L    L  
Sbjct: 69  VLKLVPAGDH----IEIENFRVIDGEDLFG-ASRDRDRLA---KLTRDRLEKLPGCTLIT 120

Query: 171 CDRGKRCLYEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRG-FDDDGNQVWGV 228
              G    ++G   PG G     +  ATY  SE  +  + ++ + DRG   +   QVWG 
Sbjct: 121 HWTGNG--FKGVVEPGKGCIVERKGQATYLESEFEI-DDEKMISLDRGRHPETDEQVWGS 177

Query: 229 KAGPYEF 235
            AGP+ F
Sbjct: 178 VAGPFHF 184


>gi|443657119|ref|ZP_21131838.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa DIANCHI905]
 gi|159029854|emb|CAO90908.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443333248|gb|ELS47816.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa DIANCHI905]
          Length = 194

 Score = 43.9 bits (102), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 83/193 (43%), Gaps = 23/193 (11%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHLT 164
           R + +          +E+ ++ ++E   +      P+  + L  E++      D+    T
Sbjct: 68  RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELLEPMPGCDMNVTWT 123

Query: 165 TIHLRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
               +   + GK+CL           NS++            +    + + DRG D + +
Sbjct: 124 GNSFKGVVKPGKQCLVFRKDKMTYLDNSFE------------ISEQGLISVDRGLDTETD 171

Query: 224 Q-VWGVKAGPYEF 235
           Q VWG  AGP+EF
Sbjct: 172 QLVWGSIAGPFEF 184


>gi|425464319|ref|ZP_18843641.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389833697|emb|CCI21546.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 194

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
           + +A  ++ +  ++EQA  +P  F +IR+   P   E+++G+ L +LEQA+      P+R
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    +
Sbjct: 67  LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
                G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175

Query: 227 GVKAGPYEF 235
           G  AGP+EF
Sbjct: 176 GSIAGPFEF 184


>gi|428313473|ref|YP_007124450.1| hypothetical protein Mic7113_5401 [Microcoleus sp. PCC 7113]
 gi|428255085|gb|AFZ21044.1| CpeT/CpcT family (DUF1001) [Microcoleus sp. PCC 7113]
          Length = 204

 Score = 43.5 bits (101), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 76/191 (39%), Gaps = 15/191 (7%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++GE  +++QA  DP  + ++ +   P     + S  L+ EQA      +P+R R 
Sbjct: 10  LARYMAGEFDNRQQAIADPAWYVHLHLWLRPVPLFTEDSLTLFAEQANIVNLDQPYRPRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             + P P E    ++V  Y ++  +  +     P   + L    I    E L    LR  
Sbjct: 70  LRLYPNPTE-PGSLQVQFYMLKNPDAIRGAGRNPELLKQLTTAQI----EFLPGCTLRVK 124

Query: 172 DR------GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
            R       +   +  +  P  F  S+Q G  Y  S        E  T+D+G D   G  
Sbjct: 125 QRLVSPNAYEFSAFSATNTPCSF--SYQ-GKDYQISLGFEATPEEFRTYDKGIDPATGKA 181

Query: 225 VWGVKAGPYEF 235
            WG   GP+ F
Sbjct: 182 TWGALLGPFRF 192


>gi|425443208|ref|ZP_18823433.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389715535|emb|CCI00103.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 194

 Score = 43.5 bits (101), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
           + +A  ++ +  ++EQA  +P  F +IR+   P   E+++G+ L +LEQA+      P+R
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    +
Sbjct: 67  LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
                G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175

Query: 227 GVKAGPYEF 235
           G  AGP+EF
Sbjct: 176 GSIAGPFEF 184


>gi|87300967|ref|ZP_01083809.1| hypothetical protein WH5701_05945 [Synechococcus sp. WH 5701]
 gi|87284838|gb|EAQ76790.1| hypothetical protein WH5701_05945 [Synechococcus sp. WH 5701]
          Length = 201

 Score = 43.5 bits (101), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 51/218 (23%), Positives = 88/218 (40%), Gaps = 36/218 (16%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQA 98
           +T S TR      +   LS    ++ QA  +P  + +I +   P  ++  GS  L LEQ+
Sbjct: 1   MTSSLTR------LVRMLSAGFSNQHQAFENPPIYAHILVRFRPLPQLEPGS--LLLEQS 52

Query: 99  FWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGD 158
           +   P +P+R R    +  P      + + ++A+ +   +    + P     +  E +  
Sbjct: 53  YAFAPGQPYRIRVLRAENAPD---GSLRIHNHALHDERRFWGAVEDPERMAQIQTEDLRL 109

Query: 159 IGEHLTTIH------LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIH 212
           +      +H      +   + G RCL E            + GAT        +    + 
Sbjct: 110 LEGCTYVVHGQGEGFVGEVEPGCRCLVE------------RKGATAYLVSSFEIDGESMR 157

Query: 213 TWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSYSDMFSP 249
           T DRG D    +Q+WG  AGP+EF+       +D FSP
Sbjct: 158 TIDRGHDPATHDQLWGSLAGPFEFQ------RTDDFSP 189


>gi|443320585|ref|ZP_21049676.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
 gi|442789691|gb|ELR99333.1| CpeT/CpcT family (DUF1001) [Gloeocapsa sp. PCC 73106]
          Length = 195

 Score = 43.5 bits (101), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 83/190 (43%), Gaps = 26/190 (13%)

Query: 55  SLSGEKFS-KEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
           SL    FS ++QA  +P  F +IR+   P          L+LEQA+      P+R R   
Sbjct: 12  SLMAADFSNQQQAYANPPFFAHIRVCMRPLPYEFLSETSLFLEQAYDCMLDVPYRLRVLK 71

Query: 114 VKPCPKELKCDVEVSSYAIREVEEY----KNFCDRPRDQRPLPEEVI--GDIGEHLTTIH 167
                      +E+ +Y +RE E++    +N  +  ++ +P   E++   D+    T   
Sbjct: 72  FTSVDGH----IELENYKVREQEKFFGASRNL-ELLKNLQPEQLELLEGCDMTVEWTGSS 126

Query: 168 LRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-V 225
            +   + GK C+ E         NS++            +   ++ ++DRGFD + N+ V
Sbjct: 127 FKGLIKPGKACIVERKGKKTYLDNSFE------------IDEQKLISFDRGFDPETNELV 174

Query: 226 WGVKAGPYEF 235
           WG  AGP+ F
Sbjct: 175 WGSIAGPFNF 184


>gi|166363473|ref|YP_001655746.1| hypothetical protein MAE_07320 [Microcystis aeruginosa NIES-843]
 gi|317374817|sp|B0JQ95.1|CPXT1_MICAN RecName: Full=Chromophore lyase CpcT/CpeT 1
 gi|166085846|dbj|BAG00554.1| hypothetical protein MAE_07320 [Microcystis aeruginosa NIES-843]
          Length = 194

 Score = 43.5 bits (101), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 15/189 (7%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
           + +A  ++ +  ++EQA  +P  F +IR+   P   E+++G+ L +LEQA+      P+R
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLNGTSL-FLEQAYDFMLNTPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    +
Sbjct: 67  LRVFKLSLVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMLGCDM 118

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
                G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175

Query: 227 GVKAGPYEF 235
           G  AGP+EF
Sbjct: 176 GSIAGPFEF 184


>gi|443475946|ref|ZP_21065875.1| protein of unknown function DUF1001 [Pseudanabaena biceps PCC 7429]
 gi|443019142|gb|ELS33279.1| protein of unknown function DUF1001 [Pseudanabaena biceps PCC 7429]
          Length = 214

 Score = 43.5 bits (101), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 96/213 (45%), Gaps = 20/213 (9%)

Query: 51  IVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAE-MVDGSKLLYLEQAFWRTPQKPFRQ 109
           I+A  L+G+  + EQA  +P  F +IR+   P    + +    L+LEQA+      P+R 
Sbjct: 15  ILAQWLAGDHSNWEQAIDNPPFFAHIRVGIRPLPNPITENGVWLFLEQAYDFELNHPYRT 74

Query: 110 R-FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQ-RPLPEEVIGDIGEHLTTIH 167
              +++    +     +E+ +Y ++  E + N   R RD+   L  + I  +      ++
Sbjct: 75  AVLHLIYQNDR-----IEMINYRLKNAETFFN-ASRDRDRLAALNADAIAQLEGCTQWVY 128

Query: 168 LRRCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-V 225
                R     ++GS  PG      + G  TY   E  V +N    + DRG+D   ++ V
Sbjct: 129 -----RNDAHTFKGSVEPGKKCCINRKGVDTYLAIEFEVTENT-YSSLDRGYDVVTDERV 182

Query: 226 WGVKAGPYEFKPAPSSSYSDMFSPLNFPPQQFL 258
           WG  AG ++F     +S+SD  S L   PQ  +
Sbjct: 183 WGSIAGAFQF--VKKASFSDEVS-LGSTPQGII 212


>gi|427722227|ref|YP_007069504.1| hypothetical protein Lepto7376_0226 [Leptolyngbya sp. PCC 7376]
 gi|427353947|gb|AFY36670.1| protein of unknown function DUF1001 [Leptolyngbya sp. PCC 7376]
          Length = 197

 Score = 43.5 bits (101), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 88/210 (41%), Gaps = 39/210 (18%)

Query: 49  ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
           A+ +A  ++GE  + +QA  +   F +I++   P  +       L+LEQA+    ++P+R
Sbjct: 7   AKALATWMAGEFSNFDQALANSALFAHIKVCMRPMPQEFFEGYGLFLEQAYSSGLEQPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHL 163
            R + +KP    L            E+  YK     P+++  + E+  G     DI   L
Sbjct: 67  LRVFHIKPVGDHL------------ELIHYK-----PKEE--IKEQFFGASRNLDI---L 104

Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGA----------TYSTSELAVLKNNEIHT 213
            T+ +   D    C    +   G F    Q G           +Y  +   +  NN I +
Sbjct: 105 KTVTMEALDPMPGCDMIVTFENGAFQGVVQEGKGCKVFRNNTESYLDNRFEITANNLI-S 163

Query: 214 WDRGFDDDGNQ-VWGVKAGPYEFKPAPSSS 242
            DRG D   ++ VWG  AG +EFK   + S
Sbjct: 164 IDRGRDPVTDEIVWGSLAGAFEFKKIKNFS 193


>gi|428223567|ref|YP_007107664.1| hypothetical protein GEI7407_0107 [Geitlerinema sp. PCC 7407]
 gi|427983468|gb|AFY64612.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
          Length = 205

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 9/186 (4%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L+G   + +QA  +P  +  +R+   P     + S  LYLEQ+      +P+R R   + 
Sbjct: 14  LAGAFENGDQAREEPAWYVPLRLWQRPIPLFREDSWTLYLEQSNLLKLDQPYRPRVLRIT 73

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
             P +    +    Y +R+    +     P   R L  + + D+     TI + R  +G 
Sbjct: 74  EHPSQPDALLG-RYYQLRDAAAVRGGGQDPERLRSLCADDLVDLPGCGLTIAIARSPQGP 132

Query: 176 RCLYEGSTPPGGFPNSWQNGATYSTS---ELAVLKNNEIH--TWDRGFD-DDGNQVWGVK 229
              ++ S PP G       G+    S   E+       +   ++D+G D + G  +WG  
Sbjct: 133 --YFQASLPPEGRCCFTVEGSLRQVSLGFEVGTTAEGSVEFLSYDKGIDPETGRAIWGAL 190

Query: 230 AGPYEF 235
            GP+ F
Sbjct: 191 MGPFRF 196


>gi|390440361|ref|ZP_10228696.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389836219|emb|CCI32822.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 194

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 90/189 (47%), Gaps = 15/189 (7%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFR 108
           + +A  ++ +  ++EQA  +P  F +IR+   P   E+++G+ L +LEQA+      P+R
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLEGTSL-FLEQAYDFMLNTPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R + +          +E+ ++   +V+E  NF    R+   L + +  D+ E +    +
Sbjct: 67  LRVFKLSVVDDH----IELENF---KVKEEANFFGASREPHRL-KNLTLDLLEPMPGCDM 118

Query: 169 RRCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VW 226
                G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VW
Sbjct: 119 NVTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISERGLISV-DRGLDPETDQLVW 175

Query: 227 GVKAGPYEF 235
           G  AGP+EF
Sbjct: 176 GSIAGPFEF 184


>gi|81299611|ref|YP_399819.1| hypothetical protein Synpcc7942_0800 [Synechococcus elongatus PCC
           7942]
 gi|81168492|gb|ABB56832.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 197

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 84/196 (42%), Gaps = 33/196 (16%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQR 110
           +A  L+ +  ++ QA  +P  F +IR+   P A E++DG  L Y+EQA+     +P+R R
Sbjct: 10  LARWLAADFSNQAQAFENPPFFAHIRVCMRPLALEVLDGLSL-YVEQAYDLDVNQPYRAR 68

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
              + P        +E+ ++ + + E+      R RD+              L  +   R
Sbjct: 69  VLKLVPAGDH----IEIENFRVIDGEDLFG-ASRDRDR--------------LAKLTRDR 109

Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNG----------ATYSTSELAVLKNNEIHTWDRG-FD 219
            ++   C +       GF    + G          ATY  SE  +  + ++ + DRG   
Sbjct: 110 LEKLPGCTFITHWTGNGFKGVVEPGKGCIVERKGQATYLESEFEI-DDEKMISLDRGRHP 168

Query: 220 DDGNQVWGVKAGPYEF 235
           +   QVWG  AGP+ F
Sbjct: 169 ETDEQVWGSVAGPFHF 184


>gi|37520762|ref|NP_924139.1| hypothetical protein gvip163 [Gloeobacter violaceus PCC 7421]
 gi|81710603|sp|Q7NLD3.1|CPXT3_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 3
 gi|35211757|dbj|BAC89134.1| cpeT [Gloeobacter violaceus PCC 7421]
          Length = 203

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 13/187 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++G+  +++QA   P  F +IR+   P      G+   Y EQ +      P+RQ  
Sbjct: 12  LARWMAGDFSNQKQALAQPQTFAHIRVFFRPLPFAFFGTVGFYSEQTYDYDLWSPYRQGL 71

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
           + +     + +  + + +Y +++   Y      P     +P + +         +  R  
Sbjct: 72  HRLL----DREGGIYIENYGLQDAGLYAGAGHDPEILATIPTDCL-QPRRGCAMVFRREG 126

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVK 229
           D      + GS  PG      ++G  TY  SE+ + ++  + + DRG D + + Q+WG +
Sbjct: 127 D-----CFRGSVEPGNHCLIPRDGYWTYLVSEVELTESTWV-SLDRGMDRETHRQIWGSQ 180

Query: 230 AGPYEFK 236
           AGP  F+
Sbjct: 181 AGPLHFE 187


>gi|425450956|ref|ZP_18830779.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 7941]
 gi|440755306|ref|ZP_20934508.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa TAIHU98]
 gi|389767993|emb|CCI06782.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 7941]
 gi|440175512|gb|ELP54881.1| phycocyanobilin lyase CpcT [Microcystis aeruginosa TAIHU98]
          Length = 194

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 83/193 (43%), Gaps = 23/193 (11%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-----DIGEHLT 164
           R + +          +E+ ++ ++E   +      P+  + L  E++      D+    T
Sbjct: 68  RVFKLSVVDDH----IELENFKVKEEANFFGASREPQRLKNLSLELLEPMPGCDMNVTWT 123

Query: 165 TIHLRRCDR-GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
               +   + GK+CL           NS++            +    + + DRG D + +
Sbjct: 124 GNSFKGVVKPGKQCLVFRKDKMTYLDNSFE------------ISEQGLISVDRGLDPETD 171

Query: 224 Q-VWGVKAGPYEF 235
           Q VWG  AGP+EF
Sbjct: 172 QLVWGSIAGPFEF 184


>gi|119513456|ref|ZP_01632483.1| hypothetical protein N9414_11904 [Nodularia spumigena CCY9414]
 gi|119461888|gb|EAW42898.1| hypothetical protein N9414_11904 [Nodularia spumigena CCY9414]
          Length = 196

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 79/176 (44%), Gaps = 13/176 (7%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++ QA  +P  F +IR+   P    V     L++EQA+      P+R R   +       
Sbjct: 16  NQAQAFENPPFFAHIRVCMRPLPVEVLSGISLFVEQAYDYMLNDPYRVRVLQLVNAGNR- 74

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
              +E+ +Y +++ E   NF    R+   L + + GD  E L   ++     G    ++G
Sbjct: 75  ---IEIVNYTVKQEE---NFYGASRNLERL-KTLTGDRLEKLPGCNMTVEWAGNS--FKG 125

Query: 182 STPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
              PG     ++ G  TY  SE  + +   I + DRG D + N+ +WG  AGP+ F
Sbjct: 126 KVEPGKGCIVFRKGQKTYLDSEFEISEERFI-SLDRGRDLETNEHIWGSVAGPFHF 180


>gi|307154867|ref|YP_003890251.1| hypothetical protein Cyan7822_5091 [Cyanothece sp. PCC 7822]
 gi|306985095|gb|ADN16976.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
          Length = 195

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 88/204 (43%), Gaps = 34/204 (16%)

Query: 41  KSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFW 100
           K+  RW         ++ +  ++EQA  +P  + +IR+   P    + G+  L+LEQA+ 
Sbjct: 8   KTLVRW---------MAADFSNQEQAWANPPLYAHIRVCMRPLPYELLGATSLFLEQAYD 58

Query: 101 RTPQKPFRQR---FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
                P+R R   F +V          +E+ +Y ++E E+Y       RD   L    + 
Sbjct: 59  FMLNVPYRLRVLKFTVVDDV-------IELENYKVKEQEQYFG---AARDLERLKNLTVD 108

Query: 158 DIGEHLTTIHLRRCDRGKRCL---YEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHT 213
           D+ +      L  CD         ++G+  PG      +N   TY  +   V +   I +
Sbjct: 109 DLEK------LPGCDMTVEWTGNSFKGTIKPGKQCIVVRNNKETYLDNSFEVDEQTLI-S 161

Query: 214 WDRGFDDDGNQ-VWGVKAGPYEFK 236
            DRG+D   ++ VWG  AG + FK
Sbjct: 162 LDRGYDPQTDELVWGSVAGAFHFK 185


>gi|172036295|ref|YP_001802796.1| hypothetical protein cce_1380 [Cyanothece sp. ATCC 51142]
 gi|354553088|ref|ZP_08972395.1| protein of unknown function DUF1001 [Cyanothece sp. ATCC 51472]
 gi|171697749|gb|ACB50730.1| DUF1001-containing protein [Cyanothece sp. ATCC 51142]
 gi|353554918|gb|EHC24307.1| protein of unknown function DUF1001 [Cyanothece sp. ATCC 51472]
          Length = 196

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 78/183 (42%), Gaps = 17/183 (9%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQA  +P  + +IR+   P          L+LEQA+     +P+R R   +    +  
Sbjct: 20  NQEQAFENPPFYAHIRVCIRPLPLSNFPEPSLFLEQAYDYALNQPYRIRVLKLNIVEER- 78

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG---KRCL 178
              +E+ +Y +++ E    F    R+   L +    DI        ++ CD         
Sbjct: 79  ---IELENYKLKDKE---TFLGASREPEKLKKLTPNDIE------LMQGCDMFVDWTGTS 126

Query: 179 YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFKP 237
           ++G   PG      +NG          + ++++ + DRG+D   N+ VWG  AG + FKP
Sbjct: 127 FKGMVKPGRNCRIVRNGKETYLDNSFEINDHQLISLDRGYDPITNELVWGSVAGAFHFKP 186

Query: 238 APS 240
             S
Sbjct: 187 RQS 189


>gi|428213027|ref|YP_007086171.1| hypothetical protein Oscil6304_2635 [Oscillatoria acuminata PCC
           6304]
 gi|428001408|gb|AFY82251.1| CpeT/CpcT family (DUF1001) [Oscillatoria acuminata PCC 6304]
          Length = 211

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 79/192 (41%), Gaps = 13/192 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           +   L+GE  ++EQA  DP  + ++ +     P     + S  L+ EQA      +P+R 
Sbjct: 10  LGQYLAGEFDNREQAIADPAWYVHLHLWQRPVPVQLFSEPSLTLFAEQANILHLDRPYRP 69

Query: 110 RFYMVKPCPKELKCD----VEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLT 164
           R   ++   + L  D    +EV  + +++  +Y+    +P   + L P +V    G  LT
Sbjct: 70  RIMQLR---QLLNGDSPPQLEVQYFMLKDPNQYRGAGRQPHLLQQLNPAQVELLPGCTLT 126

Query: 165 TIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGN 223
               R  D      +  S P G        G T            E+ ++D+G D + G 
Sbjct: 127 VSQKRLTDGTYH--FAASLPSGARCCFLYQGETRQVDLGFEASPTELFSFDKGIDPNTGK 184

Query: 224 QVWGVKAGPYEF 235
            +WG   GP+ F
Sbjct: 185 ALWGAILGPFHF 196


>gi|148238814|ref|YP_001224201.1| hypothetical protein SynWH7803_0478 [Synechococcus sp. WH 7803]
 gi|147847353|emb|CAK22904.1| Conserved hypothetical protein in phycobilisome rod gene region
           [Synechococcus sp. WH 7803]
          Length = 197

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 76/188 (40%), Gaps = 12/188 (6%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L GE  +++QA  +P  + +I +   P  ++  GS LL  EQ +   P  P+R R    +
Sbjct: 13  LCGEYSNQQQAFDNPPLYAHIFLRYRPLPQLTPGSILL--EQTYAVDPNTPYRLRMIRAE 70

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
             P      +++ ++  RE   +      P     L   +      +L   H +  ++  
Sbjct: 71  EQPSG---AIKLWNHTFREPSRFAGATFNPE----LRHSIQATDLINLEQCHYQVLEQED 123

Query: 176 RCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYE 234
              Y G+  PG      +NG          L+   + T DRG D   N+  WG  AGP+ 
Sbjct: 124 G--YHGAMEPGCQCIVRRNGKDTVLVSSFHLQGESLQTLDRGHDPITNERCWGSVAGPFR 181

Query: 235 FKPAPSSS 242
           F+   S S
Sbjct: 182 FQRTQSWS 189


>gi|257059281|ref|YP_003137169.1| hypothetical protein Cyan8802_1417 [Cyanothece sp. PCC 8802]
 gi|256589447|gb|ACV00334.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 8802]
          Length = 196

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 85/203 (41%), Gaps = 23/203 (11%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  +S +  ++EQA  +P  F +IR+   P    V     L+LEQA+     +P+R R 
Sbjct: 10  LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPLSVFSEPTLFLEQAYDFMLNQPYRLRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR- 170
           +        +   +E+ +Y   +V+E  +F    RD   L           LT  HL + 
Sbjct: 70  FRFNI----IDGRIELENY---KVQEEASFFGASRDLEKL---------RKLTPDHLEKM 113

Query: 171 --CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
             CD         ++G   PG      +N           +  +++ + DRG D   ++ 
Sbjct: 114 PGCDMFVDWTGNSFKGVVKPGKNCIVVRNERETYLDNSFEIDEHKLISVDRGLDPKTDEL 173

Query: 225 VWGVKAGPYEFKPAPSSSYSDMF 247
           VWG  AGP+ F+   S ++   F
Sbjct: 174 VWGSVAGPFHFQRKTSFAHEVEF 196


>gi|425453939|ref|ZP_18833688.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9807]
 gi|389799891|emb|CCI20607.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9807]
          Length = 194

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    + 
Sbjct: 68  RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLTLDLLEPMPGCDMN 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLIFRKDRMTYLDNSFEISERGLISV-DRGLDPETDQLVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|425458822|ref|ZP_18838308.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9808]
 gi|389823850|emb|CCI27632.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9808]
          Length = 194

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    + 
Sbjct: 68  RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLSLDLLEPMPGCDMN 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLVFRKDKMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|409994233|ref|ZP_11277350.1| hypothetical protein APPUASWS_23988 [Arthrospira platensis str.
           Paraca]
 gi|291566213|dbj|BAI88485.1| putative phycocyanobilin lyase [Arthrospira platensis NIES-39]
 gi|409934895|gb|EKN76442.1| hypothetical protein APPUASWS_23988 [Arthrospira platensis str.
           Paraca]
          Length = 198

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 80/183 (43%), Gaps = 15/183 (8%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
           L+ +  ++ QA  +P  F +IR+   P   E+++G  L YLEQA+    ++P+R R   +
Sbjct: 14  LASDFSNQPQAFENPPFFAHIRVCMRPLPWELLNGLSL-YLEQAYDINLKQPYRVRVLKL 72

Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRG 174
            P        +E+ +Y I   EE+          + L  E +          H      G
Sbjct: 73  IPVDDH----IEIENYEIDGQEEFYGASREVDRLKALKREQLNKRCGCTFITHW----TG 124

Query: 175 KRCLYEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
           K   ++G   PG G     +   TY  S   +   +   + DRG D + ++ +WG  AGP
Sbjct: 125 KG--FKGVVEPGKGCMVERKGKLTYLDSSFEI-DGDRFISHDRGMDPETDEHIWGAVAGP 181

Query: 233 YEF 235
           +EF
Sbjct: 182 FEF 184


>gi|428224114|ref|YP_007108211.1| hypothetical protein GEI7407_0661 [Geitlerinema sp. PCC 7407]
 gi|427984015|gb|AFY65159.1| protein of unknown function DUF1001 [Geitlerinema sp. PCC 7407]
          Length = 197

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 89/200 (44%), Gaps = 19/200 (9%)

Query: 39  LTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQ 97
           +T S      AR +A   S    +++QA  +P  F +IR+   P A +++DG    Y+EQ
Sbjct: 1   MTHSTDILALARWMAADFS----NQQQAFDNPPFFAHIRVCMRPLALDLLDGVSF-YVEQ 55

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A+      P+R R   +          +E+ +Y +++ E+Y     +P   + L  E + 
Sbjct: 56  AYDYILNSPYRVRVLKLVVNGDR----IEIENYKVKQQEQYYGASRQPHRLQTLSAEGL- 110

Query: 158 DIGEHLTTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDR 216
              E L   ++     G    ++G   PG      + G  TY  SE  +  + +  + DR
Sbjct: 111 ---EKLPGCNMVVEWTGSS--FKGVVEPGKACMVERKGNLTYLDSEFEI-DDEKFISLDR 164

Query: 217 GFDDDGNQ-VWGVKAGPYEF 235
           G D   ++ +WG  AGP+ F
Sbjct: 165 GRDPQTDEHIWGSVAGPFHF 184


>gi|218440601|ref|YP_002378930.1| hypothetical protein PCC7424_3679 [Cyanothece sp. PCC 7424]
 gi|218173329|gb|ACK72062.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7424]
          Length = 195

 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 88/196 (44%), Gaps = 29/196 (14%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  + +IR+   P +  V G   L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAWANPPLYAHIRVCMRPLSHDVLGGTSLFLEQAYDFMLNSPYRL 67

Query: 110 RFYMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIH 167
           R        K +  D  +E+ +Y ++E E+   F    RD + L         + LT  H
Sbjct: 68  RVL------KFIVVDDRIELENYKVKEQEQ---FFGAARDPKRL---------QTLTIDH 109

Query: 168 LRR---CDRGKRCL---YEGSTPPGG-FPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD 220
           L +   CD         ++G   PG       ++  TY  +   + ++  I + DRGFD 
Sbjct: 110 LEKLPGCDMTVEWTGHSFKGQVKPGKQCIVIRKDKETYLDNSFEIDEHKLI-SIDRGFDP 168

Query: 221 DGNQ-VWGVKAGPYEF 235
             ++ VWG  AG + F
Sbjct: 169 VTDELVWGSIAGAFHF 184


>gi|116072611|ref|ZP_01469877.1| hypothetical protein BL107_08871 [Synechococcus sp. BL107]
 gi|116064498|gb|EAU70258.1| hypothetical protein BL107_08871 [Synechococcus sp. BL107]
          Length = 192

 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 80/183 (43%), Gaps = 12/183 (6%)

Query: 54  DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
           D L GE  +++QA  +P  F +I +       +  GS LL  EQ++   P+KP+R R   
Sbjct: 10  DLLCGEYSNQKQAIENPPFFAHIFLKYKAINHLQPGSILL--EQSYAVDPKKPYRLRMIR 67

Query: 114 VKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDR 173
            +         +++ ++A ++ +++          + + EE +  +       H+   D 
Sbjct: 68  AEETSSG---TIKLWNHAFKDPKQFTEAMGEQTQTKNITEEDLILLDN--CHYHVVEKDD 122

Query: 174 GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
           G    Y G    G      ++G          L+ N++ T DRG+D   N+ +WG  AG 
Sbjct: 123 G----YHGGMEAGCRCIVCRDGKDSYLKSSFHLEGNKLSTLDRGYDLKTNERLWGSIAGE 178

Query: 233 YEF 235
           ++F
Sbjct: 179 FQF 181


>gi|124023924|ref|YP_001018231.1| CpeT [Prochlorococcus marinus str. MIT 9303]
 gi|123964210|gb|ABM78966.1| CpeT [Prochlorococcus marinus str. MIT 9303]
          Length = 204

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 30/195 (15%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
           A +L+    +K+QA +DP  F  I +   P    V      Y EQ++   P +P+RQ  +
Sbjct: 10  AKTLAAHYSNKQQAQKDPIKFARINIYFRPLLWDVFQGPGFYSEQSYDYAPWRPYRQGVH 69

Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL---TTIHLR 169
            +    KE+     + +Y+++E + +      P            D+  HL   T +  R
Sbjct: 70  KLL-ASKEVFI---LENYSLQEPKRFAGAGFYP------------DLLNHLCADTLLPRR 113

Query: 170 RC----DRGKRCLYEGSTPPGG---FPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG 222
            C       K   Y G   PG     P +  N  TY  SE+   +   I + DRGFD   
Sbjct: 114 GCAMHFKEVKPNQYRGQVEPGNNCLVPRN--NTMTYLVSEVEFDEKIWI-SRDRGFDPKT 170

Query: 223 N-QVWGVKAGPYEFK 236
           + Q+WG K GP  F+
Sbjct: 171 HEQLWGSKHGPLHFQ 185


>gi|425445152|ref|ZP_18825188.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9443]
 gi|389734886|emb|CCI01493.1| Similar to tr|Q8YLF9|Q8YLF9 [Microcystis aeruginosa PCC 9443]
          Length = 194

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 13/188 (6%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P  + +     L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAFANPPFFAHIRVCMRPLPDELLKGTSLFLEQAYDFMLNTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ ++   +V+E  NF    R+ + L + +  D+ E +    + 
Sbjct: 68  RVFKLSVVDDH----IELENF---KVKEEANFFGASREPQRL-KNLSLDLLEPMPGCDMN 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSW-QNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               G    ++G   PG     + ++  TY  +   + +   I   DRG D + +Q VWG
Sbjct: 120 VTWTGNS--FKGVVKPGKQCLIFRKDRMTYLDNSFEISEQGLISV-DRGLDPETDQLVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|428304538|ref|YP_007141363.1| hypothetical protein Cri9333_0938 [Crinalium epipsammum PCC 9333]
 gi|428246073|gb|AFZ11853.1| protein of unknown function DUF1001 [Crinalium epipsammum PCC 9333]
          Length = 204

 Score = 41.6 bits (96), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 39/182 (21%), Positives = 71/182 (39%), Gaps = 5/182 (2%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L+GE  ++ QA  +P  + ++ +   P     + S  L+ EQA       P+R R   ++
Sbjct: 14  LAGEFDNRAQALAEPAWYVHLHLWQRPVPLFTEDSITLFAEQANIVNLDHPYRPRILRLR 73

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
                    + V  Y  + +E  K    +P   + L    I  +     TI+ ++     
Sbjct: 74  DSNTP-SGSIHVQYYMFKNLEAIKGAGRKPELLKTLTTSDIELLPSCTLTINPQQITTDT 132

Query: 176 RCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAGPY 233
                 ++P    P  +   G TY        + NE H++D+G +   G  +WG   G Y
Sbjct: 133 WEF--ATSPTSDVPCCFNYQGQTYQVFLGFTARKNESHSYDKGIEPQTGKAIWGALMGSY 190

Query: 234 EF 235
            +
Sbjct: 191 RY 192


>gi|126657830|ref|ZP_01728983.1| hypothetical protein CY0110_13236 [Cyanothece sp. CCY0110]
 gi|126620770|gb|EAZ91486.1| hypothetical protein CY0110_13236 [Cyanothece sp. CCY0110]
          Length = 196

 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 76/176 (43%), Gaps = 11/176 (6%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQA  +P  + +IR+   P          L+LEQA+     +P+R R   +K      
Sbjct: 20  NQEQAFENPPFYAHIRVCIRPLPVDNFPEPSLFLEQAYDYALNQPYRVRVLQLKIVDNR- 78

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYEG 181
              +E+ +Y ++E E +     +P   +    E+  D  E +    +     G    ++G
Sbjct: 79  ---IELENYKLKEKEAFLGASRQPEKLK----EITPDDIELMQGCDMFVDWTGTS--FKG 129

Query: 182 STPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFK 236
           +  PG      + G          +  +++ + DRG+D   N+ VWG  AGP+ F+
Sbjct: 130 TVKPGRNCRIVRKGKETYLDNSFEINEHQLISLDRGYDPVTNELVWGSVAGPFHFE 185


>gi|218246228|ref|YP_002371599.1| hypothetical protein PCC8801_1383 [Cyanothece sp. PCC 8801]
 gi|218166706|gb|ACK65443.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 8801]
          Length = 196

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 85/203 (41%), Gaps = 23/203 (11%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  +S +  ++EQA  +P  F +IR+   P    V     L+LEQA+     +P+R R 
Sbjct: 10  LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPLSVFPEPTLFLEQAYDFMLNQPYRLRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR- 170
           +        +   +E+ +Y   +V+E  +F    RD   L           LT  HL + 
Sbjct: 70  FRFNI----IDGRIELENY---KVQEEASFFGASRDLEKL---------RKLTPDHLEKM 113

Query: 171 --CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ- 224
             CD         ++G   PG      +N           +  +++ + DRG D   ++ 
Sbjct: 114 PGCDMFVDWTGNSFKGVVKPGKNCIVVRNERETYLDNSFEIDEHKLISVDRGLDPKTDEL 173

Query: 225 VWGVKAGPYEFKPAPSSSYSDMF 247
           VWG  AGP+ F+   S ++   F
Sbjct: 174 VWGSVAGPFHFQRKTSFAHEVEF 196


>gi|352096586|ref|ZP_08957413.1| protein of unknown function DUF1001 [Synechococcus sp. WH 8016]
 gi|351676236|gb|EHA59390.1| protein of unknown function DUF1001 [Synechococcus sp. WH 8016]
          Length = 201

 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 14/191 (7%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
           A +L+G   + EQ+  +P +F +I +   P           Y EQ++   P  P+RQ  +
Sbjct: 11  AKTLAGHYSNFEQSQDNPKDFAHINIYFRPLPWETLKGPGFYSEQSYDHDPWSPYRQGIH 70

Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP-EEVIGDIGEHLTTIHLRRC 171
            ++    +++    V ++   +         RP   R L  E +    G     +H    
Sbjct: 71  RLQ----QIQDIFVVENFGFADALRLAGSGQRPELLRSLKVESLTPRCG---CDMHFHEV 123

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGN-QVWGVK 229
           + G    Y G   PG      ++G  TY  SE+ V     + + DRGFD + + Q WG +
Sbjct: 124 ETG---CYRGEVEPGHKCLVPRDGQLTYLISEVDVDSTTWV-SRDRGFDPETHAQKWGSE 179

Query: 230 AGPYEFKPAPS 240
            GP +FK   S
Sbjct: 180 HGPLKFKRVTS 190


>gi|428777464|ref|YP_007169251.1| hypothetical protein PCC7418_2907 [Halothece sp. PCC 7418]
 gi|428691743|gb|AFZ45037.1| protein of unknown function DUF1001 [Halothece sp. PCC 7418]
          Length = 198

 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 15/183 (8%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           ++ +  ++EQA  +P  F +IR+   P    +     L++EQA+      P+R R     
Sbjct: 14  MAADFSNQEQAFANPPFFAHIRVCMRPLPVALLSGISLFVEQAYDYMLNNPYRMRVL--- 70

Query: 116 PCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDR 173
              K L  D  +E+ +Y +++      F    RD + L +E+  D  E L   ++     
Sbjct: 71  ---KLLVVDDHIEIENYVLKDPSA---FPGASRDPQRL-QELTSDNLERLPGCNMVVNWT 123

Query: 174 GKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGP 232
           G    ++G   PG      + G          +  N+  + DRG D + ++ VWG  AG 
Sbjct: 124 GNS--FQGEVEPGKACKVVRKGQETYLDSTFEIDANQFTSLDRGRDPETDERVWGSVAGA 181

Query: 233 YEF 235
           +EF
Sbjct: 182 FEF 184


>gi|119486368|ref|ZP_01620427.1| hypothetical protein L8106_16979 [Lyngbya sp. PCC 8106]
 gi|119456581|gb|EAW37711.1| hypothetical protein L8106_16979 [Lyngbya sp. PCC 8106]
          Length = 196

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 80/192 (41%), Gaps = 23/192 (11%)

Query: 49  ARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFR 108
           AR +A   S +     QA  +P  F +IR+   P      G   LYLEQA+     +P+R
Sbjct: 11  ARWMASDFSNQP----QAFENPPFFAHIRVCMRPLPTEFLGGVSLYLEQAYDYMINQPYR 66

Query: 109 QRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHL 168
            R     P     +  +E+ +YA+   E++     +P   + L  E +           L
Sbjct: 67  VRVLNFIP----RQDHIEIENYALENQEQFYGASRQPERLQNLKIEQLRK---------L 113

Query: 169 RRCDRGKRCL---YEGSTPPG-GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ 224
             C    R     ++G   PG G     +   TY  S   V ++  + + DRG   + ++
Sbjct: 114 PCCSFITRWTGHSFKGEVEPGKGCMVERKGQVTYLDSWFEVDEHRMV-SHDRGRHPETDE 172

Query: 225 -VWGVKAGPYEF 235
            VWG  AGP+EF
Sbjct: 173 HVWGSVAGPFEF 184


>gi|443317265|ref|ZP_21046681.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
 gi|442783150|gb|ELR93074.1| CpeT/CpcT family (DUF1001) [Leptolyngbya sp. PCC 6406]
          Length = 208

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 39/197 (19%), Positives = 84/197 (42%), Gaps = 20/197 (10%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKL-LYLEQAFWRTPQKPFRQR 110
           +A  L+GE  +K QA  +P  F  +++   P  ++     +  +LEQA     Q P+RQR
Sbjct: 8   LAQCLAGEFTNKPQALAEPIWFVQLKLWIRPVPQLSSTEHVTFFLEQASMAYDQPPYRQR 67

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLP-EEVIGDIGEHLTTIH-- 167
              +       + ++  +  A+++ + ++     P     L  ++++   G  L+ +H  
Sbjct: 68  LLQL----ASQQGNLTATYSALKDPKSWQGAAQNPDRLGTLTDQDLMALTGSTLSVVHPE 123

Query: 168 -------LRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD 220
                  + R   G+RC ++      G     + G    +        ++ + +DRG+D 
Sbjct: 124 NSHGETFIARHFPGERCQFQVD----GQAREVELGFDAISGTGTAQSPDQFYMYDRGYDA 179

Query: 221 DGNQ-VWGVKAGPYEFK 236
             ++  WG   GP++ +
Sbjct: 180 TQDKYTWGALHGPFKLQ 196


>gi|434396856|ref|YP_007130860.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
 gi|428267953|gb|AFZ33894.1| protein of unknown function DUF1001 [Stanieria cyanosphaera PCC
           7437]
          Length = 204

 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 42/194 (21%), Positives = 81/194 (41%), Gaps = 13/194 (6%)

Query: 49  ARIVA--DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKP 106
           A+++A    L+GE  +  QA   P  + ++R+   P     + S  L+ EQA      +P
Sbjct: 5   AQLIALGQYLAGEFDNHPQALAQPAWYVHLRLWIRPVPLFTEDSITLFAEQASVVNLNQP 64

Query: 107 FRQRFYMVK---PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL 163
           +R R   ++     P++L+    V  Y  +++E  K+    P+  + +  E I  +    
Sbjct: 65  YRPRLLRLRENEASPQQLR----VEYYMFKDIESVKDAGTNPQLLQQITLEQIEFLPNCT 120

Query: 164 TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQ-NGATYSTSELAVLKNNEIHTWDRGFD-DD 221
             +  +     K   +  ++P    P S+   G  +       +    + T+D+G D   
Sbjct: 121 LNVEFQHNSANKY--HFKTSPQSDCPCSFNYQGKNFQVFLGFEVTPETLLTYDKGIDPQT 178

Query: 222 GNQVWGVKAGPYEF 235
           G  +WG   G Y+F
Sbjct: 179 GKGIWGALLGAYQF 192


>gi|194477086|ref|YP_002049265.1| hypothetical protein PCC_0629 [Paulinella chromatophora]
 gi|317374820|sp|B1X536.1|CPXT_PAUCH RecName: Full=Chromophore lyase CpcT/CpeT
 gi|171192093|gb|ACB43055.1| hypothetical protein PCC_0629 [Paulinella chromatophora]
          Length = 201

 Score = 40.8 bits (94), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 80/186 (43%), Gaps = 14/186 (7%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +   LS    +++QA  +P  + +I +   P   ++ GS  L +EQ++   P KP+R R 
Sbjct: 8   LVHQLSARFNNQQQAFDNPPLYAHIVVNCRPLVHLLPGS--LLIEQSYAMDPLKPYRIRV 65

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
              +   ++L     + SY++ + ++Y      P     + E+ +  I E    I     
Sbjct: 66  LRAQTRDEKLI----IFSYSLSDEQKYWGSVYEPERMLKIEEKDLQAI-EGCNYI----- 115

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGVK 229
            R K   + G   PG      + G  TY  S+  +    E+ T DRG +     Q+WG  
Sbjct: 116 VRKKNSNFIGEVEPGCRCLVDRKGVTTYIVSKFELTNKGEMRTLDRGHNPVTHEQLWGSL 175

Query: 230 AGPYEF 235
            G +EF
Sbjct: 176 GGVFEF 181


>gi|209523368|ref|ZP_03271923.1| protein of unknown function DUF1001 [Arthrospira maxima CS-328]
 gi|376006618|ref|ZP_09783855.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423064550|ref|ZP_17053340.1| hypothetical protein SPLC1_S203620 [Arthrospira platensis C1]
 gi|209496110|gb|EDZ96410.1| protein of unknown function DUF1001 [Arthrospira maxima CS-328]
 gi|375325005|emb|CCE19608.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406713793|gb|EKD08961.1| hypothetical protein SPLC1_S203620 [Arthrospira platensis C1]
          Length = 198

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 82/184 (44%), Gaps = 17/184 (9%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMV 114
           L+ +  ++ QA  +P  F +IR+   P   E+++G  L YLEQA+    ++P+R R   +
Sbjct: 14  LAADFSNQAQAFENPPFFAHIRVCMRPLPWELLNGLSL-YLEQAYDINLKQPYRVRVLKL 72

Query: 115 KPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPL-PEEVIGDIGEHLTTIHLRRCDR 173
                     +E+ +Y I   EE+          + L PE++    G    T       +
Sbjct: 73  IAVDDH----IEIENYEIDGQEEFFGASREVSRLKALKPEQLNKRCG---CTFITHWTGK 125

Query: 174 GKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAG 231
           G    ++G   PG      + G  TY  S   +   +   + DRG D + ++ +WG  AG
Sbjct: 126 G----FKGVVEPGKSCMVERKGKLTYLDSSFEI-DGDRFISHDRGMDPETDEHIWGAVAG 180

Query: 232 PYEF 235
           P+EF
Sbjct: 181 PFEF 184


>gi|87124907|ref|ZP_01080754.1| hypothetical protein RS9917_02858 [Synechococcus sp. RS9917]
 gi|86167227|gb|EAQ68487.1| hypothetical protein RS9917_02858 [Synechococcus sp. RS9917]
          Length = 197

 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 74/192 (38%), Gaps = 33/192 (17%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L G   ++ QA  +P  + +I +   P   +  G  +L LEQA+   P +P+R R    +
Sbjct: 12  LCGSFTNEAQAFENPPLYAHINVTVRPLPHLEAG--MLLLEQAYALAPAEPYRIRVLEAR 69

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
                L     V ++++ + + +      P                 LT I        +
Sbjct: 70  QVGDAL----HVRNHSLSDAQRFWGATLNPH---------------QLTAIGSDDLQLLE 110

Query: 176 RCLYEGSTPPGGFPNSWQNG----------ATYSTSELAVLKNNEIHTWDRGFDDDGN-Q 224
            C Y  S    GF    + G           TY  S+  +     + T DRG D + + Q
Sbjct: 111 GCTYVVSPKGDGFSGEVEPGCRCMVERKGRTTYLVSQFEI-DAAGMSTIDRGHDPETHEQ 169

Query: 225 VWGVKAGPYEFK 236
           VWG  AGP++F+
Sbjct: 170 VWGSLAGPFQFQ 181


>gi|37520751|ref|NP_924128.1| hypothetical protein glr1182 [Gloeobacter violaceus PCC 7421]
 gi|81710612|sp|Q7NLE2.1|CPXT2_GLOVI RecName: Full=Chromophore lyase CpcT/CpeT 2
 gi|35211746|dbj|BAC89123.1| glr1182 [Gloeobacter violaceus PCC 7421]
          Length = 202

 Score = 40.8 bits (94), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 76/184 (41%), Gaps = 14/184 (7%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           L+G+  ++EQA   P  F +IR+   P    V     LY EQA       P+R     + 
Sbjct: 13  LAGDFSNREQAWDAPAFFSHIRLCMRPLPWQVFEGYGLYSEQADDYDWAHPYRIVVLNLV 72

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNF---CDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
             P  +   +E  +YA+++   Y       +R R  R  PE++    G    T   +R  
Sbjct: 73  EQPDGI---IECRNYALKDTAPYLGAAREAERARLHRLTPEQLEPLPG---CTFLFKR-- 124

Query: 173 RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQVWGVKAG 231
             +  L+ G   PG     ++ G        A +  +   + DRG D +   QVWG  +G
Sbjct: 125 --EGSLFRGRVRPGKGCRVFRKGRDTYLDGEATVSADYYKSIDRGRDLESDEQVWGSVSG 182

Query: 232 PYEF 235
           P+ F
Sbjct: 183 PFHF 186


>gi|434405137|ref|YP_007148022.1| CpeT/CpcT family (DUF1001) [Cylindrospermum stagnale PCC 7417]
 gi|428259392|gb|AFZ25342.1| CpeT/CpcT family (DUF1001) [Cylindrospermum stagnale PCC 7417]
          Length = 201

 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 39/185 (21%), Positives = 77/185 (41%), Gaps = 3/185 (1%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +++ L+GE  ++EQA  +P  + ++R+   P     + S  L+ EQA      +P+RQR 
Sbjct: 10  LSEYLAGEFDNREQAIAEPVWYVHLRLWQRPVNLFPEDSLTLFAEQANIVNLDQPYRQRI 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
             ++   ++ +  ++V  Y  ++    K     P     L  + +  +   + ++   + 
Sbjct: 70  MRLRQG-RDSEAPLDVQYYMPKDPTALKGAGSNPALLNALTIDQLDLLPGCILSVTQDKI 128

Query: 172 DRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDG-NQVWGVKA 230
               R  +  S PP    +    G T   S    +     H++D+G D       WG   
Sbjct: 129 -APNRYKFAASAPPDTCCSFTYLGNTVQVSLGFEVTEATFHSYDKGIDSATLKATWGAII 187

Query: 231 GPYEF 235
           GPY +
Sbjct: 188 GPYRY 192


>gi|428202218|ref|YP_007080807.1| hypothetical protein Ple7327_1905 [Pleurocapsa sp. PCC 7327]
 gi|427979650|gb|AFY77250.1| CpeT/CpcT family (DUF1001) [Pleurocapsa sp. PCC 7327]
          Length = 196

 Score = 40.8 bits (94), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 84/187 (44%), Gaps = 11/187 (5%)

Query: 50  RIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           + +A  ++ +  ++EQA  +P  F +IR+   P    + G   L+LEQA+      P+R 
Sbjct: 8   KTLARWMAADFSNQEQAWANPPLFAHIRVCMRPLPLALLGETSLFLEQAYDFMLDTPYRL 67

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
           R + +          +E+ +Y +++ E   +F    R+   L +++  D  E +    + 
Sbjct: 68  RVFKLAVVGDR----IELENYKVKDEE---DFFGASRNLERL-QKLTSDRLEKMEGCDMF 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGV 228
               G    ++G   PG      +N  T        +  +++ + DRG D + ++ VWG 
Sbjct: 120 VDWTGSS--FKGYIKPGKNCIVVRNDKTTYLDNSFEVDEHKLFSIDRGLDPETDELVWGS 177

Query: 229 KAGPYEF 235
            AG +EF
Sbjct: 178 IAGAFEF 184


>gi|334116991|ref|ZP_08491083.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
 gi|333461811|gb|EGK90416.1| protein of unknown function DUF1001 [Microcoleus vaginatus FGP-2]
          Length = 206

 Score = 40.4 bits (93), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 76/191 (39%), Gaps = 13/191 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           +A  ++GE  ++EQA  +P  + ++R+     P     + S  L+ EQA      KP+R 
Sbjct: 10  LAQYMAGEFDNREQALAEPAWYVHLRLWQRPLPVPLFSEPSIALFAEQANILELDKPYRP 69

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHL 168
           R   ++   +     +E   Y  +++   K     P     L  E I  + G  L+ +  
Sbjct: 70  RILQLRHS-QIAPGLIEAQYYMFKDIAAVKGAGCNPDLLGKLTNEQIQLLPGCTLSVVVQ 128

Query: 169 RRCDRGKRCLYEGSTPPG---GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
                G R  +  S P G   GF    QN       E A     E  ++D+G     G  
Sbjct: 129 NLASNGYR--FRASLPEGTRCGFAYGGQNYQVDLGFEAA---GEEFLSYDKGISPTTGKA 183

Query: 225 VWGVKAGPYEF 235
           +WG   GP+ F
Sbjct: 184 IWGALMGPFRF 194


>gi|254421325|ref|ZP_05035043.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196188814|gb|EDX83778.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 210

 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 42/198 (21%), Positives = 83/198 (41%), Gaps = 12/198 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL-EQAFWRTPQKPFRQR 110
           +A  L+GE  ++ QA+ +P  + ++R+   P   +    K  +L EQ    + Q P+RQR
Sbjct: 9   LASYLAGEFENQRQAAAEPAWYVHLRLWQRPIQALSSEQKFTFLLEQQNVISGQPPYRQR 68

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
              +    +     +E   +A+++   YK     P+  + + ++ +  +      I  + 
Sbjct: 69  ILQITDSAE--VNQLEGQYFALKDPSRYKGAGQDPKILKGISQKDLVTLPNSGAVICYQA 126

Query: 171 CDRGKRCLYEGSTPP-----GGFPNSWQNGATYSTSELAVLKNNEI--HTWDRGFD-DDG 222
              G R  Y+  +         F    Q    Y   ++ + ++N I    +D+G D D G
Sbjct: 127 LSNGGREGYQFESALRDNQLCSFSYGGQTKYVYLGFDV-IPQDNTILLKVYDKGIDPDTG 185

Query: 223 NQVWGVKAGPYEFKPAPS 240
             +WG   GP++     S
Sbjct: 186 RGLWGALMGPFQLTKQAS 203


>gi|428317171|ref|YP_007115053.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428240851|gb|AFZ06637.1| protein of unknown function DUF1001 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 206

 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 76/191 (39%), Gaps = 13/191 (6%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLT--CPAAEMVDGSKLLYLEQAFWRTPQKPFRQ 109
           +A  ++GE  ++EQA  +P  + ++RM     P     + S  L+ EQA      KP+R 
Sbjct: 10  LAQYMAGEFDNREQALAEPTWYVHLRMWQRPLPVPLFSEPSIALFAEQANILELDKPYRP 69

Query: 110 RFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDI-GEHLTTIHL 168
           R   ++   +     +E   Y   ++   K     P     L +E I  + G  L+    
Sbjct: 70  RIVQLRHS-QIAPGLIEAQYYMFHDITAVKGAGCNPDLLGKLTKEQIQLLPGCTLSVAVQ 128

Query: 169 RRCDRGKRCLYEGSTPPG---GFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQ 224
                G R  +  S P G   GF    QN       E A     E  ++D+G +   G  
Sbjct: 129 NLASNGYR--FRASLPEGTRCGFAYGGQNYQVDLGFEAA---GEEFLSYDKGINPTTGKA 183

Query: 225 VWGVKAGPYEF 235
           +WG   GP+ F
Sbjct: 184 IWGALMGPFRF 194


>gi|427417732|ref|ZP_18907915.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
           sp. PCC 7375]
 gi|425760445|gb|EKV01298.1| protein of unknown function DUF1001, CpeT/CpcT family [Leptolyngbya
           sp. PCC 7375]
          Length = 196

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 86/193 (44%), Gaps = 33/193 (17%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           ++ +  ++ QA  +P  F +IR+   P          LYLEQA+     +P+R R  +++
Sbjct: 14  MAADFSNQAQAIENPPFFAHIRVCMRPLPSNFLSGNCLYLEQAYDFMLGQPYRVR--VLR 71

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG---DIGE--HLTTIHLRR 170
             P++    +E  S+               +DQ    E V+G   D+ +   LT  H+ +
Sbjct: 72  FLPQDDHILLENYSF---------------KDQ----EAVVGAGRDLAKLATLTADHIEK 112

Query: 171 ---CDRGKRCL---YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ 224
              CD   R     + G   PG      + G T        + ++++ ++DRG D + ++
Sbjct: 113 MTGCDMTVRWTGHSFIGKVVPGKNCIVVRQGKTSYLDNEFEITDDKLISYDRGRDPETDE 172

Query: 225 -VWGVKAGPYEFK 236
            +WG  AGP+EF+
Sbjct: 173 LLWGSIAGPFEFE 185


>gi|428206109|ref|YP_007090462.1| hypothetical protein Chro_1061 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428008030|gb|AFY86593.1| protein of unknown function DUF1001 [Chroococcidiopsis thermalis
           PCC 7203]
          Length = 197

 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 47/188 (25%), Positives = 85/188 (45%), Gaps = 17/188 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++ +  ++ QA  +P  F +IR+   P    +      Y+EQA+      P+R R 
Sbjct: 10  LARWMAADFSNQAQAFENPPFFAHIRVCMRPLPLNLLSGVSFYVEQAYDYMLGNPYRVRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHL--TTIHLR 169
             +          +E+ +Y I++ EE+       RD + L + +  D  E L    + ++
Sbjct: 70  LKLLQAGDR----IEIENYTIKQEEEFHG---ASRDLKRL-QALSSDRLEKLPGCNMIVQ 121

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
             D G    ++G+  PG      + G  TY  SE  +   ++  + DRG D + ++ VWG
Sbjct: 122 WVDNG----FKGTVEPGKCCMVVRKGKTTYLDSEFEI-DGDKFISRDRGRDPETDEHVWG 176

Query: 228 VKAGPYEF 235
             AGP+ F
Sbjct: 177 SVAGPFYF 184


>gi|427714031|ref|YP_007062655.1| hypothetical protein Syn6312_3060 [Synechococcus sp. PCC 6312]
 gi|427378160|gb|AFY62112.1| CpeT/CpcT family (DUF1001) [Synechococcus sp. PCC 6312]
          Length = 192

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 77/188 (40%), Gaps = 21/188 (11%)

Query: 54  DSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYM 113
           + L+GE  ++ QA   P  F ++R+      + + G + ++ EQA      KP+RQR  +
Sbjct: 16  ECLTGEFQNQAQAQAQPTWFVHLRLWQRAIPQGIQGQRAIFAEQANALYLDKPYRQRILV 75

Query: 114 VKPCPKELK---CDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG-DIGEHLTTIHLR 169
           ++P P ++    C    S +A   +   +    R  D  PLP  V+  +  +H     L 
Sbjct: 76  IQPEPLQIHYWACK-SPSQWAGAGMAPSRLQGLRESDLDPLPGCVLNVEYADHCFQATL- 133

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDD-DGNQVWGV 228
                 RC          FP   Q         +   K     ++DRG D   G  +WG 
Sbjct: 134 --PETSRCC---------FPYQGQERQVVLGFRVTAQK---FWSFDRGVDPVTGQGLWGA 179

Query: 229 KAGPYEFK 236
             GPYEF+
Sbjct: 180 LIGPYEFE 187


>gi|443327631|ref|ZP_21056251.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
 gi|442792723|gb|ELS02190.1| CpeT/CpcT family (DUF1001) [Xenococcus sp. PCC 7305]
          Length = 209

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 51/215 (23%), Positives = 87/215 (40%), Gaps = 37/215 (17%)

Query: 38  RLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQ 97
           R T+S    D A+++A   S +K    QA  +  ++ +IR+   P      G+   Y EQ
Sbjct: 9   RETESTLLIDLAKVMAGDFSNQK----QAFANSKDYAHIRVFFRPLPWEFFGAIGFYSEQ 64

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCD-VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVI 156
            +      P+RQ  + +       K D + + +Y++               Q P+     
Sbjct: 65  VYDYDLWTPYRQGIHRLVD-----KGDHIYIENYSL---------------QDPVIYAGA 104

Query: 157 GDIGEHLTTIHLRRCDRGKRC---------LYEGSTPPGGFPNSWQNGA-TYSTSELAVL 206
           G   + + TI     DR   C         ++ G+  PG      + G  TY  S++ + 
Sbjct: 105 GHNKDIIQTISPTSIDRRYNCSMVFKQDGEVFRGAVEPGNKCLIHRKGVETYLVSDVEIT 164

Query: 207 KNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPS 240
           +   I + DRG D + G Q+WG   GP +F+   S
Sbjct: 165 ETTWI-SLDRGMDLNTGEQIWGSAKGPLKFEKRAS 198


>gi|254412444|ref|ZP_05026218.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196180754|gb|EDX75744.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 196

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/177 (25%), Positives = 82/177 (46%), Gaps = 15/177 (8%)

Query: 62  SKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           +++QA  +P  + +IR+   P  A ++DG    ++EQA+      P+R R   +      
Sbjct: 20  NQQQAFDNPPFYAHIRVFMRPLPASLLDGVS-FFVEQAYDYLLNAPYRLRVLKLMVVGDH 78

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
               +E+ +Y ++E E+   F    RD   L + +  D  + L   ++     GK   ++
Sbjct: 79  ----IEIENYTVKEPEK---FYGASRDLNRL-QTLTTDQLDKLPGCNMIVEWTGKS--FQ 128

Query: 181 GSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
           G   PG     ++ G  TY  S   + +N  I + DRG D   ++ +WG  AGP+ F
Sbjct: 129 GHVEPGKACKVFRKGKETYLDSTFEIDENQFI-SLDRGRDPVTDEHIWGSIAGPFYF 184


>gi|411118829|ref|ZP_11391209.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
 gi|410710692|gb|EKQ68199.1| protein of unknown function DUF1001, CpeT/CpcT family
           [Oscillatoriales cyanobacterium JSC-12]
          Length = 196

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 15/177 (8%)

Query: 62  SKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           ++ QA  +P  F +IR+   P   E++ G   L +EQA+      P+R R  ++K   ++
Sbjct: 20  NQAQAFENPPFFAHIRVCMRPLPLELLSGVSFL-VEQAYDYNLNDPYRLR--VLKLITQD 76

Query: 121 LKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCLYE 180
               +E+ +Y ++  EE   F    RD + L    I  + E L   +++    G +  ++
Sbjct: 77  --GHIEIENYCVKGEEE---FFGASRDLKRLSALRIDQL-EMLPGCNMQVEWTGSQ--FK 128

Query: 181 GSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
           G   PG      + G  TY  SE  +  +  I + DRG D + ++ VWG  AGP+EF
Sbjct: 129 GRVQPGKACIVVRKGKTTYLDSEFEIDADKFI-SHDRGRDPETDEHVWGSVAGPFEF 184


>gi|307153848|ref|YP_003889232.1| hypothetical protein Cyan7822_4032 [Cyanothece sp. PCC 7822]
 gi|306984076|gb|ADN15957.1| protein of unknown function DUF1001 [Cyanothece sp. PCC 7822]
          Length = 206

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 78/194 (40%), Gaps = 22/194 (11%)

Query: 53  ADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFY 112
           A  L+G+  + +QA  +P NF +IR+   P      G    Y EQ +      P+RQ  +
Sbjct: 10  ARCLAGDFSNSQQAWENPQNFAHIRIFFRPLPFDFFGGIGFYSEQTYDYDLWSPYRQGIH 69

Query: 113 MVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCD 172
            +     E K  + + +Y +++   Y               E++  +  H      RR +
Sbjct: 70  RL----VEQKEQIYIENYGLKDAYLYAG--------SGHNREILATLKPHCIE---RRYN 114

Query: 173 -----RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-QVW 226
                R +  L+ G+              TY  SE+ + +N  + + DRG D + + QVW
Sbjct: 115 CSMVFRQQGNLWRGNVEGNQCLIERNGKQTYLVSEVELTENTWV-SLDRGMDINTHEQVW 173

Query: 227 GVKAGPYEFKPAPS 240
           G   GP  F+   S
Sbjct: 174 GSVHGPLRFEKRTS 187


>gi|434387951|ref|YP_007098562.1| CpeT/CpcT family (DUF1001) [Chamaesiphon minutus PCC 6605]
 gi|428018941|gb|AFY95035.1| CpeT/CpcT family (DUF1001) [Chamaesiphon minutus PCC 6605]
          Length = 203

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 81/188 (43%), Gaps = 17/188 (9%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPA-AEMVDGSKLLYLEQAFWRTPQKPFRQR 110
           +A  ++G+  ++ QA  +P  F +IR+   P   E+ DG  L +LEQA+    Q P+R R
Sbjct: 10  LARWMAGDFSNQAQAIENPPFFAHIRVCMRPLPYELFDGVSL-FLEQAYDFMLQNPYRLR 68

Query: 111 FYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRD-QRPLPEEVIGDIGEHLTTIHLR 169
                   +     +E+  Y +    E+      P   Q+   + +I   G  + T    
Sbjct: 69  VLNFVVVNER----IEIEHYTLDPEAEFFGAARNPEQLQQITRDNIIKMPGCTMIT---- 120

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
                    ++G   PG      +NG  TY  +   +   + I + DRG D   ++ VWG
Sbjct: 121 ---EWTGSTFKGYVEPGKGCKVTRNGKDTYLDNSFEISPGHLI-SLDRGRDIATDEHVWG 176

Query: 228 VKAGPYEF 235
             AGP+EF
Sbjct: 177 SIAGPFEF 184


>gi|186681237|ref|YP_001864433.1| CpeT protein [Nostoc punctiforme PCC 73102]
 gi|186463689|gb|ACC79490.1| CpeT protein [Nostoc punctiforme PCC 73102]
          Length = 209

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 42/190 (22%), Positives = 81/190 (42%), Gaps = 19/190 (10%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           +SG+  + +Q+  +P  + +I +   P       +   Y EQA+      P+RQ  + + 
Sbjct: 22  MSGDFSNYKQSFANPQLYAHIHIFFRPLPIEFFSAIGFYSEQAYDHDLWTPYRQGVHKLV 81

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
            C       + + +Y++++   Y         +  + + +  D  E      +     G+
Sbjct: 82  DCSDR----IYIENYSLKDQMLYAG----AARELDILKTITPDCIERRYNCSMVFMREGE 133

Query: 176 RCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTW---DRGFDDDGN-QVWGVKA 230
             ++ GS  PG      +NG  TY  S++ + +    HTW   D+G D + + Q+WG  A
Sbjct: 134 --MFRGSVEPGNQCLINRNGCQTYLVSDVEITE----HTWISLDKGMDIETHKQIWGSTA 187

Query: 231 GPYEFKPAPS 240
           GP  F+   S
Sbjct: 188 GPLRFEKRES 197


>gi|22299699|ref|NP_682946.1| hypothetical protein tlr2156 [Thermosynechococcus elongatus BP-1]
 gi|81742471|sp|Q8DH04.1|CPXT_THEEB RecName: Full=Chromophore lyase CpcT/CpeT
 gi|22295883|dbj|BAC09708.1| tlr2156 [Thermosynechococcus elongatus BP-1]
          Length = 196

 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 78/182 (42%), Gaps = 13/182 (7%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           ++ +  ++ QA  +P  + +IR+   P    V     LY+EQA+      P+R R   + 
Sbjct: 14  MAADFSNQAQAFENPPFYAHIRVCMRPLPRGVLEGIALYVEQAYDYLLSVPYRTRVLELM 73

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGK 175
           P        + + +Y ++   + K F    RD++ L      D+ E L   ++     G 
Sbjct: 74  PANDH----IVIKNYVLK---DEKRFFGAARDRQRLQAMTADDL-ELLCGCNMLTYWTGH 125

Query: 176 RCLYEGSTPPGGFPNSWQNG-ATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPY 233
              + G   PG      + G  TY  S   +  +  I + DRG D + ++ VWG  AGP+
Sbjct: 126 S--FRGEVEPGKACKVVRKGRETYLDSTFEIDGDRFI-SHDRGRDPETDEHVWGSVAGPF 182

Query: 234 EF 235
            F
Sbjct: 183 HF 184


>gi|427736066|ref|YP_007055610.1| hypothetical protein Riv7116_2555 [Rivularia sp. PCC 7116]
 gi|427371107|gb|AFY55063.1| CpeT/CpcT family (DUF1001) [Rivularia sp. PCC 7116]
          Length = 197

 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 37/69 (53%)

Query: 48  HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPF 107
           +  ++A+ L+GE  +KEQA  +P  + N+ M     +   + S  L+ EQA     ++P+
Sbjct: 6   NLNVLANYLAGEFENKEQAMTEPAWYVNLLMWQRTVSLFQEDSLALFAEQANILNLEQPY 65

Query: 108 RQRFYMVKP 116
           RQR   + P
Sbjct: 66  RQRLLRITP 74


>gi|160331181|ref|XP_001712298.1| cpeT-like protein [Hemiselmis andersenii]
 gi|159765745|gb|ABW97973.1| cpeT-like protein [Hemiselmis andersenii]
          Length = 221

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 102/221 (46%), Gaps = 29/221 (13%)

Query: 37  KRLTKSKTRWD-HARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYL 95
           +  +K+ T+      + A+ L+G   ++EQA++ P ++ +I++   P  + +      Y 
Sbjct: 18  RNFSKNSTKMSCEVELFANYLAGIWTNEEQATKFPTDWSHIQLGFYPLDQSLLNGYSFYT 77

Query: 96  EQAFWRTPQKPFRQRFYMVKPCPKELKCDV-EVSSYAIREVEEYKNFCDRPRDQRPLPEE 154
           E A   +  +P++    ++     E K ++ EV S++I+  E++      P     L ++
Sbjct: 78  ESANEFSLDEPYKSGVTLL-----EKKGEIIEVKSFSIKGPEDFWYGTYEPSLLANLTKD 132

Query: 155 -VIGDI-GEHL-------TTIHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAV 205
            ++ D  G +L         + L +   GK+C+     P  G P       TY  S  AV
Sbjct: 133 RLVNDKDGCNLEFKYDAKKKLFLGKTKSGKQCI----IPREGNP-------TYLDST-AV 180

Query: 206 LKNNEIHTWDRGFD-DDGNQVWGVKAGPYEFKPAPSSSYSD 245
           LK NE  + D G + ++  +VWG  AGP+ F    S S S+
Sbjct: 181 LKENEYSSLDIGRNIENDEKVWGPSAGPFVFLKKKSFSISN 221


>gi|427722465|ref|YP_007069742.1| hypothetical protein Lepto7376_0476 [Leptolyngbya sp. PCC 7376]
 gi|427354185|gb|AFY36908.1| protein of unknown function DUF1001 [Leptolyngbya sp. PCC 7376]
          Length = 208

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 47/195 (24%), Positives = 74/195 (37%), Gaps = 22/195 (11%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A +++G   +++Q+  D  N+ +IR+   P           Y EQ +      P+RQ  
Sbjct: 21  LARTMAGGFSNQKQSFADSKNYAHIRVFFRPLPWEFFNGIGFYSEQVYDYDLWLPYRQGI 80

Query: 112 YMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRC 171
           +   P   E    V + +Y++++   Y          R +  EVI            RRC
Sbjct: 81  HRFVPHGDE----VYIENYSLKDALLYAGAGRNLDILRSIKPEVIE-----------RRC 125

Query: 172 D-----RGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFD-DDGNQV 225
           +     R +   ++G           +   TY  S+ A         WDRG D D   QV
Sbjct: 126 NCSMIFRPENNGFQGRVEGKCCVIEKRGKRTYLVSD-ARFTETSFIGWDRGLDVDTDEQV 184

Query: 226 WGVKAGPYEFKPAPS 240
           WG   GP EF    S
Sbjct: 185 WGSAIGPLEFDKTDS 199


>gi|428304753|ref|YP_007141578.1| hypothetical protein Cri9333_1163 [Crinalium epipsammum PCC 9333]
 gi|428246288|gb|AFZ12068.1| protein of unknown function DUF1001 [Crinalium epipsammum PCC 9333]
          Length = 196

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 40/193 (20%), Positives = 80/193 (41%), Gaps = 27/193 (13%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  ++ +  ++ QA  +P  F +IR+   P          L++EQA+  T   P+R R 
Sbjct: 10  LARWMAADFSNQAQAFENPPFFAHIRVAMRPLPPEFLSGVSLFVEQAYDYTLNDPYRLR- 68

Query: 112 YMVKPCPKELKCD--VEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
                  K +  D  +E+ +Y++ + +++     +P            ++ + LT  HL 
Sbjct: 69  -----VLKLIVVDDHIEIENYSVNQEQQFYGASRKP------------ELLKALTVEHLE 111

Query: 170 RCDRGKRCL------YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN 223
           +       +      ++G   PG      + G T        +  ++  + DRG D D +
Sbjct: 112 KLPGCNMIVEWTGNSFKGYVQPGKACIVVRKGHTTYLDSTFEIDEHKFISLDRGRDPDTD 171

Query: 224 Q-VWGVKAGPYEF 235
           + +WG   GP+ F
Sbjct: 172 EHIWGSFGGPFHF 184


>gi|326783641|ref|YP_004323856.1| antenna protein [Synechococcus phage Syn19]
 gi|310005144|gb|ADO99534.1| antenna protein [Synechococcus phage Syn19]
          Length = 153

 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%), Gaps = 1/31 (3%)

Query: 206 LKNNEIHTWDRGFDDDGNQVWGVKAGPYEFK 236
           L  ++IHT D+G+D DGN +WG K  PY+FK
Sbjct: 120 LFGSKIHTCDQGYDSDGNMIWGSK-NPYKFK 149


>gi|428298429|ref|YP_007136735.1| hypothetical protein Cal6303_1725 [Calothrix sp. PCC 6303]
 gi|428234973|gb|AFZ00763.1| protein of unknown function DUF1001 [Calothrix sp. PCC 6303]
          Length = 197

 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 83/189 (43%), Gaps = 17/189 (8%)

Query: 52  VADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRF 111
           +A  +S +  ++EQA  +P  F +IR+   P    +     L+LEQA+     +P+R R 
Sbjct: 10  LARWMSADFSNQEQAYENPPFFAHIRVCIRPLPFELFSEVGLFLEQAYDFALNQPYRMRV 69

Query: 112 YMVKPCPKELKCDVEVSSYAIRE--VEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLR 169
                    +K   E    AI    + E +N+    RD   L +  + ++ + +   ++ 
Sbjct: 70  ---------MKLTTENDHIAITHFTINEEQNYYGASRDLSRLAKLSVDEL-KPMPGCNMI 119

Query: 170 RCDRGKRCLYEGSTPPGGFPNSWQNGA-TYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWG 227
               GK   ++G   PG      ++G  TY  +E  +    E  + DRG D + N+  WG
Sbjct: 120 VQWTGKS--FKGHVEPGKGCRVVRDGKETYLDNEFEI-DEQEFFSLDRGRDLETNEHAWG 176

Query: 228 VKAGPYEFK 236
             AG + F+
Sbjct: 177 SIAGAFHFR 185


>gi|427702904|ref|YP_007046126.1| hypothetical protein Cyagr_1634 [Cyanobium gracile PCC 6307]
 gi|427346072|gb|AFY28785.1| CpeT/CpcT family (DUF1001) [Cyanobium gracile PCC 6307]
          Length = 197

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 33/192 (17%)

Query: 56  LSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVK 115
           LSG   ++ QA  +P  + +I +   P  ++  GS  L LEQ++  +P  P+R R    +
Sbjct: 12  LSGGFSNQSQAFDNPPLYAHILVKFRPLPQLAPGS--LLLEQSYAISPGTPYRIRVLRAE 69

Query: 116 PCPKELKCDVEVSSYAIREVEEYKNFCD---RPR---DQRPLPEE----VIGDIGEHLTT 165
               EL     + + A+ E + +    +   R R   D   LP E    V+ ++G+    
Sbjct: 70  RRDGELI----IHNQALHEEQRFWGAIEDEGRRRSIGDADLLPLEGCTYVVREVGDGFAG 125

Query: 166 IHLRRCDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGN-Q 224
                 + G RCL E            +    Y  S   +  +  + T DRG D   + Q
Sbjct: 126 ----EVEPGCRCLVE-----------RKGSVAYLVSSFEI-DSRGMRTIDRGHDPATHEQ 169

Query: 225 VWGVKAGPYEFK 236
           +WG  AGP+EF+
Sbjct: 170 LWGSLAGPFEFE 181


>gi|434392355|ref|YP_007127302.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
 gi|428264196|gb|AFZ30142.1| protein of unknown function DUF1001 [Gloeocapsa sp. PCC 7428]
          Length = 217

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 82/209 (39%), Gaps = 35/209 (16%)

Query: 38  RLTKSKTRWDHARIVADSLSGEKFSKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQ 97
           ++T S      AR +A   S +     QA  +P  F +IR+   P           Y+EQ
Sbjct: 21  QMTHSTDIATLARWMAADFSNQA----QAFENPPFFAHIRVCMRPLPLETLSGVSFYVEQ 76

Query: 98  AFWRTPQKPFRQRFYMVKPCPKELKCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIG 157
           A+      P+R R   +          + + +Y I++ +++      P            
Sbjct: 77  AYDYMLNNPYRVRVLKLVNAGDR----ITIENYLIKDEQQFYGASREP------------ 120

Query: 158 DIGEHLTTIHLRRCDRGKRC---------LYEGSTPPG-GFPNSWQNGATYSTSELAVLK 207
              + L  I L R ++   C          ++GS  PG G     ++  TY  SE  + +
Sbjct: 121 ---QRLNAITLERLEKLPGCNMLVEWTGTSFKGSVEPGKGCIVVRKDKKTYLDSEFEIDQ 177

Query: 208 NNEIHTWDRGFDDDGNQ-VWGVKAGPYEF 235
              I + DRG D + ++ VWG  AGP+ F
Sbjct: 178 ERFI-SLDRGRDPETDEHVWGSVAGPFHF 205


>gi|332706197|ref|ZP_08426266.1| protein of unknown function, DUF1001 [Moorea producens 3L]
 gi|332355034|gb|EGJ34505.1| protein of unknown function, DUF1001 [Moorea producens 3L]
          Length = 204

 Score = 37.7 bits (86), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 50/210 (23%), Positives = 89/210 (42%), Gaps = 38/210 (18%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAA-EMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKE 120
           ++EQA  +P  + +IR+   P    ++DG  L +LEQA+      P+R R   +     E
Sbjct: 20  NQEQAFENPPLYAHIRVCMRPLPWSVLDGCSL-FLEQAYDYMLNTPYRVRVLKI----VE 74

Query: 121 LKCDVEVSSYAI----------REVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRR 170
           L   +++ +Y +          R++E  K+   +  ++ P     +   G          
Sbjct: 75  LDGQIKIENYKVQDEKLLYGASRDLERLKSLSAQQLEKLPGCTFNVAWTGHSFKA----E 130

Query: 171 CDRGKRCLYEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVK 229
            + GK CL   ++             TY  S   V++   I + DRG D   ++ +WG  
Sbjct: 131 VEPGKACLVVRNS-----------KETYLDSSFEVMEEKLI-SLDRGRDPQTDELIWGSI 178

Query: 230 AGPYEFKPAPSSSYSDMFSPLNFPPQQFLE 259
           AG + F  AP +S++   S +  P  +F E
Sbjct: 179 AGAFHF--APRTSFA---SEVLIPTPEFTE 203


>gi|67922199|ref|ZP_00515713.1| Protein of unknown function DUF1001 [Crocosphaera watsonii WH 8501]
 gi|416389332|ref|ZP_11685331.1| hypothetical protein CWATWH0003_2153 [Crocosphaera watsonii WH
           0003]
 gi|67855902|gb|EAM51147.1| Protein of unknown function DUF1001 [Crocosphaera watsonii WH 8501]
 gi|357264241|gb|EHJ13154.1| hypothetical protein CWATWH0003_2153 [Crocosphaera watsonii WH
           0003]
          Length = 196

 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 76/179 (42%), Gaps = 17/179 (9%)

Query: 62  SKEQASRDPDNFFNIRMLTCPAAEMVDGSKLLYLEQAFWRTPQKPFRQRFYMVKPCPKEL 121
           ++EQA  +P  + +IR+   P          L+LEQA+     +P+R R   +K     +
Sbjct: 20  NQEQAFENPPFYAHIRVCIRPLPITNFPEPSLFLEQAYDYALDQPYRVRVLRLKIVEGRM 79

Query: 122 KCDVEVSSYAIREVEEYKNFCDRPRDQRPLPEEVIGDIGEHLTTIHLRRCDRGKRCL--- 178
               E+ +Y ++E E    F    RD+  L +    D+        ++ CD         
Sbjct: 80  ----ELENYKLKEKE---GFLGAARDREKLKKITPDDLE------LMQGCDMFIDWTGNS 126

Query: 179 YEGSTPPGGFPNSWQNGATYSTSELAVLKNNEIHTWDRGFDDDGNQ-VWGVKAGPYEFK 236
           ++G   PG     ++ G          + + ++ + DRG+D   ++ VWG  AG + FK
Sbjct: 127 FKGVVKPGKNCLVFRKGKETYLDNSFEIDDQKLISLDRGYDPVTDELVWGSVAGAFNFK 185


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.429 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,913,092,488
Number of Sequences: 23463169
Number of extensions: 226888879
Number of successful extensions: 400820
Number of sequences better than 100.0: 190
Number of HSP's better than 100.0 without gapping: 55
Number of HSP's successfully gapped in prelim test: 135
Number of HSP's that attempted gapping in prelim test: 400580
Number of HSP's gapped (non-prelim): 203
length of query: 270
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 130
effective length of database: 9,074,351,707
effective search space: 1179665721910
effective search space used: 1179665721910
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)