BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024283
         (269 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 297

 Score =  436 bits (1121), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 209/269 (77%), Positives = 240/269 (89%), Gaps = 1/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+LTS++F+++   S +RKV +SSTR L S+ E F FRT    H V  RAS+S S+M++
Sbjct: 1   MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSEIFGFRTDFSYHYVGVRASNSPSKMVV 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60  QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q  +SLVEF 
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFS 179

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGEVEG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+KLC  LN+NKR
Sbjct: 180 SKEGEVEGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKLCVALNINKR 239

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKELLKEY+DR
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDR 268


>gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 297

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/269 (76%), Positives = 241/269 (89%), Gaps = 1/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+LTS++F+++   S +RKV +SSTR L S+ E F FRT    H V  RAS+S+S+M++
Sbjct: 1   MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60  QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF 
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKR
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKR 239

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKELLKEY+DR
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDR 268


>gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
           [Ricinus communis]
 gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
           [Ricinus communis]
          Length = 299

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 210/271 (77%), Positives = 239/271 (88%), Gaps = 3/271 (1%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLF-CHCVRFRAS-SSSSRM 58
           MA++TSV+F++I Q+  +RK   S +RS  SNF+ FRF    F CH V  +AS SSSSRM
Sbjct: 1   MAAVTSVSFSAIAQSFTERKSVASWSRSFSSNFDSFRFPVGGFSCHYVGVKASNSSSSRM 60

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
           +I CMSTATDVPPTV+ETK NFL  YK+PIPSIYNTVLQELIVQQHLMRYKR+Y+YDPVF
Sbjct: 61  LIHCMSTATDVPPTVSETKFNFLNSYKKPIPSIYNTVLQELIVQQHLMRYKRSYRYDPVF 120

Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
           ALGFVTVYD+LM+GYPS+EDREAIFQAYI AL E+PEQYRIDA+KLE+WAR QT SSLV+
Sbjct: 121 ALGFVTVYDQLMQGYPSDEDREAIFQAYINALNEEPEQYRIDAKKLEDWARSQTPSSLVD 180

Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVN 238
           F SKEGEVEG+LKDIAERA G G+FSYSRFFA+GLFRLLEL+N+TEPTVLEKLCA LN+N
Sbjct: 181 FSSKEGEVEGILKDIAERA-GNGSFSYSRFFAIGLFRLLELSNSTEPTVLEKLCAALNIN 239

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           KR VDRDLDVYRNLLSKL+QAKELLKEYVDR
Sbjct: 240 KRGVDRDLDVYRNLLSKLVQAKELLKEYVDR 270


>gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
           [Cucumis sativus]
 gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
           [Cucumis sativus]
          Length = 298

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 208/269 (77%), Positives = 237/269 (88%), Gaps = 2/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA++ S++F+++ Q S  R++ + S+RS  SNF GF FRTS+F H  R RAS+ SSRM+I
Sbjct: 1   MAAVNSISFSTLNQCS-DRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVI 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
            CMS  TDV  TVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFAL
Sbjct: 60  HCMSAGTDVT-TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFAL 118

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGYPS+EDREAIFQAYI AL EDPEQYRIDA+K EEWAR QTA+SLVEF 
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFA 178

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S+EGEVE +LKDIAERA  KGNFSYSRFFA+GLFRLLELANATEP++LEKLCA LN++K+
Sbjct: 179 SREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKK 238

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
            VDRDLDVYRNLLSKL+QAKELLKEYVDR
Sbjct: 239 GVDRDLDVYRNLLSKLVQAKELLKEYVDR 267


>gi|255636566|gb|ACU18621.1| unknown [Glycine max]
          Length = 297

 Score =  433 bits (1113), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/269 (76%), Positives = 241/269 (89%), Gaps = 1/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+LTS++F+++   S +RKV +SSTR L S+ E F FRT    H V  RAS+S+S+M++
Sbjct: 1   MAALTSLSFSAVIHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60  QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF 
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKR
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKR 239

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKELLKEY+DR
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDR 268


>gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa]
 gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  429 bits (1102), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 205/269 (76%), Positives = 234/269 (86%), Gaps = 1/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           M ++TS++F+++ Q+S +R   VSS  S   +FEGFR R+   CH V  +AS+S+SR  I
Sbjct: 1   MTAITSLSFSAVNQSSSERNTFVSSLASRNLSFEGFRLRSVFSCHYVGVKASNSTSRTAI 60

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
            CMST+TDVPPTV+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY YDPVF L
Sbjct: 61  HCMSTSTDVPPTVSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYLYDPVFGL 120

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           G VTVYD+LMEGYPS+EDREAIFQAYI ALKEDPEQYRIDA+KLEEWAR QT SSLV+F 
Sbjct: 121 GLVTVYDQLMEGYPSDEDREAIFQAYIKALKEDPEQYRIDAKKLEEWARAQTHSSLVDFS 180

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGE+EG+LK IAERA+  GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKR
Sbjct: 181 SKEGEIEGILKGIAERAAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKR 239

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKELLKEYVDR
Sbjct: 240 SVDRDLDVYRGLLSKLVQAKELLKEYVDR 268


>gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis
           vinifera]
          Length = 299

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 208/269 (77%), Positives = 241/269 (89%), Gaps = 2/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA++TS++F+++GQ+S +RKV V +TRS  S FE FRFR + +   VR  +SSSSSRM++
Sbjct: 1   MAAVTSLSFSALGQSS-ERKVPVPTTRSFASAFEAFRFRANFYAVGVRSSSSSSSSRMVV 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFAL
Sbjct: 60  QCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFAL 118

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF 
Sbjct: 119 GFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFS 178

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA  N++KR
Sbjct: 179 SKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKR 238

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYRNLL+KL+QAKELLKEYVDR
Sbjct: 239 SVDRDLDVYRNLLTKLVQAKELLKEYVDR 267


>gi|224146717|ref|XP_002326111.1| predicted protein [Populus trichocarpa]
 gi|222862986|gb|EEF00493.1| predicted protein [Populus trichocarpa]
          Length = 296

 Score =  419 bits (1076), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 205/269 (76%), Positives = 239/269 (88%), Gaps = 3/269 (1%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA++T V+F++I Q+S  R+   +  R+L   FEGFRFR+S  CH V  RAS+S+SRM+I
Sbjct: 1   MAAVTPVSFSAISQSSSDRRAFCTVARNL--GFEGFRFRSSFSCHYVGVRASNSTSRMVI 58

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
            CMST+TDVPPTVA+TK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YK+T++YDPVF L
Sbjct: 59  HCMSTSTDVPPTVADTKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKKTFRYDPVFGL 118

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGYPS+EDREAIFQAYI AL+EDPEQYRIDA+KLEEWAR QT SSLV+F 
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALEEDPEQYRIDAKKLEEWARAQTPSSLVDFS 178

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S+EGE+EG LKDIAER +  GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKR
Sbjct: 179 SREGEIEGTLKDIAERVAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKR 237

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QA+ELLKEYVDR
Sbjct: 238 SVDRDLDVYRGLLSKLVQARELLKEYVDR 266


>gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 291

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 196/266 (73%), Positives = 231/266 (86%), Gaps = 6/266 (2%)

Query: 4   LTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMIIQCM 63
           ++S +F+++ Q+S QR +      +L SN   FRFR    C  V  RAS+S+S+M+++C 
Sbjct: 5   ISSFSFSTLTQSSSQRNLT-----TLSSNSPTFRFRVGFSCLNVGVRASNSASKMVVRCS 59

Query: 64  STATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFV 123
           S+  + PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFV
Sbjct: 60  SSVAE-PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFV 118

Query: 124 TVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE 183
           TVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q  +SL+EF S+E
Sbjct: 119 TVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARSQNPNSLLEFSSRE 178

Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVD 243
           GEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNVNKRSVD
Sbjct: 179 GEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVNKRSVD 238

Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDR 269
           RDLDVYRNLLSKL+QAKELLKEYVDR
Sbjct: 239 RDLDVYRNLLSKLVQAKELLKEYVDR 264


>gi|356517586|ref|XP_003527468.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 291

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 197/270 (72%), Positives = 234/270 (86%), Gaps = 7/270 (2%)

Query: 1   MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
           MA++TS  +F+++ Q+S  R ++     +L SN   FR R    CH V  RAS+S+S+++
Sbjct: 1   MATVTSSFSFSTLTQSSSPRNLS-----TLSSNSPTFRIRVGFSCHYVGVRASNSASKIV 55

Query: 60  IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
           ++C S+A   PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFA
Sbjct: 56  VRC-SSAVAEPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFA 114

Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
           LGFVTVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q  +SLV+F
Sbjct: 115 LGFVTVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARAQNPTSLVDF 174

Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
            S+EGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNV+K
Sbjct: 175 SSREGEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVDK 234

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           RSVDRDLDVYRNLLSKL+QAKELLKEYVDR
Sbjct: 235 RSVDRDLDVYRNLLSKLVQAKELLKEYVDR 264


>gi|388496070|gb|AFK36101.1| unknown [Lotus japonicus]
          Length = 298

 Score =  405 bits (1042), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/270 (72%), Positives = 234/270 (86%), Gaps = 3/270 (1%)

Query: 1   MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
           MA++TS  +F+++ Q+S QRK  +SST  L SN +G RFR S  C  V  +AS+S+S+++
Sbjct: 1   MATVTSSFSFSALTQSS-QRKHTLSSTCVLGSNSDGIRFRKSFLCQQVGVQASNSASKLV 59

Query: 60  IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
           ++C S+ +D PP V+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMR+KR+Y+YDPVFA
Sbjct: 60  VRCSSSVSD-PPPVSETKLNFLKEYKRPIPSIYNTVLQELIVQQHLMRFKRSYRYDPVFA 118

Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
           LGFVTVY++LMEGYPS+EDR+AIFQ YI ALKEDP QYR DAQKLEEWAR Q+++SL+EF
Sbjct: 119 LGFVTVYEQLMEGYPSDEDRDAIFQTYIKALKEDPGQYREDAQKLEEWARTQSSTSLIEF 178

Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
            S+EGEVEG LKDIAERA GKG+FSYSRFFA+GLFRLLEL NA EP +LEKLCA LNV+K
Sbjct: 179 SSREGEVEGALKDIAERAGGKGDFSYSRFFAIGLFRLLELGNAMEPAILEKLCAALNVDK 238

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           RSVDRDLDVYRNLLSKL+QAKELLKEY DR
Sbjct: 239 RSVDRDLDVYRNLLSKLVQAKELLKEYADR 268


>gi|388506988|gb|AFK41560.1| unknown [Medicago truncatula]
          Length = 287

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 193/269 (71%), Positives = 229/269 (85%), Gaps = 2/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+LTS++F++  Q S QRK  +SSTR L S+ + F  RT    HCV  R  +S+S+M+I
Sbjct: 1   MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60  QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF 
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S+EGEVEG+LKDIA+RA  KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KR
Sbjct: 179 SREGEVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKR 238

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKEL +E++DR
Sbjct: 239 SVDRDLDVYRMLLSKLVQAKELQREFIDR 267


>gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula]
          Length = 303

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 191/271 (70%), Positives = 232/271 (85%), Gaps = 3/271 (1%)

Query: 1   MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
           MA++TS  +F+++ Q+S ++   +SSTR   S+ +  RFR S+    V   R S+  S++
Sbjct: 1   MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
           +++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61  VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119

Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
           ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179

Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVN 238
           F S+EGEVEG LKDIAERA G G+FSYSRFFAVGLFRLLELAN  EPT+LEKLC+ LN+N
Sbjct: 180 FSSREGEVEGTLKDIAERAGGNGDFSYSRFFAVGLFRLLELANTMEPTILEKLCSALNIN 239

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           K+SVDRDLDVYRNLLSKL+QAKELLKEY+DR
Sbjct: 240 KKSVDRDLDVYRNLLSKLVQAKELLKEYIDR 270


>gi|217073200|gb|ACJ84959.1| unknown [Medicago truncatula]
          Length = 287

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 192/269 (71%), Positives = 228/269 (84%), Gaps = 2/269 (0%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+LTS++F++  Q S QRK  +SSTR L S+ + F  RT    HCV  R  +S+S+M+I
Sbjct: 1   MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60  QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF 
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S+E EVEG+LKDIA+RA  KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KR
Sbjct: 179 SREREVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKR 238

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYR LLSKL+QAKEL +E++DR
Sbjct: 239 SVDRDLDVYRMLLSKLVQAKELQREFIDR 267


>gi|75140959|sp|Q7XAB8.1|THF1_SOLTU RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
           Precursor
 gi|33469614|gb|AAQ19850.1| light-regulated chloroplast-localized protein [Solanum tuberosum]
          Length = 293

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/269 (69%), Positives = 226/269 (84%), Gaps = 5/269 (1%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA++TSV+F++I Q S +RK +VSS+RS+    + FRFR++     V  R+S+S+SR ++
Sbjct: 1   MAAVTSVSFSAITQ-SAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVV 55

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
            C S++    PTVA+TK+ FL  YKRPIP++YNTVLQELIVQQHL RYK++YQYDPVFAL
Sbjct: 56  HCTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFAL 115

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGYPSEEDR AIF+AYI ALKEDPEQYR DAQKLEEWAR Q A++LV+F 
Sbjct: 116 GFVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFS 175

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           SKEGE+E + KDIA+RA  K  F YSR FAVGLFRLLELAN T+PT+LEKLCA LNVNK+
Sbjct: 176 SKEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKK 235

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYRNLLSKL+QAKELLKEYV+R
Sbjct: 236 SVDRDLDVYRNLLSKLVQAKELLKEYVER 264


>gi|125558787|gb|EAZ04323.1| hypothetical protein OsI_26464 [Oryza sativa Indica Group]
          Length = 287

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 175/213 (82%), Positives = 197/213 (92%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C++TA DVPPTVAETKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 43  GSRSVVRCVATAGDVPPTVAETKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKTTYQY 102

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           D VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q  +
Sbjct: 103 DAVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGN 162

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
           SLVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA 
Sbjct: 163 SLVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAA 222

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           LN+NKRSVDRDLDVYRN+LSKL+QAKELLKEYV
Sbjct: 223 LNINKRSVDRDLDVYRNILSKLVQAKELLKEYV 255


>gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera]
          Length = 243

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 178/212 (83%), Positives = 198/212 (93%), Gaps = 1/212 (0%)

Query: 58  MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
           M++QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD V
Sbjct: 1   MVVQCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAV 59

Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
           FALGFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLV
Sbjct: 60  FALGFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLV 119

Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
           EF SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA  N+
Sbjct: 120 EFSSKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNI 179

Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           +KRSVDRDLDVYRNLL+KL+QAKELLKEYVDR
Sbjct: 180 SKRSVDRDLDVYRNLLTKLVQAKELLKEYVDR 211


>gi|357122407|ref|XP_003562907.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Brachypodium distachyon]
          Length = 286

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 172/215 (80%), Positives = 196/215 (91%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C++   D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42  GSRSVVRCVAATADIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LMEGYPS EDR+AIF++YITAL EDPEQYR DAQK+EEWAR Q  S
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYITALNEDPEQYRADAQKMEEWARAQNGS 161

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
            LVEF S++GE+E +LKDI+ERA G GNFSYSRFFAVGLFRLLELANATEPTVL+KLCA 
Sbjct: 162 LLVEFSSRDGEIEAVLKDISERAQGNGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAA 221

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           LN+NKRSVDRDLD+YRNLLSKL+QAKELLKEY+DR
Sbjct: 222 LNINKRSVDRDLDIYRNLLSKLVQAKELLKEYIDR 256


>gi|326493802|dbj|BAJ85363.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 286

 Score =  375 bits (964), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 171/215 (79%), Positives = 198/215 (92%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42  GSRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
            LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA 
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+R
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYVER 256


>gi|115472755|ref|NP_001059976.1| Os07g0558500 [Oryza sativa Japonica Group]
 gi|75147522|sp|Q84PB7.1|THF1_ORYSJ RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
           Precursor
 gi|29367385|gb|AAO72565.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
 gi|34394010|dbj|BAC84034.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
 gi|113611512|dbj|BAF21890.1| Os07g0558500 [Oryza sativa Japonica Group]
 gi|125600704|gb|EAZ40280.1| hypothetical protein OsJ_24722 [Oryza sativa Japonica Group]
 gi|215694285|dbj|BAG89278.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 287

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 174/213 (81%), Positives = 196/213 (92%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C++TA DVPPTVAETKMNFLK YKRPI SIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 43  GSRSVVRCVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQY 102

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           D VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q  +
Sbjct: 103 DAVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGN 162

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
           SLVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA 
Sbjct: 163 SLVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAA 222

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           LN+NKRSVDRDLDVYRN+LSKL+QAKELLKEYV
Sbjct: 223 LNINKRSVDRDLDVYRNILSKLVQAKELLKEYV 255


>gi|52548246|gb|AAU82110.1| chloroplast inositol phosphatase-like protein [Triticum aestivum]
          Length = 286

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/213 (79%), Positives = 197/213 (92%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42  GSRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
            LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA 
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEY+
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYI 254


>gi|38570261|gb|AAR24582.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
 gi|81239115|gb|ABB60085.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
          Length = 286

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/213 (79%), Positives = 197/213 (92%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42  GSRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
            LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA 
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEY+
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYI 254


>gi|293333399|ref|NP_001168867.1| uncharacterized protein LOC100382672 [Zea mays]
 gi|223973419|gb|ACN30897.1| unknown [Zea mays]
          Length = 284

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/269 (69%), Positives = 215/269 (79%), Gaps = 15/269 (5%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MAS++S+ F  +   +  R    SST   VS                R R  S S   ++
Sbjct: 1   MASISSLPFAPLRGAAEWRP---SSTAPAVSG---------AVVLSARSRRGSGS---VV 45

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           +C++TA+DVPPTV ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYD VFAL
Sbjct: 46  RCVATASDVPPTVGETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDAVFAL 105

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+E WAR Q  SSLV+F 
Sbjct: 106 GFVTVYDQLMEGYPSIEDRDSIFKAYITALNEDPNQYRADALKMEGWARSQNGSSLVDFS 165

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKR
Sbjct: 166 SRDGEIESILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKR 225

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYRN+LSKL+QAKELLKEYVDR
Sbjct: 226 SVDRDLDVYRNILSKLVQAKELLKEYVDR 254


>gi|242050546|ref|XP_002463017.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
 gi|241926394|gb|EER99538.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
          Length = 284

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/208 (83%), Positives = 193/208 (92%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
           C++TA DVPPTVAETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47  CVATAGDVPPTVAETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
           FVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+EEWAR Q ASSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRAYITALNEDPTQYRADALKMEEWARSQNASSLVDFSS 166

Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
           ++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC  LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRS 226

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           VDRDLDVYRN+LSKL+QAKELLKEYVDR
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDR 254


>gi|157142955|gb|ABV24460.1| chloroplast-localized protein [Nicotiana benthamiana]
          Length = 295

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/274 (67%), Positives = 227/274 (82%), Gaps = 13/274 (4%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSR--- 57
           MA++TSV+F++I Q S +RK +VS +RS+    + FRFR+++   C   R+S+SS     
Sbjct: 1   MAAVTSVSFSAIAQ-SAERKSSVSPSRSV----DTFRFRSNVSFDCFNVRSSNSSFSRST 55

Query: 58  --MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
              ++ CMST     PTVAETKMNFLK YKRPIP++YNTVLQELIVQQHL++YK++Y+YD
Sbjct: 56  SRFVVHCMSTDL---PTVAETKMNFLKAYKRPIPTVYNTVLQELIVQQHLIKYKKSYRYD 112

Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
           PVFALGFVTVYD+LMEGYPSEEDR+AIF+AYI AL EDP QYR DAQK EEWAR Q A++
Sbjct: 113 PVFALGFVTVYDQLMEGYPSEEDRDAIFKAYIEALNEDPVQYRADAQKFEEWARTQNANT 172

Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVL 235
           LV+F S++GEVE +LKDIA+RA  K +F YSR FAVGLFRLLELAN T+PT+LEKLCA L
Sbjct: 173 LVDFSSRDGEVENILKDIAQRAGTKDSFCYSRLFAVGLFRLLELANVTDPTILEKLCASL 232

Query: 236 NVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           N+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+R
Sbjct: 233 NINKKSVDRDLDVYRNLLSKLVQAKELLKEYVER 266


>gi|397702097|gb|AFO59570.1| chloroplast Ptr ToxA-binding protein [Saccharum hybrid cultivar
           GT28]
          Length = 284

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 170/208 (81%), Positives = 192/208 (92%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
           C++TA DVPPTV+ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47  CVATAGDVPPTVSETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
           FVTVYD+LMEGYPS EDR++IF+ YITAL EDP+QYR DA K+EEWAR Q  SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRTYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166

Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
           ++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC  LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRS 226

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           VDRDLDVYRN+LSKL+QAKELLKEYVDR
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDR 254


>gi|18399513|ref|NP_565491.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
 gi|75206547|sp|Q9SKT0.1|THF1_ARATH RecName: Full=Protein THYLAKOID FORMATION 1, chloroplastic; Flags:
           Precursor
 gi|4454459|gb|AAD20906.1| expressed protein [Arabidopsis thaliana]
 gi|17065446|gb|AAL32877.1| Unknown protein [Arabidopsis thaliana]
 gi|20148535|gb|AAM10158.1| unknown protein [Arabidopsis thaliana]
 gi|330251998|gb|AEC07092.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
          Length = 300

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/272 (67%), Positives = 219/272 (80%), Gaps = 18/272 (6%)

Query: 3   SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
           +++S++F ++GQ+              +SNF   R   S    C +F      S S+S+ 
Sbjct: 5   AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52

Query: 59  IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
           +I CMS  T DVPP V+ETK  FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53  LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111

Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
           FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171

Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
           +F SKEG++E +LKDIA RA  K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNI 231

Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+R
Sbjct: 232 NKKSVDRDLDVYRNLLSKLVQAKELLKEYVER 263


>gi|21592994|gb|AAM64943.1| unknown [Arabidopsis thaliana]
 gi|58761181|gb|AAW82331.1| chloroplast thylakoid formation 1 [Arabidopsis thaliana]
          Length = 300

 Score =  367 bits (943), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 182/272 (66%), Positives = 218/272 (80%), Gaps = 18/272 (6%)

Query: 3   SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
           +++S++F ++GQ+              +SNF   R   S    C +F      S S+S+ 
Sbjct: 5   AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52

Query: 59  IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
           +I CMS  T DVPP V+ETK  FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53  LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111

Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
           FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171

Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
           +F SKEG++E +LKDIA RA  K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNI 231

Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           NK+SVDRDLDVYRNLLSKL+QA ELLKEYV+R
Sbjct: 232 NKKSVDRDLDVYRNLLSKLVQANELLKEYVER 263


>gi|212720892|ref|NP_001131923.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
 gi|194692932|gb|ACF80550.1| unknown [Zea mays]
 gi|195644742|gb|ACG41839.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
 gi|414887096|tpg|DAA63110.1| TPA: chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
          Length = 284

 Score =  367 bits (941), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 168/208 (80%), Positives = 192/208 (92%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
           C++T  DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47  CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
           FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166

Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
           ++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRS 226

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           VDRDLDVYRN+LSKL+QAKELLKEYVDR
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDR 254


>gi|195653795|gb|ACG46365.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
          Length = 284

 Score =  365 bits (938), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 180/269 (66%), Positives = 216/269 (80%), Gaps = 15/269 (5%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
           MA+++S+ F ++   +  R    SST + VS                R R  S S   ++
Sbjct: 1   MAAISSLPFAALRGAAEWRP---SSTAAAVSG---------AVVLSARSRRGSCS---VV 45

Query: 61  QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
           +C++T  DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFAL
Sbjct: 46  RCVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFAL 105

Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
           GFVTVYD+LME YPS ED+++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F 
Sbjct: 106 GFVTVYDQLMERYPSNEDKDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFS 165

Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KR
Sbjct: 166 SRDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKR 225

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           SVDRDLDVYRN+LSKL+QAKELLKEYVDR
Sbjct: 226 SVDRDLDVYRNILSKLVQAKELLKEYVDR 254


>gi|157849728|gb|ABV89647.1| chloroplast light-regulated protein [Brassica rapa]
          Length = 273

 Score =  363 bits (933), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 183/275 (66%), Positives = 221/275 (80%), Gaps = 17/275 (6%)

Query: 1   MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSS---- 56
           MA++ S++F ++GQ+   +  N + +R L S         S     +  R+ +SS     
Sbjct: 1   MAAIASLSFPALGQSG--KLSNPTPSRPLAS--------VSAISRRISRRSLTSSRPSTS 50

Query: 57  --RMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
             + +I CMS+ TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+Y
Sbjct: 51  SSKFVIHCMSSVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRY 109

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LM+GYPS++DR++IFQAY+ AL E P+QYRIDAQK+EEWAR QT++
Sbjct: 110 DPVFALGFVTVYDQLMDGYPSDQDRDSIFQAYVEALNEVPKQYRIDAQKMEEWARSQTSA 169

Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
           SLV+F  KEGEVE +LKDI+ERA  K  FSYSRFFAVGLFRLLELA AT+PTVL+KLCA 
Sbjct: 170 SLVDFSFKEGEVEAILKDISERAGSKEGFSYSRFFAVGLFRLLELAGATDPTVLDKLCAS 229

Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+R
Sbjct: 230 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYVER 264


>gi|297832696|ref|XP_002884230.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330070|gb|EFH60489.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 298

 Score =  363 bits (933), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 170/210 (80%), Positives = 195/210 (92%), Gaps = 1/210 (0%)

Query: 60  IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
           I CMST TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFA
Sbjct: 54  IHCMSTVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFA 112

Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
           LGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F
Sbjct: 113 LGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDF 172

Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
            S++GE+E LLKDIA RA+ K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK
Sbjct: 173 SSRQGEIEALLKDIAGRAASKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINK 232

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           +SVDRDLDVYRNLLSKL+QAKELL+EYV+R
Sbjct: 233 KSVDRDLDVYRNLLSKLVQAKELLREYVER 262


>gi|116782547|gb|ABK22548.1| unknown [Picea sitchensis]
          Length = 304

 Score =  337 bits (863), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 165/219 (75%), Positives = 190/219 (86%), Gaps = 3/219 (1%)

Query: 53  SSSSRMIIQC-MS-TATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
           +SSS  + +C MS TA+D+P TVAETK  FLK YKRPIPSIYN V+QELIVQQHLMRYKR
Sbjct: 61  ASSSLSVTRCSMSITASDIP-TVAETKSAFLKAYKRPIPSIYNNVIQELIVQQHLMRYKR 119

Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG 170
           TYQYD VFALGFV+VYD+LM+GYPS+ D EAIF+AYI ALKEDPEQYR DA+KLEEWA  
Sbjct: 120 TYQYDAVFALGFVSVYDQLMDGYPSDGDSEAIFRAYINALKEDPEQYRSDAKKLEEWASS 179

Query: 171 QTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
           Q A S+VEF S++GEVEG+LKDIAERA  K  FSYSRFFA+GLFRLLE ANAT+P VLEK
Sbjct: 180 QDAKSIVEFQSRDGEVEGILKDIAERAREKKIFSYSRFFAIGLFRLLERANATDPVVLEK 239

Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
           LC  LN++K SVDRDLD+YRN+LSKL+Q+KELLKEYV+R
Sbjct: 240 LCGALNISKPSVDRDLDIYRNILSKLVQSKELLKEYVER 278


>gi|302807588|ref|XP_002985488.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
 gi|302810785|ref|XP_002987083.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
 gi|300145248|gb|EFJ11926.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
 gi|300146694|gb|EFJ13362.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
          Length = 206

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 146/199 (73%), Positives = 172/199 (86%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTVA+TK  FLK +++PIPSIYN VLQEL+VQQHLMRY  TY+YD VFALGFVTVYD+LM
Sbjct: 3   PTVADTKSAFLKAFRKPIPSIYNNVLQELLVQQHLMRYNATYKYDAVFALGFVTVYDQLM 62

Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
           +GYP+ +D EAIF+AYI AL EDP+QYR DA+KLEEWA  QTASSL  F S +G+VE +L
Sbjct: 63  DGYPNAQDSEAIFKAYIEALGEDPDQYRKDAKKLEEWASSQTASSLASFNSGDGDVEEVL 122

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
           KDIA+RA+GK +F YSRFFAVGLFRL+E ANA++P VLEKLC  LNV+K SVDRDLDVYR
Sbjct: 123 KDIAQRAAGKTSFHYSRFFAVGLFRLVERANASDPAVLEKLCNALNVSKMSVDRDLDVYR 182

Query: 251 NLLSKLLQAKELLKEYVDR 269
           NLL+KL QAK+LLKEY+DR
Sbjct: 183 NLLTKLSQAKDLLKEYIDR 201


>gi|168043272|ref|XP_001774109.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674516|gb|EDQ61023.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 215

 Score =  310 bits (794), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 142/199 (71%), Positives = 171/199 (85%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTVA+TK++F+K Y++PIPSIY+ V+QEL+VQQHLMRY  TY YDP+FALGFVTVYD+LM
Sbjct: 7   PTVADTKLSFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYVYDPIFALGFVTVYDQLM 66

Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
           +GYP++EDR+AIF+AYI+AL EDPEQYR D++KLEEWA  Q+ S + +F  K+GEVE  L
Sbjct: 67  DGYPNDEDRDAIFKAYISALNEDPEQYRKDSKKLEEWAAAQSGSGIADFAGKDGEVEAAL 126

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
           KDIAERA+GK  F YSRFFA+GLFRLLE A A++P VLE L   LNV+KRSVDRDLDVYR
Sbjct: 127 KDIAERAAGKEKFHYSRFFAIGLFRLLECAKASDPAVLETLSKALNVSKRSVDRDLDVYR 186

Query: 251 NLLSKLLQAKELLKEYVDR 269
           NLLSKL Q KEL+KEYVDR
Sbjct: 187 NLLSKLAQGKELIKEYVDR 205


>gi|217072610|gb|ACJ84665.1| unknown [Medicago truncatula]
 gi|388509564|gb|AFK42848.1| unknown [Medicago truncatula]
          Length = 219

 Score =  296 bits (759), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 144/219 (65%), Positives = 179/219 (81%), Gaps = 3/219 (1%)

Query: 1   MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
           MA++TS  +F+++ Q+S ++   +SSTR   S+ +  RFR S+    V   R S+  S++
Sbjct: 1   MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
           +++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61  VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119

Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
           ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179

Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
           F S+E EVEG LKDIAERA G G+FSYSRFFAVG F  L
Sbjct: 180 FSSREREVEGTLKDIAERAGGNGDFSYSRFFAVGFFDFL 218


>gi|168037112|ref|XP_001771049.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677737|gb|EDQ64204.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 205

 Score =  296 bits (757), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 137/199 (68%), Positives = 162/199 (81%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTV+ETK +F+K Y++PIPSIY+ V+QEL+VQQHLMRY  TY YDP+FALGFVTVYD+LM
Sbjct: 7   PTVSETKASFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYTYDPIFALGFVTVYDQLM 66

Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
           +GYP   DR++IF AYI AL EDP +YR DA+KLEEWA  Q+AS + +F S++GEVE  L
Sbjct: 67  DGYPDATDRDSIFTAYINALNEDPVKYREDAKKLEEWASAQSASGITDFTSRDGEVEATL 126

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
           K IAERA  K  F YSRFFA+GLFRLLE A A++P VLE L   LNVNKRSVDRDLDVYR
Sbjct: 127 KSIAERAGSKDKFHYSRFFAIGLFRLLECAKASDPAVLESLSKALNVNKRSVDRDLDVYR 186

Query: 251 NLLSKLLQAKELLKEYVDR 269
           NLLSKL Q KEL+KEY +R
Sbjct: 187 NLLSKLAQGKELIKEYNER 205


>gi|414887097|tpg|DAA63111.1| TPA: hypothetical protein ZEAMMB73_220735 [Zea mays]
          Length = 207

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 124/159 (77%), Positives = 143/159 (89%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
           C++T  DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47  CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
           FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166

Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
           ++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLL+ A
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLDFA 205


>gi|326492686|dbj|BAJ90199.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 239

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 96/125 (76%), Positives = 112/125 (89%)

Query: 55  SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
            SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42  GSRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101

Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
           DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161

Query: 175 SLVEF 179
            LVEF
Sbjct: 162 LLVEF 166


>gi|384250113|gb|EIE23593.1| photosystem II biogenesis protein Psp29 [Coccomyxa subellipsoidea
           C-169]
          Length = 290

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 102/201 (50%), Positives = 143/201 (71%), Gaps = 4/201 (1%)

Query: 70  PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
           PPTVAETK NF + + RPIP IY+ V+QEL+VQ H+MRY ++Y YD VF LGFV+V+D++
Sbjct: 65  PPTVAETKRNFYEAFSRPIPGIYSNVIQELLVQHHIMRYNKSYSYDEVFGLGFVSVFDQV 124

Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-EVEG 188
           +EG P E D+ A+F AYI +L E+ +QYR DA+K+E  A+  +  + ++ P  EG E++ 
Sbjct: 125 LEGLP-EGDKGALFSAYIGSLGENGDQYRQDAEKVEALAKELSGPAELK-PDAEGSELQK 182

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 248
            L  IAER+S +GNF Y++FFA+GLFRLLEL  A +P  LE L + + + + SV RDL  
Sbjct: 183 KLASIAERSS-QGNFLYTKFFAIGLFRLLELTGAKDPKALEGLVSAMKIPQESVSRDLMT 241

Query: 249 YRNLLSKLLQAKELLKEYVDR 269
           Y+ +LSKL  AK+L+ E   R
Sbjct: 242 YKGVLSKLSAAKDLMNEMYAR 262


>gi|159471025|ref|XP_001693657.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
 gi|158283160|gb|EDP08911.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
          Length = 266

 Score =  203 bits (516), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 110/218 (50%), Positives = 149/218 (68%), Gaps = 3/218 (1%)

Query: 52  SSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRT 111
           SS + + +   +  AT  PPTVAETK  FL  Y +PI SIY+TVLQEL+VQQH MRY + 
Sbjct: 24  SSLARKAVATPVLAATFKPPTVAETKAKFLSGYNKPIASIYSTVLQELLVQQHFMRYSKN 83

Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
           YQY+P+FALGFV+VY++++E   S E+R AIF+AY+ AL ED ++Y+ DA  LE+ A G 
Sbjct: 84  YQYNPIFALGFVSVYEQILESL-SAEERGAIFKAYVDALGEDADKYKRDASALEQAANGL 142

Query: 172 TASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKL 231
           T  SL   P+ +G            AS  G FSY++F A+GLFRLLEL+ A EP+ LEKL
Sbjct: 143 TPESLT--PNADGNEVQKALASISSASAAGAFSYNKFVAIGLFRLLELSGAKEPSALEKL 200

Query: 232 CAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR 269
              + V   +V+RDL +Y+ +LSKL  AKEL++E+V+R
Sbjct: 201 VKAVGVKPEAVNRDLLMYKGVLSKLAAAKELMREFVER 238


>gi|356555139|ref|XP_003545894.1| PREDICTED: LOW QUALITY PROTEIN: protein THYLAKOID FORMATION1,
           chloroplastic-like [Glycine max]
          Length = 152

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 99/133 (74%), Positives = 111/133 (83%)

Query: 137 EDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAER 196
           E R+AIFQAYI AL EDP++YRIDA+KLEEWA  Q  +SLVEF SKEGE E  LKDIA R
Sbjct: 19  EGRDAIFQAYIKALVEDPDKYRIDARKLEEWAGVQNPTSLVEFSSKEGEAEKXLKDIAXR 78

Query: 197 ASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKL 256
           A GK  FSYSRFFAVGLFRL+EL NATEP +L+KLCA LN+NKRSVD DLDVY  LLS+L
Sbjct: 79  AGGKXEFSYSRFFAVGLFRLVELENATEPIILDKLCAALNINKRSVDWDLDVYCILLSEL 138

Query: 257 LQAKELLKEYVDR 269
           LQ KELLKEY+D+
Sbjct: 139 LQVKELLKEYIDK 151


>gi|307108772|gb|EFN57011.1| hypothetical protein CHLNCDRAFT_143677 [Chlorella variabilis]
          Length = 273

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 90/201 (44%), Positives = 133/201 (66%), Gaps = 7/201 (3%)

Query: 69  VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
            PPTVA+ K+ F   +K+P+P+IY+TV+QEL+VQQHL R+ + YQY+ V ALG V+++++
Sbjct: 49  APPTVADAKLKFNGAFKKPLPAIYSTVVQELLVQQHLFRWNKQYQYNEVTALGIVSIFEQ 108

Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
           ++ G P  E REA+F A+I AL+EDP+QYR DA  +EE ARG++  +    P   G+   
Sbjct: 109 VLGGLPDAE-REAVFDAFINALQEDPKQYRKDAAAMEELARGKSEVA----PDASGDKVQ 163

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 248
                     GK  F Y++FFAVGLFRL+EL  + +P  L  L   L +++  V+ DL  
Sbjct: 164 QALAAVAAKEGK--FLYTKFFAVGLFRLVELTGSKDPKSLTTLVKALGLSQERVNADLMT 221

Query: 249 YRNLLSKLLQAKELLKEYVDR 269
           Y+ +LSKL  AKE++KE++ R
Sbjct: 222 YKGVLSKLEAAKEIMKEFMAR 242


>gi|302852549|ref|XP_002957794.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
           nagariensis]
 gi|300256865|gb|EFJ41122.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
           nagariensis]
          Length = 373

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/244 (40%), Positives = 144/244 (59%), Gaps = 43/244 (17%)

Query: 65  TATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 124
            A   PPTVAETK  F + Y +PI SIY+TVLQEL+VQQH MRY + Y Y+ +FALGFV+
Sbjct: 38  AANFTPPTVAETKAKFFEGYSKPIASIYSTVLQELLVQQHFMRYSKDYVYNEIFALGFVS 97

Query: 125 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG-------------- 170
           VY++++E  P  E R+AIF +Y+ AL EDPE Y+ D++++E+ A                
Sbjct: 98  VYEQILESLPQSE-RDAIFVSYVKALGEDPEAYKRDSERVEKAAGALSGPDALVPDAEGS 156

Query: 171 --QTASSLVEFPSKEGEVEGLLKDIAERASGKGN-----------------------FSY 205
             QT++ +  +  + GE+    +    R  G+G+                       FSY
Sbjct: 157 DVQTSAYIWAYHQRRGEMRMPWRT---RTWGQGSSSLGVCSYGKALDAIKAASAADAFSY 213

Query: 206 SRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           ++F A+GLFRLLEL  A EP  LE+L   + +   +V+RDL +Y+ +LSKL  AKE+++E
Sbjct: 214 NKFVAIGLFRLLELTGAKEPAALERLVKSVGIKPEAVNRDLLMYKGVLSKLAAAKEMMRE 273

Query: 266 YVDR 269
           +V+R
Sbjct: 274 FVER 277


>gi|255075137|ref|XP_002501243.1| predicted protein [Micromonas sp. RCC299]
 gi|226516507|gb|ACO62501.1| predicted protein [Micromonas sp. RCC299]
          Length = 260

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 128/202 (63%), Gaps = 16/202 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A+TK  F++ Y  PIPSI++  + EL+  QH +RY   Y Y  + +LGFV+VYD+L E
Sbjct: 51  TLADTKRKFVESYPYPIPSIWSVAVNELLANQHFVRYSTRYSYSKLSSLGFVSVYDQLFE 110

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           G+PS+E++  IF  ++ AL EDPE+ R D            A+ L +F  + G V+ LL 
Sbjct: 111 GFPSDEEKAKIFDCFVEALGEDPEKCRKD------------AAELAKFAKEAGGVDALLA 158

Query: 192 D--IAE-RASGKGN-FSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 247
              +AE +++G+ N F+YSR+ A+GLFR+LEL  ATEP  LEKL     +  + V+ DL 
Sbjct: 159 SPVLAEIKSNGEANKFAYSRYDAIGLFRMLELGGATEPAALEKLADAAGLKLKKVNGDLG 218

Query: 248 VYRNLLSKLLQAKELLKEYVDR 269
           +Y+ LLSKL  AKEL KE  +R
Sbjct: 219 MYKGLLSKLAAAKELQKEIFER 240


>gi|427736065|ref|YP_007055609.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
 gi|427371106|gb|AFY55062.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
          Length = 233

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 132/203 (65%), Gaps = 22/203 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+ETK  F  L+ RPI +IY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSETKRTFYSLHTRPINTIYRRVVEELMVEMHLLGVNADFKYDPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK-----EGEV 186
           GY  EED+E+I+ A I +++EDP++YR DA++LE+ A+  T   LV   S+     + E+
Sbjct: 66  GYNPEEDKESIYNALIKSVEEDPQKYRHDAKRLEDLAKSTTGKDLVSDLSQRRLANDSEL 125

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV----------LEKLCAVLN 236
           +GLL+ IA  +S    F YSR FA+GL+ LLE   +++P +          L+ + A LN
Sbjct: 126 QGLLEGIANNSS----FKYSRLFAIGLYTLLE---SSDPEMVKDEKLRNEALKTIAAGLN 178

Query: 237 VNKRSVDRDLDVYRNLLSKLLQA 259
           +++  + +DLD+YR+ L K+ QA
Sbjct: 179 LSEDKLSKDLDLYRSNLDKMAQA 201


>gi|428213026|ref|YP_007086170.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
           6304]
 gi|428001407|gb|AFY82250.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
           6304]
          Length = 235

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 84/207 (40%), Positives = 124/207 (59%), Gaps = 17/207 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     + YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF------PSKEGE 185
           GY  EED+ +IF      L+ DP++YR DAQ LEE A   +   +V        P  EG+
Sbjct: 66  GYRPEEDKISIFNGICKGLEADPQKYRQDAQWLEEIASRHSGEEMVALLSRSAGPEMEGD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 238
            +G+L  IA     K NF YSR FAVGLF LLE A+        +    ++K+C  LN+ 
Sbjct: 126 FQGILGAIA----AKPNFKYSRLFAVGLFTLLEQADLELVKNEKSRQEAVQKICTALNLP 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKE 265
              + +DLD+YR  L K++QA+ ++++
Sbjct: 182 VDKLSKDLDLYRTNLEKMIQARSVMED 208


>gi|303286071|ref|XP_003062325.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455842|gb|EEH53144.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 222

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 80/171 (46%), Positives = 109/171 (63%), Gaps = 8/171 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TVA+TK  FLK Y  PIPSI++  LQEL+V QH +RY + Y Y  + +LGFV+VYD+L E
Sbjct: 45  TVADTKQKFLKSYPYPIPSIWSVALQELLVTQHFVRYSKKYSYSKLSSLGFVSVYDQLFE 104

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           G+PSEE++  IF+ ++ AL+EDP   R DA +L  +A G +    V       +++ L+ 
Sbjct: 105 GFPSEEEKNTIFECFVKALEEDPATVRKDAAELASFAEGASGVDGVLASPIFAQMKSLVA 164

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSV 242
           D        G F+YSR+ A+GLFRLLELA ATEP  LEKL     +  RS+
Sbjct: 165 D--------GKFAYSRYDAIGLFRLLELAKATEPAALEKLAESSGLQARSI 207


>gi|145344894|ref|XP_001416959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577185|gb|ABO95252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 203

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 125/203 (61%), Gaps = 18/203 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  FL+ Y  PIPS+++TV QEL+VQ H  +Y    +Y  + +LGFV+V+D+L E
Sbjct: 1   TVSDTKAKFLQAYPYPIPSVWSTVTQELLVQGHFAKYNAKSEYSELASLGFVSVFDQLYE 60

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           G+PSE ++  IF A++ AL ED  + R DA+            +L  F +  G V+GL  
Sbjct: 61  GFPSETEKVKIFNAFLGALGEDAAKTRADAE------------ALGAFAASAGGVDGLSA 108

Query: 190 ---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 246
                 +A + S +    Y+++ A+G+FR+LELA AT+P  LE L     ++ + V+ DL
Sbjct: 109 NPIFATMAAK-SAENKLMYTKYIAIGIFRMLELAKATDPKALEALAQAGGLSFKKVNGDL 167

Query: 247 DVYRNLLSKLLQAKELLKEYVDR 269
            +Y+ LLSKL  AKEL +E+++R
Sbjct: 168 AMYKGLLSKLASAKELQEEFLER 190


>gi|428775508|ref|YP_007167295.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
 gi|428689787|gb|AFZ43081.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
          Length = 243

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 123/205 (60%), Gaps = 9/205 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T++ETK  F  L+ RP+ SIY  V++EL+V+ HL+     ++YDP +ALG VTV+D  M+
Sbjct: 6   TLSETKRTFYTLHTRPLNSIYRRVIEELLVEMHLLTVNIDFKYDPFYALGVVTVFDTFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
           GY  E+D+E+IF A   A++ DP+QYR DA+K++  A   +  ++  +   +K  +  G 
Sbjct: 66  GYQPEKDKESIFNAICKAVESDPQQYRQDAEKVKSIADQASGEAVTAWLCEAKPLDQAGD 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
           L DI +       F YSR F +G++ +LE AN            VL   C  LN+ K  V
Sbjct: 126 LNDILQGIRENPRFKYSRLFIIGIYTVLEKANPEIVNDDKKREEVLNNCCQALNLPKEKV 185

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYV 267
           D+DLD+YR+ L K+ QA+ +L++ V
Sbjct: 186 DKDLDLYRSNLEKMEQARSVLEDVV 210


>gi|158338004|ref|YP_001519180.1| Thf1-like protein [Acaryochloris marina MBIC11017]
 gi|189030267|sp|B0C3M8.1|THF1_ACAM1 RecName: Full=Protein thf1
 gi|158308245|gb|ABW29862.1| photosystem II biogenesis protein Psb29 [Acaryochloris marina
           MBIC11017]
          Length = 247

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 129/209 (61%), Gaps = 16/209 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RP+ S+Y  V++EL+V+ HL+R    ++YDP+FALG  T +DR M+
Sbjct: 6   TVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDRFMD 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-----EV 186
           GY  E D++AIF A   A + DP Q + D Q+L E A+ ++A  ++++ ++       E+
Sbjct: 66  GYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGGDEL 125

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATE-----PTVLEKLCAVLNVNK 239
           +  L++IA+       F YSR FA+GLF LLEL+  N T+        L  +C VLN+++
Sbjct: 126 QWQLRNIAQNPK----FKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLNISE 181

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVD 268
             + +DL++YR  L K+ Q ++ + + ++
Sbjct: 182 SKLQKDLEIYRGNLDKIAQVRQAMDDILE 210


>gi|411116557|ref|ZP_11389044.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410712660|gb|EKQ70161.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 246

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 78/204 (38%), Positives = 123/204 (60%), Gaps = 11/204 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     Y Y+P++ALG VT ++R M+
Sbjct: 6   TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTSFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E D+  IF A   AL++DP +YR DAQ+L ++A+ ++A  +V +  +     G   
Sbjct: 66  GYRPENDKAPIFDAICQALQDDPNRYRHDAQRLNDFAKQKSAKDIVTWLEQAATSYG-GD 124

Query: 192 DIAERASGKGN---FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 241
           D+ E+     N   F YSR FA+GLF L E A+A           +L++ CA L ++   
Sbjct: 125 DLQEQVKAIANNPKFKYSRLFAIGLFTLFETADAEVVKKEGEREELLKQACAALRLSHDK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
           V RDL++YR+ L K+ QA+ ++ +
Sbjct: 185 VQRDLELYRSNLEKVAQAQAVMAD 208


>gi|428778484|ref|YP_007170270.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
           PCC 8305]
 gi|428692763|gb|AFZ48913.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
           PCC 8305]
          Length = 240

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 123/209 (58%), Gaps = 9/209 (4%)

Query: 68  DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 127
           D   T++ETK  F   + RP+ SIY  V++EL+V+ HL+     ++YDP++ALG  TV+D
Sbjct: 2   DTLRTLSETKRTFYTQHTRPLNSIYRRVIEELLVEMHLLSVNTDFKYDPIYALGVTTVFD 61

Query: 128 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 187
             M+GY  E+++E+IF A   A++ DP++YR DA+KL+  A   +   +    S+   ++
Sbjct: 62  TFMQGYQPEKEKESIFNAICQAVENDPQKYRQDAEKLKSIAANHSGEEVTACLSELKPLD 121

Query: 188 GL--LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVN 238
           G   L  + +       F YSR F +GL+ +LE AN    T       VL+K C  L + 
Sbjct: 122 GAEELTKVLQEIKNNSRFKYSRLFIIGLYTILETANPDLVTDDKKREEVLQKCCQGLGLP 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           K  VD+DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDAI 210


>gi|334116992|ref|ZP_08491084.1| Protein thf1 [Microcoleus vaginatus FGP-2]
 gi|333461812|gb|EGK90417.1| Protein thf1 [Microcoleus vaginatus FGP-2]
          Length = 237

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/209 (38%), Positives = 124/209 (59%), Gaps = 17/209 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     +QYDP++ALG VT +DR M 
Sbjct: 6   TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E DR +IF A   +L++DP++Y+ DAQ+LE  A   +   L+ +  +    E    
Sbjct: 66  GYAPEADRVSIFNALCKSLEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121

Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
           D A+  +  G       F YSR FA+GLF LLE A+        T    + K+ A L++ 
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAALHLP 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           +  V +DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 EDKVSKDLDLYRSNLEKMAQARIVLQDVI 210


>gi|119488459|ref|ZP_01621632.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
 gi|119455270|gb|EAW36410.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
          Length = 241

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/204 (37%), Positives = 127/204 (62%), Gaps = 11/204 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI S+Y  V++EL+V+ HL+     +QYDP++ALG V+ +DR M+
Sbjct: 6   TVSDTKRAFYNTHTRPINSVYRRVIEELMVEMHLLSVNVDFQYDPIYALGVVSAFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
           GY  E D+E+IF   I AL++DP++YR +AQ+L+E+A+  +   +V +   +   EV   
Sbjct: 66  GYLPESDKESIFHGLINALQDDPQRYRAEAQRLQEFAQTLSVQDIVSWVDVAANSEVHND 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN--------ATEPTVLEKLCAVLNVNKRS 241
           L+   ++ +    + YSR  A+GLF L+E A+        AT+ T L +L + LN+    
Sbjct: 126 LQSSFQKIATNPKYKYSRILAIGLFTLIEQADPQAMEDKEATQQT-LAQLASGLNLPLDK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
           + +DL++YR+ L KL QA+ ++ E
Sbjct: 185 LQKDLELYRSNLEKLKQARIVMDE 208


>gi|113474941|ref|YP_721002.1| Thf1-like protein [Trichodesmium erythraeum IMS101]
 gi|123056927|sp|Q116P5.1|THF1_TRIEI RecName: Full=Protein thf1
 gi|110165989|gb|ABG50529.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 239

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 77/203 (37%), Positives = 120/203 (59%), Gaps = 9/203 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +DR M+
Sbjct: 6   TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
           GY  +ED+ +IF A I   +EDP +YR DA+ LE+ A   +AS ++ +   SK  +    
Sbjct: 66  GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
           L+D     S    F YSR FA+GLF LLE+ +             L+K+C  LN+ +  +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
            +D+D+Y + L ++ QA+  +++
Sbjct: 186 LKDIDLYLSNLERVAQARSAMED 208


>gi|428317172|ref|YP_007115054.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
 gi|428240852|gb|AFZ06638.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
          Length = 237

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/209 (37%), Positives = 124/209 (59%), Gaps = 17/209 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     +QYDP++ALG VT +DR M 
Sbjct: 6   TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E DR +IF A   ++++DP++Y+ DAQ+LE  A   +   L+ +  +    E    
Sbjct: 66  GYVPEADRVSIFNALCKSVEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121

Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
           D A+  +  G       F YSR FA+GLF LLE A+        T    + K+ A L++ 
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAGLHLP 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
           +  V +DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 EDKVSKDLDLYRSNLEKMAQARIVLQDVI 210


>gi|428223566|ref|YP_007107663.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
 gi|427983467|gb|AFY64611.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
          Length = 239

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 77/204 (37%), Positives = 125/204 (61%), Gaps = 11/204 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP +ALG VT Y+R M+
Sbjct: 6   TVSDTKRAFYSMHTRPINSIYRRVVEELMVEMHLLSVNVDFRYDPFYALGVVTSYERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVEG 188
           GY  E+D+ +IF++   A + DP  YR DA++L E+ +  +A  L+ + S E   G+ +G
Sbjct: 66  GYRPEQDKTSIFESLCRANEGDPGHYRHDAERLAEFTKNLSAEELISWLSLETPRGDDQG 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
           L + + +  +    F YSR FA+GLF L+E AN       A      EK+ A L++    
Sbjct: 126 LGESL-QAIANHSQFKYSRLFAIGLFTLVEQANPDLVKDEAQRTATFEKVVAALHLPADK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
           + +DL++YR+ L KL QA+ ++++
Sbjct: 185 LQKDLELYRSNLEKLTQARIVMED 208


>gi|354568723|ref|ZP_08987886.1| Protein thf1 [Fischerella sp. JSC-11]
 gi|353539977|gb|EHC09457.1| Protein thf1 [Fischerella sp. JSC-11]
          Length = 235

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/201 (37%), Positives = 121/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P+FALG VT +DR M+
Sbjct: 6   TVSDTKRTFHTLHTRPINTIYRRVVEELMVEMHLLAVNVDFSYNPIFALGVVTSFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A + A++ DP+ YR DAQ+L+E A+      L+   S      ++ +
Sbjct: 66  GYQPESDKESIFNALLRAIEADPQIYRQDAQRLQELAKSLPPQDLIAALSLQTQLNRDTD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
           ++  L+ IA        F YSR FA+GLF LLEL++             L+ + A L+++
Sbjct: 126 LQSHLQAIA----SNPKFKYSRLFAIGLFSLLELSDPELVKDEKQRTEALKSIAAGLHIS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
              +++DL++YR+ L K+ QA
Sbjct: 182 DDKLNKDLELYRSNLDKMAQA 202


>gi|424513129|emb|CCO66713.1| Thf1-like protein [Bathycoccus prasinos]
          Length = 222

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 76/203 (37%), Positives = 120/203 (59%), Gaps = 12/203 (5%)

Query: 70  PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
           P TVA+TK  F K Y  P+PSI+ TVLQEL+V  H       YQ++ + +LGFV+V+D+L
Sbjct: 4   PATVADTKAKFTKGYPYPLPSIWATVLQELLVGMHFTVTSSKYQHEEMRSLGFVSVFDQL 63

Query: 130 MEGYPSEED--REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA-SSLVEFPSKEGEV 186
            EGYP+E+   +E IF  ++ AL ED +++R DA+KL  +A  QT+   ++  P      
Sbjct: 64  FEGYPTEDPNAKEKIFSTFMEALGEDSKKWRADAEKLSAFATEQTSIDGIIANP------ 117

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 246
             +   +  +   K +  Y +F A+G FR LE++  T P  L+K+     V    ++ DL
Sbjct: 118 --MFASMKSKVESK-SLVYDKFIAIGFFRALEMSKQTSPENLKKISEASGVTLEKINGDL 174

Query: 247 DVYRNLLSKLLQAKELLKEYVDR 269
            +Y+++LS++  AKEL  E ++R
Sbjct: 175 GLYKSVLSRMNAAKELQAEVLER 197


>gi|220910509|ref|YP_002485820.1| Thf1-like protein [Cyanothece sp. PCC 7425]
 gi|254784141|sp|B8HQ62.1|THF1_CYAP4 RecName: Full=Protein thf1
 gi|219867120|gb|ACL47459.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7425]
          Length = 236

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 125/209 (59%), Gaps = 14/209 (6%)

Query: 70  PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
           P TV++TK  F   + RPI SIY  V++EL+V+ HL+R  +T+ YDPVFALG VT ++R 
Sbjct: 4   PRTVSDTKRAFYHNHARPINSIYRRVVEELLVEIHLLRVNQTFVYDPVFALGVVTTFERF 63

Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
           M+GY    D+ +IF A   A + DP+Q + DAQ+L    RGQ+  SL+++ S    + G 
Sbjct: 64  MQGYHPPADQTSIFNAICLAQELDPQQVQQDAQELLGRVRGQSLESLLDWISTAASLGGD 123

Query: 190 LKDIAERA-SGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCAVLNVN 238
            +    RA +    F YSR FAVGLF LLE A   EP           VL+++  V+++ 
Sbjct: 124 EQQNRLRAIASNPTFKYSRLFAVGLFTLLEQA---EPELGKDEARLLQVLQQVGEVMHLP 180

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
              + +DL+ YR+ L K+ QA++ L++ V
Sbjct: 181 VEKMQKDLEQYRSNLEKMTQARKTLEDIV 209


>gi|434405136|ref|YP_007148021.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
           PCC 7417]
 gi|428259391|gb|AFZ25341.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
           PCC 7417]
          Length = 235

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 126/209 (60%), Gaps = 17/209 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A I A++++P++YR DA++L+  A+G     L+ + S      ++  
Sbjct: 66  GYQPERDQESIFNAIIQAVEQEPQRYRQDAERLQAVAQGLPEQDLIAWLSQTTHSDRDAN 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVN 238
           ++  L+ IA       NF YSR FA+GLF LLE++       +      L+ +   L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFSLLEVSSPELVKDDKQRNEALKAIATGLHLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
              + +DL++YR+ L K+ QA  ++ + V
Sbjct: 182 DDKLSKDLELYRSNLDKMAQALIVMADMV 210


>gi|440683252|ref|YP_007158047.1| Protein thf1 [Anabaena cylindrica PCC 7122]
 gi|428680371|gb|AFZ59137.1| Protein thf1 [Anabaena cylindrica PCC 7122]
          Length = 235

 Score =  139 bits (351), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 79/201 (39%), Positives = 122/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     Y Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A   A+++D ++YR DA +L+  A+      L+ + S      K+ +
Sbjct: 66  GYLPERDQESIFNALCQAVEQDQQRYRQDATRLQAIAQSLPVQDLIAWVSQTTHLDKDAD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA----TEPTVLEKLCAV---LNVN 238
           ++  L+ IA       NF YSR FA+GLF LLELA+      E    E L A+   L+++
Sbjct: 126 LQAQLQAIAH----NPNFKYSRLFAIGLFSLLELADPELVKDEKQRNEALKAIAQGLHLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
           +  + +DLD+YR+ L K+ QA
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQA 202


>gi|17228142|ref|NP_484690.1| Thf1-like protein [Nostoc sp. PCC 7120]
 gi|81772969|sp|Q8YZ41.1|THF1_ANASP RecName: Full=Protein thf1
 gi|17129992|dbj|BAB72604.1| all0646 [Nostoc sp. PCC 7120]
          Length = 233

 Score =  139 bits (351), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 75/201 (37%), Positives = 121/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR ME
Sbjct: 6   TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    + LV + S      ++ +
Sbjct: 66  GYQPERDKESIFSAICQAVEQEPQRYRQDAERLQAVAQSLPVNDLVAWLSQANHLQQDAD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
           ++  L+ IA       NF YSR FA+GLF LLE +N             L+ + A L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKSIAAGLHLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
                +DL++YR+ L K+ QA
Sbjct: 182 DDKFSKDLELYRSNLDKMTQA 202


>gi|186685250|ref|YP_001868446.1| Thf1-like protein [Nostoc punctiforme PCC 73102]
 gi|254784144|sp|B2J353.1|THF1_NOSP7 RecName: Full=Protein thf1
 gi|186467702|gb|ACC83503.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 235

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 74/201 (36%), Positives = 123/201 (61%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV------EFPSKEGE 185
           GY  E D+E+IF A   A+++DP+ YR DA++L+  A+G     L+       +  ++ +
Sbjct: 66  GYEPERDQESIFNALCRAIEQDPQHYRQDAERLQAIAKGLPVKDLIGWLGQTTYLDRDAD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA----TEPTVLEKLCAV---LNVN 238
           ++  L+ IA       NF Y+R FA+G+F LLE ++      E  + E L A+   L+V+
Sbjct: 126 LQAQLQAIA----NNPNFKYNRLFAIGVFSLLEQSDPELVKDEKQLTEALKAIAAGLHVS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
              +++DL++YR+ L K+ QA
Sbjct: 182 DDKLNKDLELYRSNLDKMAQA 202


>gi|427727466|ref|YP_007073703.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
 gi|427363385|gb|AFY46106.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
          Length = 235

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 124/209 (59%), Gaps = 17/209 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNIDFTYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    S LV + S      ++ +
Sbjct: 66  GYRPERDKESIFHAICQAVEQEPQRYRQDAERLQNLAKSLPISDLVAWLSQTTHFNQDPD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
           ++  L+ IA       NF YSR FA+GLF LLE ++             L+ +   L++ 
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLFAIGLFSLLEYSDPDLVKDEKQRTEALKNIANGLHLA 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
              + +DLD+YR+ L K+ QA  ++ + +
Sbjct: 182 DDKLSKDLDLYRSNLDKMTQALTVIADMI 210


>gi|428210102|ref|YP_007094455.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
           thermalis PCC 7203]
 gi|428012023|gb|AFY90586.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
           thermalis PCC 7203]
          Length = 250

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 122/197 (61%), Gaps = 9/197 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK NF   + RPI +IY  V++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRNFYNQHTRPINTIYRRVVEELMVEMHLLSVNADFRYDPIYALGVVTAFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  E D+E IF+A   +++++P++YR DA +L +  +  +A  L ++   +  ++G   
Sbjct: 66  GYQPERDKEPIFEALCQSIEDNPQRYRQDADRLRQLLQNVSAQQLFDWIDGKASLQGAED 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
           L+   +  +    F YSR FA+G+F LLELA+A            L+++   L+V +  +
Sbjct: 126 LQAQMQAIAQNSKFKYSRLFAIGVFTLLELADAELVKDEKQRVEALKQVATALHVPEDKL 185

Query: 243 DRDLDVYRNLLSKLLQA 259
           ++DL++YR+ L K+ QA
Sbjct: 186 NKDLELYRSNLDKIEQA 202


>gi|443311308|ref|ZP_21040938.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
           7509]
 gi|442778631|gb|ELR88894.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
           7509]
          Length = 241

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 122/197 (61%), Gaps = 9/197 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT Y+R M+
Sbjct: 6   TVSDTKRAFYSTHTRPINTIYRRVVEELMVEMHLLSVNADFSYNPIYALGVVTSYERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  E D+++IFQA   A+  DP QYR DA++L  +A+  ++  L+++ S E  ++G   
Sbjct: 66  GYQPERDKDSIFQALCQAINTDPHQYRQDAERLGSFAKSLSSQDLMQWLSSEKPIDGYSD 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 242
           L++  ++ +    F YSR FA+G+F LLEL++              +++ + L++ +  +
Sbjct: 126 LQEQIKQIATNQKFKYSRLFAIGVFSLLELSDPELVKDETKRVEAFKQISSSLHLPEDKL 185

Query: 243 DRDLDVYRNLLSKLLQA 259
           ++DL++YR  + K+ QA
Sbjct: 186 NKDLELYRANVEKMNQA 202


>gi|75910773|ref|YP_325069.1| Thf1-like protein [Anabaena variabilis ATCC 29413]
 gi|97202708|sp|Q3M4B2.1|THF1_ANAVT RecName: Full=Protein thf1
 gi|75704498|gb|ABA24174.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 233

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/201 (36%), Positives = 122/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    + LV + S      ++ +
Sbjct: 66  GYQPERDKESIFSAICQAVEQEPQRYRQDAERLKAVAQSLPVNDLVAWLSQANHLQQDAD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
           ++  L+ IA       NF YSR FA+GLF LLE +N             L+ + A L+++
Sbjct: 126 LQAQLQAIA----SNPNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKTIAAGLHLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
              + +DL++YR+ L K+ QA
Sbjct: 182 DDKLSKDLELYRSNLDKMTQA 202


>gi|300866330|ref|ZP_07111033.1| Protein thf1 [Oscillatoria sp. PCC 6506]
 gi|300335673|emb|CBN56193.1| Protein thf1 [Oscillatoria sp. PCC 6506]
          Length = 267

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 123/203 (60%), Gaps = 9/203 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     ++Y+P++ALG VT ++R M+
Sbjct: 36  TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFRYNPIYALGVVTAFERFMQ 95

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
           GY  E+D+ +IF     AL +DP++Y+ DA++LE  A   +   L+ +   S   E  G 
Sbjct: 96  GYLPEQDKVSIFNGLCQALGDDPQRYQQDARRLEGLASRVSILDLLSWLEGSTSFEDTGD 155

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEP----TVLEKLCAVLNVNKRSV 242
           L+      +    F YSR FA+GLF LLE+ +     +P      + K+CA L++ +  V
Sbjct: 156 LQASITAIATNSKFKYSRLFAIGLFALLEIVDPDLVKDPEARVQAIAKVCAALHLPEEKV 215

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
            +DLD+YR+ L K+ QA+ +L +
Sbjct: 216 TKDLDLYRSNLEKIAQARIVLAD 238


>gi|452819272|gb|EME26335.1| thylakoid protein [Galdieria sulphuraria]
          Length = 316

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 125/233 (53%), Gaps = 11/233 (4%)

Query: 36  FRFRTSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTV 95
           +R +  + C  +     SS+S      + T      TVAET  +FLK ++ PIPSIY T+
Sbjct: 54  YRTKRRMECRSIENLKESSAS----NSLPTQNGTVRTVAETISDFLKHFRHPIPSIYRTI 109

Query: 96  LQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPE 155
           +QEL+V  HL R    +QYDPVFALG+  V     + YP  E++E +F +   AL  D E
Sbjct: 110 VQELLVTTHLARVAVGFQYDPVFALGYQMVTQVFFKSYPKVEEKEKLFDSMCKALLLDYE 169

Query: 156 QYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGLLKDIAERASGKGNFSYSRFFAVGL 213
           + + DA  LEEW R +T   ++    + G+  +  LL  IA+       F YSR F +GL
Sbjct: 170 RMKKDASVLEEWTRSRTEREILLAIEEGGDDPLANLLHSIAQ----NDGFVYSRLFGLGL 225

Query: 214 FRLLEL-ANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
            R++EL          +K  + L+++   +++DLD Y+  L +L QA++L  E
Sbjct: 226 VRMMELCGEEANSERCQKWASALHISSLKLEQDLDTYQQSLERLKQAEQLFAE 278


>gi|376001810|ref|ZP_09779664.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
           PCC 8005]
 gi|375329721|emb|CCE15417.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
           PCC 8005]
          Length = 243

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/203 (37%), Positives = 116/203 (57%), Gaps = 9/203 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYNIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
           GY  E D+ +I+ A I A + DP QYR DA  LE  A   +   L E    ++E   +  
Sbjct: 66  GYIPEADKLSIWAALIMAQESDPNQYRADATALEAQAATLSVKDLTERAKIAQESSGDDP 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
           L+      +    F YSR FA+GL+ LLE ++ T     E L  +       L + K  +
Sbjct: 126 LQSCFHAIANNPKFKYSRLFAIGLYTLLEKSDVTAAQDSEGLKNIIIDFSEALRLPKDKL 185

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           ++DLD+YR  L K+ QA+ +++E
Sbjct: 186 EKDLDLYRTNLEKVAQARLMVEE 208


>gi|308801781|ref|XP_003078204.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
 gi|116056655|emb|CAL52944.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
          Length = 657

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 110/179 (61%), Gaps = 16/179 (8%)

Query: 91  IYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITAL 150
           ++ TV+QEL+VQ H  +Y +  +Y+ + +LGFV+VYD+L EG+PSEE++  IF A++ AL
Sbjct: 79  VWATVVQELLVQGHFQKYNKKSEYNELASLGFVSVYDQLFEGFPSEEEKGKIFNAFLGAL 138

Query: 151 KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKD--IAERA--SGKGNFSYS 206
            ED  + R DA+            +L  F +    VEGL ++   A+ A  S +G   Y+
Sbjct: 139 DEDAVRTRADAE------------TLGAFATSANGVEGLKENAIFAKLAAKSAEGTLLYT 186

Query: 207 RFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           ++ A+G+FR+LELA AT+P  LE L     ++   V  DL +Y+ LLSKL  AKEL +E
Sbjct: 187 KYIAIGMFRMLELAKATDPAALEALVTAGGLSMSKVSGDLSMYKGLLSKLAAAKELQEE 245


>gi|434395245|ref|YP_007130192.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
 gi|428267086|gb|AFZ33032.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
          Length = 251

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 119/197 (60%), Gaps = 9/197 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT ++R M+
Sbjct: 6   TVSDTKRAFYTSHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTAFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
           GY  E D+E+IF A   A++ DP++YR DA++L  +A+  +   L+ +   E   E  G 
Sbjct: 66  GYQPERDKESIFNALCQAVESDPQRYRQDAERLGLFAKNTSTPELIAWLRGETHKEEVGD 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 242
           L+   +  +   +F YSR FA+G+F LLEL++             L+ + A LN+++  +
Sbjct: 126 LQQQIQAIAHNPHFKYSRLFAIGVFGLLELSDPALVKDEKQRVDALKSIAATLNISEDKL 185

Query: 243 DRDLDVYRNLLSKLLQA 259
           ++DL++YR  + K+ QA
Sbjct: 186 NKDLELYRANVDKMEQA 202


>gi|298491449|ref|YP_003721626.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
 gi|298233367|gb|ADI64503.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
          Length = 235

 Score =  133 bits (334), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 72/201 (35%), Positives = 122/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     ++Y+ ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDFRYNSIYALGVVTAFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E+D+ +IF A I A+++DP++YR DA +L+  A+      L+ + S      ++ +
Sbjct: 66  GYQPEQDQASIFNAIIQAVEQDPQRYRQDAARLQVVAQSLLTKDLISWLSQTTYLDQDRD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
           ++  L+ IA  A     F YSR FA+GLF LLE+ ++            L+ +   L+++
Sbjct: 126 LQAQLQAIANNAE----FKYSRLFAIGLFSLLEMVDSELVKDEKQRNQALKAIAQGLHLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
           +  + +DL++YR+ L KL QA
Sbjct: 182 EEKLTKDLELYRSNLDKLAQA 202


>gi|427707894|ref|YP_007050271.1| Protein thf1 [Nostoc sp. PCC 7107]
 gi|427360399|gb|AFY43121.1| Protein thf1 [Nostoc sp. PCC 7107]
          Length = 235

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 121/197 (61%), Gaps = 9/197 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
           GY  E D+E+IFQA   A++++ ++YR DA++L+  A+   A+ L+ + S+   +  +  
Sbjct: 66  GYQPERDKESIFQAICQAVEQEVQRYRQDAERLQALAKSLAANDLIAWLSQTNHLNQDPD 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 242
           L+   +  +    F Y+R FA+GLF LLE ++             ++ + A L++++  +
Sbjct: 126 LQSQLQAIANNSQFKYNRLFAIGLFSLLEQSDPDLVKDEKQRTDAIKTIAAGLHLSEDKL 185

Query: 243 DRDLDVYRNLLSKLLQA 259
            +DL++YR+ L K+ QA
Sbjct: 186 SKDLELYRSNLEKMSQA 202


>gi|291567260|dbj|BAI89532.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 243

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 112/203 (55%), Gaps = 9/203 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYHIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
           GY  E D+ +I+ A I A + DP QYR DA  LE          L +    ++E   +  
Sbjct: 66  GYIPEADKLSIWAALIGAQESDPNQYRADATALEAQVASLAVKDLTDKAKMAQESSGDDP 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 242
           L+      +    F YSR  A+GL+ LLE ++AT         T+L      L + K  +
Sbjct: 126 LQSCFHAIANNPKFKYSRLLAIGLYTLLEKSDATAAQDSEGLKTILSDFSEALRLPKDKL 185

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
            +DLD+YR  L K+ QA+ ++ E
Sbjct: 186 VKDLDLYRTNLEKVAQARLMVDE 208


>gi|427719034|ref|YP_007067028.1| Protein thf1 [Calothrix sp. PCC 7507]
 gi|427351470|gb|AFY34194.1| Protein thf1 [Calothrix sp. PCC 7507]
          Length = 235

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/201 (36%), Positives = 122/201 (60%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+ ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNSIYALGVVTTFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
           GY  E D+E+IF A   A++++P++YR DA++L   A+   A+ L+ + S      ++ +
Sbjct: 66  GYLPERDQESIFNALCHAVEQEPQRYRQDAERLRVLAKSLPANDLIAWLSQTTHLDQDAD 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA----TEPTVLEKLCAV---LNVN 238
           ++  L+ IA       NF YSR  A+GLF LLEL++      E    E L A+   L ++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLLAIGLFTLLELSDPELVKDEKQRNEALKAIATGLQLS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
              +++DL++YR+ L K+ QA
Sbjct: 182 DEKLNKDLELYRSNLDKIAQA 202


>gi|434388267|ref|YP_007098878.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
           6605]
 gi|428019257|gb|AFY95351.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
           6605]
          Length = 234

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 116/205 (56%), Gaps = 13/205 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK NF   + RPI SIY  V++EL+V+ HL+     + YDP++ALG V+ +DR M 
Sbjct: 6   TVSDTKRNFYSQHTRPINSIYRRVVEELMVEMHLLSTNVDFAYDPIYALGVVSSFDRFMT 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
            Y  E D+++IF A   ++  + +QYR DA  +EE+AR    S ++++   P+ +G    
Sbjct: 66  SYRPEADKQSIFVALCESMGGNAQQYRTDATAVEEFARSMQGSDIIDWIAHPTADGMGAQ 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELA--------NATEPTVLEKLCAVLNVNKR 240
           L   +   AS    F YSR F +GLF +LE A           E  VL+ +   L++ K 
Sbjct: 126 LATTLQSIASNP-KFKYSRLFGIGLFTILEQAAPDLLKDEKKREAAVLQ-IAEALHLPKD 183

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKE 265
              +DLD YR+ L KL+Q + ++ +
Sbjct: 184 KAQKDLDTYRSNLDKLVQMEAVMAD 208


>gi|332705256|ref|ZP_08425337.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
 gi|332355999|gb|EGJ35458.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
          Length = 257

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 117/206 (56%), Gaps = 11/206 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SIY  V++EL+V+ HL+     + YDP++ LG VT +DR M+
Sbjct: 6   TVSDTKRDFYTYHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYGLGVVTCFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE---EWARGQTASSLVEFPSKEGEVEG 188
            Y  E D+E+IF A   A+  + +QY+ DAQ+L+   +   GQ   S +  P+ E     
Sbjct: 66  SYQPENDKESIFNALCQAVGGEAQQYQEDAQRLKTSVDSMSGQDLISWLSSPTSENGSGD 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRS 241
           L   IA  A     F YSR FA+GLF LLE  ++           V+  + + LN+    
Sbjct: 126 LATTIAAIAQ-NSQFKYSRLFAIGLFSLLEQTDSELAQDQKQLEEVINNISSGLNLPSEK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYV 267
           + +DL++YR+ L K+ QA+ ++++ +
Sbjct: 185 LQKDLELYRSNLEKMAQARVVIEDAI 210


>gi|414076688|ref|YP_006996006.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
 gi|413970104|gb|AFW94193.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
          Length = 223

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 115/197 (58%), Gaps = 9/197 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + YD ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYDAIYALGVVTTFDRFMD 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
           GY  E+D+E+IF+A   A+++DP+ YR DA +L+  A    A  L+   S+   +  +  
Sbjct: 66  GYQPEQDKESIFRAICQAVEQDPQSYRQDASRLQALAASLPAKDLIASLSQASPLNQDAD 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATE--PTVLEKLCAVLNVNKRSV 242
           L+   E  +   NF YSR F VGLF LL     EL    E     L+ +   L++++  +
Sbjct: 126 LQKQLEAVAANSNFKYSRLFGVGLFALLVQSDPELVKKDEQRAEALKAISNGLHISEDKL 185

Query: 243 DRDLDVYRNLLSKLLQA 259
            +DL++Y + L K+ QA
Sbjct: 186 IKDLELYSSNLEKMAQA 202


>gi|56750022|ref|YP_170723.1| Thf1-like protein [Synechococcus elongatus PCC 6301]
 gi|81300364|ref|YP_400572.1| Thf1-like protein [Synechococcus elongatus PCC 7942]
 gi|56684981|dbj|BAD78203.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169245|gb|ABB57585.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 280

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 78/240 (32%), Positives = 129/240 (53%), Gaps = 20/240 (8%)

Query: 43  FCHCVRFRASSSSSRMIIQCMSTATDVP-----PTVAETKMNFLKLYKRPIPSIYNTVLQ 97
           F  C+  RA ++ +     C S+   VP     PTV+++K  F   Y RPI  +Y  V++
Sbjct: 3   FGRCLAKRAIAAHN-----CSSSCASVPNVTSVPTVSDSKRAFYAAYPRPINPLYRRVVE 57

Query: 98  ELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
           EL+V+ HL+    ++ YDP+FALG VT +D  M  Y   E    +F A   A++++PEQY
Sbjct: 58  ELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFMSSYRPIEAVGPLFTALTQAVRQNPEQY 117

Query: 158 RIDAQKLEEWARGQTASSLVEFPSKE---GEVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
           R DA  + E  RG  + ++ ++ ++    G    L++   +  +G+  F YSR FA+GLF
Sbjct: 118 RHDANAIAEQVRGVGSDTIRQWLTEAEALGNAPELVRSSFQAIAGRSEFKYSRLFAIGLF 177

Query: 215 RLLELANAT---EPTVLEKLCAVL----NVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
            LLE A      +P  L+     +    ++    + +DLD+YR+ L K+ QA+  ++E +
Sbjct: 178 SLLETAAPDLVQDPEALKTTVTAIAERFHLPSDKLQKDLDLYRSNLEKMEQARITMEEAI 237


>gi|428304539|ref|YP_007141364.1| Protein thf1 [Crinalium epipsammum PCC 9333]
 gi|428246074|gb|AFZ11854.1| Protein thf1 [Crinalium epipsammum PCC 9333]
          Length = 243

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 120/203 (59%), Gaps = 9/203 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SIY  V++EL+V+ HL+     + Y P++ALG VT Y++ M+
Sbjct: 6   TVSDTKRDFYNNHTRPINSIYRRVVEELMVEMHLLSVNVDFAYHPIYALGVVTSYEKFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  E DR++IF A + A+ ED ++Y+ DA++L+  A   +   L+++      V+G   
Sbjct: 66  GYRPERDRDSIFDALVGAVGEDSQRYKQDAEQLKALAGRLSGKELIDWIVSPTAVDGAGS 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPTVLEKLCAV---LNVNKRSV 242
           L D     +    F YSR FA+GL+ LLE+++ +    E   L+ L  V   L++    +
Sbjct: 126 LPDQMRAIANNPQFKYSRLFAIGLYTLLEVSDPSLVKDEKERLDALNQVGQSLHLPTEKL 185

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
            +DLD+YR+ L K+ Q +  +K+
Sbjct: 186 HKDLDLYRSNLEKMAQVQIAMKD 208


>gi|428219024|ref|YP_007103489.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
 gi|427990806|gb|AFY71061.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
          Length = 260

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 116/206 (56%), Gaps = 10/206 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++ K +F + + RPI S+Y  V+ EL+V+ HL+   +T+ YDPVFALG +T YDR M 
Sbjct: 6   TVSDAKRDFFQAFPRPINSVYRRVVDELLVEMHLLTVNQTFAYDPVFALGAITAYDRFML 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  E +R+ I  A   A+  + EQ R DA  L E A  ++   + +F +     E L  
Sbjct: 66  GYEPESERDRILPAICGAVHLNAEQMRHDASSLAELAM-RSPIDVKQFLTSLETTENLEP 124

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVNKRSV 242
           L       +    F YSR FA+GLF LLE A       N     +++++   LN+    +
Sbjct: 125 LTGTIRAIAANQKFKYSRLFAIGLFTLLETADPNTMSDNDKRQELIKQVGDALNLGSEKL 184

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVD 268
            +DLD+YR+ L K+ QA++++K+ V+
Sbjct: 185 IKDLDLYRSNLEKVEQARQMMKDLVE 210


>gi|443317266|ref|ZP_21046682.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
 gi|442783151|gb|ELR93075.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
          Length = 251

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 114/203 (56%), Gaps = 8/203 (3%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTV++TK  F   + RPI S+Y  V++EL+V+ HL+R    + YDPV+ALG VT +DR M
Sbjct: 5   PTVSDTKRAFYSYHNRPIASVYRRVIEELMVEMHLLRVNEDFVYDPVYALGIVTTFDRFM 64

Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE-FPSKEGEVEGL 189
            GY  E D  +IF A   A     +QYR DA+ +     G++  +L     S+  E   L
Sbjct: 65  AGYRPEADEASIFAALCQANAGTADQYRRDAEVMVAEVSGRSLDALKAILISRSAEGADL 124

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS-------V 242
           LK + +  + +  F YSR FA+GL+ L+E  +A      EKL  +L     S       +
Sbjct: 125 LKGVLQGIADRDRFKYSRAFAIGLYTLIETVDAEILKDKEKLMELLKAVAESLPLSFDKL 184

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
            +D+++YR+ L+K+ QAK ++ +
Sbjct: 185 QKDVELYRSNLTKMEQAKIVMAD 207


>gi|254423933|ref|ZP_05037651.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7335]
 gi|196191422|gb|EDX86386.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7335]
          Length = 250

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 120/208 (57%), Gaps = 14/208 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + YD ++ALG V+ YDR M+
Sbjct: 6   TVSDTKRAFYSQHTRPINAIYRRVVEELMVEAHLLLVNADFNYDSIYALGVVSTYDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS--KEGEVEG- 188
           GY    DR+ I++A + A + DP+QYR DA++L      ++  S+  F S   E + E  
Sbjct: 66  GYEPAGDRDNIYRAILQANEADPDQYRRDAEEL--LGVAKSLPSIDAFKSILDEAKTESG 123

Query: 189 --LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNK 239
              LK    +A     F YSR FA+GL+ ++E  +A           ++ ++ + + +N+
Sbjct: 124 SDTLKANLHKAISNPKFKYSRLFAIGLYNVIESIDADMLNDKDKRDALMAEIASTIGLNE 183

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
             + +D+D+YR  L K+ QA+E++K+ +
Sbjct: 184 DLLKKDIDLYRGNLEKMAQAQEVMKDMI 211


>gi|428313474|ref|YP_007124451.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
 gi|428255086|gb|AFZ21045.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
          Length = 241

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 119/206 (57%), Gaps = 11/206 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RP+ SI+  V++EL+V+ HL+     + Y+P++ALG VT ++R ME
Sbjct: 6   TVSDTKRDFYNHHTRPVNSIFRRVVEELMVEMHLLSVNVDFHYEPIYALGVVTSFNRFME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
           GY  E D+ +IF A   ++  +PEQY+ DAQ LE  A   T   LV +   P  +  +  
Sbjct: 66  GYRPERDKASIFDALCHSVGNNPEQYKQDAQWLESMAERVTGEELVSWLSAPRPQDTLGD 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
           L   +A  A     F YSR FA+GL+ LLE A++            L+K+   L++ +  
Sbjct: 126 LYAAVAAIAENP-KFKYSRLFAIGLYTLLEKADSELVQDEKRRTEALKKISDGLHLPEEK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYV 267
           + +DL++YR+ L K+ Q + ++++ +
Sbjct: 185 LQKDLELYRSNLQKMEQVRIVIEDAI 210


>gi|97202823|sp|Q5N664.2|THF1_SYNP6 RecName: Full=Protein thf1
 gi|97202830|sp|Q31MY4.2|THF1_SYNE7 RecName: Full=Protein thf1
          Length = 254

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/207 (33%), Positives = 116/207 (56%), Gaps = 10/207 (4%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTV+++K  F   Y RPI  +Y  V++EL+V+ HL+    ++ YDP+FALG VT +D  M
Sbjct: 5   PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 64

Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 187
             Y   E    +F A   A++++PEQYR DA  + E  RG  + ++ ++ ++    G   
Sbjct: 65  SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 124

Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPTVLEKLCAVL----NVNKR 240
            L++   +  +G+  F YSR FA+GLF LLE A      +P  L+     +    ++   
Sbjct: 125 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 184

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYV 267
            + +DLD+YR+ L K+ QA+  ++E +
Sbjct: 185 KLQKDLDLYRSNLEKMEQARITMEEAI 211


>gi|428223137|ref|YP_007107307.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7502]
 gi|427996477|gb|AFY75172.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7502]
          Length = 226

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 115/205 (56%), Gaps = 10/205 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TVA+ K +F K + +P+ SIY  V+ EL+V+ HL+R  + + YD +FALG  T +DR M 
Sbjct: 6   TVADAKHDFYKAFSKPVNSIYRRVVDELLVEVHLLRVSQNFGYDSIFALGLATAFDRFMA 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE--GEVEGL 189
           GY  E D E IF+    AL  DP+Q R ++  L E ++   A     F + E   +++ L
Sbjct: 66  GYQPESDLEPIFKGLCQALLFDPDQIRQESAHLIELSKQFPAEVKSLFTTLEAGADLDTL 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
           +  I   A+    F YSR FAVG+F LLE A+            ++ ++   L +N   +
Sbjct: 126 MGQIRAIATNP-KFKYSRLFAVGVFILLETADPEAIADQDKRQALITQVGDTLKINSERL 184

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYV 267
            +DLD+YR+ L K+ Q ++++++ V
Sbjct: 185 LKDLDLYRSNLEKIQQGRQMMEDMV 209


>gi|218442064|ref|YP_002380393.1| Thf1-like protein [Cyanothece sp. PCC 7424]
 gi|254784142|sp|B7KI38.1|THF1_CYAP7 RecName: Full=Protein thf1
 gi|218174792|gb|ACK73525.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7424]
          Length = 226

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 71/208 (34%), Positives = 123/208 (59%), Gaps = 18/208 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     +QYDPV+ALG VT + R M+
Sbjct: 6   TVSDSKRDFYTKHTRPINSVYRRVVEELMVEMHLLSVNSDFQYDPVYALGVVTSFQRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-------EFPSKEG 184
           GY  + D+E+IF A   ++  DP+QYR DA+++ E A+  +A  L+       +  S E 
Sbjct: 66  GYRPDADKESIFNALCQSVGGDPQQYRQDAERMIESAKQLSAQQLLFNLESASDSSSGEN 125

Query: 185 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNV 237
           ++   L  IA        + Y+R FA+G++ +L     E+   TE    V++++  VL++
Sbjct: 126 QILQTLIGIA----NAPKYKYTRLFAIGIYTILAETDPEMLKNTEKREEVVKQIAKVLHL 181

Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKE 265
            +  + +DLD+YR+ L K+ Q   +++E
Sbjct: 182 PEEKMQKDLDLYRSNLEKMDQLLTVIEE 209


>gi|257059049|ref|YP_003136937.1| Thf1-like protein [Cyanothece sp. PCC 8802]
 gi|256589215|gb|ACV00102.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8802]
          Length = 235

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 112/197 (56%), Gaps = 10/197 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG V  + + M+
Sbjct: 6   TVSDTKRDFYTHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
           GY  EED+ +IF A   A+  D E+YR +AQ L    +G + S L+     ++ GE  EG
Sbjct: 66  GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
           +L +  +  +    F YSR FA+GL+ ++   +A              +LC  LN++   
Sbjct: 126 MLFNTLQAIANNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +DLD+YR+ + K+ Q
Sbjct: 186 LQKDLDLYRSNVDKMGQ 202


>gi|218245998|ref|YP_002371369.1| Thf1-like protein [Cyanothece sp. PCC 8801]
 gi|254784143|sp|B7K277.1|THF1_CYAP8 RecName: Full=Protein thf1
 gi|218166476|gb|ACK65213.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8801]
          Length = 235

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 112/197 (56%), Gaps = 10/197 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG V  + + M+
Sbjct: 6   TVSDTKRDFYNHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
           GY  EED+ +IF A   A+  D E+YR +AQ L    +G + S L+     ++ GE  EG
Sbjct: 66  GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
           +L +  +  +    F YSR FA+GL+ ++   +A              +LC  LN++   
Sbjct: 126 MLYNTLQAIAKNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +DLD+YR+ + K+ Q
Sbjct: 186 LQKDLDLYRSNVDKMGQ 202


>gi|434398071|ref|YP_007132075.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
 gi|428269168|gb|AFZ35109.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
          Length = 238

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 122/209 (58%), Gaps = 11/209 (5%)

Query: 60  IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
           I   +  TD   TV++ K +F + + RPI S+Y  V++EL+V+ HL+     ++ DP++ 
Sbjct: 4   IAVGNKTTDNIRTVSDAKRDFYQHHTRPINSVYRRVVEELLVEMHLLSVNVDFKSDPIYY 63

Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
           LG VT ++RLM+GY  E+D+E+IF A   A+ EDPE+ R  A  L   A+ ++   LV +
Sbjct: 64  LGVVTSFERLMQGYRPEQDKESIFNALCRAVGEDPERNRAQAGSLLNLAKNKSPQELVAW 123

Query: 180 PSKEGEVEG---LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLE 229
            S+   +E    +++ I   AS   +F YSR FA+GL+ LLE ++       +    +LE
Sbjct: 124 LSEPTPLENYHDIIEPIKAIASNP-HFKYSRLFAIGLYTLLEESDPEILKDVSKRNEILE 182

Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQ 258
            +   L++    +++DL++YR+ L K+ Q
Sbjct: 183 SIATQLHLPGEKMNKDLELYRSNLEKMEQ 211


>gi|443328840|ref|ZP_21057433.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
 gi|442791576|gb|ELS01070.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
          Length = 270

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 118/206 (57%), Gaps = 12/206 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           ++T T++  TV++TK +F   Y +PI S+Y  +++EL+V+ HL+     ++ DP+F LG 
Sbjct: 4   IATVTNLK-TVSDTKRSFYNNYNKPINSVYRRIVEELLVEMHLLSVNADFKSDPIFYLGV 62

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           V+ ++RLM+GY  ++D+ AIF A   A+  DPE YR  A  L   A+ ++   L+ +  +
Sbjct: 63  VSCFERLMQGYQPDQDKGAIFNALCRAVDGDPESYRAQAGNLLAIAKEKSGEELIAWLGE 122

Query: 183 EGEVEGLLKDIAE---RASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLC 232
              + G  ++IAE     +   NF YSR F +GL+ LLE A+A           + E + 
Sbjct: 123 PTAIAG-AENIAETIKSIAANANFKYSRPFGIGLYTLLEEADAKLLEDSDKRNEIFENIA 181

Query: 233 AVLNVNKRSVDRDLDVYRNLLSKLLQ 258
             L++    + +DL++YR+ L K+ Q
Sbjct: 182 KTLSLPGDKMKKDLELYRSNLEKMEQ 207


>gi|359462375|ref|ZP_09250938.1| Thf1-like protein [Acaryochloris sp. CCMEE 5410]
          Length = 214

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 108/181 (59%), Gaps = 16/181 (8%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+R    ++YDP+FALG  T +DR M+GY  E D++AIF A   A + DP Q + 
Sbjct: 1   MVEMHLLRVNEDFRYDPIFALGVTTSFDRFMDGYQPENDKDAIFSAICKAQEADPVQMQK 60

Query: 160 DAQKLEEWARGQTASSLVEF-----PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
           D Q+L E A+ ++A  ++++      S   E++  L++IA+       F YSR FA+GLF
Sbjct: 61  DGQRLTELAQSKSAQEMLDWITQAANSGGDELQWQLRNIAQNPK----FKYSRLFAIGLF 116

Query: 215 RLLELA--NATE-----PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
            LLEL+  N T+        L  +C VLN+++  + +DL++YR  L K+ Q ++ + + +
Sbjct: 117 TLLELSEGNITQDEESLAEFLPNICTVLNISESKLQKDLEIYRGNLDKIAQVRQAMDDIL 176

Query: 268 D 268
           +
Sbjct: 177 E 177


>gi|170077355|ref|YP_001733993.1| Thf1-like protein [Synechococcus sp. PCC 7002]
 gi|254784146|sp|B1XHY6.1|THF1_SYNP2 RecName: Full=Protein thf1
 gi|169885024|gb|ACA98737.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 254

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 119/206 (57%), Gaps = 15/206 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SI+  V++EL+V+ HL+     ++YDP +ALG VT ++R M+
Sbjct: 6   TVSDTKRDFYTHHTRPINSIFRRVVEELLVEMHLLSVNADFRYDPFYALGVVTSFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  E D+ +IFQ+   A+  D  +Y+ DA  L E A+  + + L+E   ++   EG   
Sbjct: 66  GYRPEADKVSIFQSMCQAIGGDANRYKEDAMALVELAKRCSGTQLIECFRQDVPPEGAQE 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVLNVNK 239
           L +  E  +   +F YSR FA+G++  L     +EP +LE           + A LN+ +
Sbjct: 126 LWEKIEAIAKNDHFKYSRLFAIGVYTFL---GESEPQLLEDTEKRDEMLTTVTAGLNLPE 182

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKE 265
             + +DLD+YR+ L K+ Q  E+L++
Sbjct: 183 EKMKKDLDLYRSNLEKMNQVLEVLED 208


>gi|428302138|ref|YP_007140444.1| Protein thf1 [Calothrix sp. PCC 6303]
 gi|428238682|gb|AFZ04472.1| Protein thf1 [Calothrix sp. PCC 6303]
          Length = 235

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 112/201 (55%), Gaps = 17/201 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F  ++ RPI +IY  V++EL+V+ HL+     + Y+P++ALG  T ++R M+
Sbjct: 6   TVSDTKKTFYSIHTRPINTIYRRVVEELMVEMHLLSVNTDFTYNPIYALGVATAFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK------EGE 185
           GY  E+D+E +F A   +++ D ++ + +A  L++ A   +   L+   S+       GE
Sbjct: 66  GYDPEKDKEQLFHALCQSVEIDTQKIKQEAHSLKDVAASMSVGDLISCLSRAKRFDNAGE 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 238
           ++  L  IA        F YSR FA+GLF LLE A+             L  +   LN++
Sbjct: 126 LQNQLDAIA----SNPKFKYSRLFAIGLFSLLEAASPETVKDEKQRNDALVSIAKGLNIS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQA 259
           +  + +DLD+YR+ L K+ QA
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQA 202


>gi|307155000|ref|YP_003890384.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
 gi|306985228|gb|ADN17109.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
          Length = 233

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/212 (32%), Positives = 121/212 (57%), Gaps = 25/212 (11%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ M+
Sbjct: 6   TVSDSKRDFYSKHTRPINSVYRRVVEELLVETHLLSVNSDFHYDPIYALGVVTSFEQFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA--------SSLVEFPSKE 183
           GY  E D+E+IF A   ++  DP+QYR DAQ +   A+  +A        SS + +P  +
Sbjct: 66  GYRPETDKESIFNALCQSVGGDPQQYRGDAQSILSTAKQLSAQDLLSKLQSSSIAYPQGD 125

Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCA 233
            ++   L  IA        F Y+R FA+G++ +L     T+P           V++++  
Sbjct: 126 NKIIETLVAIA----NAPKFKYTRLFAIGIYTIL---AETDPELLKDQQKRHEVIKQIAE 178

Query: 234 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           +L++ +  + +DLD+YR+ L K+ Q   +++E
Sbjct: 179 ILHLPEEKMQKDLDLYRSNLEKMEQLLTVIEE 210


>gi|427419843|ref|ZP_18910026.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
 gi|425762556|gb|EKV03409.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
          Length = 258

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/223 (34%), Positives = 125/223 (56%), Gaps = 21/223 (9%)

Query: 63  MSTATDVPP-----TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
           MS     PP     TV++TK  F   + RPI S+Y  V++EL+V+ HL+     + Y+P+
Sbjct: 1   MSNVLLPPPVTNLRTVSDTKRAFYNYHSRPINSLYRRVIEELMVEMHLLSVNVDFVYNPL 60

Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL- 176
           +ALG +T +DR M GY  E+D+E+I  A   A++ DP+QYR DA+ L+      + S L 
Sbjct: 61  YALGVITSFDRFMVGYEPEQDKESILSAICQAVEGDPQQYRQDAEALKSDLANLSLSDLN 120

Query: 177 VEFPS-KEGEVEGL---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-------P 225
            +  S K  +  GL   L  +A +AS K    Y+R  AVGL+ L E  + +         
Sbjct: 121 TQLASAKTTDGNGLQNKLHVVATQASAK----YTRLMAVGLYTLFETVDISSLEDKDSRE 176

Query: 226 TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVD 268
            +L+    +L +    VD+DL++YR+ L K+ QA+E++K+ ++
Sbjct: 177 EMLKTAAEMLALPAEKVDKDLELYRSNLDKMAQAQEVMKDILE 219


>gi|425459592|ref|ZP_18839078.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
 gi|389822632|emb|CCI29709.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
          Length = 233

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 188 KLQKDLDVYRGNLDKMDQ 205


>gi|22298677|ref|NP_681924.1| Thf1-like protein [Thermosynechococcus elongatus BP-1]
 gi|81743247|sp|Q8DJT8.1|THF1_THEEB RecName: Full=Protein thf1
 gi|22294857|dbj|BAC08686.1| tlr1134 [Thermosynechococcus elongatus BP-1]
          Length = 222

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 118/206 (57%), Gaps = 16/206 (7%)

Query: 70  PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
           P TV++TK  F   + RPI SIY   ++EL+V+ HL+R    ++Y P+FALG VT +D+ 
Sbjct: 4   PRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQF 63

Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
           MEGY  E DR+ IF A   A + +P+Q + DA   +++     +  L E  S  G+    
Sbjct: 64  MEGYQPEGDRDRIFHALCVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNS--GQPSAP 121

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-ELANATEPTV-----LEKLCAVLNVNKRSVD 243
           L  +    +GK    YSR  AVGL+  L ELA   E T+     L++L  V+ +    V 
Sbjct: 122 LNSLNH--TGK----YSRLHAVGLYAFLQELAG--EVTIHLNETLDQLAPVIPLPIEKVK 173

Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDR 269
           RDL++YR+ L K+ QA+ L+KE V++
Sbjct: 174 RDLELYRSNLDKINQARSLMKELVEQ 199


>gi|440752363|ref|ZP_20931566.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           TAIHU98]
 gi|440176856|gb|ELP56129.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           TAIHU98]
          Length = 228

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 6   TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 66  GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 124

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 125 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 183 KLQKDLDVYRGNLDKMDQ 200


>gi|37520969|ref|NP_924346.1| Thf1-like protein [Gloeobacter violaceus PCC 7421]
 gi|81710432|sp|Q7NKS7.1|THF1_GLOVI RecName: Full=Protein thf1
 gi|35211965|dbj|BAC89341.1| glr1400 [Gloeobacter violaceus PCC 7421]
          Length = 228

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 116/205 (56%), Gaps = 11/205 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F   Y RP+ SIY  V+ EL+V+ HL+   + +++DP+FA G +T Y  LME
Sbjct: 6   TVSDSKRAFFAAYPRPVNSIYRRVIDELLVEVHLLITNQDFRHDPLFATGLLTAYQALME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV-EGLL 190
           GY   E R+AI +A  TAL+   EQ   DA +    A    A  ++E  + + E  +G L
Sbjct: 66  GYTPVEQRDAILRALCTALELSYEQLHTDAAQWRAIAAELPAQEVLEVMAGKREAGDGRL 125

Query: 191 KDIAERASGKGN---FSYSRFFAVGLFRLLELAN----ATEPTVLEKL---CAVLNVNKR 240
           K + +  +G  N   F YSR F++GL  +LE A      +E   LE+L   C  L ++  
Sbjct: 126 KAMGDTLAGIANAERFKYSRLFSLGLANILEQAGRAAAMSEKDRLERLQQICTYLKLDYN 185

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKE 265
            V RDLD + ++L ++ ++KE++ E
Sbjct: 186 RVKRDLDFFHSVLERIKRSKEVVDE 210


>gi|425436789|ref|ZP_18817221.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
 gi|425451594|ref|ZP_18831415.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
 gi|389678450|emb|CCH92698.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
 gi|389767069|emb|CCI07461.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
          Length = 233

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 188 KLQKDLDVYRGNLDKMDQ 205


>gi|422302142|ref|ZP_16389506.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
 gi|389788699|emb|CCI15466.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
          Length = 233

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 110/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR+ L K+ Q
Sbjct: 188 KLQKDLDVYRSNLDKMDQ 205


>gi|425453632|ref|ZP_18833389.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
 gi|389800936|emb|CCI19831.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
          Length = 233

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 188 KLQKDLDVYRGNLDKMDQ 205


>gi|425445848|ref|ZP_18825868.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
 gi|389734049|emb|CCI02237.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
          Length = 233

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 188 KLQKDLDVYRGNLDKMDQ 205


>gi|390439536|ref|ZP_10227927.1| Protein thf1 [Microcystis sp. T1-4]
 gi|389837025|emb|CCI32051.1| Protein thf1 [Microcystis sp. T1-4]
          Length = 233

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 71/209 (33%), Positives = 109/209 (52%), Gaps = 36/209 (17%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY   ED+  IF A   A+  +PE YR DA+ +   A             KE  ++ LL 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117

Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT--------EPTVLE 229
            +  +A G  N              F YSR FA+GL+ +L  A           EP +L+
Sbjct: 118 QLQNQALGGDNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQ 176

Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQ 258
           K   +L+++   + +DLDVYR  L K+ Q
Sbjct: 177 KFSEILHLSGEKLQKDLDVYRGNLDKMDQ 205


>gi|443669636|ref|ZP_21134837.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           DIANCHI905]
 gi|159030831|emb|CAO88510.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330085|gb|ELS44832.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           DIANCHI905]
          Length = 228

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 14/198 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 6   TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 66  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 124

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 125 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +DLDVYR  L K+ Q
Sbjct: 183 KLQKDLDVYRGNLDKMDQ 200


>gi|425470743|ref|ZP_18849603.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
 gi|389883502|emb|CCI36111.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
          Length = 233

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 107/197 (54%), Gaps = 12/197 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 241
            L D          F YSR FA+GL+ +L  A             +L+K   +L+++   
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILHLSSEK 188

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +DLDVYR  L K+ Q
Sbjct: 189 LQKDLDVYRGNLDKMDQ 205


>gi|427726046|ref|YP_007073323.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
 gi|427357766|gb|AFY40489.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
          Length = 246

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 107/203 (52%), Gaps = 23/203 (11%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++ K +F   + RPI SI+  V++EL+V+ HL+     ++YDP +ALG VT Y+R M+
Sbjct: 6   TVSDAKRDFYGQHTRPINSIFRRVVEELLVEMHLVSVNVDFRYDPFYALGIVTSYERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE------FPSKEGE 185
           GY  E D+ +IFQA   A+    E Y+ DA  L E A+  +   LV+       P   GE
Sbjct: 66  GYRPESDKISIFQAMCQAVGGSAEFYKNDATALVELAKRCSGQQLVDCFRQDNAPEGAGE 125

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVL 235
           +   ++ IA        F YSR FA+GL+  L      EP +LE           L   +
Sbjct: 126 LWAKVEAIA----ANKKFKYSRLFAIGLYTFL---GEAEPALLEDADKRDEMLATLTEAM 178

Query: 236 NVNKRSVDRDLDVYRNLLSKLLQ 258
           N+    + +DLD+YR+ L K+ Q
Sbjct: 179 NLPGEKMKKDLDLYRSNLEKMTQ 201


>gi|126658461|ref|ZP_01729609.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
 gi|126620203|gb|EAZ90924.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
          Length = 246

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 70/196 (35%), Positives = 108/196 (55%), Gaps = 12/196 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
           GY  E D+ +IF A   A+  + EQY  +A+ L   A+G    S+ EF  K G+   +G+
Sbjct: 66  GYRPESDKASIFNALCQAVDGNSEQYHQEAEALINEAKG---LSMTEFKDKLGQEGGDGI 122

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSV 242
           L       +    F YSR F VGL+ LL     EL    E     ++++   L  +   +
Sbjct: 123 LWGTCNAIAQNPKFKYSRLFGVGLYTLLMEIDPELVKEEEKRNQTIKEVSEALQFSSDKL 182

Query: 243 DRDLDVYRNLLSKLLQ 258
            +DLD+YR+ L K+ Q
Sbjct: 183 QKDLDLYRSNLDKMQQ 198


>gi|425441488|ref|ZP_18821762.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
 gi|425463770|ref|ZP_18843100.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
 gi|389717772|emb|CCH98181.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
 gi|389829228|emb|CCI29632.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
          Length = 233

 Score =  116 bits (291), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 106/197 (53%), Gaps = 12/197 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 241
            L D          F YSR FA+GL+ +L  A             +L+K   +L ++   
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILRLSSEK 188

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +DLDVYR  L K+ Q
Sbjct: 189 LQKDLDVYRGNLDKMDQ 205


>gi|97202816|sp|P0C1D1.1|THF1_SYNJB RecName: Full=Protein thf1
          Length = 239

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 77/206 (37%), Positives = 112/206 (54%), Gaps = 11/206 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T++ TK  F   Y RPI ++Y  V++EL+V+ HL     T+ YDP FALG VT+YD LME
Sbjct: 6   TLSATKAAFFSAYPRPINAVYRRVVEELLVELHLTTVNSTFVYDPFFALGLVTLYDGLME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
            Y   E REAIF A   AL   PE  R +A+ L E          ++    + E E  G 
Sbjct: 66  AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMGSGDPRQRLDLLCLKPEAEDVGG 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT-----VLEKLCAVLNVNKRS 241
           LK I ER + +  ++YSR  AVGL+   E+   +   EP       LE + + L  +   
Sbjct: 126 LKAILERMT-QPPYAYSRVLAVGLYTAYEVVAKSLYEEPEERTRRFLENVVSKLPFSTER 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYV 267
           V +DL++YR+ L ++ QA+ +++E V
Sbjct: 185 VRKDLELYRSSLDRMKQARAVVEEMV 210


>gi|166367182|ref|YP_001659455.1| Thf1-like protein [Microcystis aeruginosa NIES-843]
 gi|166089555|dbj|BAG04263.1| Psb29 Photosystem II sub-stoichiometric subunit [Microcystis
           aeruginosa NIES-843]
          Length = 233

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/208 (33%), Positives = 105/208 (50%), Gaps = 34/208 (16%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY   ED+  IF A   A+  +PE YR DA+ +   A             KE  ++ LL 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117

Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANA-------TEPTVLEK 230
            +   A G  N              F YSR FA+GL+ +L  A             +L+K
Sbjct: 118 QLQNPALGANNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREQILQK 177

Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQ 258
              +L ++   + +DLDVYR  L K+ Q
Sbjct: 178 FSEILRLSSEKLQKDLDVYRGNLDKMDQ 205


>gi|423062334|ref|ZP_17051124.1| Thf1-like protein [Arthrospira platensis C1]
 gi|406716242|gb|EKD11393.1| Thf1-like protein [Arthrospira platensis C1]
          Length = 215

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/180 (35%), Positives = 101/180 (56%), Gaps = 9/180 (5%)

Query: 95  VLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDP 154
           +++EL+V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP
Sbjct: 1   MVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDP 60

Query: 155 EQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVG 212
            QYR DA  LE  A   +   L E    ++E   +  L+      +    F YSR FA+G
Sbjct: 61  NQYRADATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIG 120

Query: 213 LFRLLELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           L+ LLE ++ T         T+L      L + K  +++DLD+YR  L K+ QA+ +++E
Sbjct: 121 LYTLLEKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEE 180


>gi|172035357|ref|YP_001801858.1| Thf1-like protein [Cyanothece sp. ATCC 51142]
 gi|354555452|ref|ZP_08974753.1| Protein thf1 [Cyanothece sp. ATCC 51472]
 gi|254784140|sp|B1WNF0.1|THF1_CYAA5 RecName: Full=Protein thf1
 gi|171696811|gb|ACB49792.1| photosystem II 22 kD protein [Cyanothece sp. ATCC 51142]
 gi|353552511|gb|EHC21906.1| Protein thf1 [Cyanothece sp. ATCC 51472]
          Length = 242

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 107/194 (55%), Gaps = 8/194 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E D+ +IF A   A+  + EQY  +A+ L   A+G + +   E   +EG  +G+L 
Sbjct: 66  GYSPESDKTSIFNALCQAVDGNSEQYHQEAEALINEAKGLSITEFKEKLGQEGG-DGILW 124

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVDR 244
                 +    F YSR F VGL+ LL     +L    +     ++++   L  +   + +
Sbjct: 125 GTCGAIAQNPKFKYSRLFGVGLYTLLMEIDPDLVKEEDKRNQTIKEVSDALQFSSDKLQK 184

Query: 245 DLDVYRNLLSKLLQ 258
           DLD+YR+ L K+ Q
Sbjct: 185 DLDLYRSNLDKMQQ 198


>gi|443478915|ref|ZP_21068602.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
 gi|443015728|gb|ELS30564.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
          Length = 240

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 70/209 (33%), Positives = 114/209 (54%), Gaps = 16/209 (7%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + +P+  +Y  V+ EL+V+ HL++  +T+ YD +FALGFVT +DR   
Sbjct: 6   TVSDTKKDFYLAFPKPVNQVYRRVVDELLVEIHLLKVNQTFVYDAIFALGFVTTFDRFTA 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWAR---GQTASSLVEFPSKEG--EV 186
           GY  E DR A+F A   AL+ D ++ R DA  L + A        + L    S      +
Sbjct: 66  GYKPETDRFAVFHALCAALQFDSDRIRQDAATLSDLATRSPNDIKTLLTNLDSGISLEPL 125

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-------PTVLEKLCAVLNVNK 239
            G L+ I    S K NF YSR   VGL+ LLE+++  E         +++ +   L    
Sbjct: 126 SGQLQII----STKENFKYSRLLGVGLYALLEISDPEEIADSAKREELIKLVGETLKFGS 181

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVD 268
             + +D+D+YR+ L K+ QA++++ + V+
Sbjct: 182 DRLLKDVDLYRSNLDKIEQARQMIADMVE 210


>gi|86606816|ref|YP_475579.1| Thf1-like protein [Synechococcus sp. JA-3-3Ab]
 gi|97202812|sp|Q2JSQ3.1|THF1_SYNJA RecName: Full=Protein thf1
 gi|86555358|gb|ABD00316.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 239

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 83/209 (39%), Positives = 112/209 (53%), Gaps = 17/209 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T++ TK  F   Y RPI + Y  V++EL+V+ HL      + YDP FALG VT+YD LME
Sbjct: 6   TLSATKAAFFSAYPRPINAAYRRVVEELLVELHLTTVNSAFVYDPFFALGLVTLYDSLME 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----QTASSLVEFPSKEGEVE 187
            Y   E REAIF A   AL   PE  R +A+ L E  R     Q  + L   P  E E  
Sbjct: 66  AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMRSGDPVQRYNLLCLKP--EAEDV 123

Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPT-----VLEKLCAVLNVN 238
           G LK I +R + +  ++YSR  AVGL+   E A AT    EP       LE +   L  +
Sbjct: 124 GGLKAILQRMT-QPPYAYSRVLAVGLYTAYE-AVATSLYKEPEERTRHFLEDVIGNLPFS 181

Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
              V +DL++YR+ L +L QA+ +++E V
Sbjct: 182 PERVKKDLELYRSNLDRLKQARAIVEEMV 210


>gi|427711975|ref|YP_007060599.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           6312]
 gi|427376104|gb|AFY60056.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           6312]
          Length = 245

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 73/211 (34%), Positives = 114/211 (54%), Gaps = 21/211 (9%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + RPI SI+   ++EL+V+ HL+R    + Y P+ ALG VT Y+  M 
Sbjct: 6   TVSDTKKAFYAAHTRPIHSIFRRFVEELLVEVHLLRVNTNFVYSPLLALGIVTAYNHFMS 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA------SSLVEFPSKEGE 185
           GY  E DR +IF ++  A + DP+Q + DA +   W            + L  + S+ G+
Sbjct: 66  GYRPETDRNSIFTSFAIAEEFDPQQLQADAAR---WEELAGLELEELQTRLQAWISEGGD 122

Query: 186 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTV-----LEKLCAVLNV 237
                L+D       K    YSR  A+GL+ LLE A  N T+        LE+L  V+N+
Sbjct: 123 PWHNSLRDAVNNPQTK----YSRLQAIGLYHLLEQAAGNLTQELTTLEASLEQLSPVVNL 178

Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVD 268
               V +DL++YR+ L K++QA++++ E V+
Sbjct: 179 PVDKVKKDLELYRSNLDKMIQAQKIMAELVE 209


>gi|428203624|ref|YP_007082213.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
 gi|427981056|gb|AFY78656.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
          Length = 241

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 117/206 (56%), Gaps = 9/206 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++ K +F   + RPI SIY   ++ELIV+ HL+     ++YD ++ALG VT ++R M+
Sbjct: 11  TVSDAKRDFYTHHTRPINSIYRRFVEELIVEMHLLSVNTDFRYDAIYALGVVTAFERFMQ 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE--FPSKEGEVEGL 189
           GY  E+D+ +IF A   A   + EQYR +A ++   A+  +   L+     S     E  
Sbjct: 71  GYQPEQDKSSIFAALCQATGGNWEQYRQEAGEILAQAKQMSVQELIAKINSSTPTGGENR 130

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT----VLEKLCAVLNVNKRSV 242
           L +  +  + + N+ YSR FA+GL+ LL  A+     +P      L+++   L+++   +
Sbjct: 131 LVETLQAIANRSNYKYSRLFAIGLYTLLAEADPDILRDPEKRDRTLKEVTEALHLSPEKL 190

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVD 268
            +DLD+YR+ L K+ Q  ++L+E ++
Sbjct: 191 QKDLDLYRSNLDKMDQLLKVLEEALE 216


>gi|209522934|ref|ZP_03271491.1| Thf1-like protein [Arthrospira maxima CS-328]
 gi|209496521|gb|EDZ96819.1| Thf1-like protein [Arthrospira maxima CS-328]
          Length = 210

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 96/175 (54%), Gaps = 9/175 (5%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP QYR 
Sbjct: 1   MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDPNQYRA 60

Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
           DA  LE  A   +   L E    ++E   +  L+      +    F YSR FA+GL+ LL
Sbjct: 61  DATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIGLYTLL 120

Query: 218 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           E ++ T         T+L      L + K  +++DLD+YR  L K+ QA+ +++E
Sbjct: 121 EKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEE 175


>gi|448931221|gb|AGE54783.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           MA-1D]
          Length = 248

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 108/202 (53%), Gaps = 11/202 (5%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
            M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG
Sbjct: 41  LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
            VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154

Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           K G  ++  L DIA        + YS F A+G+F+LL++        ++ L   +     
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNGNYTGNSVKHLSESIGFKGE 210

Query: 241 SVDRDLDVYRNLLSKLLQAKEL 262
            V +D+ ++ +LL  +  +++L
Sbjct: 211 IVHKDIAMFFSLLKYIESSQKL 232


>gi|157952488|ref|YP_001497380.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
           virus NY2A]
 gi|155122715|gb|ABT14583.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
           virus NY2A]
          Length = 247

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 107/202 (52%), Gaps = 11/202 (5%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
            M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG
Sbjct: 41  LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
            VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154

Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           K G  ++  L DIA        + YS F A+G+F+LL++        ++ L   +     
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKHLSESVGFKGE 210

Query: 241 SVDRDLDVYRNLLSKLLQAKEL 262
            V +D+  + +LL  +  +++L
Sbjct: 211 IVHKDIATFFSLLKYIESSQKL 232


>gi|428769945|ref|YP_007161735.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
 gi|428684224|gb|AFZ53691.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
          Length = 234

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 109/197 (55%), Gaps = 11/197 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F + ++RPI SIY  V++EL+V+ HL+     +  DP++ LG    + + M+
Sbjct: 18  TVSDTKRSFYQHHQRPINSIYRRVVEELMVEMHLLAVNVDFNPDPIYYLGVYQSFQQFMQ 77

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
           GY  E D+E+IF A   +++ +P++Y   +Q L  +  G++A  ++++   PS EG++E 
Sbjct: 78  GYKPESDKESIFNALCQSIENNPQEYISKSQTLLNFVEGKSAQEILDWLLNPSGEGDLEA 137

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
           +             F YSR FA+G + L+E  +       +     ++ L   L +    
Sbjct: 138 VASHWRSNLENP-RFKYSRLFAIGFYTLIEKGDGEFIKDESKFTDFIQPLIDKLQLPVEK 196

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +DLD+YR+ L K+ Q
Sbjct: 197 LKKDLDLYRSNLEKMNQ 213


>gi|448930219|gb|AGE53784.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           IL-3A]
 gi|448933659|gb|AGE57214.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NE-JV-4]
          Length = 207

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 1   MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 61  VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSNLG-----PNK 114

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DIA        + YS F AVG+F+LL++        ++ L   +      
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 171 VHKDIATFFSLLKYIESSQKL 191


>gi|9631702|ref|NP_048481.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
 gi|1131477|gb|AAC96501.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
 gi|448924789|gb|AGE48370.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           AN69C]
          Length = 207

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 1   MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 61  VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSHLG-----PNK 114

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DIA        + YS F AVG+F+LL++        ++ L   +      
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 171 VHKDIATFFSLLKYIESSQKL 191


>gi|157953365|ref|YP_001498256.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
           virus AR158]
 gi|156068013|gb|ABU43720.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
           virus AR158]
 gi|448930527|gb|AGE54091.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           IL-5-2s1]
 gi|448934707|gb|AGE58259.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NY-2B]
 gi|448935079|gb|AGE58630.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NYs1]
          Length = 248

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 109/202 (53%), Gaps = 11/202 (5%)

Query: 62  CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
            M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG
Sbjct: 41  LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100

Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
            VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+
Sbjct: 101 IVTALDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154

Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
           K G  ++  L DIA        + YS F A+G+F+LL++        +++L   +     
Sbjct: 155 KHGNTLQKSLYDIAIN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKQLSESIGFKGE 210

Query: 241 SVDRDLDVYRNLLSKLLQAKEL 262
            V +D+ ++ +LL  +  +++L
Sbjct: 211 IVHKDIAMFFSLLKYIESSQKL 232


>gi|448927841|gb|AGE51413.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           CviKI]
          Length = 232

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 26  MKAITSSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 85

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 86  VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNK 139

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      
Sbjct: 140 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 195

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 196 VHKDIATFFSLLKYIESSQKL 216


>gi|148242504|ref|YP_001227661.1| Thf1-like protein [Synechococcus sp. RCC307]
 gi|147850814|emb|CAK28308.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 237

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 112/206 (54%), Gaps = 9/206 (4%)

Query: 70  PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
           P TVA++K  F   Y   IP +Y  V+ EL+V+ HL+  +  +Q D +FA+G   V+D L
Sbjct: 6   PQTVADSKRRFYAAYPHVIPGLYRRVVDELLVELHLLAGQAGFQADSLFAMGLTQVFDNL 65

Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEG 188
           M+G+   E ++ +F A  +      +Q R DA++L E       + +  +  ++G+    
Sbjct: 66  MQGFKPAERQKELFAAICSGAGLKADQLRKDAKQLREHLVPHGEAEIKSWIEQQGQGAPD 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE---LANATEPTVLE----KLCAVLNVNKRS 241
           +LK + ++A G+ +F YSR  AVGL  LL+     +  +P  L+    +L   + + K  
Sbjct: 126 VLKHVLQQA-GRSDFHYSRLHAVGLMGLLQDLSGGDDQDPQALQERAHQLGHSMGLQKDK 184

Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYV 267
           + +D+ +Y + L K+ QA ELL+E V
Sbjct: 185 LQKDMGLYASNLEKMSQAVELLEETV 210


>gi|409992261|ref|ZP_11275462.1| inositol phosphatase [Arthrospira platensis str. Paraca]
 gi|409936888|gb|EKN78351.1| inositol phosphatase [Arthrospira platensis str. Paraca]
          Length = 210

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 61/175 (34%), Positives = 93/175 (53%), Gaps = 9/175 (5%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP QYR 
Sbjct: 1   MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYTPETDKLSIWAALIGAQESDPNQYRA 60

Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
           DA  LE  A       L +    ++E   +  L+      +    F YSR  A+GL+ LL
Sbjct: 61  DATALEAQAASLAVKDLTDKAKIAQESSGDDPLQSCFHAIANNPKFKYSRLLAIGLYTLL 120

Query: 218 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           E ++AT         T+L      L + K  + +DLD+YR  L K+ QA+ ++ E
Sbjct: 121 EKSDATAAQDSEGLKTILSDFSEALRLPKDKLVKDLDLYRTNLEKVAQARLMVDE 175


>gi|119510704|ref|ZP_01629832.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
 gi|119464658|gb|EAW45567.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
          Length = 200

 Score =  106 bits (265), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 104/173 (60%), Gaps = 17/173 (9%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     + Y+P++ALG VT +DR M+GY  E+D+E+IFQA   A++++P++YR 
Sbjct: 1   MVEMHLLSVNSGFSYNPIYALGVVTSFDRFMQGYLPEQDQESIFQALCQAVEQEPQRYRE 60

Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
           DA++L+  A+    + L+ + S      ++ +++  L+ IA  +     F YSR FAVGL
Sbjct: 61  DAKRLQALAKDLPVNDLIAWLSQTTHLDRDPDLQAQLQAIAHNS----EFKYSRLFAVGL 116

Query: 214 FRLLELANA-------TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQA 259
           F LLE ++             L+ + A L+++   +++DL++Y + L K+ QA
Sbjct: 117 FTLLEQSDPELVKDEKQRTEALKTIAAGLHLSDEKLNKDLELYSSNLEKMAQA 169


>gi|448928860|gb|AGE52429.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           CvsA1]
          Length = 207

 Score =  106 bits (265), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 1   MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 61  VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNK 114

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 171 VHKDIATFFSLLKYIESSQKL 191


>gi|448931622|gb|AGE55183.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           MA-1E]
          Length = 207

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 1   MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 61  VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYFSNIETLKPYAKSSHLG-----PNK 114

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 171 VHKDIATFFSLLKYIESSQKL 191


>gi|428773451|ref|YP_007165239.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
           PCC 7202]
 gi|428687730|gb|AFZ47590.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
           PCC 7202]
          Length = 233

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/214 (32%), Positives = 111/214 (51%), Gaps = 24/214 (11%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++T+  F + + RPI SIY  V+QEL+V+ HL+     +Q D V+A+G    +++ M 
Sbjct: 6   TVSDTRRAFYQYHTRPINSIYRQVVQELMVEMHLLSVNTDFQPDAVYAVGVCQSFEQFMT 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
           GY  EED+ +IF A   A++ +P+ YR  ++ L  +  G++A  LV +        GL  
Sbjct: 66  GYKPEEDKTSIFNALCKAIEANPDDYRHQSESLLNFVEGKSAEDLVNWLLNPVADNGLDE 125

Query: 190 -----LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAVLNVNKRS 241
                LK I ER      F YSR F +G + L+   N   P V    EKL  ++      
Sbjct: 126 NIVNSLKSILERER----FKYSRLFGIGFYTLI---NKVAPDVAKDEEKLAKLIAPYSEK 178

Query: 242 VD-------RDLDVYRNLLSKLLQAKELLKEYVD 268
           +D       +D+D+YR+ L K+ Q   ++ E ++
Sbjct: 179 LDLPVDKLKKDVDLYRSNLDKINQMLVVIAETIE 212


>gi|448930916|gb|AGE54479.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           KS1B]
          Length = 207

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 106/201 (52%), Gaps = 11/201 (5%)

Query: 63  MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
           M   T  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG 
Sbjct: 1   MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60

Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
           VT  D ++  +P +E +  I  A+I +L EDPE Y  + + L+ +A+          P+K
Sbjct: 61  VTTLDSVLNTFPDDE-KVCIKNAFIISLNEDPEMYYSNIEYLKPYAKSSNLG-----PNK 114

Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
            G  ++  L DIA        + YS F AVG+F+LL++        ++ L   +      
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGKSVKHLSESIGFKGEL 170

Query: 242 VDRDLDVYRNLLSKLLQAKEL 262
           V +D+  + +LL  +  +++L
Sbjct: 171 VHKDIATFFSLLKYIESSQKL 191


>gi|67921410|ref|ZP_00514928.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
 gi|67856522|gb|EAM51763.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
          Length = 245

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 105/200 (52%), Gaps = 16/200 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK  F   + +PI SIY   ++EL+V+ HL+     + YDP++ALG VT + R M+
Sbjct: 6   TVSDTKRKFYGYHTQPINSIYRRFVEELLVEMHLLSVNIDFSYDPIYALGVVTSFQRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV----- 186
           GY  E D+ +IF A   A+    E+Y  +A+ +   A+G    S+V+F  K   V     
Sbjct: 66  GYSPESDKPSIFNALCQAVDGSSEKYHQEAEAILNEAKGL---SIVDFKDKLTHVTDNQV 122

Query: 187 -EGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVN 238
            EG+L       +    F YSR  A+GL+ LL     +L    E     ++++   L  +
Sbjct: 123 GEGVLWGTFGAIAANPKFKYSRLLAIGLYTLLMEIDSDLLKDEEKRTETIKEVSEALKFS 182

Query: 239 KRSVDRDLDVYRNLLSKLLQ 258
              + +DLD+YR+ L K+ Q
Sbjct: 183 PEKLRKDLDLYRSNLDKMQQ 202


>gi|16330615|ref|NP_441343.1| Thf1-like-protein [Synechocystis sp. PCC 6803]
 gi|383322356|ref|YP_005383209.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325525|ref|YP_005386378.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491409|ref|YP_005409085.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436676|ref|YP_005651400.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
 gi|451814773|ref|YP_007451225.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
 gi|81671042|sp|P73956.1|THF1_SYNY3 RecName: Full=Protein thf1
 gi|1653107|dbj|BAA18023.1| sll1414 [Synechocystis sp. PCC 6803]
 gi|339273708|dbj|BAK50195.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
 gi|359271675|dbj|BAL29194.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274845|dbj|BAL32363.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278015|dbj|BAL35532.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958541|dbj|BAM51781.1| Thf1-like-protein [Bacillus subtilis BEST7613]
 gi|451780742|gb|AGF51711.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
          Length = 240

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 109/202 (53%), Gaps = 8/202 (3%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++ K  F   Y RPI SIY   ++EL+V+ HL+     + YDP+FALG VT ++  M+
Sbjct: 6   TVSDAKRKFFTHYSRPISSIYRRFVEELLVEMHLLSVNIDFTYDPIFALGIVTSFNSFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-EFPSKEGEVEGLL 190
           GY   E   AIF A    + ++P+Q R DA+ +   A      + V    S++   + LL
Sbjct: 66  GYQPAEQLPAIFNALCHGVDQNPDQVRQDAKNVAASAHHIGLDAWVTAAASEQASGDNLL 125

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVD 243
            +       +  F YSR FA+GL+ LL     E+ +  E     L +L  +L+++   V 
Sbjct: 126 LNTLTGIHQRHKFKYSRLFAIGLYTLLADQDPEVKDNDEKRQDYLTRLSELLDLSLDKVV 185

Query: 244 RDLDVYRNLLSKLLQAKELLKE 265
           +DLD+YR+ L K+ Q  ++L++
Sbjct: 186 KDLDLYRSNLEKVDQLLKVLED 207


>gi|282901466|ref|ZP_06309391.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281193745|gb|EFA68717.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 201

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 99/173 (57%), Gaps = 17/173 (9%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     + Y+ ++ALG VT +DR M+GY   ED  +IF A I A+++DP+ YR 
Sbjct: 1   MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60

Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
           DA KL+  A   +   L+ + S      ++  ++  L+ IA+      NF YSR  A+GL
Sbjct: 61  DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116

Query: 214 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQA 259
           F LLEL++     + T   +  AV    L +++  +++DLD+YR+ L K+ QA
Sbjct: 117 FSLLELSDPEFVKDETQRNQTIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQA 169


>gi|282898285|ref|ZP_06306276.1| Protein thf1 [Raphidiopsis brookii D9]
 gi|281196816|gb|EFA71721.1| Protein thf1 [Raphidiopsis brookii D9]
          Length = 202

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 99/173 (57%), Gaps = 17/173 (9%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     + Y+ ++ALG VT +DR M+GY   ED  +IF A I A+++DP+ YR 
Sbjct: 1   MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60

Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
           DA KL+  A   +   L+ + S      ++  ++  L+ IA+      NF YSR  A+GL
Sbjct: 61  DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116

Query: 214 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQA 259
           F LLEL++     + T   +  AV    L +++  +++DLD+YR+ L K+ QA
Sbjct: 117 FSLLELSDPEFVKDETERNQAIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQA 169


>gi|124023249|ref|YP_001017556.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9303]
 gi|123963535|gb|ABM78291.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 250

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 105/206 (50%), Gaps = 11/206 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 14  TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTR 73

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E   + +F A   +   DP   R  AQK  E  RG     +  +  ++G  +G  +
Sbjct: 74  GYRPEAHVKTLFDALCRSCGFDPNALRKQAQKTLESVRGHDLEEVQGWIQQQG--KGAPE 131

Query: 192 DIAE--RASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
            +A+  R +G   F YSR  AVGL  LL  A   E +  EKL  +       +   K  V
Sbjct: 132 ALAQALRNTGSNTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFTKARV 191

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVD 268
           ++DL++Y++ L K+ QA EL ++ ++
Sbjct: 192 EKDLNLYKSNLEKMAQAVELSEQILE 217


>gi|443323210|ref|ZP_21052219.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
 gi|442787120|gb|ELR96844.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
          Length = 231

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 102/198 (51%), Gaps = 12/198 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV++TK +F   + RPI SIY  V++ELIV+ HL+   + ++ DP++ LG VT +DR M+
Sbjct: 6   TVSDTKRDFYAHHTRPINSIYRRVVEELIVELHLLSVNQNFRVDPIYCLGVVTSFDRFMQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWA-RGQTASSLVEFPSKEGEVEGLL 190
           GY  EED+ +I  +   A+    EQYR  A ++   A R      L+ +      VEG  
Sbjct: 66  GYRPEEDKASILASLCQAVGGKLEQYRDHANQVLNLAKRLHGVDDLLAWFKHPQPVEGEF 125

Query: 191 KDIAERASG---KGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKR 240
             +AE  S      +F YSR F +GL+ +L   N            +  +   VL V+  
Sbjct: 126 A-LAEAVSAIALNQSFKYSRMFGIGLYTMLGEKNLELLQDKPARDKITAQFAEVLPVSSD 184

Query: 241 SVDRDLDVYRNLLSKLLQ 258
            + +D ++Y+  L K+ Q
Sbjct: 185 KLQKDFELYQANLEKMKQ 202


>gi|33862947|ref|NP_894507.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9313]
 gi|81577657|sp|Q7V7R3.1|THF1_PROMM RecName: Full=Protein thf1
 gi|33634864|emb|CAE20850.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 243

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 104/206 (50%), Gaps = 10/206 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 6   TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTS 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
           GY  E   + +F A   +   DP   R  AQ+  E  RG     +  +  ++G+   E L
Sbjct: 66  GYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEAL 125

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
            K +   A G   F YSR  AVGL  LL  A   E +  EKL  +       +  +K  V
Sbjct: 126 AKALRNTA-GSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFSKARV 184

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVD 268
           ++DL++Y++ L K+ QA EL ++ ++
Sbjct: 185 EKDLNLYKSNLEKMAQAVELTEQILE 210


>gi|254413033|ref|ZP_05026805.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
           chthonoplastes PCC 7420]
 gi|196180197|gb|EDX75189.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
           chthonoplastes PCC 7420]
          Length = 208

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 99/174 (56%), Gaps = 17/174 (9%)

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
           +V+ HL+     ++YDP++ LG V  ++R M+GY  E D+E+IF A   A+  +P+QY+ 
Sbjct: 1   MVEMHLLAVNVDFKYDPIYVLGVVASFNRFMQGYRPERDKESIFNALCQAVGGNPQQYQD 60

Query: 160 DAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGN---FSYSRFFAVGLFRL 216
           DA+KL+      +A  LV++      +EG  +DI    +   +   F YSR FA+GL+ L
Sbjct: 61  DAEKLKAAVGRLSAQELVDWFGSPTPLEG-AEDIHTTVAAIADNPKFKYSRLFAIGLYTL 119

Query: 217 LELANATEP----------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAK 260
           LE A   EP           +L+++   L++ +  + +DL++YR+ L K+ QA+
Sbjct: 120 LEQA---EPELVQDAKQSMEMLQRIGQTLHLPQEKLQKDLELYRSNLEKMAQAQ 170


>gi|352093979|ref|ZP_08955150.1| Protein thf1 [Synechococcus sp. WH 8016]
 gi|351680319|gb|EHA63451.1| Protein thf1 [Synechococcus sp. WH 8016]
          Length = 247

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 104/203 (51%), Gaps = 11/203 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+    +
Sbjct: 6   TIADSKRAFHTAFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFQAFTQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E   + ++ A  ++   DPE  +  A+       G T S + E+ S  G   G  +
Sbjct: 66  GYKPEAHLDELYAAICSSNGFDPEALKQLAEGSTSAVSGHTISEVREWLSNRG--AGAPE 123

Query: 192 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 242
            +A   S  G  +F YSR  AVGL  LL  A   EP       T+  ++   L ++K  +
Sbjct: 124 PLASGISSVGGDSFHYSRLMAVGLLSLLSSAQGGEPSNPDELKTLAHEIGEQLGLSKPRL 183

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           D+DL +Y + L K+ QA EL++E
Sbjct: 184 DKDLTLYTSNLEKMAQAVELIEE 206


>gi|116074797|ref|ZP_01472058.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
 gi|116068019|gb|EAU73772.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
          Length = 234

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 103/202 (50%), Gaps = 9/202 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+D    
Sbjct: 6   TIADSKRAFHSAFPHVIPSLYRRTADELLVELHLLSHQKQFKVDALFAVGLRQVFDAFTR 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
           GY  E   +++F A  +    DP   +  A   E   +G +   + ++   +GE     +
Sbjct: 66  GYRPEAHLDSLFAAICSCNGFDPAALKQLALDSEHAVQGHSFEDVQQWLRNKGEGAPAAI 125

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSVD 243
             + +RA    NF YSR  AVGL  LL  A   + ++P+ L KL       L + K  V+
Sbjct: 126 TKVLKRAD-HANFHYSRLMAVGLLTLLAKAQGDDGSDPSELAKLAHELSEPLGLTKERVE 184

Query: 244 RDLDVYRNLLSKLLQAKELLKE 265
           +DL +Y   L ++ QA EL++E
Sbjct: 185 KDLGIYTGNLERMAQAVELMEE 206


>gi|318041533|ref|ZP_07973489.1| Thf1-like protein [Synechococcus sp. CB0101]
          Length = 224

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 11/208 (5%)

Query: 69  VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
           V  TVA++K  F   +   I  IY  ++ EL+V+ HL+ +++ ++ D +FA+G   V+D 
Sbjct: 3   VSLTVADSKRAFHSAFSYVIAPIYRRLVDELLVELHLLSHQKGFRADGLFAVGLTQVFDS 62

Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
              GY  E  RE +FQA  +A   D    R  A++  +     +   +  + S +G  +G
Sbjct: 63  FSTGYRPEAQREPLFQALCSANGFDGAALRAQAEQARQQVGHHSLEEVKGWLSNQG--QG 120

Query: 189 LLKDIAERASG--KGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 239
             + IA    G  + +F YSR  AVGL  LL+    A+A +P  L     ++   + + K
Sbjct: 121 APELIASLLQGVQRDDFHYSRLVAVGLLSLLQSAQGADALDPQALRSAAHEIGESMGLIK 180

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
             VD+DL +Y   + K+ QA EL++E V
Sbjct: 181 DRVDKDLSLYAGNIEKMSQAVELMEETV 208


>gi|317970011|ref|ZP_07971401.1| Thf1-like protein [Synechococcus sp. CB0205]
          Length = 228

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 17/208 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TVA++K  F + +   I  +Y  ++ EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 8   TVADSKRAFHQAFPYVIAPLYRRLVDELLVELHLLSHQKGFHADGLFAVGLTQVFDSFSN 67

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 186
           GY  E  RE +FQA  +A   D   +R  A          +   +  + S  GE     +
Sbjct: 68  GYKPEAQREPLFQALCSANGFDGGAFRQMASDAATQVGHHSLDEVKGWLSNRGEGAPAPI 127

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVL----EKLCAVLNVNK 239
            GLL  +        +F YSR  AVGL  LL+ A   E   P  L     ++   + + K
Sbjct: 128 AGLLHGVQRE-----DFHYSRLVAVGLLSLLQRAQGAEAMDPQALRSAAHEIGEAMGLIK 182

Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
             VD+DL +Y   + K+ QA EL++E V
Sbjct: 183 ARVDKDLSLYAGNIEKMTQAVELMEETV 210


>gi|284929212|ref|YP_003421734.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
 gi|284809656|gb|ADB95353.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
          Length = 237

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 103/197 (52%), Gaps = 10/197 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+ETK  F   + +PI SIY   ++EL+V+ HL+     YQY P++ALG VT++++ M 
Sbjct: 6   TVSETKREFYNFFTKPISSIYRRFIEELLVEMHLLSVNADYQYSPIYALGVVTLFEKFMY 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV---EG 188
            Y  ++ ++ IF A   +   D +QYR ++  +   A   + S+  E  +K  +    + 
Sbjct: 66  RYQPDDHQDLIFDALCKSTGGDTKQYRQESNTILNEAETLSISNFKEDFTKSAQEKVNDK 125

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE-----LANATEP--TVLEKLCAVLNVNKRS 241
           LL       +    F YSR  A+GL+ LLE     L  + E     +E++   L ++   
Sbjct: 126 LLWKSYYSIAQNPKFKYSRLLAIGLYSLLEKISSDLVESKEEYNKAIEQIANDLGLSSER 185

Query: 242 VDRDLDVYRNLLSKLLQ 258
           + +D+++Y + L K+ Q
Sbjct: 186 IQKDIELYCSNLEKMQQ 202


>gi|113955551|ref|YP_730625.1| Thf1-like protein [Synechococcus sp. CC9311]
 gi|113882902|gb|ABI47860.1| Uncharacterized protein [Synechococcus sp. CC9311]
          Length = 252

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 105/203 (51%), Gaps = 11/203 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F K +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+    +
Sbjct: 11  TIADSKRAFHKSFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFMAFTQ 70

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E   + ++ A  T    +PE  +  A+       G T + + E+ S  G   G  +
Sbjct: 71  GYKPETHLDELYAAICTCNGFEPEALKQLAEGSTSAVSGHTINEVREWLSNRG--AGAPE 128

Query: 192 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
            +A   S  G  +F YSR  AVGL  LL  A   EP+  ++L  +       L ++K  +
Sbjct: 129 PLASGISSVGGESFHYSRLMAVGLLSLLSSAQGGEPSNPDELKKLAHEIGEQLGLSKPRL 188

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           D+DL +Y + L K+ QA EL++E
Sbjct: 189 DKDLSLYTSNLEKMAQAVELIEE 211


>gi|87302741|ref|ZP_01085552.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
 gi|87282624|gb|EAQ74582.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
          Length = 257

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 104/215 (48%), Gaps = 35/215 (16%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TVA++K  F   +   I  +Y  ++ EL+V+ HL+  +  +  D +FA+G   V+D   +
Sbjct: 8   TVADSKRAFHAAFPYVIGPLYRRMVDELLVELHLLSRQSGFHSDGLFAVGLTQVFDGFAK 67

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR---------IDAQKLEE---W--ARGQTASSLV 177
           GY  ++  E +F A   +   D +Q R         +    L+E   W   RGQ A    
Sbjct: 68  GYRPQQQSEPLFAALCASSGFDAQQIRAQHAAAVKAVGEHSLDEVKQWLAQRGQGAPE-- 125

Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EK 230
                   + G+L  I +RA    +F YSR FAVGL  LL+    A A EP  L     +
Sbjct: 126 -------PIAGVLAGI-DRA----DFHYSRLFAVGLLSLLQHARGAEAVEPQALRQAAHE 173

Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           +   + + K  VD+DL +Y + L K+ QA EL++E
Sbjct: 174 IGESMGLMKERVDKDLTLYASTLEKMAQAVELMEE 208


>gi|449015870|dbj|BAM79272.1| photosystem II biogenesis protein Psb29 [Cyanidioschyzon merolae
           strain 10D]
          Length = 327

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 99/200 (49%), Gaps = 6/200 (3%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+ET   F +  KRP+   Y   + E++   HL      ++YD +FALGFV+VY     
Sbjct: 88  TVSETVTRFYRNLKRPVVFYYQQAVDEILTTAHLALVCAMFRYDVIFALGFVSVYRDFFR 147

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE-----GEV 186
            YP  ++RE++F+    AL  D  Q   +A     + +G+T + L+E   ++      E 
Sbjct: 148 SYPRPDERESLFRCICDALDLDVGQVTKEADDALAYVQGKTEAELIEEIERDTGEDSAEA 207

Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLEKLCAVLNVNKRSVDRD 245
           + ++  +       G + Y+R F +GL +++           ++K   +L ++   +D+D
Sbjct: 208 QPVIAALRACRRADGEYYYTRLFGIGLMKIMSSCGVEINLESVKKWANMLKISYARLDQD 267

Query: 246 LDVYRNLLSKLLQAKELLKE 265
           +  Y+  + KL QA+ + KE
Sbjct: 268 IGTYQMSMEKLTQAEVMFKE 287


>gi|88808604|ref|ZP_01124114.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
 gi|88787592|gb|EAR18749.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
          Length = 234

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 59/203 (29%), Positives = 100/203 (49%), Gaps = 11/203 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +
Sbjct: 13  TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKSNALFAVGLRQVFTAFTK 72

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
           GY   +    +F A  +    + +Q    A+  E+   G +   +  +     EG  E L
Sbjct: 73  GYRPADHLTELFDALCSCNGFNAQQLNSVAEGSEKAVAGHSMEEVQAWLQSKGEGAPEPL 132

Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
              +A+ A  +  F YSR  AVGLF LL  A   E    E LC         + +++  +
Sbjct: 133 ATGLADIAGEQ--FHYSRLMAVGLFSLLSSAQGVESQDPEDLCKTAHSIGEQIGLSRPRL 190

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           ++DL +YRN L K+ QA EL++E
Sbjct: 191 EKDLSLYRNNLEKMAQAVELMEE 213


>gi|375332109|gb|AFA52594.1| hypothetical protein [Vaucheria litorea]
          Length = 249

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 109/221 (49%), Gaps = 21/221 (9%)

Query: 63  MSTATDVP----------------PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLM 106
           MST  DVP                 TV+ET  +F   Y++PI   Y T++ +++   HL 
Sbjct: 1   MSTVEDVPDTAMYKRFGHLKGKDVKTVSETIKSFCIQYQKPILPQYRTMINDVLQSTHLN 60

Query: 107 RYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEE 166
                + YD +F  GF +++ +LM+ YP   + + I+ A +T+L  +PE+ + D + + +
Sbjct: 61  VVNGCFIYDAMFGYGFYSLFYKLMKAYPGTGEADLIYAAMVTSLDMEPEKLKEDHETISK 120

Query: 167 WARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA--TE 224
                T + L    S +GE + LL +I+        + Y++ + +GL   ++      TE
Sbjct: 121 LIENMTRADLEN--SFKGENQNLLSEISSNIKADEFYLYTKTWGIGLIEAMDKVGIPLTE 178

Query: 225 PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
             + E L  ++  +     +DL  Y+++L K+ QA++L KE
Sbjct: 179 ENI-ESLANMIGFSPIKARQDLVQYKDVLDKVAQAEQLFKE 218


>gi|299469582|emb|CBN76436.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 226

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+ET  +F   YK+ + + + T++ E +   HL  Y   ++YDP+F +GF T + R M 
Sbjct: 107 TVSETVADFYIYYKKVVLTQFRTIVTEYLQSTHLTVYDARFKYDPLFGVGFYTSFMRFMR 166

Query: 132 GYPSEEDREAIFQAYITALKE--DPEQYRIDAQKLEEWARGQTASSLVE 178
            YP     E IF A + A+    DP+Q R D   L+EWA G+T   +VE
Sbjct: 167 AYPVPGQAELIFDAVVKAIGNGLDPDQMRKDTTALKEWAEGKTEEDVVE 215


>gi|87124410|ref|ZP_01080259.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
 gi|86167982|gb|EAQ69240.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
          Length = 224

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 96/201 (47%), Gaps = 8/201 (3%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IP +Y     EL+V+ HL+ +++ +Q D +FA+G   V+     
Sbjct: 6   TIADSKRAFHTAFPFVIPPLYRRTADELLVELHLLSHQQQFQVDALFAVGLRQVFRAFTR 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
           GY   +   ++F+A  ++      +    A + E   RG +   +  +    G+     L
Sbjct: 66  GYKPGQHLASLFEALCSSTGFHAGELESLADQSEAAVRGHSIEEVRHWLEHGGDGAPAPL 125

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLC----AVLNVNKRSVDR 244
             + +RA   G F YSR  AVGL  LL  A     +P  L KL       L   +  V++
Sbjct: 126 ASVLQRADSSG-FHYSRLMAVGLLSLLSEAQGDQADPEQLRKLAHELSGPLGFAQTRVEK 184

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           DL +Y + L K+ QA EL++E
Sbjct: 185 DLGLYASNLDKMAQAVELMEE 205


>gi|116070497|ref|ZP_01467766.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
 gi|116065902|gb|EAU71659.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
          Length = 215

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 108/204 (52%), Gaps = 22/204 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ +++  P+F++G  TV++   +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFSVGLCTVFETFSQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+    +F A  ++   +   +R ++++  + A+ ++             ++G+  
Sbjct: 66  GYRPEDHITGLFDALCSSNGYNATTFRKESKQCIDAAKSES-------------IDGMES 112

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 243
            +A++  G+G+  YSR  A+G+FRL E A  +A +P  T L K C      LN     V+
Sbjct: 113 HLAKQKLGEGSH-YSRLMAIGVFRLFEEAKGDAEQPDETELRKRCKEVSTTLNFPAERVE 171

Query: 244 RDLDVYRNLLSKLLQAKELLKEYV 267
           +DL ++     ++  A EL++E +
Sbjct: 172 KDLSLFAANSERMSAAVELVQETI 195


>gi|161347491|ref|YP_001224936.2| Thf1-like protein [Synechococcus sp. WH 7803]
          Length = 226

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 104/204 (50%), Gaps = 14/204 (6%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F   +   IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +
Sbjct: 6   TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
           GY   +    +F A  +    + E+ +  A+  E+   G +   +  +   +G+   G L
Sbjct: 66  GYRPADHLPQLFDALCSCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPL 125

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRS 241
                  +G+  F YSR  AVGLF LL  A  ++ +P   E+LC         + +++  
Sbjct: 126 ATGLADIAGE-QFHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPR 181

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
           +++DL +YRN L K+ QA EL++E
Sbjct: 182 LEKDLSLYRNNLEKMAQAVELMEE 205


>gi|416383906|ref|ZP_11684537.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
           0003]
 gi|357265142|gb|EHJ13943.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
           0003]
          Length = 209

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 84/168 (50%), Gaps = 16/168 (9%)

Query: 104 HLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQK 163
           HL+     + YDP++ALG VT + R M+GY  E D+ +IF A   A+    E+Y  +A+ 
Sbjct: 2   HLLSVNIDFSYDPIYALGVVTSFQRFMQGYSPESDKPSIFNALCQAVDGSSEKYHQEAEA 61

Query: 164 LEEWARGQTASSLVEFPSKEGEV------EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
           +   A+G    S+V+F  K   V      EG+L       +    F YSR  A+GL+ LL
Sbjct: 62  ILNEAKGL---SIVDFKDKLTHVTDNQVGEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 118

Query: 218 -----ELANATEP--TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQ 258
                +L    E     ++++   L  +   + +DLD+YR+ L K+ Q
Sbjct: 119 MEIDSDLLKDEEKRTETIKEVSEALKFSPEKLRKDLDLYRSNLDKMQQ 166


>gi|397644025|gb|EJK76212.1| hypothetical protein THAOC_02035 [Thalassiosira oceanica]
          Length = 293

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 72/141 (51%), Gaps = 2/141 (1%)

Query: 79  NFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEED 138
            F      PI ++Y   + +L+   HL      +Q DPVF+LG VTV D L++ +P ++ 
Sbjct: 58  TFTDALGTPINALYKGTITDLVGSLHLTVVTARFQRDPVFSLGLVTVLDLLLKNFPEQDT 117

Query: 139 REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERAS 198
            + I  A I +      +   +A ++  WA+G+T   +    +  GE +  L  +A  A 
Sbjct: 118 AKRIKSAMIESAGMVESEVDAEAAEVATWAQGKTREDIA--SALRGEGDSTLAQVANGAK 175

Query: 199 GKGNFSYSRFFAVGLFRLLEL 219
           G   + YSRFF +GL +++++
Sbjct: 176 GDEYWMYSRFFGIGLVKMMDI 196


>gi|78184631|ref|YP_377066.1| Thf1-like protein [Synechococcus sp. CC9902]
 gi|97202850|sp|Q3AY05.1|THF1_SYNS9 RecName: Full=Protein thf1
 gi|78168925|gb|ABB26022.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 215

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 103/204 (50%), Gaps = 22/204 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ +++  P+FA+G  TV+D    
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFAVGLCTVFDTFSA 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  EE    +  A  ++   D   +R ++++  + A+ ++             V+ +  
Sbjct: 66  GYRPEEHITGLLDALCSSNGYDANTFRKESKRCIDAAKTES-------------VDAMDS 112

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 243
            +A +  G+G+  YSR  A+G+ RL E A  +A +P    L K C      LN     V+
Sbjct: 113 HLAGQKLGEGSH-YSRLMAIGVLRLFEEAKGDADQPDEADLRKRCKELSTALNFPAERVE 171

Query: 244 RDLDVYRNLLSKLLQAKELLKEYV 267
           +DL ++ +   ++  A EL++E +
Sbjct: 172 KDLSLFASNSERMSAAIELVQETI 195


>gi|427701945|ref|YP_007045167.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
           6307]
 gi|427345113|gb|AFY27826.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
           6307]
          Length = 231

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 11/205 (5%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TVA++K  F   +   I  +Y  ++ EL+V+ HL+  ++ +Q D +FA+G + V+D    
Sbjct: 6   TVADSKRAFHGAFPHVISPLYRRMVDELLVELHLLSRQKGFQIDALFAVGLIQVFDGFAR 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E  +  +FQA   +   D    R   Q+        + + + ++   +G   G   
Sbjct: 66  GYRPEAQKGPLFQALCASSGFDGPDLRRQCQEALAAMGRHSQAEVRQWIESQG--AGAPA 123

Query: 192 DIAERASG--KGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSV 242
            +A   +G  + +F YSR  AVGL  LLE A   +A EP  L +L       + + +  +
Sbjct: 124 PVATALAGIRRPDFHYSRLMAVGLLALLEQALADDAMEPQALRQLAHEIGESMGLLRDRL 183

Query: 243 DRDLDVYRNLLSKLLQAKELLKEYV 267
           D+DL +Y + L K+  A EL++E V
Sbjct: 184 DKDLALYASNLEKMSMAVELMEETV 208


>gi|260436777|ref|ZP_05790747.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
 gi|260414651|gb|EEX07947.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
          Length = 215

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 99/204 (48%), Gaps = 22/204 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++  ++ + +F++G  TV+D   +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANGLFSVGLCTVFDTFTK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E   +A+F A  ++   D  + R     L + A+G+   +L  + S     EG   
Sbjct: 66  GYRPEAQTDALFSALCSSNGFDAAKLRKTNASLVDQAKGKDHETLKSWLSSHSLKEG--- 122

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLE--KLCAVLNVNKRSVD 243
                        YSR  AVGL  LL+ A A      TE  V +  +L   L +    V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAATADATGSDTETIVKQSKELAEGLGLPTDRVE 171

Query: 244 RDLDVYRNLLSKLLQAKELLKEYV 267
           +DL ++ +   ++ QA EL++E +
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETI 195


>gi|147848088|emb|CAK23639.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 206

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 96/188 (51%), Gaps = 14/188 (7%)

Query: 88  IPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYI 147
           IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +GY   +    +F A  
Sbjct: 2   IPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTKGYRPADHLPQLFDALC 61

Query: 148 TALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLLKDIAERASGKGNFSYS 206
           +    + E+ +  A+  E+   G +   +  +   +G+   G L       +G+  F YS
Sbjct: 62  SCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPLATGLADIAGE-QFHYS 120

Query: 207 RFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRSVDRDLDVYRNLLSKLL 257
           R  AVGLF LL  A  ++ +P   E+LC         + +++  +++DL +YRN L K+ 
Sbjct: 121 RLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPRLEKDLSLYRNNLEKMA 177

Query: 258 QAKELLKE 265
           QA EL++E
Sbjct: 178 QAVELMEE 185


>gi|428183151|gb|EKX52010.1| hypothetical protein GUITHDRAFT_150871, partial [Guillardia theta
           CCMP2712]
          Length = 309

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/212 (23%), Positives = 96/212 (45%), Gaps = 17/212 (8%)

Query: 66  ATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 125
             D+ P  A  +  F KL+ RPIP ++     E++   HL      ++YD ++A G  + 
Sbjct: 72  GADIEPCGAAVE-RFYKLFARPIPFVFRAPTNEILYLSHLDLVNAMFRYDVIWAAGLYST 130

Query: 126 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 185
           +D        E+ R  +FQA +  LK D  + + DA  + +WA+G+T + +V   + +GE
Sbjct: 131 FDLFFSAL-DEDLRANLFQALMGGLKLDQSKIKSDADAVLQWAQGKTEADVVS--AIKGE 187

Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV------------LEKLCA 233
               +  +        +F Y+R F  GL +++++    EP                   A
Sbjct: 188 DSSPVGQVLASLGKNEDFLYTRNFGAGLIKIMQVVG-VEPNAENAKRWAEVLGFTSNTSA 246

Query: 234 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
           +  ++    + D+ ++ + + K+ QA +L  E
Sbjct: 247 LSGLSASKFETDVGLFLSSVDKMQQAMQLFAE 278


>gi|33865836|ref|NP_897395.1| Thf1-like protein [Synechococcus sp. WH 8102]
 gi|81574513|sp|Q7U6N6.1|THF1_SYNPX RecName: Full=Protein thf1
 gi|33633006|emb|CAE07817.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 212

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 95/199 (47%), Gaps = 19/199 (9%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ T+Q + +FA+G  TV++R  +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRIADELLVELHLLSHQATFQANSLFAVGLKTVFERFTQ 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY   E   A+  A  ++   D EQ +  AQ   + A G +  +   +  ++   +G   
Sbjct: 66  GYRPMEHPAALLSALCSSNGFDDEQLKQAAQHCLQDAEGHSDDAFQSWLKEQSLSDGA-- 123

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL-----EKLCAVLNVNKRSVDRDL 246
                        YSR  AVGL  LLE ++             KL   L +    V++DL
Sbjct: 124 ------------HYSRLMAVGLLALLEASSDESDASSLRQRAVKLSVDLGLPAERVEKDL 171

Query: 247 DVYRNLLSKLLQAKELLKE 265
            V+ +   ++ QA EL++E
Sbjct: 172 TVFSSNSERMEQAVELMQE 190


>gi|72382131|ref|YP_291486.1| Thf1-like protein [Prochlorococcus marinus str. NATL2A]
 gi|97202784|sp|Q46L45.1|THF1_PROMT RecName: Full=Protein thf1
 gi|72001981|gb|AAZ57783.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 199

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)

Query: 69  VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
           V  T++++K +F K +   IP+IY  +  EL+V+ HL+ +++ ++ D +F+ G   V+ +
Sbjct: 3   VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFSK 62

Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
              GY   E    +F A       +P +    +++L   A+  T   L  F SK      
Sbjct: 63  FTSGYKPSEHATKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 241
                      KG   YSR  A+G+++L+      +    E L   +N   +S       
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170

Query: 242 VDRDLDVYRNLLSKLLQAKELL 263
           V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192


>gi|78212971|ref|YP_381750.1| Thf1-like protein [Synechococcus sp. CC9605]
 gi|97202855|sp|Q3AJN7.1|THF1_SYNSC RecName: Full=Protein thf1
 gi|78197430|gb|ABB35195.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 215

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 101/204 (49%), Gaps = 22/204 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++  ++ + +F++G  TV+D  ++
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANELFSVGLCTVFDTFIK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E   +A+F+A  ++   D  + R     L E A+G+   SL ++ S     EG   
Sbjct: 66  GYRPEAQTDALFRALCSSNGFDAAKLRKTYASLVEQAKGKDPESLKDWLSSHALKEG--- 122

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLE------LANATEPTVLE--KLCAVLNVNKRSVD 243
                        YSR  AVGL  LL+        + TE  V +  +L   L +    V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAAAADATDSDTEAIVKQSKELAEGLGLPTDRVE 171

Query: 244 RDLDVYRNLLSKLLQAKELLKEYV 267
           +DL ++ +   ++ QA EL++E +
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETI 195


>gi|159903384|ref|YP_001550728.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9211]
 gi|254784145|sp|A9BAB2.1|THF1_PROM4 RecName: Full=Protein thf1
 gi|159888560|gb|ABX08774.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9211]
          Length = 221

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 106/203 (52%), Gaps = 18/203 (8%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+E+K  F K +   +P++Y  ++ ELIV+ +L++ +  +  D VFA+G  +++    +
Sbjct: 6   TVSESKAIFHKEFPFVVPAVYRRLVDELIVELNLLKNQERFVADGVFAIGLTSIFLDFTK 65

Query: 132 GYPSEEDREAIFQAYITAL---KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
           GY  E  +  + +A          + EQ  ++A+KL          SL+   +++ E E 
Sbjct: 66  GYKPENQKGILLEAICKCTGFSASNLEQIALEAKKLANGLNTNEIKSLITDNNRD-EKES 124

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
             K I +      N  YSR  A+G+++L+++ +       ATE + L+ L       K  
Sbjct: 125 TYKLINK------NNHYSRIIAIGIYKLVDMQSNGFNKEEATENSYLD-LVNNFGYTKER 177

Query: 242 VDRDLDVYRNLLSKLLQAKELLK 264
           V++D+++Y++ L K+ +A EL++
Sbjct: 178 VEKDVNLYKSSLDKIEKALELIE 200


>gi|124025670|ref|YP_001014786.1| Thf1-like protein [Prochlorococcus marinus str. NATL1A]
 gi|166987530|sp|A2C211.1|THF1_PROM1 RecName: Full=Protein thf1
 gi|123960738|gb|ABM75521.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL1A]
          Length = 199

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)

Query: 69  VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
           V  T++++K +F K +   IP+IY  +  EL+V+ HL+ +++ ++ D +F+ G   V+ +
Sbjct: 3   VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFCK 62

Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
              GY   E    +F A       +P +    +++L   A+  T   L  F SK      
Sbjct: 63  FTSGYKPSEHVTKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119

Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 241
                      KG   YSR  A+G+++L+      +    E L   +N   +S       
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170

Query: 242 VDRDLDVYRNLLSKLLQAKELL 263
           V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192


>gi|223995057|ref|XP_002287212.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
           CCMP1335]
 gi|220976328|gb|EED94655.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
           CCMP1335]
          Length = 212

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 74/150 (49%), Gaps = 2/150 (1%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV E    F      P+ ++Y  +  +L+   HL+     +Q D VF+LG V+  D +++
Sbjct: 2   TVGEAFTQFTDKLGTPVNALYKGMCTDLVGSLHLVMVNARFQRDAVFSLGLVSALDLVLK 61

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
            YP  E    I  A + ++  D      +A  LE WA+G+T   +      EG+ +  L 
Sbjct: 62  NYPEAETGARIKSAMLESVGLDEAVVNAEAAALEAWAQGKTKEDIASALKGEGDSQ--LA 119

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELAN 221
            IA+ A G   + YSRFF VGL R++E+  
Sbjct: 120 AIAKAAKGDQWWMYSRFFGVGLVRIMEIVG 149


>gi|33240369|ref|NP_875311.1| Thf1-like protein [Prochlorococcus marinus subsp. marinus str.
           CCMP1375]
 gi|81664534|sp|Q7VC23.1|THF1_PROMA RecName: Full=Protein thf1
 gi|33237896|gb|AAP99963.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
           str. CCMP1375]
          Length = 214

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/199 (21%), Positives = 97/199 (48%), Gaps = 9/199 (4%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           T++++K  F K +   IP +Y  VL E +V+ +L+  +  ++ D +F+ G +  ++R   
Sbjct: 6   TISDSKGLFHKEFPYVIPPVYRKVLDEYLVELNLLSNQSNFKIDTIFSYGLIISFERFTV 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  +     I ++   +   D +  +  +  +++    +    ++   +   E++  + 
Sbjct: 66  GYEPDSHISKILESLCNSCNIDIKAIKEYSNNIKKLINEKGIKEIINILT--AEIKKSVG 123

Query: 192 DIA-ERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEKLCAVLNVNKRSVDRD 245
            IA    SGK  + YSR  A+G++ L+   N       +  ++ +    L  +K  V++D
Sbjct: 124 GIALSNQSGKDKY-YSRLHAIGIYELISNINEDKKEGDDKEIISECVEALGFSKDRVEKD 182

Query: 246 LDVYRNLLSKLLQAKELLK 264
           ++ Y+N + K+ +  EL+K
Sbjct: 183 INQYKNSMEKIKEMMELIK 201


>gi|219123541|ref|XP_002182081.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406682|gb|EEC46621.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 311

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 73/147 (49%), Gaps = 2/147 (1%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV E   +F       +  +Y  ++ +++   HL+     +Q D +++LG +T  D L++
Sbjct: 67  TVGEAFADFSSELGVTVNPLYKNMVTDIVGTTHLVIVNARFQRDAIWSLGILTALDLLLK 126

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
            YP  E    I  A   ++  D ++ R +A+ + +WA G++ + +    + EG+    + 
Sbjct: 127 NYPEPEVGAKIVSALFKSVGLDEDEIRNEARTISDWAVGKSKADIETALTGEGDSP--VA 184

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLE 218
            IA        + YSR+F +GL +++E
Sbjct: 185 AIANSIKPNDYWMYSRYFGIGLIKIME 211


>gi|97202782|sp|Q7V1W1.2|THF1_PROMP RecName: Full=Protein thf1
          Length = 202

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 6   TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  EE    +F++   +   +P++ +  ++K  E  + ++            E+  LLK
Sbjct: 66  GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 115

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 242
             ++      N   SR   +G++  L +ANAT+           ++  +   LN++    
Sbjct: 116 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 168

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           ++D+ +Y++ + K+ QAKELL+E
Sbjct: 169 EKDIGIYKSSILKMEQAKELLQE 191


>gi|254526529|ref|ZP_05138581.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
           str. MIT 9202]
 gi|221537953|gb|EEE40406.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
           str. MIT 9202]
          Length = 202

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/201 (21%), Positives = 106/201 (52%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  +  E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 6   TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+  + +F++  ++   + ++    +QK ++  + +T++ +V+             
Sbjct: 66  GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 113

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 244
            + E+++ K     SR   +G++ L+    +L    E  + + +  +   LN++    ++
Sbjct: 114 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKKNESDINKMISDIFEQLNLSANKAEK 170

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 171 DIGIYKSSISKMEQAKELIEE 191


>gi|33861298|ref|NP_892859.1| Thf1-like protein [Prochlorococcus marinus subsp. pastoris str.
           CCMP1986]
 gi|33633875|emb|CAE19200.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
          Length = 217

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 21  TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  EE    +F++   +   +P++ +  ++K  E  + ++            E+  LLK
Sbjct: 81  GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 130

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 242
             ++      N   SR   +G++  L +ANAT+           ++  +   LN++    
Sbjct: 131 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 183

Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
           ++D+ +Y++ + K+ QAKELL+E
Sbjct: 184 EKDIGIYKSSILKMEQAKELLQE 206


>gi|123968337|ref|YP_001009195.1| Thf1-like protein [Prochlorococcus marinus str. AS9601]
 gi|123198447|gb|ABM70088.1| conserved hypothetical protein [Prochlorococcus marinus str.
           AS9601]
          Length = 218

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 108/205 (52%), Gaps = 30/205 (14%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 21  TVSDSKKLFHEKFPYVIPGLYKRIVDEMLVELNLLNHQNEFTLDYLFCVGLTETFKELMK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+  + +F++  ++          +A+++ E ++ ++   LV+  SK+     +LK
Sbjct: 81  GYQPEKHLDLLFESLCSST-------NFEAKEINEISK-KSQKELVDKTSKD-----ILK 127

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA-----------NATEPTVLEKLCAVLNVNKR 240
            + E+ + K     SR   +G++ L+  +           N     + EKL   L+ NK 
Sbjct: 128 LLVEKNNSK--LYPSRILNLGIYTLISNSQDFKEKNESDKNKMTSDIFEKLS--LSANK- 182

Query: 241 SVDRDLDVYRNLLSKLLQAKELLKE 265
             ++D+ +Y++ +SK+ QAKEL++E
Sbjct: 183 -AEKDIGIYKSSISKMEQAKELIEE 206


>gi|123200442|gb|ABM72050.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9515]
          Length = 198

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 100/204 (49%), Gaps = 28/204 (13%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+E+K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 2   TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 61

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
           GY  E   E +F++   +    P + + I  + LE++                 E+  LL
Sbjct: 62  GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 110

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 241
           K+         N   SR   +G++  L +ANAT+   L   EK  A+      LN++   
Sbjct: 111 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 163

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
            ++D+ +Y++ + K+ QAKELL+E
Sbjct: 164 AEKDIGIYKSSIKKMEQAKELLEE 187


>gi|161407964|ref|YP_001011157.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9515]
          Length = 217

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 100/204 (49%), Gaps = 28/204 (13%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+E+K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 21  TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
           GY  E   E +F++   +    P + + I  + LE++                 E+  LL
Sbjct: 81  GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 129

Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 241
           K+         N   SR   +G++  L +ANAT+   L   EK  A+      LN++   
Sbjct: 130 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 182

Query: 242 VDRDLDVYRNLLSKLLQAKELLKE 265
            ++D+ +Y++ + K+ QAKELL+E
Sbjct: 183 AEKDIGIYKSSIKKMEQAKELLEE 206


>gi|157413170|ref|YP_001484036.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9215]
 gi|157387745|gb|ABV50450.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9215]
          Length = 217

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/201 (21%), Positives = 105/201 (52%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  +  E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 21  TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+  + +F++  ++   + ++    +QK ++  + +T++ +V+             
Sbjct: 81  GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 128

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 244
            + E+++ K     SR   +G++ L+    +L    E    + +  +   LN++    ++
Sbjct: 129 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKNNESDTNKMISDIFEKLNLSANKAEK 185

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIEE 206


>gi|323450067|gb|EGB05951.1| hypothetical protein AURANDRAFT_66018 [Aureococcus anophagefferens]
          Length = 1032

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 69/155 (44%), Gaps = 5/155 (3%)

Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
           + YD +F  GFVT+ D +M  YP   D E I  A I AL  DP   R D + + EW  G+
Sbjct: 69  FVYDELFGFGFVTLMDMIMSPYPVAGDGEKITDALIAALDMDPATLRGDHKAVTEWLAGK 128

Query: 172 T-ASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLE 229
           T A  L    S +G     +   A    G+  F ++R   VGL  +++      +   L 
Sbjct: 129 TEADVLAAVASNDGS---KVASAAATIKGQEEFHHTRPSNVGLVAVMDAVGCKPDDESLA 185

Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
           +      +   +V R+  + +    K+  A +++K
Sbjct: 186 RWTEAFGMRAPAVQRNAGLLKEYQEKVANAMQMIK 220


>gi|194476659|ref|YP_002048838.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
 gi|171191666|gb|ACB42628.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
          Length = 213

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 86/207 (41%), Gaps = 36/207 (17%)

Query: 71  PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
           PTVA+TK  F K +   I   + TVL EL+V+  L+  +     DP+FA+G +  +  L 
Sbjct: 8   PTVADTKRAFYKGFPYVIAPSHRTVLNELLVELFLLSPQTDIGSDPLFAVGLIQFFGVLT 67

Query: 131 EGYPSEEDREAIFQAYITALKEDP--------------EQYRIDAQKLEEWARGQTASSL 176
           + Y  +  R  +F+A   ++  D                QY I  ++L  W+     +S 
Sbjct: 68  KHYQPQNHRMLLFEALCNSIGFDSFNLRQIRKESLSELSQYNI--EELHSWSLTGADNSE 125

Query: 177 VEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLE 229
           + F      +             K  F YSR  A+GL  L++ A   E         +  
Sbjct: 126 ILFTKTFIPI-------------KRRFHYSRLMAIGLLCLIKRARGVETLEAKELYYLTH 172

Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKL 256
            L   +   +  +DRDL VY + + K+
Sbjct: 173 NLAEKMGFIRERIDRDLSVYIDTIEKM 199


>gi|97202762|sp|Q31BD6.2|THF1_PROM9 RecName: Full=Protein thf1
          Length = 201

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/201 (21%), Positives = 107/201 (53%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  ++ D +F +G    +  L +
Sbjct: 6   TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 65

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+  + +F++   +          +A++++E ++     S  EF  K  +   +LK
Sbjct: 66  GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 112

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
            + E+++ K     SR   +G++ L+       E  +  +  ++  +   L++++   ++
Sbjct: 113 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 170

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 171 DIGIYKSSISKMEQAKELIQE 191


>gi|78779133|ref|YP_397245.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9312]
 gi|78712632|gb|ABB49809.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9312]
          Length = 216

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/201 (21%), Positives = 107/201 (53%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  ++ D +F +G    +  L +
Sbjct: 21  TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY  E+  + +F++   +          +A++++E ++     S  EF  K  +   +LK
Sbjct: 81  GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 127

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
            + E+++ K     SR   +G++ L+       E  +  +  ++  +   L++++   ++
Sbjct: 128 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 185

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIQE 206


>gi|126543182|gb|ABO17424.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9301]
          Length = 198

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  + +F +G    +  LM+
Sbjct: 2   TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 61

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY   +  + +F++  ++   + ++    +QK ++  + +T++              +LK
Sbjct: 62  GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSND-------------ILK 108

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
            + E+++ K     SR   + ++ L+       E   +    ++  +   LN++    ++
Sbjct: 109 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 166

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 167 DIGIYKSSISKMEQAKELIEE 187


>gi|161407965|ref|YP_001091025.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9301]
          Length = 217

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)

Query: 72  TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  + +F +G    +  LM+
Sbjct: 21  TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 80

Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
           GY   +  + +F++  ++   + ++    +QK ++  + +T++              +LK
Sbjct: 81  GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 127

Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
            + E+++ K     SR   + ++ L+       E   +    ++  +   LN++    ++
Sbjct: 128 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 185

Query: 245 DLDVYRNLLSKLLQAKELLKE 265
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIEE 206


>gi|256810247|ref|YP_003127616.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
            AG86]
 gi|256793447|gb|ACV24116.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
            AG86]
          Length = 1056

 Score = 41.6 bits (96), Expect = 0.39,   Method: Composition-based stats.
 Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)

Query: 164  LEEWARGQTASSLVEF-PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA 222
            LE+W +      + E  PSK  ++  ++    E    K + S S+F       +LEL   
Sbjct: 942  LEDWKKFIKFKEIFENKPSKLQKLVNIIYKCLEDWDNKYDDSISQFLDTSFINVLELNKK 1001

Query: 223  TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELL 263
            +   V+EKLC + +++    D+DL+ +RN L   +  K++L
Sbjct: 1002 SNKEVIEKLCVIFDISLE--DKDLEKFRNELINKIDRKKML 1040


>gi|359416122|ref|ZP_09208488.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
 gi|358033523|gb|EHK02062.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
          Length = 477

 Score = 40.4 bits (93), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 9/96 (9%)

Query: 51  ASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
           AS     + I C +   DV P  AE  ++ +K +++  P    TV+           ++R
Sbjct: 133 ASDYDVDVEIHCFADGRDVDPKSAEEYISQIKEWQKDYPGKIATVMGRFYSMDRDHNWER 192

Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAY 146
           T+Q     ALG         EG+  E  REA+ +AY
Sbjct: 193 THQAYDAMALG---------EGFEFENPREAVKKAY 219


>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
 gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
          Length = 1053

 Score = 40.0 bits (92), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 22/118 (18%)

Query: 40  TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
           +S+   C R   S+  S + I+         P +AE   N   +YK           +  
Sbjct: 85  SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 126

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
           ++Q+ L  Y+R  +  P F  G++ +   L+    +  D EA  QAYITAL+ +P+ Y
Sbjct: 127 LLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMEAAVQAYITALQYNPDLY 180


>gi|119331158|ref|NP_001073230.1| uncharacterized protein LOC772071 [Gallus gallus]
 gi|60098923|emb|CAH65292.1| hypothetical protein RCJMB04_15j12 [Gallus gallus]
          Length = 299

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 34/224 (15%)

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
           +IQC +      P +A+T      L++R   S +  V + LI   HLM Y  +R  QY  
Sbjct: 45  LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 98

Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
             + +F L      D+  M+GY    D     + Y   L E    YR+ A    +  RG 
Sbjct: 99  SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 152

Query: 171 ------QTASSLVE-FPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
                   A  L++  P  + + + LL    +  E  +G  N ++   F   + RL    
Sbjct: 153 DGVMRTMNAEKLLKTLPIIQNQHDALLDFEANPNELTNGVINAAFMLLFKDSI-RLFAAY 211

Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
           N     +LEK     ++ K      LD+Y+  L ++ +  E LK
Sbjct: 212 NEGIINLLEK---YFDMKKNQCREGLDMYKKFLGRMTKLSEFLK 252


>gi|359483284|ref|XP_003632934.1| PREDICTED: mixed-amyrin synthase-like [Vitis vinifera]
          Length = 170

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 6/76 (7%)

Query: 194 AERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLL 253
           AE  + + NF  +RF AV L   L L    EP  +E+L   +NV   ++ R  +V R   
Sbjct: 40  AEVEAARENFWKNRFLAVLLSSKLSLETVGEPLDMEQLFDAVNVMILNLKRQFEVLR--- 96

Query: 254 SKLLQAKELLKEYVDR 269
              ++  E +KEYVDR
Sbjct: 97  ---MKDNESIKEYVDR 109


>gi|345306750|ref|XP_001509380.2| PREDICTED: phosphatidylinositol-binding clathrin assembly
           protein-like [Ornithorhynchus anatinus]
          Length = 472

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 34/224 (15%)

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
           +IQC +      P +A+T      L++R   S +  V + LI   HLM Y  +R  QY  
Sbjct: 18  LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 71

Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
             + +F L      D+  M+GY    D     + Y   L E    YR+ A    +  RG 
Sbjct: 72  SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 125

Query: 171 -------QTASSLVEFPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
                   T   L   P  + +++ LL    +  E  +G  N ++   F   + RL    
Sbjct: 126 DGVMRTMNTEKLLKTLPIIQNQLDALLDFDANPNELTNGVINAAFMLLFKDSI-RLFAAY 184

Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
           N     +LEK     ++ K      LD+Y+  L+++ +  E LK
Sbjct: 185 NEGIINLLEK---YFDMKKNQCREGLDIYKKFLARMTKLSEFLK 225


>gi|387015170|gb|AFJ49704.1| Phosphatidylinositol-binding clathrin assembly protein-like
           [Crotalus adamanteus]
          Length = 654

 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 34/224 (15%)

Query: 59  IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
           +IQC +      P +A+T      L++R   S +  V + LI   HLM Y  +R  QY  
Sbjct: 45  LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 98

Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
             + +F L      D+  M+GY    D     + Y   L E    YR+ A    +  RG 
Sbjct: 99  SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 152

Query: 171 -------QTASSLVEFPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
                   T   L   P  + +++ LL    +  E  +G  N ++   F   + RL    
Sbjct: 153 DGVMRTMNTEKLLKTLPIIQNQLDALLDFDANPNELTNGVINAAFMLLFKDSI-RLFAAY 211

Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
           N     +LEK     ++ K      LD+Y+  L+++ +  E LK
Sbjct: 212 NEGIINLLEKY---FDMKKNQCKEGLDIYKKFLARMTKLSEFLK 252


>gi|194767414|ref|XP_001965811.1| GF13981 [Drosophila ananassae]
 gi|190625935|gb|EDV41459.1| GF13981 [Drosophila ananassae]
          Length = 396

 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)

Query: 40  TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
           +S+   C R   S+  S + I+         P +AE   N   +YK           +  
Sbjct: 92  SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 133

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
            +Q+ L  Y+R  +  P F  G++ +   L+    +  D E+  QAYITAL+ +PE Y
Sbjct: 134 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 187


>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
 gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
          Length = 1050

 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)

Query: 40  TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
           +S+   C R   S+  S + I+         P +AE   N   +YK           +  
Sbjct: 82  SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 123

Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
            +Q+ L  Y+R  +  P F  G++ +   L+    +  D E+  QAYITAL+ +PE Y
Sbjct: 124 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 177


>gi|226940415|ref|YP_002795489.1| peroxiredoxin/glutaredoxin family protein [Laribacter hongkongensis
           HLHK9]
 gi|226715342|gb|ACO74480.1| Probable peroxiredoxin/glutaredoxin family protein [Laribacter
           hongkongensis HLHK9]
          Length = 245

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 12/117 (10%)

Query: 164 LEEWARGQTASSLVEFPSKEGEVE---GLLKDIAERASGKGNFSYSRFFAVGLFR--LLE 218
           + EWA+ Q ++++V  P   GE     G+L D A+   GK ++ YS     G+ +   +E
Sbjct: 86  MNEWAKDQESANIVMVPDGNGEFTEGMGMLVDKADLGFGKRSWRYSMLVKDGVVQKMFIE 145

Query: 219 LANATEP---TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK----EYVD 268
                +P   +  + + A +N N +  D+ +   ++      +AKELL     +Y+D
Sbjct: 146 PQEPGDPFKVSDADTMLAYINPNAKKPDQVVVFSKDGCPFCAKAKELLSGKGYDYID 202


>gi|421145705|ref|ZP_15605552.1| methylaspartate mutase [Fusobacterium nucleatum subsp. fusiforme
           ATCC 51190]
 gi|395487895|gb|EJG08803.1| methylaspartate mutase [Fusobacterium nucleatum subsp. fusiforme
           ATCC 51190]
          Length = 493

 Score = 37.0 bits (84), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 26/125 (20%), Positives = 61/125 (48%), Gaps = 15/125 (12%)

Query: 65  TATDVPPTVAETK--MNFLKLYKRPIPSI------YNTVLQE----LIVQQHLMRYKRTY 112
           TAT VPP ++ +   +  L   ++ + +I      Y  + Q+    L +++H+  Y  T+
Sbjct: 220 TATLVPPAISNSIQILEILLAVEQGVKNISIGVAQYGNITQDIASLLALKEHIQSYLDTF 279

Query: 113 QYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQT 172
            +  +      TV+++ + G+P EE +     +Y T +    +  RI  + ++E+A+   
Sbjct: 280 SFKDI---NISTVFNQWIGGFPEEELKAYSLISYSTTIALFSKTNRIFVKNIDEYAKNSL 336

Query: 173 ASSLV 177
            ++++
Sbjct: 337 GNTMI 341


>gi|237741274|ref|ZP_04571755.1| methylaspartate mutase [Fusobacterium sp. 4_1_13]
 gi|229430806|gb|EEO41018.1| methylaspartate mutase [Fusobacterium sp. 4_1_13]
          Length = 485

 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 26/125 (20%), Positives = 61/125 (48%), Gaps = 15/125 (12%)

Query: 65  TATDVPPTVAETK--MNFLKLYKRPIPSI------YNTVLQE----LIVQQHLMRYKRTY 112
           TAT VPP ++ +   +  L   ++ + +I      Y  + Q+    L +++H+  Y  T+
Sbjct: 220 TATLVPPAISNSIQILEILLAVEQGVKNISIGVAQYGNITQDIASLLALKEHIQSYLDTF 279

Query: 113 QYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQT 172
            +  +      TV+++ + G+P EE +     +Y T +    +  RI  + ++E+A+   
Sbjct: 280 SFKDI---NISTVFNQWIGGFPEEELKAYSLISYSTTIALFSKTNRIFVKNIDEYAKNSL 336

Query: 173 ASSLV 177
            ++++
Sbjct: 337 GNTMI 341


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.133    0.375 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,682,848,710
Number of Sequences: 23463169
Number of extensions: 142750233
Number of successful extensions: 390839
Number of sequences better than 100.0: 210
Number of HSP's better than 100.0 without gapping: 180
Number of HSP's successfully gapped in prelim test: 30
Number of HSP's that attempted gapping in prelim test: 390385
Number of HSP's gapped (non-prelim): 213
length of query: 269
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 129
effective length of database: 9,074,351,707
effective search space: 1170591370203
effective search space used: 1170591370203
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)