BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026654
         (235 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 297

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 194/229 (84%), Positives = 214/229 (93%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66  TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q  +SLVEF SKEGEV
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFSSKEGEV 185

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           EG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+KLC  LN+NKRSVDRDL
Sbjct: 186 EGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKLCVALNINKRSVDRDL 245

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 231
           DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI  CLG+ L
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERAEPQKANEAITTCLGQQL 294


>gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 297

 Score =  409 bits (1052), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/230 (83%), Positives = 215/230 (93%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66  TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF SKEGE 
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFSSKEGEA 185

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKRSVDRDL
Sbjct: 186 ERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKRSVDRDL 245

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
           DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI  CLG+ L+
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLH 295


>gi|255636566|gb|ACU18621.1| unknown [Glycine max]
          Length = 297

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/230 (83%), Positives = 215/230 (93%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66  TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF SKEGE 
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFSSKEGEA 185

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKRSVDRDL
Sbjct: 186 ERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKRSVDRDL 245

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
           DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI  CLG+ L+
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLH 295


>gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula]
          Length = 303

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 191/233 (81%), Positives = 215/233 (92%), Gaps = 1/233 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           +SD PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVFALGFVTV
Sbjct: 68  VSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVFALGFVTV 126

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+EF S+EGE
Sbjct: 127 YDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIEFSSREGE 186

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG LKDIAERA G G+FSYSRFFAVGLFRLLELAN  EPT+LEKLC+ LN+NK+SVDRD
Sbjct: 187 VEGTLKDIAERAGGNGDFSYSRFFAVGLFRLLELANTMEPTILEKLCSALNINKKSVDRD 246

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYSH 234
           LDVYRNLLSKL+QAKELLKEY+DREKKK EER EPQKANEAI KCLG+  +S+
Sbjct: 247 LDVYRNLLSKLVQAKELLKEYIDREKKKIEERAEPQKANEAISKCLGQEQFSN 299


>gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis
           vinifera]
          Length = 299

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 195/232 (84%), Positives = 215/232 (92%), Gaps = 1/232 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFALGFVTV
Sbjct: 65  VTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFALGFVTV 123

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF SKEGE
Sbjct: 124 YDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFSSKEGE 183

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA  N++KRSVDRD
Sbjct: 184 VEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKRSVDRD 243

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           LDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 244 LDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 295


>gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera]
          Length = 243

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 195/232 (84%), Positives = 215/232 (92%), Gaps = 1/232 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFALGFVTV
Sbjct: 9   VTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFALGFVTV 67

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF SKEGE
Sbjct: 68  YDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFSSKEGE 127

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA  N++KRSVDRD
Sbjct: 128 VEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKRSVDRD 187

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           LDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 188 LDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 239


>gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
           [Ricinus communis]
 gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
           [Ricinus communis]
          Length = 299

 Score =  405 bits (1042), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/230 (83%), Positives = 213/230 (92%), Gaps = 1/230 (0%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTV+ETK NFL  YK+PIPSIYNTVLQELIVQQHLMRYKR+Y+YDPVFALGFVTVY
Sbjct: 69  TDVPPTVSETKFNFLNSYKKPIPSIYNTVLQELIVQQHLMRYKRSYRYDPVFALGFVTVY 128

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LM+GYPS+EDREAIFQAYI AL E+PEQYRIDA+KLE+WAR QT SSLV+F SKEGEV
Sbjct: 129 DQLMQGYPSDEDREAIFQAYINALNEEPEQYRIDAKKLEDWARSQTPSSLVDFSSKEGEV 188

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           EG+LKDIAERA G G+FSYSRFFA+GLFRLLEL+N+TEPTVLEKLCA LN+NKR VDRDL
Sbjct: 189 EGILKDIAERA-GNGSFSYSRFFAIGLFRLLELSNSTEPTVLEKLCAALNINKRGVDRDL 247

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
           DVYRNLLSKL+QAKELLKEYVDREKKK+EER   QKANEA+K CLGE L+
Sbjct: 248 DVYRNLLSKLVQAKELLKEYVDREKKKQEERASSQKANEAVKSCLGEALH 297


>gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
           [Cucumis sativus]
 gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
           [Cucumis sativus]
          Length = 298

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 191/223 (85%), Positives = 207/223 (92%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFALGFVTVYD+LME
Sbjct: 70  TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLME 129

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GYPS+EDREAIFQAYI AL EDPEQYRIDA+K EEWAR QTA+SLVEF S+EGEVE +LK
Sbjct: 130 GYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILK 189

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRN 187
           DIAERA  KGNFSYSRFFA+GLFRLLELANATEP++LEKLCA LN++K+ VDRDLDVYRN
Sbjct: 190 DIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRN 249

Query: 188 LLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEY 230
           LLSKL+QAKELLKEYVDREKKKR+ER   Q ANEAI KCLGEY
Sbjct: 250 LLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY 292


>gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 291

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 189/220 (85%), Positives = 209/220 (95%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFVTVY++L
Sbjct: 65  PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFVTVYEQL 124

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
           MEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q  +SL+EF S+EGEVEG+
Sbjct: 125 MEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARSQNPNSLLEFSSREGEVEGI 184

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
           LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNVNKRSVDRDLDVY
Sbjct: 185 LKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVNKRSVDRDLDVY 244

Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKK 225
           RNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI +
Sbjct: 245 RNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAITQ 284


>gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa]
 gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/227 (84%), Positives = 209/227 (92%), Gaps = 1/227 (0%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTV+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY YDPVF LG VTVY
Sbjct: 67  TDVPPTVSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYLYDPVFGLGLVTVY 126

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS+EDREAIFQAYI ALKEDPEQYRIDA+KLEEWAR QT SSLV+F SKEGE+
Sbjct: 127 DQLMEGYPSDEDREAIFQAYIKALKEDPEQYRIDAKKLEEWARAQTHSSLVDFSSKEGEI 186

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           EG+LK IAERA+  GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKRSVDRDL
Sbjct: 187 EGILKGIAERAAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKRSVDRDL 245

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
           DVYR LLSKL+QAKELLKEYVDREKKK+EER E QKANE + KCLG+
Sbjct: 246 DVYRGLLSKLVQAKELLKEYVDREKKKQEERAESQKANEMVAKCLGD 292


>gi|356517586|ref|XP_003527468.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Glycine max]
          Length = 291

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/220 (85%), Positives = 209/220 (95%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFVTVY++L
Sbjct: 65  PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFVTVYEQL 124

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
           MEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q  +SLV+F S+EGEVEG+
Sbjct: 125 MEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARAQNPTSLVDFSSREGEVEGI 184

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
           LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNV+KRSVDRDLDVY
Sbjct: 185 LKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVDKRSVDRDLDVY 244

Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKK 225
           RNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI +
Sbjct: 245 RNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAITQ 284


>gi|388496070|gb|AFK36101.1| unknown [Lotus japonicus]
          Length = 298

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/232 (81%), Positives = 212/232 (91%), Gaps = 1/232 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           +SD PP V+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMR+KR+Y+YDPVFALGFVTV
Sbjct: 66  VSD-PPPVSETKLNFLKEYKRPIPSIYNTVLQELIVQQHLMRFKRSYRYDPVFALGFVTV 124

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           Y++LMEGYPS+EDR+AIFQ YI ALKEDP QYR DAQKLEEWAR Q+++SL+EF S+EGE
Sbjct: 125 YEQLMEGYPSDEDRDAIFQTYIKALKEDPGQYREDAQKLEEWARTQSSTSLIEFSSREGE 184

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG LKDIAERA GKG+FSYSRFFA+GLFRLLEL NA EP +LEKLCA LNV+KRSVDRD
Sbjct: 185 VEGALKDIAERAGGKGDFSYSRFFAIGLFRLLELGNAMEPAILEKLCAALNVDKRSVDRD 244

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           LDVYRNLLSKL+QAKELLKEY DREKKK+EER EPQKANEAI KCLG+  +S
Sbjct: 245 LDVYRNLLSKLVQAKELLKEYADREKKKQEERAEPQKANEAITKCLGQEQFS 296


>gi|224146717|ref|XP_002326111.1| predicted protein [Populus trichocarpa]
 gi|222862986|gb|EEF00493.1| predicted protein [Populus trichocarpa]
          Length = 296

 Score =  395 bits (1015), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 187/227 (82%), Positives = 211/227 (92%), Gaps = 1/227 (0%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +DVPPTVA+TK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YK+T++YDPVF LGFVTVY
Sbjct: 65  TDVPPTVADTKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKKTFRYDPVFGLGFVTVY 124

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS+EDREAIFQAYI AL+EDPEQYRIDA+KLEEWAR QT SSLV+F S+EGE+
Sbjct: 125 DQLMEGYPSDEDREAIFQAYIKALEEDPEQYRIDAKKLEEWARAQTPSSLVDFSSREGEI 184

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           EG LKDIAER +  GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKRSVDRDL
Sbjct: 185 EGTLKDIAERVAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKRSVDRDL 243

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
           DVYR LLSKL+QA+ELLKEYVDREKKK+EER E QKA+E + KCLGE
Sbjct: 244 DVYRGLLSKLVQARELLKEYVDREKKKQEERAESQKASETVTKCLGE 290


>gi|242050546|ref|XP_002463017.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
 gi|241926394|gb|EER99538.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
          Length = 284

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 188/230 (81%), Positives = 208/230 (90%), Gaps = 1/230 (0%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTVAETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LGFVTVYD
Sbjct: 53  DVPPTVAETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLGFVTVYD 112

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LMEGYPS EDR++IF+AYITAL EDP QYR DA K+EEWAR Q ASSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFRAYITALNEDPTQYRADALKMEEWARSQNASSLVDFSSRDGEIE 172

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC  LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRSVDRDLD 232

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           VYRN+LSKL+QAKELLKEYVDREKKKREER+E  K NEA+ K  G  LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281


>gi|357122407|ref|XP_003562907.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
           [Brachypodium distachyon]
          Length = 286

 Score =  388 bits (997), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 183/226 (80%), Positives = 204/226 (90%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54  ADIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR+AIF++YITAL EDPEQYR DAQK+EEWAR Q  S LVEF S++GE+
Sbjct: 114 DQLMEGYPSNEDRDAIFKSYITALNEDPEQYRADAQKMEEWARAQNGSLLVEFSSRDGEI 173

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDI+ERA G GNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKRSVDRDL
Sbjct: 174 EAVLKDISERAQGNGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKRSVDRDL 233

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
           D+YRNLLSKL+QAKELLKEY+DREKKKREER E  K NE + K  G
Sbjct: 234 DIYRNLLSKLVQAKELLKEYIDREKKKREERLETPKPNEPVAKFDG 279


>gi|297832696|ref|XP_002884230.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330070|gb|EFH60489.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 298

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 183/232 (78%), Positives = 213/232 (91%), Gaps = 1/232 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVTV
Sbjct: 60  VTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTV 118

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F S++GE
Sbjct: 119 YDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSRQGE 178

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           +E LLKDIA RA+ K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDRD
Sbjct: 179 IEALLKDIAGRAASKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDRD 238

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           LDVYRNLLSKL+QAKELL+EYV+REKKK+ ER E QKANE I KCLG+ LY+
Sbjct: 239 LDVYRNLLSKLVQAKELLREYVEREKKKQGERAESQKANETISKCLGDTLYN 290


>gi|397702097|gb|AFO59570.1| chloroplast Ptr ToxA-binding protein [Saccharum hybrid cultivar
           GT28]
          Length = 284

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/230 (80%), Positives = 207/230 (90%), Gaps = 1/230 (0%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTV+ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LGFVTVYD
Sbjct: 53  DVPPTVSETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLGFVTVYD 112

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LMEGYPS EDR++IF+ YITAL EDP+QYR DA K+EEWAR Q  SSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFRTYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC  LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRSVDRDLD 232

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           VYRN+LSKL+QAKELLKEYVDREKKKREER+E  K NEA+ K  G  LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281


>gi|293333399|ref|NP_001168867.1| uncharacterized protein LOC100382672 [Zea mays]
 gi|223973419|gb|ACN30897.1| unknown [Zea mays]
          Length = 284

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/231 (80%), Positives = 207/231 (89%), Gaps = 1/231 (0%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           SDVPPTV ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYD VFALGFVTVY
Sbjct: 52  SDVPPTVGETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDAVFALGFVTVY 111

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+E WAR Q  SSLV+F S++GE+
Sbjct: 112 DQLMEGYPSIEDRDSIFKAYITALNEDPNQYRADALKMEGWARSQNGSSLVDFSSRDGEI 171

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKRSVDRDL
Sbjct: 172 ESILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKRSVDRDL 231

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           DVYRN+LSKL+QAKELLKEYVDREKKKREER+E  K NEA+ K  G  LYS
Sbjct: 232 DVYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281


>gi|21592994|gb|AAM64943.1| unknown [Arabidopsis thaliana]
 gi|58761181|gb|AAW82331.1| chloroplast thylakoid formation 1 [Arabidopsis thaliana]
          Length = 300

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 182/233 (78%), Positives = 211/233 (90%), Gaps = 1/233 (0%)

Query: 1   MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
           + +DVPP V+ETK  FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVT
Sbjct: 60  VTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVT 118

Query: 61  VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
           VYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F SKEG
Sbjct: 119 VYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKEG 178

Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
           ++E +LKDIA RA  K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDR
Sbjct: 179 DIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDR 238

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           DLDVYRNLLSKL+QA ELLKEYV+REKKK+EER + QKANE I KCLG+ LY+
Sbjct: 239 DLDVYRNLLSKLVQANELLKEYVEREKKKQEERAQSQKANETISKCLGDTLYN 291


>gi|212720892|ref|NP_001131923.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
 gi|194692932|gb|ACF80550.1| unknown [Zea mays]
 gi|195644742|gb|ACG41839.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
 gi|414887096|tpg|DAA63110.1| TPA: chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
          Length = 284

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 1/230 (0%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVYD
Sbjct: 53  DVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVYD 112

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRSVDRDLD 232

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           VYRN+LSKL+QAKELLKEYVDREKKKREER+E  K NEA+ K  G  LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281


>gi|18399513|ref|NP_565491.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
 gi|75206547|sp|Q9SKT0.1|THF1_ARATH RecName: Full=Protein THYLAKOID FORMATION 1, chloroplastic; Flags:
           Precursor
 gi|4454459|gb|AAD20906.1| expressed protein [Arabidopsis thaliana]
 gi|17065446|gb|AAL32877.1| Unknown protein [Arabidopsis thaliana]
 gi|20148535|gb|AAM10158.1| unknown protein [Arabidopsis thaliana]
 gi|330251998|gb|AEC07092.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
          Length = 300

 Score =  385 bits (988), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 182/233 (78%), Positives = 211/233 (90%), Gaps = 1/233 (0%)

Query: 1   MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
           + +DVPP V+ETK  FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVT
Sbjct: 60  VTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVT 118

Query: 61  VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
           VYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F SKEG
Sbjct: 119 VYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKEG 178

Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
           ++E +LKDIA RA  K  FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDR
Sbjct: 179 DIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDR 238

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           DLDVYRNLLSKL+QAKELLKEYV+REKKK+ ER + QKANE I KCLG+ LY+
Sbjct: 239 DLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGDTLYN 291


>gi|326493802|dbj|BAJ85363.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 286

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 180/226 (79%), Positives = 204/226 (90%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
            D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54  GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  + LVEF S++GE+
Sbjct: 114 DQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
           DVYRNLLSKL+QAKELLKEYV+REKKKR ER E  K NEA+ K  G
Sbjct: 234 DVYRNLLSKLVQAKELLKEYVEREKKKRAERLETPKPNEAVAKFDG 279


>gi|195653795|gb|ACG46365.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
          Length = 284

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 182/230 (79%), Positives = 207/230 (90%), Gaps = 1/230 (0%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVYD
Sbjct: 53  DVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVYD 112

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LME YPS ED+++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F S++GE+E
Sbjct: 113 QLMERYPSNEDKDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRSVDRDLD 232

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           VYRN+LSKL+QAKELLKEYVDREKKKREER+E  K NEA+ K  G  LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281


>gi|75140959|sp|Q7XAB8.1|THF1_SOLTU RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
           Precursor
 gi|33469614|gb|AAQ19850.1| light-regulated chloroplast-localized protein [Solanum tuberosum]
          Length = 293

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 179/227 (78%), Positives = 201/227 (88%), Gaps = 1/227 (0%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTVA+TK+ FL  YKRPIP++YNTVLQELIVQQHL RYK++YQYDPVFALGFVTVYD+LM
Sbjct: 66  PTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALGFVTVYDQLM 125

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
           EGYPSEEDR AIF+AYI ALKEDPEQYR DAQKLEEWAR Q A++LV+F SKEGE+E + 
Sbjct: 126 EGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSSKEGEIENIF 185

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
           KDIA+RA  K  F YSR FAVGLFRLLELAN T+PT+LEKLCA LNVNK+SVDRDLDVYR
Sbjct: 186 KDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKSVDRDLDVYR 245

Query: 187 NLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           NLLSKL+QAKELLKEYV+REKKKR ER E QKANE + KCLG+Y Y+
Sbjct: 246 NLLSKLVQAKELLKEYVEREKKKRGER-ETQKANETVTKCLGDYQYA 291


>gi|157142955|gb|ABV24460.1| chloroplast-localized protein [Nicotiana benthamiana]
          Length = 295

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/233 (77%), Positives = 209/233 (89%), Gaps = 2/233 (0%)

Query: 1   MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
           M +D+P TVAETKMNFLK YKRPIP++YNTVLQELIVQQHL++YK++Y+YDPVFALGFVT
Sbjct: 63  MSTDLP-TVAETKMNFLKAYKRPIPTVYNTVLQELIVQQHLIKYKKSYRYDPVFALGFVT 121

Query: 61  VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
           VYD+LMEGYPSEEDR+AIF+AYI AL EDP QYR DAQK EEWAR Q A++LV+F S++G
Sbjct: 122 VYDQLMEGYPSEEDRDAIFKAYIEALNEDPVQYRADAQKFEEWARTQNANTLVDFSSRDG 181

Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
           EVE +LKDIA+RA  K +F YSR FAVGLFRLLELAN T+PT+LEKLCA LN+NK+SVDR
Sbjct: 182 EVENILKDIAQRAGTKDSFCYSRLFAVGLFRLLELANVTDPTILEKLCASLNINKKSVDR 241

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
           DLDVYRNLLSKL+QAKELLKEYV+REKKKR ER E QKANEA+ KCLG+Y Y+
Sbjct: 242 DLDVYRNLLSKLVQAKELLKEYVEREKKKRGER-ESQKANEAVTKCLGDYQYA 293


>gi|125558787|gb|EAZ04323.1| hypothetical protein OsI_26464 [Oryza sativa Indica Group]
          Length = 287

 Score =  372 bits (956), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/225 (82%), Positives = 207/225 (92%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTVAETKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD VFALGFVTVYD
Sbjct: 56  DVPPTVAETKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKTTYQYDAVFALGFVTVYD 115

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q  +SLVEF SK+GE+E
Sbjct: 116 QLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSSKDGEIE 175

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN+NKRSVDRDLD
Sbjct: 176 AILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRSVDRDLD 235

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
           VYRN+LSKL+QAKELLKEYV+REKKKREER+E  K+NEA+ K  G
Sbjct: 236 VYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280


>gi|388506988|gb|AFK41560.1| unknown [Medicago truncatula]
          Length = 287

 Score =  372 bits (954), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 176/222 (79%), Positives = 201/222 (90%), Gaps = 1/222 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFALGFVTV
Sbjct: 65  VTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFALGFVTV 123

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF S+EGE
Sbjct: 124 YDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFSSREGE 183

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG+LKDIA+RA  KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KRSVDRD
Sbjct: 184 VEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKRSVDRD 243

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
           LDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 244 LDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285


>gi|115472755|ref|NP_001059976.1| Os07g0558500 [Oryza sativa Japonica Group]
 gi|75147522|sp|Q84PB7.1|THF1_ORYSJ RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
           Precursor
 gi|29367385|gb|AAO72565.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
 gi|34394010|dbj|BAC84034.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
 gi|113611512|dbj|BAF21890.1| Os07g0558500 [Oryza sativa Japonica Group]
 gi|125600704|gb|EAZ40280.1| hypothetical protein OsJ_24722 [Oryza sativa Japonica Group]
 gi|215694285|dbj|BAG89278.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 287

 Score =  368 bits (944), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 185/225 (82%), Positives = 206/225 (91%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           DVPPTVAETKMNFLK YKRPI SIY+TVLQEL+VQQHLMRYK TYQYD VFALGFVTVYD
Sbjct: 56  DVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALGFVTVYD 115

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
           +LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q  +SLVEF SK+GE+E
Sbjct: 116 QLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSSKDGEIE 175

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
            +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN+NKRSVDRDLD
Sbjct: 176 AILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRSVDRDLD 235

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
           VYRN+LSKL+QAKELLKEYV+REKKKREER+E  K+NEA+ K  G
Sbjct: 236 VYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280


>gi|217073200|gb|ACJ84959.1| unknown [Medicago truncatula]
          Length = 287

 Score =  368 bits (944), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 175/222 (78%), Positives = 200/222 (90%), Gaps = 1/222 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFALGFVTV
Sbjct: 65  VTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFALGFVTV 123

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF S+E E
Sbjct: 124 YDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFSSRERE 183

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VEG+LKDIA+RA  KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KRSVDRD
Sbjct: 184 VEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKRSVDRD 243

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
           LDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 244 LDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285


>gi|52548246|gb|AAU82110.1| chloroplast inositol phosphatase-like protein [Triticum aestivum]
          Length = 286

 Score =  365 bits (936), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 180/227 (79%), Positives = 204/227 (89%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
            D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54  GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  + LVEF S++GE+
Sbjct: 114 DQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
           DVYRNLLSKL+QAKELLKEY+ REKKKREER E  K NEA+ K  G 
Sbjct: 234 DVYRNLLSKLVQAKELLKEYIKREKKKREERLETPKPNEAVAKFDGS 280


>gi|38570261|gb|AAR24582.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
 gi|81239115|gb|ABB60085.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
          Length = 286

 Score =  365 bits (936), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 180/227 (79%), Positives = 205/227 (90%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
            D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54  GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  + LVEF S++GE+
Sbjct: 114 DQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
           DVYRNLLSKL+QAKELLKEY++REKKKREER E  K NEA+ K  G 
Sbjct: 234 DVYRNLLSKLVQAKELLKEYIEREKKKREERLETPKPNEAVAKFDGS 280


>gi|157849728|gb|ABV89647.1| chloroplast light-regulated protein [Brassica rapa]
          Length = 273

 Score =  358 bits (919), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 171/213 (80%), Positives = 195/213 (91%), Gaps = 1/213 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           ++DVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFALGFVTV
Sbjct: 62  VTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTV 120

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LM+GYPS++DR++IFQAY+ AL E P+QYRIDAQK+EEWAR QT++SLV+F  KEGE
Sbjct: 121 YDQLMDGYPSDQDRDSIFQAYVEALNEVPKQYRIDAQKMEEWARSQTSASLVDFSFKEGE 180

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
           VE +LKDI+ERA  K  FSYSRFFAVGLFRLLELA AT+PTVL+KLCA LN+NK+SVDRD
Sbjct: 181 VEAILKDISERAGSKEGFSYSRFFAVGLFRLLELAGATDPTVLDKLCASLNINKKSVDRD 240

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
           LDVYRNLLSKL+QAKELLKEYV+REKKKR ER 
Sbjct: 241 LDVYRNLLSKLVQAKELLKEYVEREKKKRGERA 273


>gi|116782547|gb|ABK22548.1| unknown [Picea sitchensis]
          Length = 304

 Score =  352 bits (904), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 172/229 (75%), Positives = 196/229 (85%), Gaps = 1/229 (0%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           SD+P TVAETK  FLK YKRPIPSIYN V+QELIVQQHLMRYKRTYQYD VFALGFV+VY
Sbjct: 77  SDIP-TVAETKSAFLKAYKRPIPSIYNNVIQELIVQQHLMRYKRTYQYDAVFALGFVSVY 135

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LM+GYPS+ D EAIF+AYI ALKEDPEQYR DA+KLEEWA  Q A S+VEF S++GEV
Sbjct: 136 DQLMDGYPSDGDSEAIFRAYINALKEDPEQYRSDAKKLEEWASSQDAKSIVEFQSRDGEV 195

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
           EG+LKDIAERA  K  FSYSRFFA+GLFRLLE ANAT+P VLEKLC  LN++K SVDRDL
Sbjct: 196 EGILKDIAERAREKKIFSYSRFFAIGLFRLLERANATDPVVLEKLCGALNISKPSVDRDL 255

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 231
           D+YRN+LSKL+Q+KELLKEYV+REKKKR ER   QK++EA+ K    YL
Sbjct: 256 DIYRNILSKLVQSKELLKEYVEREKKKRTERESNQKSSEAVAKIESTYL 304


>gi|302807588|ref|XP_002985488.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
 gi|302810785|ref|XP_002987083.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
 gi|300145248|gb|EFJ11926.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
 gi|300146694|gb|EFJ13362.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
          Length = 206

 Score =  310 bits (793), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 146/199 (73%), Positives = 172/199 (86%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTVA+TK  FLK +++PIPSIYN VLQEL+VQQHLMRY  TY+YD VFALGFVTVYD+LM
Sbjct: 3   PTVADTKSAFLKAFRKPIPSIYNNVLQELLVQQHLMRYNATYKYDAVFALGFVTVYDQLM 62

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
           +GYP+ +D EAIF+AYI AL EDP+QYR DA+KLEEWA  QTASSL  F S +G+VE +L
Sbjct: 63  DGYPNAQDSEAIFKAYIEALGEDPDQYRKDAKKLEEWASSQTASSLASFNSGDGDVEEVL 122

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
           KDIA+RA+GK +F YSRFFAVGLFRL+E ANA++P VLEKLC  LNV+K SVDRDLDVYR
Sbjct: 123 KDIAQRAAGKTSFHYSRFFAVGLFRLVERANASDPAVLEKLCNALNVSKMSVDRDLDVYR 182

Query: 187 NLLSKLLQAKELLKEYVDR 205
           NLL+KL QAK+LLKEY+DR
Sbjct: 183 NLLTKLSQAKDLLKEYIDR 201


>gi|168043272|ref|XP_001774109.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674516|gb|EDQ61023.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 215

 Score =  308 bits (789), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 144/210 (68%), Positives = 175/210 (83%)

Query: 1   MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
           M+    PTVA+TK++F+K Y++PIPSIY+ V+QEL+VQQHLMRY  TY YDP+FALGFVT
Sbjct: 1   MVRADVPTVADTKLSFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYVYDPIFALGFVT 60

Query: 61  VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
           VYD+LM+GYP++EDR+AIF+AYI+AL EDPEQYR D++KLEEWA  Q+ S + +F  K+G
Sbjct: 61  VYDQLMDGYPNDEDRDAIFKAYISALNEDPEQYRKDSKKLEEWAAAQSGSGIADFAGKDG 120

Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
           EVE  LKDIAERA+GK  F YSRFFA+GLFRLLE A A++P VLE L   LNV+KRSVDR
Sbjct: 121 EVEAALKDIAERAAGKEKFHYSRFFAIGLFRLLECAKASDPAVLETLSKALNVSKRSVDR 180

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
           DLDVYRNLLSKL Q KEL+KEYVDR   +R
Sbjct: 181 DLDVYRNLLSKLAQGKELIKEYVDRWVIRR 210


>gi|168037112|ref|XP_001771049.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677737|gb|EDQ64204.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 205

 Score =  295 bits (754), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 137/199 (68%), Positives = 162/199 (81%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTV+ETK +F+K Y++PIPSIY+ V+QEL+VQQHLMRY  TY YDP+FALGFVTVYD+LM
Sbjct: 7   PTVSETKASFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYTYDPIFALGFVTVYDQLM 66

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
           +GYP   DR++IF AYI AL EDP +YR DA+KLEEWA  Q+AS + +F S++GEVE  L
Sbjct: 67  DGYPDATDRDSIFTAYINALNEDPVKYREDAKKLEEWASAQSASGITDFTSRDGEVEATL 126

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
           K IAERA  K  F YSRFFA+GLFRLLE A A++P VLE L   LNVNKRSVDRDLDVYR
Sbjct: 127 KSIAERAGSKDKFHYSRFFAIGLFRLLECAKASDPAVLESLSKALNVNKRSVDRDLDVYR 186

Query: 187 NLLSKLLQAKELLKEYVDR 205
           NLLSKL Q KEL+KEY +R
Sbjct: 187 NLLSKLAQGKELIKEYNER 205


>gi|414887097|tpg|DAA63111.1| TPA: hypothetical protein ZEAMMB73_220735 [Zea mays]
          Length = 207

 Score =  268 bits (684), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 122/154 (79%), Positives = 139/154 (90%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
            DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVY
Sbjct: 52  GDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVY 111

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q  SSLV+F S++GE+
Sbjct: 112 DQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEI 171

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 156
           E +LKDI+ERA GKGNFSYSRFFAVGLFRLL+ A
Sbjct: 172 EAILKDISERAKGKGNFSYSRFFAVGLFRLLDFA 205


>gi|217072610|gb|ACJ84665.1| unknown [Medicago truncatula]
 gi|388509564|gb|AFK42848.1| unknown [Medicago truncatula]
          Length = 219

 Score =  266 bits (681), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 123/153 (80%), Positives = 138/153 (90%), Gaps = 1/153 (0%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           +SD PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVFALGFVTV
Sbjct: 68  VSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVFALGFVTV 126

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
           YD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+EF S+E E
Sbjct: 127 YDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIEFSSRERE 186

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 154
           VEG LKDIAERA G G+FSYSRFFAVG F  L 
Sbjct: 187 VEGTLKDIAERAGGNGDFSYSRFFAVGFFDFLS 219


>gi|384250113|gb|EIE23593.1| photosystem II biogenesis protein Psp29 [Coccomyxa subellipsoidea
           C-169]
          Length = 290

 Score =  214 bits (545), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 110/215 (51%), Positives = 152/215 (70%), Gaps = 4/215 (1%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           PPTVAETK NF + + RPIP IY+ V+QEL+VQ H+MRY ++Y YD VF LGFV+V+D++
Sbjct: 65  PPTVAETKRNFYEAFSRPIPGIYSNVIQELLVQHHIMRYNKSYSYDEVFGLGFVSVFDQV 124

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-EVEG 124
           +EG P E D+ A+F AYI +L E+ +QYR DA+K+E  A+  +  + ++ P  EG E++ 
Sbjct: 125 LEGLP-EGDKGALFSAYIGSLGENGDQYRQDAEKVEALAKELSGPAELK-PDAEGSELQK 182

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 184
            L  IAER+S +GNF Y++FFA+GLFRLLEL  A +P  LE L + + + + SV RDL  
Sbjct: 183 KLASIAERSS-QGNFLYTKFFAIGLFRLLELTGAKDPKALEGLVSAMKIPQESVSRDLMT 241

Query: 185 YRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           Y+ +LSKL  AK+L+ E   REKKK  ER   +KA
Sbjct: 242 YKGVLSKLSAAKDLMNEMYAREKKKAAEREAEKKA 276


>gi|159471025|ref|XP_001693657.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
 gi|158283160|gb|EDP08911.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
          Length = 266

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 111/208 (53%), Positives = 148/208 (71%), Gaps = 3/208 (1%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           PPTVAETK  FL  Y +PI SIY+TVLQEL+VQQH MRY + YQY+P+FALGFV+VY+++
Sbjct: 42  PPTVAETKAKFLSGYNKPIASIYSTVLQELLVQQHFMRYSKNYQYNPIFALGFVSVYEQI 101

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
           +E   S E+R AIF+AY+ AL ED ++Y+ DA  LE+ A G T  SL   P+ +G     
Sbjct: 102 LESL-SAEERGAIFKAYVDALGEDADKYKRDASALEQAANGLTPESLT--PNADGNEVQK 158

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
                  AS  G FSY++F A+GLFRLLEL+ A EP+ LEKL   + V   +V+RDL +Y
Sbjct: 159 ALASISSASAAGAFSYNKFVAIGLFRLLELSGAKEPSALEKLVKAVGVKPEAVNRDLLMY 218

Query: 186 RNLLSKLLQAKELLKEYVDREKKKREER 213
           + +LSKL  AKEL++E+V+REK+K+ ER
Sbjct: 219 KGVLSKLAAAKELMREFVEREKRKQAER 246


>gi|326492686|dbj|BAJ90199.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 239

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 91/112 (81%), Positives = 103/112 (91%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVYD
Sbjct: 55  DIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVYD 114

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 115
           +LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q  + LVEF
Sbjct: 115 QLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEF 166


>gi|356555139|ref|XP_003545894.1| PREDICTED: LOW QUALITY PROTEIN: protein THYLAKOID FORMATION1,
           chloroplastic-like [Glycine max]
          Length = 152

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 99/134 (73%), Positives = 112/134 (83%)

Query: 73  EDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAER 132
           E R+AIFQAYI AL EDP++YRIDA+KLEEWA  Q  +SLVEF SKEGE E  LKDIA R
Sbjct: 19  EGRDAIFQAYIKALVEDPDKYRIDARKLEEWAGVQNPTSLVEFSSKEGEAEKXLKDIAXR 78

Query: 133 ASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKL 192
           A GK  FSYSRFFAVGLFRL+EL NATEP +L+KLCA LN+NKRSVD DLDVY  LLS+L
Sbjct: 79  AGGKXEFSYSRFFAVGLFRLVELENATEPIILDKLCAALNINKRSVDWDLDVYCILLSEL 138

Query: 193 LQAKELLKEYVDRE 206
           LQ KELLKEY+D++
Sbjct: 139 LQVKELLKEYIDKD 152


>gi|307108772|gb|EFN57011.1| hypothetical protein CHLNCDRAFT_143677 [Chlorella variabilis]
          Length = 273

 Score =  183 bits (464), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 135/203 (66%), Gaps = 7/203 (3%)

Query: 5   VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
            PPTVA+ K+ F   +K+P+P+IY+TV+QEL+VQQHL R+ + YQY+ V ALG V+++++
Sbjct: 49  APPTVADAKLKFNGAFKKPLPAIYSTVVQELLVQQHLFRWNKQYQYNEVTALGIVSIFEQ 108

Query: 65  LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
           ++ G P  E REA+F A+I AL+EDP+QYR DA  +EE ARG++  +    P   G+   
Sbjct: 109 VLGGLPDAE-REAVFDAFINALQEDPKQYRKDAAAMEELARGKSEVA----PDASGDKVQ 163

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 184
                     GK  F Y++FFAVGLFRL+EL  + +P  L  L   L +++  V+ DL  
Sbjct: 164 QALAAVAAKEGK--FLYTKFFAVGLFRLVELTGSKDPKSLTTLVKALGLSQERVNADLMT 221

Query: 185 YRNLLSKLLQAKELLKEYVDREK 207
           Y+ +LSKL  AKE++KE++ REK
Sbjct: 222 YKGVLSKLEAAKEIMKEFMAREK 244


>gi|302852549|ref|XP_002957794.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
           nagariensis]
 gi|300256865|gb|EFJ41122.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
           nagariensis]
          Length = 373

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 99/243 (40%), Positives = 147/243 (60%), Gaps = 43/243 (17%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           PPTVAETK  F + Y +PI SIY+TVLQEL+VQQH MRY + Y Y+ +FALGFV+VY+++
Sbjct: 43  PPTVAETKAKFFEGYSKPIASIYSTVLQELLVQQHFMRYSKDYVYNEIFALGFVSVYEQI 102

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----------------QTA 109
           +E  P  E R+AIF +Y+ AL EDPE Y+ D++++E+ A                  QT+
Sbjct: 103 LESLPQSE-RDAIFVSYVKALGEDPEAYKRDSERVEKAAGALSGPDALVPDAEGSDVQTS 161

Query: 110 SSLVEFPSKEGEVEGLLKDIAERASGKGN-----------------------FSYSRFFA 146
           + +  +  + GE+    +    R  G+G+                       FSY++F A
Sbjct: 162 AYIWAYHQRRGEMRMPWRT---RTWGQGSSSLGVCSYGKALDAIKAASAADAFSYNKFVA 218

Query: 147 VGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
           +GLFRLLEL  A EP  LE+L   + +   +V+RDL +Y+ +LSKL  AKE+++E+V+RE
Sbjct: 219 IGLFRLLELTGAKEPAALERLVKSVGIKPEAVNRDLLMYKGVLSKLAAAKEMMREFVERE 278

Query: 207 KKK 209
           K++
Sbjct: 279 KRR 281


>gi|255075137|ref|XP_002501243.1| predicted protein [Micromonas sp. RCC299]
 gi|226516507|gb|ACO62501.1| predicted protein [Micromonas sp. RCC299]
          Length = 260

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 142/220 (64%), Gaps = 17/220 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A+TK  F++ Y  PIPSI++  + EL+  QH +RY   Y Y  + +LGFV+VYD+L E
Sbjct: 51  TLADTKRKFVESYPYPIPSIWSVAVNELLANQHFVRYSTRYSYSKLSSLGFVSVYDQLFE 110

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           G+PS+E++  IF  ++ AL EDPE+ R DA +L ++A+            + G V+ LL 
Sbjct: 111 GFPSDEEKAKIFDCFVEALGEDPEKCRKDAAELAKFAK------------EAGGVDALLA 158

Query: 128 D--IAE-RASGKGN-FSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
              +AE +++G+ N F+YSR+ A+GLFR+LEL  ATEP  LEKL     +  + V+ DL 
Sbjct: 159 SPVLAEIKSNGEANKFAYSRYDAIGLFRMLELGGATEPAALEKLADAAGLKLKKVNGDLG 218

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKANEA 222
           +Y+ LLSKL  AKEL KE  +REK+K  ER  + + AN+A
Sbjct: 219 MYKGLLSKLAAAKELQKEIFEREKRKTAERLAKKEAANDA 258


>gi|428213026|ref|YP_007086170.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
           6304]
 gi|428001407|gb|AFY82250.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
           6304]
          Length = 235

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 136/225 (60%), Gaps = 17/225 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     + YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF------PSKEGE 121
           GY  EED+ +IF      L+ DP++YR DAQ LEE A   +   +V        P  EG+
Sbjct: 66  GYRPEEDKISIFNGICKGLEADPQKYRQDAQWLEEIASRHSGEEMVALLSRSAGPEMEGD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
            +G+L  IA     K NF YSR FAVGLF LLE A+        +    ++K+C  LN+ 
Sbjct: 126 FQGILGAIA----AKPNFKYSRLFAVGLFTLLEQADLELVKNEKSRQEAVQKICTALNLP 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
              + +DLD+YR  L K++QA+ ++++ +  ++KKRE+R + + A
Sbjct: 182 VDKLSKDLDLYRTNLEKMIQARSVMEDILAADRKKREDRAKQKGA 226


>gi|427736065|ref|YP_007055609.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
 gi|427371106|gb|AFY55062.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
          Length = 233

 Score =  169 bits (429), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 88/223 (39%), Positives = 145/223 (65%), Gaps = 22/223 (9%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+ETK  F  L+ RPI +IY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSETKRTFYSLHTRPINTIYRRVVEELMVEMHLLGVNADFKYDPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK-----EGEV 122
           GY  EED+E+I+ A I +++EDP++YR DA++LE+ A+  T   LV   S+     + E+
Sbjct: 66  GYNPEEDKESIYNALIKSVEEDPQKYRHDAKRLEDLAKSTTGKDLVSDLSQRRLANDSEL 125

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV----------LEKLCAVLN 172
           +GLL+ IA  +S    F YSR FA+GL+ LLE   +++P +          L+ + A LN
Sbjct: 126 QGLLEGIANNSS----FKYSRLFAIGLYTLLE---SSDPEMVKDEKLRNEALKTIAAGLN 178

Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           +++  + +DLD+YR+ L K+ QA  ++ + +  ++K+RE+R +
Sbjct: 179 LSEDKLSKDLDLYRSNLDKMAQAAIVMADMIAADRKRREQRAQ 221


>gi|145344894|ref|XP_001416959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577185|gb|ABO95252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 203

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/210 (41%), Positives = 131/210 (62%), Gaps = 16/210 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  FL+ Y  PIPS+++TV QEL+VQ H  +Y    +Y  + +LGFV+V+D+L E
Sbjct: 1   TVSDTKAKFLQAYPYPIPSVWSTVTQELLVQGHFAKYNAKSEYSELASLGFVSVFDQLYE 60

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           G+PSE ++  IF A++ AL ED  + R DA+            +L  F +  G V+GL  
Sbjct: 61  GFPSETEKVKIFNAFLGALGEDAAKTRADAE------------ALGAFAASAGGVDGLSA 108

Query: 128 D--IAERA--SGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
           +   A  A  S +    Y+++ A+G+FR+LELA AT+P  LE L     ++ + V+ DL 
Sbjct: 109 NPIFATMAAKSAENKLMYTKYIAIGIFRMLELAKATDPKALEALAQAGGLSFKKVNGDLA 168

Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREER 213
           +Y+ LLSKL  AKEL +E+++REK+K  ER
Sbjct: 169 MYKGLLSKLASAKELQEEFLEREKRKTAER 198


>gi|428223566|ref|YP_007107663.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
 gi|427983467|gb|AFY64611.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
          Length = 239

 Score =  160 bits (406), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 140/224 (62%), Gaps = 11/224 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP +ALG VT Y+R M+
Sbjct: 6   TVSDTKRAFYSMHTRPINSIYRRVVEELMVEMHLLSVNVDFRYDPFYALGVVTSYERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVEG 124
           GY  E+D+ +IF++   A + DP  YR DA++L E+ +  +A  L+ + S E   G+ +G
Sbjct: 66  GYRPEQDKTSIFESLCRANEGDPGHYRHDAERLAEFTKNLSAEELISWLSLETPRGDDQG 125

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
           L + + +  +    F YSR FA+GLF L+E AN       A      EK+ A L++    
Sbjct: 126 LGESL-QAIANHSQFKYSRLFAIGLFTLVEQANPDLVKDEAQRTATFEKVVAALHLPADK 184

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANE 221
           + +DL++YR+ L KL QA+ ++++ +  ++KKREER + QKA+E
Sbjct: 185 LQKDLELYRSNLEKLTQARIVMEDILKADRKKREEREQAQKASE 228


>gi|303286071|ref|XP_003062325.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455842|gb|EEH53144.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 222

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 80/171 (46%), Positives = 109/171 (63%), Gaps = 8/171 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVA+TK  FLK Y  PIPSI++  LQEL+V QH +RY + Y Y  + +LGFV+VYD+L E
Sbjct: 45  TVADTKQKFLKSYPYPIPSIWSVALQELLVTQHFVRYSKKYSYSKLSSLGFVSVYDQLFE 104

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           G+PSEE++  IF+ ++ AL+EDP   R DA +L  +A G +    V       +++ L+ 
Sbjct: 105 GFPSEEEKNTIFECFVKALEEDPATVRKDAAELASFAEGASGVDGVLASPIFAQMKSLVA 164

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSV 178
           D        G F+YSR+ A+GLFRLLELA ATEP  LEKL     +  RS+
Sbjct: 165 D--------GKFAYSRYDAIGLFRLLELAKATEPAALEKLAESSGLQARSI 207


>gi|158338004|ref|YP_001519180.1| Thf1-like protein [Acaryochloris marina MBIC11017]
 gi|189030267|sp|B0C3M8.1|THF1_ACAM1 RecName: Full=Protein thf1
 gi|158308245|gb|ABW29862.1| photosystem II biogenesis protein Psb29 [Acaryochloris marina
           MBIC11017]
          Length = 247

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 80/216 (37%), Positives = 135/216 (62%), Gaps = 16/216 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RP+ S+Y  V++EL+V+ HL+R    ++YDP+FALG  T +DR M+
Sbjct: 6   TVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDRFMD 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-----EV 122
           GY  E D++AIF A   A + DP Q + D Q+L E A+ ++A  ++++ ++       E+
Sbjct: 66  GYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGGDEL 125

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATE-----PTVLEKLCAVLNVNK 175
           +  L++IA+       F YSR FA+GLF LLEL+  N T+        L  +C VLN+++
Sbjct: 126 QWQLRNIAQNPK----FKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLNISE 181

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
             + +DL++YR  L K+ Q ++ + + ++ +KK+RE
Sbjct: 182 SKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217


>gi|334116992|ref|ZP_08491084.1| Protein thf1 [Microcoleus vaginatus FGP-2]
 gi|333461812|gb|EGK90417.1| Protein thf1 [Microcoleus vaginatus FGP-2]
          Length = 237

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/215 (39%), Positives = 130/215 (60%), Gaps = 9/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     +QYDP++ALG VT +DR M 
Sbjct: 6   TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E DR +IF A   +L++DP++Y+ DAQ+LE  A   +   L+ +  +    E    
Sbjct: 66  GYAPEADRVSIFNALCKSLEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFEDTAD 125

Query: 128 DIAERASGKGN--FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
             A   +   N  F YSR FA+GLF LLE A+        T    + K+ A L++ +  V
Sbjct: 126 LQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAALHLPEDKV 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            +DLD+YR+ L K+ QA+ +L++ +  E+KKRE+R
Sbjct: 186 SKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220


>gi|411116557|ref|ZP_11389044.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410712660|gb|EKQ70161.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 246

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 131/216 (60%), Gaps = 11/216 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     Y Y+P++ALG VT ++R M+
Sbjct: 6   TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTSFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E D+  IF A   AL++DP +YR DAQ+L ++A+ ++A  +V +  +     G   
Sbjct: 66  GYRPENDKAPIFDAICQALQDDPNRYRHDAQRLNDFAKQKSAKDIVTWLEQAATSYG-GD 124

Query: 128 DIAERASGKGN---FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 177
           D+ E+     N   F YSR FA+GLF L E A+A           +L++ CA L ++   
Sbjct: 125 DLQEQVKAIANNPKFKYSRLFAIGLFTLFETADAEVVKKEGEREELLKQACAALRLSHDK 184

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
           V RDL++YR+ L K+ QA+ ++ + +  EKKKRE +
Sbjct: 185 VQRDLELYRSNLEKVAQAQAVMADMLAAEKKKREHK 220


>gi|119488459|ref|ZP_01621632.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
 gi|119455270|gb|EAW36410.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
          Length = 241

 Score =  155 bits (393), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 82/216 (37%), Positives = 135/216 (62%), Gaps = 11/216 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI S+Y  V++EL+V+ HL+     +QYDP++ALG V+ +DR M+
Sbjct: 6   TVSDTKRAFYNTHTRPINSVYRRVIEELMVEMHLLSVNVDFQYDPIYALGVVSAFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
           GY  E D+E+IF   I AL++DP++YR +AQ+L+E+A+  +   +V +   +   EV   
Sbjct: 66  GYLPESDKESIFHGLINALQDDPQRYRAEAQRLQEFAQTLSVQDIVSWVDVAANSEVHND 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN--------ATEPTVLEKLCAVLNVNKRS 177
           L+   ++ +    + YSR  A+GLF L+E A+        AT+ T L +L + LN+    
Sbjct: 126 LQSSFQKIATNPKYKYSRILAIGLFTLIEQADPQAMEDKEATQQT-LAQLASGLNLPLDK 184

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
           + +DL++YR+ L KL QA+ ++ E    E+K+RE+R
Sbjct: 185 LQKDLELYRSNLEKLKQARIVMDEMTQAERKRREQR 220


>gi|428317172|ref|YP_007115054.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
 gi|428240852|gb|AFZ06638.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
          Length = 237

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 84/215 (39%), Positives = 130/215 (60%), Gaps = 9/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     +QYDP++ALG VT +DR M 
Sbjct: 6   TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E DR +IF A   ++++DP++Y+ DAQ+LE  A   +   L+ +  +    E    
Sbjct: 66  GYVPEADRVSIFNALCKSVEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFEDTAD 125

Query: 128 DIAERASGKGN--FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
             A   +   N  F YSR FA+GLF LLE A+        T    + K+ A L++ +  V
Sbjct: 126 LQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAGLHLPEDKV 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            +DLD+YR+ L K+ QA+ +L++ +  E+KKRE+R
Sbjct: 186 SKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220


>gi|354568723|ref|ZP_08987886.1| Protein thf1 [Fischerella sp. JSC-11]
 gi|353539977|gb|EHC09457.1| Protein thf1 [Fischerella sp. JSC-11]
          Length = 235

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 135/221 (61%), Gaps = 17/221 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P+FALG VT +DR M+
Sbjct: 6   TVSDTKRTFHTLHTRPINTIYRRVVEELMVEMHLLAVNVDFSYNPIFALGVVTSFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A + A++ DP+ YR DAQ+L+E A+      L+   S      ++ +
Sbjct: 66  GYQPESDKESIFNALLRAIEADPQIYRQDAQRLQELAKSLPPQDLIAALSLQTQLNRDTD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA        F YSR FA+GLF LLEL++             L+ + A L+++
Sbjct: 126 LQSHLQAIA----SNPKFKYSRLFAIGLFSLLELSDPELVKDEKQRTEALKSIAAGLHIS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
              +++DL++YR+ L K+ QA  ++ + +  ++KKRE+R++
Sbjct: 182 DDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRSQ 222


>gi|113474941|ref|YP_721002.1| Thf1-like protein [Trichodesmium erythraeum IMS101]
 gi|123056927|sp|Q116P5.1|THF1_TRIEI RecName: Full=Protein thf1
 gi|110165989|gb|ABG50529.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 239

 Score =  152 bits (385), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 82/216 (37%), Positives = 129/216 (59%), Gaps = 9/216 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI SIYN V++EL+V+ HL+     Y Y+P +ALG VT +DR M+
Sbjct: 6   TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
           GY  +ED+ +IF A I   +EDP +YR DA+ LE+ A   +AS ++ +   SK  +    
Sbjct: 66  GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
           L+D     S    F YSR FA+GLF LLE+ +             L+K+C  LN+ +  +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
            +D+D+Y + L ++ QA+  +++ +   +KKRE+R+
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221


>gi|424513129|emb|CCO66713.1| Thf1-like protein [Bathycoccus prasinos]
          Length = 222

 Score =  152 bits (385), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 83/220 (37%), Positives = 131/220 (59%), Gaps = 12/220 (5%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           P TVA+TK  F K Y  P+PSI+ TVLQEL+V  H       YQ++ + +LGFV+V+D+L
Sbjct: 4   PATVADTKAKFTKGYPYPLPSIWATVLQELLVGMHFTVTSSKYQHEEMRSLGFVSVFDQL 63

Query: 66  MEGYPSEE--DREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA-SSLVEFPSKEGEV 122
            EGYP+E+   +E IF  ++ AL ED +++R DA+KL  +A  QT+   ++  P      
Sbjct: 64  FEGYPTEDPNAKEKIFSTFMEALGEDSKKWRADAEKLSAFATEQTSIDGIIANP------ 117

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
             +   +  +   K +  Y +F A+G FR LE++  T P  L+K+     V    ++ DL
Sbjct: 118 --MFASMKSKVESK-SLVYDKFIAIGFFRALEMSKQTSPENLKKISEASGVTLEKINGDL 174

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
            +Y+++LS++  AKEL  E ++RE++K  ER E + A +A
Sbjct: 175 GLYKSVLSRMNAAKELQAEVLERERRKTAERMEKKAAKDA 214


>gi|434405136|ref|YP_007148021.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
           PCC 7417]
 gi|428259391|gb|AFZ25341.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
           PCC 7417]
          Length = 235

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 139/234 (59%), Gaps = 24/234 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A I A++++P++YR DA++L+  A+G     L+ + S      ++  
Sbjct: 66  GYQPERDQESIFNAIIQAVEQEPQRYRQDAERLQAVAQGLPEQDLIAWLSQTTHSDRDAN 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR FA+GLF LLE++       +      L+ +   L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFSLLEVSSPELVKDDKQRNEALKAIATGLHLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
              + +DL++YR+ L K+ QA  ++ + V  ++KKRE+R +       P  ANE
Sbjct: 182 DDKLSKDLELYRSNLDKMAQALIVMADMVSADRKKREQRKQQASTPVAPPSANE 235


>gi|428775508|ref|YP_007167295.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
 gi|428689787|gb|AFZ43081.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
          Length = 243

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 82/214 (38%), Positives = 127/214 (59%), Gaps = 9/214 (4%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           D   T++ETK  F  L+ RP+ SIY  V++EL+V+ HL+     ++YDP +ALG VTV+D
Sbjct: 2   DTLRTLSETKRTFYTLHTRPLNSIYRRVIEELLVEMHLLTVNIDFKYDPFYALGVVTVFD 61

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE 121
             M+GY  E+D+E+IF A   A++ DP+QYR DA+K++  A   +  ++  +   +K  +
Sbjct: 62  TFMQGYQPEKDKESIFNAICKAVESDPQQYRQDAEKVKSIADQASGEAVTAWLCEAKPLD 121

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
             G L DI +       F YSR F +G++ +LE AN            VL   C  LN+ 
Sbjct: 122 QAGDLNDILQGIRENPRFKYSRLFIIGIYTVLEKANPEIVNDDKKREEVLNNCCQALNLP 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKK 208
           K  VD+DLD+YR+ L K+ QA+ +L++ V  ++K
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDVVRADRK 215


>gi|220910509|ref|YP_002485820.1| Thf1-like protein [Cyanothece sp. PCC 7425]
 gi|254784141|sp|B8HQ62.1|THF1_CYAP4 RecName: Full=Protein thf1
 gi|219867120|gb|ACL47459.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7425]
          Length = 236

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 89/228 (39%), Positives = 137/228 (60%), Gaps = 14/228 (6%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           P TV++TK  F   + RPI SIY  V++EL+V+ HL+R  +T+ YDPVFALG VT ++R 
Sbjct: 4   PRTVSDTKRAFYHNHARPINSIYRRVVEELLVEIHLLRVNQTFVYDPVFALGVVTTFERF 63

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
           M+GY    D+ +IF A   A + DP+Q + DAQ+L    RGQ+  SL+++ S    + G 
Sbjct: 64  MQGYHPPADQTSIFNAICLAQELDPQQVQQDAQELLGRVRGQSLESLLDWISTAASLGGD 123

Query: 126 LKDIAERA-SGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCAVLNVN 174
            +    RA +    F YSR FAVGLF LLE A   EP           VL+++  V+++ 
Sbjct: 124 EQQNRLRAIASNPTFKYSRLFAVGLFTLLEQA---EPELGKDEARLLQVLQQVGEVMHLP 180

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
              + +DL+ YR+ L K+ QA++ L++ V  E+K+R++   P ++ E+
Sbjct: 181 VEKMQKDLEQYRSNLEKMTQARKTLEDIVAAERKRRQQNAAPDRSPES 228


>gi|17228142|ref|NP_484690.1| Thf1-like protein [Nostoc sp. PCC 7120]
 gi|81772969|sp|Q8YZ41.1|THF1_ANASP RecName: Full=Protein thf1
 gi|17129992|dbj|BAB72604.1| all0646 [Nostoc sp. PCC 7120]
          Length = 233

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 134/221 (60%), Gaps = 17/221 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR ME
Sbjct: 6   TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    + LV + S      ++ +
Sbjct: 66  GYQPERDKESIFSAICQAVEQEPQRYRQDAERLQAVAQSLPVNDLVAWLSQANHLQQDAD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR FA+GLF LLE +N             L+ + A L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKSIAAGLHLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
                +DL++YR+ L K+ QA  ++ + +  ++KKRE+R +
Sbjct: 182 DDKFSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222


>gi|443311308|ref|ZP_21040938.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
           7509]
 gi|442778631|gb|ELR88894.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
           7509]
          Length = 241

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 141/228 (61%), Gaps = 9/228 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT Y+R M+
Sbjct: 6   TVSDTKRAFYSTHTRPINTIYRRVVEELMVEMHLLSVNADFSYNPIYALGVVTSYERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E D+++IFQA   A+  DP QYR DA++L  +A+  ++  L+++ S E  ++G   
Sbjct: 66  GYQPERDKDSIFQALCQAINTDPHQYRQDAERLGSFAKSLSSQDLMQWLSSEKPIDGYSD 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 178
           L++  ++ +    F YSR FA+G+F LLEL++              +++ + L++ +  +
Sbjct: 126 LQEQIKQIATNQKFKYSRLFAIGVFSLLELSDPELVKDETKRVEAFKQISSSLHLPEDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
           ++DL++YR  + K+ QA  ++++ +  E+KKR+++ + Q+A  A K  
Sbjct: 186 NKDLELYRANVEKMNQALIVMEDMLAAERKKRQKKADEQQAALAAKSS 233


>gi|427727466|ref|YP_007073703.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
 gi|427363385|gb|AFY46106.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
          Length = 235

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 137/234 (58%), Gaps = 24/234 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNIDFTYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    S LV + S      ++ +
Sbjct: 66  GYRPERDKESIFHAICQAVEQEPQRYRQDAERLQNLAKSLPISDLVAWLSQTTHFNQDPD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR FA+GLF LLE ++             L+ +   L++ 
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLFAIGLFSLLEYSDPDLVKDEKQRTEALKNIANGLHLA 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
              + +DLD+YR+ L K+ QA  ++ + +  ++KKRE+R +       P  ANE
Sbjct: 182 DDKLSKDLDLYRSNLDKMTQALTVIADMISADRKKREQRQQQSSSVVAPPTANE 235


>gi|300866330|ref|ZP_07111033.1| Protein thf1 [Oscillatoria sp. PCC 6506]
 gi|300335673|emb|CBN56193.1| Protein thf1 [Oscillatoria sp. PCC 6506]
          Length = 267

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 133/217 (61%), Gaps = 9/217 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F  ++ RPI SIY  V++EL+V+ HL+     ++Y+P++ALG VT ++R M+
Sbjct: 36  TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFRYNPIYALGVVTAFERFMQ 95

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
           GY  E+D+ +IF     AL +DP++Y+ DA++LE  A   +   L+ +   S   E  G 
Sbjct: 96  GYLPEQDKVSIFNGLCQALGDDPQRYQQDARRLEGLASRVSILDLLSWLEGSTSFEDTGD 155

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEP----TVLEKLCAVLNVNKRSV 178
           L+      +    F YSR FA+GLF LLE+ +     +P      + K+CA L++ +  V
Sbjct: 156 LQASITAIATNSKFKYSRLFAIGLFALLEIVDPDLVKDPEARVQAIAKVCAALHLPEEKV 215

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
            +DLD+YR+ L K+ QA+ +L + +  ++KKRE+R E
Sbjct: 216 TKDLDLYRSNLEKIAQARIVLADVLQADRKKREKRAE 252


>gi|186685250|ref|YP_001868446.1| Thf1-like protein [Nostoc punctiforme PCC 73102]
 gi|254784144|sp|B2J353.1|THF1_NOSP7 RecName: Full=Protein thf1
 gi|186467702|gb|ACC83503.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 235

 Score =  149 bits (376), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 137/227 (60%), Gaps = 21/227 (9%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV------EFPSKEGE 121
           GY  E D+E+IF A   A+++DP+ YR DA++L+  A+G     L+       +  ++ +
Sbjct: 66  GYEPERDQESIFNALCRAIEQDPQHYRQDAERLQAIAKGLPVKDLIGWLGQTTYLDRDAD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA---------TEPTVLEKLCAVLN 172
           ++  L+ IA       NF Y+R FA+G+F LLE ++          TE   L+ + A L+
Sbjct: 126 LQAQLQAIA----NNPNFKYNRLFAIGVFSLLEQSDPELVKDEKQLTE--ALKAIAAGLH 179

Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           V+   +++DL++YR+ L K+ QA  ++ + +  ++KKRE+R +   A
Sbjct: 180 VSDDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRKQQSTA 226


>gi|75910773|ref|YP_325069.1| Thf1-like protein [Anabaena variabilis ATCC 29413]
 gi|97202708|sp|Q3M4B2.1|THF1_ANAVT RecName: Full=Protein thf1
 gi|75704498|gb|ABA24174.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 233

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 135/221 (61%), Gaps = 17/221 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A   A++++P++YR DA++L+  A+    + LV + S      ++ +
Sbjct: 66  GYQPERDKESIFSAICQAVEQEPQRYRQDAERLKAVAQSLPVNDLVAWLSQANHLQQDAD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR FA+GLF LLE +N             L+ + A L+++
Sbjct: 126 LQAQLQAIA----SNPNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKTIAAGLHLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
              + +DL++YR+ L K+ QA  ++ + +  ++KKRE+R +
Sbjct: 182 DDKLSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222


>gi|440683252|ref|YP_007158047.1| Protein thf1 [Anabaena cylindrica PCC 7122]
 gi|428680371|gb|AFZ59137.1| Protein thf1 [Anabaena cylindrica PCC 7122]
          Length = 235

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 133/221 (60%), Gaps = 17/221 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     Y Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A   A+++D ++YR DA +L+  A+      L+ + S      K+ +
Sbjct: 66  GYLPERDQESIFNALCQAVEQDQQRYRQDATRLQAIAQSLPVQDLIAWVSQTTHLDKDAD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR FA+GLF LLELA+             L+ +   L+++
Sbjct: 126 LQAQLQAIAH----NPNFKYSRLFAIGLFSLLELADPELVKDEKQRNEALKAIAQGLHLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           +  + +DLD+YR+ L K+ QA  ++ + +  ++KKR++R +
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQALIVMADILSADRKKRDQRQQ 222


>gi|428778484|ref|YP_007170270.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
           PCC 8305]
 gi|428692763|gb|AFZ48913.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
           PCC 8305]
          Length = 240

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 123/209 (58%), Gaps = 9/209 (4%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           D   T++ETK  F   + RP+ SIY  V++EL+V+ HL+     ++YDP++ALG  TV+D
Sbjct: 2   DTLRTLSETKRTFYTQHTRPLNSIYRRVIEELLVEMHLLSVNTDFKYDPIYALGVTTVFD 61

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
             M+GY  E+++E+IF A   A++ DP++YR DA+KL+  A   +   +    S+   ++
Sbjct: 62  TFMQGYQPEKEKESIFNAICQAVENDPQKYRQDAEKLKSIAANHSGEEVTACLSELKPLD 121

Query: 124 GL--LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVN 174
           G   L  + +       F YSR F +GL+ +LE AN    T       VL+K C  L + 
Sbjct: 122 GAEELTKVLQEIKNNSRFKYSRLFIIGLYTILETANPDLVTDDKKREEVLQKCCQGLGLP 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 203
           K  VD+DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDAI 210


>gi|376001810|ref|ZP_09779664.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
           PCC 8005]
 gi|375329721|emb|CCE15417.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
           PCC 8005]
          Length = 243

 Score =  145 bits (367), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 80/215 (37%), Positives = 123/215 (57%), Gaps = 9/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYNIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
           GY  E D+ +I+ A I A + DP QYR DA  LE  A   +   L E    ++E   +  
Sbjct: 66  GYIPEADKLSIWAALIMAQESDPNQYRADATALEAQAATLSVKDLTERAKIAQESSGDDP 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
           L+      +    F YSR FA+GL+ LLE ++ T          ++      L + K  +
Sbjct: 126 LQSCFHAIANNPKFKYSRLFAIGLYTLLEKSDVTAAQDSEGLKNIIIDFSEALRLPKDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
           ++DLD+YR  L K+ QA+ +++E    E+KKRE+R
Sbjct: 186 EKDLDLYRTNLEKVAQARLMVEEMTQAERKKREQR 220


>gi|428210102|ref|YP_007094455.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
           thermalis PCC 7203]
 gi|428012023|gb|AFY90586.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
           thermalis PCC 7203]
          Length = 250

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 75/224 (33%), Positives = 138/224 (61%), Gaps = 9/224 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK NF   + RPI +IY  V++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRNFYNQHTRPINTIYRRVVEELMVEMHLLSVNADFRYDPIYALGVVTAFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E D+E IF+A   +++++P++YR DA +L +  +  +A  L ++   +  ++G   
Sbjct: 66  GYQPERDKEPIFEALCQSIEDNPQRYRQDADRLRQLLQNVSAQQLFDWIDGKASLQGAED 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
           L+   +  +    F YSR FA+G+F LLELA+A            L+++   L+V +  +
Sbjct: 126 LQAQMQAIAQNSKFKYSRLFAIGVFTLLELADAELVKDEKQRVEALKQVATALHVPEDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
           ++DL++YR+ L K+ QA   + + +  +++KR++R + ++A  A
Sbjct: 186 NKDLELYRSNLDKIEQALITMADILSADRRKRQQRLQEKEAGVA 229


>gi|427707894|ref|YP_007050271.1| Protein thf1 [Nostoc sp. PCC 7107]
 gi|427360399|gb|AFY43121.1| Protein thf1 [Nostoc sp. PCC 7107]
          Length = 235

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 138/230 (60%), Gaps = 16/230 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
           GY  E D+E+IFQA   A++++ ++YR DA++L+  A+   A+ L+ + S+   +  +  
Sbjct: 66  GYQPERDKESIFQAICQAVEQEVQRYRQDAERLQALAKSLAANDLIAWLSQTNHLNQDPD 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 178
           L+   +  +    F Y+R FA+GLF LLE ++             ++ + A L++++  +
Sbjct: 126 LQSQLQAIANNSQFKYNRLFAIGLFSLLEQSDPDLVKDEKQRTDAIKTIAAGLHLSEDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
            +DL++YR+ L K+ QA  ++ + +  ++KKRE+R +       P  ANE
Sbjct: 186 SKDLELYRSNLEKMSQALVVMADMISADRKKREQRQQQSTMPVTPPTANE 235


>gi|434388267|ref|YP_007098878.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
           6605]
 gi|428019257|gb|AFY95351.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
           6605]
          Length = 234

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 127/221 (57%), Gaps = 13/221 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK NF   + RPI SIY  V++EL+V+ HL+     + YDP++ALG V+ +DR M 
Sbjct: 6   TVSDTKRNFYSQHTRPINSIYRRVVEELMVEMHLLSTNVDFAYDPIYALGVVSSFDRFMT 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 124
            Y  E D+++IF A   ++  + +QYR DA  +EE+AR    S ++++   P+ +G    
Sbjct: 66  SYRPEADKQSIFVALCESMGGNAQQYRTDATAVEEFARSMQGSDIIDWIAHPTADGMGAQ 125

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELA--------NATEPTVLEKLCAVLNVNKR 176
           L   +   AS    F YSR F +GLF +LE A           E  VL+ +   L++ K 
Sbjct: 126 LATTLQSIASNP-KFKYSRLFGIGLFTILEQAAPDLLKDEKKREAAVLQ-IAEALHLPKD 183

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
              +DLD YR+ L KL+Q + ++ +  + E+KKRE+R + +
Sbjct: 184 KAQKDLDTYRSNLDKLVQMEAVMADLAEAERKKREKRAQAK 224


>gi|428219024|ref|YP_007103489.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
 gi|427990806|gb|AFY71061.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
          Length = 260

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 129/224 (57%), Gaps = 10/224 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++ K +F + + RPI S+Y  V+ EL+V+ HL+   +T+ YDPVFALG +T YDR M 
Sbjct: 6   TVSDAKRDFFQAFPRPINSVYRRVVDELLVEMHLLTVNQTFAYDPVFALGAITAYDRFML 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E +R+ I  A   A+  + EQ R DA  L E A  ++   + +F +     E L  
Sbjct: 66  GYEPESERDRILPAICGAVHLNAEQMRHDASSLAELAM-RSPIDVKQFLTSLETTENLEP 124

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVNKRSV 178
           L       +    F YSR FA+GLF LLE A       N     +++++   LN+    +
Sbjct: 125 LTGTIRAIAANQKFKYSRLFAIGLFTLLETADPNTMSDNDKRQELIKQVGDALNLGSEKL 184

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
            +DLD+YR+ L K+ QA++++K+ V+ E+KK+E+R  P K++ A
Sbjct: 185 IKDLDLYRSNLEKVEQARQMMKDLVEAERKKKEQRENPPKSDAA 228


>gi|291567260|dbj|BAI89532.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 243

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 80/215 (37%), Positives = 120/215 (55%), Gaps = 9/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI SIY  V++EL+V+ HL+     ++YDP++ALG VT +DR M+
Sbjct: 6   TVSDTKRAFYHIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
           GY  E D+ +I+ A I A + DP QYR DA  LE          L +    ++E   +  
Sbjct: 66  GYIPEADKLSIWAALIGAQESDPNQYRADATALEAQVASLAVKDLTDKAKMAQESSGDDP 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
           L+      +    F YSR  A+GL+ LLE ++AT         T+L      L + K  +
Sbjct: 126 LQSCFHAIANNPKFKYSRLLAIGLYTLLEKSDATAAQDSEGLKTILSDFSEALRLPKDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            +DLD+YR  L K+ QA+ ++ E    E+KKRE+R
Sbjct: 186 VKDLDLYRTNLEKVAQARLMVDEMTQAERKKREQR 220


>gi|434395245|ref|YP_007130192.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
 gi|428267086|gb|AFZ33032.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
          Length = 251

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 10/221 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + Y+P++ALG VT ++R M+
Sbjct: 6   TVSDTKRAFYTSHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTAFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 125
           GY  E D+E+IF A   A++ DP++YR DA++L  +A+  +   L+ +   E   E  G 
Sbjct: 66  GYQPERDKESIFNALCQAVESDPQRYRQDAERLGLFAKNTSTPELIAWLRGETHKEEVGD 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 178
           L+   +  +   +F YSR FA+G+F LLEL++             L+ + A LN+++  +
Sbjct: 126 LQQQIQAIAHNPHFKYSRLFAIGVFGLLELSDPALVKDEKQRVDALKSIAATLNISEDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           ++DL++YR  + K+ QA   + + +  ++KKR+++T P K 
Sbjct: 186 NKDLELYRANVDKMEQALATIADILSADRKKRQQQT-PDKG 225


>gi|298491449|ref|YP_003721626.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
 gi|298233367|gb|ADI64503.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
          Length = 235

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 136/225 (60%), Gaps = 17/225 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     ++Y+ ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDFRYNSIYALGVVTAFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E+D+ +IF A I A+++DP++YR DA +L+  A+      L+ + S      ++ +
Sbjct: 66  GYQPEQDQASIFNAIIQAVEQDPQRYRQDAARLQVVAQSLLTKDLISWLSQTTYLDQDRD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA  A     F YSR FA+GLF LLE+ ++            L+ +   L+++
Sbjct: 126 LQAQLQAIANNAE----FKYSRLFAIGLFSLLEMVDSELVKDEKQRNQALKAIAQGLHLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           +  + +DL++YR+ L KL QA  ++ + +  ++KKR++R +   A
Sbjct: 182 EEKLTKDLELYRSNLDKLAQALIVMADMLAADRKKRDQRQQKSTA 226


>gi|452819272|gb|EME26335.1| thylakoid protein [Galdieria sulphuraria]
          Length = 316

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/209 (38%), Positives = 120/209 (57%), Gaps = 7/209 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVAET  +FLK ++ PIPSIY T++QEL+V  HL R    +QYDPVFALG+  V     +
Sbjct: 86  TVAETISDFLKHFRHPIPSIYRTIVQELLVTTHLARVAVGFQYDPVFALGYQMVTQVFFK 145

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 125
            YP  E++E +F +   AL  D E+ + DA  LEEW R +T   ++    + G+  +  L
Sbjct: 146 SYPKVEEKEKLFDSMCKALLLDYERMKKDASVLEEWTRSRTEREILLAIEEGGDDPLANL 205

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-ATEPTVLEKLCAVLNVNKRSVDRDLDV 184
           L  IA+       F YSR F +GL R++EL          +K  + L+++   +++DLD 
Sbjct: 206 LHSIAQ----NDGFVYSRLFGLGLVRMMELCGEEANSERCQKWASALHISSLKLEQDLDT 261

Query: 185 YRNLLSKLLQAKELLKEYVDREKKKREER 213
           Y+  L +L QA++L  E   R+KKK  E+
Sbjct: 262 YQQSLERLKQAEQLFAELEARQKKKLAEK 290


>gi|427719034|ref|YP_007067028.1| Protein thf1 [Calothrix sp. PCC 7507]
 gi|427351470|gb|AFY34194.1| Protein thf1 [Calothrix sp. PCC 7507]
          Length = 235

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 17/221 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + Y+ ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNSIYALGVVTTFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
           GY  E D+E+IF A   A++++P++YR DA++L   A+   A+ L+ + S      ++ +
Sbjct: 66  GYLPERDQESIFNALCHAVEQEPQRYRQDAERLRVLAKSLPANDLIAWLSQTTHLDQDAD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
           ++  L+ IA       NF YSR  A+GLF LLEL++             L+ +   L ++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLLAIGLFTLLELSDPELVKDEKQRNEALKAIATGLQLS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
              +++DL++YR+ L K+ QA  ++ + +  ++KKRE+R +
Sbjct: 182 DEKLNKDLELYRSNLDKIAQALIVMADVLSADRKKREQRKQ 222


>gi|443317266|ref|ZP_21046682.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
 gi|442783151|gb|ELR93075.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
          Length = 251

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 13/236 (5%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTV++TK  F   + RPI S+Y  V++EL+V+ HL+R    + YDPV+ALG VT +DR M
Sbjct: 5   PTVSDTKRAFYSYHNRPIASVYRRVIEELMVEMHLLRVNEDFVYDPVYALGIVTTFDRFM 64

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL-VEFPSKEGEVEGL 125
            GY  E D  +IF A   A     +QYR DA+ +     G++  +L     S+  E   L
Sbjct: 65  AGYRPEADEASIFAALCQANAGTADQYRRDAEVMVAEVSGRSLDALKAILISRSAEGADL 124

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS-------V 178
           LK + +  + +  F YSR FA+GL+ L+E  +A      EKL  +L     S       +
Sbjct: 125 LKGVLQGIADRDRFKYSRAFAIGLYTLIETVDAEILKDKEKLMELLKAVAESLPLSFDKL 184

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK-----ANEAIKKCLGE 229
            +D+++YR+ L+K+ QAK ++ + +  ++KKREER + +       N+A+    GE
Sbjct: 185 QKDVELYRSNLTKMEQAKIVMADILAADRKKREERAKAKADAASLPNDAVVTPSGE 240


>gi|428304539|ref|YP_007141364.1| Protein thf1 [Crinalium epipsammum PCC 9333]
 gi|428246074|gb|AFZ11854.1| Protein thf1 [Crinalium epipsammum PCC 9333]
          Length = 243

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/217 (35%), Positives = 131/217 (60%), Gaps = 9/217 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SIY  V++EL+V+ HL+     + Y P++ALG VT Y++ M+
Sbjct: 6   TVSDTKRDFYNNHTRPINSIYRRVVEELMVEMHLLSVNVDFAYHPIYALGVVTSYEKFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E DR++IF A + A+ ED ++Y+ DA++L+  A   +   L+++      V+G   
Sbjct: 66  GYRPERDRDSIFDALVGAVGEDSQRYKQDAEQLKALAGRLSGKELIDWIVSPTAVDGAGS 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPTVLEKLCAV---LNVNKRSV 178
           L D     +    F YSR FA+GL+ LLE+++ +    E   L+ L  V   L++    +
Sbjct: 126 LPDQMRAIANNPQFKYSRLFAIGLYTLLEVSDPSLVKDEKERLDALNQVGQSLHLPTEKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
            +DLD+YR+ L K+ Q +  +K+ ++ ++KKRE+R +
Sbjct: 186 HKDLDLYRSNLEKMAQVQIAMKDALEADRKKREKRDQ 222


>gi|428313474|ref|YP_007124451.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
 gi|428255086|gb|AFZ21045.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
          Length = 241

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/223 (34%), Positives = 130/223 (58%), Gaps = 17/223 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RP+ SI+  V++EL+V+ HL+     + Y+P++ALG VT ++R ME
Sbjct: 6   TVSDTKRDFYNHHTRPVNSIFRRVVEELMVEMHLLSVNVDFHYEPIYALGVVTSFNRFME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE------GE 121
           GY  E D+ +IF A   ++  +PEQY+ DAQ LE  A   T   LV + S        G+
Sbjct: 66  GYRPERDKASIFDALCHSVGNNPEQYKQDAQWLESMAERVTGEELVSWLSAPRPQDTLGD 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
           +   +  IAE       F YSR FA+GL+ LLE A++            L+K+   L++ 
Sbjct: 126 LYAAVAAIAENP----KFKYSRLFAIGLYTLLEKADSELVQDEKRRTEALKKISDGLHLP 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
           +  + +DL++YR+ L K+ Q + ++++ +  ++KKRE+R + Q
Sbjct: 182 EEKLQKDLELYRSNLQKMEQVRIVIEDAIQADRKKREKRIQDQ 224


>gi|414076688|ref|YP_006996006.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
 gi|413970104|gb|AFW94193.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
          Length = 223

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 76/213 (35%), Positives = 125/213 (58%), Gaps = 9/213 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  L+ RPI +IY  V++EL+V+ HL+     + YD ++ALG VT +DR M+
Sbjct: 6   TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYDAIYALGVVTTFDRFMD 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
           GY  E+D+E+IF+A   A+++DP+ YR DA +L+  A    A  L+   S+   +  +  
Sbjct: 66  GYQPEQDKESIFRAICQAVEQDPQSYRQDASRLQALAASLPAKDLIASLSQASPLNQDAD 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATE--PTVLEKLCAVLNVNKRSV 178
           L+   E  +   NF YSR F VGLF LL     EL    E     L+ +   L++++  +
Sbjct: 126 LQKQLEAVAANSNFKYSRLFGVGLFALLVQSDPELVKKDEQRAEALKAISNGLHISEDKL 185

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
            +DL++Y + L K+ QA  ++ + +  ++KKR+
Sbjct: 186 IKDLELYSSNLEKMAQALIVMADILTADRKKRD 218


>gi|56750022|ref|YP_170723.1| Thf1-like protein [Synechococcus elongatus PCC 6301]
 gi|81300364|ref|YP_400572.1| Thf1-like protein [Synechococcus elongatus PCC 7942]
 gi|56684981|dbj|BAD78203.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169245|gb|ABB57585.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 280

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 124/217 (57%), Gaps = 10/217 (4%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTV+++K  F   Y RPI  +Y  V++EL+V+ HL+    ++ YDP+FALG VT +D  M
Sbjct: 31  PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 90

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 123
             Y   E    +F A   A++++PEQYR DA  + E  RG  + ++ ++ ++    G   
Sbjct: 91  SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 150

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEPTVLEKLCAVL----NVNKR 176
            L++   +  +G+  F YSR FA+GLF LLE A      +P  L+     +    ++   
Sbjct: 151 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 210

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLD+YR+ L K+ QA+  ++E +  +++KRE+R
Sbjct: 211 KLQKDLDLYRSNLEKMEQARITMEEAIQADRRKREQR 247


>gi|97202823|sp|Q5N664.2|THF1_SYNP6 RecName: Full=Protein thf1
 gi|97202830|sp|Q31MY4.2|THF1_SYNE7 RecName: Full=Protein thf1
          Length = 254

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 124/217 (57%), Gaps = 10/217 (4%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTV+++K  F   Y RPI  +Y  V++EL+V+ HL+    ++ YDP+FALG VT +D  M
Sbjct: 5   PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 64

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 123
             Y   E    +F A   A++++PEQYR DA  + E  RG  + ++ ++ ++    G   
Sbjct: 65  SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 124

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEPTVLEKLCAVL----NVNKR 176
            L++   +  +G+  F YSR FA+GLF LLE A      +P  L+     +    ++   
Sbjct: 125 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 184

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLD+YR+ L K+ QA+  ++E +  +++KRE+R
Sbjct: 185 KLQKDLDLYRSNLEKMEQARITMEEAIQADRRKREQR 221


>gi|427419843|ref|ZP_18910026.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
 gi|425762556|gb|EKV03409.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
          Length = 258

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 130/220 (59%), Gaps = 16/220 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI S+Y  V++EL+V+ HL+     + Y+P++ALG +T +DR M 
Sbjct: 15  TVSDTKRAFYNYHSRPINSLYRRVIEELMVEMHLLSVNVDFVYNPLYALGVITSFDRFMV 74

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL-VEFPS-KEGEVEGL 125
           GY  E+D+E+I  A   A++ DP+QYR DA+ L+      + S L  +  S K  +  GL
Sbjct: 75  GYEPEQDKESILSAICQAVEGDPQQYRQDAEALKSDLANLSLSDLNTQLASAKTTDGNGL 134

Query: 126 ---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-------PTVLEKLCAVLNVNK 175
              L  +A +AS K    Y+R  AVGL+ L E  + +          +L+    +L +  
Sbjct: 135 QNKLHVVATQASAK----YTRLMAVGLYTLFETVDISSLEDKDSREEMLKTAAEMLALPA 190

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
             VD+DL++YR+ L K+ QA+E++K+ ++ E+KKRE+R +
Sbjct: 191 EKVDKDLELYRSNLDKMAQAQEVMKDILEAERKKREQRAQ 230


>gi|308801781|ref|XP_003078204.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
 gi|116056655|emb|CAL52944.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
          Length = 657

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 77/201 (38%), Positives = 118/201 (58%), Gaps = 16/201 (7%)

Query: 27  IYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITAL 86
           ++ TV+QEL+VQ H  +Y +  +Y+ + +LGFV+VYD+L EG+PSEE++  IF A++ AL
Sbjct: 79  VWATVVQELLVQGHFQKYNKKSEYNELASLGFVSVYDQLFEGFPSEEEKGKIFNAFLGAL 138

Query: 87  KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKD--IAERA--SGKGNFSYS 142
            ED  + R DA+            +L  F +    VEGL ++   A+ A  S +G   Y+
Sbjct: 139 DEDAVRTRADAE------------TLGAFATSANGVEGLKENAIFAKLAAKSAEGTLLYT 186

Query: 143 RFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
           ++ A+G+FR+LELA AT+P  LE L     ++   V  DL +Y+ LLSKL  AKEL +E 
Sbjct: 187 KYIAIGMFRMLELAKATDPAALEALVTAGGLSMSKVSGDLSMYKGLLSKLAAAKELQEEL 246

Query: 203 VDREKKKREERTEPQKANEAI 223
            +  +     R    +A +AI
Sbjct: 247 CETFRSTPRARMSFTEAFKAI 267


>gi|257059049|ref|YP_003136937.1| Thf1-like protein [Cyanothece sp. PCC 8802]
 gi|256589215|gb|ACV00102.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8802]
          Length = 235

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG V  + + M+
Sbjct: 6   TVSDTKRDFYTHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE-VEG 124
           GY  EED+ +IF A   A+  D E+YR +AQ L    +G + S L+     ++ GE  EG
Sbjct: 66  GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
           +L +  +  +    F YSR FA+GL+ ++   +A              +LC  LN++   
Sbjct: 126 MLFNTLQAIANNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
           + +DLD+YR+ + K+ Q   ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220


>gi|218245998|ref|YP_002371369.1| Thf1-like protein [Cyanothece sp. PCC 8801]
 gi|254784143|sp|B7K277.1|THF1_CYAP8 RecName: Full=Protein thf1
 gi|218166476|gb|ACK65213.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8801]
          Length = 235

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG V  + + M+
Sbjct: 6   TVSDTKRDFYNHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE-VEG 124
           GY  EED+ +IF A   A+  D E+YR +AQ L    +G + S L+     ++ GE  EG
Sbjct: 66  GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
           +L +  +  +    F YSR FA+GL+ ++   +A              +LC  LN++   
Sbjct: 126 MLYNTLQAIAKNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
           + +DLD+YR+ + K+ Q   ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220


>gi|428302138|ref|YP_007140444.1| Protein thf1 [Calothrix sp. PCC 6303]
 gi|428238682|gb|AFZ04472.1| Protein thf1 [Calothrix sp. PCC 6303]
          Length = 235

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 130/228 (57%), Gaps = 18/228 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F  ++ RPI +IY  V++EL+V+ HL+     + Y+P++ALG  T ++R M+
Sbjct: 6   TVSDTKKTFYSIHTRPINTIYRRVVEELMVEMHLLSVNTDFTYNPIYALGVATAFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK------EGE 121
           GY  E+D+E +F A   +++ D ++ + +A  L++ A   +   L+   S+       GE
Sbjct: 66  GYDPEKDKEQLFHALCQSVEIDTQKIKQEAHSLKDVAASMSVGDLISCLSRAKRFDNAGE 125

Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
           ++  L  IA        F YSR FA+GLF LLE A+             L  +   LN++
Sbjct: 126 LQNQLDAIA----SNPKFKYSRLFAIGLFSLLEAASPETVKDEKQRNDALVSIAKGLNIS 181

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
           +  + +DLD+YR+ L K+ QA  ++ + +  ++KKRE+R + QK++ A
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQAMVVMADMLAADRKKREQRAQ-QKSSVA 228


>gi|307155000|ref|YP_003890384.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
 gi|306985228|gb|ADN17109.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
          Length = 233

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 133/233 (57%), Gaps = 25/233 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ M+
Sbjct: 6   TVSDSKRDFYSKHTRPINSVYRRVVEELLVETHLLSVNSDFHYDPIYALGVVTSFEQFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA--------SSLVEFPSKE 119
           GY  E D+E+IF A   ++  DP+QYR DAQ +   A+  +A        SS + +P  +
Sbjct: 66  GYRPETDKESIFNALCQSVGGDPQQYRGDAQSILSTAKQLSAQDLLSKLQSSSIAYPQGD 125

Query: 120 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCA 169
            ++   L  IA        F Y+R FA+G++ +L     T+P           V++++  
Sbjct: 126 NKIIETLVAIA----NAPKFKYTRLFAIGIYTILA---ETDPELLKDQQKRHEVIKQIAE 178

Query: 170 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
           +L++ +  + +DLD+YR+ L K+ Q   +++E +  ++KKRE+R + +   E 
Sbjct: 179 ILHLPEEKMQKDLDLYRSNLEKMEQLLTVIEEALQADRKKREQRDQAKTQAET 231


>gi|218442064|ref|YP_002380393.1| Thf1-like protein [Cyanothece sp. PCC 7424]
 gi|254784142|sp|B7KI38.1|THF1_CYAP7 RecName: Full=Protein thf1
 gi|218174792|gb|ACK73525.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7424]
          Length = 226

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 75/220 (34%), Positives = 132/220 (60%), Gaps = 18/220 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     +QYDPV+ALG VT + R M+
Sbjct: 6   TVSDSKRDFYTKHTRPINSVYRRVVEELMVEMHLLSVNSDFQYDPVYALGVVTSFQRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-------EFPSKEG 120
           GY  + D+E+IF A   ++  DP+QYR DA+++ E A+  +A  L+       +  S E 
Sbjct: 66  GYRPDADKESIFNALCQSVGGDPQQYRQDAERMIESAKQLSAQQLLFNLESASDSSSGEN 125

Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNV 173
           ++   L  IA        + Y+R FA+G++ +L     E+   TE    V++++  VL++
Sbjct: 126 QILQTLIGIA----NAPKYKYTRLFAIGIYTILAETDPEMLKNTEKREEVVKQIAKVLHL 181

Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            +  + +DLD+YR+ L K+ Q   +++E +  ++KKRE++
Sbjct: 182 PEEKMQKDLDLYRSNLEKMDQLLTVIEEALQADRKKREQQ 221


>gi|332705256|ref|ZP_08425337.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
 gi|332355999|gb|EGJ35458.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
          Length = 257

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 117/206 (56%), Gaps = 11/206 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SIY  V++EL+V+ HL+     + YDP++ LG VT +DR M+
Sbjct: 6   TVSDTKRDFYTYHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYGLGVVTCFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE---EWARGQTASSLVEFPSKEGEVEG 124
            Y  E D+E+IF A   A+  + +QY+ DAQ+L+   +   GQ   S +  P+ E     
Sbjct: 66  SYQPENDKESIFNALCQAVGGEAQQYQEDAQRLKTSVDSMSGQDLISWLSSPTSENGSGD 125

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRS 177
           L   IA  A     F YSR FA+GLF LLE  ++           V+  + + LN+    
Sbjct: 126 LATTIAAIAQ-NSQFKYSRLFAIGLFSLLEQTDSELAQDQKQLEEVINNISSGLNLPSEK 184

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYV 203
           + +DL++YR+ L K+ QA+ ++++ +
Sbjct: 185 LQKDLELYRSNLEKMAQARVVIEDAI 210


>gi|37520969|ref|NP_924346.1| Thf1-like protein [Gloeobacter violaceus PCC 7421]
 gi|81710432|sp|Q7NKS7.1|THF1_GLOVI RecName: Full=Protein thf1
 gi|35211965|dbj|BAC89341.1| glr1400 [Gloeobacter violaceus PCC 7421]
          Length = 228

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 125/221 (56%), Gaps = 11/221 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F   Y RP+ SIY  V+ EL+V+ HL+   + +++DP+FA G +T Y  LME
Sbjct: 6   TVSDSKRAFFAAYPRPVNSIYRRVIDELLVEVHLLITNQDFRHDPLFATGLLTAYQALME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV-EGLL 126
           GY   E R+AI +A  TAL+   EQ   DA +    A    A  ++E  + + E  +G L
Sbjct: 66  GYTPVEQRDAILRALCTALELSYEQLHTDAAQWRAIAAELPAQEVLEVMAGKREAGDGRL 125

Query: 127 KDIAERASGKGN---FSYSRFFAVGLFRLLELAN----ATEPTVLEKL---CAVLNVNKR 176
           K + +  +G  N   F YSR F++GL  +LE A      +E   LE+L   C  L ++  
Sbjct: 126 KAMGDTLAGIANAERFKYSRLFSLGLANILEQAGRAAAMSEKDRLERLQQICTYLKLDYN 185

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
            V RDLD + ++L ++ ++KE++ E    E++KREER   Q
Sbjct: 186 RVKRDLDFFHSVLERIKRSKEVVDELSQTERRKREERAVSQ 226


>gi|359462375|ref|ZP_09250938.1| Thf1-like protein [Acaryochloris sp. CCMEE 5410]
          Length = 214

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 68/188 (36%), Positives = 115/188 (61%), Gaps = 16/188 (8%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+R    ++YDP+FALG  T +DR M+GY  E D++AIF A   A + DP Q + 
Sbjct: 1   MVEMHLLRVNEDFRYDPIFALGVTTSFDRFMDGYQPENDKDAIFSAICKAQEADPVQMQK 60

Query: 96  DAQKLEEWARGQTASSLVEFPSKEG-----EVEGLLKDIAERASGKGNFSYSRFFAVGLF 150
           D Q+L E A+ ++A  ++++ ++       E++  L++IA+       F YSR FA+GLF
Sbjct: 61  DGQRLTELAQSKSAQEMLDWITQAANSGGDELQWQLRNIAQNP----KFKYSRLFAIGLF 116

Query: 151 RLLELA--NATE-----PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 203
            LLEL+  N T+        L  +C VLN+++  + +DL++YR  L K+ Q ++ + + +
Sbjct: 117 TLLELSEGNITQDEESLAEFLPNICTVLNISESKLQKDLEIYRGNLDKIAQVRQAMDDIL 176

Query: 204 DREKKKRE 211
           + +KK+RE
Sbjct: 177 EAQKKRRE 184


>gi|425459592|ref|ZP_18839078.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
 gi|389822632|emb|CCI29709.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
          Length = 233

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 224


>gi|440752363|ref|ZP_20931566.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           TAIHU98]
 gi|440176856|gb|ELP56129.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           TAIHU98]
          Length = 228

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 6   TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 66  GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 124

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 125 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 219


>gi|22298677|ref|NP_681924.1| Thf1-like protein [Thermosynechococcus elongatus BP-1]
 gi|81743247|sp|Q8DJT8.1|THF1_THEEB RecName: Full=Protein thf1
 gi|22294857|dbj|BAC08686.1| tlr1134 [Thermosynechococcus elongatus BP-1]
          Length = 222

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 129/223 (57%), Gaps = 16/223 (7%)

Query: 6   PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
           P TV++TK  F   + RPI SIY   ++EL+V+ HL+R    ++Y P+FALG VT +D+ 
Sbjct: 4   PRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQF 63

Query: 66  MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
           MEGY  E DR+ IF A   A + +P+Q + DA   +++     +  L E  S  G+    
Sbjct: 64  MEGYQPEGDRDRIFHALCVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNS--GQPSAP 121

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-ELANATEPTV-----LEKLCAVLNVNKRSVD 179
           L  +    +GK    YSR  AVGL+  L ELA   E T+     L++L  V+ +    V 
Sbjct: 122 LNSLNH--TGK----YSRLHAVGLYAFLQELAG--EVTIHLNETLDQLAPVIPLPIEKVK 173

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
           RDL++YR+ L K+ QA+ L+KE V++E+K+R ++T    A +A
Sbjct: 174 RDLELYRSNLDKINQARSLMKELVEQERKRRAQQTSAPPAVDA 216


>gi|425436789|ref|ZP_18817221.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
 gi|425451594|ref|ZP_18831415.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
 gi|389678450|emb|CCH92698.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
 gi|389767069|emb|CCI07461.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
          Length = 233

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 127/223 (56%), Gaps = 14/223 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++ + ++ 
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230


>gi|422302142|ref|ZP_16389506.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
 gi|389788699|emb|CCI15466.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
          Length = 233

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 128/226 (56%), Gaps = 19/226 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
            + +DLDVYR+ L K+ Q  +++++ ++ EKKKR     E++T PQ
Sbjct: 188 KLQKDLDVYRSNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233


>gi|425445848|ref|ZP_18825868.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
 gi|389734049|emb|CCI02237.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
          Length = 233

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 127/226 (56%), Gaps = 19/226 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR     E++T PQ
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233


>gi|443328840|ref|ZP_21057433.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
 gi|442791576|gb|ELS01070.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
          Length = 270

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 71/218 (32%), Positives = 125/218 (57%), Gaps = 11/218 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   Y +PI S+Y  +++EL+V+ HL+     ++ DP+F LG V+ ++RLM+
Sbjct: 12  TVSDTKRSFYNNYNKPINSVYRRIVEELLVEMHLLSVNADFKSDPIFYLGVVSCFERLMQ 71

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  ++D+ AIF A   A+  DPE YR  A  L   A+ ++   L+ +  +   + G  +
Sbjct: 72  GYQPDQDKGAIFNALCRAVDGDPESYRAQAGNLLAIAKEKSGEELIAWLGEPTAIAG-AE 130

Query: 128 DIAE---RASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 177
           +IAE     +   NF YSR F +GL+ LLE A+A           + E +   L++    
Sbjct: 131 NIAETIKSIAANANFKYSRPFGIGLYTLLEEADAKLLEDSDKRNEIFENIAKTLSLPGDK 190

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           + +DL++YR+ L K+ Q  + +++ +   +K+RE+R +
Sbjct: 191 MKKDLELYRSNLEKMEQVLKAIEDALQASRKQREKRAQ 228


>gi|425453632|ref|ZP_18833389.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
 gi|389800936|emb|CCI19831.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
          Length = 233

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 224


>gi|443669636|ref|ZP_21134837.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           DIANCHI905]
 gi|159030831|emb|CAO88510.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330085|gb|ELS44832.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
           DIANCHI905]
          Length = 228

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 6   TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 66  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 124

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
            L D          F YSR FA+GL+ +L  A           EP +L+K   +L+++  
Sbjct: 125 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 219


>gi|390439536|ref|ZP_10227927.1| Protein thf1 [Microcystis sp. T1-4]
 gi|389837025|emb|CCI32051.1| Protein thf1 [Microcystis sp. T1-4]
          Length = 233

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/234 (32%), Positives = 127/234 (54%), Gaps = 36/234 (15%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY   ED+  IF A   A+  +PE YR DA+ +   A             KE  ++ LL 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117

Query: 128 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT--------EPTVLE 165
            +  +A G  N              F YSR FA+GL+ +L  A           EP +L+
Sbjct: 118 QLQNQALGGDNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQ 176

Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           K   +L+++   + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++ + ++ 
Sbjct: 177 KFSEILHLSGEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230


>gi|425470743|ref|ZP_18849603.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
 gi|389883502|emb|CCI36111.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
          Length = 233

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 125/225 (55%), Gaps = 17/225 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
            L D          F YSR FA+GL+ +L  A             +L+K   +L+++   
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILHLSSEK 188

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
           + +DLDVYR  L K+ Q  +++++ ++ EKKKR     E++T PQ
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233


>gi|425441488|ref|ZP_18821762.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
 gi|425463770|ref|ZP_18843100.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
 gi|389717772|emb|CCH98181.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
 gi|389829228|emb|CCI29632.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
          Length = 233

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/222 (32%), Positives = 124/222 (55%), Gaps = 12/222 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
           GY   ED+  IF A   A+  +PE YR DA+ +   A+     SL   ++ P+  G  + 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
            L D          F YSR FA+GL+ +L  A             +L+K   +L ++   
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILRLSSEK 188

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++ + ++ 
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230


>gi|423062334|ref|ZP_17051124.1| Thf1-like protein [Arthrospira platensis C1]
 gi|406716242|gb|EKD11393.1| Thf1-like protein [Arthrospira platensis C1]
          Length = 215

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 109/192 (56%), Gaps = 9/192 (4%)

Query: 31  VLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDP 90
           +++EL+V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP
Sbjct: 1   MVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDP 60

Query: 91  EQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVG 148
            QYR DA  LE  A   +   L E    ++E   +  L+      +    F YSR FA+G
Sbjct: 61  NQYRADATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIG 120

Query: 149 LFRLLELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 201
           L+ LLE ++ T         T+L      L + K  +++DLD+YR  L K+ QA+ +++E
Sbjct: 121 LYTLLEKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEE 180

Query: 202 YVDREKKKREER 213
               E+KKRE+R
Sbjct: 181 MTQAERKKREQR 192


>gi|170077355|ref|YP_001733993.1| Thf1-like protein [Synechococcus sp. PCC 7002]
 gi|254784146|sp|B1XHY6.1|THF1_SYNP2 RecName: Full=Protein thf1
 gi|169885024|gb|ACA98737.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 254

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 74/215 (34%), Positives = 125/215 (58%), Gaps = 15/215 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SI+  V++EL+V+ HL+     ++YDP +ALG VT ++R M+
Sbjct: 6   TVSDTKRDFYTHHTRPINSIFRRVVEELLVEMHLLSVNADFRYDPFYALGVVTSFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E D+ +IFQ+   A+  D  +Y+ DA  L E A+  + + L+E   ++   EG   
Sbjct: 66  GYRPEADKVSIFQSMCQAIGGDANRYKEDAMALVELAKRCSGTQLIECFRQDVPPEGAQE 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVLNVNK 175
           L +  E  +   +F YSR FA+G++  L     +EP +LE           + A LN+ +
Sbjct: 126 LWEKIEAIAKNDHFKYSRLFAIGVYTFL---GESEPQLLEDTEKRDEMLTTVTAGLNLPE 182

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
             + +DLD+YR+ L K+ Q  E+L++ +  E+++R
Sbjct: 183 EKMKKDLDLYRSNLEKMNQVLEVLEDALAVERQRR 217


>gi|434398071|ref|YP_007132075.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
 gi|428269168|gb|AFZ35109.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
          Length = 238

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 132/223 (59%), Gaps = 12/223 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++ K +F + + RPI S+Y  V++EL+V+ HL+     ++ DP++ LG VT ++RLM+
Sbjct: 16  TVSDAKRDFYQHHTRPINSVYRRVVEELLVEMHLLSVNVDFKSDPIYYLGVVTSFERLMQ 75

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG--- 124
           GY  E+D+E+IF A   A+ EDPE+ R  A  L   A+ ++   LV + S+   +E    
Sbjct: 76  GYRPEQDKESIFNALCRAVGEDPERNRAQAGSLLNLAKNKSPQELVAWLSEPTPLENYHD 135

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
           +++ I   AS   +F YSR FA+GL+ LLE ++       +    +LE +   L++    
Sbjct: 136 IIEPIKAIASNP-HFKYSRLFAIGLYTLLEESDPEILKDVSKRNEILESIATQLHLPGEK 194

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDR-EKKKREERTEPQKA 219
           +++DL++YR+ L K+ Q   ++++ +    K+K + + EP+ A
Sbjct: 195 MNKDLELYRSNLEKMEQLLSVIEDVLQAGRKQKNQPKPEPETA 237


>gi|254423933|ref|ZP_05037651.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7335]
 gi|196191422|gb|EDX86386.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7335]
          Length = 250

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 120/208 (57%), Gaps = 14/208 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI +IY  V++EL+V+ HL+     + YD ++ALG V+ YDR M+
Sbjct: 6   TVSDTKRAFYSQHTRPINAIYRRVVEELMVEAHLLLVNADFNYDSIYALGVVSTYDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS--KEGEVEG- 124
           GY    DR+ I++A + A + DP+QYR DA++L      ++  S+  F S   E + E  
Sbjct: 66  GYEPAGDRDNIYRAILQANEADPDQYRRDAEEL--LGVAKSLPSIDAFKSILDEAKTESG 123

Query: 125 --LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNK 175
              LK    +A     F YSR FA+GL+ ++E  +A           ++ ++ + + +N+
Sbjct: 124 SDTLKANLHKAISNPKFKYSRLFAIGLYNVIESIDADMLNDKDKRDALMAEIASTIGLNE 183

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
             + +D+D+YR  L K+ QA+E++K+ +
Sbjct: 184 DLLKKDIDLYRGNLEKMAQAQEVMKDMI 211


>gi|166367182|ref|YP_001659455.1| Thf1-like protein [Microcystis aeruginosa NIES-843]
 gi|166089555|dbj|BAG04263.1| Psb29 Photosystem II sub-stoichiometric subunit [Microcystis
           aeruginosa NIES-843]
          Length = 233

 Score =  126 bits (316), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 123/233 (52%), Gaps = 34/233 (14%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K +F   + RPI S+Y  V++EL+V+ HL+     + YDP++ALG VT +++ ME
Sbjct: 11  TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY   ED+  IF A   A+  +PE YR DA+ +   A             KE  ++ LL 
Sbjct: 71  GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117

Query: 128 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT-------EPTVLEK 166
            +   A G  N              F YSR FA+GL+ +L  A             +L+K
Sbjct: 118 QLQNPALGANNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREQILQK 177

Query: 167 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
              +L ++   + +DLDVYR  L K+ Q  +++++ ++ EKKKR+++ + ++ 
Sbjct: 178 FSEILRLSSEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230


>gi|427711975|ref|YP_007060599.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           6312]
 gi|427376104|gb|AFY60056.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           6312]
          Length = 245

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 26/229 (11%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI SI+   ++EL+V+ HL+R    + Y P+ ALG VT Y+  M 
Sbjct: 6   TVSDTKKAFYAAHTRPIHSIFRRFVEELLVEVHLLRVNTNFVYSPLLALGIVTAYNHFMS 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA------SSLVEFPSKEGE 121
           GY  E DR +IF ++  A + DP+Q + DA +   W            + L  + S+ G+
Sbjct: 66  GYRPETDRNSIFTSFAIAEEFDPQQLQADAAR---WEELAGLELEELQTRLQAWISEGGD 122

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNV 173
                L+D       K    YSR  A+GL+ LLE A         T    LE+L  V+N+
Sbjct: 123 PWHNSLRDAVNNPQTK----YSRLQAIGLYHLLEQAAGNLTQELTTLEASLEQLSPVVNL 178

Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
               V +DL++YR+ L K++QA++++ E V+ E+K+RE     Q ANEA
Sbjct: 179 PVDKVKKDLELYRSNLDKMIQAQKIMAELVEVERKRRE-----QAANEA 222


>gi|428223137|ref|YP_007107307.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7502]
 gi|427996477|gb|AFY75172.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
           7502]
          Length = 226

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 115/205 (56%), Gaps = 10/205 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVA+ K +F K + +P+ SIY  V+ EL+V+ HL+R  + + YD +FALG  T +DR M 
Sbjct: 6   TVADAKHDFYKAFSKPVNSIYRRVVDELLVEVHLLRVSQNFGYDSIFALGLATAFDRFMA 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE--GEVEGL 125
           GY  E D E IF+    AL  DP+Q R ++  L E ++   A     F + E   +++ L
Sbjct: 66  GYQPESDLEPIFKGLCQALLFDPDQIRQESAHLIELSKQFPAEVKSLFTTLEAGADLDTL 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
           +  I   A+    F YSR FAVG+F LLE A+            ++ ++   L +N   +
Sbjct: 126 MGQIRAIATNP-KFKYSRLFAVGVFILLETADPEAIADQDKRQALITQVGDTLKINSERL 184

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYV 203
            +DLD+YR+ L K+ Q ++++++ V
Sbjct: 185 LKDLDLYRSNLEKIQQGRQMMEDMV 209


>gi|443478915|ref|ZP_21068602.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
 gi|443015728|gb|ELS30564.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
          Length = 240

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 74/218 (33%), Positives = 121/218 (55%), Gaps = 16/218 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + +P+  +Y  V+ EL+V+ HL++  +T+ YD +FALGFVT +DR   
Sbjct: 6   TVSDTKKDFYLAFPKPVNQVYRRVVDELLVEIHLLKVNQTFVYDAIFALGFVTTFDRFTA 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWAR---GQTASSLVEFPSKEG--EV 122
           GY  E DR A+F A   AL+ D ++ R DA  L + A        + L    S      +
Sbjct: 66  GYKPETDRFAVFHALCAALQFDSDRIRQDAATLSDLATRSPNDIKTLLTNLDSGISLEPL 125

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNK 175
            G L+ I    S K NF YSR   VGL+ LLE+++  E         +++ +   L    
Sbjct: 126 SGQLQII----STKENFKYSRLLGVGLYALLEISDPEEIADSAKREELIKLVGETLKFGS 181

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
             + +D+D+YR+ L K+ QA++++ + V+ E+KKR ++
Sbjct: 182 DRLLKDVDLYRSNLDKIEQARQMIADMVEAERKKRSQK 219


>gi|427726046|ref|YP_007073323.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
 gi|427357766|gb|AFY40489.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
          Length = 246

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 75/216 (34%), Positives = 117/216 (54%), Gaps = 15/216 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++ K +F   + RPI SI+  V++EL+V+ HL+     ++YDP +ALG VT Y+R M+
Sbjct: 6   TVSDAKRDFYGQHTRPINSIFRRVVEELLVEMHLVSVNVDFRYDPFYALGIVTSYERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  E D+ +IFQA   A+    E Y+ DA  L E A+  +   LV+   ++   EG   
Sbjct: 66  GYRPESDKISIFQAMCQAVGGSAEFYKNDATALVELAKRCSGQQLVDCFRQDNAPEGAGE 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE----------KLCAVLNVNK 175
           L    E  +    F YSR FA+GL+  L      EP +LE           L   +N+  
Sbjct: 126 LWAKVEAIAANKKFKYSRLFAIGLYTFL---GEAEPALLEDADKRDEMLATLTEAMNLPG 182

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
             + +DLD+YR+ L K+ Q   ++++ +  E+K+RE
Sbjct: 183 EKMKKDLDLYRSNLEKMTQVLAVIEDALVAERKRRE 218


>gi|126658461|ref|ZP_01729609.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
 gi|126620203|gb|EAZ90924.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
          Length = 246

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 73/212 (34%), Positives = 119/212 (56%), Gaps = 12/212 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 125
           GY  E D+ +IF A   A+  + EQY  +A+ L   A+G    S+ EF  K G+   +G+
Sbjct: 66  GYRPESDKASIFNALCQAVDGNSEQYHQEAEALINEAKG---LSMTEFKDKLGQEGGDGI 122

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSV 178
           L       +    F YSR F VGL+ LL     EL    E     ++++   L  +   +
Sbjct: 123 LWGTCNAIAQNPKFKYSRLFGVGLYTLLMEIDPELVKEEEKRNQTIKEVSEALQFSSDKL 182

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
            +DLD+YR+ L K+ Q   ++++ ++ ++KKR
Sbjct: 183 QKDLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214


>gi|428203624|ref|YP_007082213.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
 gi|427981056|gb|AFY78656.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
          Length = 241

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 130/225 (57%), Gaps = 10/225 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++ K +F   + RPI SIY   ++ELIV+ HL+     ++YD ++ALG VT ++R M+
Sbjct: 11  TVSDAKRDFYTHHTRPINSIYRRFVEELIVEMHLLSVNTDFRYDAIYALGVVTAFERFMQ 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE--FPSKEGEVEGL 125
           GY  E+D+ +IF A   A   + EQYR +A ++   A+  +   L+     S     E  
Sbjct: 71  GYQPEQDKSSIFAALCQATGGNWEQYRQEAGEILAQAKQMSVQELIAKINSSTPTGGENR 130

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT----VLEKLCAVLNVNKRSV 178
           L +  +  + + N+ YSR FA+GL+ LL  A+     +P      L+++   L+++   +
Sbjct: 131 LVETLQAIANRSNYKYSRLFAIGLYTLLAEADPDILRDPEKRDRTLKEVTEALHLSPEKL 190

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
            +DLD+YR+ L K+ Q  ++L+E ++ E+KKR+++ +P++    I
Sbjct: 191 QKDLDLYRSNLDKMDQLLKVLEEALEAERKKRQQQ-KPEQGTAQI 234


>gi|209522934|ref|ZP_03271491.1| Thf1-like protein [Arthrospira maxima CS-328]
 gi|209496521|gb|EDZ96819.1| Thf1-like protein [Arthrospira maxima CS-328]
          Length = 210

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 104/187 (55%), Gaps = 9/187 (4%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP QYR 
Sbjct: 1   MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDPNQYRA 60

Query: 96  DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
           DA  LE  A   +   L E    ++E   +  L+      +    F YSR FA+GL+ LL
Sbjct: 61  DATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIGLYTLL 120

Query: 154 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
           E ++ T         T+L      L + K  +++DLD+YR  L K+ QA+ +++E    E
Sbjct: 121 EKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEEMTQAE 180

Query: 207 KKKREER 213
           +KKRE+R
Sbjct: 181 RKKREQR 187


>gi|97202816|sp|P0C1D1.1|THF1_SYNJB RecName: Full=Protein thf1
          Length = 239

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 14/232 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T++ TK  F   Y RPI ++Y  V++EL+V+ HL     T+ YDP FALG VT+YD LME
Sbjct: 6   TLSATKAAFFSAYPRPINAVYRRVVEELLVELHLTTVNSTFVYDPFFALGLVTLYDGLME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 125
            Y   E REAIF A   AL   PE  R +A+ L E          ++    + E E  G 
Sbjct: 66  AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMGSGDPRQRLDLLCLKPEAEDVGG 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EP-----TVLEKLCAVLNVNKRS 177
           LK I ER + +  ++YSR  AVGL+   E+   +   EP       LE + + L  +   
Sbjct: 126 LKAILERMT-QPPYAYSRVLAVGLYTAYEVVAKSLYEEPEERTRRFLENVVSKLPFSTER 184

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
           V +DL++YR+ L ++ QA+ +++E V   ++++E R   Q A    +  LG+
Sbjct: 185 VRKDLELYRSSLDRMKQARAVVEEMVKAARRQQERR---QSAASLPETSLGD 233


>gi|172035357|ref|YP_001801858.1| Thf1-like protein [Cyanothece sp. ATCC 51142]
 gi|354555452|ref|ZP_08974753.1| Protein thf1 [Cyanothece sp. ATCC 51472]
 gi|254784140|sp|B1WNF0.1|THF1_CYAA5 RecName: Full=Protein thf1
 gi|171696811|gb|ACB49792.1| photosystem II 22 kD protein [Cyanothece sp. ATCC 51142]
 gi|353552511|gb|EHC21906.1| Protein thf1 [Cyanothece sp. ATCC 51472]
          Length = 242

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 69/210 (32%), Positives = 118/210 (56%), Gaps = 8/210 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + RPI SIY   ++EL+V+ HL+     ++YDP++ALG VT ++R M+
Sbjct: 6   TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E D+ +IF A   A+  + EQY  +A+ L   A+G + +   E   +EG  +G+L 
Sbjct: 66  GYSPESDKTSIFNALCQAVDGNSEQYHQEAEALINEAKGLSITEFKEKLGQEGG-DGILW 124

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVDR 180
                 +    F YSR F VGL+ LL     +L    +     ++++   L  +   + +
Sbjct: 125 GTCGAIAQNPKFKYSRLFGVGLYTLLMEIDPDLVKEEDKRNQTIKEVSDALQFSSDKLQK 184

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
           DLD+YR+ L K+ Q   ++++ ++ ++KKR
Sbjct: 185 DLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214


>gi|409992261|ref|ZP_11275462.1| inositol phosphatase [Arthrospira platensis str. Paraca]
 gi|409936888|gb|EKN78351.1| inositol phosphatase [Arthrospira platensis str. Paraca]
          Length = 210

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 9/187 (4%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     ++YDP++ALG VT +DR M+GY  E D+ +I+ A I A + DP QYR 
Sbjct: 1   MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYTPETDKLSIWAALIGAQESDPNQYRA 60

Query: 96  DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
           DA  LE  A       L +    ++E   +  L+      +    F YSR  A+GL+ LL
Sbjct: 61  DATALEAQAASLAVKDLTDKAKIAQESSGDDPLQSCFHAIANNPKFKYSRLLAIGLYTLL 120

Query: 154 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
           E ++AT         T+L      L + K  + +DLD+YR  L K+ QA+ ++ E    E
Sbjct: 121 EKSDATAAQDSEGLKTILSDFSEALRLPKDKLVKDLDLYRTNLEKVAQARLMVDEMTQAE 180

Query: 207 KKKREER 213
           +KKRE+R
Sbjct: 181 RKKREQR 187


>gi|148242504|ref|YP_001227661.1| Thf1-like protein [Synechococcus sp. RCC307]
 gi|147850814|emb|CAK28308.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 237

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 125/228 (54%), Gaps = 10/228 (4%)

Query: 1   MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
           M+   P TVA++K  F   Y   IP +Y  V+ EL+V+ HL+  +  +Q D +FA+G   
Sbjct: 1   MVLSNPQTVADSKRRFYAAYPHVIPGLYRRVVDELLVELHLLAGQAGFQADSLFAMGLTQ 60

Query: 61  VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
           V+D LM+G+   E ++ +F A  +      +Q R DA++L E       + +  +  ++G
Sbjct: 61  VFDNLMQGFKPAERQKELFAAICSGAGLKADQLRKDAKQLREHLVPHGEAEIKSWIEQQG 120

Query: 121 E-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE---LANATEPTVLE----KLCAVLN 172
           +    +LK + ++A G+ +F YSR  AVGL  LL+     +  +P  L+    +L   + 
Sbjct: 121 QGAPDVLKHVLQQA-GRSDFHYSRLHAVGLMGLLQDLSGGDDQDPQALQERAHQLGHSMG 179

Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKA 219
           + K  + +D+ +Y + L K+ QA ELL+E V  E++KRE+R  EP  A
Sbjct: 180 LQKDKLQKDMGLYASNLEKMSQAVELLEETVAAERRKREQRQGEPASA 227


>gi|86606816|ref|YP_475579.1| Thf1-like protein [Synechococcus sp. JA-3-3Ab]
 gi|97202812|sp|Q2JSQ3.1|THF1_SYNJA RecName: Full=Protein thf1
 gi|86555358|gb|ABD00316.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 239

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/218 (37%), Positives = 116/218 (53%), Gaps = 15/218 (6%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T++ TK  F   Y RPI + Y  V++EL+V+ HL      + YDP FALG VT+YD LME
Sbjct: 6   TLSATKAAFFSAYPRPINAAYRRVVEELLVELHLTTVNSAFVYDPFFALGLVTLYDSLME 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----QTASSLVEFPSKEGEVE 123
            Y   E REAIF A   AL   PE  R +A+ L E  R     Q  + L   P  E E  
Sbjct: 66  AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMRSGDPVQRYNLLCLKP--EAEDV 123

Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EP-----TVLEKLCAVLNVNK 175
           G LK I +R + +  ++YSR  AVGL+   E    +   EP       LE +   L  + 
Sbjct: 124 GGLKAILQRMT-QPPYAYSRVLAVGLYTAYEAVATSLYKEPEERTRHFLEDVIGNLPFSP 182

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
             V +DL++YR+ L +L QA+ +++E V   ++++E R
Sbjct: 183 ERVKKDLELYRSNLDRLKQARAIVEEMVKAARRQQERR 220


>gi|428773451|ref|YP_007165239.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
           PCC 7202]
 gi|428687730|gb|AFZ47590.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
           PCC 7202]
          Length = 233

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 78/234 (33%), Positives = 123/234 (52%), Gaps = 27/234 (11%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++T+  F + + RPI SIY  V+QEL+V+ HL+     +Q D V+A+G    +++ M 
Sbjct: 6   TVSDTRRAFYQYHTRPINSIYRQVVQELMVEMHLLSVNTDFQPDAVYAVGVCQSFEQFMT 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
           GY  EED+ +IF A   A++ +P+ YR  ++ L  +  G++A  LV +        GL  
Sbjct: 66  GYKPEEDKTSIFNALCKAIEANPDDYRHQSESLLNFVEGKSAEDLVNWLLNPVADNGLDE 125

Query: 126 -----LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAVLNVNKRS 177
                LK I ER      F YSR F +G + L+   N   P V    EKL  ++      
Sbjct: 126 NIVNSLKSILERER----FKYSRLFGIGFYTLI---NKVAPDVAKDEEKLAKLIAPYSEK 178

Query: 178 VD-------RDLDVYRNLLSKLLQAKELLKEYVDREKKKR---EERTEPQKANE 221
           +D       +D+D+YR+ L K+ Q   ++ E ++  KKKR   E+  E ++ANE
Sbjct: 179 LDLPVDKLKKDVDLYRSNLDKINQMLVVIAETIEASKKKRINIEKTEEKEEANE 232


>gi|119510704|ref|ZP_01629832.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
 gi|119464658|gb|EAW45567.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
          Length = 200

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 117/193 (60%), Gaps = 17/193 (8%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     + Y+P++ALG VT +DR M+GY  E+D+E+IFQA   A++++P++YR 
Sbjct: 1   MVEMHLLSVNSGFSYNPIYALGVVTSFDRFMQGYLPEQDQESIFQALCQAVEQEPQRYRE 60

Query: 96  DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
           DA++L+  A+    + L+ + S      ++ +++  L+ IA  +     F YSR FAVGL
Sbjct: 61  DAKRLQALAKDLPVNDLIAWLSQTTHLDRDPDLQAQLQAIAHNS----EFKYSRLFAVGL 116

Query: 150 FRLLELANA-------TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
           F LLE ++             L+ + A L+++   +++DL++Y + L K+ QA  ++ + 
Sbjct: 117 FTLLEQSDPELVKDEKQRTEALKTIAAGLHLSDEKLNKDLELYSSNLEKMAQALVVMADM 176

Query: 203 VDREKKKREERTE 215
           +  ++KKRE+R +
Sbjct: 177 LSADRKKREQRQQ 189


>gi|428769945|ref|YP_007161735.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
 gi|428684224|gb|AFZ53691.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
          Length = 234

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 120/214 (56%), Gaps = 11/214 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F + ++RPI SIY  V++EL+V+ HL+     +  DP++ LG    + + M+
Sbjct: 18  TVSDTKRSFYQHHQRPINSIYRRVVEELMVEMHLLAVNVDFNPDPIYYLGVYQSFQQFMQ 77

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 124
           GY  E D+E+IF A   +++ +P++Y   +Q L  +  G++A  ++++   PS EG++E 
Sbjct: 78  GYKPESDKESIFNALCQSIENNPQEYISKSQTLLNFVEGKSAQEILDWLLNPSGEGDLEA 137

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
           +             F YSR FA+G + L+E  +       +     ++ L   L +    
Sbjct: 138 VASHWRSNLENP-RFKYSRLFAIGFYTLIEKGDGEFIKDESKFTDFIQPLIDKLQLPVEK 196

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
           + +DLD+YR+ L K+ Q   ++ + ++ E+KK++
Sbjct: 197 LKKDLDLYRSNLEKMNQMLSVMADVLEAERKKKQ 230


>gi|16330615|ref|NP_441343.1| Thf1-like-protein [Synechocystis sp. PCC 6803]
 gi|383322356|ref|YP_005383209.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325525|ref|YP_005386378.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491409|ref|YP_005409085.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436676|ref|YP_005651400.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
 gi|451814773|ref|YP_007451225.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
 gi|81671042|sp|P73956.1|THF1_SYNY3 RecName: Full=Protein thf1
 gi|1653107|dbj|BAA18023.1| sll1414 [Synechocystis sp. PCC 6803]
 gi|339273708|dbj|BAK50195.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
 gi|359271675|dbj|BAL29194.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274845|dbj|BAL32363.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278015|dbj|BAL35532.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958541|dbj|BAM51781.1| Thf1-like-protein [Bacillus subtilis BEST7613]
 gi|451780742|gb|AGF51711.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
          Length = 240

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 72/214 (33%), Positives = 118/214 (55%), Gaps = 8/214 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++ K  F   Y RPI SIY   ++EL+V+ HL+     + YDP+FALG VT ++  M+
Sbjct: 6   TVSDAKRKFFTHYSRPISSIYRRFVEELLVEMHLLSVNIDFTYDPIFALGIVTSFNSFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE-FPSKEGEVEGLL 126
           GY   E   AIF A    + ++P+Q R DA+ +   A      + V    S++   + LL
Sbjct: 66  GYQPAEQLPAIFNALCHGVDQNPDQVRQDAKNVAASAHHIGLDAWVTAAASEQASGDNLL 125

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVD 179
            +       +  F YSR FA+GL+ LL     E+ +  E     L +L  +L+++   V 
Sbjct: 126 LNTLTGIHQRHKFKYSRLFAIGLYTLLADQDPEVKDNDEKRQDYLTRLSELLDLSLDKVV 185

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
           +DLD+YR+ L K+ Q  ++L++  + E+KK+E++
Sbjct: 186 KDLDLYRSNLEKVDQLLKVLEDAAEAERKKKEKQ 219


>gi|67921410|ref|ZP_00514928.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
 gi|67856522|gb|EAM51763.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
          Length = 245

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 123/232 (53%), Gaps = 17/232 (7%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK  F   + +PI SIY   ++EL+V+ HL+     + YDP++ALG VT + R M+
Sbjct: 6   TVSDTKRKFYGYHTQPINSIYRRFVEELLVEMHLLSVNIDFSYDPIYALGVVTSFQRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV----- 122
           GY  E D+ +IF A   A+    E+Y  +A+ +   A+G    S+V+F  K   V     
Sbjct: 66  GYSPESDKPSIFNALCQAVDGSSEKYHQEAEAILNEAKGL---SIVDFKDKLTHVTDNQV 122

Query: 123 -EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
            EG+L       +    F YSR  A+GL+ LL   ++            ++++   L  +
Sbjct: 123 GEGVLWGTFGAIAANPKFKYSRLLAIGLYTLLMEIDSDLLKDEEKRTETIKEVSEALKFS 182

Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
              + +DLD+YR+ L K+ Q   ++++ ++ ++KKR   TE + + E +++ 
Sbjct: 183 PEKLRKDLDLYRSNLDKMQQLLTVIEDSLEADRKKRAS-TEGKTSAEVVEQT 233


>gi|282901466|ref|ZP_06309391.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281193745|gb|EFA68717.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 201

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     + Y+ ++ALG VT +DR M+GY   ED  +IF A I A+++DP+ YR 
Sbjct: 1   MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60

Query: 96  DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
           DA KL+  A   +   L+ + S      ++  ++  L+ IA+      NF YSR  A+GL
Sbjct: 61  DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116

Query: 150 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
           F LLEL++     + T   +  AV    L +++  +++DLD+YR+ L K+ QA  ++ + 
Sbjct: 117 FSLLELSDPEFVKDETQRNQTIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176

Query: 203 VDREKKKREERTE 215
           +  ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189


>gi|282898285|ref|ZP_06306276.1| Protein thf1 [Raphidiopsis brookii D9]
 gi|281196816|gb|EFA71721.1| Protein thf1 [Raphidiopsis brookii D9]
          Length = 202

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     + Y+ ++ALG VT +DR M+GY   ED  +IF A I A+++DP+ YR 
Sbjct: 1   MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60

Query: 96  DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
           DA KL+  A   +   L+ + S      ++  ++  L+ IA+      NF YSR  A+GL
Sbjct: 61  DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116

Query: 150 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
           F LLEL++     + T   +  AV    L +++  +++DLD+YR+ L K+ QA  ++ + 
Sbjct: 117 FSLLELSDPEFVKDETERNQAIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176

Query: 203 VDREKKKREERTE 215
           +  ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189


>gi|124023249|ref|YP_001017556.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9303]
 gi|123963535|gb|ABM78291.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 250

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 71/222 (31%), Positives = 117/222 (52%), Gaps = 12/222 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 14  TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTR 73

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E   + +F A   +   DP   R  AQK  E  RG     +  +  ++G  +G  +
Sbjct: 74  GYRPEAHVKTLFDALCRSCGFDPNALRKQAQKTLESVRGHDLEEVQGWIQQQG--KGAPE 131

Query: 128 DIAE--RASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
            +A+  R +G   F YSR  AVGL  LL  A   E +  EKL  +       +   K  V
Sbjct: 132 ALAQALRNTGSNTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFTKARV 191

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKAN 220
           ++DL++Y++ L K+ QA EL ++ ++ E++KRE++ E +K N
Sbjct: 192 EKDLNLYKSNLEKMAQAVELSEQILESERRKREQK-ESEKLN 232


>gi|9631702|ref|NP_048481.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
 gi|1131477|gb|AAC96501.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
 gi|448924789|gb|AGE48370.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           AN69C]
          Length = 207

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 4   ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTT 63

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 64  LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSHLG-----PNKHGN 117

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  KR+ ++ 
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207


>gi|448930219|gb|AGE53784.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           IL-3A]
 gi|448933659|gb|AGE57214.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NE-JV-4]
          Length = 207

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 116/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 4   ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 64  LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSNLG-----PNKHGN 117

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  +R+ ++ 
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLRRKSKSS 207


>gi|448927841|gb|AGE51413.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           CviKI]
          Length = 232

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 29  ITSSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 88

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 89  LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 142

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 143 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 198

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  KR+ ++ 
Sbjct: 199 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 232


>gi|254413033|ref|ZP_05026805.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
           chthonoplastes PCC 7420]
 gi|196180197|gb|EDX75189.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
           chthonoplastes PCC 7420]
          Length = 208

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 114/196 (58%), Gaps = 17/196 (8%)

Query: 36  IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
           +V+ HL+     ++YDP++ LG V  ++R M+GY  E D+E+IF A   A+  +P+QY+ 
Sbjct: 1   MVEMHLLAVNVDFKYDPIYVLGVVASFNRFMQGYRPERDKESIFNALCQAVGGNPQQYQD 60

Query: 96  DAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGN---FSYSRFFAVGLFRL 152
           DA+KL+      +A  LV++      +EG  +DI    +   +   F YSR FA+GL+ L
Sbjct: 61  DAEKLKAAVGRLSAQELVDWFGSPTPLEG-AEDIHTTVAAIADNPKFKYSRLFAIGLYTL 119

Query: 153 LELANATEP----------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
           LE A   EP           +L+++   L++ +  + +DL++YR+ L K+ QA+  +++ 
Sbjct: 120 LEQA---EPELVQDAKQSMEMLQRIGQTLHLPQEKLQKDLELYRSNLEKMAQAQIAIEDA 176

Query: 203 VDREKKKREERTEPQK 218
           +  ++KKRE+R + +K
Sbjct: 177 IKADRKKREQREQEKK 192


>gi|448928860|gb|AGE52429.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           CvsA1]
          Length = 207

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 4   ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 64  LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 117

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 118 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  KR+ ++ 
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207


>gi|448931622|gb|AGE55183.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           MA-1E]
          Length = 207

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 4   ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 64  LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYFSNIETLKPYAKSSHLG-----PNKHGN 117

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DI    S    + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 118 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  KR+ ++ 
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207


>gi|443323210|ref|ZP_21052219.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
 gi|442787120|gb|ELR96844.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
          Length = 231

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/218 (32%), Positives = 116/218 (53%), Gaps = 12/218 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV++TK +F   + RPI SIY  V++ELIV+ HL+   + ++ DP++ LG VT +DR M+
Sbjct: 6   TVSDTKRDFYAHHTRPINSIYRRVVEELIVELHLLSVNQNFRVDPIYCLGVVTSFDRFMQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWA-RGQTASSLVEFPSKEGEVEGLL 126
           GY  EED+ +I  +   A+    EQYR  A ++   A R      L+ +      VEG  
Sbjct: 66  GYRPEEDKASILASLCQAVGGKLEQYRDHANQVLNLAKRLHGVDDLLAWFKHPQPVEGEF 125

Query: 127 KDIAERASG---KGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKR 176
             +AE  S      +F YSR F +GL+ +L   N            +  +   VL V+  
Sbjct: 126 A-LAEAVSAIALNQSFKYSRMFGIGLYTMLGEKNLELLQDKPARDKITAQFAEVLPVSSD 184

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
            + +D ++Y+  L K+ Q   +++E ++ E+KKR +++
Sbjct: 185 KLQKDFELYQANLEKMKQMIIVVEEALEAERKKRAKKS 222


>gi|33862947|ref|NP_894507.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9313]
 gi|81577657|sp|Q7V7R3.1|THF1_PROMM RecName: Full=Protein thf1
 gi|33634864|emb|CAE20850.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 243

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 111/214 (51%), Gaps = 10/214 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 6   TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTS 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
           GY  E   + +F A   +   DP   R  AQ+  E  RG     +  +  ++G+   E L
Sbjct: 66  GYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEAL 125

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
            K +   A G   F YSR  AVGL  LL  A   E +  EKL  +       +  +K  V
Sbjct: 126 AKALRNTA-GSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFSKARV 184

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
           ++DL++Y++ L K+ QA EL ++ ++ E++KRE+
Sbjct: 185 EKDLNLYKSNLEKMAQAVELTEQILESERRKREQ 218


>gi|448930916|gb|AGE54479.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           KS1B]
          Length = 207

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 115/215 (53%), Gaps = 12/215 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 4   ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTT 63

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E +  I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 64  LDSVLNTFPDDE-KVCIKNAFIISLNEDPEMYYSNIEYLKPYAKSSNLG-----PNKHGN 117

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F AVG+F+LL++        ++ L   +      V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGKSVKHLSESIGFKGELVHK 173

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           D+  + +LL K +++ + L + +  E  KR+ ++ 
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207


>gi|157952488|ref|YP_001497380.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
           virus NY2A]
 gi|155122715|gb|ABT14583.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
           virus NY2A]
          Length = 247

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/214 (30%), Positives = 115/214 (53%), Gaps = 12/214 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 45  ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 105 LDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F A+G+F+LL++        ++ L   +      V +
Sbjct: 159 TLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKHLSESVGFKGEIVHK 214

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
           D+  + +LL K +++ + L + +  E  K+ + +
Sbjct: 215 DIATFFSLL-KYIESSQKLADDIREESLKKSKSS 247


>gi|448931221|gb|AGE54783.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           MA-1D]
          Length = 248

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 108/198 (54%), Gaps = 11/198 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 45  ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 105 LDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F A+G+F+LL++        ++ L   +      V +
Sbjct: 159 TLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNGNYTGNSVKHLSESIGFKGEIVHK 214

Query: 181 DLDVYRNLLSKLLQAKEL 198
           D+ ++ +LL  +  +++L
Sbjct: 215 DIAMFFSLLKYIESSQKL 232


>gi|116074797|ref|ZP_01472058.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
 gi|116068019|gb|EAU73772.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
          Length = 234

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/216 (30%), Positives = 112/216 (51%), Gaps = 9/216 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+D    
Sbjct: 6   TIADSKRAFHSAFPHVIPSLYRRTADELLVELHLLSHQKQFKVDALFAVGLRQVFDAFTR 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 126
           GY  E   +++F A  +    DP   +  A   E   +G +   + ++   +GE     +
Sbjct: 66  GYRPEAHLDSLFAAICSCNGFDPAALKQLALDSEHAVQGHSFEDVQQWLRNKGEGAPAAI 125

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSVD 179
             + +RA    NF YSR  AVGL  LL  A   + ++P+ L KL       L + K  V+
Sbjct: 126 TKVLKRAD-HANFHYSRLMAVGLLTLLAKAQGDDGSDPSELAKLAHELSEPLGLTKERVE 184

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           +DL +Y   L ++ QA EL++E +  E++KRE + E
Sbjct: 185 KDLGIYTGNLERMAQAVELMEETLAAERRKRERQNE 220


>gi|157953365|ref|YP_001498256.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
           virus AR158]
 gi|156068013|gb|ABU43720.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
           virus AR158]
 gi|448930527|gb|AGE54091.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           IL-5-2s1]
 gi|448934707|gb|AGE58259.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NY-2B]
 gi|448935079|gb|AGE58630.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
           NYs1]
          Length = 248

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 109/198 (55%), Gaps = 11/198 (5%)

Query: 2   ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
           I+  PPTV++TK  F   YK+ +  +YNT +Q ++V+QH+ RY + Y Y  V ALG VT 
Sbjct: 45  ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104

Query: 62  YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
            D ++  +P +E + +I  A+I +L EDPE Y  + + L+ +A+          P+K G 
Sbjct: 105 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
            ++  L DIA        + YS F A+G+F+LL++        +++L   +      V +
Sbjct: 159 TLQKSLYDIAIN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKQLSESIGFKGEIVHK 214

Query: 181 DLDVYRNLLSKLLQAKEL 198
           D+ ++ +LL  +  +++L
Sbjct: 215 DIAMFFSLLKYIESSQKL 232


>gi|352093979|ref|ZP_08955150.1| Protein thf1 [Synechococcus sp. WH 8016]
 gi|351680319|gb|EHA63451.1| Protein thf1 [Synechococcus sp. WH 8016]
          Length = 247

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 67/221 (30%), Positives = 114/221 (51%), Gaps = 11/221 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+    +
Sbjct: 6   TIADSKRAFHTAFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFQAFTQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E   + ++ A  ++   DPE  +  A+       G T S + E+ S  G   G  +
Sbjct: 66  GYKPEAHLDELYAAICSSNGFDPEALKQLAEGSTSAVSGHTISEVREWLSNRG--AGAPE 123

Query: 128 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
            +A   S  G  +F YSR  AVGL  LL  A   EP       T+  ++   L ++K  +
Sbjct: 124 PLASGISSVGGDSFHYSRLMAVGLLSLLSSAQGGEPSNPDELKTLAHEIGEQLGLSKPRL 183

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
           D+DL +Y + L K+ QA EL++E +  E++KR+ +    +A
Sbjct: 184 DKDLTLYTSNLEKMAQAVELIEETLAAERRKRDRQAADSQA 224


>gi|113955551|ref|YP_730625.1| Thf1-like protein [Synechococcus sp. CC9311]
 gi|113882902|gb|ABI47860.1| Uncharacterized protein [Synechococcus sp. CC9311]
          Length = 252

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 111/213 (52%), Gaps = 11/213 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F K +   IPS+Y     EL+V+ HL+ +++ ++ D +FA+G   V+    +
Sbjct: 11  TIADSKRAFHKSFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFMAFTQ 70

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E   + ++ A  T    +PE  +  A+       G T + + E+ S  G   G  +
Sbjct: 71  GYKPETHLDELYAAICTCNGFEPEALKQLAEGSTSAVSGHTINEVREWLSNRG--AGAPE 128

Query: 128 DIAERASGKG--NFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRSV 178
            +A   S  G  +F YSR  AVGL  LL  A   EP+       +  ++   L ++K  +
Sbjct: 129 PLASGISSVGGESFHYSRLMAVGLLSLLSSAQGGEPSNPDELKKLAHEIGEQLGLSKPRL 188

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
           D+DL +Y + L K+ QA EL++E +  E++KR+
Sbjct: 189 DKDLSLYTSNLEKMAQAVELIEETLAAERRKRD 221


>gi|375332109|gb|AFA52594.1| hypothetical protein [Vaucheria litorea]
          Length = 249

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 118/223 (52%), Gaps = 7/223 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+ET  +F   Y++PI   Y T++ +++   HL      + YD +F  GF +++ +LM+
Sbjct: 26  TVSETIKSFCIQYQKPILPQYRTMINDVLQSTHLNVVNGCFIYDAMFGYGFYSLFYKLMK 85

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
            YP   + + I+ A +T+L  +PE+ + D + + +     T + L    S +GE + LL 
Sbjct: 86  AYPGTGEADLIYAAMVTSLDMEPEKLKEDHETISKLIENMTRADLEN--SFKGENQNLLS 143

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLCAVLNVNKRSVDRDLDVY 185
           +I+        + Y++ + +GL   ++      TE  + E L  ++  +     +DL  Y
Sbjct: 144 EISSNIKADEFYLYTKTWGIGLIEAMDKVGIPLTEENI-ESLANMIGFSPIKARQDLVQY 202

Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTE--PQKANEAIKKC 226
           +++L K+ QA++L KE   REKKK  ER E   ++A EA KK 
Sbjct: 203 KDVLDKVAQAEQLFKEIEIREKKKMAERLEEKAKRALEAAKKA 245


>gi|88808604|ref|ZP_01124114.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
 gi|88787592|gb|EAR18749.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
          Length = 234

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/216 (29%), Positives = 109/216 (50%), Gaps = 11/216 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +
Sbjct: 13  TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKSNALFAVGLRQVFTAFTK 72

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
           GY   +    +F A  +    + +Q    A+  E+   G +   +  +     EG  E L
Sbjct: 73  GYRPADHLTELFDALCSCNGFNAQQLNSVAEGSEKAVAGHSMEEVQAWLQSKGEGAPEPL 132

Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
              +A+ A  +  F YSR  AVGLF LL  A   E    E LC         + +++  +
Sbjct: 133 ATGLADIAGEQ--FHYSRLMAVGLFSLLSSAQGVESQDPEDLCKTAHSIGEQIGLSRPRL 190

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
           ++DL +YRN L K+ QA EL++E +  E++KRE ++
Sbjct: 191 EKDLSLYRNNLEKMAQAVELMEETLASERRKRERQS 226


>gi|284929212|ref|YP_003421734.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
 gi|284809656|gb|ADB95353.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
          Length = 237

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 118/232 (50%), Gaps = 14/232 (6%)

Query: 4   DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
           D   TV+ETK  F   + +PI SIY   ++EL+V+ HL+     YQY P++ALG VT+++
Sbjct: 2   DNIRTVSETKREFYNFFTKPISSIYRRFIEELLVEMHLLSVNADYQYSPIYALGVVTLFE 61

Query: 64  RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-- 121
           + M  Y  ++ ++ IF A   +   D +QYR ++  +   A   + S+  E  +K  +  
Sbjct: 62  KFMYRYQPDDHQDLIFDALCKSTGGDTKQYRQESNTILNEAETLSISNFKEDFTKSAQEK 121

Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE-----LANATEP--TVLEKLCAVLNV 173
             + LL       +    F YSR  A+GL+ LLE     L  + E     +E++   L +
Sbjct: 122 VNDKLLWKSYYSIAQNPKFKYSRLLAIGLYSLLEKISSDLVESKEEYNKAIEQIANDLGL 181

Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR----EERTEPQKANE 221
           +   + +D+++Y + L K+ Q    +++ ++  +KKR    EE T     NE
Sbjct: 182 SSERIQKDIELYCSNLEKMQQLLIAIEDSLEFGRKKRISQQEEDTLKTNDNE 233


>gi|318041533|ref|ZP_07973489.1| Thf1-like protein [Synechococcus sp. CB0101]
          Length = 224

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 11/208 (5%)

Query: 5   VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
           V  TVA++K  F   +   I  IY  ++ EL+V+ HL+ +++ ++ D +FA+G   V+D 
Sbjct: 3   VSLTVADSKRAFHSAFSYVIAPIYRRLVDELLVELHLLSHQKGFRADGLFAVGLTQVFDS 62

Query: 65  LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
              GY  E  RE +FQA  +A   D    R  A++  +     +   +  + S +G  +G
Sbjct: 63  FSTGYRPEAQREPLFQALCSANGFDGAALRAQAEQARQQVGHHSLEEVKGWLSNQG--QG 120

Query: 125 LLKDIAERASG--KGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 175
             + IA    G  + +F YSR  AVGL  LL+    A+A +P  L     ++   + + K
Sbjct: 121 APELIASLLQGVQRDDFHYSRLVAVGLLSLLQSAQGADALDPQALRSAAHEIGESMGLIK 180

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
             VD+DL +Y   + K+ QA EL++E V
Sbjct: 181 DRVDKDLSLYAGNIEKMSQAVELMEETV 208


>gi|317970011|ref|ZP_07971401.1| Thf1-like protein [Synechococcus sp. CB0205]
          Length = 228

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 17/208 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVA++K  F + +   I  +Y  ++ EL+V+ HL+ +++ +  D +FA+G   V+D    
Sbjct: 8   TVADSKRAFHQAFPYVIAPLYRRLVDELLVELHLLSHQKGFHADGLFAVGLTQVFDSFSN 67

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
           GY  E  RE +FQA  +A   D   +R  A          +   +  + S  GE     +
Sbjct: 68  GYKPEAQREPLFQALCSANGFDGGAFRQMASDAATQVGHHSLDEVKGWLSNRGEGAPAPI 127

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVL----EKLCAVLNVNK 175
            GLL  +        +F YSR  AVGL  LL+ A   E   P  L     ++   + + K
Sbjct: 128 AGLLHGVQRE-----DFHYSRLVAVGLLSLLQRAQGAEAMDPQALRSAAHEIGEAMGLIK 182

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
             VD+DL +Y   + K+ QA EL++E V
Sbjct: 183 ARVDKDLSLYAGNIEKMTQAVELMEETV 210


>gi|449015870|dbj|BAM79272.1| photosystem II biogenesis protein Psb29 [Cyanidioschyzon merolae
           strain 10D]
          Length = 327

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 112/226 (49%), Gaps = 8/226 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+ET   F +  KRP+   Y   + E++   HL      ++YD +FALGFV+VY     
Sbjct: 88  TVSETVTRFYRNLKRPVVFYYQQAVDEILTTAHLALVCAMFRYDVIFALGFVSVYRDFFR 147

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE-----GEV 122
            YP  ++RE++F+    AL  D  Q   +A     + +G+T + L+E   ++      E 
Sbjct: 148 SYPRPDERESLFRCICDALDLDVGQVTKEADDALAYVQGKTEAELIEEIERDTGEDSAEA 207

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLEKLCAVLNVNKRSVDRD 181
           + ++  +       G + Y+R F +GL +++           ++K   +L ++   +D+D
Sbjct: 208 QPVIAALRACRRADGEYYYTRLFGIGLMKIMSSCGVEINLESVKKWANMLKISYARLDQD 267

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKK--REERTEPQKANEAIKK 225
           +  Y+  + KL QA+ + KE   RE+ +   E   + Q+A E ++K
Sbjct: 268 IGTYQMSMEKLTQAEVMFKELEARERARIADELARKAQEAEEELRK 313


>gi|116070497|ref|ZP_01467766.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
 gi|116065902|gb|EAU71659.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
          Length = 215

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 116/218 (53%), Gaps = 22/218 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ +++  P+F++G  TV++   +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFSVGLCTVFETFSQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+    +F A  ++   +   +R ++++  + A+ ++             ++G+  
Sbjct: 66  GYRPEDHITGLFDALCSSNGYNATTFRKESKQCIDAAKSES-------------IDGMES 112

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 179
            +A++  G+G+  YSR  A+G+FRL E A  +A +P  T L K C      LN     V+
Sbjct: 113 HLAKQKLGEGSH-YSRLMAIGVFRLFEEAKGDAEQPDETELRKRCKEVSTTLNFPAERVE 171

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
           +DL ++     ++  A EL++E +  E++K+E R   Q
Sbjct: 172 KDLSLFAANSERMSAAVELVQETIAAERRKKERRQAEQ 209


>gi|161347491|ref|YP_001224936.2| Thf1-like protein [Synechococcus sp. WH 7803]
          Length = 226

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 115/228 (50%), Gaps = 26/228 (11%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +
Sbjct: 6   TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
           GY   +    +F A  +    + E+ +  A+  E+   G +   +  +   +G+     +
Sbjct: 66  GYRPADHLPQLFDALCSCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPL 125

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNV 173
              L DIA        F YSR  AVGLF LL  A  ++ +P   E+LC         + +
Sbjct: 126 ATGLADIAGE-----QFHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGL 177

Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE----ERTEPQ 217
           ++  +++DL +YRN L K+ QA EL++E +  E++KRE    E  +PQ
Sbjct: 178 SRPRLEKDLSLYRNNLEKMAQAVELMEETLASERRKRERQASENKQPQ 225


>gi|87124410|ref|ZP_01080259.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
 gi|86167982|gb|EAQ69240.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
          Length = 224

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 105/215 (48%), Gaps = 8/215 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F   +   IP +Y     EL+V+ HL+ +++ +Q D +FA+G   V+     
Sbjct: 6   TIADSKRAFHTAFPFVIPPLYRRTADELLVELHLLSHQQQFQVDALFAVGLRQVFRAFTR 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 126
           GY   +   ++F+A  ++      +    A + E   RG +   +  +    G+     L
Sbjct: 66  GYKPGQHLASLFEALCSSTGFHAGELESLADQSEAAVRGHSIEEVRHWLEHGGDGAPAPL 125

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLC----AVLNVNKRSVDR 180
             + +RA   G F YSR  AVGL  LL  A     +P  L KL       L   +  V++
Sbjct: 126 ASVLQRADSSG-FHYSRLMAVGLLSLLSEAQGDQADPEQLRKLAHELSGPLGFAQTRVEK 184

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
           DL +Y + L K+ QA EL++E +  E++KRE + +
Sbjct: 185 DLGLYASNLDKMAQAVELMEETLAAERRKRERQQQ 219


>gi|87302741|ref|ZP_01085552.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
 gi|87282624|gb|EAQ74582.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
          Length = 257

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 104/206 (50%), Gaps = 17/206 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVA++K  F   +   I  +Y  ++ EL+V+ HL+  +  +  D +FA+G   V+D   +
Sbjct: 8   TVADSKRAFHAAFPYVIGPLYRRMVDELLVELHLLSRQSGFHSDGLFAVGLTQVFDGFAK 67

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
           GY  ++  E +F A   +   D +Q R       +     +   + ++ ++ G+     +
Sbjct: 68  GYRPQQQSEPLFAALCASSGFDAQQIRAQHAAAVKAVGEHSLDEVKQWLAQRGQGAPEPI 127

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 175
            G+L  I +RA    +F YSR FAVGL  LL+    A A EP  L     ++   + + K
Sbjct: 128 AGVLAGI-DRA----DFHYSRLFAVGLLSLLQHARGAEAVEPQALRQAAHEIGESMGLMK 182

Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKE 201
             VD+DL +Y + L K+ QA EL++E
Sbjct: 183 ERVDKDLTLYASTLEKMAQAVELMEE 208


>gi|78184631|ref|YP_377066.1| Thf1-like protein [Synechococcus sp. CC9902]
 gi|97202850|sp|Q3AY05.1|THF1_SYNS9 RecName: Full=Protein thf1
 gi|78168925|gb|ABB26022.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 215

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 111/218 (50%), Gaps = 22/218 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ +++  P+FA+G  TV+D    
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFAVGLCTVFDTFSA 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  EE    +  A  ++   D   +R ++++  + A+ ++             V+ +  
Sbjct: 66  GYRPEEHITGLLDALCSSNGYDANTFRKESKRCIDAAKTES-------------VDAMDS 112

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 179
            +A +  G+G+  YSR  A+G+ RL E A  +A +P    L K C      LN     V+
Sbjct: 113 HLAGQKLGEGSH-YSRLMAIGVLRLFEEAKGDADQPDEADLRKRCKELSTALNFPAERVE 171

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
           +DL ++ +   ++  A EL++E +  E++K+E R   Q
Sbjct: 172 KDLSLFASNSERMSAAIELVQETIAAERRKKERRQAEQ 209


>gi|416383906|ref|ZP_11684537.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
           0003]
 gi|357265142|gb|EHJ13943.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
           0003]
          Length = 209

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 103/200 (51%), Gaps = 17/200 (8%)

Query: 40  HLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQK 99
           HL+     + YDP++ALG VT + R M+GY  E D+ +IF A   A+    E+Y  +A+ 
Sbjct: 2   HLLSVNIDFSYDPIYALGVVTSFQRFMQGYSPESDKPSIFNALCQAVDGSSEKYHQEAEA 61

Query: 100 LEEWARGQTASSLVEFPSKEGEV------EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
           +   A+G    S+V+F  K   V      EG+L       +    F YSR  A+GL+ LL
Sbjct: 62  ILNEAKGL---SIVDFKDKLTHVTDNQVGEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 118

Query: 154 -----ELANATE--PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
                +L    E     ++++   L  +   + +DLD+YR+ L K+ Q   ++++ ++ +
Sbjct: 119 MEIDSDLLKDEEKRTETIKEVSEALKFSPEKLRKDLDLYRSNLDKMQQLLTVIEDSLEAD 178

Query: 207 KKKREERTEPQKANEAIKKC 226
           +KKR   TE + + E +++ 
Sbjct: 179 RKKR-ASTEGKTSAEVVEQT 197


>gi|260436777|ref|ZP_05790747.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
 gi|260414651|gb|EEX07947.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
          Length = 215

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 108/218 (49%), Gaps = 22/218 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++  ++ + +F++G  TV+D   +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANGLFSVGLCTVFDTFTK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E   +A+F A  ++   D  + R     L + A+G+   +L  + S     EG   
Sbjct: 66  GYRPEAQTDALFSALCSSNGFDAAKLRKTNASLVDQAKGKDHETLKSWLSSHSLKEG--- 122

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLE--KLCAVLNVNKRSVD 179
                        YSR  AVGL  LL+ A A      TE  V +  +L   L +    V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAATADATGSDTETIVKQSKELAEGLGLPTDRVE 171

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
           +DL ++ +   ++ QA EL++E +  EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209


>gi|397644025|gb|EJK76212.1| hypothetical protein THAOC_02035 [Thalassiosira oceanica]
          Length = 293

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 98/209 (46%), Gaps = 8/209 (3%)

Query: 15  NFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEED 74
            F      PI ++Y   + +L+   HL      +Q DPVF+LG VTV D L++ +P ++ 
Sbjct: 58  TFTDALGTPINALYKGTITDLVGSLHLTVVTARFQRDPVFSLGLVTVLDLLLKNFPEQDT 117

Query: 75  REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERAS 134
            + I  A I +      +   +A ++  WA+G+T   +    +  GE +  L  +A  A 
Sbjct: 118 AKRIKSAMIESAGMVESEVDAEAAEVATWAQGKTREDIA--SALRGEGDSTLAQVANGAK 175

Query: 135 GKGNFSYSRFFAVGLFRLLELANATEPT-----VLEKLCAV-LNVNKRSVDRDLDVYRNL 188
           G   + YSRFF +GL +++++    +       V+E      L     +   D D+Y   
Sbjct: 176 GDEYWMYSRFFGIGLVKMMDIVGIEQDMSVAYDVMEDWVGTCLGKPHYTACADSDLYFKQ 235

Query: 189 LSKLLQAKELLKEYVDREKKKREERTEPQ 217
             KL   + ++KE   REKK+  +R E +
Sbjct: 236 KGKLDMMETMMKEIEIREKKRMADRLEAK 264


>gi|147848088|emb|CAK23639.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 206

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 107/212 (50%), Gaps = 26/212 (12%)

Query: 24  IPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYI 83
           IPS+Y     EL+V+ HL+ ++  ++ + +FA+G   V+    +GY   +    +F A  
Sbjct: 2   IPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTKGYRPADHLPQLFDALC 61

Query: 84  TALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----VEGLLKDIAERASGKGN 138
           +    + E+ +  A+  E+   G +   +  +   +G+     +   L DIA        
Sbjct: 62  SCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPLATGLADIAGE-----Q 116

Query: 139 FSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRSVDRDLDVYRNLL 189
           F YSR  AVGLF LL  A  ++ +P   E+LC         + +++  +++DL +YRN L
Sbjct: 117 FHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPRLEKDLSLYRNNL 173

Query: 190 SKLLQAKELLKEYVDREKKKRE----ERTEPQ 217
            K+ QA EL++E +  E++KRE    E  +PQ
Sbjct: 174 EKMAQAVELMEETLASERRKRERQASENKQPQ 205


>gi|78212971|ref|YP_381750.1| Thf1-like protein [Synechococcus sp. CC9605]
 gi|97202855|sp|Q3AJN7.1|THF1_SYNSC RecName: Full=Protein thf1
 gi|78197430|gb|ABB35195.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 215

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 110/218 (50%), Gaps = 22/218 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++  ++ + +F++G  TV+D  ++
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANELFSVGLCTVFDTFIK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E   +A+F+A  ++   D  + R     L E A+G+   SL ++ S     EG   
Sbjct: 66  GYRPEAQTDALFRALCSSNGFDAAKLRKTYASLVEQAKGKDPESLKDWLSSHALKEG--- 122

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLE------LANATEPTVLE--KLCAVLNVNKRSVD 179
                        YSR  AVGL  LL+        + TE  V +  +L   L +    V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAAAADATDSDTEAIVKQSKELAEGLGLPTDRVE 171

Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
           +DL ++ +   ++ QA EL++E +  EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209


>gi|428183151|gb|EKX52010.1| hypothetical protein GUITHDRAFT_150871, partial [Guillardia theta
           CCMP2712]
          Length = 309

 Score = 83.2 bits (204), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 104/223 (46%), Gaps = 17/223 (7%)

Query: 3   SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
           +D+ P  A  +  F KL+ RPIP ++     E++   HL      ++YD ++A G  + +
Sbjct: 73  ADIEPCGAAVE-RFYKLFARPIPFVFRAPTNEILYLSHLDLVNAMFRYDVIWAAGLYSTF 131

Query: 63  DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
           D        E+ R  +FQA +  LK D  + + DA  + +WA+G+T + +V   + +GE 
Sbjct: 132 DLFFSAL-DEDLRANLFQALMGGLKLDQSKIKSDADAVLQWAQGKTEADVVS--AIKGED 188

Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV------------LEKLCAV 170
              +  +        +F Y+R F  GL +++++    EP                   A+
Sbjct: 189 SSPVGQVLASLGKNEDFLYTRNFGAGLIKIMQVVG-VEPNAENAKRWAEVLGFTSNTSAL 247

Query: 171 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
             ++    + D+ ++ + + K+ QA +L  E   REKKK  E+
Sbjct: 248 SGLSASKFETDVGLFLSSVDKMQQAMQLFAEVEAREKKKIAEK 290


>gi|299469582|emb|CBN76436.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 226

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+ET  +F   YK+ + + + T++ E +   HL  Y   ++YDP+F +GF T + R M 
Sbjct: 107 TVSETVADFYIYYKKVVLTQFRTIVTEYLQSTHLTVYDARFKYDPLFGVGFYTSFMRFMR 166

Query: 68  GYPSEEDREAIFQAYITALKE--DPEQYRIDAQKLEEWARGQTASSLVE 114
            YP     E IF A + A+    DP+Q R D   L+EWA G+T   +VE
Sbjct: 167 AYPVPGQAELIFDAVVKAIGNGLDPDQMRKDTTALKEWAEGKTEEDVVE 215


>gi|33865836|ref|NP_897395.1| Thf1-like protein [Synechococcus sp. WH 8102]
 gi|81574513|sp|Q7U6N6.1|THF1_SYNPX RecName: Full=Protein thf1
 gi|33633006|emb|CAE07817.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 212

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 104/211 (49%), Gaps = 19/211 (9%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T+A++K  F + +   I  +Y  +  EL+V+ HL+ ++ T+Q + +FA+G  TV++R  +
Sbjct: 6   TIADSKRAFHQAFPHVIAPLYRRIADELLVELHLLSHQATFQANSLFAVGLKTVFERFTQ 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY   E   A+  A  ++   D EQ +  AQ   + A G +  +   +  ++   +G   
Sbjct: 66  GYRPMEHPAALLSALCSSNGFDDEQLKQAAQHCLQDAEGHSDDAFQSWLKEQSLSDGA-- 123

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL-----EKLCAVLNVNKRSVDRDL 182
                        YSR  AVGL  LLE ++             KL   L +    V++DL
Sbjct: 124 ------------HYSRLMAVGLLALLEASSDESDASSLRQRAVKLSVDLGLPAERVEKDL 171

Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            V+ +   ++ QA EL++E +  +++K+E+R
Sbjct: 172 TVFSSNSERMEQAVELMQETLAADRRKKEKR 202


>gi|159903384|ref|YP_001550728.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9211]
 gi|254784145|sp|A9BAB2.1|THF1_PROM4 RecName: Full=Protein thf1
 gi|159888560|gb|ABX08774.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9211]
          Length = 221

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 115/221 (52%), Gaps = 18/221 (8%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+E+K  F K +   +P++Y  ++ ELIV+ +L++ +  +  D VFA+G  +++    +
Sbjct: 6   TVSESKAIFHKEFPFVVPAVYRRLVDELIVELNLLKNQERFVADGVFAIGLTSIFLDFTK 65

Query: 68  GYPSEEDREAIFQAYITAL---KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
           GY  E  +  + +A          + EQ  ++A+KL          SL+   +++ E E 
Sbjct: 66  GYKPENQKGILLEAICKCTGFSASNLEQIALEAKKLANGLNTNEIKSLITDNNRD-EKES 124

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
             K I +      N  YSR  A+G+++L+++ +       ATE + L+ L       K  
Sbjct: 125 TYKLINK------NNHYSRIIAIGIYKLVDMQSNGFNKEEATENSYLD-LVNNFGYTKER 177

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK 218
           V++D+++Y++ L K+ +A EL++  +  EK++ +ER    K
Sbjct: 178 VEKDVNLYKSSLDKIEKALELIEMNIKDEKRRNKERVSRTK 218


>gi|219123541|ref|XP_002182081.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406682|gb|EEC46621.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 311

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/225 (24%), Positives = 104/225 (46%), Gaps = 9/225 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV E   +F       +  +Y  ++ +++   HL+     +Q D +++LG +T  D L++
Sbjct: 67  TVGEAFADFSSELGVTVNPLYKNMVTDIVGTTHLVIVNARFQRDAIWSLGILTALDLLLK 126

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
            YP  E    I  A   ++  D ++ R +A+ + +WA G++ + +    + EG+    + 
Sbjct: 127 NYPEPEVGAKIVSALFKSVGLDEDEIRNEARTISDWAVGKSKADIETALTGEGDSP--VA 184

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLEKLCAVLNVNKRSVDRD 181
            IA        + YSR+F +GL +++E            P +   +   L  +  +   D
Sbjct: 185 AIANSIKPNDYWMYSRYFGIGLIKIMESTGVEMDKDEVYPVMESWMQEKLGRSSLTACAD 244

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
            D+Y  +  KL   + ++KE   REKK+  ER E  KA  A++  
Sbjct: 245 SDLYFKIKDKLDMMETMMKEIEIREKKRMAERLE-DKAEAALRAA 288


>gi|427701945|ref|YP_007045167.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
           6307]
 gi|427345113|gb|AFY27826.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
           6307]
          Length = 231

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 11/205 (5%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TVA++K  F   +   I  +Y  ++ EL+V+ HL+  ++ +Q D +FA+G + V+D    
Sbjct: 6   TVADSKRAFHGAFPHVISPLYRRMVDELLVELHLLSRQKGFQIDALFAVGLIQVFDGFAR 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E  +  +FQA   +   D    R   Q+        + + + ++   +G   G   
Sbjct: 66  GYRPEAQKGPLFQALCASSGFDGPDLRRQCQEALAAMGRHSQAEVRQWIESQG--AGAPA 123

Query: 128 DIAERASG--KGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSV 178
            +A   +G  + +F YSR  AVGL  LLE A   +A EP  L +L       + + +  +
Sbjct: 124 PVATALAGIRRPDFHYSRLMAVGLLALLEQALADDAMEPQALRQLAHEIGESMGLLRDRL 183

Query: 179 DRDLDVYRNLLSKLLQAKELLKEYV 203
           D+DL +Y + L K+  A EL++E V
Sbjct: 184 DKDLALYASNLEKMSMAVELMEETV 208


>gi|223995057|ref|XP_002287212.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
           CCMP1335]
 gi|220976328|gb|EED94655.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
           CCMP1335]
          Length = 212

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 99/212 (46%), Gaps = 8/212 (3%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV E    F      P+ ++Y  +  +L+   HL+     +Q D VF+LG V+  D +++
Sbjct: 2   TVGEAFTQFTDKLGTPVNALYKGMCTDLVGSLHLVMVNARFQRDAVFSLGLVSALDLVLK 61

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
            YP  E    I  A + ++  D      +A  LE WA+G+T   +      EG+ +  L 
Sbjct: 62  NYPEAETGARIKSAMLESVGLDEAVVNAEAAALEAWAQGKTKEDIASALKGEGDSQ--LA 119

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEK-LCAVLNVNKRSVDRD 181
            IA+ A G   + YSRFF VGL R++E+       +    V+E  +   +     +   D
Sbjct: 120 AIAKAAKGDQWWMYSRFFGVGLVRIMEIVGVEMDMSVAYDVMENWMGKCMEKPYYTACSD 179

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
            D+Y     KL   + ++KE   REKK+  +R
Sbjct: 180 SDLYFKTKGKLDMMETMMKEIEIREKKRMADR 211


>gi|72382131|ref|YP_291486.1| Thf1-like protein [Prochlorococcus marinus str. NATL2A]
 gi|97202784|sp|Q46L45.1|THF1_PROMT RecName: Full=Protein thf1
 gi|72001981|gb|AAZ57783.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 199

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)

Query: 5   VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
           V  T++++K +F K +   IP+IY  +  EL+V+ HL+ +++ ++ D +F+ G   V+ +
Sbjct: 3   VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFSK 62

Query: 65  LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
              GY   E    +F A       +P +    +++L   A+  T   L  F SK      
Sbjct: 63  FTSGYKPSEHATKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 177
                      KG   YSR  A+G+++L+      +    E L   +N   +S       
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170

Query: 178 VDRDLDVYRNLLSKLLQAKELL 199
           V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192


>gi|124025670|ref|YP_001014786.1| Thf1-like protein [Prochlorococcus marinus str. NATL1A]
 gi|166987530|sp|A2C211.1|THF1_PROM1 RecName: Full=Protein thf1
 gi|123960738|gb|ABM75521.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL1A]
          Length = 199

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)

Query: 5   VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
           V  T++++K +F K +   IP+IY  +  EL+V+ HL+ +++ ++ D +F+ G   V+ +
Sbjct: 3   VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFCK 62

Query: 65  LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
              GY   E    +F A       +P +    +++L   A+  T   L  F SK      
Sbjct: 63  FTSGYKPSEHVTKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119

Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 177
                      KG   YSR  A+G+++L+      +    E L   +N   +S       
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170

Query: 178 VDRDLDVYRNLLSKLLQAKELL 199
           V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192


>gi|33240369|ref|NP_875311.1| Thf1-like protein [Prochlorococcus marinus subsp. marinus str.
           CCMP1375]
 gi|81664534|sp|Q7VC23.1|THF1_PROMA RecName: Full=Protein thf1
 gi|33237896|gb|AAP99963.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
           str. CCMP1375]
          Length = 214

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/208 (22%), Positives = 102/208 (49%), Gaps = 9/208 (4%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           T++++K  F K +   IP +Y  VL E +V+ +L+  +  ++ D +F+ G +  ++R   
Sbjct: 6   TISDSKGLFHKEFPYVIPPVYRKVLDEYLVELNLLSNQSNFKIDTIFSYGLIISFERFTV 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  +     I ++   +   D +  +  +  +++    +    ++   +   E++  + 
Sbjct: 66  GYEPDSHISKILESLCNSCNIDIKAIKEYSNNIKKLINEKGIKEIINILT--AEIKKSVG 123

Query: 128 DIA-ERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEKLCAVLNVNKRSVDRD 181
            IA    SGK  + YSR  A+G++ L+   N       +  ++ +    L  +K  V++D
Sbjct: 124 GIALSNQSGKDKY-YSRLHAIGIYELISNINEDKKEGDDKEIISECVEALGFSKDRVEKD 182

Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKK 209
           ++ Y+N + K+ +  EL+K  V+  K+K
Sbjct: 183 INQYKNSMEKIKEMMELIKLTVEETKRK 210


>gi|254526529|ref|ZP_05138581.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
           str. MIT 9202]
 gi|221537953|gb|EEE40406.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
           str. MIT 9202]
          Length = 202

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 111/211 (52%), Gaps = 22/211 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  +  E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 6   TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+  + +F++  ++   + ++    +QK ++  + +T++ +V+             
Sbjct: 66  GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 113

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 180
            + E+++ K     SR   +G++ L+    +L    E  + + +  +   LN++    ++
Sbjct: 114 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKKNESDINKMISDIFEQLNLSANKAEK 170

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
           D+ +Y++ +SK+ QAKEL++E   + KKK E
Sbjct: 171 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 201


>gi|157413170|ref|YP_001484036.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9215]
 gi|157387745|gb|ABV50450.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9215]
          Length = 217

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 111/211 (52%), Gaps = 22/211 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  +  E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 21  TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+  + +F++  ++   + ++    +QK ++  + +T++ +V+             
Sbjct: 81  GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 128

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA------NATEPT-VLEKLCAVLNVNKRSVDR 180
            + E+++ K     SR   +G++ L+  A      N ++   ++  +   LN++    ++
Sbjct: 129 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKNNESDTNKMISDIFEKLNLSANKAEK 185

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
           D+ +Y++ +SK+ QAKEL++E   + KKK E
Sbjct: 186 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 216


>gi|123968337|ref|YP_001009195.1| Thf1-like protein [Prochlorococcus marinus str. AS9601]
 gi|123198447|gb|ABM70088.1| conserved hypothetical protein [Prochlorococcus marinus str.
           AS9601]
          Length = 218

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 115/217 (52%), Gaps = 30/217 (13%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  LM+
Sbjct: 21  TVSDSKKLFHEKFPYVIPGLYKRIVDEMLVELNLLNHQNEFTLDYLFCVGLTETFKELMK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+  + +F++  ++          +A+++ E ++ ++   LV+  SK+     +LK
Sbjct: 81  GYQPEKHLDLLFESLCSST-------NFEAKEINEISK-KSQKELVDKTSKD-----ILK 127

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA-----------NATEPTVLEKLCAVLNVNKR 176
            + E+ + K     SR   +G++ L+  +           N     + EKL   L+ NK 
Sbjct: 128 LLVEKNNSK--LYPSRILNLGIYTLISNSQDFKEKNESDKNKMTSDIFEKLS--LSANK- 182

Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
             ++D+ +Y++ +SK+ QAKEL++E   ++K K +++
Sbjct: 183 -AEKDIGIYKSSISKMEQAKELIEELRIKDKNKNQKK 218


>gi|123200442|gb|ABM72050.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9515]
          Length = 198

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+E+K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 2   TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 61

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
           GY  E   E +F++   +    P + + I  + LE++                 E+  LL
Sbjct: 62  GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 110

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 177
           K+         N   SR   +G++  L +ANAT+   L   EK  A+      LN++   
Sbjct: 111 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 163

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
            ++D+ +Y++ + K+ QAKELL+E   + KKK
Sbjct: 164 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 195


>gi|161407964|ref|YP_001011157.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9515]
          Length = 217

 Score = 66.6 bits (161), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+E+K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 21  TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
           GY  E   E +F++   +    P + + I  + LE++                 E+  LL
Sbjct: 81  GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 129

Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 177
           K+         N   SR   +G++  L +ANAT+   L   EK  A+      LN++   
Sbjct: 130 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 182

Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
            ++D+ +Y++ + K+ QAKELL+E   + KKK
Sbjct: 183 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 214


>gi|97202782|sp|Q7V1W1.2|THF1_PROMP RecName: Full=Protein thf1
          Length = 202

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 6   TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  EE    +F++   +   +P++ +  ++K  E  + ++            E+  LLK
Sbjct: 66  GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 115

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 178
             ++      N   SR   +G++  L +ANAT+           ++  +   LN++    
Sbjct: 116 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 168

Query: 179 DRDLDVYRNLLSKLLQAKELLKE 201
           ++D+ +Y++ + K+ QAKELL+E
Sbjct: 169 EKDIGIYKSSILKMEQAKELLQE 191


>gi|33861298|ref|NP_892859.1| Thf1-like protein [Prochlorococcus marinus subsp. pastoris str.
           CCMP1986]
 gi|33633875|emb|CAE19200.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
          Length = 217

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  D +F +G    +  L +
Sbjct: 21  TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  EE    +F++   +   +P++ +  ++K  E  + ++            E+  LLK
Sbjct: 81  GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 130

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 178
             ++      N   SR   +G++  L +ANAT+           ++  +   LN++    
Sbjct: 131 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 183

Query: 179 DRDLDVYRNLLSKLLQAKELLKE 201
           ++D+ +Y++ + K+ QAKELL+E
Sbjct: 184 EKDIGIYKSSILKMEQAKELLQE 206


>gi|97202762|sp|Q31BD6.2|THF1_PROM9 RecName: Full=Protein thf1
          Length = 201

 Score = 63.5 bits (153), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  ++ D +F +G    +  L +
Sbjct: 6   TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 65

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+  + +F++   +          +A++++E ++     S  EF  K  +   +LK
Sbjct: 66  GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 112

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 180
            + E+++ K     SR   +G++ L+       E  +  +  ++  +   L++++   ++
Sbjct: 113 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 170

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKK 209
           D+ +Y++ +SK+ QAKEL++E   ++KKK
Sbjct: 171 DIGIYKSSISKMEQAKELIQEQRIKDKKK 199


>gi|194476659|ref|YP_002048838.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
 gi|171191666|gb|ACB42628.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
          Length = 213

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 86/207 (41%), Gaps = 36/207 (17%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           PTVA+TK  F K +   I   + TVL EL+V+  L+  +     DP+FA+G +  +  L 
Sbjct: 8   PTVADTKRAFYKGFPYVIAPSHRTVLNELLVELFLLSPQTDIGSDPLFAVGLIQFFGVLT 67

Query: 67  EGYPSEEDREAIFQAYITALKEDP--------------EQYRIDAQKLEEWARGQTASSL 112
           + Y  +  R  +F+A   ++  D                QY I  ++L  W+     +S 
Sbjct: 68  KHYQPQNHRMLLFEALCNSIGFDSFNLRQIRKESLSELSQYNI--EELHSWSLTGADNSE 125

Query: 113 VEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLE 165
           + F      +             K  F YSR  A+GL  L++ A   E         +  
Sbjct: 126 ILFTKTFIPI-------------KRRFHYSRLMAIGLLCLIKRARGVETLEAKELYYLTH 172

Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKL 192
            L   +   +  +DRDL VY + + K+
Sbjct: 173 NLAEKMGFIRERIDRDLSVYIDTIEKM 199


>gi|78779133|ref|YP_397245.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9312]
 gi|78712632|gb|ABB49809.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9312]
          Length = 216

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  ++ D +F +G    +  L +
Sbjct: 21  TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY  E+  + +F++   +          +A++++E ++     S  EF  K  +   +LK
Sbjct: 81  GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 127

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 180
            + E+++ K     SR   +G++ L+       E  +  +  ++  +   L++++   ++
Sbjct: 128 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 185

Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKK 209
           D+ +Y++ +SK+ QAKEL++E   ++KKK
Sbjct: 186 DIGIYKSSISKMEQAKELIQEQRIKDKKK 214


>gi|323450067|gb|EGB05951.1| hypothetical protein AURANDRAFT_66018 [Aureococcus anophagefferens]
          Length = 1032

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 46/164 (28%), Positives = 73/164 (44%), Gaps = 5/164 (3%)

Query: 48  YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 107
           + YD +F  GFVT+ D +M  YP   D E I  A I AL  DP   R D + + EW  G+
Sbjct: 69  FVYDELFGFGFVTLMDMIMSPYPVAGDGEKITDALIAALDMDPATLRGDHKAVTEWLAGK 128

Query: 108 T-ASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-EPTVLE 165
           T A  L    S +G     +   A    G+  F ++R   VGL  +++      +   L 
Sbjct: 129 TEADVLAAVASNDGSK---VASAAATIKGQEEFHHTRPSNVGLVAVMDAVGCKPDDESLA 185

Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
           +      +   +V R+  + +    K+  A +++K     EKK+
Sbjct: 186 RWTEAFGMRAPAVQRNAGLLKEYQEKVANAMQMIKSAEIMEKKR 229


>gi|126543182|gb|ABO17424.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9301]
          Length = 198

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  + +F +G    +  LM+
Sbjct: 2   TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 61

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY   +  + +F++  ++   + ++    +QK ++  + +T++              +LK
Sbjct: 62  GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 108

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 180
            + E+++ K     SR   + ++ L+  A        +    ++  +   LN++    ++
Sbjct: 109 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 166

Query: 181 DLDVYRNLLSKLLQAKELLKE 201
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 167 DIGIYKSSISKMEQAKELIEE 187


>gi|161407965|ref|YP_001091025.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9301]
          Length = 217

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)

Query: 8   TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
           TV+++K  F + +   IP +Y  ++ E++V+ +L+ ++  +  + +F +G    +  LM+
Sbjct: 21  TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 80

Query: 68  GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
           GY   +  + +F++  ++   + ++    +QK ++  + +T++              +LK
Sbjct: 81  GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 127

Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 180
            + E+++ K     SR   + ++ L+  A        +    ++  +   LN++    ++
Sbjct: 128 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 185

Query: 181 DLDVYRNLLSKLLQAKELLKE 201
           D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIEE 206


>gi|256810247|ref|YP_003127616.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
            AG86]
 gi|256793447|gb|ACV24116.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
            AG86]
          Length = 1056

 Score = 41.6 bits (96), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)

Query: 100  LEEWARGQTASSLVEF-PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA 158
            LE+W +      + E  PSK  ++  ++    E    K + S S+F       +LEL   
Sbjct: 942  LEDWKKFIKFKEIFENKPSKLQKLVNIIYKCLEDWDNKYDDSISQFLDTSFINVLELNKK 1001

Query: 159  TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELL 199
            +   V+EKLC + +++    D+DL+ +RN L   +  K++L
Sbjct: 1002 SNKEVIEKLCVIFDISLE--DKDLEKFRNELINKIDRKKML 1040


>gi|359483284|ref|XP_003632934.1| PREDICTED: mixed-amyrin synthase-like [Vitis vinifera]
          Length = 170

 Score = 39.7 bits (91), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 6/76 (7%)

Query: 130 AERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLL 189
           AE  + + NF  +RF AV L   L L    EP  +E+L   +NV   ++ R  +V R   
Sbjct: 40  AEVEAARENFWKNRFLAVLLSSKLSLETVGEPLDMEQLFDAVNVMILNLKRQFEVLR--- 96

Query: 190 SKLLQAKELLKEYVDR 205
              ++  E +KEYVDR
Sbjct: 97  ---MKDNESIKEYVDR 109


>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
 gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
          Length = 1053

 Score = 39.3 bits (90), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 15/89 (16%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           P +AE   N   +YK           +  ++Q+ L  Y+R  +  P F  G++ +   L+
Sbjct: 109 PVLAEAYSNLGNVYK-----------ERGLLQEALDNYRRAVRLKPDFIDGYINLAAALV 157

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRI 95
               +  D EA  QAYITAL+ +P+ Y +
Sbjct: 158 ----AARDMEAAVQAYITALQYNPDLYCV 182


>gi|194767414|ref|XP_001965811.1| GF13981 [Drosophila ananassae]
 gi|190625935|gb|EDV41459.1| GF13981 [Drosophila ananassae]
          Length = 396

 Score = 37.4 bits (85), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 15/89 (16%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           P +AE   N   +YK           +   +Q+ L  Y+R  +  P F  G++ +   L+
Sbjct: 116 PVLAEAYSNLGNVYK-----------ERGQLQEALDNYRRAVRLKPDFIDGYINLAAALV 164

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRI 95
               +  D E+  QAYITAL+ +PE Y +
Sbjct: 165 ----AARDMESAVQAYITALQYNPELYCV 189


>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
 gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
          Length = 1050

 Score = 37.0 bits (84), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 15/89 (16%)

Query: 7   PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
           P +AE   N   +YK           +   +Q+ L  Y+R  +  P F  G++ +   L+
Sbjct: 106 PVLAEAYSNLGNVYK-----------ERGQLQEALDNYRRAVRLKPDFIDGYINLAAALV 154

Query: 67  EGYPSEEDREAIFQAYITALKEDPEQYRI 95
               +  D E+  QAYITAL+ +PE Y +
Sbjct: 155 ----AARDMESAVQAYITALQYNPELYCV 179


>gi|194880104|ref|XP_001974366.1| GG21695 [Drosophila erecta]
 gi|190657553|gb|EDV54766.1| GG21695 [Drosophila erecta]
          Length = 428

 Score = 37.0 bits (84), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 54/128 (42%), Gaps = 10/128 (7%)

Query: 103 WARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-- 160
           W R  T   L     +E E+   L D+ E+A  + +  +S      L  + EL N+    
Sbjct: 291 WIRSCTDQRLCRLNGREDEIRKELHDLEEQALQEESVQHSSQLMYSL-EVEELRNSIRNW 349

Query: 161 ----PTVLEK---LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
                T LE    +C V  +  + V  DL  Y    +  L   + ++  +D+EK  REER
Sbjct: 350 QERLDTDLENADVMCTVSRLALQKVKDDLKFYMEQKAMYLSRIDEVQAIIDQEKMTREER 409

Query: 214 TEPQKANE 221
             P + NE
Sbjct: 410 VSPCRENE 417


>gi|226940415|ref|YP_002795489.1| peroxiredoxin/glutaredoxin family protein [Laribacter hongkongensis
           HLHK9]
 gi|226715342|gb|ACO74480.1| Probable peroxiredoxin/glutaredoxin family protein [Laribacter
           hongkongensis HLHK9]
          Length = 245

 Score = 36.6 bits (83), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 12/121 (9%)

Query: 96  DAQKLEEWARGQTASSLVEFPSKEGEVE---GLLKDIAERASGKGNFSYSRFFAVGLFR- 151
           D   + EWA+ Q ++++V  P   GE     G+L D A+   GK ++ YS     G+ + 
Sbjct: 82  DTFVMNEWAKDQESANIVMVPDGNGEFTEGMGMLVDKADLGFGKRSWRYSMLVKDGVVQK 141

Query: 152 -LLELANATEP---TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK----EYV 203
             +E     +P   +  + + A +N N +  D+ +   ++      +AKELL     +Y+
Sbjct: 142 MFIEPQEPGDPFKVSDADTMLAYINPNAKKPDQVVVFSKDGCPFCAKAKELLSGKGYDYI 201

Query: 204 D 204
           D
Sbjct: 202 D 202


>gi|444321392|ref|XP_004181352.1| hypothetical protein TBLA_0F02940 [Tetrapisispora blattae CBS 6284]
 gi|387514396|emb|CCH61833.1| hypothetical protein TBLA_0F02940 [Tetrapisispora blattae CBS 6284]
          Length = 2621

 Score = 36.6 bits (83), Expect = 8.6,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 180  RDLDVYRNLLSKLLQAKELLKEYVDREKKKRE-----ERTEPQKANEAIKKCLGEYLY 232
            + L ++ +++ +  Q  E+ K+Y   +K+K       ERT P + NEA+KK   E +Y
Sbjct: 1996 KSLKIFDDMIKQFTQTSEISKKYSASDKEKSSSDILYERTSPPEMNEALKKIFEEGIY 2053


>gi|227496429|ref|ZP_03926715.1| recombination factor protein RarA [Actinomyces urogenitalis DSM
           15434]
 gi|226834048|gb|EEH66431.1| recombination factor protein RarA [Actinomyces urogenitalis DSM
           15434]
          Length = 455

 Score = 36.6 bits (83), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 42  MRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQ 80
           + Y   YQYDP  A GF    D + EGYP  E+REA ++
Sbjct: 388 LGYGEGYQYDPDTAEGFSGA-DYMPEGYPPREEREAFYE 425


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.134    0.375 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,506,343,023
Number of Sequences: 23463169
Number of extensions: 141391676
Number of successful extensions: 556381
Number of sequences better than 100.0: 269
Number of HSP's better than 100.0 without gapping: 202
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 555841
Number of HSP's gapped (non-prelim): 328
length of query: 235
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 97
effective length of database: 9,121,278,045
effective search space: 884763970365
effective search space used: 884763970365
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)