BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025486
(252 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 367 bits (943), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 176/231 (76%), Positives = 205/231 (88%), Gaps = 1/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S S+M++
Sbjct: 1 MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSEIFGFRTDFSYHYVGVRASNSPSKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
SKEGEVEG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+K+
Sbjct: 180 SKEGEVEGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKL 230
>gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 363 bits (932), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 173/231 (74%), Positives = 205/231 (88%), Gaps = 1/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S+S+M++
Sbjct: 1 MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+K+
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKL 230
>gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
Length = 298
Score = 363 bits (932), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 175/231 (75%), Positives = 201/231 (87%), Gaps = 2/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++ S++F+++ Q S R++ + S+RS SNF GF FRTS+F H R RAS+ SSRM+I
Sbjct: 1 MAAVNSISFSTLNQCS-DRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMS TDV TVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFAL
Sbjct: 60 HCMSAGTDVT-TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS+EDREAIFQAYI AL EDPEQYRIDA+K EEWAR QTA+SLVEF
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFA 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S+EGEVE +LKDIAERA KGNFSYSRFFA+GLFRLLELANATEP++LEK+
Sbjct: 179 SREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKL 229
>gi|255636566|gb|ACU18621.1| unknown [Glycine max]
Length = 297
Score = 362 bits (930), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 173/231 (74%), Positives = 205/231 (88%), Gaps = 1/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S+S+M++
Sbjct: 1 MAALTSLSFSAVIHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+K+
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKL 230
>gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
Length = 299
Score = 358 bits (920), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 175/233 (75%), Positives = 203/233 (87%), Gaps = 3/233 (1%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLF-CHCVRFRAS-SSSSRM 58
MA++TSV+F++I Q+ +RK S +RS SNF+ FRF F CH V +AS SSSSRM
Sbjct: 1 MAAVTSVSFSAIAQSFTERKSVASWSRSFSSNFDSFRFPVGGFSCHYVGVKASNSSSSRM 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+I CMSTATDVPPTV+ETK NFL YK+PIPSIYNTVLQELIVQQHLMRYKR+Y+YDPVF
Sbjct: 61 LIHCMSTATDVPPTVSETKFNFLNSYKKPIPSIYNTVLQELIVQQHLMRYKRSYRYDPVF 120
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LM+GYPS+EDREAIFQAYI AL E+PEQYRIDA+KLE+WAR QT SSLV+
Sbjct: 121 ALGFVTVYDQLMQGYPSDEDREAIFQAYINALNEEPEQYRIDAKKLEDWARSQTPSSLVD 180
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
F SKEGEVEG+LKDIAERA G G+FSYSRFFA+GLFRLLEL+N+TEPTVLEK+
Sbjct: 181 FSSKEGEVEGILKDIAERA-GNGSFSYSRFFAIGLFRLLELSNSTEPTVLEKL 232
>gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa]
gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 358 bits (919), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 173/238 (72%), Positives = 200/238 (84%), Gaps = 1/238 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
M ++TS++F+++ Q+S +R VSS S +FEGFR R+ CH V +AS+S+SR I
Sbjct: 1 MTAITSLSFSAVNQSSSERNTFVSSLASRNLSFEGFRLRSVFSCHYVGVKASNSTSRTAI 60
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMST+TDVPPTV+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY YDPVF L
Sbjct: 61 HCMSTSTDVPPTVSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYLYDPVFGL 120
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
G VTVYD+LMEGYPS+EDREAIFQAYI ALKEDPEQYRIDA+KLEEWAR QT SSLV+F
Sbjct: 121 GLVTVYDQLMEGYPSDEDREAIFQAYIKALKEDPEQYRIDAKKLEEWARAQTHSSLVDFS 180
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDN 238
SKEGE+EG+LK IAERA+ GNFSYSRFFAVGLFRLLEL+NA+EPTVLEK+ S N
Sbjct: 181 SKEGEIEGILKGIAERAAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNIN 237
>gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis
vinifera]
Length = 299
Score = 356 bits (913), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 176/235 (74%), Positives = 207/235 (88%), Gaps = 2/235 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++TS++F+++GQ+S +RKV V +TRS S FE FRFR + + VR +SSSSSRM++
Sbjct: 1 MAAVTSLSFSALGQSS-ERKVPVPTTRSFASAFEAFRFRANFYAVGVRSSSSSSSSRMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFAL
Sbjct: 60 QCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF
Sbjct: 119 GFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTF 235
SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEK+ + F
Sbjct: 179 SKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAF 233
>gi|224146717|ref|XP_002326111.1| predicted protein [Populus trichocarpa]
gi|222862986|gb|EEF00493.1| predicted protein [Populus trichocarpa]
Length = 296
Score = 349 bits (895), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 174/238 (73%), Positives = 205/238 (86%), Gaps = 3/238 (1%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++T V+F++I Q+S R+ + R+L FEGFRFR+S CH V RAS+S+SRM+I
Sbjct: 1 MAAVTPVSFSAISQSSSDRRAFCTVARNL--GFEGFRFRSSFSCHYVGVRASNSTSRMVI 58
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMST+TDVPPTVA+TK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YK+T++YDPVF L
Sbjct: 59 HCMSTSTDVPPTVADTKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKKTFRYDPVFGL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS+EDREAIFQAYI AL+EDPEQYRIDA+KLEEWAR QT SSLV+F
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALEEDPEQYRIDAKKLEEWARAQTPSSLVDFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDN 238
S+EGE+EG LKDIAER + GNFSYSRFFAVGLFRLLEL+NA+EPTVLEK+ S N
Sbjct: 179 SREGEIEGTLKDIAERVAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNIN 235
>gi|388506988|gb|AFK41560.1| unknown [Medicago truncatula]
Length = 287
Score = 337 bits (865), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/231 (70%), Positives = 194/231 (83%), Gaps = 2/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F++ Q S QRK +SSTR L S+ + F RT HCV R +S+S+M+I
Sbjct: 1 MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60 QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S+EGEVEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+K+
Sbjct: 179 SREGEVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKL 229
>gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 158/228 (69%), Positives = 193/228 (84%), Gaps = 6/228 (2%)
Query: 4 LTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMIIQCM 63
++S +F+++ Q+S QR + +L SN FRFR C V RAS+S+S+M+++C
Sbjct: 5 ISSFSFSTLTQSSSQRNLT-----TLSSNSPTFRFRVGFSCLNVGVRASNSASKMVVRCS 59
Query: 64 STATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFV 123
S+ + PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFV
Sbjct: 60 SSVAE-PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFV 118
Query: 124 TVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE 183
TVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SL+EF S+E
Sbjct: 119 TVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARSQNPNSLLEFSSRE 178
Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
GEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEK+
Sbjct: 179 GEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKL 226
>gi|217073200|gb|ACJ84959.1| unknown [Medicago truncatula]
Length = 287
Score = 334 bits (856), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/231 (70%), Positives = 193/231 (83%), Gaps = 2/231 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F++ Q S QRK +SSTR L S+ + F RT HCV R +S+S+M+I
Sbjct: 1 MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60 QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S+E EVEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+K+
Sbjct: 179 SREREVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKL 229
>gi|388496070|gb|AFK36101.1| unknown [Lotus japonicus]
Length = 298
Score = 333 bits (855), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 161/232 (69%), Positives = 198/232 (85%), Gaps = 3/232 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
MA++TS +F+++ Q+S QRK +SST L SN +G RFR S C V +AS+S+S+++
Sbjct: 1 MATVTSSFSFSALTQSS-QRKHTLSSTCVLGSNSDGIRFRKSFLCQQVGVQASNSASKLV 59
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
++C S+ +D PP V+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMR+KR+Y+YDPVFA
Sbjct: 60 VRCSSSVSD-PPPVSETKLNFLKEYKRPIPSIYNTVLQELIVQQHLMRFKRSYRYDPVFA 118
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVY++LMEGYPS+EDR+AIFQ YI ALKEDP QYR DAQKLEEWAR Q+++SL+EF
Sbjct: 119 LGFVTVYEQLMEGYPSDEDRDAIFQTYIKALKEDPGQYREDAQKLEEWARTQSSTSLIEF 178
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S+EGEVEG LKDIAERA GKG+FSYSRFFA+GLFRLLEL NA EP +LEK+
Sbjct: 179 SSREGEVEGALKDIAERAGGKGDFSYSRFFAIGLFRLLELGNAMEPAILEKL 230
>gi|356517586|ref|XP_003527468.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 333 bits (853), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 160/232 (68%), Positives = 196/232 (84%), Gaps = 7/232 (3%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
MA++TS +F+++ Q+S R ++ +L SN FR R CH V RAS+S+S+++
Sbjct: 1 MATVTSSFSFSTLTQSSSPRNLS-----TLSSNSPTFRIRVGFSCHYVGVRASNSASKIV 55
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
++C S+A PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFA
Sbjct: 56 VRC-SSAVAEPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFA 114
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SLV+F
Sbjct: 115 LGFVTVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARAQNPTSLVDF 174
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S+EGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEK+
Sbjct: 175 SSREGEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKL 226
>gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula]
Length = 303
Score = 327 bits (837), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 163/254 (64%), Positives = 203/254 (79%), Gaps = 4/254 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
MA++TS +F+++ Q+S ++ +SSTR S+ + RFR S+ V R S+ S++
Sbjct: 1 MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61 VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDN 238
F S+EGEVEG LKDIAERA G G+FSYSRFFAVGLFRLLELAN EPT+LEK+ S N
Sbjct: 180 FSSREGEVEGTLKDIAERAGGNGDFSYSRFFAVGLFRLLELANTMEPTILEKLCSALNIN 239
Query: 239 IGTSIIHGEDPFRS 252
S+ D +R+
Sbjct: 240 -KKSVDRDLDVYRN 252
>gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera]
Length = 243
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 151/196 (77%), Positives = 172/196 (87%), Gaps = 4/196 (2%)
Query: 58 MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
M++QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD V
Sbjct: 1 MVVQCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAV 59
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLV
Sbjct: 60 FALGFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLV 119
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFD 237
EF SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEK+ + F
Sbjct: 120 EFSSKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAF-- 177
Query: 238 NIGT-SIIHGEDPFRS 252
NI S+ D +R+
Sbjct: 178 NISKRSVDRDLDVYRN 193
>gi|125558787|gb|EAZ04323.1| hypothetical protein OsI_26464 [Oryza sativa Indica Group]
Length = 287
Score = 310 bits (795), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 142/176 (80%), Positives = 162/176 (92%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C++TA DVPPTVAETKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 44 SRSVVRCVATAGDVPPTVAETKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKTTYQYD 103
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +S
Sbjct: 104 AVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNS 163
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+K+
Sbjct: 164 LVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKL 219
>gi|75140959|sp|Q7XAB8.1|THF1_SOLTU RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|33469614|gb|AAQ19850.1| light-regulated chloroplast-localized protein [Solanum tuberosum]
Length = 293
Score = 309 bits (791), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 153/231 (66%), Positives = 189/231 (81%), Gaps = 5/231 (2%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++TSV+F++I Q S +RK +VSS+RS+ + FRFR++ V R+S+S+SR ++
Sbjct: 1 MAAVTSVSFSAITQ-SAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVV 55
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
C S++ PTVA+TK+ FL YKRPIP++YNTVLQELIVQQHL RYK++YQYDPVFAL
Sbjct: 56 HCTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFAL 115
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPSEEDR AIF+AYI ALKEDPEQYR DAQKLEEWAR Q A++LV+F
Sbjct: 116 GFVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFS 175
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
SKEGE+E + KDIA+RA K F YSR FAVGLFRLLELAN T+PT+LEK+
Sbjct: 176 SKEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKL 226
>gi|115472755|ref|NP_001059976.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|75147522|sp|Q84PB7.1|THF1_ORYSJ RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|29367385|gb|AAO72565.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|34394010|dbj|BAC84034.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|113611512|dbj|BAF21890.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|125600704|gb|EAZ40280.1| hypothetical protein OsJ_24722 [Oryza sativa Japonica Group]
gi|215694285|dbj|BAG89278.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 287
Score = 306 bits (784), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 141/176 (80%), Positives = 161/176 (91%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C++TA DVPPTVAETKMNFLK YKRPI SIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 44 SRSVVRCVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYD 103
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +S
Sbjct: 104 AVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNS 163
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+K+
Sbjct: 164 LVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKL 219
>gi|52548246|gb|AAU82110.1| chloroplast inositol phosphatase-like protein [Triticum aestivum]
Length = 286
Score = 305 bits (782), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 137/176 (77%), Positives = 162/176 (92%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 43 SRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYD 102
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 103 PVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNL 162
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+K+
Sbjct: 163 LVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKL 218
>gi|38570261|gb|AAR24582.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
gi|81239115|gb|ABB60085.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
Length = 286
Score = 305 bits (782), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 137/176 (77%), Positives = 162/176 (92%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 43 SRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYD 102
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 103 PVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNL 162
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+K+
Sbjct: 163 LVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKL 218
>gi|357122407|ref|XP_003562907.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Brachypodium distachyon]
Length = 286
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 138/176 (78%), Positives = 159/176 (90%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C++ D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 43 SRSVVRCVAATADIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYD 102
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPS EDR+AIF++YITAL EDPEQYR DAQK+EEWAR Q S
Sbjct: 103 PVFALGFVTVYDQLMEGYPSNEDRDAIFKSYITALNEDPEQYRADAQKMEEWARAQNGSL 162
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF S++GE+E +LKDI+ERA G GNFSYSRFFAVGLFRLLELANATEPTVL+K+
Sbjct: 163 LVEFSSRDGEIEAVLKDISERAQGNGNFSYSRFFAVGLFRLLELANATEPTVLDKL 218
>gi|326493802|dbj|BAJ85363.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 286
Score = 305 bits (780), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 137/176 (77%), Positives = 161/176 (91%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 43 SRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYD 102
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 103 PVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNL 162
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+K+
Sbjct: 163 LVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKL 218
>gi|242050546|ref|XP_002463017.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
gi|241926394|gb|EER99538.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
Length = 284
Score = 303 bits (777), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 139/170 (81%), Positives = 157/170 (92%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++TA DVPPTVAETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47 CVATAGDVPPTVAETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+EEWAR Q ASSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRAYITALNEDPTQYRADALKMEEWARSQNASSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+K+
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKL 216
>gi|157142955|gb|ABV24460.1| chloroplast-localized protein [Nicotiana benthamiana]
Length = 295
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/236 (64%), Positives = 190/236 (80%), Gaps = 13/236 (5%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSR--- 57
MA++TSV+F++I Q S +RK +VS +RS+ + FRFR+++ C R+S+SS
Sbjct: 1 MAAVTSVSFSAIAQ-SAERKSSVSPSRSV----DTFRFRSNVSFDCFNVRSSNSSFSRST 55
Query: 58 --MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
++ CMST PTVAETKMNFLK YKRPIP++YNTVLQELIVQQHL++YK++Y+YD
Sbjct: 56 SRFVVHCMSTDL---PTVAETKMNFLKAYKRPIPTVYNTVLQELIVQQHLIKYKKSYRYD 112
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPSEEDR+AIF+AYI AL EDP QYR DAQK EEWAR Q A++
Sbjct: 113 PVFALGFVTVYDQLMEGYPSEEDRDAIFKAYIEALNEDPVQYRADAQKFEEWARTQNANT 172
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
LV+F S++GEVE +LKDIA+RA K +F YSR FAVGLFRLLELAN T+PT+LEK+
Sbjct: 173 LVDFSSRDGEVENILKDIAQRAGTKDSFCYSRLFAVGLFRLLELANVTDPTILEKL 228
>gi|397702097|gb|AFO59570.1| chloroplast Ptr ToxA-binding protein [Saccharum hybrid cultivar
GT28]
Length = 284
Score = 300 bits (769), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 136/170 (80%), Positives = 156/170 (91%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++TA DVPPTV+ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47 CVATAGDVPPTVSETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+ YITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRTYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+K+
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKL 216
>gi|293333399|ref|NP_001168867.1| uncharacterized protein LOC100382672 [Zea mays]
gi|223973419|gb|ACN30897.1| unknown [Zea mays]
Length = 284
Score = 300 bits (767), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 151/231 (65%), Positives = 178/231 (77%), Gaps = 15/231 (6%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MAS++S+ F + + R SST VS R R S S ++
Sbjct: 1 MASISSLPFAPLRGAAEWRP---SSTAPAVSG---------AVVLSARSRRGSGS---VV 45
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
+C++TA+DVPPTV ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYD VFAL
Sbjct: 46 RCVATASDVPPTVGETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDAVFAL 105
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+E WAR Q SSLV+F
Sbjct: 106 GFVTVYDQLMEGYPSIEDRDSIFKAYITALNEDPNQYRADALKMEGWARSQNGSSLVDFS 165
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+K+
Sbjct: 166 SRDGEIESILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKL 216
>gi|21592994|gb|AAM64943.1| unknown [Arabidopsis thaliana]
gi|58761181|gb|AAW82331.1| chloroplast thylakoid formation 1 [Arabidopsis thaliana]
Length = 300
Score = 299 bits (766), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 149/234 (63%), Positives = 182/234 (77%), Gaps = 18/234 (7%)
Query: 3 SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
+++S++F ++GQ+ +SNF R S C +F S S+S+
Sbjct: 5 AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52
Query: 59 IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
+I CMS T DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53 LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
+F SKEG++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+K+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKL 225
>gi|18399513|ref|NP_565491.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
gi|75206547|sp|Q9SKT0.1|THF1_ARATH RecName: Full=Protein THYLAKOID FORMATION 1, chloroplastic; Flags:
Precursor
gi|4454459|gb|AAD20906.1| expressed protein [Arabidopsis thaliana]
gi|17065446|gb|AAL32877.1| Unknown protein [Arabidopsis thaliana]
gi|20148535|gb|AAM10158.1| unknown protein [Arabidopsis thaliana]
gi|330251998|gb|AEC07092.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
Length = 300
Score = 299 bits (765), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 149/234 (63%), Positives = 182/234 (77%), Gaps = 18/234 (7%)
Query: 3 SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
+++S++F ++GQ+ +SNF R S C +F S S+S+
Sbjct: 5 AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52
Query: 59 IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
+I CMS T DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53 LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
+F SKEG++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+K+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKL 225
>gi|212720892|ref|NP_001131923.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|194692932|gb|ACF80550.1| unknown [Zea mays]
gi|195644742|gb|ACG41839.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|414887096|tpg|DAA63110.1| TPA: chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 295 bits (756), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 133/170 (78%), Positives = 155/170 (91%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47 CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+K+
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKL 216
>gi|217072610|gb|ACJ84665.1| unknown [Medicago truncatula]
gi|388509564|gb|AFK42848.1| unknown [Medicago truncatula]
Length = 219
Score = 295 bits (756), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 144/219 (65%), Positives = 179/219 (81%), Gaps = 3/219 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
MA++TS +F+++ Q+S ++ +SSTR S+ + RFR S+ V R S+ S++
Sbjct: 1 MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61 VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
F S+E EVEG LKDIAERA G G+FSYSRFFAVG F L
Sbjct: 180 FSSREREVEGTLKDIAERAGGNGDFSYSRFFAVGFFDFL 218
>gi|297832696|ref|XP_002884230.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
gi|297330070|gb|EFH60489.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
Length = 298
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 137/172 (79%), Positives = 158/172 (91%), Gaps = 1/172 (0%)
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
I CMST TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFA
Sbjct: 54 IHCMSTVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFA 112
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F
Sbjct: 113 LGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDF 172
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S++GE+E LLKDIA RA+ K FSYSRFFAVGLFRLLELA+AT+PTVL+K+
Sbjct: 173 SSRQGEIEALLKDIAGRAASKEGFSYSRFFAVGLFRLLELASATDPTVLDKL 224
>gi|195653795|gb|ACG46365.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 294 bits (753), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 145/231 (62%), Positives = 179/231 (77%), Gaps = 15/231 (6%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+++S+ F ++ + R SST + VS R R S S ++
Sbjct: 1 MAAISSLPFAALRGAAEWRP---SSTAAAVSG---------AVVLSARSRRGSCS---VV 45
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
+C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFAL
Sbjct: 46 RCVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFAL 105
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LME YPS ED+++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F
Sbjct: 106 GFVTVYDQLMERYPSNEDKDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFS 165
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+K+
Sbjct: 166 SRDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKL 216
>gi|157849728|gb|ABV89647.1| chloroplast light-regulated protein [Brassica rapa]
Length = 273
Score = 293 bits (750), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 152/231 (65%), Positives = 185/231 (80%), Gaps = 5/231 (2%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++ S++F ++GQ+ + N + +R L S R S R S+SSS+ +I
Sbjct: 1 MAAIASLSFPALGQSG--KLSNPTPSRPLAS-VSAISRRISRRSLTSS-RPSTSSSKFVI 56
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMS+ TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFAL
Sbjct: 57 HCMSSVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFAL 115
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LM+GYPS++DR++IFQAY+ AL E P+QYRIDAQK+EEWAR QT++SLV+F
Sbjct: 116 GFVTVYDQLMDGYPSDQDRDSIFQAYVEALNEVPKQYRIDAQKMEEWARSQTSASLVDFS 175
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
KEGEVE +LKDI+ERA K FSYSRFFAVGLFRLLELA AT+PTVL+K+
Sbjct: 176 FKEGEVEAILKDISERAGSKEGFSYSRFFAVGLFRLLELAGATDPTVLDKL 226
>gi|414887097|tpg|DAA63111.1| TPA: hypothetical protein ZEAMMB73_220735 [Zea mays]
Length = 207
Score = 275 bits (703), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 124/159 (77%), Positives = 143/159 (89%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47 CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLL+ A
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLDFA 205
>gi|116782547|gb|ABK22548.1| unknown [Picea sitchensis]
Length = 304
Score = 274 bits (701), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 136/181 (75%), Positives = 155/181 (85%), Gaps = 3/181 (1%)
Query: 53 SSSSRMIIQC-MS-TATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
+SSS + +C MS TA+D+P TVAETK FLK YKRPIPSIYN V+QELIVQQHLMRYKR
Sbjct: 61 ASSSLSVTRCSMSITASDIP-TVAETKSAFLKAYKRPIPSIYNNVIQELIVQQHLMRYKR 119
Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG 170
TYQYD VFALGFV+VYD+LM+GYPS+ D EAIF+AYI ALKEDPEQYR DA+KLEEWA
Sbjct: 120 TYQYDAVFALGFVSVYDQLMDGYPSDGDSEAIFRAYINALKEDPEQYRSDAKKLEEWASS 179
Query: 171 QTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
Q A S+VEF S++GEVEG+LKDIAERA K FSYSRFFA+GLFRLLE ANAT+P VLEK
Sbjct: 180 QDAKSIVEFQSRDGEVEGILKDIAERAREKKIFSYSRFFAIGLFRLLERANATDPVVLEK 239
Query: 231 V 231
+
Sbjct: 240 L 240
>gi|302807588|ref|XP_002985488.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
gi|302810785|ref|XP_002987083.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300145248|gb|EFJ11926.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300146694|gb|EFJ13362.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
Length = 206
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 115/161 (71%), Positives = 138/161 (85%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK FLK +++PIPSIYN VLQEL+VQQHLMRY TY+YD VFALGFVTVYD+LM
Sbjct: 3 PTVADTKSAFLKAFRKPIPSIYNNVLQELLVQQHLMRYNATYKYDAVFALGFVTVYDQLM 62
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP+ +D EAIF+AYI AL EDP+QYR DA+KLEEWA QTASSL F S +G+VE +L
Sbjct: 63 DGYPNAQDSEAIFKAYIEALGEDPDQYRKDAKKLEEWASSQTASSLASFNSGDGDVEEVL 122
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
KDIA+RA+GK +F YSRFFAVGLFRL+E ANA++P VLEK+
Sbjct: 123 KDIAQRAAGKTSFHYSRFFAVGLFRLVERANASDPAVLEKL 163
>gi|168043272|ref|XP_001774109.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674516|gb|EDQ61023.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 215
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 110/159 (69%), Positives = 137/159 (86%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK++F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVTVYD+LM
Sbjct: 7 PTVADTKLSFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYVYDPIFALGFVTVYDQLM 66
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP++EDR+AIF+AYI+AL EDPEQYR D++KLEEWA Q+ S + +F K+GEVE L
Sbjct: 67 DGYPNDEDRDAIFKAYISALNEDPEQYRKDSKKLEEWAAAQSGSGIADFAGKDGEVEAAL 126
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE 229
KDIAERA+GK F YSRFFA+GLFRLLE A A++P VLE
Sbjct: 127 KDIAERAAGKEKFHYSRFFAIGLFRLLECAKASDPAVLE 165
>gi|168037112|ref|XP_001771049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677737|gb|EDQ64204.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 205
Score = 235 bits (599), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 106/161 (65%), Positives = 130/161 (80%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV+ETK +F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVTVYD+LM
Sbjct: 7 PTVSETKASFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYTYDPIFALGFVTVYDQLM 66
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP DR++IF AYI AL EDP +YR DA+KLEEWA Q+AS + +F S++GEVE L
Sbjct: 67 DGYPDATDRDSIFTAYINALNEDPVKYREDAKKLEEWASAQSASGITDFTSRDGEVEATL 126
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
K IAERA K F YSRFFA+GLFRLLE A A++P VLE +
Sbjct: 127 KSIAERAGSKDKFHYSRFFAIGLFRLLECAKASDPAVLESL 167
>gi|326492686|dbj|BAJ90199.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 239
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 96/124 (77%), Positives = 112/124 (90%)
Query: 56 SRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD
Sbjct: 43 SRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYD 102
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 103 PVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNL 162
Query: 176 LVEF 179
LVEF
Sbjct: 163 LVEF 166
>gi|384250113|gb|EIE23593.1| photosystem II biogenesis protein Psp29 [Coccomyxa subellipsoidea
C-169]
Length = 290
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/167 (52%), Positives = 121/167 (72%), Gaps = 4/167 (2%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
PPTVAETK NF + + RPIP IY+ V+QEL+VQ H+MRY ++Y YD VF LGFV+V+D++
Sbjct: 65 PPTVAETKRNFYEAFSRPIPGIYSNVIQELLVQHHIMRYNKSYSYDEVFGLGFVSVFDQV 124
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-EVEG 188
+EG P E D+ A+F AYI +L E+ +QYR DA+K+E A+ + + ++ P EG E++
Sbjct: 125 LEGLP-EGDKGALFSAYIGSLGENGDQYRQDAEKVEALAKELSGPAELK-PDAEGSELQK 182
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTF 235
L IAER+S +GNF Y++FFA+GLFRLLEL A +P LE + S
Sbjct: 183 KLASIAERSS-QGNFLYTKFFAIGLFRLLELTGAKDPKALEGLVSAM 228
>gi|159471025|ref|XP_001693657.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
gi|158283160|gb|EDP08911.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
Length = 266
Score = 166 bits (420), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 92/180 (51%), Positives = 122/180 (67%), Gaps = 3/180 (1%)
Query: 52 SSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRT 111
SS + + + + AT PPTVAETK FL Y +PI SIY+TVLQEL+VQQH MRY +
Sbjct: 24 SSLARKAVATPVLAATFKPPTVAETKAKFLSGYNKPIASIYSTVLQELLVQQHFMRYSKN 83
Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
YQY+P+FALGFV+VY++++E S E+R AIF+AY+ AL ED ++Y+ DA LE+ A G
Sbjct: 84 YQYNPIFALGFVSVYEQILESL-SAEERGAIFKAYVDALGEDADKYKRDASALEQAANGL 142
Query: 172 TASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
T SL P+ +G AS G FSY++F A+GLFRLLEL+ A EP+ LEK+
Sbjct: 143 TPESLT--PNADGNEVQKALASISSASAAGAFSYNKFVAIGLFRLLELSGAKEPSALEKL 200
>gi|303286071|ref|XP_003062325.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455842|gb|EEH53144.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 222
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 105/160 (65%), Gaps = 8/160 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA+TK FLK Y PIPSI++ LQEL+V QH +RY + Y Y + +LGFV+VYD+L E
Sbjct: 45 TVADTKQKFLKSYPYPIPSIWSVALQELLVTQHFVRYSKKYSYSKLSSLGFVSVYDQLFE 104
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
G+PSEE++ IF+ ++ AL+EDP R DA +L +A G + V +++ L+
Sbjct: 105 GFPSEEEKNTIFECFVKALEEDPATVRKDAAELASFAEGASGVDGVLASPIFAQMKSLVA 164
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
D G F+YSR+ A+GLFRLLELA ATEP LEK+
Sbjct: 165 D--------GKFAYSRYDAIGLFRLLELAKATEPAALEKL 196
>gi|307108772|gb|EFN57011.1| hypothetical protein CHLNCDRAFT_143677 [Chlorella variabilis]
Length = 273
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/174 (44%), Positives = 112/174 (64%), Gaps = 11/174 (6%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
PPTVA+ K+ F +K+P+P+IY+TV+QEL+VQQHL R+ + YQY+ V ALG V+++++
Sbjct: 49 APPTVADAKLKFNGAFKKPLPAIYSTVVQELLVQQHLFRWNKQYQYNEVTALGIVSIFEQ 108
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
++ G P E REA+F A+I AL+EDP+QYR DA +EE ARG++ + P G+
Sbjct: 109 VLGGLPDAE-REAVFDAFINALQEDPKQYRKDAAAMEELARGKSEVA----PDASGDKVQ 163
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTS 242
GK F Y++FFAVGLFRL+EL + +P L +T +G S
Sbjct: 164 QALAAVAAKEGK--FLYTKFFAVGLFRLVELTGSKDPKSL----TTLVKALGLS 211
>gi|427736065|ref|YP_007055609.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
gi|427371106|gb|AFY55062.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
Length = 233
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/186 (41%), Positives = 119/186 (63%), Gaps = 14/186 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ETK F L+ RPI +IY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSETKRTFYSLHTRPINTIYRRVVEELMVEMHLLGVNADFKYDPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK-----EGEV 186
GY EED+E+I+ A I +++EDP++YR DA++LE+ A+ T LV S+ + E+
Sbjct: 66 GYNPEEDKESIYNALIKSVEEDPQKYRHDAKRLEDLAKSTTGKDLVSDLSQRRLANDSEL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSII 244
+GLL+ IA +S F YSR FA+GL+ LLE +++P ++ EK+R+ I +
Sbjct: 126 QGLLEGIANNSS----FKYSRLFAIGLYTLLE---SSDPEMVKDEKLRNEALKTIAAGLN 178
Query: 245 HGEDPF 250
ED
Sbjct: 179 LSEDKL 184
>gi|302852549|ref|XP_002957794.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
gi|300256865|gb|EFJ41122.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
Length = 373
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 115/198 (58%), Gaps = 37/198 (18%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
PPTVAETK F + Y +PI SIY+TVLQEL+VQQH MRY + Y Y+ +FALGFV+VY+++
Sbjct: 43 PPTVAETKAKFFEGYSKPIASIYSTVLQELLVQQHFMRYSKDYVYNEIFALGFVSVYEQI 102
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----------------QTA 173
+E P E R+AIF +Y+ AL EDPE Y+ D++++E+ A QT+
Sbjct: 103 LESLPQSE-RDAIFVSYVKALGEDPEAYKRDSERVEKAAGALSGPDALVPDAEGSDVQTS 161
Query: 174 SSLVEFPSKEGEVE--------------------GLLKDIAERASGKGNFSYSRFFAVGL 213
+ + + + GE+ G D + AS FSY++F A+GL
Sbjct: 162 AYIWAYHQRRGEMRMPWRTRTWGQGSSSLGVCSYGKALDAIKAASAADAFSYNKFVAIGL 221
Query: 214 FRLLELANATEPTVLEKV 231
FRLLEL A EP LE++
Sbjct: 222 FRLLELTGAKEPAALERL 239
>gi|356555139|ref|XP_003545894.1| PREDICTED: LOW QUALITY PROTEIN: protein THYLAKOID FORMATION1,
chloroplastic-like [Glycine max]
Length = 152
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/95 (72%), Positives = 78/95 (82%)
Query: 137 EDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAER 196
E R+AIFQAYI AL EDP++YRIDA+KLEEWA Q +SLVEF SKEGE E LKDIA R
Sbjct: 19 EGRDAIFQAYIKALVEDPDKYRIDARKLEEWAGVQNPTSLVEFSSKEGEAEKXLKDIAXR 78
Query: 197 ASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
A GK FSYSRFFAVGLFRL+EL NATEP +L+K+
Sbjct: 79 AGGKXEFSYSRFFAVGLFRLVELENATEPIILDKL 113
>gi|255075137|ref|XP_002501243.1| predicted protein [Micromonas sp. RCC299]
gi|226516507|gb|ACO62501.1| predicted protein [Micromonas sp. RCC299]
Length = 260
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 106/164 (64%), Gaps = 16/164 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A+TK F++ Y PIPSI++ + EL+ QH +RY Y Y + +LGFV+VYD+L E
Sbjct: 51 TLADTKRKFVESYPYPIPSIWSVAVNELLANQHFVRYSTRYSYSKLSSLGFVSVYDQLFE 110
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
G+PS+E++ IF ++ AL EDPE+ R D A+ L +F + G V+ LL
Sbjct: 111 GFPSDEEKAKIFDCFVEALGEDPEKCRKD------------AAELAKFAKEAGGVDALLA 158
Query: 192 D--IAE-RASGKGN-FSYSRFFAVGLFRLLELANATEPTVLEKV 231
+AE +++G+ N F+YSR+ A+GLFR+LEL ATEP LEK+
Sbjct: 159 SPVLAEIKSNGEANKFAYSRYDAIGLFRMLELGGATEPAALEKL 202
>gi|428213026|ref|YP_007086170.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
gi|428001407|gb|AFY82250.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
Length = 235
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/178 (43%), Positives = 104/178 (58%), Gaps = 11/178 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ + YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF------PSKEGE 185
GY EED+ +IF L+ DP++YR DAQ LEE A + +V P EG+
Sbjct: 66 GYRPEEDKISIFNGICKGLEADPQKYRQDAQWLEEIASRHSGEEMVALLSRSAGPEMEGD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
+G+L IA K NF YSR FAVGLF LLE A+ E EK R I T++
Sbjct: 126 FQGILGAIA----AKPNFKYSRLFAVGLFTLLEQAD-LELVKNEKSRQEAVQKICTAL 178
>gi|443311308|ref|ZP_21040938.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
gi|442778631|gb|ELR88894.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
Length = 241
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/181 (38%), Positives = 110/181 (60%), Gaps = 3/181 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSTHTRPINTIYRRVVEELMVEMHLLSVNADFSYNPIYALGVVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+++IFQA A+ DP QYR DA++L +A+ ++ L+++ S E ++G
Sbjct: 66 GYQPERDKDSIFQALCQAINTDPHQYRQDAERLGSFAKSLSSQDLMQWLSSEKPIDGYSD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHGEDP 249
L++ ++ + F YSR FA+G+F LLEL++ E E R F I +S+ ED
Sbjct: 126 LQEQIKQIATNQKFKYSRLFAIGVFSLLELSDP-ELVKDETKRVEAFKQISSSLHLPEDK 184
Query: 250 F 250
Sbjct: 185 L 185
>gi|17228142|ref|NP_484690.1| Thf1-like protein [Nostoc sp. PCC 7120]
gi|81772969|sp|Q8YZ41.1|THF1_ANASP RecName: Full=Protein thf1
gi|17129992|dbj|BAB72604.1| all0646 [Nostoc sp. PCC 7120]
Length = 233
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 113/187 (60%), Gaps = 15/187 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR ME
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLQAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSI 243
++ L+ IA NF YSR FA+GLF LLE +N P ++ EK R+ +I +
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFTLLEQSN---PDLVKDEKQRTEALKSIAAGL 178
Query: 244 IHGEDPF 250
+D F
Sbjct: 179 HLSDDKF 185
>gi|427727466|ref|YP_007073703.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
gi|427363385|gb|AFY46106.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
Length = 235
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/185 (37%), Positives = 112/185 (60%), Gaps = 15/185 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNIDFTYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ S LV + S ++ +
Sbjct: 66 GYRPERDKESIFHAICQAVEQEPQRYRQDAERLQNLAKSLPISDLVAWLSQTTHFNQDPD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSI 243
++ L+ IA NF YSR FA+GLF LLE ++P ++ EK R+ NI +
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLFAIGLFSLLEY---SDPDLVKDEKQRTEALKNIANGL 178
Query: 244 IHGED 248
+D
Sbjct: 179 HLADD 183
>gi|434405136|ref|YP_007148021.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
gi|428259391|gb|AFZ25341.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
Length = 235
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 111/185 (60%), Gaps = 11/185 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A I A++++P++YR DA++L+ A+G L+ + S ++
Sbjct: 66 GYQPERDQESIFNAIIQAVEQEPQRYRQDAERLQAVAQGLPEQDLIAWLSQTTHSDRDAN 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIH 245
++ L+ IA NF YSR FA+GLF LLE++ + E +K R+ I T +
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFSLLEVS-SPELVKDDKQRNEALKAIATGLHL 180
Query: 246 GEDPF 250
+D
Sbjct: 181 SDDKL 185
>gi|113474941|ref|YP_721002.1| Thf1-like protein [Trichodesmium erythraeum IMS101]
gi|123056927|sp|Q116P5.1|THF1_TRIEI RecName: Full=Protein thf1
gi|110165989|gb|ABG50529.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 239
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 94/152 (61%), Gaps = 2/152 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIYN V++EL+V+ HL+ Y Y+P +ALG VT +DR M+
Sbjct: 6 TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY +ED+ +IF A I +EDP +YR DA+ LE+ A +AS ++ + SK +
Sbjct: 66 GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN 221
L+D S F YSR FA+GLF LLE+ +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVD 157
>gi|158338004|ref|YP_001519180.1| Thf1-like protein [Acaryochloris marina MBIC11017]
gi|189030267|sp|B0C3M8.1|THF1_ACAM1 RecName: Full=Protein thf1
gi|158308245|gb|ABW29862.1| photosystem II biogenesis protein Psb29 [Acaryochloris marina
MBIC11017]
Length = 247
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 107/175 (61%), Gaps = 10/175 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RP+ S+Y V++EL+V+ HL+R ++YDP+FALG T +DR M+
Sbjct: 6 TVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDRFMD 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-----EV 186
GY E D++AIF A A + DP Q + D Q+L E A+ ++A ++++ ++ E+
Sbjct: 66 GYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGGDEL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGT 241
+ L++IA+ F YSR FA+GLF LLEL+ T E+ + F NI T
Sbjct: 126 QWQLRNIAQNPK----FKYSRLFAIGLFTLLELSEGN-ITQDEESLAEFLPNICT 175
>gi|119488459|ref|ZP_01621632.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
gi|119455270|gb|EAW36410.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
Length = 241
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 103/165 (62%), Gaps = 5/165 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI S+Y V++EL+V+ HL+ +QYDP++ALG V+ +DR M+
Sbjct: 6 TVSDTKRAFYNTHTRPINSVYRRVIEELMVEMHLLSVNVDFQYDPIYALGVVSAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY E D+E+IF I AL++DP++YR +AQ+L+E+A+ + +V + + EV
Sbjct: 66 GYLPESDKESIFHGLINALQDDPQRYRAEAQRLQEFAQTLSVQDIVSWVDVAANSEVHND 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRST 234
L+ ++ + + YSR A+GLF L+E A +P +E +T
Sbjct: 126 LQSSFQKIATNPKYKYSRILAIGLFTLIEQA---DPQAMEDKEAT 167
>gi|354568723|ref|ZP_08987886.1| Protein thf1 [Fischerella sp. JSC-11]
gi|353539977|gb|EHC09457.1| Protein thf1 [Fischerella sp. JSC-11]
Length = 235
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 108/183 (59%), Gaps = 11/183 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P+FALG VT +DR M+
Sbjct: 6 TVSDTKRTFHTLHTRPINTIYRRVVEELMVEMHLLAVNVDFSYNPIFALGVVTSFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A + A++ DP+ YR DAQ+L+E A+ L+ S ++ +
Sbjct: 66 GYQPESDKESIFNALLRAIEADPQIYRQDAQRLQELAKSLPPQDLIAALSLQTQLNRDTD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIH 245
++ L+ IA F YSR FA+GLF LLEL++ E EK R+ +I +
Sbjct: 126 LQSHLQAIA----SNPKFKYSRLFAIGLFSLLELSDP-ELVKDEKQRTEALKSIAAGLHI 180
Query: 246 GED 248
+D
Sbjct: 181 SDD 183
>gi|440683252|ref|YP_007158047.1| Protein thf1 [Anabaena cylindrica PCC 7122]
gi|428680371|gb|AFZ59137.1| Protein thf1 [Anabaena cylindrica PCC 7122]
Length = 235
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 107/185 (57%), Gaps = 11/185 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ Y Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A+++D ++YR DA +L+ A+ L+ + S K+ +
Sbjct: 66 GYLPERDQESIFNALCQAVEQDQQRYRQDATRLQAIAQSLPVQDLIAWVSQTTHLDKDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIH 245
++ L+ IA NF YSR FA+GLF LLELA+ E EK R+ I +
Sbjct: 126 LQAQLQAIAH----NPNFKYSRLFAIGLFSLLELADP-ELVKDEKQRNEALKAIAQGLHL 180
Query: 246 GEDPF 250
ED
Sbjct: 181 SEDKL 185
>gi|428210102|ref|YP_007094455.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
gi|428012023|gb|AFY90586.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
Length = 250
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 109/181 (60%), Gaps = 3/181 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK NF + RPI +IY V++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRNFYNQHTRPINTIYRRVVEELMVEMHLLSVNADFRYDPIYALGVVTAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+E IF+A +++++P++YR DA +L + + +A L ++ + ++G
Sbjct: 66 GYQPERDKEPIFEALCQSIEDNPQRYRQDADRLRQLLQNVSAQQLFDWIDGKASLQGAED 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHGEDP 249
L+ + + F YSR FA+G+F LLELA+A E EK R + T++ ED
Sbjct: 126 LQAQMQAIAQNSKFKYSRLFAIGVFTLLELADA-ELVKDEKQRVEALKQVATALHVPEDK 184
Query: 250 F 250
Sbjct: 185 L 185
>gi|411116557|ref|ZP_11389044.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
gi|410712660|gb|EKQ70161.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
Length = 246
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 96/154 (62%), Gaps = 4/154 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ Y Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E D+ IF A AL++DP +YR DAQ+L ++A+ ++A +V + + G
Sbjct: 66 GYRPENDKAPIFDAICQALQDDPNRYRHDAQRLNDFAKQKSAKDIVTWLEQAATSYG-GD 124
Query: 192 DIAERASGKGN---FSYSRFFAVGLFRLLELANA 222
D+ E+ N F YSR FA+GLF L E A+A
Sbjct: 125 DLQEQVKAIANNPKFKYSRLFAIGLFTLFETADA 158
>gi|75910773|ref|YP_325069.1| Thf1-like protein [Anabaena variabilis ATCC 29413]
gi|97202708|sp|Q3M4B2.1|THF1_ANAVT RecName: Full=Protein thf1
gi|75704498|gb|ABA24174.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 233
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 111/185 (60%), Gaps = 15/185 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLKAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSI 243
++ L+ IA NF YSR FA+GLF LLE +N P ++ EK R+ I +
Sbjct: 126 LQAQLQAIA----SNPNFKYSRLFAIGLFTLLEQSN---PDLVKDEKQRTEALKTIAAGL 178
Query: 244 IHGED 248
+D
Sbjct: 179 HLSDD 183
>gi|186685250|ref|YP_001868446.1| Thf1-like protein [Nostoc punctiforme PCC 73102]
gi|254784144|sp|B2J353.1|THF1_NOSP7 RecName: Full=Protein thf1
gi|186467702|gb|ACC83503.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 235
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 96/153 (62%), Gaps = 10/153 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV------EFPSKEGE 185
GY E D+E+IF A A+++DP+ YR DA++L+ A+G L+ + ++ +
Sbjct: 66 GYEPERDQESIFNALCRAIEQDPQHYRQDAERLQAIAKGLPVKDLIGWLGQTTYLDRDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
++ L+ IA NF Y+R FA+G+F LLE
Sbjct: 126 LQAQLQAIA----NNPNFKYNRLFAIGVFSLLE 154
>gi|334116992|ref|ZP_08491084.1| Protein thf1 [Microcoleus vaginatus FGP-2]
gi|333461812|gb|EGK90417.1| Protein thf1 [Microcoleus vaginatus FGP-2]
Length = 237
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 95/156 (60%), Gaps = 10/156 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E DR +IF A +L++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYAPEADRVSIFNALCKSLEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121
Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELAN 221
D A+ + G F YSR FA+GLF LLE A+
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKAD 157
>gi|145344894|ref|XP_001416959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577185|gb|ABO95252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 203
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/165 (39%), Positives = 101/165 (61%), Gaps = 18/165 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK FL+ Y PIPS+++TV QEL+VQ H +Y +Y + +LGFV+V+D+L E
Sbjct: 1 TVSDTKAKFLQAYPYPIPSVWSTVTQELLVQGHFAKYNAKSEYSELASLGFVSVFDQLYE 60
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
G+PSE ++ IF A++ AL ED + R DA+ +L F + G V+GL
Sbjct: 61 GFPSETEKVKIFNAFLGALGEDAAKTRADAE------------ALGAFAASAGGVDGLSA 108
Query: 192 D-----IAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
+ +A + S + Y+++ A+G+FR+LELA AT+P LE +
Sbjct: 109 NPIFATMAAK-SAENKLMYTKYIAIGIFRMLELAKATDPKALEAL 152
>gi|427707894|ref|YP_007050271.1| Protein thf1 [Nostoc sp. PCC 7107]
gi|427360399|gb|AFY43121.1| Protein thf1 [Nostoc sp. PCC 7107]
Length = 235
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 109/183 (59%), Gaps = 7/183 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
GY E D+E+IFQA A++++ ++YR DA++L+ A+ A+ L+ + S+ + +
Sbjct: 66 GYQPERDKESIFQAICQAVEQEVQRYRQDAERLQALAKSLAANDLIAWLSQTNHLNQDPD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSIIHGE 247
L+ + + F Y+R FA+GLF LLE ++P ++ EK R+ I + E
Sbjct: 126 LQSQLQAIANNSQFKYNRLFAIGLFSLLE---QSDPDLVKDEKQRTDAIKTIAAGLHLSE 182
Query: 248 DPF 250
D
Sbjct: 183 DKL 185
>gi|220910509|ref|YP_002485820.1| Thf1-like protein [Cyanothece sp. PCC 7425]
gi|254784141|sp|B8HQ62.1|THF1_CYAP4 RecName: Full=Protein thf1
gi|219867120|gb|ACL47459.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7425]
Length = 236
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 69/152 (45%), Positives = 95/152 (62%), Gaps = 1/152 (0%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TV++TK F + RPI SIY V++EL+V+ HL+R +T+ YDPVFALG VT ++R
Sbjct: 4 PRTVSDTKRAFYHNHARPINSIYRRVVEELLVEIHLLRVNQTFVYDPVFALGVVTTFERF 63
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
M+GY D+ +IF A A + DP+Q + DAQ+L RGQ+ SL+++ S + G
Sbjct: 64 MQGYHPPADQTSIFNAICLAQELDPQQVQQDAQELLGRVRGQSLESLLDWISTAASLGGD 123
Query: 190 LKDIAERA-SGKGNFSYSRFFAVGLFRLLELA 220
+ RA + F YSR FAVGLF LLE A
Sbjct: 124 EQQNRLRAIASNPTFKYSRLFAVGLFTLLEQA 155
>gi|428223566|ref|YP_007107663.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
gi|427983467|gb|AFY64611.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
Length = 239
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 98/153 (64%), Gaps = 4/153 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSMHTRPINSIYRRVVEELMVEMHLLSVNVDFRYDPFYALGVVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVEG 188
GY E+D+ +IF++ A + DP YR DA++L E+ + +A L+ + S E G+ +G
Sbjct: 66 GYRPEQDKTSIFESLCRANEGDPGHYRHDAERLAEFTKNLSAEELISWLSLETPRGDDQG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN 221
L + + + + F YSR FA+GLF L+E AN
Sbjct: 126 LGESL-QAIANHSQFKYSRLFAIGLFTLVEQAN 157
>gi|428317172|ref|YP_007115054.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
gi|428240852|gb|AFZ06638.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
Length = 237
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 95/156 (60%), Gaps = 10/156 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E DR +IF A ++++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYVPEADRVSIFNALCKSVEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121
Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELAN 221
D A+ + G F YSR FA+GLF LLE A+
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKAD 157
>gi|414076688|ref|YP_006996006.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
gi|413970104|gb|AFW94193.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
Length = 223
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 91/148 (61%), Gaps = 2/148 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + YD ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYDAIYALGVVTTFDRFMD 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
GY E+D+E+IF+A A+++DP+ YR DA +L+ A A L+ S+ + +
Sbjct: 66 GYQPEQDKESIFRAICQAVEQDPQSYRQDASRLQALAASLPAKDLIASLSQASPLNQDAD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL 217
L+ E + NF YSR F VGLF LL
Sbjct: 126 LQKQLEAVAANSNFKYSRLFGVGLFALL 153
>gi|427719034|ref|YP_007067028.1| Protein thf1 [Calothrix sp. PCC 7507]
gi|427351470|gb|AFY34194.1| Protein thf1 [Calothrix sp. PCC 7507]
Length = 235
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 107/178 (60%), Gaps = 11/178 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNSIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L A+ A+ L+ + S ++ +
Sbjct: 66 GYLPERDQESIFNALCHAVEQEPQRYRQDAERLRVLAKSLPANDLIAWLSQTTHLDQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
++ L+ IA NF YSR A+GLF LLEL++ E EK R+ I T +
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLLAIGLFTLLELSDP-ELVKDEKQRNEALKAIATGL 178
>gi|298491449|ref|YP_003721626.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
gi|298233367|gb|ADI64503.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
Length = 235
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 109/185 (58%), Gaps = 11/185 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ ++Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDFRYNSIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E+D+ +IF A I A+++DP++YR DA +L+ A+ L+ + S ++ +
Sbjct: 66 GYQPEQDQASIFNAIIQAVEQDPQRYRQDAARLQVVAQSLLTKDLISWLSQTTYLDQDRD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIH 245
++ L+ IA A F YSR FA+GLF LLE+ + +E EK R+ I +
Sbjct: 126 LQAQLQAIANNAE----FKYSRLFAIGLFSLLEMVD-SELVKDEKQRNQALKAIAQGLHL 180
Query: 246 GEDPF 250
E+
Sbjct: 181 SEEKL 185
>gi|428775508|ref|YP_007167295.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
gi|428689787|gb|AFZ43081.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
Length = 243
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 104/175 (59%), Gaps = 7/175 (4%)
Query: 68 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 127
D T++ETK F L+ RP+ SIY V++EL+V+ HL+ ++YDP +ALG VTV+D
Sbjct: 2 DTLRTLSETKRTFYTLHTRPLNSIYRRVIEELLVEMHLLTVNIDFKYDPFYALGVVTVFD 61
Query: 128 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE 185
M+GY E+D+E+IF A A++ DP+QYR DA+K++ A + ++ + +K +
Sbjct: 62 TFMQGYQPEKDKESIFNAICKAVESDPQQYRQDAEKVKSIADQASGEAVTAWLCEAKPLD 121
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDN 238
G L DI + F YSR F +G++ +LE AN P ++ +K R +N
Sbjct: 122 QAGDLNDILQGIRENPRFKYSRLFIIGIYTVLEKAN---PEIVNDDKKREEVLNN 173
>gi|376001810|ref|ZP_09779664.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
gi|375329721|emb|CCE15417.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
Length = 243
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 96/168 (57%), Gaps = 2/168 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYNIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY E D+ +I+ A I A + DP QYR DA LE A + L E ++E +
Sbjct: 66 GYIPEADKLSIWAALIMAQESDPNQYRADATALEAQAATLSVKDLTERAKIAQESSGDDP 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFD 237
L+ + F YSR FA+GL+ LLE ++ T E +++ D
Sbjct: 126 LQSCFHAIANNPKFKYSRLFAIGLYTLLEKSDVTAAQDSEGLKNIIID 173
>gi|434395245|ref|YP_007130192.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
gi|428267086|gb|AFZ33032.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
Length = 251
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 108/181 (59%), Gaps = 7/181 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTSHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
GY E D+E+IF A A++ DP++YR DA++L +A+ + L+ + E E G
Sbjct: 66 GYQPERDKESIFNALCQAVESDPQRYRQDAERLGLFAKNTSTPELIAWLRGETHKEEVGD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSIIHGE 247
L+ + + +F YSR FA+G+F LLEL ++P ++ EK R +I ++ E
Sbjct: 126 LQQQIQAIAHNPHFKYSRLFAIGVFGLLEL---SDPALVKDEKQRVDALKSIAATLNISE 182
Query: 248 D 248
D
Sbjct: 183 D 183
>gi|428304539|ref|YP_007141364.1| Protein thf1 [Crinalium epipsammum PCC 9333]
gi|428246074|gb|AFZ11854.1| Protein thf1 [Crinalium epipsammum PCC 9333]
Length = 243
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 106/176 (60%), Gaps = 7/176 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++EL+V+ HL+ + Y P++ALG VT Y++ M+
Sbjct: 6 TVSDTKRDFYNNHTRPINSIYRRVVEELMVEMHLLSVNVDFAYHPIYALGVVTSYEKFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E DR++IF A + A+ ED ++Y+ DA++L+ A + L+++ V+G
Sbjct: 66 GYRPERDRDSIFDALVGAVGEDSQRYKQDAEQLKALAGRLSGKELIDWIVSPTAVDGAGS 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL--EKVRSTFFDNIGTSI 243
L D + F YSR FA+GL+ LLE+ ++P+++ EK R + +G S+
Sbjct: 126 LPDQMRAIANNPQFKYSRLFAIGLYTLLEV---SDPSLVKDEKERLDALNQVGQSL 178
>gi|428778484|ref|YP_007170270.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
gi|428692763|gb|AFZ48913.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
Length = 240
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 93/156 (59%), Gaps = 2/156 (1%)
Query: 68 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 127
D T++ETK F + RP+ SIY V++EL+V+ HL+ ++YDP++ALG TV+D
Sbjct: 2 DTLRTLSETKRTFYTQHTRPLNSIYRRVIEELLVEMHLLSVNTDFKYDPIYALGVTTVFD 61
Query: 128 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 187
M+GY E+++E+IF A A++ DP++YR DA+KL+ A + + S+ ++
Sbjct: 62 TFMQGYQPEKEKESIFNAICQAVENDPQKYRQDAEKLKSIAANHSGEEVTACLSELKPLD 121
Query: 188 GL--LKDIAERASGKGNFSYSRFFAVGLFRLLELAN 221
G L + + F YSR F +GL+ +LE AN
Sbjct: 122 GAEELTKVLQEIKNNSRFKYSRLFIIGLYTILETAN 157
>gi|452819272|gb|EME26335.1| thylakoid protein [Galdieria sulphuraria]
Length = 316
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/187 (37%), Positives = 101/187 (54%), Gaps = 10/187 (5%)
Query: 36 FRFRTSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTV 95
+R + + C + SS+S + T TVAET +FLK ++ PIPSIY T+
Sbjct: 54 YRTKRRMECRSIENLKESSAS----NSLPTQNGTVRTVAETISDFLKHFRHPIPSIYRTI 109
Query: 96 LQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPE 155
+QEL+V HL R +QYDPVFALG+ V + YP E++E +F + AL D E
Sbjct: 110 VQELLVTTHLARVAVGFQYDPVFALGYQMVTQVFFKSYPKVEEKEKLFDSMCKALLLDYE 169
Query: 156 QYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGLLKDIAERASGKGNFSYSRFFAVGL 213
+ + DA LEEW R +T ++ + G+ + LL IA+ F YSR F +GL
Sbjct: 170 RMKKDASVLEEWTRSRTEREILLAIEEGGDDPLANLLHSIAQ----NDGFVYSRLFGLGL 225
Query: 214 FRLLELA 220
R++EL
Sbjct: 226 VRMMELC 232
>gi|424513129|emb|CCO66713.1| Thf1-like protein [Bathycoccus prasinos]
Length = 222
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/165 (38%), Positives = 98/165 (59%), Gaps = 12/165 (7%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TVA+TK F K Y P+PSI+ TVLQEL+V H YQ++ + +LGFV+V+D+L
Sbjct: 4 PATVADTKAKFTKGYPYPLPSIWATVLQELLVGMHFTVTSSKYQHEEMRSLGFVSVFDQL 63
Query: 130 MEGYPSEED--REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA-SSLVEFPSKEGEV 186
EGYP+E+ +E IF ++ AL ED +++R DA+KL +A QT+ ++ P
Sbjct: 64 FEGYPTEDPNAKEKIFSTFMEALGEDSKKWRADAEKLSAFATEQTSIDGIIANP------ 117
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV 231
+ + + K + Y +F A+G FR LE++ T P L+K+
Sbjct: 118 --MFASMKSKVESK-SLVYDKFIAIGFFRALEMSKQTSPENLKKI 159
>gi|56750022|ref|YP_170723.1| Thf1-like protein [Synechococcus elongatus PCC 6301]
gi|81300364|ref|YP_400572.1| Thf1-like protein [Synechococcus elongatus PCC 7942]
gi|56684981|dbj|BAD78203.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169245|gb|ABB57585.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 280
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 103/186 (55%), Gaps = 13/186 (6%)
Query: 43 FCHCVRFRASSSSSRMIIQCMSTATDVP-----PTVAETKMNFLKLYKRPIPSIYNTVLQ 97
F C+ RA ++ + C S+ VP PTV+++K F Y RPI +Y V++
Sbjct: 3 FGRCLAKRAIAAHN-----CSSSCASVPNVTSVPTVSDSKRAFYAAYPRPINPLYRRVVE 57
Query: 98 ELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
EL+V+ HL+ ++ YDP+FALG VT +D M Y E +F A A++++PEQY
Sbjct: 58 ELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFMSSYRPIEAVGPLFTALTQAVRQNPEQY 117
Query: 158 RIDAQKLEEWARGQTASSLVEFPSKE---GEVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
R DA + E RG + ++ ++ ++ G L++ + +G+ F YSR FA+GLF
Sbjct: 118 RHDANAIAEQVRGVGSDTIRQWLTEAEALGNAPELVRSSFQAIAGRSEFKYSRLFAIGLF 177
Query: 215 RLLELA 220
LLE A
Sbjct: 178 SLLETA 183
>gi|443317266|ref|ZP_21046682.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
gi|442783151|gb|ELR93075.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
Length = 251
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 1/153 (0%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV++TK F + RPI S+Y V++EL+V+ HL+R + YDPV+ALG VT +DR M
Sbjct: 5 PTVSDTKRAFYSYHNRPIASVYRRVIEELMVEMHLLRVNEDFVYDPVYALGIVTTFDRFM 64
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE-FPSKEGEVEGL 189
GY E D +IF A A +QYR DA+ + G++ +L S+ E L
Sbjct: 65 AGYRPEADEASIFAALCQANAGTADQYRRDAEVMVAEVSGRSLDALKAILISRSAEGADL 124
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA 222
LK + + + + F YSR FA+GL+ L+E +A
Sbjct: 125 LKGVLQGIADRDRFKYSRAFAIGLYTLIETVDA 157
>gi|291567260|dbj|BAI89532.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 243
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 94/168 (55%), Gaps = 2/168 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYHIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY E D+ +I+ A I A + DP QYR DA LE L + ++E +
Sbjct: 66 GYIPEADKLSIWAALIGAQESDPNQYRADATALEAQVASLAVKDLTDKAKMAQESSGDDP 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFD 237
L+ + F YSR A+GL+ LLE ++AT E +++ D
Sbjct: 126 LQSCFHAIANNPKFKYSRLLAIGLYTLLEKSDATAAQDSEGLKTILSD 173
>gi|434398071|ref|YP_007132075.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
gi|428269168|gb|AFZ35109.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
Length = 238
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 68/189 (35%), Positives = 112/189 (59%), Gaps = 9/189 (4%)
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
I + TD TV++ K +F + + RPI S+Y V++EL+V+ HL+ ++ DP++
Sbjct: 4 IAVGNKTTDNIRTVSDAKRDFYQHHTRPINSVYRRVVEELLVEMHLLSVNVDFKSDPIYY 63
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LG VT ++RLM+GY E+D+E+IF A A+ EDPE+ R A L A+ ++ LV +
Sbjct: 64 LGVVTSFERLMQGYRPEQDKESIFNALCRAVGEDPERNRAQAGSLLNLAKNKSPQELVAW 123
Query: 180 PSKEGEVEG---LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKV--RST 234
S+ +E +++ I AS +F YSR FA+GL+ LLE ++P +L+ V R+
Sbjct: 124 LSEPTPLENYHDIIEPIKAIASNP-HFKYSRLFAIGLYTLLE---ESDPEILKDVSKRNE 179
Query: 235 FFDNIGTSI 243
++I T +
Sbjct: 180 ILESIATQL 188
>gi|332705256|ref|ZP_08425337.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
gi|332355999|gb|EGJ35458.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
Length = 257
Score = 116 bits (291), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 88/150 (58%), Gaps = 4/150 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++EL+V+ HL+ + YDP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYTYHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYGLGVVTCFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE---EWARGQTASSLVEFPSKEGEVEG 188
Y E D+E+IF A A+ + +QY+ DAQ+L+ + GQ S + P+ E
Sbjct: 66 SYQPENDKESIFNALCQAVGGEAQQYQEDAQRLKTSVDSMSGQDLISWLSSPTSENGSGD 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
L IA A F YSR FA+GLF LLE
Sbjct: 126 LATTIAAIAQN-SQFKYSRLFAIGLFSLLE 154
>gi|443328840|ref|ZP_21057433.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
gi|442791576|gb|ELS01070.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
Length = 270
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 109/184 (59%), Gaps = 6/184 (3%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
++T T++ TV++TK +F Y +PI S+Y +++EL+V+ HL+ ++ DP+F LG
Sbjct: 4 IATVTNLK-TVSDTKRSFYNNYNKPINSVYRRIVEELLVEMHLLSVNADFKSDPIFYLGV 62
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
V+ ++RLM+GY ++D+ AIF A A+ DPE YR A L A+ ++ L+ + +
Sbjct: 63 VSCFERLMQGYQPDQDKGAIFNALCRAVDGDPESYRAQAGNLLAIAKEKSGEELIAWLGE 122
Query: 183 EGEVEGLLKDIAE---RASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNI 239
+ G ++IAE + NF YSR F +GL+ LLE A+A +K R+ F+NI
Sbjct: 123 PTAIAG-AENIAETIKSIAANANFKYSRPFGIGLYTLLEEADAKLLEDSDK-RNEIFENI 180
Query: 240 GTSI 243
++
Sbjct: 181 AKTL 184
>gi|434388267|ref|YP_007098878.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
gi|428019257|gb|AFY95351.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
Length = 234
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 91/152 (59%), Gaps = 4/152 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK NF + RPI SIY V++EL+V+ HL+ + YDP++ALG V+ +DR M
Sbjct: 6 TVSDTKRNFYSQHTRPINSIYRRVVEELMVEMHLLSTNVDFAYDPIYALGVVSSFDRFMT 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
Y E D+++IF A ++ + +QYR DA +EE+AR S ++++ P+ +G
Sbjct: 66 SYRPEADKQSIFVALCESMGGNAQQYRTDATAVEEFARSMQGSDIIDWIAHPTADGMGAQ 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
L + AS F YSR F +GLF +LE A
Sbjct: 126 LATTLQSIASNP-KFKYSRLFGIGLFTILEQA 156
>gi|428313474|ref|YP_007124451.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
gi|428255086|gb|AFZ21045.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
Length = 241
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/178 (37%), Positives = 100/178 (56%), Gaps = 11/178 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RP+ SI+ V++EL+V+ HL+ + Y+P++ALG VT ++R ME
Sbjct: 6 TVSDTKRDFYNHHTRPVNSIFRRVVEELMVEMHLLSVNVDFHYEPIYALGVVTSFNRFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE------GE 185
GY E D+ +IF A ++ +PEQY+ DAQ LE A T LV + S G+
Sbjct: 66 GYRPERDKASIFDALCHSVGNNPEQYKQDAQWLESMAERVTGEELVSWLSAPRPQDTLGD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
+ + IAE F YSR FA+GL+ LLE A+ +E EK R+ I +
Sbjct: 126 LYAAVAAIAENPK----FKYSRLFAIGLYTLLEKAD-SELVQDEKRRTEALKKISDGL 178
>gi|300866330|ref|ZP_07111033.1| Protein thf1 [Oscillatoria sp. PCC 6506]
gi|300335673|emb|CBN56193.1| Protein thf1 [Oscillatoria sp. PCC 6506]
Length = 267
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 94/152 (61%), Gaps = 2/152 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ ++Y+P++ALG VT ++R M+
Sbjct: 36 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFRYNPIYALGVVTAFERFMQ 95
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY E+D+ +IF AL +DP++Y+ DA++LE A + L+ + S E G
Sbjct: 96 GYLPEQDKVSIFNGLCQALGDDPQRYQQDARRLEGLASRVSILDLLSWLEGSTSFEDTGD 155
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN 221
L+ + F YSR FA+GLF LLE+ +
Sbjct: 156 LQASITAIATNSKFKYSRLFAIGLFALLEIVD 187
>gi|97202823|sp|Q5N664.2|THF1_SYNP6 RecName: Full=Protein thf1
gi|97202830|sp|Q31MY4.2|THF1_SYNE7 RecName: Full=Protein thf1
Length = 254
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 90/153 (58%), Gaps = 3/153 (1%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV+++K F Y RPI +Y V++EL+V+ HL+ ++ YDP+FALG VT +D M
Sbjct: 5 PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 64
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 187
Y E +F A A++++PEQYR DA + E RG + ++ ++ ++ G
Sbjct: 65 SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 124
Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
L++ + +G+ F YSR FA+GLF LLE A
Sbjct: 125 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETA 157
>gi|308801781|ref|XP_003078204.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
gi|116056655|emb|CAL52944.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
Length = 657
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 91/145 (62%), Gaps = 16/145 (11%)
Query: 91 IYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITAL 150
++ TV+QEL+VQ H +Y + +Y+ + +LGFV+VYD+L EG+PSEE++ IF A++ AL
Sbjct: 79 VWATVVQELLVQGHFQKYNKKSEYNELASLGFVSVYDQLFEGFPSEEEKGKIFNAFLGAL 138
Query: 151 KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKD--IAERA--SGKGNFSYS 206
ED + R DA+ +L F + VEGL ++ A+ A S +G Y+
Sbjct: 139 DEDAVRTRADAE------------TLGAFATSANGVEGLKENAIFAKLAAKSAEGTLLYT 186
Query: 207 RFFAVGLFRLLELANATEPTVLEKV 231
++ A+G+FR+LELA AT+P LE +
Sbjct: 187 KYIAIGMFRMLELAKATDPAALEAL 211
>gi|428302138|ref|YP_007140444.1| Protein thf1 [Calothrix sp. PCC 6303]
gi|428238682|gb|AFZ04472.1| Protein thf1 [Calothrix sp. PCC 6303]
Length = 235
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 101/185 (54%), Gaps = 11/185 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI +IY V++EL+V+ HL+ + Y+P++ALG T ++R M+
Sbjct: 6 TVSDTKKTFYSIHTRPINTIYRRVVEELMVEMHLLSVNTDFTYNPIYALGVATAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK------EGE 185
GY E+D+E +F A +++ D ++ + +A L++ A + L+ S+ GE
Sbjct: 66 GYDPEKDKEQLFHALCQSVEIDTQKIKQEAHSLKDVAASMSVGDLISCLSRAKRFDNAGE 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIH 245
++ L IA F YSR FA+GLF LLE A + E EK R+ +I +
Sbjct: 126 LQNQLDAIA----SNPKFKYSRLFAIGLFSLLEAA-SPETVKDEKQRNDALVSIAKGLNI 180
Query: 246 GEDPF 250
ED
Sbjct: 181 SEDKL 185
>gi|218245998|ref|YP_002371369.1| Thf1-like protein [Cyanothece sp. PCC 8801]
gi|254784143|sp|B7K277.1|THF1_CYAP8 RecName: Full=Protein thf1
gi|218166476|gb|ACK65213.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8801]
Length = 235
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 96/167 (57%), Gaps = 3/167 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYNHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTF 235
+L + + + F YSR FA+GL+ ++ +A +K TF
Sbjct: 126 MLYNTLQAIAKNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETF 172
>gi|257059049|ref|YP_003136937.1| Thf1-like protein [Cyanothece sp. PCC 8802]
gi|256589215|gb|ACV00102.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8802]
Length = 235
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 96/167 (57%), Gaps = 3/167 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTF 235
+L + + + F YSR FA+GL+ ++ +A +K TF
Sbjct: 126 MLFNTLQAIANNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETF 172
>gi|218442064|ref|YP_002380393.1| Thf1-like protein [Cyanothece sp. PCC 7424]
gi|254784142|sp|B7KI38.1|THF1_CYAP7 RecName: Full=Protein thf1
gi|218174792|gb|ACK73525.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7424]
Length = 226
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 98/165 (59%), Gaps = 14/165 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ +QYDPV+ALG VT + R M+
Sbjct: 6 TVSDSKRDFYTKHTRPINSVYRRVVEELMVEMHLLSVNSDFQYDPVYALGVVTSFQRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-------EFPSKEG 184
GY + D+E+IF A ++ DP+QYR DA+++ E A+ +A L+ + S E
Sbjct: 66 GYRPDADKESIFNALCQSVGGDPQQYRQDAERMIESAKQLSAQQLLFNLESASDSSSGEN 125
Query: 185 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE 229
++ L IA + Y+R FA+G++ +L T+P +L+
Sbjct: 126 QILQTLIGIA----NAPKYKYTRLFAIGIYTILA---ETDPEMLK 163
>gi|254423933|ref|ZP_05037651.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
gi|196191422|gb|EDX86386.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
Length = 250
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 101/185 (54%), Gaps = 8/185 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + YD ++ALG V+ YDR M+
Sbjct: 6 TVSDTKRAFYSQHTRPINAIYRRVVEELMVEAHLLLVNADFNYDSIYALGVVSTYDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS--KEGEVEG- 188
GY DR+ I++A + A + DP+QYR DA++L ++ S+ F S E + E
Sbjct: 66 GYEPAGDRDNIYRAILQANEADPDQYRRDAEEL--LGVAKSLPSIDAFKSILDEAKTESG 123
Query: 189 --LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHG 246
LK +A F YSR FA+GL+ ++E +A +K R I ++I
Sbjct: 124 SDTLKANLHKAISNPKFKYSRLFAIGLYNVIESIDADMLNDKDK-RDALMAEIASTIGLN 182
Query: 247 EDPFR 251
ED +
Sbjct: 183 EDLLK 187
>gi|428223137|ref|YP_007107307.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
gi|427996477|gb|AFY75172.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
Length = 226
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/174 (36%), Positives = 95/174 (54%), Gaps = 4/174 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA+ K +F K + +P+ SIY V+ EL+V+ HL+R + + YD +FALG T +DR M
Sbjct: 6 TVADAKHDFYKAFSKPVNSIYRRVVDELLVEVHLLRVSQNFGYDSIFALGLATAFDRFMA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE--GEVEGL 189
GY E D E IF+ AL DP+Q R ++ L E ++ A F + E +++ L
Sbjct: 66 GYQPESDLEPIFKGLCQALLFDPDQIRQESAHLIELSKQFPAEVKSLFTTLEAGADLDTL 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
+ I A+ F YSR FAVG+F LLE A+ E + R +G ++
Sbjct: 126 MGQIRAIATNP-KFKYSRLFAVGVFILLETADP-EAIADQDKRQALITQVGDTL 177
>gi|428219024|ref|YP_007103489.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
gi|427990806|gb|AFY71061.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
Length = 260
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 95/177 (53%), Gaps = 4/177 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + + RPI S+Y V+ EL+V+ HL+ +T+ YDPVFALG +T YDR M
Sbjct: 6 TVSDAKRDFFQAFPRPINSVYRRVVDELLVEMHLLTVNQTFAYDPVFALGAITAYDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E +R+ I A A+ + EQ R DA L E A ++ + +F + E L
Sbjct: 66 GYEPESERDRILPAICGAVHLNAEQMRHDASSLAELAM-RSPIDVKQFLTSLETTENLEP 124
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHG 246
L + F YSR FA+GLF LLE A+ + +K R +G ++ G
Sbjct: 125 LTGTIRAIAANQKFKYSRLFAIGLFTLLETADPNTMSDNDK-RQELIKQVGDALNLG 180
>gi|307155000|ref|YP_003890384.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
gi|306985228|gb|ADN17109.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
Length = 233
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 97/166 (58%), Gaps = 15/166 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ M+
Sbjct: 6 TVSDSKRDFYSKHTRPINSVYRRVVEELLVETHLLSVNSDFHYDPIYALGVVTSFEQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA--------SSLVEFPSKE 183
GY E D+E+IF A ++ DP+QYR DAQ + A+ +A SS + +P +
Sbjct: 66 GYRPETDKESIFNALCQSVGGDPQQYRGDAQSILSTAKQLSAQDLLSKLQSSSIAYPQGD 125
Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE 229
++ L IA F Y+R FA+G++ +L T+P +L+
Sbjct: 126 NKIIETLVAIA----NAPKFKYTRLFAIGIYTIL---AETDPELLK 164
>gi|170077355|ref|YP_001733993.1| Thf1-like protein [Synechococcus sp. PCC 7002]
gi|254784146|sp|B1XHY6.1|THF1_SYNP2 RecName: Full=Protein thf1
gi|169885024|gb|ACA98737.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 254
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 101/177 (57%), Gaps = 7/177 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT ++R M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIFRRVVEELLVEMHLLSVNADFRYDPFYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+ +IFQ+ A+ D +Y+ DA L E A+ + + L+E ++ EG
Sbjct: 66 GYRPEADKVSIFQSMCQAIGGDANRYKEDAMALVELAKRCSGTQLIECFRQDVPPEGAQE 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHG 246
L + E + +F YSR FA+G++ L +EP +LE D + T++ G
Sbjct: 126 LWEKIEAIAKNDHFKYSRLFAIGVYTFL---GESEPQLLEDTEKR--DEMLTTVTAG 177
>gi|157952488|ref|YP_001497380.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
gi|155122715|gb|ABT14583.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
Length = 247
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 90/159 (56%), Gaps = 11/159 (6%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
K G ++ L DIA + YS F A+G+F+LL++
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQM 189
>gi|126658461|ref|ZP_01729609.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
gi|126620203|gb|EAZ90924.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
Length = 246
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 86/148 (58%), Gaps = 5/148 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
GY E D+ +IF A A+ + EQY +A+ L A+G S+ EF K G+ +G+
Sbjct: 66 GYRPESDKASIFNALCQAVDGNSEQYHQEAEALINEAKG---LSMTEFKDKLGQEGGDGI 122
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL 217
L + F YSR F VGL+ LL
Sbjct: 123 LWGTCNAIAQNPKFKYSRLFGVGLYTLL 150
>gi|172035357|ref|YP_001801858.1| Thf1-like protein [Cyanothece sp. ATCC 51142]
gi|354555452|ref|ZP_08974753.1| Protein thf1 [Cyanothece sp. ATCC 51472]
gi|254784140|sp|B1WNF0.1|THF1_CYAA5 RecName: Full=Protein thf1
gi|171696811|gb|ACB49792.1| photosystem II 22 kD protein [Cyanothece sp. ATCC 51142]
gi|353552511|gb|EHC21906.1| Protein thf1 [Cyanothece sp. ATCC 51472]
Length = 242
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 1/146 (0%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E D+ +IF A A+ + EQY +A+ L A+G + + E +EG +G+L
Sbjct: 66 GYSPESDKTSIFNALCQAVDGNSEQYHQEAEALINEAKGLSITEFKEKLGQEGG-DGILW 124
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL 217
+ F YSR F VGL+ LL
Sbjct: 125 GTCGAIAQNPKFKYSRLFGVGLYTLL 150
>gi|448931221|gb|AGE54783.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1D]
Length = 248
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 90/159 (56%), Gaps = 11/159 (6%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
K G ++ L DIA + YS F A+G+F+LL++
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQM 189
>gi|448930219|gb|AGE53784.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-3A]
gi|448933659|gb|AGE57214.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NE-JV-4]
Length = 207
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSNLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DIA + YS F AVG+F+LL++
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQM 148
>gi|427726046|ref|YP_007073323.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
gi|427357766|gb|AFY40489.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
Length = 246
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 90/164 (54%), Gaps = 13/164 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDAKRDFYGQHTRPINSIFRRVVEELLVEMHLVSVNVDFRYDPFYALGIVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE------FPSKEGE 185
GY E D+ +IFQA A+ E Y+ DA L E A+ + LV+ P GE
Sbjct: 66 GYRPESDKISIFQAMCQAVGGSAEFYKNDATALVELAKRCSGQQLVDCFRQDNAPEGAGE 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE 229
+ ++ IA F YSR FA+GL+ L EP +LE
Sbjct: 126 LWAKVEAIA----ANKKFKYSRLFAIGLYTFL---GEAEPALLE 162
>gi|9631702|ref|NP_048481.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|1131477|gb|AAC96501.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|448924789|gb|AGE48370.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
AN69C]
Length = 207
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DIA + YS F AVG+F+LL++
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQM 148
>gi|443478915|ref|ZP_21068602.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
gi|443015728|gb|ELS30564.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
Length = 240
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 96/182 (52%), Gaps = 10/182 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + +P+ +Y V+ EL+V+ HL++ +T+ YD +FALGFVT +DR
Sbjct: 6 TVSDTKKDFYLAFPKPVNQVYRRVVDELLVEIHLLKVNQTFVYDAIFALGFVTTFDRFTA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWAR---GQTASSLVEFPSKEG--EV 186
GY E DR A+F A AL+ D ++ R DA L + A + L S +
Sbjct: 66 GYKPETDRFAVFHALCAALQFDSDRIRQDAATLSDLATRSPNDIKTLLTNLDSGISLEPL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSIIHG 246
G L+ I S K NF YSR VGL+ LLE+++ E K R +G ++ G
Sbjct: 126 SGQLQII----STKENFKYSRLLGVGLYALLEISDPEEIADSAK-REELIKLVGETLKFG 180
Query: 247 ED 248
D
Sbjct: 181 SD 182
>gi|448927841|gb|AGE51413.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CviKI]
Length = 232
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 26 MKAITSSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 85
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 86 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNK 139
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DI S + YS F AVG+F+LL++
Sbjct: 140 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQM 173
>gi|157953365|ref|YP_001498256.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|156068013|gb|ABU43720.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|448930527|gb|AGE54091.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-5-2s1]
gi|448934707|gb|AGE58259.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NY-2B]
gi|448935079|gb|AGE58630.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NYs1]
Length = 248
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 90/159 (56%), Gaps = 11/159 (6%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
K G ++ L DIA + YS F A+G+F+LL++
Sbjct: 155 KHGNTLQKSLYDIAIN----DKYVYSSFAAIGIFKLLQM 189
>gi|448928860|gb|AGE52429.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CvsA1]
Length = 207
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DI S + YS F AVG+F+LL++
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQM 148
>gi|448930916|gb|AGE54479.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
KS1B]
Length = 207
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 89/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KVCIKNAFIISLNEDPEMYYSNIEYLKPYAKSSNLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DIA + YS F AVG+F+LL++
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQM 148
>gi|448931622|gb|AGE55183.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1E]
Length = 207
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYFSNIETLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL 219
G ++ L DI S + YS F AVG+F+LL++
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQM 148
>gi|359462375|ref|ZP_09250938.1| Thf1-like protein [Acaryochloris sp. CCMEE 5410]
Length = 214
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 87/147 (59%), Gaps = 10/147 (6%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+R ++YDP+FALG T +DR M+GY E D++AIF A A + DP Q +
Sbjct: 1 MVEMHLLRVNEDFRYDPIFALGVTTSFDRFMDGYQPENDKDAIFSAICKAQEADPVQMQK 60
Query: 160 DAQKLEEWARGQTASSLVEFPSKEG-----EVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
D Q+L E A+ ++A ++++ ++ E++ L++IA+ F YSR FA+GLF
Sbjct: 61 DGQRLTELAQSKSAQEMLDWITQAANSGGDELQWQLRNIAQNPK----FKYSRLFAIGLF 116
Query: 215 RLLELANATEPTVLEKVRSTFFDNIGT 241
LLEL+ T E+ + F NI T
Sbjct: 117 TLLELSEGN-ITQDEESLAEFLPNICT 142
>gi|425459592|ref|ZP_18839078.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
gi|389822632|emb|CCI29709.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
Length = 233
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTI--LAEAQPDIIKEK 168
>gi|440752363|ref|ZP_20931566.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
gi|440176856|gb|ELP56129.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
Length = 228
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 125 -LSDSLVSVINAAKFKYSRLFAIGLYTI--LAEAQPDIIKEK 163
>gi|427419843|ref|ZP_18910026.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
gi|425762556|gb|EKV03409.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
Length = 258
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 93/166 (56%), Gaps = 14/166 (8%)
Query: 63 MSTATDVPP-----TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
MS PP TV++TK F + RPI S+Y V++EL+V+ HL+ + Y+P+
Sbjct: 1 MSNVLLPPPVTNLRTVSDTKRAFYNYHSRPINSLYRRVIEELMVEMHLLSVNVDFVYNPL 60
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE----EWARGQTA 173
+ALG +T +DR M GY E+D+E+I A A++ DP+QYR DA+ L+ +
Sbjct: 61 YALGVITSFDRFMVGYEPEQDKESILSAICQAVEGDPQQYRQDAEALKSDLANLSLSDLN 120
Query: 174 SSLVEFPSKEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
+ L + +G ++ L +A +AS K Y+R AVGL+ L E
Sbjct: 121 TQLASAKTTDGNGLQNKLHVVATQASAK----YTRLMAVGLYTLFE 162
>gi|97202816|sp|P0C1D1.1|THF1_SYNJB RecName: Full=Protein thf1
Length = 239
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 92/174 (52%), Gaps = 3/174 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++ TK F Y RPI ++Y V++EL+V+ HL T+ YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAVYRRVVEELLVELHLTTVNSTFVYDPFFALGLVTLYDGLME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
Y E REAIF A AL PE R +A+ L E ++ + E E G
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMGSGDPRQRLDLLCLKPEAEDVGG 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
LK I ER + + ++YSR AVGL+ E+ + E+ F +N+ + +
Sbjct: 126 LKAILERMT-QPPYAYSRVLAVGLYTAYEVVAKSLYEEPEERTRRFLENVVSKL 178
>gi|425436789|ref|ZP_18817221.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|425451594|ref|ZP_18831415.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
gi|389678450|emb|CCH92698.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|389767069|emb|CCI07461.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
Length = 233
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTI--LAEAQPDIIKEK 168
>gi|37520969|ref|NP_924346.1| Thf1-like protein [Gloeobacter violaceus PCC 7421]
gi|81710432|sp|Q7NKS7.1|THF1_GLOVI RecName: Full=Protein thf1
gi|35211965|dbj|BAC89341.1| glr1400 [Gloeobacter violaceus PCC 7421]
Length = 228
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/154 (38%), Positives = 88/154 (57%), Gaps = 4/154 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F Y RP+ SIY V+ EL+V+ HL+ + +++DP+FA G +T Y LME
Sbjct: 6 TVSDSKRAFFAAYPRPVNSIYRRVIDELLVEVHLLITNQDFRHDPLFATGLLTAYQALME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV-EGLL 190
GY E R+AI +A TAL+ EQ DA + A A ++E + + E +G L
Sbjct: 66 GYTPVEQRDAILRALCTALELSYEQLHTDAAQWRAIAAELPAQEVLEVMAGKREAGDGRL 125
Query: 191 KDIAERASGKGN---FSYSRFFAVGLFRLLELAN 221
K + + +G N F YSR F++GL +LE A
Sbjct: 126 KAMGDTLAGIANAERFKYSRLFSLGLANILEQAG 159
>gi|425445848|ref|ZP_18825868.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
gi|389734049|emb|CCI02237.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
Length = 233
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDIIKEK 168
>gi|390439536|ref|ZP_10227927.1| Protein thf1 [Microcystis sp. T1-4]
gi|389837025|emb|CCI32051.1| Protein thf1 [Microcystis sp. T1-4]
Length = 233
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 90/173 (52%), Gaps = 29/173 (16%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANATEPTVLEK 230
+ +A G N F YSR FA+GL+ + LA A + EK
Sbjct: 118 QLQNQALGGDNQLSDSLVSLINAPKFKYSRLFAIGLYTI--LAEAQPDMIKEK 168
>gi|425453632|ref|ZP_18833389.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
gi|389800936|emb|CCI19831.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
Length = 233
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDMIKEK 168
>gi|422302142|ref|ZP_16389506.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
gi|389788699|emb|CCI15466.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
Length = 233
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDMIKEK 168
>gi|425470743|ref|ZP_18849603.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
gi|389883502|emb|CCI36111.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
Length = 233
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDMIKEK 168
>gi|425441488|ref|ZP_18821762.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|425463770|ref|ZP_18843100.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
gi|389717772|emb|CCH98181.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|389829228|emb|CCI29632.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
Length = 233
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDMIKEK 168
>gi|443669636|ref|ZP_21134837.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
gi|159030831|emb|CAO88510.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330085|gb|ELS44832.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
Length = 228
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 90/162 (55%), Gaps = 7/162 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
L D F YSR FA+GL+ + LA A + EK
Sbjct: 125 -LSDSLVSVINAPKFKYSRLFAIGLYTI--LAEAQPDIIKEK 163
>gi|166367182|ref|YP_001659455.1| Thf1-like protein [Microcystis aeruginosa NIES-843]
gi|166089555|dbj|BAG04263.1| Psb29 Photosystem II sub-stoichiometric subunit [Microcystis
aeruginosa NIES-843]
Length = 233
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 89/173 (51%), Gaps = 29/173 (16%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANATEPTVLEK 230
+ A G N F YSR FA+GL+ + LA A + EK
Sbjct: 118 QLQNPALGANNQLSDSLVSLINAPKFKYSRLFAIGLYTI--LAEAQPDIIKEK 168
>gi|86606816|ref|YP_475579.1| Thf1-like protein [Synechococcus sp. JA-3-3Ab]
gi|97202812|sp|Q2JSQ3.1|THF1_SYNJA RecName: Full=Protein thf1
gi|86555358|gb|ABD00316.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 239
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 93/178 (52%), Gaps = 14/178 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++ TK F Y RPI + Y V++EL+V+ HL + YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAAYRRVVEELLVELHLTTVNSAFVYDPFFALGLVTLYDSLME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----QTASSLVEFPSKEGEVE 187
Y E REAIF A AL PE R +A+ L E R Q + L P E E
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMRSGDPVQRYNLLCLKP--EAEDV 123
Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPTVLEKVRSTFFDNIGT 241
G LK I +R + + ++YSR AVGL+ E A AT EP E+ R D IG
Sbjct: 124 GGLKAILQRMT-QPPYAYSRVLAVGLYTAYE-AVATSLYKEPE--ERTRHFLEDVIGN 177
>gi|428769945|ref|YP_007161735.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
gi|428684224|gb|AFZ53691.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
Length = 234
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 90/154 (58%), Gaps = 4/154 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + ++RPI SIY V++EL+V+ HL+ + DP++ LG + + M+
Sbjct: 18 TVSDTKRSFYQHHQRPINSIYRRVVEELMVEMHLLAVNVDFNPDPIYYLGVYQSFQQFMQ 77
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
GY E D+E+IF A +++ +P++Y +Q L + G++A ++++ PS EG++E
Sbjct: 78 GYKPESDKESIFNALCQSIENNPQEYISKSQTLLNFVEGKSAQEILDWLLNPSGEGDLEA 137
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANA 222
+ F YSR FA+G + L+E +
Sbjct: 138 VASHWRSNLEN-PRFKYSRLFAIGFYTLIEKGDG 170
>gi|428203624|ref|YP_007082213.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
gi|427981056|gb|AFY78656.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
Length = 241
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 90/159 (56%), Gaps = 5/159 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + RPI SIY ++ELIV+ HL+ ++YD ++ALG VT ++R M+
Sbjct: 11 TVSDAKRDFYTHHTRPINSIYRRFVEELIVEMHLLSVNTDFRYDAIYALGVVTAFERFMQ 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE--FPSKEGEVEGL 189
GY E+D+ +IF A A + EQYR +A ++ A+ + L+ S E
Sbjct: 71 GYQPEQDKSSIFAALCQATGGNWEQYRQEAGEILAQAKQMSVQELIAKINSSTPTGGENR 130
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL 228
L + + + + N+ YSR FA+GL+ LL A+ P +L
Sbjct: 131 LVETLQAIANRSNYKYSRLFAIGLYTLLAEAD---PDIL 166
>gi|428773451|ref|YP_007165239.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
gi|428687730|gb|AFZ47590.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
Length = 233
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 87/163 (53%), Gaps = 14/163 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++T+ F + + RPI SIY V+QEL+V+ HL+ +Q D V+A+G +++ M
Sbjct: 6 TVSDTRRAFYQYHTRPINSIYRQVVQELMVEMHLLSVNTDFQPDAVYAVGVCQSFEQFMT 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY EED+ +IF A A++ +P+ YR ++ L + G++A LV + GL
Sbjct: 66 GYKPEEDKTSIFNALCKAIEANPDDYRHQSESLLNFVEGKSAEDLVNWLLNPVADNGLDE 125
Query: 190 -----LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV 227
LK I ER F YSR F +G + L+ N P V
Sbjct: 126 NIVNSLKSILERER----FKYSRLFGIGFYTLI---NKVAPDV 161
>gi|16330615|ref|NP_441343.1| Thf1-like-protein [Synechocystis sp. PCC 6803]
gi|383322356|ref|YP_005383209.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325525|ref|YP_005386378.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491409|ref|YP_005409085.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436676|ref|YP_005651400.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|451814773|ref|YP_007451225.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
gi|81671042|sp|P73956.1|THF1_SYNY3 RecName: Full=Protein thf1
gi|1653107|dbj|BAA18023.1| sll1414 [Synechocystis sp. PCC 6803]
gi|339273708|dbj|BAK50195.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|359271675|dbj|BAL29194.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274845|dbj|BAL32363.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278015|dbj|BAL35532.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958541|dbj|BAM51781.1| Thf1-like-protein [Bacillus subtilis BEST7613]
gi|451780742|gb|AGF51711.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
Length = 240
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 79/147 (53%), Gaps = 1/147 (0%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K F Y RPI SIY ++EL+V+ HL+ + YDP+FALG VT ++ M+
Sbjct: 6 TVSDAKRKFFTHYSRPISSIYRRFVEELLVEMHLLSVNIDFTYDPIFALGIVTSFNSFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-EFPSKEGEVEGLL 190
GY E AIF A + ++P+Q R DA+ + A + V S++ + LL
Sbjct: 66 GYQPAEQLPAIFNALCHGVDQNPDQVRQDAKNVAASAHHIGLDAWVTAAASEQASGDNLL 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLL 217
+ + F YSR FA+GL+ LL
Sbjct: 126 LNTLTGIHQRHKFKYSRLFAIGLYTLL 152
>gi|423062334|ref|ZP_17051124.1| Thf1-like protein [Arthrospira platensis C1]
gi|406716242|gb|EKD11393.1| Thf1-like protein [Arthrospira platensis C1]
Length = 215
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 81/145 (55%), Gaps = 2/145 (1%)
Query: 95 VLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDP 154
+++EL+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP
Sbjct: 1 MVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDP 60
Query: 155 EQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVG 212
QYR DA LE A + L E ++E + L+ + F YSR FA+G
Sbjct: 61 NQYRADATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIG 120
Query: 213 LFRLLELANATEPTVLEKVRSTFFD 237
L+ LLE ++ T E +++ D
Sbjct: 121 LYTLLEKSDVTAAQDSEGLKTILSD 145
>gi|67921410|ref|ZP_00514928.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67856522|gb|EAM51763.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 245
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 83/152 (54%), Gaps = 9/152 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + +PI SIY ++EL+V+ HL+ + YDP++ALG VT + R M+
Sbjct: 6 TVSDTKRKFYGYHTQPINSIYRRFVEELLVEMHLLSVNIDFSYDPIYALGVVTSFQRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV----- 186
GY E D+ +IF A A+ E+Y +A+ + A+G S+V+F K V
Sbjct: 66 GYSPESDKPSIFNALCQAVDGSSEKYHQEAEAILNEAKGL---SIVDFKDKLTHVTDNQV 122
Query: 187 -EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
EG+L + F YSR A+GL+ LL
Sbjct: 123 GEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 154
>gi|119510704|ref|ZP_01629832.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
gi|119464658|gb|EAW45567.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
Length = 200
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 91/152 (59%), Gaps = 15/152 (9%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+P++ALG VT +DR M+GY E+D+E+IFQA A++++P++YR
Sbjct: 1 MVEMHLLSVNSGFSYNPIYALGVVTSFDRFMQGYLPEQDQESIFQALCQAVEQEPQRYRE 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA++L+ A+ + L+ + S ++ +++ L+ IA + F YSR FAVGL
Sbjct: 61 DAKRLQALAKDLPVNDLIAWLSQTTHLDRDPDLQAQLQAIAHNS----EFKYSRLFAVGL 116
Query: 214 FRLLELANATEPTVL--EKVRSTFFDNIGTSI 243
F LLE ++P ++ EK R+ I +
Sbjct: 117 FTLLE---QSDPELVKDEKQRTEALKTIAAGL 145
>gi|22298677|ref|NP_681924.1| Thf1-like protein [Thermosynechococcus elongatus BP-1]
gi|81743247|sp|Q8DJT8.1|THF1_THEEB RecName: Full=Protein thf1
gi|22294857|dbj|BAC08686.1| tlr1134 [Thermosynechococcus elongatus BP-1]
Length = 222
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 84/149 (56%), Gaps = 8/149 (5%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TV++TK F + RPI SIY ++EL+V+ HL+R ++Y P+FALG VT +D+
Sbjct: 4 PRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQF 63
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
MEGY E DR+ IF A A + +P+Q + DA +++ + L E S G+
Sbjct: 64 MEGYQPEGDRDRIFHALCVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNS--GQPSAP 121
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLE 218
L + +GK YSR AVGL+ L+
Sbjct: 122 LNSLNH--TGK----YSRLHAVGLYAFLQ 144
>gi|443323210|ref|ZP_21052219.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
gi|442787120|gb|ELR96844.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
Length = 231
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 84/150 (56%), Gaps = 5/150 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++ELIV+ HL+ + ++ DP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYAHHTRPINSIYRRVVEELIVELHLLSVNQNFRVDPIYCLGVVTSFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWA-RGQTASSLVEFPSKEGEVEGLL 190
GY EED+ +I + A+ EQYR A ++ A R L+ + VEG
Sbjct: 66 GYRPEEDKASILASLCQAVGGKLEQYRDHANQVLNLAKRLHGVDDLLAWFKHPQPVEGEF 125
Query: 191 KDIAERASG---KGNFSYSRFFAVGLFRLL 217
+AE S +F YSR F +GL+ +L
Sbjct: 126 A-LAEAVSAIALNQSFKYSRMFGIGLYTML 154
>gi|209522934|ref|ZP_03271491.1| Thf1-like protein [Arthrospira maxima CS-328]
gi|209496521|gb|EDZ96819.1| Thf1-like protein [Arthrospira maxima CS-328]
Length = 210
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 2/140 (1%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDPNQYRA 60
Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
DA LE A + L E ++E + L+ + F YSR FA+GL+ LL
Sbjct: 61 DATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIGLYTLL 120
Query: 218 ELANATEPTVLEKVRSTFFD 237
E ++ T E +++ D
Sbjct: 121 EKSDVTAAQDSEGLKTILSD 140
>gi|282901466|ref|ZP_06309391.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193745|gb|EFA68717.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 201
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 75/128 (58%), Gaps = 10/128 (7%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 214 FRLLELAN 221
F LLEL++
Sbjct: 117 FSLLELSD 124
>gi|282898285|ref|ZP_06306276.1| Protein thf1 [Raphidiopsis brookii D9]
gi|281196816|gb|EFA71721.1| Protein thf1 [Raphidiopsis brookii D9]
Length = 202
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 75/128 (58%), Gaps = 10/128 (7%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 214 FRLLELAN 221
F LLEL++
Sbjct: 117 FSLLELSD 124
>gi|409992261|ref|ZP_11275462.1| inositol phosphatase [Arthrospira platensis str. Paraca]
gi|409936888|gb|EKN78351.1| inositol phosphatase [Arthrospira platensis str. Paraca]
Length = 210
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 75/140 (53%), Gaps = 2/140 (1%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYTPETDKLSIWAALIGAQESDPNQYRA 60
Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
DA LE A L + ++E + L+ + F YSR A+GL+ LL
Sbjct: 61 DATALEAQAASLAVKDLTDKAKIAQESSGDDPLQSCFHAIANNPKFKYSRLLAIGLYTLL 120
Query: 218 ELANATEPTVLEKVRSTFFD 237
E ++AT E +++ D
Sbjct: 121 EKSDATAAQDSEGLKTILSD 140
>gi|148242504|ref|YP_001227661.1| Thf1-like protein [Synechococcus sp. RCC307]
gi|147850814|emb|CAK28308.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 237
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 93/175 (53%), Gaps = 5/175 (2%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TVA++K F Y IP +Y V+ EL+V+ HL+ + +Q D +FA+G V+D L
Sbjct: 6 PQTVADSKRRFYAAYPHVIPGLYRRVVDELLVELHLLAGQAGFQADSLFAMGLTQVFDNL 65
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEG 188
M+G+ E ++ +F A + +Q R DA++L E + + + ++G+
Sbjct: 66 MQGFKPAERQKELFAAICSGAGLKADQLRKDAKQLREHLVPHGEAEIKSWIEQQGQGAPD 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE---LANATEPTVLEKVRSTFFDNIG 240
+LK + ++A G+ +F YSR AVGL LL+ + +P L++ ++G
Sbjct: 126 VLKHVLQQA-GRSDFHYSRLHAVGLMGLLQDLSGGDDQDPQALQERAHQLGHSMG 179
>gi|427711975|ref|YP_007060599.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
gi|427376104|gb|AFY60056.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
Length = 245
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 81/156 (51%), Gaps = 14/156 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SI+ ++EL+V+ HL+R + Y P+ ALG VT Y+ M
Sbjct: 6 TVSDTKKAFYAAHTRPIHSIFRRFVEELLVEVHLLRVNTNFVYSPLLALGIVTAYNHFMS 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA------SSLVEFPSKEGE 185
GY E DR +IF ++ A + DP+Q + DA + W + L + S+ G+
Sbjct: 66 GYRPETDRNSIFTSFAIAEEFDPQQLQADAAR---WEELAGLELEELQTRLQAWISEGGD 122
Query: 186 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
L+D K YSR A+GL+ LLE A
Sbjct: 123 PWHNSLRDAVNNPQTK----YSRLQAIGLYHLLEQA 154
>gi|284929212|ref|YP_003421734.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
gi|284809656|gb|ADB95353.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
Length = 237
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 82/154 (53%), Gaps = 3/154 (1%)
Query: 68 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 127
D TV+ETK F + +PI SIY ++EL+V+ HL+ YQY P++ALG VT+++
Sbjct: 2 DNIRTVSETKREFYNFFTKPISSIYRRFIEELLVEMHLLSVNADYQYSPIYALGVVTLFE 61
Query: 128 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV- 186
+ M Y ++ ++ IF A + D +QYR ++ + A + S+ E +K +
Sbjct: 62 KFMYRYQPDDHQDLIFDALCKSTGGDTKQYRQESNTILNEAETLSISNFKEDFTKSAQEK 121
Query: 187 --EGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
+ LL + F YSR A+GL+ LLE
Sbjct: 122 VNDKLLWKSYYSIAQNPKFKYSRLLAIGLYSLLE 155
>gi|254413033|ref|ZP_05026805.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
gi|196180197|gb|EDX75189.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
Length = 208
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 84/149 (56%), Gaps = 9/149 (6%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ LG V ++R M+GY E D+E+IF A A+ +P+QY+
Sbjct: 1 MVEMHLLAVNVDFKYDPIYVLGVVASFNRFMQGYRPERDKESIFNALCQAVGGNPQQYQD 60
Query: 160 DAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGN---FSYSRFFAVGLFRL 216
DA+KL+ +A LV++ +EG +DI + + F YSR FA+GL+ L
Sbjct: 61 DAEKLKAAVGRLSAQELVDWFGSPTPLEG-AEDIHTTVAAIADNPKFKYSRLFAIGLYTL 119
Query: 217 LELANATEPTVLEKVRST--FFDNIGTSI 243
LE A EP +++ + + IG ++
Sbjct: 120 LEQA---EPELVQDAKQSMEMLQRIGQTL 145
>gi|124023249|ref|YP_001017556.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9303]
gi|123963535|gb|ABM78291.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 250
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 85/174 (48%), Gaps = 7/174 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 14 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTR 73
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + +F A + DP R AQK E RG + + ++G +G +
Sbjct: 74 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQKTLESVRGHDLEEVQGWIQQQG--KGAPE 131
Query: 192 DIAE--RASGKGNFSYSRFFAVGLFRLLELANATE---PTVLEKVRSTFFDNIG 240
+A+ R +G F YSR AVGL LL A E P L ++ +++G
Sbjct: 132 ALAQALRNTGSNTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVG 185
>gi|33862947|ref|NP_894507.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9313]
gi|81577657|sp|Q7V7R3.1|THF1_PROMM RecName: Full=Protein thf1
gi|33634864|emb|CAE20850.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 243
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 84/176 (47%), Gaps = 6/176 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 6 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTS 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
GY E + +F A + DP R AQ+ E RG + + ++G+ E L
Sbjct: 66 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEAL 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVLEKVRSTFFDNIGTS 242
K + A G F YSR AVGL LL A E P L ++ +++G S
Sbjct: 126 AKALRNTA-GSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFS 180
>gi|299469582|emb|CBN76436.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 226
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 61/110 (55%), Gaps = 2/110 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ET +F YK+ + + + T++ E + HL Y ++YDP+F +GF T + R M
Sbjct: 107 TVSETVADFYIYYKKVVLTQFRTIVTEYLQSTHLTVYDARFKYDPLFGVGFYTSFMRFMR 166
Query: 132 GYPSEEDREAIFQAYITALKE--DPEQYRIDAQKLEEWARGQTASSLVEF 179
YP E IF A + A+ DP+Q R D L+EWA G+T +VE
Sbjct: 167 AYPVPGQAELIFDAVVKAIGNGLDPDQMRKDTTALKEWAEGKTEEDVVEM 216
>gi|318041533|ref|ZP_07973489.1| Thf1-like protein [Synechococcus sp. CB0101]
Length = 224
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 89/177 (50%), Gaps = 5/177 (2%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V TVA++K F + I IY ++ EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 3 VSLTVADSKRAFHSAFSYVIAPIYRRLVDELLVELHLLSHQKGFRADGLFAVGLTQVFDS 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E RE +FQA +A D R A++ + + + + S +G +G
Sbjct: 63 FSTGYRPEAQREPLFQALCSANGFDGAALRAQAEQARQQVGHHSLEEVKGWLSNQG--QG 120
Query: 189 LLKDIAERASG--KGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
+ IA G + +F YSR AVGL LL+ A + + +RS + IG S+
Sbjct: 121 APELIASLLQGVQRDDFHYSRLVAVGLLSLLQSAQGADALDPQALRSAAHE-IGESM 176
>gi|116074797|ref|ZP_01472058.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
gi|116068019|gb|EAU73772.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
Length = 234
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 86/173 (49%), Gaps = 5/173 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 6 TIADSKRAFHSAFPHVIPSLYRRTADELLVELHLLSHQKQFKVDALFAVGLRQVFDAFTR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
GY E +++F A + DP + A E +G + + ++ +GE +
Sbjct: 66 GYRPEAHLDSLFAAICSCNGFDPAALKQLALDSEHAVQGHSFEDVQQWLRNKGEGAPAAI 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA---NATEPTVLEKVRSTFFDNIG 240
+ +RA NF YSR AVGL LL A + ++P+ L K+ + +G
Sbjct: 126 TKVLKRAD-HANFHYSRLMAVGLLTLLAKAQGDDGSDPSELAKLAHELSEPLG 177
>gi|449015870|dbj|BAM79272.1| photosystem II biogenesis protein Psb29 [Cyanidioschyzon merolae
strain 10D]
Length = 327
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 76/151 (50%), Gaps = 5/151 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ET F + KRP+ Y + E++ HL ++YD +FALGFV+VY
Sbjct: 88 TVSETVTRFYRNLKRPVVFYYQQAVDEILTTAHLALVCAMFRYDVIFALGFVSVYRDFFR 147
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE-----GEV 186
YP ++RE++F+ AL D Q +A + +G+T + L+E ++ E
Sbjct: 148 SYPRPDERESLFRCICDALDLDVGQVTKEADDALAYVQGKTEAELIEEIERDTGEDSAEA 207
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
+ ++ + G + Y+R F +GL +++
Sbjct: 208 QPVIAALRACRRADGEYYYTRLFGIGLMKIM 238
>gi|352093979|ref|ZP_08955150.1| Protein thf1 [Synechococcus sp. WH 8016]
gi|351680319|gb|EHA63451.1| Protein thf1 [Synechococcus sp. WH 8016]
Length = 247
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 87/174 (50%), Gaps = 5/174 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 6 TIADSKRAFHTAFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFQAFTQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + ++ A ++ DPE + A+ G T S + E+ S G G +
Sbjct: 66 GYKPEAHLDELYAAICSSNGFDPEALKQLAEGSTSAVSGHTISEVREWLSNRG--AGAPE 123
Query: 192 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRSTFFDNIGTSI 243
+A S G +F YSR AVGL LL A EP+ ++++ T IG +
Sbjct: 124 PLASGISSVGGDSFHYSRLMAVGLLSLLSSAQGGEPSNPDELK-TLAHEIGEQL 176
>gi|317970011|ref|ZP_07971401.1| Thf1-like protein [Synechococcus sp. CB0205]
Length = 228
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 79/168 (47%), Gaps = 10/168 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + + I +Y ++ EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 8 TVADSKRAFHQAFPYVIAPLYRRLVDELLVELHLLSHQKGFHADGLFAVGLTQVFDSFSN 67
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 186
GY E RE +FQA +A D +R A + + + S GE +
Sbjct: 68 GYKPEAQREPLFQALCSANGFDGGAFRQMASDAATQVGHHSLDEVKGWLSNRGEGAPAPI 127
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKVRST 234
GLL + +F YSR AVGL LL+ A E + +RS
Sbjct: 128 AGLLHGVQRE-----DFHYSRLVAVGLLSLLQRAQGAEAMDPQALRSA 170
>gi|397644025|gb|EJK76212.1| hypothetical protein THAOC_02035 [Thalassiosira oceanica]
Length = 293
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 71/133 (53%), Gaps = 2/133 (1%)
Query: 87 PIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAY 146
PI ++Y + +L+ HL +Q DPVF+LG VTV D L++ +P ++ + I A
Sbjct: 66 PINALYKGTITDLVGSLHLTVVTARFQRDPVFSLGLVTVLDLLLKNFPEQDTAKRIKSAM 125
Query: 147 ITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYS 206
I + + +A ++ WA+G+T + + GE + L +A A G + YS
Sbjct: 126 IESAGMVESEVDAEAAEVATWAQGKTREDIA--SALRGEGDSTLAQVANGAKGDEYWMYS 183
Query: 207 RFFAVGLFRLLEL 219
RFF +GL +++++
Sbjct: 184 RFFGIGLVKMMDI 196
>gi|113955551|ref|YP_730625.1| Thf1-like protein [Synechococcus sp. CC9311]
gi|113882902|gb|ABI47860.1| Uncharacterized protein [Synechococcus sp. CC9311]
Length = 252
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 87/176 (49%), Gaps = 7/176 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F K + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 11 TIADSKRAFHKSFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFMAFTQ 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + ++ A T +PE + A+ G T + + E+ S G G +
Sbjct: 71 GYKPETHLDELYAAICTCNGFEPEALKQLAEGSTSAVSGHTINEVREWLSNRG--AGAPE 128
Query: 192 DIAERASGKG--NFSYSRFFAVGLFRLLELANATEPT---VLEKVRSTFFDNIGTS 242
+A S G +F YSR AVGL LL A EP+ L+K+ + +G S
Sbjct: 129 PLASGISSVGGESFHYSRLMAVGLLSLLSSAQGGEPSNPDELKKLAHEIGEQLGLS 184
>gi|428183151|gb|EKX52010.1| hypothetical protein GUITHDRAFT_150871, partial [Guillardia theta
CCMP2712]
Length = 309
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 78/161 (48%), Gaps = 5/161 (3%)
Query: 66 ATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 125
D+ P A + F KL+ RPIP ++ E++ HL ++YD ++A G +
Sbjct: 72 GADIEPCGAAVE-RFYKLFARPIPFVFRAPTNEILYLSHLDLVNAMFRYDVIWAAGLYST 130
Query: 126 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 185
+D E+ R +FQA + LK D + + DA + +WA+G+T + +V + +GE
Sbjct: 131 FDLFFSAL-DEDLRANLFQALMGGLKLDQSKIKSDADAVLQWAQGKTEADVVS--AIKGE 187
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT 226
+ + +F Y+R F GL +++++ EP
Sbjct: 188 DSSPVGQVLASLGKNEDFLYTRNFGAGLIKIMQVVG-VEPN 227
>gi|416383906|ref|ZP_11684537.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
gi|357265142|gb|EHJ13943.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
Length = 209
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 9/120 (7%)
Query: 104 HLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQK 163
HL+ + YDP++ALG VT + R M+GY E D+ +IF A A+ E+Y +A+
Sbjct: 2 HLLSVNIDFSYDPIYALGVVTSFQRFMQGYSPESDKPSIFNALCQAVDGSSEKYHQEAEA 61
Query: 164 LEEWARGQTASSLVEFPSKEGEV------EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
+ A+G S+V+F K V EG+L + F YSR A+GL+ LL
Sbjct: 62 ILNEAKGL---SIVDFKDKLTHVTDNQVGEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 118
>gi|87302741|ref|ZP_01085552.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
gi|87282624|gb|EAQ74582.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
Length = 257
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 86/177 (48%), Gaps = 13/177 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + I +Y ++ EL+V+ HL+ + + D +FA+G V+D +
Sbjct: 8 TVADSKRAFHAAFPYVIGPLYRRMVDELLVELHLLSRQSGFHSDGLFAVGLTQVFDGFAK 67
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 186
GY ++ E +F A + D +Q R + + + ++ ++ G+ +
Sbjct: 68 GYRPQQQSEPLFAALCASSGFDAQQIRAQHAAAVKAVGEHSLDEVKQWLAQRGQGAPEPI 127
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL---ANATEPTVLEKVRSTFFDNIG 240
G+L I +RA +F YSR FAVGL LL+ A A EP L + +++G
Sbjct: 128 AGVLAGI-DRA----DFHYSRLFAVGLLSLLQHARGAEAVEPQALRQAAHEIGESMG 179
>gi|375332109|gb|AFA52594.1| hypothetical protein [Vaucheria litorea]
Length = 249
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 84/172 (48%), Gaps = 18/172 (10%)
Query: 63 MSTATDVP----------------PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLM 106
MST DVP TV+ET +F Y++PI Y T++ +++ HL
Sbjct: 1 MSTVEDVPDTAMYKRFGHLKGKDVKTVSETIKSFCIQYQKPILPQYRTMINDVLQSTHLN 60
Query: 107 RYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEE 166
+ YD +F GF +++ +LM+ YP + + I+ A +T+L +PE+ + D + + +
Sbjct: 61 VVNGCFIYDAMFGYGFYSLFYKLMKAYPGTGEADLIYAAMVTSLDMEPEKLKEDHETISK 120
Query: 167 WARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
T + L S +GE + LL +I+ + Y++ + +GL ++
Sbjct: 121 LIENMTRADLEN--SFKGENQNLLSEISSNIKADEFYLYTKTWGIGLIEAMD 170
>gi|116070497|ref|ZP_01467766.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
gi|116065902|gb|EAU71659.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
Length = 215
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 87/160 (54%), Gaps = 16/160 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+F++G TV++ +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFSVGLCTVFETFSQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ +F A ++ + +R ++++ + A+ ++ ++G+
Sbjct: 66 GYRPEDHITGLFDALCSSNGYNATTFRKESKQCIDAAKSES-------------IDGMES 112
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLE 229
+A++ G+G+ YSR A+G+FRL E A +A +P E
Sbjct: 113 HLAKQKLGEGSH-YSRLMAIGVFRLFEEAKGDAEQPDETE 151
>gi|223995057|ref|XP_002287212.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
gi|220976328|gb|EED94655.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
Length = 212
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 74/150 (49%), Gaps = 2/150 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV E F P+ ++Y + +L+ HL+ +Q D VF+LG V+ D +++
Sbjct: 2 TVGEAFTQFTDKLGTPVNALYKGMCTDLVGSLHLVMVNARFQRDAVFSLGLVSALDLVLK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
YP E I A + ++ D +A LE WA+G+T + EG+ + L
Sbjct: 62 NYPEAETGARIKSAMLESVGLDEAVVNAEAAALEAWAQGKTKEDIASALKGEGDSQ--LA 119
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELAN 221
IA+ A G + YSRFF VGL R++E+
Sbjct: 120 AIAKAAKGDQWWMYSRFFGVGLVRIMEIVG 149
>gi|78184631|ref|YP_377066.1| Thf1-like protein [Synechococcus sp. CC9902]
gi|97202850|sp|Q3AY05.1|THF1_SYNS9 RecName: Full=Protein thf1
gi|78168925|gb|ABB26022.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 215
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 81/156 (51%), Gaps = 16/156 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+FA+G TV+D
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFAVGLCTVFDTFSA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY EE + A ++ D +R ++++ + A+ ++ V+ +
Sbjct: 66 GYRPEEHITGLLDALCSSNGYDANTFRKESKRCIDAAKTES-------------VDAMDS 112
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP 225
+A + G+G+ YSR A+G+ RL E A +A +P
Sbjct: 113 HLAGQKLGEGSH-YSRLMAIGVLRLFEEAKGDADQP 147
>gi|78212971|ref|YP_381750.1| Thf1-like protein [Synechococcus sp. CC9605]
gi|97202855|sp|Q3AJN7.1|THF1_SYNSC RecName: Full=Protein thf1
gi|78197430|gb|ABB35195.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 215
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 74/147 (50%), Gaps = 14/147 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D ++
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANELFSVGLCTVFDTFIK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E +A+F+A ++ D + R L E A+G+ SL ++ S EG
Sbjct: 66 GYRPEAQTDALFRALCSSNGFDAAKLRKTYASLVEQAKGKDPESLKDWLSSHALKEG--- 122
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLE 218
YSR AVGL LL+
Sbjct: 123 -----------SHYSRLMAVGLMSLLK 138
>gi|260436777|ref|ZP_05790747.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
gi|260414651|gb|EEX07947.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
Length = 215
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 73/151 (48%), Gaps = 14/151 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANGLFSVGLCTVFDTFTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E +A+F A ++ D + R L + A+G+ +L + S EG
Sbjct: 66 GYRPEAQTDALFSALCSSNGFDAAKLRKTNASLVDQAKGKDHETLKSWLSSHSLKEG--- 122
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANA 222
YSR AVGL LL+ A A
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAATA 142
>gi|161347491|ref|YP_001224936.2| Thf1-like protein [Synechococcus sp. WH 7803]
Length = 226
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 84/178 (47%), Gaps = 12/178 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 6 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 186
GY + +F A + + E+ + A+ E+ G + + + +G+ +
Sbjct: 66 GYRPADHLPQLFDALCSCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKVRSTFFDNIGTS 242
L DIA F YSR AVGLF LL A ++ +P L K T + IG S
Sbjct: 126 ATGLADIAGE-----QFHYSRLMAVGLFSLLSSAQGDSQDPEELCKTAHTIGEQIGLS 178
>gi|33865836|ref|NP_897395.1| Thf1-like protein [Synechococcus sp. WH 8102]
gi|81574513|sp|Q7U6N6.1|THF1_SYNPX RecName: Full=Protein thf1
gi|33633006|emb|CAE07817.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 212
Score = 66.6 bits (161), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 71/146 (48%), Gaps = 14/146 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ T+Q + +FA+G TV++R +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRIADELLVELHLLSHQATFQANSLFAVGLKTVFERFTQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E A+ A ++ D EQ + AQ + A G + + + ++ +G
Sbjct: 66 GYRPMEHPAALLSALCSSNGFDDEQLKQAAQHCLQDAEGHSDDAFQSWLKEQSLSDGA-- 123
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL 217
YSR AVGL LL
Sbjct: 124 ------------HYSRLMAVGLLALL 137
>gi|88808604|ref|ZP_01124114.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
gi|88787592|gb|EAR18749.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
Length = 234
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 83/176 (47%), Gaps = 7/176 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 13 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKSNALFAVGLRQVFTAFTK 72
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
GY + +F A + + +Q A+ E+ G + + + +GE E L
Sbjct: 73 GYRPADHLTELFDALCSCNGFNAQQLNSVAEGSEKAVAGHSMEEVQAWLQSKGEGAPEPL 132
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVLEKVRSTFFDNIGTS 242
+A+ A + F YSR AVGLF LL A E P L K + + IG S
Sbjct: 133 ATGLADIAGEQ--FHYSRLMAVGLFSLLSSAQGVESQDPEDLCKTAHSIGEQIGLS 186
>gi|72382131|ref|YP_291486.1| Thf1-like protein [Prochlorococcus marinus str. NATL2A]
gi|97202784|sp|Q46L45.1|THF1_PROMT RecName: Full=Protein thf1
gi|72001981|gb|AAZ57783.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 199
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 12/149 (8%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFSK 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHATKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLL 217
KG YSR A+G+++L+
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLV 139
>gi|87124410|ref|ZP_01080259.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
gi|86167982|gb|EAQ69240.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
Length = 224
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 79/172 (45%), Gaps = 4/172 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IP +Y EL+V+ HL+ +++ +Q D +FA+G V+
Sbjct: 6 TIADSKRAFHTAFPFVIPPLYRRTADELLVELHLLSHQQQFQVDALFAVGLRQVFRAFTR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
GY + ++F+A ++ + A + E RG + + + G+ L
Sbjct: 66 GYKPGQHLASLFEALCSSTGFHAGELESLADQSEAAVRGHSIEEVRHWLEHGGDGAPAPL 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKVRSTFFDNIG 240
+ +RA G F YSR AVGL LL A + +P L K+ +G
Sbjct: 126 ASVLQRADSSG-FHYSRLMAVGLLSLLSEAQGDQADPEQLRKLAHELSGPLG 176
>gi|219123541|ref|XP_002182081.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406682|gb|EEC46621.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 311
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 73/147 (49%), Gaps = 2/147 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV E +F + +Y ++ +++ HL+ +Q D +++LG +T D L++
Sbjct: 67 TVGEAFADFSSELGVTVNPLYKNMVTDIVGTTHLVIVNARFQRDAIWSLGILTALDLLLK 126
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
YP E I A ++ D ++ R +A+ + +WA G++ + + + EG + +
Sbjct: 127 NYPEPEVGAKIVSALFKSVGLDEDEIRNEARTISDWAVGKSKADIETALTGEG--DSPVA 184
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLE 218
IA + YSR+F +GL +++E
Sbjct: 185 AIANSIKPNDYWMYSRYFGIGLIKIME 211
>gi|124025670|ref|YP_001014786.1| Thf1-like protein [Prochlorococcus marinus str. NATL1A]
gi|166987530|sp|A2C211.1|THF1_PROM1 RecName: Full=Protein thf1
gi|123960738|gb|ABM75521.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 199
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 12/149 (8%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFCK 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHVTKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLL 217
KG YSR A+G+++L+
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLV 139
>gi|427701945|ref|YP_007045167.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
gi|427345113|gb|AFY27826.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
Length = 231
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 85/174 (48%), Gaps = 7/174 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + I +Y ++ EL+V+ HL+ ++ +Q D +FA+G + V+D
Sbjct: 6 TVADSKRAFHGAFPHVISPLYRRMVDELLVELHLLSRQKGFQIDALFAVGLIQVFDGFAR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + +FQA + D R Q+ + + + ++ +G G
Sbjct: 66 GYRPEAQKGPLFQALCASSGFDGPDLRRQCQEALAAMGRHSQAEVRQWIESQG--AGAPA 123
Query: 192 DIAERASG--KGNFSYSRFFAVGLFRLLELA---NATEPTVLEKVRSTFFDNIG 240
+A +G + +F YSR AVGL LLE A +A EP L ++ +++G
Sbjct: 124 PVATALAGIRRPDFHYSRLMAVGLLALLEQALADDAMEPQALRQLAHEIGESMG 177
>gi|147848088|emb|CAK23639.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 206
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 76/162 (46%), Gaps = 12/162 (7%)
Query: 88 IPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYI 147
IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +GY + +F A
Sbjct: 2 IPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTKGYRPADHLPQLFDALC 61
Query: 148 TALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----VEGLLKDIAERASGKGN 202
+ + E+ + A+ E+ G + + + +G+ + L DIA
Sbjct: 62 SCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPLATGLADIAGE-----Q 116
Query: 203 FSYSRFFAVGLFRLLELA--NATEPTVLEKVRSTFFDNIGTS 242
F YSR AVGLF LL A ++ +P L K T + IG S
Sbjct: 117 FHYSRLMAVGLFSLLSSAQGDSQDPEELCKTAHTIGEQIGLS 158
>gi|159903384|ref|YP_001550728.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9211]
gi|254784145|sp|A9BAB2.1|THF1_PROM4 RecName: Full=Protein thf1
gi|159888560|gb|ABX08774.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 221
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 87/172 (50%), Gaps = 17/172 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F K + +P++Y ++ ELIV+ +L++ + + D VFA+G +++ +
Sbjct: 6 TVSESKAIFHKEFPFVVPAVYRRLVDELIVELNLLKNQERFVADGVFAIGLTSIFLDFTK 65
Query: 132 GYPSEEDREAIFQAYITAL---KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E + + +A + EQ ++A+KL SL+ +++ E E
Sbjct: 66 GYKPENQKGILLEAICKCTGFSASNLEQIALEAKKLANGLNTNEIKSLITDNNRD-EKES 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKVRS 233
K I + N YSR A+G+++L+++ + ATE + L+ V +
Sbjct: 125 TYKLINK------NNHYSRIIAIGIYKLVDMQSNGFNKEEATENSYLDLVNN 170
>gi|323450067|gb|EGB05951.1| hypothetical protein AURANDRAFT_66018 [Aureococcus anophagefferens]
Length = 1032
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 52/108 (48%), Gaps = 4/108 (3%)
Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
+ YD +F GFVT+ D +M YP D E I A I AL DP R D + + EW G+
Sbjct: 69 FVYDELFGFGFVTLMDMIMSPYPVAGDGEKITDALIAALDMDPATLRGDHKAVTEWLAGK 128
Query: 172 T-ASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 218
T A L S +G + A G+ F ++R VGL +++
Sbjct: 129 TEADVLAAVASNDGS---KVASAAATIKGQEEFHHTRPSNVGLVAVMD 173
>gi|194476659|ref|YP_002048838.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
gi|171191666|gb|ACB42628.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
Length = 213
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 71/168 (42%), Gaps = 29/168 (17%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK F K + I + TVL EL+V+ L+ + DP+FA+G + + L
Sbjct: 8 PTVADTKRAFYKGFPYVIAPSHRTVLNELLVELFLLSPQTDIGSDPLFAVGLIQFFGVLT 67
Query: 131 EGYPSEEDREAIFQAYITALKEDP--------------EQYRIDAQKLEEWARGQTASSL 176
+ Y + R +F+A ++ D QY I ++L W+ +S
Sbjct: 68 KHYQPQNHRMLLFEALCNSIGFDSFNLRQIRKESLSELSQYNI--EELHSWSLTGADNSE 125
Query: 177 VEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE 224
+ F + K F YSR A+GL L++ A E
Sbjct: 126 ILFTKTFIPI-------------KRRFHYSRLMAIGLLCLIKRARGVE 160
>gi|33240369|ref|NP_875311.1| Thf1-like protein [Prochlorococcus marinus subsp. marinus str.
CCMP1375]
gi|81664534|sp|Q7VC23.1|THF1_PROMA RecName: Full=Protein thf1
gi|33237896|gb|AAP99963.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 214
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/151 (21%), Positives = 73/151 (48%), Gaps = 4/151 (2%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++++K F K + IP +Y VL E +V+ +L+ + ++ D +F+ G + ++R
Sbjct: 6 TISDSKGLFHKEFPYVIPPVYRKVLDEYLVELNLLSNQSNFKIDTIFSYGLIISFERFTV 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY + I ++ + D + + + +++ + ++ + E++ +
Sbjct: 66 GYEPDSHISKILESLCNSCNIDIKAIKEYSNNIKKLINEKGIKEIINILT--AEIKKSVG 123
Query: 192 DIA-ERASGKGNFSYSRFFAVGLFRLLELAN 221
IA SGK + YSR A+G++ L+ N
Sbjct: 124 GIALSNQSGKDKY-YSRLHAIGIYELISNIN 153
>gi|123200442|gb|ABM72050.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 198
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 72/154 (46%), Gaps = 19/154 (12%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 2 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 62 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 110
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATE 224
K+ N SR +G++ L +ANAT+
Sbjct: 111 KE-----KSTSNLYSSRILNIGIY--LIIANATD 137
>gi|161407964|ref|YP_001011157.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9515]
Length = 217
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 72/154 (46%), Gaps = 19/154 (12%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 21 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 81 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 129
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATE 224
K+ N SR +G++ L +ANAT+
Sbjct: 130 KE-----KSTSNLYSSRILNIGIY--LIIANATD 156
>gi|157413170|ref|YP_001484036.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9215]
gi|157387745|gb|ABV50450.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
Length = 217
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/108 (21%), Positives = 61/108 (56%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 21 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 81 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL 128
>gi|254526529|ref|ZP_05138581.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
gi|221537953|gb|EEE40406.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
Length = 202
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/175 (20%), Positives = 86/175 (49%), Gaps = 20/175 (11%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 6 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 66 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 113
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKVRSTFFDNIGTS 242
+ E+++ K SR +G++ L+ +L E + K+ S F+ + S
Sbjct: 114 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKKNESDI-NKMISDIFEQLNLS 164
>gi|126543182|gb|ABO17424.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 198
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/108 (18%), Positives = 61/108 (56%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 2 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
GY + + +F++ ++ + ++ +QK ++ + +T++ +++
Sbjct: 62 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSNDILKL 109
>gi|161407965|ref|YP_001091025.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9301]
Length = 217
Score = 43.9 bits (102), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 20/108 (18%), Positives = 61/108 (56%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 21 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
GY + + +F++ ++ + ++ +QK ++ + +T++ +++
Sbjct: 81 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSNDILKL 128
>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
Length = 1053
Score = 40.4 bits (93), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 85 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 126
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
++Q+ L Y+R + P F G++ + L+ + D EA QAYITAL+ +P+ Y
Sbjct: 127 LLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMEAAVQAYITALQYNPDLY 180
>gi|359416122|ref|ZP_09208488.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
gi|358033523|gb|EHK02062.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
Length = 477
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 9/96 (9%)
Query: 51 ASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
AS + I C + DV P AE ++ +K +++ P TV+ ++R
Sbjct: 133 ASDYDVDVEIHCFADGRDVDPKSAEEYISQIKEWQKDYPGKIATVMGRFYSMDRDHNWER 192
Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAY 146
T+Q ALG EG+ E REA+ +AY
Sbjct: 193 THQAYDAMALG---------EGFEFENPREAVKKAY 219
>gi|194767414|ref|XP_001965811.1| GF13981 [Drosophila ananassae]
gi|190625935|gb|EDV41459.1| GF13981 [Drosophila ananassae]
Length = 396
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 92 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 133
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
+Q+ L Y+R + P F G++ + L+ + D E+ QAYITAL+ +PE Y
Sbjct: 134 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 187
>gi|403176155|ref|XP_003334869.2| hypothetical protein PGTG_16037 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375172127|gb|EFP90450.2| hypothetical protein PGTG_16037 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1808
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 14/94 (14%)
Query: 159 IDAQKLEEWARGQTASSLVEFPSKEGEVE------GLLKDIAERASGK------GNFSYS 206
+ A KLEE GQ S L EF + ++ GLL +++ GK G SY
Sbjct: 978 LTALKLEEL--GQCISVLAEFARQTDDINIALTAGGLLWQVSDHVQGKNKASKAGGESYV 1035
Query: 207 RFFAVGLFRLLELANATEPTVLEKVRSTFFDNIG 240
R + L +LLEL +A+ V + T F IG
Sbjct: 1036 RLWMYLLSKLLELVHASRQEVRDGAIQTLFRTIG 1069
>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
Length = 1050
Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 82 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 123
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
+Q+ L Y+R + P F G++ + L+ + D E+ QAYITAL+ +PE Y
Sbjct: 124 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 177
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.133 0.380
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,529,808,947
Number of Sequences: 23463169
Number of extensions: 135440769
Number of successful extensions: 345293
Number of sequences better than 100.0: 190
Number of HSP's better than 100.0 without gapping: 180
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 344940
Number of HSP's gapped (non-prelim): 197
length of query: 252
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 113
effective length of database: 9,097,814,876
effective search space: 1028053080988
effective search space used: 1028053080988
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)