BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022333
(299 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 229/295 (77%), Positives = 262/295 (88%), Gaps = 1/295 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S S+M++
Sbjct: 1 MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSEIFGFRTDFSYHYVGVRASNSPSKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGEVEG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+KLC LN+NKR
Sbjct: 180 SKEGEVEGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKLCVALNINKR 239
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 295
SVDRDLDVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDREKKKRDERAEPQKANEAITTCLGQQL 294
>gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 476 bits (1224), Expect = e-132, Method: Compositional matrix adjust.
Identities = 227/297 (76%), Positives = 264/297 (88%), Gaps = 1/297 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S+S+M++
Sbjct: 1 MAALTSLSFSAVTHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKR
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKR 239
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L+
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLHG 296
>gi|255636566|gb|ACU18621.1| unknown [Glycine max]
Length = 297
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 227/297 (76%), Positives = 264/297 (88%), Gaps = 1/297 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F+++ S +RKV +SSTR L S+ E F FRT H V RAS+S+S+M++
Sbjct: 1 MAALTSLSFSAVIHCS-ERKVTLSSTRFLASSSELFGFRTDFSYHYVGVRASNSASKMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ATDVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFAL
Sbjct: 60 QCMSSATDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFAL 119
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVT+YD+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF
Sbjct: 120 GFVTIYDKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFS 179
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGE E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKR
Sbjct: 180 SKEGEAERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKR 239
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L+
Sbjct: 240 SVDRDLDVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLHG 296
>gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
Length = 299
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 229/298 (76%), Positives = 261/298 (87%), Gaps = 3/298 (1%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLF-CHCVRFRAS-SSSSRM 58
MA++TSV+F++I Q+ +RK S +RS SNF+ FRF F CH V +AS SSSSRM
Sbjct: 1 MAAVTSVSFSAIAQSFTERKSVASWSRSFSSNFDSFRFPVGGFSCHYVGVKASNSSSSRM 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+I CMSTATDVPPTV+ETK NFL YK+PIPSIYNTVLQELIVQQHLMRYKR+Y+YDPVF
Sbjct: 61 LIHCMSTATDVPPTVSETKFNFLNSYKKPIPSIYNTVLQELIVQQHLMRYKRSYRYDPVF 120
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LM+GYPS+EDREAIFQAYI AL E+PEQYRIDA+KLE+WAR QT SSLV+
Sbjct: 121 ALGFVTVYDQLMQGYPSDEDREAIFQAYINALNEEPEQYRIDAKKLEDWARSQTPSSLVD 180
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVN 238
F SKEGEVEG+LKDIAERA G G+FSYSRFFA+GLFRLLEL+N+TEPTVLEKLCA LN+N
Sbjct: 181 FSSKEGEVEGILKDIAERA-GNGSFSYSRFFAIGLFRLLELSNSTEPTVLEKLCAALNIN 239
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 296
KR VDRDLDVYRNLLSKL+QAKELLKEYVDREKKK+EER QKANEA+K CLGE L+
Sbjct: 240 KRGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKQEERASSQKANEAVKSCLGEALH 297
>gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
Length = 298
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 227/294 (77%), Positives = 257/294 (87%), Gaps = 2/294 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++ S++F+++ Q S R++ + S+RS SNF GF FRTS+F H R RAS+ SSRM+I
Sbjct: 1 MAAVNSISFSTLNQCS-DRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMS TDV TVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFAL
Sbjct: 60 HCMSAGTDVT-TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS+EDREAIFQAYI AL EDPEQYRIDA+K EEWAR QTA+SLVEF
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFA 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S+EGEVE +LKDIAERA KGNFSYSRFFA+GLFRLLELANATEP++LEKLCA LN++K+
Sbjct: 179 SREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKK 238
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEY 294
VDRDLDVYRNLLSKL+QAKELLKEYVDREKKKR+ER Q ANEAI KCLGEY
Sbjct: 239 GVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY 292
>gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis
vinifera]
Length = 299
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 231/297 (77%), Positives = 265/297 (89%), Gaps = 2/297 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++TS++F+++GQ+S +RKV V +TRS S FE FRFR + + VR +SSSSSRM++
Sbjct: 1 MAAVTSLSFSALGQSS-ERKVPVPTTRSFASAFEAFRFRANFYAVGVRSSSSSSSSRMVV 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFAL
Sbjct: 60 QCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF
Sbjct: 119 GFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA N++KR
Sbjct: 179 SKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKR 238
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 239 SVDRDLDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 295
>gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa]
gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 222/293 (75%), Positives = 254/293 (86%), Gaps = 1/293 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
M ++TS++F+++ Q+S +R VSS S +FEGFR R+ CH V +AS+S+SR I
Sbjct: 1 MTAITSLSFSAVNQSSSERNTFVSSLASRNLSFEGFRLRSVFSCHYVGVKASNSTSRTAI 60
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMST+TDVPPTV+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY YDPVF L
Sbjct: 61 HCMSTSTDVPPTVSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYLYDPVFGL 120
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
G VTVYD+LMEGYPS+EDREAIFQAYI ALKEDPEQYRIDA+KLEEWAR QT SSLV+F
Sbjct: 121 GLVTVYDQLMEGYPSDEDREAIFQAYIKALKEDPEQYRIDAKKLEEWARAQTHSSLVDFS 180
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGE+EG+LK IAERA+ GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKR
Sbjct: 181 SKEGEIEGILKGIAERAAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKR 239
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 293
SVDRDLDVYR LLSKL+QAKELLKEYVDREKKK+EER E QKANE + KCLG+
Sbjct: 240 SVDRDLDVYRGLLSKLVQAKELLKEYVDREKKKQEERAESQKANEMVAKCLGD 292
>gi|224146717|ref|XP_002326111.1| predicted protein [Populus trichocarpa]
gi|222862986|gb|EEF00493.1| predicted protein [Populus trichocarpa]
Length = 296
Score = 453 bits (1166), Expect = e-125, Method: Compositional matrix adjust.
Identities = 222/293 (75%), Positives = 259/293 (88%), Gaps = 3/293 (1%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++T V+F++I Q+S R+ + R+L FEGFRFR+S CH V RAS+S+SRM+I
Sbjct: 1 MAAVTPVSFSAISQSSSDRRAFCTVARNL--GFEGFRFRSSFSCHYVGVRASNSTSRMVI 58
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
CMST+TDVPPTVA+TK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YK+T++YDPVF L
Sbjct: 59 HCMSTSTDVPPTVADTKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKKTFRYDPVFGL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS+EDREAIFQAYI AL+EDPEQYRIDA+KLEEWAR QT SSLV+F
Sbjct: 119 GFVTVYDQLMEGYPSDEDREAIFQAYIKALEEDPEQYRIDAKKLEEWARAQTPSSLVDFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S+EGE+EG LKDIAER + GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKR
Sbjct: 179 SREGEIEGTLKDIAERVAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKR 237
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 293
SVDRDLDVYR LLSKL+QA+ELLKEYVDREKKK+EER E QKA+E + KCLGE
Sbjct: 238 SVDRDLDVYRGLLSKLVQARELLKEYVDREKKKQEERAESQKASETVTKCLGE 290
>gi|388496070|gb|AFK36101.1| unknown [Lotus japonicus]
Length = 298
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 217/299 (72%), Positives = 258/299 (86%), Gaps = 3/299 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
MA++TS +F+++ Q+S QRK +SST L SN +G RFR S C V +AS+S+S+++
Sbjct: 1 MATVTSSFSFSALTQSS-QRKHTLSSTCVLGSNSDGIRFRKSFLCQQVGVQASNSASKLV 59
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
++C S+ +D PP V+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMR+KR+Y+YDPVFA
Sbjct: 60 VRCSSSVSD-PPPVSETKLNFLKEYKRPIPSIYNTVLQELIVQQHLMRFKRSYRYDPVFA 118
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVY++LMEGYPS+EDR+AIFQ YI ALKEDP QYR DAQKLEEWAR Q+++SL+EF
Sbjct: 119 LGFVTVYEQLMEGYPSDEDRDAIFQTYIKALKEDPGQYREDAQKLEEWARTQSSTSLIEF 178
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
S+EGEVEG LKDIAERA GKG+FSYSRFFA+GLFRLLEL NA EP +LEKLCA LNV+K
Sbjct: 179 SSREGEVEGALKDIAERAGGKGDFSYSRFFAIGLFRLLELGNAMEPAILEKLCAALNVDK 238
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYSH 298
RSVDRDLDVYRNLLSKL+QAKELLKEY DREKKK+EER EPQKANEAI KCLG+ +S
Sbjct: 239 RSVDRDLDVYRNLLSKLVQAKELLKEYADREKKKQEERAEPQKANEAITKCLGQEQFSE 297
>gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/284 (74%), Positives = 248/284 (87%), Gaps = 6/284 (2%)
Query: 4 LTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMIIQCM 63
++S +F+++ Q+S QR + +L SN FRFR C V RAS+S+S+M+++C
Sbjct: 5 ISSFSFSTLTQSSSQRNLT-----TLSSNSPTFRFRVGFSCLNVGVRASNSASKMVVRCS 59
Query: 64 STATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFV 123
S+ + PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFV
Sbjct: 60 SSVAE-PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFV 118
Query: 124 TVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE 183
TVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SL+EF S+E
Sbjct: 119 TVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARSQNPNSLLEFSSRE 178
Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVD 243
GEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNVNKRSVD
Sbjct: 179 GEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVNKRSVD 238
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 287
RDLDVYRNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI
Sbjct: 239 RDLDVYRNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAI 282
>gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula]
Length = 303
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 212/300 (70%), Positives = 256/300 (85%), Gaps = 3/300 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
MA++TS +F+++ Q+S ++ +SSTR S+ + RFR S+ V R S+ S++
Sbjct: 1 MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61 VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVN 238
F S+EGEVEG LKDIAERA G G+FSYSRFFAVGLFRLLELAN EPT+LEKLC+ LN+N
Sbjct: 180 FSSREGEVEGTLKDIAERAGGNGDFSYSRFFAVGLFRLLELANTMEPTILEKLCSALNIN 239
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYSH 298
K+SVDRDLDVYRNLLSKL+QAKELLKEY+DREKKK EER EPQKANEAI KCLG+ +S+
Sbjct: 240 KKSVDRDLDVYRNLLSKLVQAKELLKEYIDREKKKIEERAEPQKANEAISKCLGQEQFSN 299
>gi|356517586|ref|XP_003527468.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/288 (73%), Positives = 251/288 (87%), Gaps = 7/288 (2%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMI 59
MA++TS +F+++ Q+S R ++ +L SN FR R CH V RAS+S+S+++
Sbjct: 1 MATVTSSFSFSTLTQSSSPRNLS-----TLSSNSPTFRIRVGFSCHYVGVRASNSASKIV 55
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
++C S+A PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFA
Sbjct: 56 VRC-SSAVAEPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFA 114
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVY++LMEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SLV+F
Sbjct: 115 LGFVTVYEQLMEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARAQNPTSLVDF 174
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
S+EGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNV+K
Sbjct: 175 SSREGEVEGILKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVDK 234
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 287
RSVDRDLDVYRNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI
Sbjct: 235 RSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAI 282
>gi|388506988|gb|AFK41560.1| unknown [Medicago truncatula]
Length = 287
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 209/287 (72%), Positives = 245/287 (85%), Gaps = 2/287 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F++ Q S QRK +SSTR L S+ + F RT HCV R +S+S+M+I
Sbjct: 1 MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60 QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S+EGEVEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KR
Sbjct: 179 SREGEVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKR 238
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 287
SVDRDLDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 239 SVDRDLDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285
>gi|217073200|gb|ACJ84959.1| unknown [Medicago truncatula]
Length = 287
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 208/287 (72%), Positives = 244/287 (85%), Gaps = 2/287 (0%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+LTS++F++ Q S QRK +SSTR L S+ + F RT HCV R +S+S+M+I
Sbjct: 1 MAALTSLSFSATSQCS-QRKSTLSSTRFLASSSDMFGIRTDSSYHCVGVRVGNSASKMVI 59
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
QCMS+ TDVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFAL
Sbjct: 60 QCMSSVTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFAL 118
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF
Sbjct: 119 GFVTVYDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFS 178
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S+E EVEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KR
Sbjct: 179 SREREVEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKR 238
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 287
SVDRDLDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 239 SVDRDLDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285
>gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera]
Length = 243
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 201/240 (83%), Positives = 222/240 (92%), Gaps = 1/240 (0%)
Query: 58 MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
M++QCMS+ TDVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD V
Sbjct: 1 MVVQCMSSVTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAV 59
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLV
Sbjct: 60 FALGFVTVYDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLV 119
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
EF SKEGEVEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA N+
Sbjct: 120 EFSSKEGEVEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNI 179
Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
+KRSVDRDLDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 180 SKRSVDRDLDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 239
>gi|75140959|sp|Q7XAB8.1|THF1_SOLTU RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|33469614|gb|AAQ19850.1| light-regulated chloroplast-localized protein [Solanum tuberosum]
Length = 293
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/297 (69%), Positives = 248/297 (83%), Gaps = 6/297 (2%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA++TSV+F++I Q S +RK +VSS+RS+ + FRFR++ V R+S+S+SR ++
Sbjct: 1 MAAVTSVSFSAITQ-SAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVV 55
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
C S++ PTVA+TK+ FL YKRPIP++YNTVLQELIVQQHL RYK++YQYDPVFAL
Sbjct: 56 HCTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFAL 115
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPSEEDR AIF+AYI ALKEDPEQYR DAQKLEEWAR Q A++LV+F
Sbjct: 116 GFVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFS 175
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
SKEGE+E + KDIA+RA K F YSR FAVGLFRLLELAN T+PT+LEKLCA LNVNK+
Sbjct: 176 SKEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKK 235
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYRNLLSKL+QAKELLKEYV+REKKKR ER E QKANE + KCLG+Y Y+
Sbjct: 236 SVDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGER-ETQKANETVTKCLGDYQYA 291
>gi|157142955|gb|ABV24460.1| chloroplast-localized protein [Nicotiana benthamiana]
Length = 295
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 206/302 (68%), Positives = 250/302 (82%), Gaps = 14/302 (4%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSR--- 57
MA++TSV+F++I Q S +RK +VS +RS+ + FRFR+++ C R+S+SS
Sbjct: 1 MAAVTSVSFSAIAQ-SAERKSSVSPSRSV----DTFRFRSNVSFDCFNVRSSNSSFSRST 55
Query: 58 --MIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYD 115
++ CMST PTVAETKMNFLK YKRPIP++YNTVLQELIVQQHL++YK++Y+YD
Sbjct: 56 SRFVVHCMSTDL---PTVAETKMNFLKAYKRPIPTVYNTVLQELIVQQHLIKYKKSYRYD 112
Query: 116 PVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASS 175
PVFALGFVTVYD+LMEGYPSEEDR+AIF+AYI AL EDP QYR DAQK EEWAR Q A++
Sbjct: 113 PVFALGFVTVYDQLMEGYPSEEDRDAIFKAYIEALNEDPVQYRADAQKFEEWARTQNANT 172
Query: 176 LVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVL 235
LV+F S++GEVE +LKDIA+RA K +F YSR FAVGLFRLLELAN T+PT+LEKLCA L
Sbjct: 173 LVDFSSRDGEVENILKDIAQRAGTKDSFCYSRLFAVGLFRLLELANVTDPTILEKLCASL 232
Query: 236 NVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 295
N+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+REKKKR ER E QKANEA+ KCLG+Y
Sbjct: 233 NINKKSVDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGER-ESQKANEAVTKCLGDYQ 291
Query: 296 YS 297
Y+
Sbjct: 292 YA 293
>gi|21592994|gb|AAM64943.1| unknown [Arabidopsis thaliana]
gi|58761181|gb|AAW82331.1| chloroplast thylakoid formation 1 [Arabidopsis thaliana]
Length = 300
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 201/300 (67%), Positives = 241/300 (80%), Gaps = 18/300 (6%)
Query: 3 SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
+++S++F ++GQ+ +SNF R S C +F S S+S+
Sbjct: 5 AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52
Query: 59 IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
+I CMS T DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53 LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
+F SKEG++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNI 231
Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
NK+SVDRDLDVYRNLLSKL+QA ELLKEYV+REKKK+EER + QKANE I KCLG+ LY+
Sbjct: 232 NKKSVDRDLDVYRNLLSKLVQANELLKEYVEREKKKQEERAQSQKANETISKCLGDTLYN 291
>gi|18399513|ref|NP_565491.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
gi|75206547|sp|Q9SKT0.1|THF1_ARATH RecName: Full=Protein THYLAKOID FORMATION 1, chloroplastic; Flags:
Precursor
gi|4454459|gb|AAD20906.1| expressed protein [Arabidopsis thaliana]
gi|17065446|gb|AAL32877.1| Unknown protein [Arabidopsis thaliana]
gi|20148535|gb|AAM10158.1| unknown protein [Arabidopsis thaliana]
gi|330251998|gb|AEC07092.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
Length = 300
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 201/300 (67%), Positives = 241/300 (80%), Gaps = 18/300 (6%)
Query: 3 SLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFR----ASSSSSRM 58
+++S++F ++GQ+ +SNF R S C +F S S+S+
Sbjct: 5 AISSLSFPALGQSD------------KISNFASSRPLASAIRICTKFSRLSLNSRSTSKS 52
Query: 59 IIQCMSTAT-DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
+I CMS T DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPV
Sbjct: 53 LIHCMSNVTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPV 111
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV 177
FALGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV
Sbjct: 112 FALGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLV 171
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNV 237
+F SKEG++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+
Sbjct: 172 DFSSKEGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNI 231
Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+REKKK+ ER + QKANE I KCLG+ LY+
Sbjct: 232 NKKSVDRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGDTLYN 291
>gi|293333399|ref|NP_001168867.1| uncharacterized protein LOC100382672 [Zea mays]
gi|223973419|gb|ACN30897.1| unknown [Zea mays]
Length = 284
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/297 (68%), Positives = 235/297 (79%), Gaps = 16/297 (5%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MAS++S+ F + + R SST VS R R S S ++
Sbjct: 1 MASISSLPFAPLRGAAEWRP---SSTAPAVSG---------AVVLSARSRRGSGS---VV 45
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
+C++TA+DVPPTV ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYD VFAL
Sbjct: 46 RCVATASDVPPTVGETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDAVFAL 105
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+E WAR Q SSLV+F
Sbjct: 106 GFVTVYDQLMEGYPSIEDRDSIFKAYITALNEDPNQYRADALKMEGWARSQNGSSLVDFS 165
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKR
Sbjct: 166 SRDGEIESILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKR 225
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 226 SVDRDLDVYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|357122407|ref|XP_003562907.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Brachypodium distachyon]
Length = 286
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 186/238 (78%), Positives = 211/238 (88%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C++ D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42 GSRSVVRCVAATADIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LMEGYPS EDR+AIF++YITAL EDPEQYR DAQK+EEWAR Q S
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYITALNEDPEQYRADAQKMEEWARAQNGS 161
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
LVEF S++GE+E +LKDI+ERA G GNFSYSRFFAVGLFRLLELANATEPTVL+KLCA
Sbjct: 162 LLVEFSSRDGEIEAVLKDISERAQGNGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAA 221
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NKRSVDRDLD+YRNLLSKL+QAKELLKEY+DREKKKREER E K NE + K G
Sbjct: 222 LNINKRSVDRDLDIYRNLLSKLVQAKELLKEYIDREKKKREERLETPKPNEPVAKFDG 279
>gi|242050546|ref|XP_002463017.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
gi|241926394|gb|EER99538.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
Length = 284
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/236 (80%), Positives = 213/236 (90%), Gaps = 1/236 (0%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++TA DVPPTVAETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47 CVATAGDVPPTVAETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+EEWAR Q ASSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRAYITALNEDPTQYRADALKMEEWARSQNASSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRS 226
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
VDRDLDVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|297832696|ref|XP_002884230.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
gi|297330070|gb|EFH60489.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
Length = 298
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 189/238 (79%), Positives = 217/238 (91%), Gaps = 1/238 (0%)
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
I CMST TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFA
Sbjct: 54 IHCMSTVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFA 112
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LGFVTVYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F
Sbjct: 113 LGFVTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDF 172
Query: 180 PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNK 239
S++GE+E LLKDIA RA+ K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK
Sbjct: 173 SSRQGEIEALLKDIAGRAASKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINK 232
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
+SVDRDLDVYRNLLSKL+QAKELL+EYV+REKKK+ ER E QKANE I KCLG+ LY+
Sbjct: 233 KSVDRDLDVYRNLLSKLVQAKELLREYVEREKKKQGERAESQKANETISKCLGDTLYN 290
>gi|326493802|dbj|BAJ85363.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 286
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 185/238 (77%), Positives = 213/238 (89%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42 GSRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+REKKKR ER E K NEA+ K G
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRAERLETPKPNEAVAKFDG 279
>gi|397702097|gb|AFO59570.1| chloroplast Ptr ToxA-binding protein [Saccharum hybrid cultivar
GT28]
Length = 284
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 188/236 (79%), Positives = 212/236 (89%), Gaps = 1/236 (0%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++TA DVPPTV+ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LG
Sbjct: 47 CVATAGDVPPTVSETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+ YITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFRTYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRS 226
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
VDRDLDVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|212720892|ref|NP_001131923.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|194692932|gb|ACF80550.1| unknown [Zea mays]
gi|195644742|gb|ACG41839.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|414887096|tpg|DAA63110.1| TPA: chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 186/236 (78%), Positives = 212/236 (89%), Gaps = 1/236 (0%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47 CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRS
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRS 226
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
VDRDLDVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 227 VDRDLDVYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281
>gi|195653795|gb|ACG46365.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/297 (66%), Positives = 236/297 (79%), Gaps = 16/297 (5%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSSRMII 60
MA+++S+ F ++ + R SST + VS R R S S ++
Sbjct: 1 MAAISSLPFAALRGAAEWRP---SSTAAAVSG---------AVVLSARSRRGSCS---VV 45
Query: 61 QCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFAL 120
+C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFAL
Sbjct: 46 RCVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFAL 105
Query: 121 GFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP 180
GFVTVYD+LME YPS ED+++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F
Sbjct: 106 GFVTVYDQLMERYPSNEDKDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFS 165
Query: 181 SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KR
Sbjct: 166 SRDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKR 225
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 297
SVDRDLDVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 226 SVDRDLDVYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281
>gi|125558787|gb|EAZ04323.1| hypothetical protein OsI_26464 [Oryza sativa Indica Group]
Length = 287
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 191/238 (80%), Positives = 217/238 (91%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C++TA DVPPTVAETKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 43 GSRSVVRCVATAGDVPPTVAETKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKTTYQY 102
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
D VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +
Sbjct: 103 DAVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGN 162
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
SLVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA
Sbjct: 163 SLVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAA 222
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NKRSVDRDLDVYRN+LSKL+QAKELLKEYV+REKKKREER+E K+NEA+ K G
Sbjct: 223 LNINKRSVDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280
>gi|115472755|ref|NP_001059976.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|75147522|sp|Q84PB7.1|THF1_ORYSJ RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|29367385|gb|AAO72565.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|34394010|dbj|BAC84034.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|113611512|dbj|BAF21890.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|125600704|gb|EAZ40280.1| hypothetical protein OsJ_24722 [Oryza sativa Japonica Group]
gi|215694285|dbj|BAG89278.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 287
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/238 (79%), Positives = 216/238 (90%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C++TA DVPPTVAETKMNFLK YKRPI SIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 43 GSRSVVRCVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQY 102
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
D VFALGFVTVYD+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +
Sbjct: 103 DAVFALGFVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGN 162
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
SLVEF SK+GE+E +LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA
Sbjct: 163 SLVEFSSKDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAA 222
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NKRSVDRDLDVYRN+LSKL+QAKELLKEYV+REKKKREER+E K+NEA+ K G
Sbjct: 223 LNINKRSVDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280
>gi|52548246|gb|AAU82110.1| chloroplast inositol phosphatase-like protein [Triticum aestivum]
Length = 286
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/238 (77%), Positives = 214/238 (89%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42 GSRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEY+ REKKKREER E K NEA+ K G
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYIKREKKKREERLETPKPNEAVAKFDG 279
>gi|38570261|gb|AAR24582.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
gi|81239115|gb|ABB60085.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
Length = 286
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/238 (77%), Positives = 215/238 (90%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C++TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42 GSRSVVRCVATAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
LVEF S++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA
Sbjct: 162 LLVEFSSRDGEIESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAA 221
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 292
LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEY++REKKKREER E K NEA+ K G
Sbjct: 222 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYIEREKKKREERLETPKPNEAVAKFDG 279
>gi|157849728|gb|ABV89647.1| chloroplast light-regulated protein [Brassica rapa]
Length = 273
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/284 (66%), Positives = 228/284 (80%), Gaps = 17/284 (5%)
Query: 1 MASLTSVAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRFRASSSSS---- 56
MA++ S++F ++GQ+ + N + +R L S S + R+ +SS
Sbjct: 1 MAAIASLSFPALGQSG--KLSNPTPSRPLAS--------VSAISRRISRRSLTSSRPSTS 50
Query: 57 --RMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
+ +I CMS+ TDVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+Y
Sbjct: 51 SSKFVIHCMSSVTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRY 109
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LM+GYPS++DR++IFQAY+ AL E P+QYRIDAQK+EEWAR QT++
Sbjct: 110 DPVFALGFVTVYDQLMDGYPSDQDRDSIFQAYVEALNEVPKQYRIDAQKMEEWARSQTSA 169
Query: 175 SLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV 234
SLV+F KEGEVE +LKDI+ERA K FSYSRFFAVGLFRLLELA AT+PTVL+KLCA
Sbjct: 170 SLVDFSFKEGEVEAILKDISERAGSKEGFSYSRFFAVGLFRLLELAGATDPTVLDKLCAS 229
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
LN+NK+SVDRDLDVYRNLLSKL+QAKELLKEYV+REKKKR ER
Sbjct: 230 LNINKKSVDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERA 273
>gi|116782547|gb|ABK22548.1| unknown [Picea sitchensis]
Length = 304
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 179/245 (73%), Positives = 207/245 (84%), Gaps = 3/245 (1%)
Query: 53 SSSSRMIIQC-MS-TATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
+SSS + +C MS TA+D+P TVAETK FLK YKRPIPSIYN V+QELIVQQHLMRYKR
Sbjct: 61 ASSSLSVTRCSMSITASDIP-TVAETKSAFLKAYKRPIPSIYNNVIQELIVQQHLMRYKR 119
Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG 170
TYQYD VFALGFV+VYD+LM+GYPS+ D EAIF+AYI ALKEDPEQYR DA+KLEEWA
Sbjct: 120 TYQYDAVFALGFVSVYDQLMDGYPSDGDSEAIFRAYINALKEDPEQYRSDAKKLEEWASS 179
Query: 171 QTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK 230
Q A S+VEF S++GEVEG+LKDIAERA K FSYSRFFA+GLFRLLE ANAT+P VLEK
Sbjct: 180 QDAKSIVEFQSRDGEVEGILKDIAERAREKKIFSYSRFFAIGLFRLLERANATDPVVLEK 239
Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 290
LC LN++K SVDRDLD+YRN+LSKL+Q+KELLKEYV+REKKKR ER QK++EA+ K
Sbjct: 240 LCGALNISKPSVDRDLDIYRNILSKLVQSKELLKEYVEREKKKRTERESNQKSSEAVAKI 299
Query: 291 LGEYL 295
YL
Sbjct: 300 ESTYL 304
>gi|302807588|ref|XP_002985488.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
gi|302810785|ref|XP_002987083.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300145248|gb|EFJ11926.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300146694|gb|EFJ13362.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
Length = 206
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 146/199 (73%), Positives = 172/199 (86%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK FLK +++PIPSIYN VLQEL+VQQHLMRY TY+YD VFALGFVTVYD+LM
Sbjct: 3 PTVADTKSAFLKAFRKPIPSIYNNVLQELLVQQHLMRYNATYKYDAVFALGFVTVYDQLM 62
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP+ +D EAIF+AYI AL EDP+QYR DA+KLEEWA QTASSL F S +G+VE +L
Sbjct: 63 DGYPNAQDSEAIFKAYIEALGEDPDQYRKDAKKLEEWASSQTASSLASFNSGDGDVEEVL 122
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
KDIA+RA+GK +F YSRFFAVGLFRL+E ANA++P VLEKLC LNV+K SVDRDLDVYR
Sbjct: 123 KDIAQRAAGKTSFHYSRFFAVGLFRLVERANASDPAVLEKLCNALNVSKMSVDRDLDVYR 182
Query: 251 NLLSKLLQAKELLKEYVDR 269
NLL+KL QAK+LLKEY+DR
Sbjct: 183 NLLTKLSQAKDLLKEYIDR 201
>gi|168043272|ref|XP_001774109.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674516|gb|EDQ61023.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 215
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 142/199 (71%), Positives = 171/199 (85%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK++F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVTVYD+LM
Sbjct: 7 PTVADTKLSFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYVYDPIFALGFVTVYDQLM 66
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP++EDR+AIF+AYI+AL EDPEQYR D++KLEEWA Q+ S + +F K+GEVE L
Sbjct: 67 DGYPNDEDRDAIFKAYISALNEDPEQYRKDSKKLEEWAAAQSGSGIADFAGKDGEVEAAL 126
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
KDIAERA+GK F YSRFFA+GLFRLLE A A++P VLE L LNV+KRSVDRDLDVYR
Sbjct: 127 KDIAERAAGKEKFHYSRFFAIGLFRLLECAKASDPAVLETLSKALNVSKRSVDRDLDVYR 186
Query: 251 NLLSKLLQAKELLKEYVDR 269
NLLSKL Q KEL+KEYVDR
Sbjct: 187 NLLSKLAQGKELIKEYVDR 205
>gi|217072610|gb|ACJ84665.1| unknown [Medicago truncatula]
gi|388509564|gb|AFK42848.1| unknown [Medicago truncatula]
Length = 219
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 144/219 (65%), Positives = 179/219 (81%), Gaps = 3/219 (1%)
Query: 1 MASLTS-VAFTSIGQTSCQRKVNVSSTRSLVSNFEGFRFRTSLFCHCVRF-RASSSSSRM 58
MA++TS +F+++ Q+S ++ +SSTR S+ + RFR S+ V R S+ S++
Sbjct: 1 MAAVTSSFSFSTLTQSSQRKLTTLSSTRFFGSDSDAVRFRFSISLRYVGVVRTSNFVSKL 60
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVF 118
+++C S+ +D PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVF
Sbjct: 61 VVRCSSSVSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVF 119
Query: 119 ALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE 178
ALGFVTVYD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+E
Sbjct: 120 ALGFVTVYDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIE 179
Query: 179 FPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
F S+E EVEG LKDIAERA G G+FSYSRFFAVG F L
Sbjct: 180 FSSREREVEGTLKDIAERAGGNGDFSYSRFFAVGFFDFL 218
>gi|168037112|ref|XP_001771049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677737|gb|EDQ64204.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 205
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 137/199 (68%), Positives = 162/199 (81%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV+ETK +F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVTVYD+LM
Sbjct: 7 PTVSETKASFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYTYDPIFALGFVTVYDQLM 66
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
+GYP DR++IF AYI AL EDP +YR DA+KLEEWA Q+AS + +F S++GEVE L
Sbjct: 67 DGYPDATDRDSIFTAYINALNEDPVKYREDAKKLEEWASAQSASGITDFTSRDGEVEATL 126
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 250
K IAERA K F YSRFFA+GLFRLLE A A++P VLE L LNVNKRSVDRDLDVYR
Sbjct: 127 KSIAERAGSKDKFHYSRFFAIGLFRLLECAKASDPAVLESLSKALNVNKRSVDRDLDVYR 186
Query: 251 NLLSKLLQAKELLKEYVDR 269
NLLSKL Q KEL+KEY +R
Sbjct: 187 NLLSKLAQGKELIKEYNER 205
>gi|414887097|tpg|DAA63111.1| TPA: hypothetical protein ZEAMMB73_220735 [Zea mays]
Length = 207
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 124/159 (77%), Positives = 143/159 (89%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
C++T DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALG
Sbjct: 47 CVATTGDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALG 106
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
FVTVYD+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S
Sbjct: 107 FVTVYDQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSS 166
Query: 182 KEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 220
++GE+E +LKDI+ERA GKGNFSYSRFFAVGLFRLL+ A
Sbjct: 167 RDGEIEAILKDISERAKGKGNFSYSRFFAVGLFRLLDFA 205
>gi|326492686|dbj|BAJ90199.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 239
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 96/125 (76%), Positives = 112/125 (89%)
Query: 55 SSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQY 114
SR +++C+ TA D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQY
Sbjct: 42 GSRSVVRCVITAGDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQY 101
Query: 115 DPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTAS 174
DPVFALGFVTVYD+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q +
Sbjct: 102 DPVFALGFVTVYDQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGN 161
Query: 175 SLVEF 179
LVEF
Sbjct: 162 LLVEF 166
>gi|384250113|gb|EIE23593.1| photosystem II biogenesis protein Psp29 [Coccomyxa subellipsoidea
C-169]
Length = 290
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 110/215 (51%), Positives = 152/215 (70%), Gaps = 4/215 (1%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
PPTVAETK NF + + RPIP IY+ V+QEL+VQ H+MRY ++Y YD VF LGFV+V+D++
Sbjct: 65 PPTVAETKRNFYEAFSRPIPGIYSNVIQELLVQHHIMRYNKSYSYDEVFGLGFVSVFDQV 124
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-EVEG 188
+EG P E D+ A+F AYI +L E+ +QYR DA+K+E A+ + + ++ P EG E++
Sbjct: 125 LEGLP-EGDKGALFSAYIGSLGENGDQYRQDAEKVEALAKELSGPAELK-PDAEGSELQK 182
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 248
L IAER+S +GNF Y++FFA+GLFRLLEL A +P LE L + + + + SV RDL
Sbjct: 183 KLASIAERSS-QGNFLYTKFFAIGLFRLLELTGAKDPKALEGLVSAMKIPQESVSRDLMT 241
Query: 249 YRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
Y+ +LSKL AK+L+ E REKKK ER +KA
Sbjct: 242 YKGVLSKLSAAKDLMNEMYAREKKKAAEREAEKKA 276
>gi|159471025|ref|XP_001693657.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
gi|158283160|gb|EDP08911.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
Length = 266
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 115/226 (50%), Positives = 156/226 (69%), Gaps = 3/226 (1%)
Query: 52 SSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRT 111
SS + + + + AT PPTVAETK FL Y +PI SIY+TVLQEL+VQQH MRY +
Sbjct: 24 SSLARKAVATPVLAATFKPPTVAETKAKFLSGYNKPIASIYSTVLQELLVQQHFMRYSKN 83
Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
YQY+P+FALGFV+VY++++E S E+R AIF+AY+ AL ED ++Y+ DA LE+ A G
Sbjct: 84 YQYNPIFALGFVSVYEQILESL-SAEERGAIFKAYVDALGEDADKYKRDASALEQAANGL 142
Query: 172 TASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKL 231
T SL P+ +G AS G FSY++F A+GLFRLLEL+ A EP+ LEKL
Sbjct: 143 TPESLT--PNADGNEVQKALASISSASAAGAFSYNKFVAIGLFRLLELSGAKEPSALEKL 200
Query: 232 CAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ V +V+RDL +Y+ +LSKL AKEL++E+V+REK+K+ ER
Sbjct: 201 VKAVGVKPEAVNRDLLMYKGVLSKLAAAKELMREFVEREKRKQAER 246
>gi|356555139|ref|XP_003545894.1| PREDICTED: LOW QUALITY PROTEIN: protein THYLAKOID FORMATION1,
chloroplastic-like [Glycine max]
Length = 152
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 99/134 (73%), Positives = 112/134 (83%)
Query: 137 EDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAER 196
E R+AIFQAYI AL EDP++YRIDA+KLEEWA Q +SLVEF SKEGE E LKDIA R
Sbjct: 19 EGRDAIFQAYIKALVEDPDKYRIDARKLEEWAGVQNPTSLVEFSSKEGEAEKXLKDIAXR 78
Query: 197 ASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKL 256
A GK FSYSRFFAVGLFRL+EL NATEP +L+KLCA LN+NKRSVD DLDVY LLS+L
Sbjct: 79 AGGKXEFSYSRFFAVGLFRLVELENATEPIILDKLCAALNINKRSVDWDLDVYCILLSEL 138
Query: 257 LQAKELLKEYVDRE 270
LQ KELLKEY+D++
Sbjct: 139 LQVKELLKEYIDKD 152
>gi|307108772|gb|EFN57011.1| hypothetical protein CHLNCDRAFT_143677 [Chlorella variabilis]
Length = 273
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 135/203 (66%), Gaps = 7/203 (3%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
PPTVA+ K+ F +K+P+P+IY+TV+QEL+VQQHL R+ + YQY+ V ALG V+++++
Sbjct: 49 APPTVADAKLKFNGAFKKPLPAIYSTVVQELLVQQHLFRWNKQYQYNEVTALGIVSIFEQ 108
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
++ G P E REA+F A+I AL+EDP+QYR DA +EE ARG++ + P G+
Sbjct: 109 VLGGLPDAE-REAVFDAFINALQEDPKQYRKDAAAMEELARGKSEVA----PDASGDKVQ 163
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 248
GK F Y++FFAVGLFRL+EL + +P L L L +++ V+ DL
Sbjct: 164 QALAAVAAKEGK--FLYTKFFAVGLFRLVELTGSKDPKSLTTLVKALGLSQERVNADLMT 221
Query: 249 YRNLLSKLLQAKELLKEYVDREK 271
Y+ +LSKL AKE++KE++ REK
Sbjct: 222 YKGVLSKLEAAKEIMKEFMAREK 244
>gi|302852549|ref|XP_002957794.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
gi|300256865|gb|EFJ41122.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
Length = 373
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 148/248 (59%), Gaps = 43/248 (17%)
Query: 65 TATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 124
A PPTVAETK F + Y +PI SIY+TVLQEL+VQQH MRY + Y Y+ +FALGFV+
Sbjct: 38 AANFTPPTVAETKAKFFEGYSKPIASIYSTVLQELLVQQHFMRYSKDYVYNEIFALGFVS 97
Query: 125 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG-------------- 170
VY++++E P E R+AIF +Y+ AL EDPE Y+ D++++E+ A
Sbjct: 98 VYEQILESLPQSE-RDAIFVSYVKALGEDPEAYKRDSERVEKAAGALSGPDALVPDAEGS 156
Query: 171 --QTASSLVEFPSKEGEVEGLLKDIAERASGKGN-----------------------FSY 205
QT++ + + + GE+ + R G+G+ FSY
Sbjct: 157 DVQTSAYIWAYHQRRGEMRMPWRT---RTWGQGSSSLGVCSYGKALDAIKAASAADAFSY 213
Query: 206 SRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
++F A+GLFRLLEL A EP LE+L + + +V+RDL +Y+ +LSKL AKE+++E
Sbjct: 214 NKFVAIGLFRLLELTGAKEPAALERLVKSVGIKPEAVNRDLLMYKGVLSKLAAAKEMMRE 273
Query: 266 YVDREKKK 273
+V+REK++
Sbjct: 274 FVEREKRR 281
>gi|255075137|ref|XP_002501243.1| predicted protein [Micromonas sp. RCC299]
gi|226516507|gb|ACO62501.1| predicted protein [Micromonas sp. RCC299]
Length = 260
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 142/220 (64%), Gaps = 17/220 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A+TK F++ Y PIPSI++ + EL+ QH +RY Y Y + +LGFV+VYD+L E
Sbjct: 51 TLADTKRKFVESYPYPIPSIWSVAVNELLANQHFVRYSTRYSYSKLSSLGFVSVYDQLFE 110
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
G+PS+E++ IF ++ AL EDPE+ R DA +L ++A+ + G V+ LL
Sbjct: 111 GFPSDEEKAKIFDCFVEALGEDPEKCRKDAAELAKFAK------------EAGGVDALLA 158
Query: 192 D--IAE-RASGKGN-FSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 247
+AE +++G+ N F+YSR+ A+GLFR+LEL ATEP LEKL + + V+ DL
Sbjct: 159 SPVLAEIKSNGEANKFAYSRYDAIGLFRMLELGGATEPAALEKLADAAGLKLKKVNGDLG 218
Query: 248 VYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKANEA 286
+Y+ LLSKL AKEL KE +REK+K ER + + AN+A
Sbjct: 219 MYKGLLSKLAAAKELQKEIFEREKRKTAERLAKKEAANDA 258
>gi|428213026|ref|YP_007086170.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
gi|428001407|gb|AFY82250.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
Length = 235
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 136/225 (60%), Gaps = 17/225 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ + YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF------PSKEGE 185
GY EED+ +IF L+ DP++YR DAQ LEE A + +V P EG+
Sbjct: 66 GYRPEEDKISIFNGICKGLEADPQKYRQDAQWLEEIASRHSGEEMVALLSRSAGPEMEGD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 238
+G+L IA K NF YSR FAVGLF LLE A+ + ++K+C LN+
Sbjct: 126 FQGILGAIA----AKPNFKYSRLFAVGLFTLLEQADLELVKNEKSRQEAVQKICTALNLP 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
+ +DLD+YR L K++QA+ ++++ + ++KKRE+R + + A
Sbjct: 182 VDKLSKDLDLYRTNLEKMIQARSVMEDILAADRKKREDRAKQKGA 226
>gi|427736065|ref|YP_007055609.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
gi|427371106|gb|AFY55062.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
Length = 233
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 145/223 (65%), Gaps = 22/223 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ETK F L+ RPI +IY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSETKRTFYSLHTRPINTIYRRVVEELMVEMHLLGVNADFKYDPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK-----EGEV 186
GY EED+E+I+ A I +++EDP++YR DA++LE+ A+ T LV S+ + E+
Sbjct: 66 GYNPEEDKESIYNALIKSVEEDPQKYRHDAKRLEDLAKSTTGKDLVSDLSQRRLANDSEL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV----------LEKLCAVLN 236
+GLL+ IA +S F YSR FA+GL+ LLE +++P + L+ + A LN
Sbjct: 126 QGLLEGIANNSS----FKYSRLFAIGLYTLLE---SSDPEMVKDEKLRNEALKTIAAGLN 178
Query: 237 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+++ + +DLD+YR+ L K+ QA ++ + + ++K+RE+R +
Sbjct: 179 LSEDKLSKDLDLYRSNLDKMAQAAIVMADMIAADRKRREQRAQ 221
>gi|145344894|ref|XP_001416959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577185|gb|ABO95252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 203
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/211 (40%), Positives = 131/211 (62%), Gaps = 18/211 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK FL+ Y PIPS+++TV QEL+VQ H +Y +Y + +LGFV+V+D+L E
Sbjct: 1 TVSDTKAKFLQAYPYPIPSVWSTVTQELLVQGHFAKYNAKSEYSELASLGFVSVFDQLYE 60
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
G+PSE ++ IF A++ AL ED + R DA+ +L F + G V+GL
Sbjct: 61 GFPSETEKVKIFNAFLGALGEDAAKTRADAE------------ALGAFAASAGGVDGLSA 108
Query: 190 ---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 246
+A + S + Y+++ A+G+FR+LELA AT+P LE L ++ + V+ DL
Sbjct: 109 NPIFATMAAK-SAENKLMYTKYIAIGIFRMLELAKATDPKALEALAQAGGLSFKKVNGDL 167
Query: 247 DVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+Y+ LLSKL AKEL +E+++REK+K ER
Sbjct: 168 AMYKGLLSKLASAKELQEEFLEREKRKTAER 198
>gi|428223566|ref|YP_007107663.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
gi|427983467|gb|AFY64611.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
Length = 239
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 140/224 (62%), Gaps = 11/224 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSMHTRPINSIYRRVVEELMVEMHLLSVNVDFRYDPFYALGVVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVEG 188
GY E+D+ +IF++ A + DP YR DA++L E+ + +A L+ + S E G+ +G
Sbjct: 66 GYRPEQDKTSIFESLCRANEGDPGHYRHDAERLAEFTKNLSAEELISWLSLETPRGDDQG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
L + + + + F YSR FA+GLF L+E AN A EK+ A L++
Sbjct: 126 LGESL-QAIANHSQFKYSRLFAIGLFTLVEQANPDLVKDEAQRTATFEKVVAALHLPADK 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANE 285
+ +DL++YR+ L KL QA+ ++++ + ++KKREER + QKA+E
Sbjct: 185 LQKDLELYRSNLEKLTQARIVMEDILKADRKKREEREQAQKASE 228
>gi|303286071|ref|XP_003062325.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455842|gb|EEH53144.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 222
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 109/171 (63%), Gaps = 8/171 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA+TK FLK Y PIPSI++ LQEL+V QH +RY + Y Y + +LGFV+VYD+L E
Sbjct: 45 TVADTKQKFLKSYPYPIPSIWSVALQELLVTQHFVRYSKKYSYSKLSSLGFVSVYDQLFE 104
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
G+PSEE++ IF+ ++ AL+EDP R DA +L +A G + V +++ L+
Sbjct: 105 GFPSEEEKNTIFECFVKALEEDPATVRKDAAELASFAEGASGVDGVLASPIFAQMKSLVA 164
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSV 242
D G F+YSR+ A+GLFRLLELA ATEP LEKL + RS+
Sbjct: 165 D--------GKFAYSRYDAIGLFRLLELAKATEPAALEKLAESSGLQARSI 207
>gi|158338004|ref|YP_001519180.1| Thf1-like protein [Acaryochloris marina MBIC11017]
gi|189030267|sp|B0C3M8.1|THF1_ACAM1 RecName: Full=Protein thf1
gi|158308245|gb|ABW29862.1| photosystem II biogenesis protein Psb29 [Acaryochloris marina
MBIC11017]
Length = 247
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 135/216 (62%), Gaps = 16/216 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RP+ S+Y V++EL+V+ HL+R ++YDP+FALG T +DR M+
Sbjct: 6 TVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDRFMD 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-----EV 186
GY E D++AIF A A + DP Q + D Q+L E A+ ++A ++++ ++ E+
Sbjct: 66 GYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGGDEL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATE-----PTVLEKLCAVLNVNK 239
+ L++IA+ F YSR FA+GLF LLEL+ N T+ L +C VLN+++
Sbjct: 126 QWQLRNIAQNPK----FKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLNISE 181
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
+ +DL++YR L K+ Q ++ + + ++ +KK+RE
Sbjct: 182 SKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217
>gi|411116557|ref|ZP_11389044.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
gi|410712660|gb|EKQ70161.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
Length = 246
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 131/216 (60%), Gaps = 11/216 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ Y Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E D+ IF A AL++DP +YR DAQ+L ++A+ ++A +V + + G
Sbjct: 66 GYRPENDKAPIFDAICQALQDDPNRYRHDAQRLNDFAKQKSAKDIVTWLEQAATSYG-GD 124
Query: 192 DIAERASGKGN---FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 241
D+ E+ N F YSR FA+GLF L E A+A +L++ CA L ++
Sbjct: 125 DLQEQVKAIANNPKFKYSRLFAIGLFTLFETADAEVVKKEGEREELLKQACAALRLSHDK 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
V RDL++YR+ L K+ QA+ ++ + + EKKKRE +
Sbjct: 185 VQRDLELYRSNLEKVAQAQAVMADMLAAEKKKREHK 220
>gi|334116992|ref|ZP_08491084.1| Protein thf1 [Microcoleus vaginatus FGP-2]
gi|333461812|gb|EGK90417.1| Protein thf1 [Microcoleus vaginatus FGP-2]
Length = 237
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/219 (39%), Positives = 132/219 (60%), Gaps = 17/219 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E DR +IF A +L++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYAPEADRVSIFNALCKSLEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121
Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
D A+ + G F YSR FA+GLF LLE A+ T + K+ A L++
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAALHLP 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ V +DLD+YR+ L K+ QA+ +L++ + E+KKRE+R
Sbjct: 182 EDKVSKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220
>gi|119488459|ref|ZP_01621632.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
gi|119455270|gb|EAW36410.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
Length = 241
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 135/216 (62%), Gaps = 11/216 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI S+Y V++EL+V+ HL+ +QYDP++ALG V+ +DR M+
Sbjct: 6 TVSDTKRAFYNTHTRPINSVYRRVIEELMVEMHLLSVNVDFQYDPIYALGVVSAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY E D+E+IF I AL++DP++YR +AQ+L+E+A+ + +V + + EV
Sbjct: 66 GYLPESDKESIFHGLINALQDDPQRYRAEAQRLQEFAQTLSVQDIVSWVDVAANSEVHND 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN--------ATEPTVLEKLCAVLNVNKRS 241
L+ ++ + + YSR A+GLF L+E A+ AT+ T L +L + LN+
Sbjct: 126 LQSSFQKIATNPKYKYSRILAIGLFTLIEQADPQAMEDKEATQQT-LAQLASGLNLPLDK 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ +DL++YR+ L KL QA+ ++ E E+K+RE+R
Sbjct: 185 LQKDLELYRSNLEKLKQARIVMDEMTQAERKRREQR 220
>gi|428317172|ref|YP_007115054.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
gi|428240852|gb|AFZ06638.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
Length = 237
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/219 (38%), Positives = 132/219 (60%), Gaps = 17/219 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E DR +IF A ++++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYVPEADRVSIFNALCKSVEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFE---- 121
Query: 192 DIAERASGKG------NFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
D A+ + G F YSR FA+GLF LLE A+ T + K+ A L++
Sbjct: 122 DTADLQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAGLHLP 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ V +DLD+YR+ L K+ QA+ +L++ + E+KKRE+R
Sbjct: 182 EDKVSKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220
>gi|354568723|ref|ZP_08987886.1| Protein thf1 [Fischerella sp. JSC-11]
gi|353539977|gb|EHC09457.1| Protein thf1 [Fischerella sp. JSC-11]
Length = 235
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 83/223 (37%), Positives = 137/223 (61%), Gaps = 21/223 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P+FALG VT +DR M+
Sbjct: 6 TVSDTKRTFHTLHTRPINTIYRRVVEELMVEMHLLAVNVDFSYNPIFALGVVTSFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A + A++ DP+ YR DAQ+L+E A+ L+ S ++ +
Sbjct: 66 GYQPESDKESIFNALLRAIEADPQIYRQDAQRLQELAKSLPPQDLIAALSLQTQLNRDTD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA---------TEPTVLEKLCAVLN 236
++ L+ IA F YSR FA+GLF LLEL++ TE L+ + A L+
Sbjct: 126 LQSHLQAIA----SNPKFKYSRLFAIGLFSLLELSDPELVKDEKQRTE--ALKSIAAGLH 179
Query: 237 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
++ +++DL++YR+ L K+ QA ++ + + ++KKRE+R++
Sbjct: 180 ISDDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRSQ 222
>gi|424513129|emb|CCO66713.1| Thf1-like protein [Bathycoccus prasinos]
Length = 222
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 131/220 (59%), Gaps = 12/220 (5%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TVA+TK F K Y P+PSI+ TVLQEL+V H YQ++ + +LGFV+V+D+L
Sbjct: 4 PATVADTKAKFTKGYPYPLPSIWATVLQELLVGMHFTVTSSKYQHEEMRSLGFVSVFDQL 63
Query: 130 MEGYPSEED--REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA-SSLVEFPSKEGEV 186
EGYP+E+ +E IF ++ AL ED +++R DA+KL +A QT+ ++ P
Sbjct: 64 FEGYPTEDPNAKEKIFSTFMEALGEDSKKWRADAEKLSAFATEQTSIDGIIANP------ 117
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 246
+ + + K + Y +F A+G FR LE++ T P L+K+ V ++ DL
Sbjct: 118 --MFASMKSKVESK-SLVYDKFIAIGFFRALEMSKQTSPENLKKISEASGVTLEKINGDL 174
Query: 247 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
+Y+++LS++ AKEL E ++RE++K ER E + A +A
Sbjct: 175 GLYKSVLSRMNAAKELQAEVLERERRKTAERMEKKAAKDA 214
>gi|113474941|ref|YP_721002.1| Thf1-like protein [Trichodesmium erythraeum IMS101]
gi|123056927|sp|Q116P5.1|THF1_TRIEI RecName: Full=Protein thf1
gi|110165989|gb|ABG50529.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 239
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 129/216 (59%), Gaps = 9/216 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIYN V++EL+V+ HL+ Y Y+P +ALG VT +DR M+
Sbjct: 6 TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY +ED+ +IF A I +EDP +YR DA+ LE+ A +AS ++ + SK +
Sbjct: 66 GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
L+D S F YSR FA+GLF LLE+ + L+K+C LN+ + +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
+D+D+Y + L ++ QA+ +++ + +KKRE+R+
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221
>gi|434405136|ref|YP_007148021.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
gi|428259391|gb|AFZ25341.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
Length = 235
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 139/234 (59%), Gaps = 24/234 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A I A++++P++YR DA++L+ A+G L+ + S ++
Sbjct: 66 GYQPERDQESIFNAIIQAVEQEPQRYRQDAERLQAVAQGLPEQDLIAWLSQTTHSDRDAN 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVN 238
++ L+ IA NF YSR FA+GLF LLE++ + L+ + L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFSLLEVSSPELVKDDKQRNEALKAIATGLHLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 285
+ +DL++YR+ L K+ QA ++ + V ++KKRE+R + P ANE
Sbjct: 182 DDKLSKDLELYRSNLDKMAQALIVMADMVSADRKKREQRKQQASTPVAPPSANE 235
>gi|428775508|ref|YP_007167295.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
gi|428689787|gb|AFZ43081.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
Length = 243
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 126/210 (60%), Gaps = 9/210 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++ETK F L+ RP+ SIY V++EL+V+ HL+ ++YDP +ALG VTV+D M+
Sbjct: 6 TLSETKRTFYTLHTRPLNSIYRRVIEELLVEMHLLTVNIDFKYDPFYALGVVTVFDTFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY E+D+E+IF A A++ DP+QYR DA+K++ A + ++ + +K + G
Sbjct: 66 GYQPEKDKESIFNAICKAVESDPQQYRQDAEKVKSIADQASGEAVTAWLCEAKPLDQAGD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
L DI + F YSR F +G++ +LE AN VL C LN+ K V
Sbjct: 126 LNDILQGIRENPRFKYSRLFIIGIYTVLEKANPEIVNDDKKREEVLNNCCQALNLPKEKV 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKK 272
D+DLD+YR+ L K+ QA+ +L++ V ++K
Sbjct: 186 DKDLDLYRSNLEKMEQARSVLEDVVRADRK 215
>gi|220910509|ref|YP_002485820.1| Thf1-like protein [Cyanothece sp. PCC 7425]
gi|254784141|sp|B8HQ62.1|THF1_CYAP4 RecName: Full=Protein thf1
gi|219867120|gb|ACL47459.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7425]
Length = 236
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/228 (39%), Positives = 137/228 (60%), Gaps = 14/228 (6%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TV++TK F + RPI SIY V++EL+V+ HL+R +T+ YDPVFALG VT ++R
Sbjct: 4 PRTVSDTKRAFYHNHARPINSIYRRVVEELLVEIHLLRVNQTFVYDPVFALGVVTTFERF 63
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
M+GY D+ +IF A A + DP+Q + DAQ+L RGQ+ SL+++ S + G
Sbjct: 64 MQGYHPPADQTSIFNAICLAQELDPQQVQQDAQELLGRVRGQSLESLLDWISTAASLGGD 123
Query: 190 LKDIAERA-SGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCAVLNVN 238
+ RA + F YSR FAVGLF LLE A EP VL+++ V+++
Sbjct: 124 EQQNRLRAIASNPTFKYSRLFAVGLFTLLEQA---EPELGKDEARLLQVLQQVGEVMHLP 180
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
+ +DL+ YR+ L K+ QA++ L++ V E+K+R++ P ++ E+
Sbjct: 181 VEKMQKDLEQYRSNLEKMTQARKTLEDIVAAERKRRQQNAAPDRSPES 228
>gi|427727466|ref|YP_007073703.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
gi|427363385|gb|AFY46106.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
Length = 235
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 137/234 (58%), Gaps = 24/234 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNIDFTYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ S LV + S ++ +
Sbjct: 66 GYRPERDKESIFHAICQAVEQEPQRYRQDAERLQNLAKSLPISDLVAWLSQTTHFNQDPD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
++ L+ IA NF YSR FA+GLF LLE ++ L+ + L++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLFAIGLFSLLEYSDPDLVKDEKQRTEALKNIANGLHLA 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 285
+ +DLD+YR+ L K+ QA ++ + + ++KKRE+R + P ANE
Sbjct: 182 DDKLSKDLDLYRSNLDKMTQALTVIADMISADRKKREQRQQQSSSVVAPPTANE 235
>gi|17228142|ref|NP_484690.1| Thf1-like protein [Nostoc sp. PCC 7120]
gi|81772969|sp|Q8YZ41.1|THF1_ANASP RecName: Full=Protein thf1
gi|17129992|dbj|BAB72604.1| all0646 [Nostoc sp. PCC 7120]
Length = 233
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 134/221 (60%), Gaps = 17/221 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR ME
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLQAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
++ L+ IA NF YSR FA+GLF LLE +N L+ + A L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKSIAAGLHLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DDKFSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222
>gi|443311308|ref|ZP_21040938.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
gi|442778631|gb|ELR88894.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
Length = 241
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 141/226 (62%), Gaps = 9/226 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSTHTRPINTIYRRVVEELMVEMHLLSVNADFSYNPIYALGVVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+++IFQA A+ DP QYR DA++L +A+ ++ L+++ S E ++G
Sbjct: 66 GYQPERDKDSIFQALCQAINTDPHQYRQDAERLGSFAKSLSSQDLMQWLSSEKPIDGYSD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 242
L++ ++ + F YSR FA+G+F LLEL++ +++ + L++ + +
Sbjct: 126 LQEQIKQIATNQKFKYSRLFAIGVFSLLELSDPELVKDETKRVEAFKQISSSLHLPEDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIK 288
++DL++YR + K+ QA ++++ + E+KKR+++ + Q+A A K
Sbjct: 186 NKDLELYRANVEKMNQALIVMEDMLAAERKKRQKKADEQQAALAAK 231
>gi|300866330|ref|ZP_07111033.1| Protein thf1 [Oscillatoria sp. PCC 6506]
gi|300335673|emb|CBN56193.1| Protein thf1 [Oscillatoria sp. PCC 6506]
Length = 267
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 133/217 (61%), Gaps = 9/217 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F ++ RPI SIY V++EL+V+ HL+ ++Y+P++ALG VT ++R M+
Sbjct: 36 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFRYNPIYALGVVTAFERFMQ 95
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY E+D+ +IF AL +DP++Y+ DA++LE A + L+ + S E G
Sbjct: 96 GYLPEQDKVSIFNGLCQALGDDPQRYQQDARRLEGLASRVSILDLLSWLEGSTSFEDTGD 155
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEP----TVLEKLCAVLNVNKRSV 242
L+ + F YSR FA+GLF LLE+ + +P + K+CA L++ + V
Sbjct: 156 LQASITAIATNSKFKYSRLFAIGLFALLEIVDPDLVKDPEARVQAIAKVCAALHLPEEKV 215
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+DLD+YR+ L K+ QA+ +L + + ++KKRE+R E
Sbjct: 216 TKDLDLYRSNLEKIAQARIVLADVLQADRKKREKRAE 252
>gi|186685250|ref|YP_001868446.1| Thf1-like protein [Nostoc punctiforme PCC 73102]
gi|254784144|sp|B2J353.1|THF1_NOSP7 RecName: Full=Protein thf1
gi|186467702|gb|ACC83503.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 235
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 137/225 (60%), Gaps = 17/225 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV------EFPSKEGE 185
GY E D+E+IF A A+++DP+ YR DA++L+ A+G L+ + ++ +
Sbjct: 66 GYEPERDQESIFNALCRAIEQDPQHYRQDAERLQAIAKGLPVKDLIGWLGQTTYLDRDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA----TEPTVLEKLCAV---LNVN 238
++ L+ IA NF Y+R FA+G+F LLE ++ E + E L A+ L+V+
Sbjct: 126 LQAQLQAIA----NNPNFKYNRLFAIGVFSLLEQSDPELVKDEKQLTEALKAIAAGLHVS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
+++DL++YR+ L K+ QA ++ + + ++KKRE+R + A
Sbjct: 182 DDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRKQQSTA 226
>gi|440683252|ref|YP_007158047.1| Protein thf1 [Anabaena cylindrica PCC 7122]
gi|428680371|gb|AFZ59137.1| Protein thf1 [Anabaena cylindrica PCC 7122]
Length = 235
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/221 (37%), Positives = 135/221 (61%), Gaps = 17/221 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ Y Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A+++D ++YR DA +L+ A+ L+ + S K+ +
Sbjct: 66 GYLPERDQESIFNALCQAVEQDQQRYRQDATRLQAIAQSLPVQDLIAWVSQTTHLDKDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA----TEPTVLEKLCAV---LNVN 238
++ L+ IA NF YSR FA+GLF LLELA+ E E L A+ L+++
Sbjct: 126 LQAQLQAIAH----NPNFKYSRLFAIGLFSLLELADPELVKDEKQRNEALKAIAQGLHLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ + +DLD+YR+ L K+ QA ++ + + ++KKR++R +
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQALIVMADILSADRKKRDQRQQ 222
>gi|75910773|ref|YP_325069.1| Thf1-like protein [Anabaena variabilis ATCC 29413]
gi|97202708|sp|Q3M4B2.1|THF1_ANAVT RecName: Full=Protein thf1
gi|75704498|gb|ABA24174.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 233
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 135/221 (61%), Gaps = 17/221 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLKAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
++ L+ IA NF YSR FA+GLF LLE +N L+ + A L+++
Sbjct: 126 LQAQLQAIA----SNPNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKTIAAGLHLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DDKLSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222
>gi|428778484|ref|YP_007170270.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
gi|428692763|gb|AFZ48913.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
Length = 240
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 123/209 (58%), Gaps = 9/209 (4%)
Query: 68 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 127
D T++ETK F + RP+ SIY V++EL+V+ HL+ ++YDP++ALG TV+D
Sbjct: 2 DTLRTLSETKRTFYTQHTRPLNSIYRRVIEELLVEMHLLSVNTDFKYDPIYALGVTTVFD 61
Query: 128 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 187
M+GY E+++E+IF A A++ DP++YR DA+KL+ A + + S+ ++
Sbjct: 62 TFMQGYQPEKEKESIFNAICQAVENDPQKYRQDAEKLKSIAANHSGEEVTACLSELKPLD 121
Query: 188 GL--LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVN 238
G L + + F YSR F +GL+ +LE AN T VL+K C L +
Sbjct: 122 GAEELTKVLQEIKNNSRFKYSRLFIIGLYTILETANPDLVTDDKKREEVLQKCCQGLGLP 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
K VD+DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDAI 210
>gi|376001810|ref|ZP_09779664.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
gi|375329721|emb|CCE15417.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
Length = 243
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/215 (38%), Positives = 124/215 (57%), Gaps = 9/215 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYNIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY E D+ +I+ A I A + DP QYR DA LE A + L E ++E +
Sbjct: 66 GYIPEADKLSIWAALIMAQESDPNQYRADATALEAQAATLSVKDLTERAKIAQESSGDDP 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
L+ + F YSR FA+GL+ LLE ++ T E L + L + K +
Sbjct: 126 LQSCFHAIANNPKFKYSRLFAIGLYTLLEKSDVTAAQDSEGLKNIIIDFSEALRLPKDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
++DLD+YR L K+ QA+ +++E E+KKRE+R
Sbjct: 186 EKDLDLYRTNLEKVAQARLMVEEMTQAERKKREQR 220
>gi|428210102|ref|YP_007094455.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
gi|428012023|gb|AFY90586.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
Length = 250
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 138/224 (61%), Gaps = 9/224 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK NF + RPI +IY V++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRNFYNQHTRPINTIYRRVVEELMVEMHLLSVNADFRYDPIYALGVVTAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+E IF+A +++++P++YR DA +L + + +A L ++ + ++G
Sbjct: 66 GYQPERDKEPIFEALCQSIEDNPQRYRQDADRLRQLLQNVSAQQLFDWIDGKASLQGAED 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
L+ + + F YSR FA+G+F LLELA+A L+++ L+V + +
Sbjct: 126 LQAQMQAIAQNSKFKYSRLFAIGVFTLLELADAELVKDEKQRVEALKQVATALHVPEDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
++DL++YR+ L K+ QA + + + +++KR++R + ++A A
Sbjct: 186 NKDLELYRSNLDKIEQALITMADILSADRRKRQQRLQEKEAGVA 229
>gi|427707894|ref|YP_007050271.1| Protein thf1 [Nostoc sp. PCC 7107]
gi|427360399|gb|AFY43121.1| Protein thf1 [Nostoc sp. PCC 7107]
Length = 235
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 138/230 (60%), Gaps = 16/230 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
GY E D+E+IFQA A++++ ++YR DA++L+ A+ A+ L+ + S+ + +
Sbjct: 66 GYQPERDKESIFQAICQAVEQEVQRYRQDAERLQALAKSLAANDLIAWLSQTNHLNQDPD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 242
L+ + + F Y+R FA+GLF LLE ++ ++ + A L++++ +
Sbjct: 126 LQSQLQAIANNSQFKYNRLFAIGLFSLLEQSDPDLVKDEKQRTDAIKTIAAGLHLSEDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 285
+DL++YR+ L K+ QA ++ + + ++KKRE+R + P ANE
Sbjct: 186 SKDLELYRSNLEKMSQALVVMADMISADRKKREQRQQQSTMPVTPPTANE 235
>gi|428219024|ref|YP_007103489.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
gi|427990806|gb|AFY71061.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
Length = 260
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/224 (37%), Positives = 129/224 (57%), Gaps = 10/224 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + + RPI S+Y V+ EL+V+ HL+ +T+ YDPVFALG +T YDR M
Sbjct: 6 TVSDAKRDFFQAFPRPINSVYRRVVDELLVEMHLLTVNQTFAYDPVFALGAITAYDRFML 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E +R+ I A A+ + EQ R DA L E A ++ + +F + E L
Sbjct: 66 GYEPESERDRILPAICGAVHLNAEQMRHDASSLAELAM-RSPIDVKQFLTSLETTENLEP 124
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVNKRSV 242
L + F YSR FA+GLF LLE A N +++++ LN+ +
Sbjct: 125 LTGTIRAIAANQKFKYSRLFAIGLFTLLETADPNTMSDNDKRQELIKQVGDALNLGSEKL 184
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
+DLD+YR+ L K+ QA++++K+ V+ E+KK+E+R P K++ A
Sbjct: 185 IKDLDLYRSNLEKVEQARQMMKDLVEAERKKKEQRENPPKSDAA 228
>gi|291567260|dbj|BAI89532.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 243
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 120/215 (55%), Gaps = 9/215 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYHIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 189
GY E D+ +I+ A I A + DP QYR DA LE L + ++E +
Sbjct: 66 GYIPEADKLSIWAALIGAQESDPNQYRADATALEAQVASLAVKDLTDKAKMAQESSGDDP 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 242
L+ + F YSR A+GL+ LLE ++AT T+L L + K +
Sbjct: 126 LQSCFHAIANNPKFKYSRLLAIGLYTLLEKSDATAAQDSEGLKTILSDFSEALRLPKDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+DLD+YR L K+ QA+ ++ E E+KKRE+R
Sbjct: 186 VKDLDLYRTNLEKVAQARLMVDEMTQAERKKREQR 220
>gi|434388267|ref|YP_007098878.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
gi|428019257|gb|AFY95351.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
Length = 234
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 127/221 (57%), Gaps = 13/221 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK NF + RPI SIY V++EL+V+ HL+ + YDP++ALG V+ +DR M
Sbjct: 6 TVSDTKRNFYSQHTRPINSIYRRVVEELMVEMHLLSTNVDFAYDPIYALGVVSSFDRFMT 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
Y E D+++IF A ++ + +QYR DA +EE+AR S ++++ P+ +G
Sbjct: 66 SYRPEADKQSIFVALCESMGGNAQQYRTDATAVEEFARSMQGSDIIDWIAHPTADGMGAQ 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L + AS F YSR F +GLF +LE A E VL+ + L++ K
Sbjct: 126 LATTLQSIASNP-KFKYSRLFGIGLFTILEQAAPDLLKDEKKREAAVLQ-IAEALHLPKD 183
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+DLD YR+ L KL+Q + ++ + + E+KKRE+R + +
Sbjct: 184 KAQKDLDTYRSNLDKLVQMEAVMADLAEAERKKREKRAQAK 224
>gi|434395245|ref|YP_007130192.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
gi|428267086|gb|AFZ33032.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
Length = 251
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 10/221 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTSHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
GY E D+E+IF A A++ DP++YR DA++L +A+ + L+ + E E G
Sbjct: 66 GYQPERDKESIFNALCQAVESDPQRYRQDAERLGLFAKNTSTPELIAWLRGETHKEEVGD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 242
L+ + + +F YSR FA+G+F LLEL++ L+ + A LN+++ +
Sbjct: 126 LQQQIQAIAHNPHFKYSRLFAIGVFGLLELSDPALVKDEKQRVDALKSIAATLNISEDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
++DL++YR + K+ QA + + + ++KKR+++T P K
Sbjct: 186 NKDLELYRANVDKMEQALATIADILSADRKKRQQQT-PDKG 225
>gi|452819272|gb|EME26335.1| thylakoid protein [Galdieria sulphuraria]
Length = 316
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 132/245 (53%), Gaps = 11/245 (4%)
Query: 36 FRFRTSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTV 95
+R + + C + SS+S + T TVAET +FLK ++ PIPSIY T+
Sbjct: 54 YRTKRRMECRSIENLKESSAS----NSLPTQNGTVRTVAETISDFLKHFRHPIPSIYRTI 109
Query: 96 LQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPE 155
+QEL+V HL R +QYDPVFALG+ V + YP E++E +F + AL D E
Sbjct: 110 VQELLVTTHLARVAVGFQYDPVFALGYQMVTQVFFKSYPKVEEKEKLFDSMCKALLLDYE 169
Query: 156 QYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGLLKDIAERASGKGNFSYSRFFAVGL 213
+ + DA LEEW R +T ++ + G+ + LL IA+ F YSR F +GL
Sbjct: 170 RMKKDASVLEEWTRSRTEREILLAIEEGGDDPLANLLHSIAQ----NDGFVYSRLFGLGL 225
Query: 214 FRLLEL-ANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKK 272
R++EL +K + L+++ +++DLD Y+ L +L QA++L E R+KK
Sbjct: 226 VRMMELCGEEANSERCQKWASALHISSLKLEQDLDTYQQSLERLKQAEQLFAELEARQKK 285
Query: 273 KREER 277
K E+
Sbjct: 286 KLAEK 290
>gi|298491449|ref|YP_003721626.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
gi|298233367|gb|ADI64503.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
Length = 235
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 136/225 (60%), Gaps = 17/225 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ ++Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDFRYNSIYALGVVTAFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E+D+ +IF A I A+++DP++YR DA +L+ A+ L+ + S ++ +
Sbjct: 66 GYQPEQDQASIFNAIIQAVEQDPQRYRQDAARLQVVAQSLLTKDLISWLSQTTYLDQDRD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
++ L+ IA A F YSR FA+GLF LLE+ ++ L+ + L+++
Sbjct: 126 LQAQLQAIANNAE----FKYSRLFAIGLFSLLEMVDSELVKDEKQRNQALKAIAQGLHLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
+ + +DL++YR+ L KL QA ++ + + ++KKR++R + A
Sbjct: 182 EEKLTKDLELYRSNLDKLAQALIVMADMLAADRKKRDQRQQKSTA 226
>gi|427719034|ref|YP_007067028.1| Protein thf1 [Calothrix sp. PCC 7507]
gi|427351470|gb|AFY34194.1| Protein thf1 [Calothrix sp. PCC 7507]
Length = 235
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 17/221 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNSIYALGVVTTFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 185
GY E D+E+IF A A++++P++YR DA++L A+ A+ L+ + S ++ +
Sbjct: 66 GYLPERDQESIFNALCHAVEQEPQRYRQDAERLRVLAKSLPANDLIAWLSQTTHLDQDAD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
++ L+ IA NF YSR A+GLF LLEL++ L+ + L ++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLLAIGLFTLLELSDPELVKDEKQRNEALKAIATGLQLS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+++DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DEKLNKDLELYRSNLDKIAQALIVMADVLSADRKKREQRKQ 222
>gi|56750022|ref|YP_170723.1| Thf1-like protein [Synechococcus elongatus PCC 6301]
gi|81300364|ref|YP_400572.1| Thf1-like protein [Synechococcus elongatus PCC 7942]
gi|56684981|dbj|BAD78203.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169245|gb|ABB57585.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 280
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 82/250 (32%), Positives = 137/250 (54%), Gaps = 20/250 (8%)
Query: 43 FCHCVRFRASSSSSRMIIQCMSTATDVP-----PTVAETKMNFLKLYKRPIPSIYNTVLQ 97
F C+ RA ++ + C S+ VP PTV+++K F Y RPI +Y V++
Sbjct: 3 FGRCLAKRAIAAHN-----CSSSCASVPNVTSVPTVSDSKRAFYAAYPRPINPLYRRVVE 57
Query: 98 ELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
EL+V+ HL+ ++ YDP+FALG VT +D M Y E +F A A++++PEQY
Sbjct: 58 ELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFMSSYRPIEAVGPLFTALTQAVRQNPEQY 117
Query: 158 RIDAQKLEEWARGQTASSLVEFPSKE---GEVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
R DA + E RG + ++ ++ ++ G L++ + +G+ F YSR FA+GLF
Sbjct: 118 RHDANAIAEQVRGVGSDTIRQWLTEAEALGNAPELVRSSFQAIAGRSEFKYSRLFAIGLF 177
Query: 215 RLLELANAT---EPTVLEKLCAVL----NVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
LLE A +P L+ + ++ + +DLD+YR+ L K+ QA+ ++E +
Sbjct: 178 SLLETAAPDLVQDPEALKTTVTAIAERFHLPSDKLQKDLDLYRSNLEKMEQARITMEEAI 237
Query: 268 DREKKKREER 277
+++KRE+R
Sbjct: 238 QADRRKREQR 247
>gi|443317266|ref|ZP_21046682.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
gi|442783151|gb|ELR93075.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
Length = 251
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 13/236 (5%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV++TK F + RPI S+Y V++EL+V+ HL+R + YDPV+ALG VT +DR M
Sbjct: 5 PTVSDTKRAFYSYHNRPIASVYRRVIEELMVEMHLLRVNEDFVYDPVYALGIVTTFDRFM 64
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL-VEFPSKEGEVEGL 189
GY E D +IF A A +QYR DA+ + G++ +L S+ E L
Sbjct: 65 AGYRPEADEASIFAALCQANAGTADQYRRDAEVMVAEVSGRSLDALKAILISRSAEGADL 124
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS-------V 242
LK + + + + F YSR FA+GL+ L+E +A EKL +L S +
Sbjct: 125 LKGVLQGIADRDRFKYSRAFAIGLYTLIETVDAEILKDKEKLMELLKAVAESLPLSFDKL 184
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK-----ANEAIKKCLGE 293
+D+++YR+ L+K+ QAK ++ + + ++KKREER + + N+A+ GE
Sbjct: 185 QKDVELYRSNLTKMEQAKIVMADILAADRKKREERAKAKADAASLPNDAVVTPSGE 240
>gi|428304539|ref|YP_007141364.1| Protein thf1 [Crinalium epipsammum PCC 9333]
gi|428246074|gb|AFZ11854.1| Protein thf1 [Crinalium epipsammum PCC 9333]
Length = 243
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 131/217 (60%), Gaps = 9/217 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++EL+V+ HL+ + Y P++ALG VT Y++ M+
Sbjct: 6 TVSDTKRDFYNNHTRPINSIYRRVVEELMVEMHLLSVNVDFAYHPIYALGVVTSYEKFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E DR++IF A + A+ ED ++Y+ DA++L+ A + L+++ V+G
Sbjct: 66 GYRPERDRDSIFDALVGAVGEDSQRYKQDAEQLKALAGRLSGKELIDWIVSPTAVDGAGS 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPTVLEKLCAV---LNVNKRSV 242
L D + F YSR FA+GL+ LLE+++ + E L+ L V L++ +
Sbjct: 126 LPDQMRAIANNPQFKYSRLFAIGLYTLLEVSDPSLVKDEKERLDALNQVGQSLHLPTEKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+DLD+YR+ L K+ Q + +K+ ++ ++KKRE+R +
Sbjct: 186 HKDLDLYRSNLEKMAQVQIAMKDALEADRKKREKRDQ 222
>gi|428313474|ref|YP_007124451.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
gi|428255086|gb|AFZ21045.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
Length = 241
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 78/223 (34%), Positives = 130/223 (58%), Gaps = 17/223 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RP+ SI+ V++EL+V+ HL+ + Y+P++ALG VT ++R ME
Sbjct: 6 TVSDTKRDFYNHHTRPVNSIFRRVVEELMVEMHLLSVNVDFHYEPIYALGVVTSFNRFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE------GE 185
GY E D+ +IF A ++ +PEQY+ DAQ LE A T LV + S G+
Sbjct: 66 GYRPERDKASIFDALCHSVGNNPEQYKQDAQWLESMAERVTGEELVSWLSAPRPQDTLGD 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 238
+ + IAE F YSR FA+GL+ LLE A++ L+K+ L++
Sbjct: 126 LYAAVAAIAENPK----FKYSRLFAIGLYTLLEKADSELVQDEKRRTEALKKISDGLHLP 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+ + +DL++YR+ L K+ Q + ++++ + ++KKRE+R + Q
Sbjct: 182 EEKLQKDLELYRSNLQKMEQVRIVIEDAIQADRKKREKRIQDQ 224
>gi|414076688|ref|YP_006996006.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
gi|413970104|gb|AFW94193.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
Length = 223
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 125/213 (58%), Gaps = 9/213 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F L+ RPI +IY V++EL+V+ HL+ + YD ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYDAIYALGVVTTFDRFMD 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 189
GY E+D+E+IF+A A+++DP+ YR DA +L+ A A L+ S+ + +
Sbjct: 66 GYQPEQDKESIFRAICQAVEQDPQSYRQDASRLQALAASLPAKDLIASLSQASPLNQDAD 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATE--PTVLEKLCAVLNVNKRSV 242
L+ E + NF YSR F VGLF LL EL E L+ + L++++ +
Sbjct: 126 LQKQLEAVAANSNFKYSRLFGVGLFALLVQSDPELVKKDEQRAEALKAISNGLHISEDKL 185
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
+DL++Y + L K+ QA ++ + + ++KKR+
Sbjct: 186 IKDLELYSSNLEKMAQALIVMADILTADRKKRD 218
>gi|97202823|sp|Q5N664.2|THF1_SYNP6 RecName: Full=Protein thf1
gi|97202830|sp|Q31MY4.2|THF1_SYNE7 RecName: Full=Protein thf1
Length = 254
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 124/217 (57%), Gaps = 10/217 (4%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTV+++K F Y RPI +Y V++EL+V+ HL+ ++ YDP+FALG VT +D M
Sbjct: 5 PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 64
Query: 131 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 187
Y E +F A A++++PEQYR DA + E RG + ++ ++ ++ G
Sbjct: 65 SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 124
Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPTVLEKLCAVL----NVNKR 240
L++ + +G+ F YSR FA+GLF LLE A +P L+ + ++
Sbjct: 125 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 184
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ +DLD+YR+ L K+ QA+ ++E + +++KRE+R
Sbjct: 185 KLQKDLDLYRSNLEKMEQARITMEEAIQADRRKREQR 221
>gi|308801781|ref|XP_003078204.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
gi|116056655|emb|CAL52944.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
Length = 657
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 110/179 (61%), Gaps = 16/179 (8%)
Query: 91 IYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITAL 150
++ TV+QEL+VQ H +Y + +Y+ + +LGFV+VYD+L EG+PSEE++ IF A++ AL
Sbjct: 79 VWATVVQELLVQGHFQKYNKKSEYNELASLGFVSVYDQLFEGFPSEEEKGKIFNAFLGAL 138
Query: 151 KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKD--IAERA--SGKGNFSYS 206
ED + R DA+ +L F + VEGL ++ A+ A S +G Y+
Sbjct: 139 DEDAVRTRADAE------------TLGAFATSANGVEGLKENAIFAKLAAKSAEGTLLYT 186
Query: 207 RFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
++ A+G+FR+LELA AT+P LE L ++ V DL +Y+ LLSKL AKEL +E
Sbjct: 187 KYIAIGMFRMLELAKATDPAALEALVTAGGLSMSKVSGDLSMYKGLLSKLAAAKELQEE 245
>gi|427419843|ref|ZP_18910026.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
gi|425762556|gb|EKV03409.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
Length = 258
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 134/234 (57%), Gaps = 21/234 (8%)
Query: 63 MSTATDVPP-----TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPV 117
MS PP TV++TK F + RPI S+Y V++EL+V+ HL+ + Y+P+
Sbjct: 1 MSNVLLPPPVTNLRTVSDTKRAFYNYHSRPINSLYRRVIEELMVEMHLLSVNVDFVYNPL 60
Query: 118 FALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL- 176
+ALG +T +DR M GY E+D+E+I A A++ DP+QYR DA+ L+ + S L
Sbjct: 61 YALGVITSFDRFMVGYEPEQDKESILSAICQAVEGDPQQYRQDAEALKSDLANLSLSDLN 120
Query: 177 VEFPS-KEGEVEGL---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-------P 225
+ S K + GL L +A +AS K Y+R AVGL+ L E + +
Sbjct: 121 TQLASAKTTDGNGLQNKLHVVATQASAK----YTRLMAVGLYTLFETVDISSLEDKDSRE 176
Query: 226 TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+L+ +L + VD+DL++YR+ L K+ QA+E++K+ ++ E+KKRE+R +
Sbjct: 177 EMLKTAAEMLALPAEKVDKDLELYRSNLDKMAQAQEVMKDILEAERKKREQRAQ 230
>gi|257059049|ref|YP_003136937.1| Thf1-like protein [Cyanothece sp. PCC 8802]
gi|256589215|gb|ACV00102.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8802]
Length = 235
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
+L + + + F YSR FA+GL+ ++ +A +LC LN++
Sbjct: 126 MLFNTLQAIANNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 276
+ +DLD+YR+ + K+ Q ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220
>gi|428302138|ref|YP_007140444.1| Protein thf1 [Calothrix sp. PCC 6303]
gi|428238682|gb|AFZ04472.1| Protein thf1 [Calothrix sp. PCC 6303]
Length = 235
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 125/221 (56%), Gaps = 17/221 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F ++ RPI +IY V++EL+V+ HL+ + Y+P++ALG T ++R M+
Sbjct: 6 TVSDTKKTFYSIHTRPINTIYRRVVEELMVEMHLLSVNTDFTYNPIYALGVATAFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK------EGE 185
GY E+D+E +F A +++ D ++ + +A L++ A + L+ S+ GE
Sbjct: 66 GYDPEKDKEQLFHALCQSVEIDTQKIKQEAHSLKDVAASMSVGDLISCLSRAKRFDNAGE 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 238
++ L IA F YSR FA+GLF LLE A+ L + LN++
Sbjct: 126 LQNQLDAIA----SNPKFKYSRLFAIGLFSLLEAASPETVKDEKQRNDALVSIAKGLNIS 181
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ + +DLD+YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQAMVVMADMLAADRKKREQRAQ 222
>gi|218245998|ref|YP_002371369.1| Thf1-like protein [Cyanothece sp. PCC 8801]
gi|254784143|sp|B7K277.1|THF1_CYAP8 RecName: Full=Protein thf1
gi|218166476|gb|ACK65213.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8801]
Length = 235
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYNHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEV-EG 188
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
+L + + + F YSR FA+GL+ ++ +A +LC LN++
Sbjct: 126 MLYNTLQAIAKNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 276
+ +DLD+YR+ + K+ Q ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220
>gi|218442064|ref|YP_002380393.1| Thf1-like protein [Cyanothece sp. PCC 7424]
gi|254784142|sp|B7KI38.1|THF1_CYAP7 RecName: Full=Protein thf1
gi|218174792|gb|ACK73525.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7424]
Length = 226
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 132/220 (60%), Gaps = 18/220 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ +QYDPV+ALG VT + R M+
Sbjct: 6 TVSDSKRDFYTKHTRPINSVYRRVVEELMVEMHLLSVNSDFQYDPVYALGVVTSFQRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-------EFPSKEG 184
GY + D+E+IF A ++ DP+QYR DA+++ E A+ +A L+ + S E
Sbjct: 66 GYRPDADKESIFNALCQSVGGDPQQYRQDAERMIESAKQLSAQQLLFNLESASDSSSGEN 125
Query: 185 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNV 237
++ L IA + Y+R FA+G++ +L E+ TE V++++ VL++
Sbjct: 126 QILQTLIGIA----NAPKYKYTRLFAIGIYTILAETDPEMLKNTEKREEVVKQIAKVLHL 181
Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ + +DLD+YR+ L K+ Q +++E + ++KKRE++
Sbjct: 182 PEEKMQKDLDLYRSNLEKMDQLLTVIEEALQADRKKREQQ 221
>gi|307155000|ref|YP_003890384.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
gi|306985228|gb|ADN17109.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
Length = 233
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 133/232 (57%), Gaps = 25/232 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ M+
Sbjct: 6 TVSDSKRDFYSKHTRPINSVYRRVVEELLVETHLLSVNSDFHYDPIYALGVVTSFEQFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA--------SSLVEFPSKE 183
GY E D+E+IF A ++ DP+QYR DAQ + A+ +A SS + +P +
Sbjct: 66 GYRPETDKESIFNALCQSVGGDPQQYRGDAQSILSTAKQLSAQDLLSKLQSSSIAYPQGD 125
Query: 184 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCA 233
++ L IA F Y+R FA+G++ +L T+P V++++
Sbjct: 126 NKIIETLVAIA----NAPKFKYTRLFAIGIYTILA---ETDPELLKDQQKRHEVIKQIAE 178
Query: 234 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANE 285
+L++ + + +DLD+YR+ L K+ Q +++E + ++KKRE+R + + E
Sbjct: 179 ILHLPEEKMQKDLDLYRSNLEKMEQLLTVIEEALQADRKKREQRDQAKTQAE 230
>gi|332705256|ref|ZP_08425337.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
gi|332355999|gb|EGJ35458.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
Length = 257
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 117/206 (56%), Gaps = 11/206 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++EL+V+ HL+ + YDP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYTYHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYGLGVVTCFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE---EWARGQTASSLVEFPSKEGEVEG 188
Y E D+E+IF A A+ + +QY+ DAQ+L+ + GQ S + P+ E
Sbjct: 66 SYQPENDKESIFNALCQAVGGEAQQYQEDAQRLKTSVDSMSGQDLISWLSSPTSENGSGD 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRS 241
L IA A F YSR FA+GLF LLE ++ V+ + + LN+
Sbjct: 126 LATTIAAIAQ-NSQFKYSRLFAIGLFSLLEQTDSELAQDQKQLEEVINNISSGLNLPSEK 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYV 267
+ +DL++YR+ L K+ QA+ ++++ +
Sbjct: 185 LQKDLELYRSNLEKMAQARVVIEDAI 210
>gi|359462375|ref|ZP_09250938.1| Thf1-like protein [Acaryochloris sp. CCMEE 5410]
Length = 214
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/188 (36%), Positives = 115/188 (61%), Gaps = 16/188 (8%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+R ++YDP+FALG T +DR M+GY E D++AIF A A + DP Q +
Sbjct: 1 MVEMHLLRVNEDFRYDPIFALGVTTSFDRFMDGYQPENDKDAIFSAICKAQEADPVQMQK 60
Query: 160 DAQKLEEWARGQTASSLVEFPSKEG-----EVEGLLKDIAERASGKGNFSYSRFFAVGLF 214
D Q+L E A+ ++A ++++ ++ E++ L++IA+ F YSR FA+GLF
Sbjct: 61 DGQRLTELAQSKSAQEMLDWITQAANSGGDELQWQLRNIAQNPK----FKYSRLFAIGLF 116
Query: 215 RLLELA--NATE-----PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 267
LLEL+ N T+ L +C VLN+++ + +DL++YR L K+ Q ++ + + +
Sbjct: 117 TLLELSEGNITQDEESLAEFLPNICTVLNISESKLQKDLEIYRGNLDKIAQVRQAMDDIL 176
Query: 268 DREKKKRE 275
+ +KK+RE
Sbjct: 177 EAQKKRRE 184
>gi|425459592|ref|ZP_18839078.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
gi|389822632|emb|CCI29709.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
Length = 233
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 125/219 (57%), Gaps = 14/219 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|37520969|ref|NP_924346.1| Thf1-like protein [Gloeobacter violaceus PCC 7421]
gi|81710432|sp|Q7NKS7.1|THF1_GLOVI RecName: Full=Protein thf1
gi|35211965|dbj|BAC89341.1| glr1400 [Gloeobacter violaceus PCC 7421]
Length = 228
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 125/221 (56%), Gaps = 11/221 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F Y RP+ SIY V+ EL+V+ HL+ + +++DP+FA G +T Y LME
Sbjct: 6 TVSDSKRAFFAAYPRPVNSIYRRVIDELLVEVHLLITNQDFRHDPLFATGLLTAYQALME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV-EGLL 190
GY E R+AI +A TAL+ EQ DA + A A ++E + + E +G L
Sbjct: 66 GYTPVEQRDAILRALCTALELSYEQLHTDAAQWRAIAAELPAQEVLEVMAGKREAGDGRL 125
Query: 191 KDIAERASGKGN---FSYSRFFAVGLFRLLELAN----ATEPTVLEKL---CAVLNVNKR 240
K + + +G N F YSR F++GL +LE A +E LE+L C L ++
Sbjct: 126 KAMGDTLAGIANAERFKYSRLFSLGLANILEQAGRAAAMSEKDRLERLQQICTYLKLDYN 185
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
V RDLD + ++L ++ ++KE++ E E++KREER Q
Sbjct: 186 RVKRDLDFFHSVLERIKRSKEVVDELSQTERRKREERAVSQ 226
>gi|440752363|ref|ZP_20931566.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
gi|440176856|gb|ELP56129.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
Length = 228
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 125/219 (57%), Gaps = 14/219 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 125 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 221
>gi|422302142|ref|ZP_16389506.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
gi|389788699|emb|CCI15466.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
Length = 233
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 128/226 (56%), Gaps = 19/226 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 281
+ +DLDVYR+ L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 188 KLQKDLDVYRSNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|22298677|ref|NP_681924.1| Thf1-like protein [Thermosynechococcus elongatus BP-1]
gi|81743247|sp|Q8DJT8.1|THF1_THEEB RecName: Full=Protein thf1
gi|22294857|dbj|BAC08686.1| tlr1134 [Thermosynechococcus elongatus BP-1]
Length = 222
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 129/223 (57%), Gaps = 16/223 (7%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TV++TK F + RPI SIY ++EL+V+ HL+R ++Y P+FALG VT +D+
Sbjct: 4 PRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQF 63
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 189
MEGY E DR+ IF A A + +P+Q + DA +++ + L E S G+
Sbjct: 64 MEGYQPEGDRDRIFHALCVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNS--GQPSAP 121
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-ELANATEPTV-----LEKLCAVLNVNKRSVD 243
L + +GK YSR AVGL+ L ELA E T+ L++L V+ + V
Sbjct: 122 LNSLNH--TGK----YSRLHAVGLYAFLQELAG--EVTIHLNETLDQLAPVIPLPIEKVK 173
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
RDL++YR+ L K+ QA+ L+KE V++E+K+R ++T A +A
Sbjct: 174 RDLELYRSNLDKINQARSLMKELVEQERKRRAQQTSAPPAVDA 216
>gi|425436789|ref|ZP_18817221.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|425451594|ref|ZP_18831415.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
gi|389678450|emb|CCH92698.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|389767069|emb|CCI07461.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
Length = 233
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 125/219 (57%), Gaps = 14/219 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|425445848|ref|ZP_18825868.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
gi|389734049|emb|CCI02237.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
Length = 233
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 127/226 (56%), Gaps = 19/226 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 281
+ +DLDVYR L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|443328840|ref|ZP_21057433.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
gi|442791576|gb|ELS01070.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
Length = 270
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 131/227 (57%), Gaps = 12/227 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
++T T++ TV++TK +F Y +PI S+Y +++EL+V+ HL+ ++ DP+F LG
Sbjct: 4 IATVTNLK-TVSDTKRSFYNNYNKPINSVYRRIVEELLVEMHLLSVNADFKSDPIFYLGV 62
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
V+ ++RLM+GY ++D+ AIF A A+ DPE YR A L A+ ++ L+ + +
Sbjct: 63 VSCFERLMQGYQPDQDKGAIFNALCRAVDGDPESYRAQAGNLLAIAKEKSGEELIAWLGE 122
Query: 183 EGEVEGLLKDIAE---RASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLC 232
+ G ++IAE + NF YSR F +GL+ LLE A+A + E +
Sbjct: 123 PTAIAG-AENIAETIKSIAANANFKYSRPFGIGLYTLLEEADAKLLEDSDKRNEIFENIA 181
Query: 233 AVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
L++ + +DL++YR+ L K+ Q + +++ + +K+RE+R +
Sbjct: 182 KTLSLPGDKMKKDLELYRSNLEKMEQVLKAIEDALQASRKQREKRAQ 228
>gi|425453632|ref|ZP_18833389.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
gi|389800936|emb|CCI19831.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
Length = 233
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 125/219 (57%), Gaps = 14/219 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|443669636|ref|ZP_21134837.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
gi|159030831|emb|CAO88510.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330085|gb|ELS44832.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
Length = 228
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 125/219 (57%), Gaps = 14/219 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 240
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 125 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 221
>gi|390439536|ref|ZP_10227927.1| Protein thf1 [Microcystis sp. T1-4]
gi|389837025|emb|CCI32051.1| Protein thf1 [Microcystis sp. T1-4]
Length = 233
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/230 (33%), Positives = 125/230 (54%), Gaps = 36/230 (15%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT--------EPTVLE 229
+ +A G N F YSR FA+GL+ +L A EP +L+
Sbjct: 118 QLQNQALGGDNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQ 176
Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
K +L+++ + +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 177 KFSEILHLSGEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|425470743|ref|ZP_18849603.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
gi|389883502|emb|CCI36111.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
Length = 233
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 125/225 (55%), Gaps = 17/225 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
L D F YSR FA+GL+ +L A +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILHLSSEK 188
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 281
+ +DLDVYR L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|425441488|ref|ZP_18821762.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|425463770|ref|ZP_18843100.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
gi|389717772|emb|CCH98181.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|389829228|emb|CCI29632.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
Length = 233
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 122/218 (55%), Gaps = 12/218 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 188
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 241
L D F YSR FA+GL+ +L A +L+K +L ++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILRLSSEK 188
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|434398071|ref|YP_007132075.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
gi|428269168|gb|AFZ35109.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
Length = 238
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 136/235 (57%), Gaps = 12/235 (5%)
Query: 60 IQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFA 119
I + TD TV++ K +F + + RPI S+Y V++EL+V+ HL+ ++ DP++
Sbjct: 4 IAVGNKTTDNIRTVSDAKRDFYQHHTRPINSVYRRVVEELLVEMHLLSVNVDFKSDPIYY 63
Query: 120 LGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 179
LG VT ++RLM+GY E+D+E+IF A A+ EDPE+ R A L A+ ++ LV +
Sbjct: 64 LGVVTSFERLMQGYRPEQDKESIFNALCRAVGEDPERNRAQAGSLLNLAKNKSPQELVAW 123
Query: 180 PSKEGEVEG---LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLE 229
S+ +E +++ I AS +F YSR FA+GL+ LLE ++ + +LE
Sbjct: 124 LSEPTPLENYHDIIEPIKAIASNP-HFKYSRLFAIGLYTLLEESDPEILKDVSKRNEILE 182
Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDR-EKKKREERTEPQKA 283
+ L++ +++DL++YR+ L K+ Q ++++ + K+K + + EP+ A
Sbjct: 183 SIATQLHLPGEKMNKDLELYRSNLEKMEQLLSVIEDVLQAGRKQKNQPKPEPETA 237
>gi|254423933|ref|ZP_05037651.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
gi|196191422|gb|EDX86386.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
Length = 250
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 120/208 (57%), Gaps = 14/208 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI +IY V++EL+V+ HL+ + YD ++ALG V+ YDR M+
Sbjct: 6 TVSDTKRAFYSQHTRPINAIYRRVVEELMVEAHLLLVNADFNYDSIYALGVVSTYDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS--KEGEVEG- 188
GY DR+ I++A + A + DP+QYR DA++L ++ S+ F S E + E
Sbjct: 66 GYEPAGDRDNIYRAILQANEADPDQYRRDAEEL--LGVAKSLPSIDAFKSILDEAKTESG 123
Query: 189 --LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNK 239
LK +A F YSR FA+GL+ ++E +A ++ ++ + + +N+
Sbjct: 124 SDTLKANLHKAISNPKFKYSRLFAIGLYNVIESIDADMLNDKDKRDALMAEIASTIGLNE 183
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
+ +D+D+YR L K+ QA+E++K+ +
Sbjct: 184 DLLKKDIDLYRGNLEKMAQAQEVMKDMI 211
>gi|423062334|ref|ZP_17051124.1| Thf1-like protein [Arthrospira platensis C1]
gi|406716242|gb|EKD11393.1| Thf1-like protein [Arthrospira platensis C1]
Length = 215
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 109/192 (56%), Gaps = 9/192 (4%)
Query: 95 VLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDP 154
+++EL+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP
Sbjct: 1 MVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDP 60
Query: 155 EQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVG 212
QYR DA LE A + L E ++E + L+ + F YSR FA+G
Sbjct: 61 NQYRADATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIG 120
Query: 213 LFRLLELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
L+ LLE ++ T T+L L + K +++DLD+YR L K+ QA+ +++E
Sbjct: 121 LYTLLEKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEE 180
Query: 266 YVDREKKKREER 277
E+KKRE+R
Sbjct: 181 MTQAERKKREQR 192
>gi|166367182|ref|YP_001659455.1| Thf1-like protein [Microcystis aeruginosa NIES-843]
gi|166089555|dbj|BAG04263.1| Psb29 Photosystem II sub-stoichiometric subunit [Microcystis
aeruginosa NIES-843]
Length = 233
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 121/229 (52%), Gaps = 34/229 (14%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 192 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT-------EPTVLEK 230
+ A G N F YSR FA+GL+ +L A +L+K
Sbjct: 118 QLQNPALGANNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREQILQK 177
Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+L ++ + +DLDVYR L K+ Q +++++ ++ EKKKR+++ +
Sbjct: 178 FSEILRLSSEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQ 226
>gi|170077355|ref|YP_001733993.1| Thf1-like protein [Synechococcus sp. PCC 7002]
gi|254784146|sp|B1XHY6.1|THF1_SYNP2 RecName: Full=Protein thf1
gi|169885024|gb|ACA98737.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 254
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 125/215 (58%), Gaps = 15/215 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT ++R M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIFRRVVEELLVEMHLLSVNADFRYDPFYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY E D+ +IFQ+ A+ D +Y+ DA L E A+ + + L+E ++ EG
Sbjct: 66 GYRPEADKVSIFQSMCQAIGGDANRYKEDAMALVELAKRCSGTQLIECFRQDVPPEGAQE 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVLNVNK 239
L + E + +F YSR FA+G++ L +EP +LE + A LN+ +
Sbjct: 126 LWEKIEAIAKNDHFKYSRLFAIGVYTFL---GESEPQLLEDTEKRDEMLTTVTAGLNLPE 182
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 274
+ +DLD+YR+ L K+ Q E+L++ + E+++R
Sbjct: 183 EKMKKDLDLYRSNLEKMNQVLEVLEDALAVERQRR 217
>gi|427711975|ref|YP_007060599.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
gi|427376104|gb|AFY60056.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
Length = 245
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 125/229 (54%), Gaps = 26/229 (11%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SI+ ++EL+V+ HL+R + Y P+ ALG VT Y+ M
Sbjct: 6 TVSDTKKAFYAAHTRPIHSIFRRFVEELLVEVHLLRVNTNFVYSPLLALGIVTAYNHFMS 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA------SSLVEFPSKEGE 185
GY E DR +IF ++ A + DP+Q + DA + W + L + S+ G+
Sbjct: 66 GYRPETDRNSIFTSFAIAEEFDPQQLQADAAR---WEELAGLELEELQTRLQAWISEGGD 122
Query: 186 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTV-----LEKLCAVLNV 237
L+D K YSR A+GL+ LLE A N T+ LE+L V+N+
Sbjct: 123 PWHNSLRDAVNNPQTK----YSRLQAIGLYHLLEQAAGNLTQELTTLEASLEQLSPVVNL 178
Query: 238 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 286
V +DL++YR+ L K++QA++++ E V+ E+K+RE Q ANEA
Sbjct: 179 PVDKVKKDLELYRSNLDKMIQAQKIMAELVEVERKRRE-----QAANEA 222
>gi|428223137|ref|YP_007107307.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
gi|427996477|gb|AFY75172.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
Length = 226
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 115/205 (56%), Gaps = 10/205 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA+ K +F K + +P+ SIY V+ EL+V+ HL+R + + YD +FALG T +DR M
Sbjct: 6 TVADAKHDFYKAFSKPVNSIYRRVVDELLVEVHLLRVSQNFGYDSIFALGLATAFDRFMA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE--GEVEGL 189
GY E D E IF+ AL DP+Q R ++ L E ++ A F + E +++ L
Sbjct: 66 GYQPESDLEPIFKGLCQALLFDPDQIRQESAHLIELSKQFPAEVKSLFTTLEAGADLDTL 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 242
+ I A+ F YSR FAVG+F LLE A+ ++ ++ L +N +
Sbjct: 126 MGQIRAIATNP-KFKYSRLFAVGVFILLETADPEAIADQDKRQALITQVGDTLKINSERL 184
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYV 267
+DLD+YR+ L K+ Q ++++++ V
Sbjct: 185 LKDLDLYRSNLEKIQQGRQMMEDMV 209
>gi|427726046|ref|YP_007073323.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
gi|427357766|gb|AFY40489.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
Length = 246
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 118/220 (53%), Gaps = 23/220 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDAKRDFYGQHTRPINSIFRRVVEELLVEMHLVSVNVDFRYDPFYALGIVTSYERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE------FPSKEGE 185
GY E D+ +IFQA A+ E Y+ DA L E A+ + LV+ P GE
Sbjct: 66 GYRPESDKISIFQAMCQAVGGSAEFYKNDATALVELAKRCSGQQLVDCFRQDNAPEGAGE 125
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVL 235
+ ++ IA F YSR FA+GL+ L EP +LE L +
Sbjct: 126 LWAKVEAIA----ANKKFKYSRLFAIGLYTFL---GEAEPALLEDADKRDEMLATLTEAM 178
Query: 236 NVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
N+ + +DLD+YR+ L K+ Q ++++ + E+K+RE
Sbjct: 179 NLPGEKMKKDLDLYRSNLEKMTQVLAVIEDALVAERKRRE 218
>gi|443478915|ref|ZP_21068602.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
gi|443015728|gb|ELS30564.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
Length = 240
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/218 (33%), Positives = 121/218 (55%), Gaps = 16/218 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + +P+ +Y V+ EL+V+ HL++ +T+ YD +FALGFVT +DR
Sbjct: 6 TVSDTKKDFYLAFPKPVNQVYRRVVDELLVEIHLLKVNQTFVYDAIFALGFVTTFDRFTA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWAR---GQTASSLVEFPSKEG--EV 186
GY E DR A+F A AL+ D ++ R DA L + A + L S +
Sbjct: 66 GYKPETDRFAVFHALCAALQFDSDRIRQDAATLSDLATRSPNDIKTLLTNLDSGISLEPL 125
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNK 239
G L+ I S K NF YSR VGL+ LLE+++ E +++ + L
Sbjct: 126 SGQLQII----STKENFKYSRLLGVGLYALLEISDPEEIADSAKREELIKLVGETLKFGS 181
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ +D+D+YR+ L K+ QA++++ + V+ E+KKR ++
Sbjct: 182 DRLLKDVDLYRSNLDKIEQARQMIADMVEAERKKRSQK 219
>gi|209522934|ref|ZP_03271491.1| Thf1-like protein [Arthrospira maxima CS-328]
gi|209496521|gb|EDZ96819.1| Thf1-like protein [Arthrospira maxima CS-328]
Length = 210
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 104/187 (55%), Gaps = 9/187 (4%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDPNQYRA 60
Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
DA LE A + L E ++E + L+ + F YSR FA+GL+ LL
Sbjct: 61 DATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIGLYTLL 120
Query: 218 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 270
E ++ T T+L L + K +++DLD+YR L K+ QA+ +++E E
Sbjct: 121 EKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEEMTQAE 180
Query: 271 KKKREER 277
+KKRE+R
Sbjct: 181 RKKREQR 187
>gi|126658461|ref|ZP_01729609.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
gi|126620203|gb|EAZ90924.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
Length = 246
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 119/212 (56%), Gaps = 12/212 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
GY E D+ +IF A A+ + EQY +A+ L A+G S+ EF K G+ +G+
Sbjct: 66 GYRPESDKASIFNALCQAVDGNSEQYHQEAEALINEAKG---LSMTEFKDKLGQEGGDGI 122
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSV 242
L + F YSR F VGL+ LL EL E ++++ L + +
Sbjct: 123 LWGTCNAIAQNPKFKYSRLFGVGLYTLLMEIDPELVKEEEKRNQTIKEVSEALQFSSDKL 182
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 274
+DLD+YR+ L K+ Q ++++ ++ ++KKR
Sbjct: 183 QKDLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214
>gi|428203624|ref|YP_007082213.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
gi|427981056|gb|AFY78656.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
Length = 241
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 130/225 (57%), Gaps = 10/225 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K +F + RPI SIY ++ELIV+ HL+ ++YD ++ALG VT ++R M+
Sbjct: 11 TVSDAKRDFYTHHTRPINSIYRRFVEELIVEMHLLSVNTDFRYDAIYALGVVTAFERFMQ 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE--FPSKEGEVEGL 189
GY E+D+ +IF A A + EQYR +A ++ A+ + L+ S E
Sbjct: 71 GYQPEQDKSSIFAALCQATGGNWEQYRQEAGEILAQAKQMSVQELIAKINSSTPTGGENR 130
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT----VLEKLCAVLNVNKRSV 242
L + + + + N+ YSR FA+GL+ LL A+ +P L+++ L+++ +
Sbjct: 131 LVETLQAIANRSNYKYSRLFAIGLYTLLAEADPDILRDPEKRDRTLKEVTEALHLSPEKL 190
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 287
+DLD+YR+ L K+ Q ++L+E ++ E+KKR+++ +P++ I
Sbjct: 191 QKDLDLYRSNLDKMDQLLKVLEEALEAERKKRQQQ-KPEQGTAQI 234
>gi|97202816|sp|P0C1D1.1|THF1_SYNJB RecName: Full=Protein thf1
Length = 239
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 14/232 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++ TK F Y RPI ++Y V++EL+V+ HL T+ YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAVYRRVVEELLVELHLTTVNSTFVYDPFFALGLVTLYDGLME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 189
Y E REAIF A AL PE R +A+ L E ++ + E E G
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMGSGDPRQRLDLLCLKPEAEDVGG 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT-----VLEKLCAVLNVNKRS 241
LK I ER + + ++YSR AVGL+ E+ + EP LE + + L +
Sbjct: 126 LKAILERMT-QPPYAYSRVLAVGLYTAYEVVAKSLYEEPEERTRRFLENVVSKLPFSTER 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 293
V +DL++YR+ L ++ QA+ +++E V ++++E R Q A + LG+
Sbjct: 185 VRKDLELYRSSLDRMKQARAVVEEMVKAARRQQERR---QSAASLPETSLGD 233
>gi|172035357|ref|YP_001801858.1| Thf1-like protein [Cyanothece sp. ATCC 51142]
gi|354555452|ref|ZP_08974753.1| Protein thf1 [Cyanothece sp. ATCC 51472]
gi|254784140|sp|B1WNF0.1|THF1_CYAA5 RecName: Full=Protein thf1
gi|171696811|gb|ACB49792.1| photosystem II 22 kD protein [Cyanothece sp. ATCC 51142]
gi|353552511|gb|EHC21906.1| Protein thf1 [Cyanothece sp. ATCC 51472]
Length = 242
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 118/210 (56%), Gaps = 8/210 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E D+ +IF A A+ + EQY +A+ L A+G + + E +EG +G+L
Sbjct: 66 GYSPESDKTSIFNALCQAVDGNSEQYHQEAEALINEAKGLSITEFKEKLGQEGG-DGILW 124
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVDR 244
+ F YSR F VGL+ LL +L + ++++ L + + +
Sbjct: 125 GTCGAIAQNPKFKYSRLFGVGLYTLLMEIDPDLVKEEDKRNQTIKEVSDALQFSSDKLQK 184
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKKR 274
DLD+YR+ L K+ Q ++++ ++ ++KKR
Sbjct: 185 DLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214
>gi|409992261|ref|ZP_11275462.1| inositol phosphatase [Arthrospira platensis str. Paraca]
gi|409936888|gb|EKN78351.1| inositol phosphatase [Arthrospira platensis str. Paraca]
Length = 210
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 9/187 (4%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYTPETDKLSIWAALIGAQESDPNQYRA 60
Query: 160 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
DA LE A L + ++E + L+ + F YSR A+GL+ LL
Sbjct: 61 DATALEAQAASLAVKDLTDKAKIAQESSGDDPLQSCFHAIANNPKFKYSRLLAIGLYTLL 120
Query: 218 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 270
E ++AT T+L L + K + +DLD+YR L K+ QA+ ++ E E
Sbjct: 121 EKSDATAAQDSEGLKTILSDFSEALRLPKDKLVKDLDLYRTNLEKVAQARLMVDEMTQAE 180
Query: 271 KKKREER 277
+KKRE+R
Sbjct: 181 RKKREQR 187
>gi|86606816|ref|YP_475579.1| Thf1-like protein [Synechococcus sp. JA-3-3Ab]
gi|97202812|sp|Q2JSQ3.1|THF1_SYNJA RecName: Full=Protein thf1
gi|86555358|gb|ABD00316.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 239
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 116/218 (53%), Gaps = 15/218 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++ TK F Y RPI + Y V++EL+V+ HL + YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAAYRRVVEELLVELHLTTVNSAFVYDPFFALGLVTLYDSLME 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----QTASSLVEFPSKEGEVE 187
Y E REAIF A AL PE R +A+ L E R Q + L P E E
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMRSGDPVQRYNLLCLKP--EAEDV 123
Query: 188 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EP-----TVLEKLCAVLNVNK 239
G LK I +R + + ++YSR AVGL+ E + EP LE + L +
Sbjct: 124 GGLKAILQRMT-QPPYAYSRVLAVGLYTAYEAVATSLYKEPEERTRHFLEDVIGNLPFSP 182
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
V +DL++YR+ L +L QA+ +++E V ++++E R
Sbjct: 183 ERVKKDLELYRSNLDRLKQARAIVEEMVKAARRQQERR 220
>gi|148242504|ref|YP_001227661.1| Thf1-like protein [Synechococcus sp. RCC307]
gi|147850814|emb|CAK28308.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 237
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 123/223 (55%), Gaps = 10/223 (4%)
Query: 70 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 129
P TVA++K F Y IP +Y V+ EL+V+ HL+ + +Q D +FA+G V+D L
Sbjct: 6 PQTVADSKRRFYAAYPHVIPGLYRRVVDELLVELHLLAGQAGFQADSLFAMGLTQVFDNL 65
Query: 130 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEG 188
M+G+ E ++ +F A + +Q R DA++L E + + + ++G+
Sbjct: 66 MQGFKPAERQKELFAAICSGAGLKADQLRKDAKQLREHLVPHGEAEIKSWIEQQGQGAPD 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE---LANATEPTVLE----KLCAVLNVNKRS 241
+LK + ++A G+ +F YSR AVGL LL+ + +P L+ +L + + K
Sbjct: 126 VLKHVLQQA-GRSDFHYSRLHAVGLMGLLQDLSGGDDQDPQALQERAHQLGHSMGLQKDK 184
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKA 283
+ +D+ +Y + L K+ QA ELL+E V E++KRE+R EP A
Sbjct: 185 LQKDMGLYASNLEKMSQAVELLEETVAAERRKREQRQGEPASA 227
>gi|119510704|ref|ZP_01629832.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
gi|119464658|gb|EAW45567.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
Length = 200
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 117/193 (60%), Gaps = 17/193 (8%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+P++ALG VT +DR M+GY E+D+E+IFQA A++++P++YR
Sbjct: 1 MVEMHLLSVNSGFSYNPIYALGVVTSFDRFMQGYLPEQDQESIFQALCQAVEQEPQRYRE 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA++L+ A+ + L+ + S ++ +++ L+ IA + F YSR FAVGL
Sbjct: 61 DAKRLQALAKDLPVNDLIAWLSQTTHLDRDPDLQAQLQAIAHNS----EFKYSRLFAVGL 116
Query: 214 FRLLELANA-------TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 266
F LLE ++ L+ + A L+++ +++DL++Y + L K+ QA ++ +
Sbjct: 117 FTLLEQSDPELVKDEKQRTEALKTIAAGLHLSDEKLNKDLELYSSNLEKMAQALVVMADM 176
Query: 267 VDREKKKREERTE 279
+ ++KKRE+R +
Sbjct: 177 LSADRKKREQRQQ 189
>gi|428773451|ref|YP_007165239.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
gi|428687730|gb|AFZ47590.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
Length = 233
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/234 (33%), Positives = 124/234 (52%), Gaps = 27/234 (11%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++T+ F + + RPI SIY V+QEL+V+ HL+ +Q D V+A+G +++ M
Sbjct: 6 TVSDTRRAFYQYHTRPINSIYRQVVQELMVEMHLLSVNTDFQPDAVYAVGVCQSFEQFMT 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 189
GY EED+ +IF A A++ +P+ YR ++ L + G++A LV + GL
Sbjct: 66 GYKPEEDKTSIFNALCKAIEANPDDYRHQSESLLNFVEGKSAEDLVNWLLNPVADNGLDE 125
Query: 190 -----LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV------LEKLCAV---- 234
LK I ER F YSR F +G + L+ N P V L KL A
Sbjct: 126 NIVNSLKSILERER----FKYSRLFGIGFYTLI---NKVAPDVAKDEEKLAKLIAPYSEK 178
Query: 235 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR---EERTEPQKANE 285
L++ + +D+D+YR+ L K+ Q ++ E ++ KKKR E+ E ++ANE
Sbjct: 179 LDLPVDKLKKDVDLYRSNLDKINQMLVVIAETIEASKKKRINIEKTEEKEEANE 232
>gi|428769945|ref|YP_007161735.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
gi|428684224|gb|AFZ53691.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
Length = 234
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 120/214 (56%), Gaps = 11/214 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + ++RPI SIY V++EL+V+ HL+ + DP++ LG + + M+
Sbjct: 18 TVSDTKRSFYQHHQRPINSIYRRVVEELMVEMHLLAVNVDFNPDPIYYLGVYQSFQQFMQ 77
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 188
GY E D+E+IF A +++ +P++Y +Q L + G++A ++++ PS EG++E
Sbjct: 78 GYKPESDKESIFNALCQSIENNPQEYISKSQTLLNFVEGKSAQEILDWLLNPSGEGDLEA 137
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
+ F YSR FA+G + L+E + + ++ L L +
Sbjct: 138 VASHWRSNLENP-RFKYSRLFAIGFYTLIEKGDGEFIKDESKFTDFIQPLIDKLQLPVEK 196
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
+ +DLD+YR+ L K+ Q ++ + ++ E+KK++
Sbjct: 197 LKKDLDLYRSNLEKMNQMLSVMADVLEAERKKKQ 230
>gi|16330615|ref|NP_441343.1| Thf1-like-protein [Synechocystis sp. PCC 6803]
gi|383322356|ref|YP_005383209.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325525|ref|YP_005386378.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491409|ref|YP_005409085.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436676|ref|YP_005651400.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|451814773|ref|YP_007451225.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
gi|81671042|sp|P73956.1|THF1_SYNY3 RecName: Full=Protein thf1
gi|1653107|dbj|BAA18023.1| sll1414 [Synechocystis sp. PCC 6803]
gi|339273708|dbj|BAK50195.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|359271675|dbj|BAL29194.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274845|dbj|BAL32363.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278015|dbj|BAL35532.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958541|dbj|BAM51781.1| Thf1-like-protein [Bacillus subtilis BEST7613]
gi|451780742|gb|AGF51711.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
Length = 240
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/214 (33%), Positives = 118/214 (55%), Gaps = 8/214 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++ K F Y RPI SIY ++EL+V+ HL+ + YDP+FALG VT ++ M+
Sbjct: 6 TVSDAKRKFFTHYSRPISSIYRRFVEELLVEMHLLSVNIDFTYDPIFALGIVTSFNSFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-EFPSKEGEVEGLL 190
GY E AIF A + ++P+Q R DA+ + A + V S++ + LL
Sbjct: 66 GYQPAEQLPAIFNALCHGVDQNPDQVRQDAKNVAASAHHIGLDAWVTAAASEQASGDNLL 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVD 243
+ + F YSR FA+GL+ LL E+ + E L +L +L+++ V
Sbjct: 126 LNTLTGIHQRHKFKYSRLFAIGLYTLLADQDPEVKDNDEKRQDYLTRLSELLDLSLDKVV 185
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+DLD+YR+ L K+ Q ++L++ + E+KK+E++
Sbjct: 186 KDLDLYRSNLEKVDQLLKVLEDAAEAERKKKEKQ 219
>gi|67921410|ref|ZP_00514928.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67856522|gb|EAM51763.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 245
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/232 (31%), Positives = 124/232 (53%), Gaps = 17/232 (7%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK F + +PI SIY ++EL+V+ HL+ + YDP++ALG VT + R M+
Sbjct: 6 TVSDTKRKFYGYHTQPINSIYRRFVEELLVEMHLLSVNIDFSYDPIYALGVVTSFQRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV----- 186
GY E D+ +IF A A+ E+Y +A+ + A+G S+V+F K V
Sbjct: 66 GYSPESDKPSIFNALCQAVDGSSEKYHQEAEAILNEAKGL---SIVDFKDKLTHVTDNQV 122
Query: 187 -EGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATE--PTVLEKLCAVLNVN 238
EG+L + F YSR A+GL+ LL +L E ++++ L +
Sbjct: 123 GEGVLWGTFGAIAANPKFKYSRLLAIGLYTLLMEIDSDLLKDEEKRTETIKEVSEALKFS 182
Query: 239 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 290
+ +DLD+YR+ L K+ Q ++++ ++ ++KKR TE + + E +++
Sbjct: 183 PEKLRKDLDLYRSNLDKMQQLLTVIEDSLEADRKKR-ASTEGKTSAEVVEQT 233
>gi|124023249|ref|YP_001017556.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9303]
gi|123963535|gb|ABM78291.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 250
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 117/222 (52%), Gaps = 12/222 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 14 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTR 73
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + +F A + DP R AQK E RG + + ++G +G +
Sbjct: 74 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQKTLESVRGHDLEEVQGWIQQQG--KGAPE 131
Query: 192 DIAE--RASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
+A+ R +G F YSR AVGL LL A E + EKL + + K V
Sbjct: 132 ALAQALRNTGSNTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFTKARV 191
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKAN 284
++DL++Y++ L K+ QA EL ++ ++ E++KRE++ E +K N
Sbjct: 192 EKDLNLYKSNLEKMAQAVELSEQILESERRKREQK-ESEKLN 232
>gi|9631702|ref|NP_048481.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|1131477|gb|AAC96501.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|448924789|gb|AGE48370.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
AN69C]
Length = 207
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 116/217 (53%), Gaps = 12/217 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
G ++ L DIA + YS F AVG+F+LL++ ++ L +
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 171 VHKDIATFFSLL-KYIESSQKLADDIREESLKRKSKS 206
>gi|282901466|ref|ZP_06309391.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193745|gb|EFA68717.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 201
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 214 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 266
F LLEL++ + T + AV L +++ +++DLD+YR+ L K+ QA ++ +
Sbjct: 117 FSLLELSDPEFVKDETQRNQTIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176
Query: 267 VDREKKKREERTE 279
+ ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189
>gi|282898285|ref|ZP_06306276.1| Protein thf1 [Raphidiopsis brookii D9]
gi|281196816|gb|EFA71721.1| Protein thf1 [Raphidiopsis brookii D9]
Length = 202
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 160 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 213
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 214 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 266
F LLEL++ + T + AV L +++ +++DLD+YR+ L K+ QA ++ +
Sbjct: 117 FSLLELSDPEFVKDETERNQAIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176
Query: 267 VDREKKKREERTE 279
+ ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189
>gi|448927841|gb|AGE51413.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CviKI]
Length = 232
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 116/218 (53%), Gaps = 12/218 (5%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 25 IMKAITSSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALG 84
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 85 IVTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 138
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
K G ++ L DI S + YS F AVG+F+LL++ ++ L +
Sbjct: 139 KHGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGE 194
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 195 LVHKDIATFFSLL-KYIESSQKLADDIREESLKRKSKS 231
>gi|448930219|gb|AGE53784.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-3A]
gi|448933659|gb|AGE57214.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NE-JV-4]
Length = 207
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 116/217 (53%), Gaps = 12/217 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSNLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
G ++ L DIA + YS F AVG+F+LL++ ++ L +
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E +R+ ++
Sbjct: 171 VHKDIATFFSLL-KYIESSQKLADDIREESLRRKSKS 206
>gi|448928860|gb|AGE52429.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CvsA1]
Length = 207
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 116/217 (53%), Gaps = 12/217 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
G ++ L DI S + YS F AVG+F+LL++ ++ L +
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 171 VHKDIATFFSLL-KYIESSQKLADDIREESLKRKSKS 206
>gi|254413033|ref|ZP_05026805.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
gi|196180197|gb|EDX75189.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
Length = 208
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 114/196 (58%), Gaps = 17/196 (8%)
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 159
+V+ HL+ ++YDP++ LG V ++R M+GY E D+E+IF A A+ +P+QY+
Sbjct: 1 MVEMHLLAVNVDFKYDPIYVLGVVASFNRFMQGYRPERDKESIFNALCQAVGGNPQQYQD 60
Query: 160 DAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGN---FSYSRFFAVGLFRL 216
DA+KL+ +A LV++ +EG +DI + + F YSR FA+GL+ L
Sbjct: 61 DAEKLKAAVGRLSAQELVDWFGSPTPLEG-AEDIHTTVAAIADNPKFKYSRLFAIGLYTL 119
Query: 217 LELANATEP----------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 266
LE A EP +L+++ L++ + + +DL++YR+ L K+ QA+ +++
Sbjct: 120 LEQA---EPELVQDAKQSMEMLQRIGQTLHLPQEKLQKDLELYRSNLEKMAQAQIAIEDA 176
Query: 267 VDREKKKREERTEPQK 282
+ ++KKRE+R + +K
Sbjct: 177 IKADRKKREQREQEKK 192
>gi|448931622|gb|AGE55183.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1E]
Length = 207
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 116/217 (53%), Gaps = 12/217 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYFSNIETLKPYAKSSHLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
G ++ L DI S + YS F AVG+F+LL++ ++ L +
Sbjct: 115 HGNTLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGEL 170
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 171 VHKDIATFFSLL-KYIESSQKLADDIREESLKRKSKS 206
>gi|157952488|ref|YP_001497380.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
gi|155122715|gb|ABT14583.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
Length = 247
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 115/218 (52%), Gaps = 12/218 (5%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
K G ++ L DIA + YS F A+G+F+LL++ ++ L +
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKHLSESVGFKGE 210
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E K+ + +
Sbjct: 211 IVHKDIATFFSLL-KYIESSQKLADDIREESLKKSKSS 247
>gi|448930916|gb|AGE54479.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
KS1B]
Length = 207
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 115/217 (52%), Gaps = 12/217 (5%)
Query: 63 MSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGF 122
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 1 MKAITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGI 60
Query: 123 VTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK 182
VT D ++ +P +E + I A+I +L EDPE Y + + L+ +A+ P+K
Sbjct: 61 VTTLDSVLNTFPDDE-KVCIKNAFIISLNEDPEMYYSNIEYLKPYAKSSNLG-----PNK 114
Query: 183 EGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS 241
G ++ L DIA + YS F AVG+F+LL++ ++ L +
Sbjct: 115 HGNTLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGKSVKHLSESIGFKGEL 170
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
V +D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 171 VHKDIATFFSLL-KYIESSQKLADDIREESLKRKSKS 206
>gi|33862947|ref|NP_894507.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9313]
gi|81577657|sp|Q7V7R3.1|THF1_PROMM RecName: Full=Protein thf1
gi|33634864|emb|CAE20850.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 243
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 111/214 (51%), Gaps = 10/214 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 6 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTS 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 189
GY E + +F A + DP R AQ+ E RG + + ++G+ E L
Sbjct: 66 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEAL 125
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
K + A G F YSR AVGL LL A E + EKL + + +K V
Sbjct: 126 AKALRNTA-GSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFSKARV 184
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 276
++DL++Y++ L K+ QA EL ++ ++ E++KRE+
Sbjct: 185 EKDLNLYKSNLEKMAQAVELTEQILESERRKREQ 218
>gi|448931221|gb|AGE54783.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1D]
Length = 248
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 108/202 (53%), Gaps = 11/202 (5%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
K G ++ L DIA + YS F A+G+F+LL++ ++ L +
Sbjct: 155 KHGNTLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNGNYTGNSVKHLSESIGFKGE 210
Query: 241 SVDRDLDVYRNLLSKLLQAKEL 262
V +D+ ++ +LL + +++L
Sbjct: 211 IVHKDIAMFFSLLKYIESSQKL 232
>gi|443323210|ref|ZP_21052219.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
gi|442787120|gb|ELR96844.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
Length = 231
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 116/218 (53%), Gaps = 12/218 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV++TK +F + RPI SIY V++ELIV+ HL+ + ++ DP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYAHHTRPINSIYRRVVEELIVELHLLSVNQNFRVDPIYCLGVVTSFDRFMQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWA-RGQTASSLVEFPSKEGEVEGLL 190
GY EED+ +I + A+ EQYR A ++ A R L+ + VEG
Sbjct: 66 GYRPEEDKASILASLCQAVGGKLEQYRDHANQVLNLAKRLHGVDDLLAWFKHPQPVEGEF 125
Query: 191 KDIAERASG---KGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKR 240
+AE S +F YSR F +GL+ +L N + + VL V+
Sbjct: 126 A-LAEAVSAIALNQSFKYSRMFGIGLYTMLGEKNLELLQDKPARDKITAQFAEVLPVSSD 184
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
+ +D ++Y+ L K+ Q +++E ++ E+KKR +++
Sbjct: 185 KLQKDFELYQANLEKMKQMIIVVEEALEAERKKRAKKS 222
>gi|157953365|ref|YP_001498256.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|156068013|gb|ABU43720.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|448930527|gb|AGE54091.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-5-2s1]
gi|448934707|gb|AGE58259.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NY-2B]
gi|448935079|gb|AGE58630.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NYs1]
Length = 248
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 109/202 (53%), Gaps = 11/202 (5%)
Query: 62 CMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALG 121
M T PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG
Sbjct: 41 LMKAITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALG 100
Query: 122 FVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS 181
VT D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+
Sbjct: 101 IVTALDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PN 154
Query: 182 KEGE-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKR 240
K G ++ L DIA + YS F A+G+F+LL++ +++L +
Sbjct: 155 KHGNTLQKSLYDIAIN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKQLSESIGFKGE 210
Query: 241 SVDRDLDVYRNLLSKLLQAKEL 262
V +D+ ++ +LL + +++L
Sbjct: 211 IVHKDIAMFFSLLKYIESSQKL 232
>gi|116074797|ref|ZP_01472058.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
gi|116068019|gb|EAU73772.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
Length = 234
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 112/216 (51%), Gaps = 9/216 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 6 TIADSKRAFHSAFPHVIPSLYRRTADELLVELHLLSHQKQFKVDALFAVGLRQVFDAFTR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
GY E +++F A + DP + A E +G + + ++ +GE +
Sbjct: 66 GYRPEAHLDSLFAAICSCNGFDPAALKQLALDSEHAVQGHSFEDVQQWLRNKGEGAPAAI 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSVD 243
+ +RA NF YSR AVGL LL A + ++P+ L KL L + K V+
Sbjct: 126 TKVLKRAD-HANFHYSRLMAVGLLTLLAKAQGDDGSDPSELAKLAHELSEPLGLTKERVE 184
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
+DL +Y L ++ QA EL++E + E++KRE + E
Sbjct: 185 KDLGIYTGNLERMAQAVELMEETLAAERRKRERQNE 220
>gi|352093979|ref|ZP_08955150.1| Protein thf1 [Synechococcus sp. WH 8016]
gi|351680319|gb|EHA63451.1| Protein thf1 [Synechococcus sp. WH 8016]
Length = 247
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 114/221 (51%), Gaps = 11/221 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 6 TIADSKRAFHTAFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFQAFTQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + ++ A ++ DPE + A+ G T S + E+ S G G +
Sbjct: 66 GYKPEAHLDELYAAICSSNGFDPEALKQLAEGSTSAVSGHTISEVREWLSNRG--AGAPE 123
Query: 192 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 242
+A S G +F YSR AVGL LL A EP T+ ++ L ++K +
Sbjct: 124 PLASGISSVGGDSFHYSRLMAVGLLSLLSSAQGGEPSNPDELKTLAHEIGEQLGLSKPRL 183
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 283
D+DL +Y + L K+ QA EL++E + E++KR+ + +A
Sbjct: 184 DKDLTLYTSNLEKMAQAVELIEETLAAERRKRDRQAADSQA 224
>gi|113955551|ref|YP_730625.1| Thf1-like protein [Synechococcus sp. CC9311]
gi|113882902|gb|ABI47860.1| Uncharacterized protein [Synechococcus sp. CC9311]
Length = 252
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 112/213 (52%), Gaps = 11/213 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F K + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 11 TIADSKRAFHKSFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFMAFTQ 70
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + ++ A T +PE + A+ G T + + E+ S G G +
Sbjct: 71 GYKPETHLDELYAAICTCNGFEPEALKQLAEGSTSAVSGHTINEVREWLSNRG--AGAPE 128
Query: 192 DIAERASGKG--NFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
+A S G +F YSR AVGL LL A EP+ ++L + L ++K +
Sbjct: 129 PLASGISSVGGESFHYSRLMAVGLLSLLSSAQGGEPSNPDELKKLAHEIGEQLGLSKPRL 188
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
D+DL +Y + L K+ QA EL++E + E++KR+
Sbjct: 189 DKDLSLYTSNLEKMAQAVELIEETLAAERRKRD 221
>gi|375332109|gb|AFA52594.1| hypothetical protein [Vaucheria litorea]
Length = 249
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 124/248 (50%), Gaps = 23/248 (9%)
Query: 63 MSTATDVPPT----------------VAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLM 106
MST DVP T V+ET +F Y++PI Y T++ +++ HL
Sbjct: 1 MSTVEDVPDTAMYKRFGHLKGKDVKTVSETIKSFCIQYQKPILPQYRTMINDVLQSTHLN 60
Query: 107 RYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEE 166
+ YD +F GF +++ +LM+ YP + + I+ A +T+L +PE+ + D + + +
Sbjct: 61 VVNGCFIYDAMFGYGFYSLFYKLMKAYPGTGEADLIYAAMVTSLDMEPEKLKEDHETISK 120
Query: 167 WARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA--TE 224
T + L S +GE + LL +I+ + Y++ + +GL ++ TE
Sbjct: 121 LIENMTRADLEN--SFKGENQNLLSEISSNIKADEFYLYTKTWGIGLIEAMDKVGIPLTE 178
Query: 225 PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE--PQK 282
+ E L ++ + +DL Y+++L K+ QA++L KE REKKK ER E ++
Sbjct: 179 ENI-ESLANMIGFSPIKARQDLVQYKDVLDKVAQAEQLFKEIEIREKKKMAERLEEKAKR 237
Query: 283 ANEAIKKC 290
A EA KK
Sbjct: 238 ALEAAKKA 245
>gi|88808604|ref|ZP_01124114.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
gi|88787592|gb|EAR18749.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
Length = 234
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 109/216 (50%), Gaps = 11/216 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 13 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKSNALFAVGLRQVFTAFTK 72
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 189
GY + +F A + + +Q A+ E+ G + + + EG E L
Sbjct: 73 GYRPADHLTELFDALCSCNGFNAQQLNSVAEGSEKAVAGHSMEEVQAWLQSKGEGAPEPL 132
Query: 190 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 242
+A+ A + F YSR AVGLF LL A E E LC + +++ +
Sbjct: 133 ATGLADIAGEQ--FHYSRLMAVGLFSLLSSAQGVESQDPEDLCKTAHSIGEQIGLSRPRL 190
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 278
++DL +YRN L K+ QA EL++E + E++KRE ++
Sbjct: 191 EKDLSLYRNNLEKMAQAVELMEETLASERRKRERQS 226
>gi|318041533|ref|ZP_07973489.1| Thf1-like protein [Synechococcus sp. CB0101]
Length = 224
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 11/208 (5%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V TVA++K F + I IY ++ EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 3 VSLTVADSKRAFHSAFSYVIAPIYRRLVDELLVELHLLSHQKGFRADGLFAVGLTQVFDS 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E RE +FQA +A D R A++ + + + + S +G +G
Sbjct: 63 FSTGYRPEAQREPLFQALCSANGFDGAALRAQAEQARQQVGHHSLEEVKGWLSNQG--QG 120
Query: 189 LLKDIAERASG--KGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 239
+ IA G + +F YSR AVGL LL+ A+A +P L ++ + + K
Sbjct: 121 APELIASLLQGVQRDDFHYSRLVAVGLLSLLQSAQGADALDPQALRSAAHEIGESMGLIK 180
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
VD+DL +Y + K+ QA EL++E V
Sbjct: 181 DRVDKDLSLYAGNIEKMSQAVELMEETV 208
>gi|284929212|ref|YP_003421734.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
gi|284809656|gb|ADB95353.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
Length = 237
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 117/228 (51%), Gaps = 14/228 (6%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ETK F + +PI SIY ++EL+V+ HL+ YQY P++ALG VT++++ M
Sbjct: 6 TVSETKREFYNFFTKPISSIYRRFIEELLVEMHLLSVNADYQYSPIYALGVVTLFEKFMY 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE---VEG 188
Y ++ ++ IF A + D +QYR ++ + A + S+ E +K + +
Sbjct: 66 RYQPDDHQDLIFDALCKSTGGDTKQYRQESNTILNEAETLSISNFKEDFTKSAQEKVNDK 125
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLE-----LANATEP--TVLEKLCAVLNVNKRS 241
LL + F YSR A+GL+ LLE L + E +E++ L ++
Sbjct: 126 LLWKSYYSIAQNPKFKYSRLLAIGLYSLLEKISSDLVESKEEYNKAIEQIANDLGLSSER 185
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR----EERTEPQKANE 285
+ +D+++Y + L K+ Q +++ ++ +KKR EE T NE
Sbjct: 186 IQKDIELYCSNLEKMQQLLIAIEDSLEFGRKKRISQQEEDTLKTNDNE 233
>gi|317970011|ref|ZP_07971401.1| Thf1-like protein [Synechococcus sp. CB0205]
Length = 228
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 17/208 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + + I +Y ++ EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 8 TVADSKRAFHQAFPYVIAPLYRRLVDELLVELHLLSHQKGFHADGLFAVGLTQVFDSFSN 67
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 186
GY E RE +FQA +A D +R A + + + S GE +
Sbjct: 68 GYKPEAQREPLFQALCSANGFDGGAFRQMASDAATQVGHHSLDEVKGWLSNRGEGAPAPI 127
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVL----EKLCAVLNVNK 239
GLL + +F YSR AVGL LL+ A E P L ++ + + K
Sbjct: 128 AGLLHGVQRE-----DFHYSRLVAVGLLSLLQRAQGAEAMDPQALRSAAHEIGEAMGLIK 182
Query: 240 RSVDRDLDVYRNLLSKLLQAKELLKEYV 267
VD+DL +Y + K+ QA EL++E V
Sbjct: 183 ARVDKDLSLYAGNIEKMTQAVELMEETV 210
>gi|449015870|dbj|BAM79272.1| photosystem II biogenesis protein Psb29 [Cyanidioschyzon merolae
strain 10D]
Length = 327
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 112/226 (49%), Gaps = 8/226 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ET F + KRP+ Y + E++ HL ++YD +FALGFV+VY
Sbjct: 88 TVSETVTRFYRNLKRPVVFYYQQAVDEILTTAHLALVCAMFRYDVIFALGFVSVYRDFFR 147
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE-----GEV 186
YP ++RE++F+ AL D Q +A + +G+T + L+E ++ E
Sbjct: 148 SYPRPDERESLFRCICDALDLDVGQVTKEADDALAYVQGKTEAELIEEIERDTGEDSAEA 207
Query: 187 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLEKLCAVLNVNKRSVDRD 245
+ ++ + G + Y+R F +GL +++ ++K +L ++ +D+D
Sbjct: 208 QPVIAALRACRRADGEYYYTRLFGIGLMKIMSSCGVEINLESVKKWANMLKISYARLDQD 267
Query: 246 LDVYRNLLSKLLQAKELLKEYVDREKKK--REERTEPQKANEAIKK 289
+ Y+ + KL QA+ + KE RE+ + E + Q+A E ++K
Sbjct: 268 IGTYQMSMEKLTQAEVMFKELEARERARIADELARKAQEAEEELRK 313
>gi|161347491|ref|YP_001224936.2| Thf1-like protein [Synechococcus sp. WH 7803]
Length = 226
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 115/224 (51%), Gaps = 18/224 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 6 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
GY + +F A + + E+ + A+ E+ G + + + +G+ G L
Sbjct: 66 GYRPADHLPQLFDALCSCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPL 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRS 241
+G+ F YSR AVGLF LL A ++ +P E+LC + +++
Sbjct: 126 ATGLADIAGE-QFHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPR 181
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE----ERTEPQ 281
+++DL +YRN L K+ QA EL++E + E++KRE E +PQ
Sbjct: 182 LEKDLSLYRNNLEKMAQAVELMEETLASERRKRERQASENKQPQ 225
>gi|116070497|ref|ZP_01467766.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
gi|116065902|gb|EAU71659.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
Length = 215
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 116/218 (53%), Gaps = 22/218 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+F++G TV++ +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFSVGLCTVFETFSQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ +F A ++ + +R ++++ + A+ ++ ++G+
Sbjct: 66 GYRPEDHITGLFDALCSSNGYNATTFRKESKQCIDAAKSES-------------IDGMES 112
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 243
+A++ G+G+ YSR A+G+FRL E A +A +P T L K C LN V+
Sbjct: 113 HLAKQKLGEGSH-YSRLMAIGVFRLFEEAKGDAEQPDETELRKRCKEVSTTLNFPAERVE 171
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+DL ++ ++ A EL++E + E++K+E R Q
Sbjct: 172 KDLSLFAANSERMSAAVELVQETIAAERRKKERRQAEQ 209
>gi|87124410|ref|ZP_01080259.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
gi|86167982|gb|EAQ69240.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
Length = 224
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 105/215 (48%), Gaps = 8/215 (3%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + IP +Y EL+V+ HL+ +++ +Q D +FA+G V+
Sbjct: 6 TIADSKRAFHTAFPFVIPPLYRRTADELLVELHLLSHQQQFQVDALFAVGLRQVFRAFTR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 190
GY + ++F+A ++ + A + E RG + + + G+ L
Sbjct: 66 GYKPGQHLASLFEALCSSTGFHAGELESLADQSEAAVRGHSIEEVRHWLEHGGDGAPAPL 125
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLC----AVLNVNKRSVDR 244
+ +RA G F YSR AVGL LL A +P L KL L + V++
Sbjct: 126 ASVLQRADSSG-FHYSRLMAVGLLSLLSEAQGDQADPEQLRKLAHELSGPLGFAQTRVEK 184
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 279
DL +Y + L K+ QA EL++E + E++KRE + +
Sbjct: 185 DLGLYASNLDKMAQAVELMEETLAAERRKRERQQQ 219
>gi|87302741|ref|ZP_01085552.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
gi|87282624|gb|EAQ74582.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
Length = 257
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 104/215 (48%), Gaps = 35/215 (16%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + I +Y ++ EL+V+ HL+ + + D +FA+G V+D +
Sbjct: 8 TVADSKRAFHAAFPYVIGPLYRRMVDELLVELHLLSRQSGFHSDGLFAVGLTQVFDGFAK 67
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR---------IDAQKLEE---W--ARGQTASSLV 177
GY ++ E +F A + D +Q R + L+E W RGQ A
Sbjct: 68 GYRPQQQSEPLFAALCASSGFDAQQIRAQHAAAVKAVGEHSLDEVKQWLAQRGQGAPE-- 125
Query: 178 EFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EK 230
+ G+L I +RA +F YSR FAVGL LL+ A A EP L +
Sbjct: 126 -------PIAGVLAGI-DRA----DFHYSRLFAVGLLSLLQHARGAEAVEPQALRQAAHE 173
Query: 231 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 265
+ + + K VD+DL +Y + L K+ QA EL++E
Sbjct: 174 IGESMGLMKERVDKDLTLYASTLEKMAQAVELMEE 208
>gi|416383906|ref|ZP_11684537.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
gi|357265142|gb|EHJ13943.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
Length = 209
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 103/200 (51%), Gaps = 17/200 (8%)
Query: 104 HLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQK 163
HL+ + YDP++ALG VT + R M+GY E D+ +IF A A+ E+Y +A+
Sbjct: 2 HLLSVNIDFSYDPIYALGVVTSFQRFMQGYSPESDKPSIFNALCQAVDGSSEKYHQEAEA 61
Query: 164 LEEWARGQTASSLVEFPSKEGEV------EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 217
+ A+G S+V+F K V EG+L + F YSR A+GL+ LL
Sbjct: 62 ILNEAKGL---SIVDFKDKLTHVTDNQVGEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 118
Query: 218 -----ELANATE--PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 270
+L E ++++ L + + +DLD+YR+ L K+ Q ++++ ++ +
Sbjct: 119 MEIDSDLLKDEEKRTETIKEVSEALKFSPEKLRKDLDLYRSNLDKMQQLLTVIEDSLEAD 178
Query: 271 KKKREERTEPQKANEAIKKC 290
+KKR TE + + E +++
Sbjct: 179 RKKR-ASTEGKTSAEVVEQT 197
>gi|260436777|ref|ZP_05790747.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
gi|260414651|gb|EEX07947.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
Length = 215
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 108/218 (49%), Gaps = 22/218 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANGLFSVGLCTVFDTFTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E +A+F A ++ D + R L + A+G+ +L + S EG
Sbjct: 66 GYRPEAQTDALFSALCSSNGFDAAKLRKTNASLVDQAKGKDHETLKSWLSSHSLKEG--- 122
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLE--KLCAVLNVNKRSVD 243
YSR AVGL LL+ A A TE V + +L L + V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAATADATGSDTETIVKQSKELAEGLGLPTDRVE 171
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+DL ++ + ++ QA EL++E + EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209
>gi|78184631|ref|YP_377066.1| Thf1-like protein [Synechococcus sp. CC9902]
gi|97202850|sp|Q3AY05.1|THF1_SYNS9 RecName: Full=Protein thf1
gi|78168925|gb|ABB26022.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 215
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 111/218 (50%), Gaps = 22/218 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+FA+G TV+D
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFAVGLCTVFDTFSA 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY EE + A ++ D +R ++++ + A+ ++ V+ +
Sbjct: 66 GYRPEEHITGLLDALCSSNGYDANTFRKESKRCIDAAKTES-------------VDAMDS 112
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 243
+A + G+G+ YSR A+G+ RL E A +A +P L K C LN V+
Sbjct: 113 HLAGQKLGEGSH-YSRLMAIGVLRLFEEAKGDADQPDEADLRKRCKELSTALNFPAERVE 171
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+DL ++ + ++ A EL++E + E++K+E R Q
Sbjct: 172 KDLSLFASNSERMSAAIELVQETIAAERRKKERRQAEQ 209
>gi|397644025|gb|EJK76212.1| hypothetical protein THAOC_02035 [Thalassiosira oceanica]
Length = 293
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 98/209 (46%), Gaps = 8/209 (3%)
Query: 79 NFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEED 138
F PI ++Y + +L+ HL +Q DPVF+LG VTV D L++ +P ++
Sbjct: 58 TFTDALGTPINALYKGTITDLVGSLHLTVVTARFQRDPVFSLGLVTVLDLLLKNFPEQDT 117
Query: 139 REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERAS 198
+ I A I + + +A ++ WA+G+T + + GE + L +A A
Sbjct: 118 AKRIKSAMIESAGMVESEVDAEAAEVATWAQGKTREDIA--SALRGEGDSTLAQVANGAK 175
Query: 199 GKGNFSYSRFFAVGLFRLLELANATEPT-----VLEKLCAV-LNVNKRSVDRDLDVYRNL 252
G + YSRFF +GL +++++ + V+E L + D D+Y
Sbjct: 176 GDEYWMYSRFFGIGLVKMMDIVGIEQDMSVAYDVMEDWVGTCLGKPHYTACADSDLYFKQ 235
Query: 253 LSKLLQAKELLKEYVDREKKKREERTEPQ 281
KL + ++KE REKK+ +R E +
Sbjct: 236 KGKLDMMETMMKEIEIREKKRMADRLEAK 264
>gi|147848088|emb|CAK23639.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 206
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 107/208 (51%), Gaps = 18/208 (8%)
Query: 88 IPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYI 147
IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +GY + +F A
Sbjct: 2 IPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTKGYRPADHLPQLFDALC 61
Query: 148 TALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLLKDIAERASGKGNFSYS 206
+ + E+ + A+ E+ G + + + +G+ G L +G+ F YS
Sbjct: 62 SCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPLATGLADIAGE-QFHYS 120
Query: 207 RFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRSVDRDLDVYRNLLSKLL 257
R AVGLF LL A ++ +P E+LC + +++ +++DL +YRN L K+
Sbjct: 121 RLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPRLEKDLSLYRNNLEKMA 177
Query: 258 QAKELLKEYVDREKKKRE----ERTEPQ 281
QA EL++E + E++KRE E +PQ
Sbjct: 178 QAVELMEETLASERRKRERQASENKQPQ 205
>gi|78212971|ref|YP_381750.1| Thf1-like protein [Synechococcus sp. CC9605]
gi|97202855|sp|Q3AJN7.1|THF1_SYNSC RecName: Full=Protein thf1
gi|78197430|gb|ABB35195.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 215
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 110/218 (50%), Gaps = 22/218 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D ++
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANELFSVGLCTVFDTFIK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E +A+F+A ++ D + R L E A+G+ SL ++ S EG
Sbjct: 66 GYRPEAQTDALFRALCSSNGFDAAKLRKTYASLVEQAKGKDPESLKDWLSSHALKEG--- 122
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLE------LANATEPTVLE--KLCAVLNVNKRSVD 243
YSR AVGL LL+ + TE V + +L L + V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAAAADATDSDTEAIVKQSKELAEGLGLPTDRVE 171
Query: 244 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 281
+DL ++ + ++ QA EL++E + EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209
>gi|428183151|gb|EKX52010.1| hypothetical protein GUITHDRAFT_150871, partial [Guillardia theta
CCMP2712]
Length = 309
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 103/224 (45%), Gaps = 17/224 (7%)
Query: 66 ATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 125
D+ P A + F KL+ RPIP ++ E++ HL ++YD ++A G +
Sbjct: 72 GADIEPCGAAVE-RFYKLFARPIPFVFRAPTNEILYLSHLDLVNAMFRYDVIWAAGLYST 130
Query: 126 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 185
+D E+ R +FQA + LK D + + DA + +WA+G+T + +V + +GE
Sbjct: 131 FDLFFSAL-DEDLRANLFQALMGGLKLDQSKIKSDADAVLQWAQGKTEADVVS--AIKGE 187
Query: 186 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV------------LEKLCA 233
+ + +F Y+R F GL +++++ EP A
Sbjct: 188 DSSPVGQVLASLGKNEDFLYTRNFGAGLIKIMQVVG-VEPNAENAKRWAEVLGFTSNTSA 246
Query: 234 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
+ ++ + D+ ++ + + K+ QA +L E REKKK E+
Sbjct: 247 LSGLSASKFETDVGLFLSSVDKMQQAMQLFAEVEAREKKKIAEK 290
>gi|299469582|emb|CBN76436.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 226
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+ET +F YK+ + + + T++ E + HL Y ++YDP+F +GF T + R M
Sbjct: 107 TVSETVADFYIYYKKVVLTQFRTIVTEYLQSTHLTVYDARFKYDPLFGVGFYTSFMRFMR 166
Query: 132 GYPSEEDREAIFQAYITALKE--DPEQYRIDAQKLEEWARGQTASSLVE 178
YP E IF A + A+ DP+Q R D L+EWA G+T +VE
Sbjct: 167 AYPVPGQAELIFDAVVKAIGNGLDPDQMRKDTTALKEWAEGKTEEDVVE 215
>gi|33865836|ref|NP_897395.1| Thf1-like protein [Synechococcus sp. WH 8102]
gi|81574513|sp|Q7U6N6.1|THF1_SYNPX RecName: Full=Protein thf1
gi|33633006|emb|CAE07817.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 212
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 104/211 (49%), Gaps = 19/211 (9%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T+A++K F + + I +Y + EL+V+ HL+ ++ T+Q + +FA+G TV++R +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRIADELLVELHLLSHQATFQANSLFAVGLKTVFERFTQ 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E A+ A ++ D EQ + AQ + A G + + + ++ +G
Sbjct: 66 GYRPMEHPAALLSALCSSNGFDDEQLKQAAQHCLQDAEGHSDDAFQSWLKEQSLSDGA-- 123
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL-----EKLCAVLNVNKRSVDRDL 246
YSR AVGL LLE ++ KL L + V++DL
Sbjct: 124 ------------HYSRLMAVGLLALLEASSDESDASSLRQRAVKLSVDLGLPAERVEKDL 171
Query: 247 DVYRNLLSKLLQAKELLKEYVDREKKKREER 277
V+ + ++ QA EL++E + +++K+E+R
Sbjct: 172 TVFSSNSERMEQAVELMQETLAADRRKKEKR 202
>gi|159903384|ref|YP_001550728.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9211]
gi|254784145|sp|A9BAB2.1|THF1_PROM4 RecName: Full=Protein thf1
gi|159888560|gb|ABX08774.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 221
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 115/221 (52%), Gaps = 18/221 (8%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F K + +P++Y ++ ELIV+ +L++ + + D VFA+G +++ +
Sbjct: 6 TVSESKAIFHKEFPFVVPAVYRRLVDELIVELNLLKNQERFVADGVFAIGLTSIFLDFTK 65
Query: 132 GYPSEEDREAIFQAYITAL---KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E + + +A + EQ ++A+KL SL+ +++ E E
Sbjct: 66 GYKPENQKGILLEAICKCTGFSASNLEQIALEAKKLANGLNTNEIKSLITDNNRD-EKES 124
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 241
K I + N YSR A+G+++L+++ + ATE + L+ L K
Sbjct: 125 TYKLINK------NNHYSRIIAIGIYKLVDMQSNGFNKEEATENSYLD-LVNNFGYTKER 177
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK 282
V++D+++Y++ L K+ +A EL++ + EK++ +ER K
Sbjct: 178 VEKDVNLYKSSLDKIEKALELIEMNIKDEKRRNKERVSRTK 218
>gi|219123541|ref|XP_002182081.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406682|gb|EEC46621.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 311
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 105/225 (46%), Gaps = 9/225 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV E +F + +Y ++ +++ HL+ +Q D +++LG +T D L++
Sbjct: 67 TVGEAFADFSSELGVTVNPLYKNMVTDIVGTTHLVIVNARFQRDAIWSLGILTALDLLLK 126
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
YP E I A ++ D ++ R +A+ + +WA G++ + + + EG+ +
Sbjct: 127 NYPEPEVGAKIVSALFKSVGLDEDEIRNEARTISDWAVGKSKADIETALTGEGDSP--VA 184
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA------NATEPTVLEKLCAVLNVNKRSVDRD 245
IA + YSR+F +GL +++E + P + + L + + D
Sbjct: 185 AIANSIKPNDYWMYSRYFGIGLIKIMESTGVEMDKDEVYPVMESWMQEKLGRSSLTACAD 244
Query: 246 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 290
D+Y + KL + ++KE REKK+ ER E KA A++
Sbjct: 245 SDLYFKIKDKLDMMETMMKEIEIREKKRMAERLE-DKAEAALRAA 288
>gi|427701945|ref|YP_007045167.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
gi|427345113|gb|AFY27826.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
Length = 231
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 11/205 (5%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TVA++K F + I +Y ++ EL+V+ HL+ ++ +Q D +FA+G + V+D
Sbjct: 6 TVADSKRAFHGAFPHVISPLYRRMVDELLVELHLLSRQKGFQIDALFAVGLIQVFDGFAR 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E + +FQA + D R Q+ + + + ++ +G G
Sbjct: 66 GYRPEAQKGPLFQALCASSGFDGPDLRRQCQEALAAMGRHSQAEVRQWIESQG--AGAPA 123
Query: 192 DIAERASG--KGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSV 242
+A +G + +F YSR AVGL LLE A +A EP L +L + + + +
Sbjct: 124 PVATALAGIRRPDFHYSRLMAVGLLALLEQALADDAMEPQALRQLAHEIGESMGLLRDRL 183
Query: 243 DRDLDVYRNLLSKLLQAKELLKEYV 267
D+DL +Y + L K+ A EL++E V
Sbjct: 184 DKDLALYASNLEKMSMAVELMEETV 208
>gi|223995057|ref|XP_002287212.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
gi|220976328|gb|EED94655.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
Length = 212
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 100/213 (46%), Gaps = 10/213 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV E F P+ ++Y + +L+ HL+ +Q D VF+LG V+ D +++
Sbjct: 2 TVGEAFTQFTDKLGTPVNALYKGMCTDLVGSLHLVMVNARFQRDAVFSLGLVSALDLVLK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
YP E I A + ++ D +A LE WA+G+T + EG+ + L
Sbjct: 62 NYPEAETGARIKSAMLESVGLDEAVVNAEAAALEAWAQGKTKEDIASALKGEGDSQ--LA 119
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR------- 244
IA+ A G + YSRFF VGL R++E+ E + + N + +++
Sbjct: 120 AIAKAAKGDQWWMYSRFFGVGLVRIMEIV-GVEMDMSVAYDVMENWMGKCMEKPYYTACS 178
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
D D+Y KL + ++KE REKK+ +R
Sbjct: 179 DSDLYFKTKGKLDMMETMMKEIEIREKKRMADR 211
>gi|72382131|ref|YP_291486.1| Thf1-like protein [Prochlorococcus marinus str. NATL2A]
gi|97202784|sp|Q46L45.1|THF1_PROMT RecName: Full=Protein thf1
gi|72001981|gb|AAZ57783.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 199
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFSK 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHATKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 241
KG YSR A+G+++L+ + E L +N +S
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170
Query: 242 VDRDLDVYRNLLSKLLQAKELL 263
V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192
>gi|124025670|ref|YP_001014786.1| Thf1-like protein [Prochlorococcus marinus str. NATL1A]
gi|166987530|sp|A2C211.1|THF1_PROM1 RecName: Full=Protein thf1
gi|123960738|gb|ABM75521.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 199
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)
Query: 69 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 128
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFCK 62
Query: 129 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 188
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHVTKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 189 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 241
KG YSR A+G+++L+ + E L +N +S
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170
Query: 242 VDRDLDVYRNLLSKLLQAKELL 263
V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192
>gi|254526529|ref|ZP_05138581.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
gi|221537953|gb|EEE40406.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
Length = 202
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 111/211 (52%), Gaps = 22/211 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 6 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 66 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 113
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 244
+ E+++ K SR +G++ L+ +L E + + + + LN++ ++
Sbjct: 114 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKKNESDINKMISDIFEQLNLSANKAEK 170
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
D+ +Y++ +SK+ QAKEL++E + KKK E
Sbjct: 171 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 201
>gi|33240369|ref|NP_875311.1| Thf1-like protein [Prochlorococcus marinus subsp. marinus str.
CCMP1375]
gi|81664534|sp|Q7VC23.1|THF1_PROMA RecName: Full=Protein thf1
gi|33237896|gb|AAP99963.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 214
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/208 (22%), Positives = 102/208 (49%), Gaps = 9/208 (4%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
T++++K F K + IP +Y VL E +V+ +L+ + ++ D +F+ G + ++R
Sbjct: 6 TISDSKGLFHKEFPYVIPPVYRKVLDEYLVELNLLSNQSNFKIDTIFSYGLIISFERFTV 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY + I ++ + D + + + +++ + ++ + E++ +
Sbjct: 66 GYEPDSHISKILESLCNSCNIDIKAIKEYSNNIKKLINEKGIKEIINILT--AEIKKSVG 123
Query: 192 DIA-ERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEKLCAVLNVNKRSVDRD 245
IA SGK + YSR A+G++ L+ N + ++ + L +K V++D
Sbjct: 124 GIALSNQSGKDKY-YSRLHAIGIYELISNINEDKKEGDDKEIISECVEALGFSKDRVEKD 182
Query: 246 LDVYRNLLSKLLQAKELLKEYVDREKKK 273
++ Y+N + K+ + EL+K V+ K+K
Sbjct: 183 INQYKNSMEKIKEMMELIKLTVEETKRK 210
>gi|123968337|ref|YP_001009195.1| Thf1-like protein [Prochlorococcus marinus str. AS9601]
gi|123198447|gb|ABM70088.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 218
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 115/217 (52%), Gaps = 30/217 (13%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 21 TVSDSKKLFHEKFPYVIPGLYKRIVDEMLVELNLLNHQNEFTLDYLFCVGLTETFKELMK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ ++ +A+++ E ++ ++ LV+ SK+ +LK
Sbjct: 81 GYQPEKHLDLLFESLCSST-------NFEAKEINEISK-KSQKELVDKTSKD-----ILK 127
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELA-----------NATEPTVLEKLCAVLNVNKR 240
+ E+ + K SR +G++ L+ + N + EKL L+ NK
Sbjct: 128 LLVEKNNSK--LYPSRILNLGIYTLISNSQDFKEKNESDKNKMTSDIFEKLS--LSANK- 182
Query: 241 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 277
++D+ +Y++ +SK+ QAKEL++E ++K K +++
Sbjct: 183 -AEKDIGIYKSSISKMEQAKELIEELRIKDKNKNQKK 218
>gi|157413170|ref|YP_001484036.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9215]
gi|157387745|gb|ABV50450.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
Length = 217
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 110/211 (52%), Gaps = 22/211 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 21 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 81 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 128
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 244
+ E+++ K SR +G++ L+ +L E + + + LN++ ++
Sbjct: 129 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKNNESDTNKMISDIFEKLNLSANKAEK 185
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 275
D+ +Y++ +SK+ QAKEL++E + KKK E
Sbjct: 186 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 216
>gi|123200442|gb|ABM72050.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 198
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 2 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 62 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 110
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 241
K+ N SR +G++ L +ANAT+ L EK A+ LN++
Sbjct: 111 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 163
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 273
++D+ +Y++ + K+ QAKELL+E + KKK
Sbjct: 164 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 195
>gi|161407964|ref|YP_001011157.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9515]
Length = 217
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 21 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 190
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 81 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 129
Query: 191 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 241
K+ N SR +G++ L +ANAT+ L EK A+ LN++
Sbjct: 130 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 182
Query: 242 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 273
++D+ +Y++ + K+ QAKELL+E + KKK
Sbjct: 183 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 214
>gi|323450067|gb|EGB05951.1| hypothetical protein AURANDRAFT_66018 [Aureococcus anophagefferens]
Length = 1032
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 73/164 (44%), Gaps = 5/164 (3%)
Query: 112 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 171
+ YD +F GFVT+ D +M YP D E I A I AL DP R D + + EW G+
Sbjct: 69 FVYDELFGFGFVTLMDMIMSPYPVAGDGEKITDALIAALDMDPATLRGDHKAVTEWLAGK 128
Query: 172 T-ASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLE 229
T A L S +G + A G+ F ++R VGL +++ + L
Sbjct: 129 TEADVLAAVASNDGS---KVASAAATIKGQEEFHHTRPSNVGLVAVMDAVGCKPDDESLA 185
Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 273
+ + +V R+ + + K+ A +++K EKK+
Sbjct: 186 RWTEAFGMRAPAVQRNAGLLKEYQEKVANAMQMIKSAEIMEKKR 229
>gi|97202782|sp|Q7V1W1.2|THF1_PROMP RecName: Full=Protein thf1
Length = 202
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 6 TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY EE +F++ + +P++ + ++K E + ++ E+ LLK
Sbjct: 66 GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 115
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 242
++ N SR +G++ L +ANAT+ ++ + LN++
Sbjct: 116 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 168
Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
++D+ +Y++ + K+ QAKELL+E
Sbjct: 169 EKDIGIYKSSILKMEQAKELLQE 191
>gi|33861298|ref|NP_892859.1| Thf1-like protein [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
gi|33633875|emb|CAE19200.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 217
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 21 TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY EE +F++ + +P++ + ++K E + ++ E+ LLK
Sbjct: 81 GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 130
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 242
++ N SR +G++ L +ANAT+ ++ + LN++
Sbjct: 131 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 183
Query: 243 DRDLDVYRNLLSKLLQAKELLKE 265
++D+ +Y++ + K+ QAKELL+E
Sbjct: 184 EKDIGIYKSSILKMEQAKELLQE 206
>gi|97202762|sp|Q31BD6.2|THF1_PROM9 RecName: Full=Protein thf1
Length = 201
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ ++ D +F +G + L +
Sbjct: 6 TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 65
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ + +A++++E ++ S EF K + +LK
Sbjct: 66 GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 112
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
+ E+++ K SR +G++ L+ E + + ++ + L++++ ++
Sbjct: 113 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 170
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKK 273
D+ +Y++ +SK+ QAKEL++E ++KKK
Sbjct: 171 DIGIYKSSISKMEQAKELIQEQRIKDKKK 199
>gi|78779133|ref|YP_397245.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9312]
gi|78712632|gb|ABB49809.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 216
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ ++ D +F +G + L +
Sbjct: 21 TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY E+ + +F++ + +A++++E ++ S EF K + +LK
Sbjct: 81 GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 127
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 244
+ E+++ K SR +G++ L+ E + + ++ + L++++ ++
Sbjct: 128 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 185
Query: 245 DLDVYRNLLSKLLQAKELLKEYVDREKKK 273
D+ +Y++ +SK+ QAKEL++E ++KKK
Sbjct: 186 DIGIYKSSISKMEQAKELIQEQRIKDKKK 214
>gi|194476659|ref|YP_002048838.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
gi|171191666|gb|ACB42628.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
Length = 213
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 92/221 (41%), Gaps = 36/221 (16%)
Query: 71 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 130
PTVA+TK F K + I + TVL EL+V+ L+ + DP+FA+G + + L
Sbjct: 8 PTVADTKRAFYKGFPYVIAPSHRTVLNELLVELFLLSPQTDIGSDPLFAVGLIQFFGVLT 67
Query: 131 EGYPSEEDREAIFQAYITALKEDP--------------EQYRIDAQKLEEWARGQTASSL 176
+ Y + R +F+A ++ D QY I ++L W+ +S
Sbjct: 68 KHYQPQNHRMLLFEALCNSIGFDSFNLRQIRKESLSELSQYNI--EELHSWSLTGADNSE 125
Query: 177 VEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLE 229
+ F + K F YSR A+GL L++ A E +
Sbjct: 126 ILFTKTFIPI-------------KRRFHYSRLMAIGLLCLIKRARGVETLEAKELYYLTH 172
Query: 230 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 270
L + + +DRDL VY + + K+ +++ + ++
Sbjct: 173 NLAEKMGFIRERIDRDLSVYIDTIEKMAAIIDMIPKITKKD 213
>gi|126543182|gb|ABO17424.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 198
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 2 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 61
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY + + +F++ ++ + ++ +QK ++ + +T++ +LK
Sbjct: 62 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSND-------------ILK 108
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 244
+ E+++ K SR + ++ L+ A + ++ + LN++ ++
Sbjct: 109 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 166
Query: 245 DLDVYRNLLSKLLQAKELLKE 265
D+ +Y++ +SK+ QAKEL++E
Sbjct: 167 DIGIYKSSISKMEQAKELIEE 187
>gi|161407965|ref|YP_001091025.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9301]
Length = 217
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)
Query: 72 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 131
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 21 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 80
Query: 132 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 191
GY + + +F++ ++ + ++ +QK ++ + +T++ +LK
Sbjct: 81 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 127
Query: 192 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 244
+ E+++ K SR + ++ L+ A + ++ + LN++ ++
Sbjct: 128 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 185
Query: 245 DLDVYRNLLSKLLQAKELLKE 265
D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIEE 206
>gi|256810247|ref|YP_003127616.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
AG86]
gi|256793447|gb|ACV24116.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
AG86]
Length = 1056
Score = 40.8 bits (94), Expect = 0.78, Method: Composition-based stats.
Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)
Query: 164 LEEWARGQTASSLVEF-PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA 222
LE+W + + E PSK ++ ++ E K + S S+F +LEL
Sbjct: 942 LEDWKKFIKFKEIFENKPSKLQKLVNIIYKCLEDWDNKYDDSISQFLDTSFINVLELNKK 1001
Query: 223 TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELL 263
+ V+EKLC + +++ D+DL+ +RN L + K++L
Sbjct: 1002 SNKEVIEKLCVIFDISLE--DKDLEKFRNELINKIDRKKML 1040
>gi|119331158|ref|NP_001073230.1| uncharacterized protein LOC772071 [Gallus gallus]
gi|60098923|emb|CAH65292.1| hypothetical protein RCJMB04_15j12 [Gallus gallus]
Length = 299
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 105/255 (41%), Gaps = 37/255 (14%)
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
+IQC + P +A+T L++R S + V + LI HLM Y +R QY
Sbjct: 45 LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 98
Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
+ +F L D+ M+GY D + Y L E YR+ A + RG
Sbjct: 99 SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 152
Query: 171 ------QTASSLVE-FPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
A L++ P + + + LL + E +G N ++ F + RL
Sbjct: 153 DGVMRTMNAEKLLKTLPIIQNQHDALLDFEANPNELTNGVINAAFMLLFKDSI-RLFAAY 211
Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK--EYVDREKKKREERT 278
N +LEK ++ K LD+Y+ L ++ + E LK E V ++ + T
Sbjct: 212 NEGIINLLEK---YFDMKKNQCREGLDMYKKFLGRMTKLSEFLKVAEQVGIDQGDIPDLT 268
Query: 279 E-PQKANEAIKKCLG 292
+ P EA+++ G
Sbjct: 269 QAPSSLLEALEQHFG 283
>gi|359416122|ref|ZP_09208488.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
gi|358033523|gb|EHK02062.1| phosphoglyceromutase [Candidatus Haloredivivus sp. G17]
Length = 477
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 9/96 (9%)
Query: 51 ASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKR 110
AS + I C + DV P AE ++ +K +++ P TV+ ++R
Sbjct: 133 ASDYDVDVEIHCFADGRDVDPKSAEEYISQIKEWQKDYPGKIATVMGRFYSMDRDHNWER 192
Query: 111 TYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAY 146
T+Q ALG EG+ E REA+ +AY
Sbjct: 193 THQAYDAMALG---------EGFEFENPREAVKKAY 219
>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
Length = 1053
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 85 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 126
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
++Q+ L Y+R + P F G++ + L+ + D EA QAYITAL+ +P+ Y
Sbjct: 127 LLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMEAAVQAYITALQYNPDLY 180
>gi|345306750|ref|XP_001509380.2| PREDICTED: phosphatidylinositol-binding clathrin assembly
protein-like [Ornithorhynchus anatinus]
Length = 472
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 34/224 (15%)
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
+IQC + P +A+T L++R S + V + LI HLM Y +R QY
Sbjct: 18 LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 71
Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
+ +F L D+ M+GY D + Y L E YR+ A + RG
Sbjct: 72 SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 125
Query: 171 -------QTASSLVEFPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
T L P + +++ LL + E +G N ++ F + RL
Sbjct: 126 DGVMRTMNTEKLLKTLPIIQNQLDALLDFDANPNELTNGVINAAFMLLFKDSI-RLFAAY 184
Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
N +LEK ++ K LD+Y+ L+++ + E LK
Sbjct: 185 NEGIINLLEK---YFDMKKNQCREGLDIYKKFLARMTKLSEFLK 225
>gi|359483284|ref|XP_003632934.1| PREDICTED: mixed-amyrin synthase-like [Vitis vinifera]
Length = 170
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 194 AERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLL 253
AE + + NF +RF AV L L L EP +E+L +NV ++ R +V R
Sbjct: 40 AEVEAARENFWKNRFLAVLLSSKLSLETVGEPLDMEQLFDAVNVMILNLKRQFEVLR--- 96
Query: 254 SKLLQAKELLKEYVDR 269
++ E +KEYVDR
Sbjct: 97 ---MKDNESIKEYVDR 109
>gi|387015170|gb|AFJ49704.1| Phosphatidylinositol-binding clathrin assembly protein-like
[Crotalus adamanteus]
Length = 654
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 34/224 (15%)
Query: 59 IIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRY--KRTYQY-- 114
+IQC + P +A+T L++R S + V + LI HLM Y +R QY
Sbjct: 45 LIQCTNEMNVNIPQLADT------LFERTANSSWVVVFKALITTHHLMMYGNERFIQYLA 98
Query: 115 --DPVFALGFVTVYDR-LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG- 170
+ +F L D+ M+GY D + Y L E YR+ A + RG
Sbjct: 99 SRNTLFNLN--NYLDKSAMQGY----DMSTFIRRYSRYLNEKALSYRLVAVDFTKMKRGI 152
Query: 171 -------QTASSLVEFPSKEGEVEGLLK---DIAERASGKGNFSYSRFFAVGLFRLLELA 220
T L P + +++ LL + E +G N ++ F + RL
Sbjct: 153 DGVMRTMNTEKLLKTLPIIQNQLDALLDFDANPNELTNGVINAAFMLLFKDSI-RLFAAY 211
Query: 221 NATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK 264
N +LEK ++ K LD+Y+ L+++ + E LK
Sbjct: 212 NEGIINLLEKY---FDMKKNQCKEGLDIYKKFLARMTKLSEFLK 252
>gi|194767414|ref|XP_001965811.1| GF13981 [Drosophila ananassae]
gi|190625935|gb|EDV41459.1| GF13981 [Drosophila ananassae]
Length = 396
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 92 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 133
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
+Q+ L Y+R + P F G++ + L+ + D E+ QAYITAL+ +PE Y
Sbjct: 134 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 187
>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
Length = 1050
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 22/118 (18%)
Query: 40 TSLFCHCVRFRASSSSSRMIIQCMSTATDVPPTVAETKMNFLKLYKRPIPSIYNTVLQEL 99
+S+ C R S+ S + I+ P +AE N +YK +
Sbjct: 82 SSIHFQCRRLDKSAQFSTLAIK-------QNPVLAEAYSNLGNVYK-----------ERG 123
Query: 100 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQY 157
+Q+ L Y+R + P F G++ + L+ + D E+ QAYITAL+ +PE Y
Sbjct: 124 QLQEALDNYRRAVRLKPDFIDGYINLAAALV----AARDMESAVQAYITALQYNPELY 177
>gi|421145705|ref|ZP_15605552.1| methylaspartate mutase [Fusobacterium nucleatum subsp. fusiforme
ATCC 51190]
gi|395487895|gb|EJG08803.1| methylaspartate mutase [Fusobacterium nucleatum subsp. fusiforme
ATCC 51190]
Length = 493
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 26/125 (20%), Positives = 61/125 (48%), Gaps = 15/125 (12%)
Query: 65 TATDVPPTVAETK--MNFLKLYKRPIPSI------YNTVLQE----LIVQQHLMRYKRTY 112
TAT VPP ++ + + L ++ + +I Y + Q+ L +++H+ Y T+
Sbjct: 220 TATLVPPAISNSIQILEILLAVEQGVKNISIGVAQYGNITQDIASLLALKEHIQSYLDTF 279
Query: 113 QYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQT 172
+ + TV+++ + G+P EE + +Y T + + RI + ++E+A+
Sbjct: 280 SFKDI---NISTVFNQWIGGFPEEELKAYSLISYSTTIALFSKTNRIFVKNIDEYAKNSL 336
Query: 173 ASSLV 177
++++
Sbjct: 337 GNTMI 341
>gi|237741274|ref|ZP_04571755.1| methylaspartate mutase [Fusobacterium sp. 4_1_13]
gi|229430806|gb|EEO41018.1| methylaspartate mutase [Fusobacterium sp. 4_1_13]
Length = 485
Score = 37.7 bits (86), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/125 (20%), Positives = 61/125 (48%), Gaps = 15/125 (12%)
Query: 65 TATDVPPTVAETK--MNFLKLYKRPIPSI------YNTVLQE----LIVQQHLMRYKRTY 112
TAT VPP ++ + + L ++ + +I Y + Q+ L +++H+ Y T+
Sbjct: 220 TATLVPPAISNSIQILEILLAVEQGVKNISIGVAQYGNITQDIASLLALKEHIQSYLDTF 279
Query: 113 QYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQT 172
+ + TV+++ + G+P EE + +Y T + + RI + ++E+A+
Sbjct: 280 SFKDI---NISTVFNQWIGGFPEEELKAYSLISYSTTIALFSKTNRIFVKNIDEYAKNSL 336
Query: 173 ASSLV 177
++++
Sbjct: 337 GNTMI 341
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.132 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,133,718,933
Number of Sequences: 23463169
Number of extensions: 162177053
Number of successful extensions: 603455
Number of sequences better than 100.0: 235
Number of HSP's better than 100.0 without gapping: 187
Number of HSP's successfully gapped in prelim test: 48
Number of HSP's that attempted gapping in prelim test: 602957
Number of HSP's gapped (non-prelim): 253
length of query: 299
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 158
effective length of database: 9,050,888,538
effective search space: 1430040389004
effective search space used: 1430040389004
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)