BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026654
(235 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 194/229 (84%), Positives = 214/229 (93%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66 TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q +SLVEF SKEGEV
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFSSKEGEV 185
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+KLC LN+NKRSVDRDL
Sbjct: 186 EGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKLCVALNINKRSVDRDL 245
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 231
DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERAEPQKANEAITTCLGQQL 294
>gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 297
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/230 (83%), Positives = 215/230 (93%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66 TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF SKEGE
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFSSKEGEA 185
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKRSVDRDL
Sbjct: 186 ERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKRSVDRDL 245
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L+
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLH 295
>gi|255636566|gb|ACU18621.1| unknown [Glycine max]
Length = 297
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/230 (83%), Positives = 215/230 (93%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66 TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q+ +SLVEF SKEGE
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQSPTSLVEFSSKEGEA 185
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDIAERA GKG FSYSRFFAVGLFRL+ELANATEPT+L+KLCA LN+NKRSVDRDL
Sbjct: 186 ERILKDIAERAGGKGEFSYSRFFAVGLFRLVELANATEPTILDKLCAALNINKRSVDRDL 245
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L+
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERVEPQKANEAITTCLGQQLH 295
>gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula]
Length = 303
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 191/233 (81%), Positives = 215/233 (92%), Gaps = 1/233 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
+SD PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVFALGFVTV
Sbjct: 68 VSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVFALGFVTV 126
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+EF S+EGE
Sbjct: 127 YDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIEFSSREGE 186
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG LKDIAERA G G+FSYSRFFAVGLFRLLELAN EPT+LEKLC+ LN+NK+SVDRD
Sbjct: 187 VEGTLKDIAERAGGNGDFSYSRFFAVGLFRLLELANTMEPTILEKLCSALNINKKSVDRD 246
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYSH 234
LDVYRNLLSKL+QAKELLKEY+DREKKK EER EPQKANEAI KCLG+ +S+
Sbjct: 247 LDVYRNLLSKLVQAKELLKEYIDREKKKIEERAEPQKANEAISKCLGQEQFSN 299
>gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis
vinifera]
Length = 299
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 195/232 (84%), Positives = 215/232 (92%), Gaps = 1/232 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFALGFVTV
Sbjct: 65 VTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFALGFVTV 123
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF SKEGE
Sbjct: 124 YDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFSSKEGE 183
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA N++KRSVDRD
Sbjct: 184 VEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKRSVDRD 243
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
LDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 244 LDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 295
>gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera]
Length = 243
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 195/232 (84%), Positives = 215/232 (92%), Gaps = 1/232 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVP TV+ETKMNFLK YKRPIPSIYNT+LQEL+VQQHLMRYKRTY+YD VFALGFVTV
Sbjct: 9 VTDVP-TVSETKMNFLKNYKRPIPSIYNTLLQELMVQQHLMRYKRTYRYDAVFALGFVTV 67
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LM+GYPS+EDR+ IFQ YI AL+EDPEQYR DAQ LEEWAR QTASSLVEF SKEGE
Sbjct: 68 YDQLMDGYPSDEDRDIIFQVYIKALREDPEQYRKDAQMLEEWARSQTASSLVEFSSKEGE 127
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG+LKDIAERA GKG+FSYSRFFA+GLFRLLELANATEPT+LEKLCA N++KRSVDRD
Sbjct: 128 VEGILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPTILEKLCAAFNISKRSVDRD 187
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
LDVYRNLL+KL+QAKELLKEYVDREKKKREER E QKANEAI KCLGEY Y+
Sbjct: 188 LDVYRNLLTKLVQAKELLKEYVDREKKKREERVESQKANEAITKCLGEYEYT 239
>gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative
[Ricinus communis]
Length = 299
Score = 405 bits (1042), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/230 (83%), Positives = 213/230 (92%), Gaps = 1/230 (0%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK NFL YK+PIPSIYNTVLQELIVQQHLMRYKR+Y+YDPVFALGFVTVY
Sbjct: 69 TDVPPTVSETKFNFLNSYKKPIPSIYNTVLQELIVQQHLMRYKRSYRYDPVFALGFVTVY 128
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LM+GYPS+EDREAIFQAYI AL E+PEQYRIDA+KLE+WAR QT SSLV+F SKEGEV
Sbjct: 129 DQLMQGYPSDEDREAIFQAYINALNEEPEQYRIDAKKLEDWARSQTPSSLVDFSSKEGEV 188
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG+LKDIAERA G G+FSYSRFFA+GLFRLLEL+N+TEPTVLEKLCA LN+NKR VDRDL
Sbjct: 189 EGILKDIAERA-GNGSFSYSRFFAIGLFRLLELSNSTEPTVLEKLCAALNINKRGVDRDL 247
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
DVYRNLLSKL+QAKELLKEYVDREKKK+EER QKANEA+K CLGE L+
Sbjct: 248 DVYRNLLSKLVQAKELLKEYVDREKKKQEERASSQKANEAVKSCLGEALH 297
>gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like
[Cucumis sativus]
Length = 298
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 191/223 (85%), Positives = 207/223 (92%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVAETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFALGFVTVYD+LME
Sbjct: 70 TVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLME 129
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GYPS+EDREAIFQAYI AL EDPEQYRIDA+K EEWAR QTA+SLVEF S+EGEVE +LK
Sbjct: 130 GYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILK 189
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRN 187
DIAERA KGNFSYSRFFA+GLFRLLELANATEP++LEKLCA LN++K+ VDRDLDVYRN
Sbjct: 190 DIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRN 249
Query: 188 LLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEY 230
LLSKL+QAKELLKEYVDREKKKR+ER Q ANEAI KCLGEY
Sbjct: 250 LLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY 292
>gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 189/220 (85%), Positives = 209/220 (95%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFVTVY++L
Sbjct: 65 PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFVTVYEQL 124
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
MEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SL+EF S+EGEVEG+
Sbjct: 125 MEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARSQNPNSLLEFSSREGEVEGI 184
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNVNKRSVDRDLDVY
Sbjct: 185 LKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVNKRSVDRDLDVY 244
Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKK 225
RNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI +
Sbjct: 245 RNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAITQ 284
>gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa]
gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/227 (84%), Positives = 209/227 (92%), Gaps = 1/227 (0%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY YDPVF LG VTVY
Sbjct: 67 TDVPPTVSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYLYDPVFGLGLVTVY 126
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDREAIFQAYI ALKEDPEQYRIDA+KLEEWAR QT SSLV+F SKEGE+
Sbjct: 127 DQLMEGYPSDEDREAIFQAYIKALKEDPEQYRIDAKKLEEWARAQTHSSLVDFSSKEGEI 186
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG+LK IAERA+ GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKRSVDRDL
Sbjct: 187 EGILKGIAERAAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKRSVDRDL 245
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
DVYR LLSKL+QAKELLKEYVDREKKK+EER E QKANE + KCLG+
Sbjct: 246 DVYRGLLSKLVQAKELLKEYVDREKKKQEERAESQKANEMVAKCLGD 292
>gi|356517586|ref|XP_003527468.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Glycine max]
Length = 291
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/220 (85%), Positives = 209/220 (95%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
PPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMRYKR+Y+YD VFALGFVTVY++L
Sbjct: 65 PPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRSYRYDAVFALGFVTVYEQL 124
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
MEGYPS+EDR+AIFQAYI ALKEDPEQYR+DA+KLEEWAR Q +SLV+F S+EGEVEG+
Sbjct: 125 MEGYPSDEDRDAIFQAYIQALKEDPEQYRVDAKKLEEWARAQNPTSLVDFSSREGEVEGI 184
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
LKDIAERA GKG+FSYSRFFA+GLFRLLELANA EPT+LEKLCAVLNV+KRSVDRDLDVY
Sbjct: 185 LKDIAERAGGKGDFSYSRFFAIGLFRLLELANAMEPTILEKLCAVLNVDKRSVDRDLDVY 244
Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKK 225
RNLLSKL+QAKELLKEYVDREKKKREER EPQK+NEAI +
Sbjct: 245 RNLLSKLVQAKELLKEYVDREKKKREERAEPQKSNEAITQ 284
>gi|388496070|gb|AFK36101.1| unknown [Lotus japonicus]
Length = 298
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 189/232 (81%), Positives = 212/232 (91%), Gaps = 1/232 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
+SD PP V+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLMR+KR+Y+YDPVFALGFVTV
Sbjct: 66 VSD-PPPVSETKLNFLKEYKRPIPSIYNTVLQELIVQQHLMRFKRSYRYDPVFALGFVTV 124
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
Y++LMEGYPS+EDR+AIFQ YI ALKEDP QYR DAQKLEEWAR Q+++SL+EF S+EGE
Sbjct: 125 YEQLMEGYPSDEDRDAIFQTYIKALKEDPGQYREDAQKLEEWARTQSSTSLIEFSSREGE 184
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG LKDIAERA GKG+FSYSRFFA+GLFRLLEL NA EP +LEKLCA LNV+KRSVDRD
Sbjct: 185 VEGALKDIAERAGGKGDFSYSRFFAIGLFRLLELGNAMEPAILEKLCAALNVDKRSVDRD 244
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
LDVYRNLLSKL+QAKELLKEY DREKKK+EER EPQKANEAI KCLG+ +S
Sbjct: 245 LDVYRNLLSKLVQAKELLKEYADREKKKQEERAEPQKANEAITKCLGQEQFS 296
>gi|224146717|ref|XP_002326111.1| predicted protein [Populus trichocarpa]
gi|222862986|gb|EEF00493.1| predicted protein [Populus trichocarpa]
Length = 296
Score = 395 bits (1015), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/227 (82%), Positives = 211/227 (92%), Gaps = 1/227 (0%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTVA+TK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YK+T++YDPVF LGFVTVY
Sbjct: 65 TDVPPTVADTKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKKTFRYDPVFGLGFVTVY 124
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDREAIFQAYI AL+EDPEQYRIDA+KLEEWAR QT SSLV+F S+EGE+
Sbjct: 125 DQLMEGYPSDEDREAIFQAYIKALEEDPEQYRIDAKKLEEWARAQTPSSLVDFSSREGEI 184
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG LKDIAER + GNFSYSRFFAVGLFRLLEL+NA+EPTVLEKLC+ LN+NKRSVDRDL
Sbjct: 185 EGTLKDIAERVAS-GNFSYSRFFAVGLFRLLELSNASEPTVLEKLCSALNINKRSVDRDL 243
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
DVYR LLSKL+QA+ELLKEYVDREKKK+EER E QKA+E + KCLGE
Sbjct: 244 DVYRGLLSKLVQARELLKEYVDREKKKQEERAESQKASETVTKCLGE 290
>gi|242050546|ref|XP_002463017.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
gi|241926394|gb|EER99538.1| hypothetical protein SORBIDRAFT_02g036270 [Sorghum bicolor]
Length = 284
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/230 (81%), Positives = 208/230 (90%), Gaps = 1/230 (0%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LGFVTVYD
Sbjct: 53 DVPPTVAETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLGFVTVYD 112
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+EEWAR Q ASSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFRAYITALNEDPTQYRADALKMEEWARSQNASSLVDFSSRDGEIE 172
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRSVDRDLD 232
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
VYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|357122407|ref|XP_003562907.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like
[Brachypodium distachyon]
Length = 286
Score = 388 bits (997), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/226 (80%), Positives = 204/226 (90%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54 ADIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR+AIF++YITAL EDPEQYR DAQK+EEWAR Q S LVEF S++GE+
Sbjct: 114 DQLMEGYPSNEDRDAIFKSYITALNEDPEQYRADAQKMEEWARAQNGSLLVEFSSRDGEI 173
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDI+ERA G GNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKRSVDRDL
Sbjct: 174 EAVLKDISERAQGNGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKRSVDRDL 233
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
D+YRNLLSKL+QAKELLKEY+DREKKKREER E K NE + K G
Sbjct: 234 DIYRNLLSKLVQAKELLKEYIDREKKKREERLETPKPNEPVAKFDG 279
>gi|297832696|ref|XP_002884230.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
gi|297330070|gb|EFH60489.1| hypothetical protein ARALYDRAFT_900469 [Arabidopsis lyrata subsp.
lyrata]
Length = 298
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/232 (78%), Positives = 213/232 (91%), Gaps = 1/232 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVTV
Sbjct: 60 VTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTV 118
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F S++GE
Sbjct: 119 YDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSRQGE 178
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
+E LLKDIA RA+ K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDRD
Sbjct: 179 IEALLKDIAGRAASKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDRD 238
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
LDVYRNLLSKL+QAKELL+EYV+REKKK+ ER E QKANE I KCLG+ LY+
Sbjct: 239 LDVYRNLLSKLVQAKELLREYVEREKKKQGERAESQKANETISKCLGDTLYN 290
>gi|397702097|gb|AFO59570.1| chloroplast Ptr ToxA-binding protein [Saccharum hybrid cultivar
GT28]
Length = 284
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/230 (80%), Positives = 207/230 (90%), Gaps = 1/230 (0%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTV+ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYDPVF LGFVTVYD
Sbjct: 53 DVPPTVSETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDPVFGLGFVTVYD 112
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS EDR++IF+ YITAL EDP+QYR DA K+EEWAR Q SSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFRTYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLC LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCTALNVSKRSVDRDLD 232
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
VYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|293333399|ref|NP_001168867.1| uncharacterized protein LOC100382672 [Zea mays]
gi|223973419|gb|ACN30897.1| unknown [Zea mays]
Length = 284
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/231 (80%), Positives = 207/231 (89%), Gaps = 1/231 (0%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
SDVPPTV ETK+NFLK YKRPIPSIY+TVLQEL+VQQHLMRYKRTYQYD VFALGFVTVY
Sbjct: 52 SDVPPTVGETKLNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKRTYQYDAVFALGFVTVY 111
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR++IF+AYITAL EDP QYR DA K+E WAR Q SSLV+F S++GE+
Sbjct: 112 DQLMEGYPSIEDRDSIFKAYITALNEDPNQYRADALKMEGWARSQNGSSLVDFSSRDGEI 171
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDI+ERA GKGNFSYSRFFAVGLFRLLELANATEPTVL+KLCA LN+NKRSVDRDL
Sbjct: 172 ESILKDISERAKGKGNFSYSRFFAVGLFRLLELANATEPTVLDKLCAALNINKRSVDRDL 231
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
DVYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 232 DVYRNILSKLVQAKELLKEYVDREKKKREERSETPKPNEAVTKFDGN-LYS 281
>gi|21592994|gb|AAM64943.1| unknown [Arabidopsis thaliana]
gi|58761181|gb|AAW82331.1| chloroplast thylakoid formation 1 [Arabidopsis thaliana]
Length = 300
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 182/233 (78%), Positives = 211/233 (90%), Gaps = 1/233 (0%)
Query: 1 MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
+ +DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVT
Sbjct: 60 VTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVT 118
Query: 61 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
VYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F SKEG
Sbjct: 119 VYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKEG 178
Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDR
Sbjct: 179 DIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDR 238
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
DLDVYRNLLSKL+QA ELLKEYV+REKKK+EER + QKANE I KCLG+ LY+
Sbjct: 239 DLDVYRNLLSKLVQANELLKEYVEREKKKQEERAQSQKANETISKCLGDTLYN 291
>gi|212720892|ref|NP_001131923.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|194692932|gb|ACF80550.1| unknown [Zea mays]
gi|195644742|gb|ACG41839.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
gi|414887096|tpg|DAA63110.1| TPA: chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/230 (80%), Positives = 208/230 (90%), Gaps = 1/230 (0%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVYD
Sbjct: 53 DVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVYD 112
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S++GE+E
Sbjct: 113 QLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRSVDRDLD 232
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
VYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281
>gi|18399513|ref|NP_565491.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
gi|75206547|sp|Q9SKT0.1|THF1_ARATH RecName: Full=Protein THYLAKOID FORMATION 1, chloroplastic; Flags:
Precursor
gi|4454459|gb|AAD20906.1| expressed protein [Arabidopsis thaliana]
gi|17065446|gb|AAL32877.1| Unknown protein [Arabidopsis thaliana]
gi|20148535|gb|AAM10158.1| unknown protein [Arabidopsis thaliana]
gi|330251998|gb|AEC07092.1| protein THYLAKOID FORMATION 1 [Arabidopsis thaliana]
Length = 300
Score = 385 bits (988), Expect = e-105, Method: Compositional matrix adjust.
Identities = 182/233 (78%), Positives = 211/233 (90%), Gaps = 1/233 (0%)
Query: 1 MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
+ +DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVT
Sbjct: 60 VTADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVT 118
Query: 61 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
VYD+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F SKEG
Sbjct: 119 VYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKEG 178
Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDR
Sbjct: 179 DIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDR 238
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
DLDVYRNLLSKL+QAKELLKEYV+REKKK+ ER + QKANE I KCLG+ LY+
Sbjct: 239 DLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGDTLYN 291
>gi|326493802|dbj|BAJ85363.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 286
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 180/226 (79%), Positives = 204/226 (90%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54 GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q + LVEF S++GE+
Sbjct: 114 DQLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
DVYRNLLSKL+QAKELLKEYV+REKKKR ER E K NEA+ K G
Sbjct: 234 DVYRNLLSKLVQAKELLKEYVEREKKKRAERLETPKPNEAVAKFDG 279
>gi|195653795|gb|ACG46365.1| chloroplast-localized Ptr ToxA-binding protein1 [Zea mays]
Length = 284
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/230 (79%), Positives = 207/230 (90%), Gaps = 1/230 (0%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVYD
Sbjct: 53 DVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVYD 112
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LME YPS ED+++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S++GE+E
Sbjct: 113 QLMERYPSNEDKDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEIE 172
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPT+L+KLCA LNV+KRSVDRDLD
Sbjct: 173 AILKDISERAKGKGNFSYSRFFAVGLFRLLELSNATEPTILDKLCAALNVSKRSVDRDLD 232
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
VYRN+LSKL+QAKELLKEYVDREKKKREER+E K NEA+ K G LYS
Sbjct: 233 VYRNILSKLVQAKELLKEYVDREKKKREERSEAPKPNEAVTKFDGN-LYS 281
>gi|75140959|sp|Q7XAB8.1|THF1_SOLTU RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|33469614|gb|AAQ19850.1| light-regulated chloroplast-localized protein [Solanum tuberosum]
Length = 293
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/227 (78%), Positives = 201/227 (88%), Gaps = 1/227 (0%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTVA+TK+ FL YKRPIP++YNTVLQELIVQQHL RYK++YQYDPVFALGFVTVYD+LM
Sbjct: 66 PTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALGFVTVYDQLM 125
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
EGYPSEEDR AIF+AYI ALKEDPEQYR DAQKLEEWAR Q A++LV+F SKEGE+E +
Sbjct: 126 EGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSSKEGEIENIF 185
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
KDIA+RA K F YSR FAVGLFRLLELAN T+PT+LEKLCA LNVNK+SVDRDLDVYR
Sbjct: 186 KDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKSVDRDLDVYR 245
Query: 187 NLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
NLLSKL+QAKELLKEYV+REKKKR ER E QKANE + KCLG+Y Y+
Sbjct: 246 NLLSKLVQAKELLKEYVEREKKKRGER-ETQKANETVTKCLGDYQYA 291
>gi|157142955|gb|ABV24460.1| chloroplast-localized protein [Nicotiana benthamiana]
Length = 295
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/233 (77%), Positives = 209/233 (89%), Gaps = 2/233 (0%)
Query: 1 MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
M +D+P TVAETKMNFLK YKRPIP++YNTVLQELIVQQHL++YK++Y+YDPVFALGFVT
Sbjct: 63 MSTDLP-TVAETKMNFLKAYKRPIPTVYNTVLQELIVQQHLIKYKKSYRYDPVFALGFVT 121
Query: 61 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
VYD+LMEGYPSEEDR+AIF+AYI AL EDP QYR DAQK EEWAR Q A++LV+F S++G
Sbjct: 122 VYDQLMEGYPSEEDRDAIFKAYIEALNEDPVQYRADAQKFEEWARTQNANTLVDFSSRDG 181
Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
EVE +LKDIA+RA K +F YSR FAVGLFRLLELAN T+PT+LEKLCA LN+NK+SVDR
Sbjct: 182 EVENILKDIAQRAGTKDSFCYSRLFAVGLFRLLELANVTDPTILEKLCASLNINKKSVDR 241
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYS 233
DLDVYRNLLSKL+QAKELLKEYV+REKKKR ER E QKANEA+ KCLG+Y Y+
Sbjct: 242 DLDVYRNLLSKLVQAKELLKEYVEREKKKRGER-ESQKANEAVTKCLGDYQYA 293
>gi|125558787|gb|EAZ04323.1| hypothetical protein OsI_26464 [Oryza sativa Indica Group]
Length = 287
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/225 (82%), Positives = 207/225 (92%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYD VFALGFVTVYD
Sbjct: 56 DVPPTVAETKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKTTYQYDAVFALGFVTVYD 115
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +SLVEF SK+GE+E
Sbjct: 116 QLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSSKDGEIE 175
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN+NKRSVDRDLD
Sbjct: 176 AILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRSVDRDLD 235
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
VYRN+LSKL+QAKELLKEYV+REKKKREER+E K+NEA+ K G
Sbjct: 236 VYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280
>gi|388506988|gb|AFK41560.1| unknown [Medicago truncatula]
Length = 287
Score = 372 bits (954), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/222 (79%), Positives = 201/222 (90%), Gaps = 1/222 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFALGFVTV
Sbjct: 65 VTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFALGFVTV 123
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF S+EGE
Sbjct: 124 YDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFSSREGE 183
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KRSVDRD
Sbjct: 184 VEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKRSVDRD 243
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
LDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 244 LDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285
>gi|115472755|ref|NP_001059976.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|75147522|sp|Q84PB7.1|THF1_ORYSJ RecName: Full=Protein THYLAKOID FORMATION1, chloroplastic; Flags:
Precursor
gi|29367385|gb|AAO72565.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|34394010|dbj|BAC84034.1| inositol phosphatase-like protein [Oryza sativa Japonica Group]
gi|113611512|dbj|BAF21890.1| Os07g0558500 [Oryza sativa Japonica Group]
gi|125600704|gb|EAZ40280.1| hypothetical protein OsJ_24722 [Oryza sativa Japonica Group]
gi|215694285|dbj|BAG89278.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 287
Score = 368 bits (944), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 185/225 (82%), Positives = 206/225 (91%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETKMNFLK YKRPI SIY+TVLQEL+VQQHLMRYK TYQYD VFALGFVTVYD
Sbjct: 56 DVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALGFVTVYD 115
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS EDR+AIF+AYITAL EDPEQYR DAQK+EEWAR Q +SLVEF SK+GE+E
Sbjct: 116 QLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSSKDGEIE 175
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+LKDI+ERA GKG+FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN+NKRSVDRDLD
Sbjct: 176 AILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRSVDRDLD 235
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLG 228
VYRN+LSKL+QAKELLKEYV+REKKKREER+E K+NEA+ K G
Sbjct: 236 VYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTKFDG 280
>gi|217073200|gb|ACJ84959.1| unknown [Medicago truncatula]
Length = 287
Score = 368 bits (944), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 175/222 (78%), Positives = 200/222 (90%), Gaps = 1/222 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVP +V+ETK+NFLK YKRPIPSIYN VLQELIVQ HLMRYK +YQYD VFALGFVTV
Sbjct: 65 VTDVP-SVSETKLNFLKAYKRPIPSIYNNVLQELIVQHHLMRYKTSYQYDSVFALGFVTV 123
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LMEGY SEE+R+ IF+AYI ALKEDPEQYRIDA+KLE+WA+ Q + SLVEF S+E E
Sbjct: 124 YDKLMEGYSSEEERDTIFKAYINALKEDPEQYRIDAKKLEDWAKAQNSISLVEFSSRERE 183
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VEG+LKDIA+RA KG FSYSRFFAVGLFRLLELANATEPT+L+KLCA LN++KRSVDRD
Sbjct: 184 VEGVLKDIAKRAGEKGEFSYSRFFAVGLFRLLELANATEPTILDKLCAALNIDKRSVDRD 243
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
LDVYR LLSKL+QAKEL +E++DREKKKREER EPQKAN AI
Sbjct: 244 LDVYRMLLSKLVQAKELQREFIDREKKKREERVEPQKANGAI 285
>gi|52548246|gb|AAU82110.1| chloroplast inositol phosphatase-like protein [Triticum aestivum]
Length = 286
Score = 365 bits (936), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 180/227 (79%), Positives = 204/227 (89%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54 GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q + LVEF S++GE+
Sbjct: 114 DQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
DVYRNLLSKL+QAKELLKEY+ REKKKREER E K NEA+ K G
Sbjct: 234 DVYRNLLSKLVQAKELLKEYIKREKKKREERLETPKPNEAVAKFDGS 280
>gi|38570261|gb|AAR24582.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
gi|81239115|gb|ABB60085.1| chloroplast-localized Ptr ToxA-binding protein1 [Triticum aestivum]
Length = 286
Score = 365 bits (936), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 180/227 (79%), Positives = 205/227 (90%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVY
Sbjct: 54 GDIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVY 113
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q + LVEF S++GE+
Sbjct: 114 DQLMEGYPSTEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEFSSRDGEI 173
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDI+ERA GKGNFSYSRFFAVGLFRLLEL+NATEPTVL+KLCA LN+NK+SVDRDL
Sbjct: 174 ESILKDISERAQGKGNFSYSRFFAVGLFRLLELSNATEPTVLDKLCAALNINKKSVDRDL 233
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
DVYRNLLSKL+QAKELLKEY++REKKKREER E K NEA+ K G
Sbjct: 234 DVYRNLLSKLVQAKELLKEYIEREKKKREERLETPKPNEAVAKFDGS 280
>gi|157849728|gb|ABV89647.1| chloroplast light-regulated protein [Brassica rapa]
Length = 273
Score = 358 bits (919), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 171/213 (80%), Positives = 195/213 (91%), Gaps = 1/213 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
++DVPP V+ETK NFLK YKRPIPSIYNTVLQELIVQQHLMRYKRTY+YDPVFALGFVTV
Sbjct: 62 VTDVPP-VSETKSNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTV 120
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LM+GYPS++DR++IFQAY+ AL E P+QYRIDAQK+EEWAR QT++SLV+F KEGE
Sbjct: 121 YDQLMDGYPSDQDRDSIFQAYVEALNEVPKQYRIDAQKMEEWARSQTSASLVDFSFKEGE 180
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRD 181
VE +LKDI+ERA K FSYSRFFAVGLFRLLELA AT+PTVL+KLCA LN+NK+SVDRD
Sbjct: 181 VEAILKDISERAGSKEGFSYSRFFAVGLFRLLELAGATDPTVLDKLCASLNINKKSVDRD 240
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
LDVYRNLLSKL+QAKELLKEYV+REKKKR ER
Sbjct: 241 LDVYRNLLSKLVQAKELLKEYVEREKKKRGERA 273
>gi|116782547|gb|ABK22548.1| unknown [Picea sitchensis]
Length = 304
Score = 352 bits (904), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 172/229 (75%), Positives = 196/229 (85%), Gaps = 1/229 (0%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
SD+P TVAETK FLK YKRPIPSIYN V+QELIVQQHLMRYKRTYQYD VFALGFV+VY
Sbjct: 77 SDIP-TVAETKSAFLKAYKRPIPSIYNNVIQELIVQQHLMRYKRTYQYDAVFALGFVSVY 135
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LM+GYPS+ D EAIF+AYI ALKEDPEQYR DA+KLEEWA Q A S+VEF S++GEV
Sbjct: 136 DQLMDGYPSDGDSEAIFRAYINALKEDPEQYRSDAKKLEEWASSQDAKSIVEFQSRDGEV 195
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG+LKDIAERA K FSYSRFFA+GLFRLLE ANAT+P VLEKLC LN++K SVDRDL
Sbjct: 196 EGILKDIAERAREKKIFSYSRFFAIGLFRLLERANATDPVVLEKLCGALNISKPSVDRDL 255
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 231
D+YRN+LSKL+Q+KELLKEYV+REKKKR ER QK++EA+ K YL
Sbjct: 256 DIYRNILSKLVQSKELLKEYVEREKKKRTERESNQKSSEAVAKIESTYL 304
>gi|302807588|ref|XP_002985488.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
gi|302810785|ref|XP_002987083.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300145248|gb|EFJ11926.1| hypothetical protein SELMODRAFT_125247 [Selaginella moellendorffii]
gi|300146694|gb|EFJ13362.1| hypothetical protein SELMODRAFT_122474 [Selaginella moellendorffii]
Length = 206
Score = 310 bits (793), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 146/199 (73%), Positives = 172/199 (86%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTVA+TK FLK +++PIPSIYN VLQEL+VQQHLMRY TY+YD VFALGFVTVYD+LM
Sbjct: 3 PTVADTKSAFLKAFRKPIPSIYNNVLQELLVQQHLMRYNATYKYDAVFALGFVTVYDQLM 62
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
+GYP+ +D EAIF+AYI AL EDP+QYR DA+KLEEWA QTASSL F S +G+VE +L
Sbjct: 63 DGYPNAQDSEAIFKAYIEALGEDPDQYRKDAKKLEEWASSQTASSLASFNSGDGDVEEVL 122
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
KDIA+RA+GK +F YSRFFAVGLFRL+E ANA++P VLEKLC LNV+K SVDRDLDVYR
Sbjct: 123 KDIAQRAAGKTSFHYSRFFAVGLFRLVERANASDPAVLEKLCNALNVSKMSVDRDLDVYR 182
Query: 187 NLLSKLLQAKELLKEYVDR 205
NLL+KL QAK+LLKEY+DR
Sbjct: 183 NLLTKLSQAKDLLKEYIDR 201
>gi|168043272|ref|XP_001774109.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674516|gb|EDQ61023.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 215
Score = 308 bits (789), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 144/210 (68%), Positives = 175/210 (83%)
Query: 1 MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
M+ PTVA+TK++F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVT
Sbjct: 1 MVRADVPTVADTKLSFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYVYDPIFALGFVT 60
Query: 61 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
VYD+LM+GYP++EDR+AIF+AYI+AL EDPEQYR D++KLEEWA Q+ S + +F K+G
Sbjct: 61 VYDQLMDGYPNDEDRDAIFKAYISALNEDPEQYRKDSKKLEEWAAAQSGSGIADFAGKDG 120
Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
EVE LKDIAERA+GK F YSRFFA+GLFRLLE A A++P VLE L LNV+KRSVDR
Sbjct: 121 EVEAALKDIAERAAGKEKFHYSRFFAIGLFRLLECAKASDPAVLETLSKALNVSKRSVDR 180
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
DLDVYRNLLSKL Q KEL+KEYVDR +R
Sbjct: 181 DLDVYRNLLSKLAQGKELIKEYVDRWVIRR 210
>gi|168037112|ref|XP_001771049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677737|gb|EDQ64204.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 205
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 137/199 (68%), Positives = 162/199 (81%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTV+ETK +F+K Y++PIPSIY+ V+QEL+VQQHLMRY TY YDP+FALGFVTVYD+LM
Sbjct: 7 PTVSETKASFIKSYRKPIPSIYSNVIQELLVQQHLMRYNSTYTYDPIFALGFVTVYDQLM 66
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
+GYP DR++IF AYI AL EDP +YR DA+KLEEWA Q+AS + +F S++GEVE L
Sbjct: 67 DGYPDATDRDSIFTAYINALNEDPVKYREDAKKLEEWASAQSASGITDFTSRDGEVEATL 126
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYR 186
K IAERA K F YSRFFA+GLFRLLE A A++P VLE L LNVNKRSVDRDLDVYR
Sbjct: 127 KSIAERAGSKDKFHYSRFFAIGLFRLLECAKASDPAVLESLSKALNVNKRSVDRDLDVYR 186
Query: 187 NLLSKLLQAKELLKEYVDR 205
NLLSKL Q KEL+KEY +R
Sbjct: 187 NLLSKLAQGKELIKEYNER 205
>gi|414887097|tpg|DAA63111.1| TPA: hypothetical protein ZEAMMB73_220735 [Zea mays]
Length = 207
Score = 268 bits (684), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 122/154 (79%), Positives = 139/154 (90%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
DVPPTVAETK+NFLK YKRPIPSIY+ VLQEL+VQQHLMRYK+TYQYD VFALGFVTVY
Sbjct: 52 GDVPPTVAETKLNFLKSYKRPIPSIYSAVLQELLVQQHLMRYKKTYQYDAVFALGFVTVY 111
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS EDR++IF+AYITAL EDP+QYR DA K+EEWAR Q SSLV+F S++GE+
Sbjct: 112 DQLMEGYPSNEDRDSIFKAYITALNEDPDQYRADALKMEEWARSQNGSSLVDFSSRDGEI 171
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA 156
E +LKDI+ERA GKGNFSYSRFFAVGLFRLL+ A
Sbjct: 172 EAILKDISERAKGKGNFSYSRFFAVGLFRLLDFA 205
>gi|217072610|gb|ACJ84665.1| unknown [Medicago truncatula]
gi|388509564|gb|AFK42848.1| unknown [Medicago truncatula]
Length = 219
Score = 266 bits (681), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 123/153 (80%), Positives = 138/153 (90%), Gaps = 1/153 (0%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
+SD PPTV+ETK+NFLK YKRPIPSIYN+VLQELIVQQHLMRYK++Y+YDPVFALGFVTV
Sbjct: 68 VSD-PPTVSETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKKSYRYDPVFALGFVTV 126
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
YD+LMEGYPS+EDR+AIFQAYI ALKEDP QYR+DAQKLEEWAR Q A+SL+EF S+E E
Sbjct: 127 YDQLMEGYPSDEDRDAIFQAYINALKEDPAQYRVDAQKLEEWARAQNATSLIEFSSRERE 186
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE 154
VEG LKDIAERA G G+FSYSRFFAVG F L
Sbjct: 187 VEGTLKDIAERAGGNGDFSYSRFFAVGFFDFLS 219
>gi|384250113|gb|EIE23593.1| photosystem II biogenesis protein Psp29 [Coccomyxa subellipsoidea
C-169]
Length = 290
Score = 214 bits (545), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/215 (51%), Positives = 152/215 (70%), Gaps = 4/215 (1%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
PPTVAETK NF + + RPIP IY+ V+QEL+VQ H+MRY ++Y YD VF LGFV+V+D++
Sbjct: 65 PPTVAETKRNFYEAFSRPIPGIYSNVIQELLVQHHIMRYNKSYSYDEVFGLGFVSVFDQV 124
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-EVEG 124
+EG P E D+ A+F AYI +L E+ +QYR DA+K+E A+ + + ++ P EG E++
Sbjct: 125 LEGLP-EGDKGALFSAYIGSLGENGDQYRQDAEKVEALAKELSGPAELK-PDAEGSELQK 182
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 184
L IAER+S +GNF Y++FFA+GLFRLLEL A +P LE L + + + + SV RDL
Sbjct: 183 KLASIAERSS-QGNFLYTKFFAIGLFRLLELTGAKDPKALEGLVSAMKIPQESVSRDLMT 241
Query: 185 YRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
Y+ +LSKL AK+L+ E REKKK ER +KA
Sbjct: 242 YKGVLSKLSAAKDLMNEMYAREKKKAAEREAEKKA 276
>gi|159471025|ref|XP_001693657.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
gi|158283160|gb|EDP08911.1| inositol phosphatase-like protein [Chlamydomonas reinhardtii]
Length = 266
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/208 (53%), Positives = 148/208 (71%), Gaps = 3/208 (1%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
PPTVAETK FL Y +PI SIY+TVLQEL+VQQH MRY + YQY+P+FALGFV+VY+++
Sbjct: 42 PPTVAETKAKFLSGYNKPIASIYSTVLQELLVQQHFMRYSKNYQYNPIFALGFVSVYEQI 101
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
+E S E+R AIF+AY+ AL ED ++Y+ DA LE+ A G T SL P+ +G
Sbjct: 102 LESL-SAEERGAIFKAYVDALGEDADKYKRDASALEQAANGLTPESLT--PNADGNEVQK 158
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVY 185
AS G FSY++F A+GLFRLLEL+ A EP+ LEKL + V +V+RDL +Y
Sbjct: 159 ALASISSASAAGAFSYNKFVAIGLFRLLELSGAKEPSALEKLVKAVGVKPEAVNRDLLMY 218
Query: 186 RNLLSKLLQAKELLKEYVDREKKKREER 213
+ +LSKL AKEL++E+V+REK+K+ ER
Sbjct: 219 KGVLSKLAAAKELMREFVEREKRKQAER 246
>gi|326492686|dbj|BAJ90199.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 239
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 91/112 (81%), Positives = 103/112 (91%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
D+PPTVA+TKMNFLK YKRPIPSIY+TVLQEL+VQQHLMRYK TYQYDPVFALGFVTVYD
Sbjct: 55 DIPPTVADTKMNFLKSYKRPIPSIYSTVLQELLVQQHLMRYKSTYQYDPVFALGFVTVYD 114
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF 115
+LMEGYPS EDR+AIF++Y+TAL EDPEQYR DAQ++EEWAR Q + LVEF
Sbjct: 115 QLMEGYPSNEDRDAIFKSYVTALNEDPEQYRADAQRMEEWARSQNGNLLVEF 166
>gi|356555139|ref|XP_003545894.1| PREDICTED: LOW QUALITY PROTEIN: protein THYLAKOID FORMATION1,
chloroplastic-like [Glycine max]
Length = 152
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 99/134 (73%), Positives = 112/134 (83%)
Query: 73 EDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAER 132
E R+AIFQAYI AL EDP++YRIDA+KLEEWA Q +SLVEF SKEGE E LKDIA R
Sbjct: 19 EGRDAIFQAYIKALVEDPDKYRIDARKLEEWAGVQNPTSLVEFSSKEGEAEKXLKDIAXR 78
Query: 133 ASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKL 192
A GK FSYSRFFAVGLFRL+EL NATEP +L+KLCA LN+NKRSVD DLDVY LLS+L
Sbjct: 79 AGGKXEFSYSRFFAVGLFRLVELENATEPIILDKLCAALNINKRSVDWDLDVYCILLSEL 138
Query: 193 LQAKELLKEYVDRE 206
LQ KELLKEY+D++
Sbjct: 139 LQVKELLKEYIDKD 152
>gi|307108772|gb|EFN57011.1| hypothetical protein CHLNCDRAFT_143677 [Chlorella variabilis]
Length = 273
Score = 183 bits (464), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 135/203 (66%), Gaps = 7/203 (3%)
Query: 5 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
PPTVA+ K+ F +K+P+P+IY+TV+QEL+VQQHL R+ + YQY+ V ALG V+++++
Sbjct: 49 APPTVADAKLKFNGAFKKPLPAIYSTVVQELLVQQHLFRWNKQYQYNEVTALGIVSIFEQ 108
Query: 65 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
++ G P E REA+F A+I AL+EDP+QYR DA +EE ARG++ + P G+
Sbjct: 109 VLGGLPDAE-REAVFDAFINALQEDPKQYRKDAAAMEELARGKSEVA----PDASGDKVQ 163
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDV 184
GK F Y++FFAVGLFRL+EL + +P L L L +++ V+ DL
Sbjct: 164 QALAAVAAKEGK--FLYTKFFAVGLFRLVELTGSKDPKSLTTLVKALGLSQERVNADLMT 221
Query: 185 YRNLLSKLLQAKELLKEYVDREK 207
Y+ +LSKL AKE++KE++ REK
Sbjct: 222 YKGVLSKLEAAKEIMKEFMAREK 244
>gi|302852549|ref|XP_002957794.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
gi|300256865|gb|EFJ41122.1| hypothetical protein VOLCADRAFT_107813 [Volvox carteri f.
nagariensis]
Length = 373
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/243 (40%), Positives = 147/243 (60%), Gaps = 43/243 (17%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
PPTVAETK F + Y +PI SIY+TVLQEL+VQQH MRY + Y Y+ +FALGFV+VY+++
Sbjct: 43 PPTVAETKAKFFEGYSKPIASIYSTVLQELLVQQHFMRYSKDYVYNEIFALGFVSVYEQI 102
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----------------QTA 109
+E P E R+AIF +Y+ AL EDPE Y+ D++++E+ A QT+
Sbjct: 103 LESLPQSE-RDAIFVSYVKALGEDPEAYKRDSERVEKAAGALSGPDALVPDAEGSDVQTS 161
Query: 110 SSLVEFPSKEGEVEGLLKDIAERASGKGN-----------------------FSYSRFFA 146
+ + + + GE+ + R G+G+ FSY++F A
Sbjct: 162 AYIWAYHQRRGEMRMPWRT---RTWGQGSSSLGVCSYGKALDAIKAASAADAFSYNKFVA 218
Query: 147 VGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
+GLFRLLEL A EP LE+L + + +V+RDL +Y+ +LSKL AKE+++E+V+RE
Sbjct: 219 IGLFRLLELTGAKEPAALERLVKSVGIKPEAVNRDLLMYKGVLSKLAAAKEMMREFVERE 278
Query: 207 KKK 209
K++
Sbjct: 279 KRR 281
>gi|255075137|ref|XP_002501243.1| predicted protein [Micromonas sp. RCC299]
gi|226516507|gb|ACO62501.1| predicted protein [Micromonas sp. RCC299]
Length = 260
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 142/220 (64%), Gaps = 17/220 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A+TK F++ Y PIPSI++ + EL+ QH +RY Y Y + +LGFV+VYD+L E
Sbjct: 51 TLADTKRKFVESYPYPIPSIWSVAVNELLANQHFVRYSTRYSYSKLSSLGFVSVYDQLFE 110
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
G+PS+E++ IF ++ AL EDPE+ R DA +L ++A+ + G V+ LL
Sbjct: 111 GFPSDEEKAKIFDCFVEALGEDPEKCRKDAAELAKFAK------------EAGGVDALLA 158
Query: 128 D--IAE-RASGKGN-FSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+AE +++G+ N F+YSR+ A+GLFR+LEL ATEP LEKL + + V+ DL
Sbjct: 159 SPVLAEIKSNGEANKFAYSRYDAIGLFRMLELGGATEPAALEKLADAAGLKLKKVNGDLG 218
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKANEA 222
+Y+ LLSKL AKEL KE +REK+K ER + + AN+A
Sbjct: 219 MYKGLLSKLAAAKELQKEIFEREKRKTAERLAKKEAANDA 258
>gi|428213026|ref|YP_007086170.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
gi|428001407|gb|AFY82250.1| photosystem II biogenesis protein Psp29 [Oscillatoria acuminata PCC
6304]
Length = 235
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 136/225 (60%), Gaps = 17/225 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI SIY V++EL+V+ HL+ + YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF------PSKEGE 121
GY EED+ +IF L+ DP++YR DAQ LEE A + +V P EG+
Sbjct: 66 GYRPEEDKISIFNGICKGLEADPQKYRQDAQWLEEIASRHSGEEMVALLSRSAGPEMEGD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
+G+L IA K NF YSR FAVGLF LLE A+ + ++K+C LN+
Sbjct: 126 FQGILGAIA----AKPNFKYSRLFAVGLFTLLEQADLELVKNEKSRQEAVQKICTALNLP 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
+ +DLD+YR L K++QA+ ++++ + ++KKRE+R + + A
Sbjct: 182 VDKLSKDLDLYRTNLEKMIQARSVMEDILAADRKKREDRAKQKGA 226
>gi|427736065|ref|YP_007055609.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
gi|427371106|gb|AFY55062.1| photosystem II biogenesis protein Psp29 [Rivularia sp. PCC 7116]
Length = 233
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 145/223 (65%), Gaps = 22/223 (9%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+ETK F L+ RPI +IY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSETKRTFYSLHTRPINTIYRRVVEELMVEMHLLGVNADFKYDPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK-----EGEV 122
GY EED+E+I+ A I +++EDP++YR DA++LE+ A+ T LV S+ + E+
Sbjct: 66 GYNPEEDKESIYNALIKSVEEDPQKYRHDAKRLEDLAKSTTGKDLVSDLSQRRLANDSEL 125
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV----------LEKLCAVLN 172
+GLL+ IA +S F YSR FA+GL+ LLE +++P + L+ + A LN
Sbjct: 126 QGLLEGIANNSS----FKYSRLFAIGLYTLLE---SSDPEMVKDEKLRNEALKTIAAGLN 178
Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+++ + +DLD+YR+ L K+ QA ++ + + ++K+RE+R +
Sbjct: 179 LSEDKLSKDLDLYRSNLDKMAQAAIVMADMIAADRKRREQRAQ 221
>gi|145344894|ref|XP_001416959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577185|gb|ABO95252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 203
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 131/210 (62%), Gaps = 16/210 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK FL+ Y PIPS+++TV QEL+VQ H +Y +Y + +LGFV+V+D+L E
Sbjct: 1 TVSDTKAKFLQAYPYPIPSVWSTVTQELLVQGHFAKYNAKSEYSELASLGFVSVFDQLYE 60
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
G+PSE ++ IF A++ AL ED + R DA+ +L F + G V+GL
Sbjct: 61 GFPSETEKVKIFNAFLGALGEDAAKTRADAE------------ALGAFAASAGGVDGLSA 108
Query: 128 D--IAERA--SGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
+ A A S + Y+++ A+G+FR+LELA AT+P LE L ++ + V+ DL
Sbjct: 109 NPIFATMAAKSAENKLMYTKYIAIGIFRMLELAKATDPKALEALAQAGGLSFKKVNGDLA 168
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREER 213
+Y+ LLSKL AKEL +E+++REK+K ER
Sbjct: 169 MYKGLLSKLASAKELQEEFLEREKRKTAER 198
>gi|428223566|ref|YP_007107663.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
gi|427983467|gb|AFY64611.1| photosystem II biogenesis protein Psp29 [Geitlerinema sp. PCC 7407]
Length = 239
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 140/224 (62%), Gaps = 11/224 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSMHTRPINSIYRRVVEELMVEMHLLSVNVDFRYDPFYALGVVTSYERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVEG 124
GY E+D+ +IF++ A + DP YR DA++L E+ + +A L+ + S E G+ +G
Sbjct: 66 GYRPEQDKTSIFESLCRANEGDPGHYRHDAERLAEFTKNLSAEELISWLSLETPRGDDQG 125
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
L + + + + F YSR FA+GLF L+E AN A EK+ A L++
Sbjct: 126 LGESL-QAIANHSQFKYSRLFAIGLFTLVEQANPDLVKDEAQRTATFEKVVAALHLPADK 184
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANE 221
+ +DL++YR+ L KL QA+ ++++ + ++KKREER + QKA+E
Sbjct: 185 LQKDLELYRSNLEKLTQARIVMEDILKADRKKREEREQAQKASE 228
>gi|303286071|ref|XP_003062325.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455842|gb|EEH53144.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 222
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 109/171 (63%), Gaps = 8/171 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVA+TK FLK Y PIPSI++ LQEL+V QH +RY + Y Y + +LGFV+VYD+L E
Sbjct: 45 TVADTKQKFLKSYPYPIPSIWSVALQELLVTQHFVRYSKKYSYSKLSSLGFVSVYDQLFE 104
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
G+PSEE++ IF+ ++ AL+EDP R DA +L +A G + V +++ L+
Sbjct: 105 GFPSEEEKNTIFECFVKALEEDPATVRKDAAELASFAEGASGVDGVLASPIFAQMKSLVA 164
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSV 178
D G F+YSR+ A+GLFRLLELA ATEP LEKL + RS+
Sbjct: 165 D--------GKFAYSRYDAIGLFRLLELAKATEPAALEKLAESSGLQARSI 207
>gi|158338004|ref|YP_001519180.1| Thf1-like protein [Acaryochloris marina MBIC11017]
gi|189030267|sp|B0C3M8.1|THF1_ACAM1 RecName: Full=Protein thf1
gi|158308245|gb|ABW29862.1| photosystem II biogenesis protein Psb29 [Acaryochloris marina
MBIC11017]
Length = 247
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 135/216 (62%), Gaps = 16/216 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RP+ S+Y V++EL+V+ HL+R ++YDP+FALG T +DR M+
Sbjct: 6 TVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDRFMD 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG-----EV 122
GY E D++AIF A A + DP Q + D Q+L E A+ ++A ++++ ++ E+
Sbjct: 66 GYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGGDEL 125
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATE-----PTVLEKLCAVLNVNK 175
+ L++IA+ F YSR FA+GLF LLEL+ N T+ L +C VLN+++
Sbjct: 126 QWQLRNIAQNPK----FKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLNISE 181
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
+ +DL++YR L K+ Q ++ + + ++ +KK+RE
Sbjct: 182 SKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217
>gi|334116992|ref|ZP_08491084.1| Protein thf1 [Microcoleus vaginatus FGP-2]
gi|333461812|gb|EGK90417.1| Protein thf1 [Microcoleus vaginatus FGP-2]
Length = 237
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/215 (39%), Positives = 130/215 (60%), Gaps = 9/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E DR +IF A +L++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYAPEADRVSIFNALCKSLEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFEDTAD 125
Query: 128 DIAERASGKGN--FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
A + N F YSR FA+GLF LLE A+ T + K+ A L++ + V
Sbjct: 126 LQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAALHLPEDKV 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+DLD+YR+ L K+ QA+ +L++ + E+KKRE+R
Sbjct: 186 SKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220
>gi|411116557|ref|ZP_11389044.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
gi|410712660|gb|EKQ70161.1| photosystem II biogenesis protein Psp29 [Oscillatoriales
cyanobacterium JSC-12]
Length = 246
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 131/216 (60%), Gaps = 11/216 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI SIY V++EL+V+ HL+ Y Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTSFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E D+ IF A AL++DP +YR DAQ+L ++A+ ++A +V + + G
Sbjct: 66 GYRPENDKAPIFDAICQALQDDPNRYRHDAQRLNDFAKQKSAKDIVTWLEQAATSYG-GD 124
Query: 128 DIAERASGKGN---FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 177
D+ E+ N F YSR FA+GLF L E A+A +L++ CA L ++
Sbjct: 125 DLQEQVKAIANNPKFKYSRLFAIGLFTLFETADAEVVKKEGEREELLKQACAALRLSHDK 184
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
V RDL++YR+ L K+ QA+ ++ + + EKKKRE +
Sbjct: 185 VQRDLELYRSNLEKVAQAQAVMADMLAAEKKKREHK 220
>gi|119488459|ref|ZP_01621632.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
gi|119455270|gb|EAW36410.1| hypothetical protein L8106_23815 [Lyngbya sp. PCC 8106]
Length = 241
Score = 155 bits (393), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 135/216 (62%), Gaps = 11/216 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI S+Y V++EL+V+ HL+ +QYDP++ALG V+ +DR M+
Sbjct: 6 TVSDTKRAFYNTHTRPINSVYRRVIEELMVEMHLLSVNVDFQYDPIYALGVVSAFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
GY E D+E+IF I AL++DP++YR +AQ+L+E+A+ + +V + + EV
Sbjct: 66 GYLPESDKESIFHGLINALQDDPQRYRAEAQRLQEFAQTLSVQDIVSWVDVAANSEVHND 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN--------ATEPTVLEKLCAVLNVNKRS 177
L+ ++ + + YSR A+GLF L+E A+ AT+ T L +L + LN+
Sbjct: 126 LQSSFQKIATNPKYKYSRILAIGLFTLIEQADPQAMEDKEATQQT-LAQLASGLNLPLDK 184
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DL++YR+ L KL QA+ ++ E E+K+RE+R
Sbjct: 185 LQKDLELYRSNLEKLKQARIVMDEMTQAERKRREQR 220
>gi|428317172|ref|YP_007115054.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
gi|428240852|gb|AFZ06638.1| Protein thf1 [Oscillatoria nigro-viridis PCC 7112]
Length = 237
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/215 (39%), Positives = 130/215 (60%), Gaps = 9/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F ++ RPI SIY V++EL+V+ HL+ +QYDP++ALG VT +DR M
Sbjct: 6 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSANADFQYDPIYALGVVTAFDRFML 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E DR +IF A ++++DP++Y+ DAQ+LE A + L+ + + E
Sbjct: 66 GYVPEADRVSIFNALCKSVEDDPDRYKQDAQRLESLADRLSGQELLSWLDRSTSFEDTAD 125
Query: 128 DIAERASGKGN--FSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
A + N F YSR FA+GLF LLE A+ T + K+ A L++ + V
Sbjct: 126 LQASLGAIASNPQFKYSRLFAIGLFSLLEKADPNLVKDQETRNDAIAKVSAGLHLPEDKV 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+DLD+YR+ L K+ QA+ +L++ + E+KKRE+R
Sbjct: 186 SKDLDLYRSNLEKMAQARIVLQDVIQAERKKREKR 220
>gi|354568723|ref|ZP_08987886.1| Protein thf1 [Fischerella sp. JSC-11]
gi|353539977|gb|EHC09457.1| Protein thf1 [Fischerella sp. JSC-11]
Length = 235
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 135/221 (61%), Gaps = 17/221 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P+FALG VT +DR M+
Sbjct: 6 TVSDTKRTFHTLHTRPINTIYRRVVEELMVEMHLLAVNVDFSYNPIFALGVVTSFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A + A++ DP+ YR DAQ+L+E A+ L+ S ++ +
Sbjct: 66 GYQPESDKESIFNALLRAIEADPQIYRQDAQRLQELAKSLPPQDLIAALSLQTQLNRDTD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA F YSR FA+GLF LLEL++ L+ + A L+++
Sbjct: 126 LQSHLQAIA----SNPKFKYSRLFAIGLFSLLELSDPELVKDEKQRTEALKSIAAGLHIS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+++DL++YR+ L K+ QA ++ + + ++KKRE+R++
Sbjct: 182 DDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRSQ 222
>gi|113474941|ref|YP_721002.1| Thf1-like protein [Trichodesmium erythraeum IMS101]
gi|123056927|sp|Q116P5.1|THF1_TRIEI RecName: Full=Protein thf1
gi|110165989|gb|ABG50529.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 239
Score = 152 bits (385), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 129/216 (59%), Gaps = 9/216 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI SIYN V++EL+V+ HL+ Y Y+P +ALG VT +DR M+
Sbjct: 6 TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
GY +ED+ +IF A I +EDP +YR DA+ LE+ A +AS ++ + SK +
Sbjct: 66 GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
L+D S F YSR FA+GLF LLE+ + L+K+C LN+ + +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
+D+D+Y + L ++ QA+ +++ + +KKRE+R+
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221
>gi|424513129|emb|CCO66713.1| Thf1-like protein [Bathycoccus prasinos]
Length = 222
Score = 152 bits (385), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 131/220 (59%), Gaps = 12/220 (5%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
P TVA+TK F K Y P+PSI+ TVLQEL+V H YQ++ + +LGFV+V+D+L
Sbjct: 4 PATVADTKAKFTKGYPYPLPSIWATVLQELLVGMHFTVTSSKYQHEEMRSLGFVSVFDQL 63
Query: 66 MEGYPSEE--DREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA-SSLVEFPSKEGEV 122
EGYP+E+ +E IF ++ AL ED +++R DA+KL +A QT+ ++ P
Sbjct: 64 FEGYPTEDPNAKEKIFSTFMEALGEDSKKWRADAEKLSAFATEQTSIDGIIANP------ 117
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
+ + + K + Y +F A+G FR LE++ T P L+K+ V ++ DL
Sbjct: 118 --MFASMKSKVESK-SLVYDKFIAIGFFRALEMSKQTSPENLKKISEASGVTLEKINGDL 174
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
+Y+++LS++ AKEL E ++RE++K ER E + A +A
Sbjct: 175 GLYKSVLSRMNAAKELQAEVLERERRKTAERMEKKAAKDA 214
>gi|434405136|ref|YP_007148021.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
gi|428259391|gb|AFZ25341.1| photosystem II biogenesis protein Psp29 [Cylindrospermum stagnale
PCC 7417]
Length = 235
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 139/234 (59%), Gaps = 24/234 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A I A++++P++YR DA++L+ A+G L+ + S ++
Sbjct: 66 GYQPERDQESIFNAIIQAVEQEPQRYRQDAERLQAVAQGLPEQDLIAWLSQTTHSDRDAN 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVN 174
++ L+ IA NF YSR FA+GLF LLE++ + L+ + L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFSLLEVSSPELVKDDKQRNEALKAIATGLHLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
+ +DL++YR+ L K+ QA ++ + V ++KKRE+R + P ANE
Sbjct: 182 DDKLSKDLELYRSNLDKMAQALIVMADMVSADRKKREQRKQQASTPVAPPSANE 235
>gi|428775508|ref|YP_007167295.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
gi|428689787|gb|AFZ43081.1| photosystem II biogenesis protein Psp29 [Halothece sp. PCC 7418]
Length = 243
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 127/214 (59%), Gaps = 9/214 (4%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
D T++ETK F L+ RP+ SIY V++EL+V+ HL+ ++YDP +ALG VTV+D
Sbjct: 2 DTLRTLSETKRTFYTLHTRPLNSIYRRVIEELLVEMHLLTVNIDFKYDPFYALGVVTVFD 61
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE 121
M+GY E+D+E+IF A A++ DP+QYR DA+K++ A + ++ + +K +
Sbjct: 62 TFMQGYQPEKDKESIFNAICKAVESDPQQYRQDAEKVKSIADQASGEAVTAWLCEAKPLD 121
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
G L DI + F YSR F +G++ +LE AN VL C LN+
Sbjct: 122 QAGDLNDILQGIRENPRFKYSRLFIIGIYTVLEKANPEIVNDDKKREEVLNNCCQALNLP 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKK 208
K VD+DLD+YR+ L K+ QA+ +L++ V ++K
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDVVRADRK 215
>gi|220910509|ref|YP_002485820.1| Thf1-like protein [Cyanothece sp. PCC 7425]
gi|254784141|sp|B8HQ62.1|THF1_CYAP4 RecName: Full=Protein thf1
gi|219867120|gb|ACL47459.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7425]
Length = 236
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/228 (39%), Positives = 137/228 (60%), Gaps = 14/228 (6%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
P TV++TK F + RPI SIY V++EL+V+ HL+R +T+ YDPVFALG VT ++R
Sbjct: 4 PRTVSDTKRAFYHNHARPINSIYRRVVEELLVEIHLLRVNQTFVYDPVFALGVVTTFERF 63
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
M+GY D+ +IF A A + DP+Q + DAQ+L RGQ+ SL+++ S + G
Sbjct: 64 MQGYHPPADQTSIFNAICLAQELDPQQVQQDAQELLGRVRGQSLESLLDWISTAASLGGD 123
Query: 126 LKDIAERA-SGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCAVLNVN 174
+ RA + F YSR FAVGLF LLE A EP VL+++ V+++
Sbjct: 124 EQQNRLRAIASNPTFKYSRLFAVGLFTLLEQA---EPELGKDEARLLQVLQQVGEVMHLP 180
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
+ +DL+ YR+ L K+ QA++ L++ V E+K+R++ P ++ E+
Sbjct: 181 VEKMQKDLEQYRSNLEKMTQARKTLEDIVAAERKRRQQNAAPDRSPES 228
>gi|17228142|ref|NP_484690.1| Thf1-like protein [Nostoc sp. PCC 7120]
gi|81772969|sp|Q8YZ41.1|THF1_ANASP RecName: Full=Protein thf1
gi|17129992|dbj|BAB72604.1| all0646 [Nostoc sp. PCC 7120]
Length = 233
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 134/221 (60%), Gaps = 17/221 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR ME
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLQAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA NF YSR FA+GLF LLE +N L+ + A L+++
Sbjct: 126 LQAQLQAIA----NNSNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKSIAAGLHLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DDKFSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222
>gi|443311308|ref|ZP_21040938.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
gi|442778631|gb|ELR88894.1| photosystem II biogenesis protein Psp29 [Synechocystis sp. PCC
7509]
Length = 241
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 141/228 (61%), Gaps = 9/228 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT Y+R M+
Sbjct: 6 TVSDTKRAFYSTHTRPINTIYRRVVEELMVEMHLLSVNADFSYNPIYALGVVTSYERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E D+++IFQA A+ DP QYR DA++L +A+ ++ L+++ S E ++G
Sbjct: 66 GYQPERDKDSIFQALCQAINTDPHQYRQDAERLGSFAKSLSSQDLMQWLSSEKPIDGYSD 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 178
L++ ++ + F YSR FA+G+F LLEL++ +++ + L++ + +
Sbjct: 126 LQEQIKQIATNQKFKYSRLFAIGVFSLLELSDPELVKDETKRVEAFKQISSSLHLPEDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
++DL++YR + K+ QA ++++ + E+KKR+++ + Q+A A K
Sbjct: 186 NKDLELYRANVEKMNQALIVMEDMLAAERKKRQKKADEQQAALAAKSS 233
>gi|427727466|ref|YP_007073703.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
gi|427363385|gb|AFY46106.1| photosystem II biogenesis protein Psp29 [Nostoc sp. PCC 7524]
Length = 235
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 137/234 (58%), Gaps = 24/234 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNIDFTYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A A++++P++YR DA++L+ A+ S LV + S ++ +
Sbjct: 66 GYRPERDKESIFHAICQAVEQEPQRYRQDAERLQNLAKSLPISDLVAWLSQTTHFNQDPD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
++ L+ IA NF YSR FA+GLF LLE ++ L+ + L++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLFAIGLFSLLEYSDPDLVKDEKQRTEALKNIANGLHLA 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
+ +DLD+YR+ L K+ QA ++ + + ++KKRE+R + P ANE
Sbjct: 182 DDKLSKDLDLYRSNLDKMTQALTVIADMISADRKKREQRQQQSSSVVAPPTANE 235
>gi|300866330|ref|ZP_07111033.1| Protein thf1 [Oscillatoria sp. PCC 6506]
gi|300335673|emb|CBN56193.1| Protein thf1 [Oscillatoria sp. PCC 6506]
Length = 267
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 133/217 (61%), Gaps = 9/217 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F ++ RPI SIY V++EL+V+ HL+ ++Y+P++ALG VT ++R M+
Sbjct: 36 TVSDTKRSFYTIHTRPINSIYRRVVEELMVEMHLLSVNVDFRYNPIYALGVVTAFERFMQ 95
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
GY E+D+ +IF AL +DP++Y+ DA++LE A + L+ + S E G
Sbjct: 96 GYLPEQDKVSIFNGLCQALGDDPQRYQQDARRLEGLASRVSILDLLSWLEGSTSFEDTGD 155
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEP----TVLEKLCAVLNVNKRSV 178
L+ + F YSR FA+GLF LLE+ + +P + K+CA L++ + V
Sbjct: 156 LQASITAIATNSKFKYSRLFAIGLFALLEIVDPDLVKDPEARVQAIAKVCAALHLPEEKV 215
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+DLD+YR+ L K+ QA+ +L + + ++KKRE+R E
Sbjct: 216 TKDLDLYRSNLEKIAQARIVLADVLQADRKKREKRAE 252
>gi|186685250|ref|YP_001868446.1| Thf1-like protein [Nostoc punctiforme PCC 73102]
gi|254784144|sp|B2J353.1|THF1_NOSP7 RecName: Full=Protein thf1
gi|186467702|gb|ACC83503.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 235
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 137/227 (60%), Gaps = 21/227 (9%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV------EFPSKEGE 121
GY E D+E+IF A A+++DP+ YR DA++L+ A+G L+ + ++ +
Sbjct: 66 GYEPERDQESIFNALCRAIEQDPQHYRQDAERLQAIAKGLPVKDLIGWLGQTTYLDRDAD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA---------TEPTVLEKLCAVLN 172
++ L+ IA NF Y+R FA+G+F LLE ++ TE L+ + A L+
Sbjct: 126 LQAQLQAIA----NNPNFKYNRLFAIGVFSLLEQSDPELVKDEKQLTE--ALKAIAAGLH 179
Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
V+ +++DL++YR+ L K+ QA ++ + + ++KKRE+R + A
Sbjct: 180 VSDDKLNKDLELYRSNLDKMAQALVVMADMLSADRKKREQRKQQSTA 226
>gi|75910773|ref|YP_325069.1| Thf1-like protein [Anabaena variabilis ATCC 29413]
gi|97202708|sp|Q3M4B2.1|THF1_ANAVT RecName: Full=Protein thf1
gi|75704498|gb|ABA24174.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 233
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 135/221 (61%), Gaps = 17/221 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYALHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A A++++P++YR DA++L+ A+ + LV + S ++ +
Sbjct: 66 GYQPERDKESIFSAICQAVEQEPQRYRQDAERLKAVAQSLPVNDLVAWLSQANHLQQDAD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA NF YSR FA+GLF LLE +N L+ + A L+++
Sbjct: 126 LQAQLQAIA----SNPNFKYSRLFAIGLFTLLEQSNPDLVKDEKQRTEALKTIAAGLHLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+ +DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DDKLSKDLELYRSNLDKMTQALAVMADMLTADRKKREQRQQ 222
>gi|440683252|ref|YP_007158047.1| Protein thf1 [Anabaena cylindrica PCC 7122]
gi|428680371|gb|AFZ59137.1| Protein thf1 [Anabaena cylindrica PCC 7122]
Length = 235
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 133/221 (60%), Gaps = 17/221 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ Y Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDYSYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A A+++D ++YR DA +L+ A+ L+ + S K+ +
Sbjct: 66 GYLPERDQESIFNALCQAVEQDQQRYRQDATRLQAIAQSLPVQDLIAWVSQTTHLDKDAD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA NF YSR FA+GLF LLELA+ L+ + L+++
Sbjct: 126 LQAQLQAIAH----NPNFKYSRLFAIGLFSLLELADPELVKDEKQRNEALKAIAQGLHLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+ + +DLD+YR+ L K+ QA ++ + + ++KKR++R +
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQALIVMADILSADRKKRDQRQQ 222
>gi|428778484|ref|YP_007170270.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
gi|428692763|gb|AFZ48913.1| photosystem II biogenesis protein Psp29 [Dactylococcopsis salina
PCC 8305]
Length = 240
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 123/209 (58%), Gaps = 9/209 (4%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
D T++ETK F + RP+ SIY V++EL+V+ HL+ ++YDP++ALG TV+D
Sbjct: 2 DTLRTLSETKRTFYTQHTRPLNSIYRRVIEELLVEMHLLSVNTDFKYDPIYALGVTTVFD 61
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
M+GY E+++E+IF A A++ DP++YR DA+KL+ A + + S+ ++
Sbjct: 62 TFMQGYQPEKEKESIFNAICQAVENDPQKYRQDAEKLKSIAANHSGEEVTACLSELKPLD 121
Query: 124 GL--LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVN 174
G L + + F YSR F +GL+ +LE AN T VL+K C L +
Sbjct: 122 GAEELTKVLQEIKNNSRFKYSRLFIIGLYTILETANPDLVTDDKKREEVLQKCCQGLGLP 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYV 203
K VD+DLD+YR+ L K+ QA+ +L++ +
Sbjct: 182 KEKVDKDLDLYRSNLEKMEQARSVLEDAI 210
>gi|376001810|ref|ZP_09779664.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
gi|375329721|emb|CCE15417.1| Putative thylakoid formation protein, Thf1-like [Arthrospira sp.
PCC 8005]
Length = 243
Score = 145 bits (367), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 123/215 (57%), Gaps = 9/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYNIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
GY E D+ +I+ A I A + DP QYR DA LE A + L E ++E +
Sbjct: 66 GYIPEADKLSIWAALIMAQESDPNQYRADATALEAQAATLSVKDLTERAKIAQESSGDDP 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
L+ + F YSR FA+GL+ LLE ++ T ++ L + K +
Sbjct: 126 LQSCFHAIANNPKFKYSRLFAIGLYTLLEKSDVTAAQDSEGLKNIIIDFSEALRLPKDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
++DLD+YR L K+ QA+ +++E E+KKRE+R
Sbjct: 186 EKDLDLYRTNLEKVAQARLMVEEMTQAERKKREQR 220
>gi|428210102|ref|YP_007094455.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
gi|428012023|gb|AFY90586.1| photosystem II biogenesis protein Psp29 [Chroococcidiopsis
thermalis PCC 7203]
Length = 250
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 138/224 (61%), Gaps = 9/224 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK NF + RPI +IY V++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRNFYNQHTRPINTIYRRVVEELMVEMHLLSVNADFRYDPIYALGVVTAFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E D+E IF+A +++++P++YR DA +L + + +A L ++ + ++G
Sbjct: 66 GYQPERDKEPIFEALCQSIEDNPQRYRQDADRLRQLLQNVSAQQLFDWIDGKASLQGAED 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
L+ + + F YSR FA+G+F LLELA+A L+++ L+V + +
Sbjct: 126 LQAQMQAIAQNSKFKYSRLFAIGVFTLLELADAELVKDEKQRVEALKQVATALHVPEDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
++DL++YR+ L K+ QA + + + +++KR++R + ++A A
Sbjct: 186 NKDLELYRSNLDKIEQALITMADILSADRRKRQQRLQEKEAGVA 229
>gi|427707894|ref|YP_007050271.1| Protein thf1 [Nostoc sp. PCC 7107]
gi|427360399|gb|AFY43121.1| Protein thf1 [Nostoc sp. PCC 7107]
Length = 235
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 138/230 (60%), Gaps = 16/230 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+P++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYSLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
GY E D+E+IFQA A++++ ++YR DA++L+ A+ A+ L+ + S+ + +
Sbjct: 66 GYQPERDKESIFQAICQAVEQEVQRYRQDAERLQALAKSLAANDLIAWLSQTNHLNQDPD 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSV 178
L+ + + F Y+R FA+GLF LLE ++ ++ + A L++++ +
Sbjct: 126 LQSQLQAIANNSQFKYNRLFAIGLFSLLEQSDPDLVKDEKQRTDAIKTIAAGLHLSEDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE-------PQKANE 221
+DL++YR+ L K+ QA ++ + + ++KKRE+R + P ANE
Sbjct: 186 SKDLELYRSNLEKMSQALVVMADMISADRKKREQRQQQSTMPVTPPTANE 235
>gi|434388267|ref|YP_007098878.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
gi|428019257|gb|AFY95351.1| photosystem II biogenesis protein Psp29 [Chamaesiphon minutus PCC
6605]
Length = 234
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 127/221 (57%), Gaps = 13/221 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK NF + RPI SIY V++EL+V+ HL+ + YDP++ALG V+ +DR M
Sbjct: 6 TVSDTKRNFYSQHTRPINSIYRRVVEELMVEMHLLSTNVDFAYDPIYALGVVSSFDRFMT 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 124
Y E D+++IF A ++ + +QYR DA +EE+AR S ++++ P+ +G
Sbjct: 66 SYRPEADKQSIFVALCESMGGNAQQYRTDATAVEEFARSMQGSDIIDWIAHPTADGMGAQ 125
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELA--------NATEPTVLEKLCAVLNVNKR 176
L + AS F YSR F +GLF +LE A E VL+ + L++ K
Sbjct: 126 LATTLQSIASNP-KFKYSRLFGIGLFTILEQAAPDLLKDEKKREAAVLQ-IAEALHLPKD 183
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+DLD YR+ L KL+Q + ++ + + E+KKRE+R + +
Sbjct: 184 KAQKDLDTYRSNLDKLVQMEAVMADLAEAERKKREKRAQAK 224
>gi|428219024|ref|YP_007103489.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
gi|427990806|gb|AFY71061.1| Protein thf1 [Pseudanabaena sp. PCC 7367]
Length = 260
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/224 (37%), Positives = 129/224 (57%), Gaps = 10/224 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++ K +F + + RPI S+Y V+ EL+V+ HL+ +T+ YDPVFALG +T YDR M
Sbjct: 6 TVSDAKRDFFQAFPRPINSVYRRVVDELLVEMHLLTVNQTFAYDPVFALGAITAYDRFML 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E +R+ I A A+ + EQ R DA L E A ++ + +F + E L
Sbjct: 66 GYEPESERDRILPAICGAVHLNAEQMRHDASSLAELAM-RSPIDVKQFLTSLETTENLEP 124
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELA-------NATEPTVLEKLCAVLNVNKRSV 178
L + F YSR FA+GLF LLE A N +++++ LN+ +
Sbjct: 125 LTGTIRAIAANQKFKYSRLFAIGLFTLLETADPNTMSDNDKRQELIKQVGDALNLGSEKL 184
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
+DLD+YR+ L K+ QA++++K+ V+ E+KK+E+R P K++ A
Sbjct: 185 IKDLDLYRSNLEKVEQARQMMKDLVEAERKKKEQRENPPKSDAA 228
>gi|291567260|dbj|BAI89532.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 243
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 120/215 (55%), Gaps = 9/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI SIY V++EL+V+ HL+ ++YDP++ALG VT +DR M+
Sbjct: 6 TVSDTKRAFYHIHTRPINSIYRRVVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGL 125
GY E D+ +I+ A I A + DP QYR DA LE L + ++E +
Sbjct: 66 GYIPEADKLSIWAALIGAQESDPNQYRADATALEAQVASLAVKDLTDKAKMAQESSGDDP 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
L+ + F YSR A+GL+ LLE ++AT T+L L + K +
Sbjct: 126 LQSCFHAIANNPKFKYSRLLAIGLYTLLEKSDATAAQDSEGLKTILSDFSEALRLPKDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+DLD+YR L K+ QA+ ++ E E+KKRE+R
Sbjct: 186 VKDLDLYRTNLEKVAQARLMVDEMTQAERKKREQR 220
>gi|434395245|ref|YP_007130192.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
gi|428267086|gb|AFZ33032.1| Protein thf1 [Gloeocapsa sp. PCC 7428]
Length = 251
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 10/221 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI +IY V++EL+V+ HL+ + Y+P++ALG VT ++R M+
Sbjct: 6 TVSDTKRAFYTSHTRPINTIYRRVVEELMVEMHLLSVNVDFSYNPIYALGVVTAFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 125
GY E D+E+IF A A++ DP++YR DA++L +A+ + L+ + E E G
Sbjct: 66 GYQPERDKESIFNALCQAVESDPQRYRQDAERLGLFAKNTSTPELIAWLRGETHKEEVGD 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRSV 178
L+ + + +F YSR FA+G+F LLEL++ L+ + A LN+++ +
Sbjct: 126 LQQQIQAIAHNPHFKYSRLFAIGVFGLLELSDPALVKDEKQRVDALKSIAATLNISEDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
++DL++YR + K+ QA + + + ++KKR+++T P K
Sbjct: 186 NKDLELYRANVDKMEQALATIADILSADRKKRQQQT-PDKG 225
>gi|298491449|ref|YP_003721626.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
gi|298233367|gb|ADI64503.1| photosystem II biogenesis protein Psp29 ['Nostoc azollae' 0708]
Length = 235
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 136/225 (60%), Gaps = 17/225 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ ++Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNVDFRYNSIYALGVVTAFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E+D+ +IF A I A+++DP++YR DA +L+ A+ L+ + S ++ +
Sbjct: 66 GYQPEQDQASIFNAIIQAVEQDPQRYRQDAARLQVVAQSLLTKDLISWLSQTTYLDQDRD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA A F YSR FA+GLF LLE+ ++ L+ + L+++
Sbjct: 126 LQAQLQAIANNAE----FKYSRLFAIGLFSLLEMVDSELVKDEKQRNQALKAIAQGLHLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
+ + +DL++YR+ L KL QA ++ + + ++KKR++R + A
Sbjct: 182 EEKLTKDLELYRSNLDKLAQALIVMADMLAADRKKRDQRQQKSTA 226
>gi|452819272|gb|EME26335.1| thylakoid protein [Galdieria sulphuraria]
Length = 316
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 120/209 (57%), Gaps = 7/209 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVAET +FLK ++ PIPSIY T++QEL+V HL R +QYDPVFALG+ V +
Sbjct: 86 TVAETISDFLKHFRHPIPSIYRTIVQELLVTTHLARVAVGFQYDPVFALGYQMVTQVFFK 145
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 125
YP E++E +F + AL D E+ + DA LEEW R +T ++ + G+ + L
Sbjct: 146 SYPKVEEKEKLFDSMCKALLLDYERMKKDASVLEEWTRSRTEREILLAIEEGGDDPLANL 205
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELAN-ATEPTVLEKLCAVLNVNKRSVDRDLDV 184
L IA+ F YSR F +GL R++EL +K + L+++ +++DLD
Sbjct: 206 LHSIAQ----NDGFVYSRLFGLGLVRMMELCGEEANSERCQKWASALHISSLKLEQDLDT 261
Query: 185 YRNLLSKLLQAKELLKEYVDREKKKREER 213
Y+ L +L QA++L E R+KKK E+
Sbjct: 262 YQQSLERLKQAEQLFAELEARQKKKLAEK 290
>gi|427719034|ref|YP_007067028.1| Protein thf1 [Calothrix sp. PCC 7507]
gi|427351470|gb|AFY34194.1| Protein thf1 [Calothrix sp. PCC 7507]
Length = 235
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 133/221 (60%), Gaps = 17/221 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + Y+ ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYNLHTRPINTIYRRVVEELMVEMHLLSVNIDFSYNSIYALGVVTTFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS------KEGE 121
GY E D+E+IF A A++++P++YR DA++L A+ A+ L+ + S ++ +
Sbjct: 66 GYLPERDQESIFNALCHAVEQEPQRYRQDAERLRVLAKSLPANDLIAWLSQTTHLDQDAD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVN 174
++ L+ IA NF YSR A+GLF LLEL++ L+ + L ++
Sbjct: 126 LQAQLQAIA----NNPNFKYSRLLAIGLFTLLELSDPELVKDEKQRNEALKAIATGLQLS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+++DL++YR+ L K+ QA ++ + + ++KKRE+R +
Sbjct: 182 DEKLNKDLELYRSNLDKIAQALIVMADVLSADRKKREQRKQ 222
>gi|443317266|ref|ZP_21046682.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
gi|442783151|gb|ELR93075.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 6406]
Length = 251
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 13/236 (5%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTV++TK F + RPI S+Y V++EL+V+ HL+R + YDPV+ALG VT +DR M
Sbjct: 5 PTVSDTKRAFYSYHNRPIASVYRRVIEELMVEMHLLRVNEDFVYDPVYALGIVTTFDRFM 64
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL-VEFPSKEGEVEGL 125
GY E D +IF A A +QYR DA+ + G++ +L S+ E L
Sbjct: 65 AGYRPEADEASIFAALCQANAGTADQYRRDAEVMVAEVSGRSLDALKAILISRSAEGADL 124
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS-------V 178
LK + + + + F YSR FA+GL+ L+E +A EKL +L S +
Sbjct: 125 LKGVLQGIADRDRFKYSRAFAIGLYTLIETVDAEILKDKEKLMELLKAVAESLPLSFDKL 184
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK-----ANEAIKKCLGE 229
+D+++YR+ L+K+ QAK ++ + + ++KKREER + + N+A+ GE
Sbjct: 185 QKDVELYRSNLTKMEQAKIVMADILAADRKKREERAKAKADAASLPNDAVVTPSGE 240
>gi|428304539|ref|YP_007141364.1| Protein thf1 [Crinalium epipsammum PCC 9333]
gi|428246074|gb|AFZ11854.1| Protein thf1 [Crinalium epipsammum PCC 9333]
Length = 243
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 131/217 (60%), Gaps = 9/217 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SIY V++EL+V+ HL+ + Y P++ALG VT Y++ M+
Sbjct: 6 TVSDTKRDFYNNHTRPINSIYRRVVEELMVEMHLLSVNVDFAYHPIYALGVVTSYEKFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E DR++IF A + A+ ED ++Y+ DA++L+ A + L+++ V+G
Sbjct: 66 GYRPERDRDSIFDALVGAVGEDSQRYKQDAEQLKALAGRLSGKELIDWIVSPTAVDGAGS 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT----EPTVLEKLCAV---LNVNKRSV 178
L D + F YSR FA+GL+ LLE+++ + E L+ L V L++ +
Sbjct: 126 LPDQMRAIANNPQFKYSRLFAIGLYTLLEVSDPSLVKDEKERLDALNQVGQSLHLPTEKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+DLD+YR+ L K+ Q + +K+ ++ ++KKRE+R +
Sbjct: 186 HKDLDLYRSNLEKMAQVQIAMKDALEADRKKREKRDQ 222
>gi|428313474|ref|YP_007124451.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
gi|428255086|gb|AFZ21045.1| photosystem II biogenesis protein Psp29 [Microcoleus sp. PCC 7113]
Length = 241
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/223 (34%), Positives = 130/223 (58%), Gaps = 17/223 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RP+ SI+ V++EL+V+ HL+ + Y+P++ALG VT ++R ME
Sbjct: 6 TVSDTKRDFYNHHTRPVNSIFRRVVEELMVEMHLLSVNVDFHYEPIYALGVVTSFNRFME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE------GE 121
GY E D+ +IF A ++ +PEQY+ DAQ LE A T LV + S G+
Sbjct: 66 GYRPERDKASIFDALCHSVGNNPEQYKQDAQWLESMAERVTGEELVSWLSAPRPQDTLGD 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
+ + IAE F YSR FA+GL+ LLE A++ L+K+ L++
Sbjct: 126 LYAAVAAIAENP----KFKYSRLFAIGLYTLLEKADSELVQDEKRRTEALKKISDGLHLP 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+ + +DL++YR+ L K+ Q + ++++ + ++KKRE+R + Q
Sbjct: 182 EEKLQKDLELYRSNLQKMEQVRIVIEDAIQADRKKREKRIQDQ 224
>gi|414076688|ref|YP_006996006.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
gi|413970104|gb|AFW94193.1| photosystem II biogenesis protein Psp29 [Anabaena sp. 90]
Length = 223
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 125/213 (58%), Gaps = 9/213 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F L+ RPI +IY V++EL+V+ HL+ + YD ++ALG VT +DR M+
Sbjct: 6 TVSDTKRTFYTLHTRPINTIYRRVVEELMVEMHLLSVNVDFSYDAIYALGVVTTFDRFMD 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
GY E+D+E+IF+A A+++DP+ YR DA +L+ A A L+ S+ + +
Sbjct: 66 GYQPEQDKESIFRAICQAVEQDPQSYRQDASRLQALAASLPAKDLIASLSQASPLNQDAD 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATE--PTVLEKLCAVLNVNKRSV 178
L+ E + NF YSR F VGLF LL EL E L+ + L++++ +
Sbjct: 126 LQKQLEAVAANSNFKYSRLFGVGLFALLVQSDPELVKKDEQRAEALKAISNGLHISEDKL 185
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
+DL++Y + L K+ QA ++ + + ++KKR+
Sbjct: 186 IKDLELYSSNLEKMAQALIVMADILTADRKKRD 218
>gi|56750022|ref|YP_170723.1| Thf1-like protein [Synechococcus elongatus PCC 6301]
gi|81300364|ref|YP_400572.1| Thf1-like protein [Synechococcus elongatus PCC 7942]
gi|56684981|dbj|BAD78203.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169245|gb|ABB57585.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 280
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 124/217 (57%), Gaps = 10/217 (4%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTV+++K F Y RPI +Y V++EL+V+ HL+ ++ YDP+FALG VT +D M
Sbjct: 31 PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 90
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 123
Y E +F A A++++PEQYR DA + E RG + ++ ++ ++ G
Sbjct: 91 SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 150
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEPTVLEKLCAVL----NVNKR 176
L++ + +G+ F YSR FA+GLF LLE A +P L+ + ++
Sbjct: 151 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 210
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLD+YR+ L K+ QA+ ++E + +++KRE+R
Sbjct: 211 KLQKDLDLYRSNLEKMEQARITMEEAIQADRRKREQR 247
>gi|97202823|sp|Q5N664.2|THF1_SYNP6 RecName: Full=Protein thf1
gi|97202830|sp|Q31MY4.2|THF1_SYNE7 RecName: Full=Protein thf1
Length = 254
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 124/217 (57%), Gaps = 10/217 (4%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTV+++K F Y RPI +Y V++EL+V+ HL+ ++ YDP+FALG VT +D M
Sbjct: 5 PTVSDSKRAFYAAYPRPINPLYRRVVEELLVEIHLLSVNTSFVYDPLFALGVVTAFDSFM 64
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE---GEVE 123
Y E +F A A++++PEQYR DA + E RG + ++ ++ ++ G
Sbjct: 65 SSYRPIEAVGPLFTALTQAVRQNPEQYRHDANAIAEQVRGVGSDTIRQWLTEAEALGNAP 124
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN---ATEPTVLEKLCAVL----NVNKR 176
L++ + +G+ F YSR FA+GLF LLE A +P L+ + ++
Sbjct: 125 ELVRSSFQAIAGRSEFKYSRLFAIGLFSLLETAAPDLVQDPEALKTTVTAIAERFHLPSD 184
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLD+YR+ L K+ QA+ ++E + +++KRE+R
Sbjct: 185 KLQKDLDLYRSNLEKMEQARITMEEAIQADRRKREQR 221
>gi|427419843|ref|ZP_18910026.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
gi|425762556|gb|EKV03409.1| photosystem II biogenesis protein Psp29 [Leptolyngbya sp. PCC 7375]
Length = 258
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 130/220 (59%), Gaps = 16/220 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI S+Y V++EL+V+ HL+ + Y+P++ALG +T +DR M
Sbjct: 15 TVSDTKRAFYNYHSRPINSLYRRVIEELMVEMHLLSVNVDFVYNPLYALGVITSFDRFMV 74
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL-VEFPS-KEGEVEGL 125
GY E+D+E+I A A++ DP+QYR DA+ L+ + S L + S K + GL
Sbjct: 75 GYEPEQDKESILSAICQAVEGDPQQYRQDAEALKSDLANLSLSDLNTQLASAKTTDGNGL 134
Query: 126 ---LKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-------PTVLEKLCAVLNVNK 175
L +A +AS K Y+R AVGL+ L E + + +L+ +L +
Sbjct: 135 QNKLHVVATQASAK----YTRLMAVGLYTLFETVDISSLEDKDSREEMLKTAAEMLALPA 190
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
VD+DL++YR+ L K+ QA+E++K+ ++ E+KKRE+R +
Sbjct: 191 EKVDKDLELYRSNLDKMAQAQEVMKDILEAERKKREQRAQ 230
>gi|308801781|ref|XP_003078204.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
gi|116056655|emb|CAL52944.1| inositol phosphatase-like protein (ISS) [Ostreococcus tauri]
Length = 657
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 118/201 (58%), Gaps = 16/201 (7%)
Query: 27 IYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITAL 86
++ TV+QEL+VQ H +Y + +Y+ + +LGFV+VYD+L EG+PSEE++ IF A++ AL
Sbjct: 79 VWATVVQELLVQGHFQKYNKKSEYNELASLGFVSVYDQLFEGFPSEEEKGKIFNAFLGAL 138
Query: 87 KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKD--IAERA--SGKGNFSYS 142
ED + R DA+ +L F + VEGL ++ A+ A S +G Y+
Sbjct: 139 DEDAVRTRADAE------------TLGAFATSANGVEGLKENAIFAKLAAKSAEGTLLYT 186
Query: 143 RFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
++ A+G+FR+LELA AT+P LE L ++ V DL +Y+ LLSKL AKEL +E
Sbjct: 187 KYIAIGMFRMLELAKATDPAALEALVTAGGLSMSKVSGDLSMYKGLLSKLAAAKELQEEL 246
Query: 203 VDREKKKREERTEPQKANEAI 223
+ + R +A +AI
Sbjct: 247 CETFRSTPRARMSFTEAFKAI 267
>gi|257059049|ref|YP_003136937.1| Thf1-like protein [Cyanothece sp. PCC 8802]
gi|256589215|gb|ACV00102.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8802]
Length = 235
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE-VEG 124
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
+L + + + F YSR FA+GL+ ++ +A +LC LN++
Sbjct: 126 MLFNTLQAIANNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
+ +DLD+YR+ + K+ Q ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220
>gi|218245998|ref|YP_002371369.1| Thf1-like protein [Cyanothece sp. PCC 8801]
gi|254784143|sp|B7K277.1|THF1_CYAP8 RecName: Full=Protein thf1
gi|218166476|gb|ACK65213.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 8801]
Length = 235
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 125/215 (58%), Gaps = 10/215 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SIY ++EL+V+ HL+ ++YDP++ALG V + + M+
Sbjct: 6 TVSDTKRDFYNHHTRPINSIYRRFIEELLVEMHLLCVNIDFRYDPIYALGVVASFQQFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGE-VEG 124
GY EED+ +IF A A+ D E+YR +AQ L +G + S L+ ++ GE EG
Sbjct: 66 GYRPEEDKNSIFSALCQAVGGDGEKYRHEAQTLLNQVKGMSVSDLIAMGNSARTGEPGEG 125
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
+L + + + F YSR FA+GL+ ++ +A +LC LN++
Sbjct: 126 MLYNTLQAIAKNPQFKYSRLFAIGLYTMVMEIDADLLKEQDKRNETFSQLCNGLNLSSDK 185
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
+ +DLD+YR+ + K+ Q ++++ ++ E+KKRE+
Sbjct: 186 LQKDLDLYRSNVDKMGQLLAVIEDALEAERKKREK 220
>gi|428302138|ref|YP_007140444.1| Protein thf1 [Calothrix sp. PCC 6303]
gi|428238682|gb|AFZ04472.1| Protein thf1 [Calothrix sp. PCC 6303]
Length = 235
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 130/228 (57%), Gaps = 18/228 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F ++ RPI +IY V++EL+V+ HL+ + Y+P++ALG T ++R M+
Sbjct: 6 TVSDTKKTFYSIHTRPINTIYRRVVEELMVEMHLLSVNTDFTYNPIYALGVATAFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSK------EGE 121
GY E+D+E +F A +++ D ++ + +A L++ A + L+ S+ GE
Sbjct: 66 GYDPEKDKEQLFHALCQSVEIDTQKIKQEAHSLKDVAASMSVGDLISCLSRAKRFDNAGE 125
Query: 122 VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVN 174
++ L IA F YSR FA+GLF LLE A+ L + LN++
Sbjct: 126 LQNQLDAIA----SNPKFKYSRLFAIGLFSLLEAASPETVKDEKQRNDALVSIAKGLNIS 181
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
+ + +DLD+YR+ L K+ QA ++ + + ++KKRE+R + QK++ A
Sbjct: 182 EDKLSKDLDLYRSNLDKMAQAMVVMADMLAADRKKREQRAQ-QKSSVA 228
>gi|307155000|ref|YP_003890384.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
gi|306985228|gb|ADN17109.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7822]
Length = 233
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 133/233 (57%), Gaps = 25/233 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ M+
Sbjct: 6 TVSDSKRDFYSKHTRPINSVYRRVVEELLVETHLLSVNSDFHYDPIYALGVVTSFEQFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA--------SSLVEFPSKE 119
GY E D+E+IF A ++ DP+QYR DAQ + A+ +A SS + +P +
Sbjct: 66 GYRPETDKESIFNALCQSVGGDPQQYRGDAQSILSTAKQLSAQDLLSKLQSSSIAYPQGD 125
Query: 120 GEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEP----------TVLEKLCA 169
++ L IA F Y+R FA+G++ +L T+P V++++
Sbjct: 126 NKIIETLVAIA----NAPKFKYTRLFAIGIYTILA---ETDPELLKDQQKRHEVIKQIAE 178
Query: 170 VLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
+L++ + + +DLD+YR+ L K+ Q +++E + ++KKRE+R + + E
Sbjct: 179 ILHLPEEKMQKDLDLYRSNLEKMEQLLTVIEEALQADRKKREQRDQAKTQAET 231
>gi|218442064|ref|YP_002380393.1| Thf1-like protein [Cyanothece sp. PCC 7424]
gi|254784142|sp|B7KI38.1|THF1_CYAP7 RecName: Full=Protein thf1
gi|218174792|gb|ACK73525.1| photosystem II biogenesis protein Psp29 [Cyanothece sp. PCC 7424]
Length = 226
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 132/220 (60%), Gaps = 18/220 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ +QYDPV+ALG VT + R M+
Sbjct: 6 TVSDSKRDFYTKHTRPINSVYRRVVEELMVEMHLLSVNSDFQYDPVYALGVVTSFQRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLV-------EFPSKEG 120
GY + D+E+IF A ++ DP+QYR DA+++ E A+ +A L+ + S E
Sbjct: 66 GYRPDADKESIFNALCQSVGGDPQQYRQDAERMIESAKQLSAQQLLFNLESASDSSSGEN 125
Query: 121 EVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNV 173
++ L IA + Y+R FA+G++ +L E+ TE V++++ VL++
Sbjct: 126 QILQTLIGIA----NAPKYKYTRLFAIGIYTILAETDPEMLKNTEKREEVVKQIAKVLHL 181
Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ + +DLD+YR+ L K+ Q +++E + ++KKRE++
Sbjct: 182 PEEKMQKDLDLYRSNLEKMDQLLTVIEEALQADRKKREQQ 221
>gi|332705256|ref|ZP_08425337.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
gi|332355999|gb|EGJ35458.1| photosystem II biogenesis protein Psp29 [Moorea producens 3L]
Length = 257
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 117/206 (56%), Gaps = 11/206 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SIY V++EL+V+ HL+ + YDP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYTYHTRPINSIYRRVVEELMVEMHLLSVNVDFNYDPIYGLGVVTCFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLE---EWARGQTASSLVEFPSKEGEVEG 124
Y E D+E+IF A A+ + +QY+ DAQ+L+ + GQ S + P+ E
Sbjct: 66 SYQPENDKESIFNALCQAVGGEAQQYQEDAQRLKTSVDSMSGQDLISWLSSPTSENGSGD 125
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRS 177
L IA A F YSR FA+GLF LLE ++ V+ + + LN+
Sbjct: 126 LATTIAAIAQ-NSQFKYSRLFAIGLFSLLEQTDSELAQDQKQLEEVINNISSGLNLPSEK 184
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYV 203
+ +DL++YR+ L K+ QA+ ++++ +
Sbjct: 185 LQKDLELYRSNLEKMAQARVVIEDAI 210
>gi|37520969|ref|NP_924346.1| Thf1-like protein [Gloeobacter violaceus PCC 7421]
gi|81710432|sp|Q7NKS7.1|THF1_GLOVI RecName: Full=Protein thf1
gi|35211965|dbj|BAC89341.1| glr1400 [Gloeobacter violaceus PCC 7421]
Length = 228
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 125/221 (56%), Gaps = 11/221 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F Y RP+ SIY V+ EL+V+ HL+ + +++DP+FA G +T Y LME
Sbjct: 6 TVSDSKRAFFAAYPRPVNSIYRRVIDELLVEVHLLITNQDFRHDPLFATGLLTAYQALME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV-EGLL 126
GY E R+AI +A TAL+ EQ DA + A A ++E + + E +G L
Sbjct: 66 GYTPVEQRDAILRALCTALELSYEQLHTDAAQWRAIAAELPAQEVLEVMAGKREAGDGRL 125
Query: 127 KDIAERASGKGN---FSYSRFFAVGLFRLLELAN----ATEPTVLEKL---CAVLNVNKR 176
K + + +G N F YSR F++GL +LE A +E LE+L C L ++
Sbjct: 126 KAMGDTLAGIANAERFKYSRLFSLGLANILEQAGRAAAMSEKDRLERLQQICTYLKLDYN 185
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
V RDLD + ++L ++ ++KE++ E E++KREER Q
Sbjct: 186 RVKRDLDFFHSVLERIKRSKEVVDELSQTERRKREERAVSQ 226
>gi|359462375|ref|ZP_09250938.1| Thf1-like protein [Acaryochloris sp. CCMEE 5410]
Length = 214
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/188 (36%), Positives = 115/188 (61%), Gaps = 16/188 (8%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+R ++YDP+FALG T +DR M+GY E D++AIF A A + DP Q +
Sbjct: 1 MVEMHLLRVNEDFRYDPIFALGVTTSFDRFMDGYQPENDKDAIFSAICKAQEADPVQMQK 60
Query: 96 DAQKLEEWARGQTASSLVEFPSKEG-----EVEGLLKDIAERASGKGNFSYSRFFAVGLF 150
D Q+L E A+ ++A ++++ ++ E++ L++IA+ F YSR FA+GLF
Sbjct: 61 DGQRLTELAQSKSAQEMLDWITQAANSGGDELQWQLRNIAQNP----KFKYSRLFAIGLF 116
Query: 151 RLLELA--NATE-----PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYV 203
LLEL+ N T+ L +C VLN+++ + +DL++YR L K+ Q ++ + + +
Sbjct: 117 TLLELSEGNITQDEESLAEFLPNICTVLNISESKLQKDLEIYRGNLDKIAQVRQAMDDIL 176
Query: 204 DREKKKRE 211
+ +KK+RE
Sbjct: 177 EAQKKRRE 184
>gi|425459592|ref|ZP_18839078.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
gi|389822632|emb|CCI29709.1| Protein thf1 [Microcystis aeruginosa PCC 9808]
Length = 233
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 224
>gi|440752363|ref|ZP_20931566.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
gi|440176856|gb|ELP56129.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
TAIHU98]
Length = 228
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRHDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 125 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 219
>gi|22298677|ref|NP_681924.1| Thf1-like protein [Thermosynechococcus elongatus BP-1]
gi|81743247|sp|Q8DJT8.1|THF1_THEEB RecName: Full=Protein thf1
gi|22294857|dbj|BAC08686.1| tlr1134 [Thermosynechococcus elongatus BP-1]
Length = 222
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 129/223 (57%), Gaps = 16/223 (7%)
Query: 6 PPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRL 65
P TV++TK F + RPI SIY ++EL+V+ HL+R ++Y P+FALG VT +D+
Sbjct: 4 PRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQF 63
Query: 66 MEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL 125
MEGY E DR+ IF A A + +P+Q + DA +++ + L E S G+
Sbjct: 64 MEGYQPEGDRDRIFHALCVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNS--GQPSAP 121
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-ELANATEPTV-----LEKLCAVLNVNKRSVD 179
L + +GK YSR AVGL+ L ELA E T+ L++L V+ + V
Sbjct: 122 LNSLNH--TGK----YSRLHAVGLYAFLQELAG--EVTIHLNETLDQLAPVIPLPIEKVK 173
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
RDL++YR+ L K+ QA+ L+KE V++E+K+R ++T A +A
Sbjct: 174 RDLELYRSNLDKINQARSLMKELVEQERKRRAQQTSAPPAVDA 216
>gi|425436789|ref|ZP_18817221.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|425451594|ref|ZP_18831415.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
gi|389678450|emb|CCH92698.1| Protein thf1 [Microcystis aeruginosa PCC 9432]
gi|389767069|emb|CCI07461.1| Protein thf1 [Microcystis aeruginosa PCC 7941]
Length = 233
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 127/223 (56%), Gaps = 14/223 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAAKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ + ++
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230
>gi|422302142|ref|ZP_16389506.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
gi|389788699|emb|CCI15466.1| Protein thf1 [Microcystis aeruginosa PCC 9806]
Length = 233
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 128/226 (56%), Gaps = 19/226 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
+ +DLDVYR+ L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 188 KLQKDLDVYRSNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|425445848|ref|ZP_18825868.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
gi|389734049|emb|CCI02237.1| Protein thf1 [Microcystis aeruginosa PCC 9443]
Length = 233
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 127/226 (56%), Gaps = 19/226 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 187
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
+ +DLDVYR L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|443328840|ref|ZP_21057433.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
gi|442791576|gb|ELS01070.1| photosystem II biogenesis protein Psp29 [Xenococcus sp. PCC 7305]
Length = 270
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 125/218 (57%), Gaps = 11/218 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F Y +PI S+Y +++EL+V+ HL+ ++ DP+F LG V+ ++RLM+
Sbjct: 12 TVSDTKRSFYNNYNKPINSVYRRIVEELLVEMHLLSVNADFKSDPIFYLGVVSCFERLMQ 71
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY ++D+ AIF A A+ DPE YR A L A+ ++ L+ + + + G +
Sbjct: 72 GYQPDQDKGAIFNALCRAVDGDPESYRAQAGNLLAIAKEKSGEELIAWLGEPTAIAG-AE 130
Query: 128 DIAE---RASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRS 177
+IAE + NF YSR F +GL+ LLE A+A + E + L++
Sbjct: 131 NIAETIKSIAANANFKYSRPFGIGLYTLLEEADAKLLEDSDKRNEIFENIAKTLSLPGDK 190
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+ +DL++YR+ L K+ Q + +++ + +K+RE+R +
Sbjct: 191 MKKDLELYRSNLEKMEQVLKAIEDALQASRKQREKRAQ 228
>gi|425453632|ref|ZP_18833389.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
gi|389800936|emb|CCI19831.1| Protein thf1 [Microcystis aeruginosa PCC 9807]
Length = 233
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQKFSEILHLSSE 187
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++
Sbjct: 188 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 224
>gi|443669636|ref|ZP_21134837.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
gi|159030831|emb|CAO88510.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330085|gb|ELS44832.1| photosystem II biogenesis protein Psp29 [Microcystis aeruginosa
DIANCHI905]
Length = 228
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 124/217 (57%), Gaps = 14/217 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 6 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 66 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENIIAIAKETNIDSLLSQLQNPALGGNNQ- 124
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT--------EPTVLEKLCAVLNVNKR 176
L D F YSR FA+GL+ +L A EP +L+K +L+++
Sbjct: 125 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREP-ILQKFSEILHLSSE 182
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++
Sbjct: 183 KLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQK 219
>gi|390439536|ref|ZP_10227927.1| Protein thf1 [Microcystis sp. T1-4]
gi|389837025|emb|CCI32051.1| Protein thf1 [Microcystis sp. T1-4]
Length = 233
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 127/234 (54%), Gaps = 36/234 (15%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 128 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT--------EPTVLE 165
+ +A G N F YSR FA+GL+ +L A EP +L+
Sbjct: 118 QLQNQALGGDNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREP-ILQ 176
Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
K +L+++ + +DLDVYR L K+ Q +++++ ++ EKKKR+++ + ++
Sbjct: 177 KFSEILHLSGEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230
>gi|425470743|ref|ZP_18849603.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
gi|389883502|emb|CCI36111.1| Protein thf1 [Microcystis aeruginosa PCC 9701]
Length = 233
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 125/225 (55%), Gaps = 17/225 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
L D F YSR FA+GL+ +L A +L+K +L+++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILHLSSEK 188
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR-----EERTEPQ 217
+ +DLDVYR L K+ Q +++++ ++ EKKKR E++T PQ
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQTTPQ 233
>gi|425441488|ref|ZP_18821762.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|425463770|ref|ZP_18843100.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
gi|389717772|emb|CCH98181.1| Protein thf1 [Microcystis aeruginosa PCC 9717]
gi|389829228|emb|CCI29632.1| Protein thf1 [Microcystis aeruginosa PCC 9809]
Length = 233
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 124/222 (55%), Gaps = 12/222 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSL---VEFPSKEGEVEG 124
GY ED+ IF A A+ +PE YR DA+ + A+ SL ++ P+ G +
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIAKETNIDSLLSQLQNPALGGNNQ- 129
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNKRS 177
L D F YSR FA+GL+ +L A +L+K +L ++
Sbjct: 130 -LSDSLVSVINAPKFKYSRLFAIGLYTILAEAQPDMIKEKEKREQILQKFSEILRLSSEK 188
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
+ +DLDVYR L K+ Q +++++ ++ EKKKR+++ + ++
Sbjct: 189 LQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230
>gi|423062334|ref|ZP_17051124.1| Thf1-like protein [Arthrospira platensis C1]
gi|406716242|gb|EKD11393.1| Thf1-like protein [Arthrospira platensis C1]
Length = 215
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 109/192 (56%), Gaps = 9/192 (4%)
Query: 31 VLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDP 90
+++EL+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP
Sbjct: 1 MVEELMVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDP 60
Query: 91 EQYRIDAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVG 148
QYR DA LE A + L E ++E + L+ + F YSR FA+G
Sbjct: 61 NQYRADATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIG 120
Query: 149 LFRLLELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKE 201
L+ LLE ++ T T+L L + K +++DLD+YR L K+ QA+ +++E
Sbjct: 121 LYTLLEKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEE 180
Query: 202 YVDREKKKREER 213
E+KKRE+R
Sbjct: 181 MTQAERKKREQR 192
>gi|170077355|ref|YP_001733993.1| Thf1-like protein [Synechococcus sp. PCC 7002]
gi|254784146|sp|B1XHY6.1|THF1_SYNP2 RecName: Full=Protein thf1
gi|169885024|gb|ACA98737.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 254
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 125/215 (58%), Gaps = 15/215 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT ++R M+
Sbjct: 6 TVSDTKRDFYTHHTRPINSIFRRVVEELLVEMHLLSVNADFRYDPFYALGVVTSFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E D+ +IFQ+ A+ D +Y+ DA L E A+ + + L+E ++ EG
Sbjct: 66 GYRPEADKVSIFQSMCQAIGGDANRYKEDAMALVELAKRCSGTQLIECFRQDVPPEGAQE 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEK----------LCAVLNVNK 175
L + E + +F YSR FA+G++ L +EP +LE + A LN+ +
Sbjct: 126 LWEKIEAIAKNDHFKYSRLFAIGVYTFL---GESEPQLLEDTEKRDEMLTTVTAGLNLPE 182
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
+ +DLD+YR+ L K+ Q E+L++ + E+++R
Sbjct: 183 EKMKKDLDLYRSNLEKMNQVLEVLEDALAVERQRR 217
>gi|434398071|ref|YP_007132075.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
gi|428269168|gb|AFZ35109.1| Protein thf1 [Stanieria cyanosphaera PCC 7437]
Length = 238
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 132/223 (59%), Gaps = 12/223 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++ K +F + + RPI S+Y V++EL+V+ HL+ ++ DP++ LG VT ++RLM+
Sbjct: 16 TVSDAKRDFYQHHTRPINSVYRRVVEELLVEMHLLSVNVDFKSDPIYYLGVVTSFERLMQ 75
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG--- 124
GY E+D+E+IF A A+ EDPE+ R A L A+ ++ LV + S+ +E
Sbjct: 76 GYRPEQDKESIFNALCRAVGEDPERNRAQAGSLLNLAKNKSPQELVAWLSEPTPLENYHD 135
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
+++ I AS +F YSR FA+GL+ LLE ++ + +LE + L++
Sbjct: 136 IIEPIKAIASNP-HFKYSRLFAIGLYTLLEESDPEILKDVSKRNEILESIATQLHLPGEK 194
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDR-EKKKREERTEPQKA 219
+++DL++YR+ L K+ Q ++++ + K+K + + EP+ A
Sbjct: 195 MNKDLELYRSNLEKMEQLLSVIEDVLQAGRKQKNQPKPEPETA 237
>gi|254423933|ref|ZP_05037651.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
gi|196191422|gb|EDX86386.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7335]
Length = 250
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 120/208 (57%), Gaps = 14/208 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI +IY V++EL+V+ HL+ + YD ++ALG V+ YDR M+
Sbjct: 6 TVSDTKRAFYSQHTRPINAIYRRVVEELMVEAHLLLVNADFNYDSIYALGVVSTYDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPS--KEGEVEG- 124
GY DR+ I++A + A + DP+QYR DA++L ++ S+ F S E + E
Sbjct: 66 GYEPAGDRDNIYRAILQANEADPDQYRRDAEEL--LGVAKSLPSIDAFKSILDEAKTESG 123
Query: 125 --LLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVNK 175
LK +A F YSR FA+GL+ ++E +A ++ ++ + + +N+
Sbjct: 124 SDTLKANLHKAISNPKFKYSRLFAIGLYNVIESIDADMLNDKDKRDALMAEIASTIGLNE 183
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
+ +D+D+YR L K+ QA+E++K+ +
Sbjct: 184 DLLKKDIDLYRGNLEKMAQAQEVMKDMI 211
>gi|166367182|ref|YP_001659455.1| Thf1-like protein [Microcystis aeruginosa NIES-843]
gi|166089555|dbj|BAG04263.1| Psb29 Photosystem II sub-stoichiometric subunit [Microcystis
aeruginosa NIES-843]
Length = 233
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 123/233 (52%), Gaps = 34/233 (14%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K +F + RPI S+Y V++EL+V+ HL+ + YDP++ALG VT +++ ME
Sbjct: 11 TVSDSKRDFYTRHTRPINSVYRRVVEELLVEMHLLSVNVDFHYDPIYALGVVTSFEKFME 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY ED+ IF A A+ +PE YR DA+ + A KE ++ LL
Sbjct: 71 GYRPGEDKPNIFNALCQAVNGNPEVYRRDAENMIAIA-------------KETNIDSLLS 117
Query: 128 DIAERASGKGN--------------FSYSRFFAVGLFRLLELANAT-------EPTVLEK 166
+ A G N F YSR FA+GL+ +L A +L+K
Sbjct: 118 QLQNPALGANNQLSDSLVSLINAPKFKYSRLFAIGLYTILAEAQPDIIKEKEKREQILQK 177
Query: 167 LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
+L ++ + +DLDVYR L K+ Q +++++ ++ EKKKR+++ + ++
Sbjct: 178 FSEILRLSSEKLQKDLDVYRGNLDKMDQLLKVIEDALEAEKKKRQQKEQEKQT 230
>gi|427711975|ref|YP_007060599.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
gi|427376104|gb|AFY60056.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
6312]
Length = 245
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 26/229 (11%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI SI+ ++EL+V+ HL+R + Y P+ ALG VT Y+ M
Sbjct: 6 TVSDTKKAFYAAHTRPIHSIFRRFVEELLVEVHLLRVNTNFVYSPLLALGIVTAYNHFMS 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTA------SSLVEFPSKEGE 121
GY E DR +IF ++ A + DP+Q + DA + W + L + S+ G+
Sbjct: 66 GYRPETDRNSIFTSFAIAEEFDPQQLQADAAR---WEELAGLELEELQTRLQAWISEGGD 122
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNV 173
L+D K YSR A+GL+ LLE A T LE+L V+N+
Sbjct: 123 PWHNSLRDAVNNPQTK----YSRLQAIGLYHLLEQAAGNLTQELTTLEASLEQLSPVVNL 178
Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEA 222
V +DL++YR+ L K++QA++++ E V+ E+K+RE Q ANEA
Sbjct: 179 PVDKVKKDLELYRSNLDKMIQAQKIMAELVEVERKRRE-----QAANEA 222
>gi|428223137|ref|YP_007107307.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
gi|427996477|gb|AFY75172.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. PCC
7502]
Length = 226
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 115/205 (56%), Gaps = 10/205 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVA+ K +F K + +P+ SIY V+ EL+V+ HL+R + + YD +FALG T +DR M
Sbjct: 6 TVADAKHDFYKAFSKPVNSIYRRVVDELLVEVHLLRVSQNFGYDSIFALGLATAFDRFMA 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE--GEVEGL 125
GY E D E IF+ AL DP+Q R ++ L E ++ A F + E +++ L
Sbjct: 66 GYQPESDLEPIFKGLCQALLFDPDQIRQESAHLIELSKQFPAEVKSLFTTLEAGADLDTL 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANA-------TEPTVLEKLCAVLNVNKRSV 178
+ I A+ F YSR FAVG+F LLE A+ ++ ++ L +N +
Sbjct: 126 MGQIRAIATNP-KFKYSRLFAVGVFILLETADPEAIADQDKRQALITQVGDTLKINSERL 184
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYV 203
+DLD+YR+ L K+ Q ++++++ V
Sbjct: 185 LKDLDLYRSNLEKIQQGRQMMEDMV 209
>gi|443478915|ref|ZP_21068602.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
gi|443015728|gb|ELS30564.1| Protein thf1 [Pseudanabaena biceps PCC 7429]
Length = 240
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 74/218 (33%), Positives = 121/218 (55%), Gaps = 16/218 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + +P+ +Y V+ EL+V+ HL++ +T+ YD +FALGFVT +DR
Sbjct: 6 TVSDTKKDFYLAFPKPVNQVYRRVVDELLVEIHLLKVNQTFVYDAIFALGFVTTFDRFTA 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWAR---GQTASSLVEFPSKEG--EV 122
GY E DR A+F A AL+ D ++ R DA L + A + L S +
Sbjct: 66 GYKPETDRFAVFHALCAALQFDSDRIRQDAATLSDLATRSPNDIKTLLTNLDSGISLEPL 125
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNK 175
G L+ I S K NF YSR VGL+ LLE+++ E +++ + L
Sbjct: 126 SGQLQII----STKENFKYSRLLGVGLYALLEISDPEEIADSAKREELIKLVGETLKFGS 181
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+ +D+D+YR+ L K+ QA++++ + V+ E+KKR ++
Sbjct: 182 DRLLKDVDLYRSNLDKIEQARQMIADMVEAERKKRSQK 219
>gi|427726046|ref|YP_007073323.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
gi|427357766|gb|AFY40489.1| Protein thf1 [Leptolyngbya sp. PCC 7376]
Length = 246
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 117/216 (54%), Gaps = 15/216 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++ K +F + RPI SI+ V++EL+V+ HL+ ++YDP +ALG VT Y+R M+
Sbjct: 6 TVSDAKRDFYGQHTRPINSIFRRVVEELLVEMHLVSVNVDFRYDPFYALGIVTSYERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY E D+ +IFQA A+ E Y+ DA L E A+ + LV+ ++ EG
Sbjct: 66 GYRPESDKISIFQAMCQAVGGSAEFYKNDATALVELAKRCSGQQLVDCFRQDNAPEGAGE 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLE----------KLCAVLNVNK 175
L E + F YSR FA+GL+ L EP +LE L +N+
Sbjct: 126 LWAKVEAIAANKKFKYSRLFAIGLYTFL---GEAEPALLEDADKRDEMLATLTEAMNLPG 182
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
+ +DLD+YR+ L K+ Q ++++ + E+K+RE
Sbjct: 183 EKMKKDLDLYRSNLEKMTQVLAVIEDALVAERKRRE 218
>gi|126658461|ref|ZP_01729609.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
gi|126620203|gb|EAZ90924.1| hypothetical protein CY0110_21090 [Cyanothece sp. CCY0110]
Length = 246
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 119/212 (56%), Gaps = 12/212 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE--VEGL 125
GY E D+ +IF A A+ + EQY +A+ L A+G S+ EF K G+ +G+
Sbjct: 66 GYRPESDKASIFNALCQAVDGNSEQYHQEAEALINEAKG---LSMTEFKDKLGQEGGDGI 122
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSV 178
L + F YSR F VGL+ LL EL E ++++ L + +
Sbjct: 123 LWGTCNAIAQNPKFKYSRLFGVGLYTLLMEIDPELVKEEEKRNQTIKEVSEALQFSSDKL 182
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
+DLD+YR+ L K+ Q ++++ ++ ++KKR
Sbjct: 183 QKDLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214
>gi|428203624|ref|YP_007082213.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
gi|427981056|gb|AFY78656.1| photosystem II biogenesis protein Psp29 [Pleurocapsa sp. PCC 7327]
Length = 241
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 130/225 (57%), Gaps = 10/225 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++ K +F + RPI SIY ++ELIV+ HL+ ++YD ++ALG VT ++R M+
Sbjct: 11 TVSDAKRDFYTHHTRPINSIYRRFVEELIVEMHLLSVNTDFRYDAIYALGVVTAFERFMQ 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE--FPSKEGEVEGL 125
GY E+D+ +IF A A + EQYR +A ++ A+ + L+ S E
Sbjct: 71 GYQPEQDKSSIFAALCQATGGNWEQYRQEAGEILAQAKQMSVQELIAKINSSTPTGGENR 130
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EPT----VLEKLCAVLNVNKRSV 178
L + + + + N+ YSR FA+GL+ LL A+ +P L+++ L+++ +
Sbjct: 131 LVETLQAIANRSNYKYSRLFAIGLYTLLAEADPDILRDPEKRDRTLKEVTEALHLSPEKL 190
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAI 223
+DLD+YR+ L K+ Q ++L+E ++ E+KKR+++ +P++ I
Sbjct: 191 QKDLDLYRSNLDKMDQLLKVLEEALEAERKKRQQQ-KPEQGTAQI 234
>gi|209522934|ref|ZP_03271491.1| Thf1-like protein [Arthrospira maxima CS-328]
gi|209496521|gb|EDZ96819.1| Thf1-like protein [Arthrospira maxima CS-328]
Length = 210
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 104/187 (55%), Gaps = 9/187 (4%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYIPEADKLSIWAALIMAQESDPNQYRA 60
Query: 96 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
DA LE A + L E ++E + L+ + F YSR FA+GL+ LL
Sbjct: 61 DATALEAQAATLSVKDLTERAKIAQESSGDDPLQSCFHAIANNPKFKYSRLFAIGLYTLL 120
Query: 154 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
E ++ T T+L L + K +++DLD+YR L K+ QA+ +++E E
Sbjct: 121 EKSDVTAAQDSEGLKTILSDFSEALRLPKDKLEKDLDLYRTNLEKVAQARLMVEEMTQAE 180
Query: 207 KKKREER 213
+KKRE+R
Sbjct: 181 RKKREQR 187
>gi|97202816|sp|P0C1D1.1|THF1_SYNJB RecName: Full=Protein thf1
Length = 239
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 14/232 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T++ TK F Y RPI ++Y V++EL+V+ HL T+ YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAVYRRVVEELLVELHLTTVNSTFVYDPFFALGLVTLYDGLME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE--GL 125
Y E REAIF A AL PE R +A+ L E ++ + E E G
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMGSGDPRQRLDLLCLKPEAEDVGG 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EP-----TVLEKLCAVLNVNKRS 177
LK I ER + + ++YSR AVGL+ E+ + EP LE + + L +
Sbjct: 126 LKAILERMT-QPPYAYSRVLAVGLYTAYEVVAKSLYEEPEERTRRFLENVVSKLPFSTER 184
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGE 229
V +DL++YR+ L ++ QA+ +++E V ++++E R Q A + LG+
Sbjct: 185 VRKDLELYRSSLDRMKQARAVVEEMVKAARRQQERR---QSAASLPETSLGD 233
>gi|172035357|ref|YP_001801858.1| Thf1-like protein [Cyanothece sp. ATCC 51142]
gi|354555452|ref|ZP_08974753.1| Protein thf1 [Cyanothece sp. ATCC 51472]
gi|254784140|sp|B1WNF0.1|THF1_CYAA5 RecName: Full=Protein thf1
gi|171696811|gb|ACB49792.1| photosystem II 22 kD protein [Cyanothece sp. ATCC 51142]
gi|353552511|gb|EHC21906.1| Protein thf1 [Cyanothece sp. ATCC 51472]
Length = 242
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 118/210 (56%), Gaps = 8/210 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + RPI SIY ++EL+V+ HL+ ++YDP++ALG VT ++R M+
Sbjct: 6 TVSDTKRKFYGYHTRPINSIYRRFVEELLVEMHLLSVNVDFKYDPIYALGVVTSFERFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E D+ +IF A A+ + EQY +A+ L A+G + + E +EG +G+L
Sbjct: 66 GYSPESDKTSIFNALCQAVDGNSEQYHQEAEALINEAKGLSITEFKEKLGQEGG-DGILW 124
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVDR 180
+ F YSR F VGL+ LL +L + ++++ L + + +
Sbjct: 125 GTCGAIAQNPKFKYSRLFGVGLYTLLMEIDPDLVKEEDKRNQTIKEVSDALQFSSDKLQK 184
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKR 210
DLD+YR+ L K+ Q ++++ ++ ++KKR
Sbjct: 185 DLDLYRSNLDKMQQLLTVIEDTLEADRKKR 214
>gi|409992261|ref|ZP_11275462.1| inositol phosphatase [Arthrospira platensis str. Paraca]
gi|409936888|gb|EKN78351.1| inositol phosphatase [Arthrospira platensis str. Paraca]
Length = 210
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 9/187 (4%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ ++YDP++ALG VT +DR M+GY E D+ +I+ A I A + DP QYR
Sbjct: 1 MVEMHLLSVNVDFKYDPIYALGVVTAFDRFMQGYTPETDKLSIWAALIGAQESDPNQYRA 60
Query: 96 DAQKLEEWARGQTASSLVEFP--SKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
DA LE A L + ++E + L+ + F YSR A+GL+ LL
Sbjct: 61 DATALEAQAASLAVKDLTDKAKIAQESSGDDPLQSCFHAIANNPKFKYSRLLAIGLYTLL 120
Query: 154 ELANATEP-------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
E ++AT T+L L + K + +DLD+YR L K+ QA+ ++ E E
Sbjct: 121 EKSDATAAQDSEGLKTILSDFSEALRLPKDKLVKDLDLYRTNLEKVAQARLMVDEMTQAE 180
Query: 207 KKKREER 213
+KKRE+R
Sbjct: 181 RKKREQR 187
>gi|148242504|ref|YP_001227661.1| Thf1-like protein [Synechococcus sp. RCC307]
gi|147850814|emb|CAK28308.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 237
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 125/228 (54%), Gaps = 10/228 (4%)
Query: 1 MISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVT 60
M+ P TVA++K F Y IP +Y V+ EL+V+ HL+ + +Q D +FA+G
Sbjct: 1 MVLSNPQTVADSKRRFYAAYPHVIPGLYRRVVDELLVELHLLAGQAGFQADSLFAMGLTQ 60
Query: 61 VYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEG 120
V+D LM+G+ E ++ +F A + +Q R DA++L E + + + ++G
Sbjct: 61 VFDNLMQGFKPAERQKELFAAICSGAGLKADQLRKDAKQLREHLVPHGEAEIKSWIEQQG 120
Query: 121 E-VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE---LANATEPTVLE----KLCAVLN 172
+ +LK + ++A G+ +F YSR AVGL LL+ + +P L+ +L +
Sbjct: 121 QGAPDVLKHVLQQA-GRSDFHYSRLHAVGLMGLLQDLSGGDDQDPQALQERAHQLGHSMG 179
Query: 173 VNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER-TEPQKA 219
+ K + +D+ +Y + L K+ QA ELL+E V E++KRE+R EP A
Sbjct: 180 LQKDKLQKDMGLYASNLEKMSQAVELLEETVAAERRKREQRQGEPASA 227
>gi|86606816|ref|YP_475579.1| Thf1-like protein [Synechococcus sp. JA-3-3Ab]
gi|97202812|sp|Q2JSQ3.1|THF1_SYNJA RecName: Full=Protein thf1
gi|86555358|gb|ABD00316.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 239
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 116/218 (53%), Gaps = 15/218 (6%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T++ TK F Y RPI + Y V++EL+V+ HL + YDP FALG VT+YD LME
Sbjct: 6 TLSATKAAFFSAYPRPINAAYRRVVEELLVELHLTTVNSAFVYDPFFALGLVTLYDSLME 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARG----QTASSLVEFPSKEGEVE 123
Y E REAIF A AL PE R +A+ L E R Q + L P E E
Sbjct: 66 AYHPPEQREAIFNALCKALHLKPEVLRKNARDLLELMRSGDPVQRYNLLCLKP--EAEDV 123
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT---EP-----TVLEKLCAVLNVNK 175
G LK I +R + + ++YSR AVGL+ E + EP LE + L +
Sbjct: 124 GGLKAILQRMT-QPPYAYSRVLAVGLYTAYEAVATSLYKEPEERTRHFLEDVIGNLPFSP 182
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
V +DL++YR+ L +L QA+ +++E V ++++E R
Sbjct: 183 ERVKKDLELYRSNLDRLKQARAIVEEMVKAARRQQERR 220
>gi|428773451|ref|YP_007165239.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
gi|428687730|gb|AFZ47590.1| photosystem II biogenesis protein Psp29 [Cyanobacterium stanieri
PCC 7202]
Length = 233
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 123/234 (52%), Gaps = 27/234 (11%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++T+ F + + RPI SIY V+QEL+V+ HL+ +Q D V+A+G +++ M
Sbjct: 6 TVSDTRRAFYQYHTRPINSIYRQVVQELMVEMHLLSVNTDFQPDAVYAVGVCQSFEQFMT 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGL-- 125
GY EED+ +IF A A++ +P+ YR ++ L + G++A LV + GL
Sbjct: 66 GYKPEEDKTSIFNALCKAIEANPDDYRHQSESLLNFVEGKSAEDLVNWLLNPVADNGLDE 125
Query: 126 -----LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAVLNVNKRS 177
LK I ER F YSR F +G + L+ N P V EKL ++
Sbjct: 126 NIVNSLKSILERER----FKYSRLFGIGFYTLI---NKVAPDVAKDEEKLAKLIAPYSEK 178
Query: 178 VD-------RDLDVYRNLLSKLLQAKELLKEYVDREKKKR---EERTEPQKANE 221
+D +D+D+YR+ L K+ Q ++ E ++ KKKR E+ E ++ANE
Sbjct: 179 LDLPVDKLKKDVDLYRSNLDKINQMLVVIAETIEASKKKRINIEKTEEKEEANE 232
>gi|119510704|ref|ZP_01629832.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
gi|119464658|gb|EAW45567.1| hypothetical protein N9414_22068 [Nodularia spumigena CCY9414]
Length = 200
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 117/193 (60%), Gaps = 17/193 (8%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ + Y+P++ALG VT +DR M+GY E+D+E+IFQA A++++P++YR
Sbjct: 1 MVEMHLLSVNSGFSYNPIYALGVVTSFDRFMQGYLPEQDQESIFQALCQAVEQEPQRYRE 60
Query: 96 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
DA++L+ A+ + L+ + S ++ +++ L+ IA + F YSR FAVGL
Sbjct: 61 DAKRLQALAKDLPVNDLIAWLSQTTHLDRDPDLQAQLQAIAHNS----EFKYSRLFAVGL 116
Query: 150 FRLLELANA-------TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
F LLE ++ L+ + A L+++ +++DL++Y + L K+ QA ++ +
Sbjct: 117 FTLLEQSDPELVKDEKQRTEALKTIAAGLHLSDEKLNKDLELYSSNLEKMAQALVVMADM 176
Query: 203 VDREKKKREERTE 215
+ ++KKRE+R +
Sbjct: 177 LSADRKKREQRQQ 189
>gi|428769945|ref|YP_007161735.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
gi|428684224|gb|AFZ53691.1| Protein thf1 [Cyanobacterium aponinum PCC 10605]
Length = 234
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 120/214 (56%), Gaps = 11/214 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + ++RPI SIY V++EL+V+ HL+ + DP++ LG + + M+
Sbjct: 18 TVSDTKRSFYQHHQRPINSIYRRVVEELMVEMHLLAVNVDFNPDPIYYLGVYQSFQQFMQ 77
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF---PSKEGEVEG 124
GY E D+E+IF A +++ +P++Y +Q L + G++A ++++ PS EG++E
Sbjct: 78 GYKPESDKESIFNALCQSIENNPQEYISKSQTLLNFVEGKSAQEILDWLLNPSGEGDLEA 137
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
+ F YSR FA+G + L+E + + ++ L L +
Sbjct: 138 VASHWRSNLENP-RFKYSRLFAIGFYTLIEKGDGEFIKDESKFTDFIQPLIDKLQLPVEK 196
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
+ +DLD+YR+ L K+ Q ++ + ++ E+KK++
Sbjct: 197 LKKDLDLYRSNLEKMNQMLSVMADVLEAERKKKQ 230
>gi|16330615|ref|NP_441343.1| Thf1-like-protein [Synechocystis sp. PCC 6803]
gi|383322356|ref|YP_005383209.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325525|ref|YP_005386378.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491409|ref|YP_005409085.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436676|ref|YP_005651400.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|451814773|ref|YP_007451225.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
gi|81671042|sp|P73956.1|THF1_SYNY3 RecName: Full=Protein thf1
gi|1653107|dbj|BAA18023.1| sll1414 [Synechocystis sp. PCC 6803]
gi|339273708|dbj|BAK50195.1| hypothetical protein SYNGTS_1447 [Synechocystis sp. PCC 6803]
gi|359271675|dbj|BAL29194.1| hypothetical protein SYNGTI_1447 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274845|dbj|BAL32363.1| hypothetical protein SYNPCCN_1446 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278015|dbj|BAL35532.1| hypothetical protein SYNPCCP_1446 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958541|dbj|BAM51781.1| Thf1-like-protein [Bacillus subtilis BEST7613]
gi|451780742|gb|AGF51711.1| hypothetical protein MYO_114600 [Synechocystis sp. PCC 6803]
Length = 240
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/214 (33%), Positives = 118/214 (55%), Gaps = 8/214 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++ K F Y RPI SIY ++EL+V+ HL+ + YDP+FALG VT ++ M+
Sbjct: 6 TVSDAKRKFFTHYSRPISSIYRRFVEELLVEMHLLSVNIDFTYDPIFALGIVTSFNSFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVE-FPSKEGEVEGLL 126
GY E AIF A + ++P+Q R DA+ + A + V S++ + LL
Sbjct: 66 GYQPAEQLPAIFNALCHGVDQNPDQVRQDAKNVAASAHHIGLDAWVTAAASEQASGDNLL 125
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLL-----ELANATEP--TVLEKLCAVLNVNKRSVD 179
+ + F YSR FA+GL+ LL E+ + E L +L +L+++ V
Sbjct: 126 LNTLTGIHQRHKFKYSRLFAIGLYTLLADQDPEVKDNDEKRQDYLTRLSELLDLSLDKVV 185
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
+DLD+YR+ L K+ Q ++L++ + E+KK+E++
Sbjct: 186 KDLDLYRSNLEKVDQLLKVLEDAAEAERKKKEKQ 219
>gi|67921410|ref|ZP_00514928.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67856522|gb|EAM51763.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 245
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 123/232 (53%), Gaps = 17/232 (7%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK F + +PI SIY ++EL+V+ HL+ + YDP++ALG VT + R M+
Sbjct: 6 TVSDTKRKFYGYHTQPINSIYRRFVEELLVEMHLLSVNIDFSYDPIYALGVVTSFQRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV----- 122
GY E D+ +IF A A+ E+Y +A+ + A+G S+V+F K V
Sbjct: 66 GYSPESDKPSIFNALCQAVDGSSEKYHQEAEAILNEAKGL---SIVDFKDKLTHVTDNQV 122
Query: 123 -EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-------EPTVLEKLCAVLNVN 174
EG+L + F YSR A+GL+ LL ++ ++++ L +
Sbjct: 123 GEGVLWGTFGAIAANPKFKYSRLLAIGLYTLLMEIDSDLLKDEEKRTETIKEVSEALKFS 182
Query: 175 KRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
+ +DLD+YR+ L K+ Q ++++ ++ ++KKR TE + + E +++
Sbjct: 183 PEKLRKDLDLYRSNLDKMQQLLTVIEDSLEADRKKRAS-TEGKTSAEVVEQT 233
>gi|282901466|ref|ZP_06309391.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193745|gb|EFA68717.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 201
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 96 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 150 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
F LLEL++ + T + AV L +++ +++DLD+YR+ L K+ QA ++ +
Sbjct: 117 FSLLELSDPEFVKDETQRNQTIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176
Query: 203 VDREKKKREERTE 215
+ ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189
>gi|282898285|ref|ZP_06306276.1| Protein thf1 [Raphidiopsis brookii D9]
gi|281196816|gb|EFA71721.1| Protein thf1 [Raphidiopsis brookii D9]
Length = 202
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 112/193 (58%), Gaps = 17/193 (8%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ + Y+ ++ALG VT +DR M+GY ED +IF A I A+++DP+ YR
Sbjct: 1 MVEMHLLSVNVDFSYNSIYALGVVTTFDRFMQGYQPSEDLVSIFNAIICAVEQDPQVYRQ 60
Query: 96 DAQKLEEWARGQTASSLVEFPS------KEGEVEGLLKDIAERASGKGNFSYSRFFAVGL 149
DA KL+ A + L+ + S ++ ++ L+ IA+ NF YSR A+GL
Sbjct: 61 DAAKLKAIANSFSVKDLIAWCSQTTPLDQDANLQAELQAIAQNP----NFKYSRLLAIGL 116
Query: 150 FRLLELAN---ATEPTVLEKLCAV----LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
F LLEL++ + T + AV L +++ +++DLD+YR+ L K+ QA ++ +
Sbjct: 117 FSLLELSDPEFVKDETERNQAIAVIAQGLKLSEDKLNKDLDLYRSNLDKMEQALIVMADM 176
Query: 203 VDREKKKREERTE 215
+ ++KKR++R +
Sbjct: 177 LAADRKKRDQRQQ 189
>gi|124023249|ref|YP_001017556.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9303]
gi|123963535|gb|ABM78291.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 250
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 117/222 (52%), Gaps = 12/222 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 14 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTR 73
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E + +F A + DP R AQK E RG + + ++G +G +
Sbjct: 74 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQKTLESVRGHDLEEVQGWIQQQG--KGAPE 131
Query: 128 DIAE--RASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
+A+ R +G F YSR AVGL LL A E + EKL + + K V
Sbjct: 132 ALAQALRNTGSNTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFTKARV 191
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKAN 220
++DL++Y++ L K+ QA EL ++ ++ E++KRE++ E +K N
Sbjct: 192 EKDLNLYKSNLEKMAQAVELSEQILESERRKREQK-ESEKLN 232
>gi|9631702|ref|NP_048481.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|1131477|gb|AAC96501.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]
gi|448924789|gb|AGE48370.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
AN69C]
Length = 207
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 4 ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTT 63
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 64 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSHLG-----PNKHGN 117
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F AVG+F+LL++ ++ L + V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207
>gi|448930219|gb|AGE53784.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-3A]
gi|448933659|gb|AGE57214.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NE-JV-4]
Length = 207
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 116/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 4 ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 64 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIESLKPYAKSSNLG-----PNKHGN 117
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F AVG+F+LL++ ++ L + V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E +R+ ++
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLRRKSKSS 207
>gi|448927841|gb|AGE51413.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CviKI]
Length = 232
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 29 ITSSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 88
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 89 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 142
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DI S + YS F AVG+F+LL++ ++ L + V +
Sbjct: 143 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 198
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 199 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 232
>gi|254413033|ref|ZP_05026805.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
gi|196180197|gb|EDX75189.1| photosystem II biogenesis protein Psp29 [Coleofasciculus
chthonoplastes PCC 7420]
Length = 208
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 114/196 (58%), Gaps = 17/196 (8%)
Query: 36 IVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRI 95
+V+ HL+ ++YDP++ LG V ++R M+GY E D+E+IF A A+ +P+QY+
Sbjct: 1 MVEMHLLAVNVDFKYDPIYVLGVVASFNRFMQGYRPERDKESIFNALCQAVGGNPQQYQD 60
Query: 96 DAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGN---FSYSRFFAVGLFRL 152
DA+KL+ +A LV++ +EG +DI + + F YSR FA+GL+ L
Sbjct: 61 DAEKLKAAVGRLSAQELVDWFGSPTPLEG-AEDIHTTVAAIADNPKFKYSRLFAIGLYTL 119
Query: 153 LELANATEP----------TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEY 202
LE A EP +L+++ L++ + + +DL++YR+ L K+ QA+ +++
Sbjct: 120 LEQA---EPELVQDAKQSMEMLQRIGQTLHLPQEKLQKDLELYRSNLEKMAQAQIAIEDA 176
Query: 203 VDREKKKREERTEPQK 218
+ ++KKRE+R + +K
Sbjct: 177 IKADRKKREQREQEKK 192
>gi|448928860|gb|AGE52429.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
CvsA1]
Length = 207
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 4 ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 64 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 117
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DI S + YS F AVG+F+LL++ ++ L + V +
Sbjct: 118 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207
>gi|448931622|gb|AGE55183.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1E]
Length = 207
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 116/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 4 ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHICRYNKNYTYSDVSALGIVTT 63
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 64 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYFSNIETLKPYAKSSHLG-----PNKHGN 117
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DI S + YS F AVG+F+LL++ ++ L + V +
Sbjct: 118 TLQKSLYDI----SINDKYVYSSFAAVGIFKLLQMNGNYTGNSVKHLSESIGFKGELVHK 173
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207
>gi|443323210|ref|ZP_21052219.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
gi|442787120|gb|ELR96844.1| photosystem II biogenesis protein Psp29 [Gloeocapsa sp. PCC 73106]
Length = 231
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 116/218 (53%), Gaps = 12/218 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV++TK +F + RPI SIY V++ELIV+ HL+ + ++ DP++ LG VT +DR M+
Sbjct: 6 TVSDTKRDFYAHHTRPINSIYRRVVEELIVELHLLSVNQNFRVDPIYCLGVVTSFDRFMQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWA-RGQTASSLVEFPSKEGEVEGLL 126
GY EED+ +I + A+ EQYR A ++ A R L+ + VEG
Sbjct: 66 GYRPEEDKASILASLCQAVGGKLEQYRDHANQVLNLAKRLHGVDDLLAWFKHPQPVEGEF 125
Query: 127 KDIAERASG---KGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKR 176
+AE S +F YSR F +GL+ +L N + + VL V+
Sbjct: 126 A-LAEAVSAIALNQSFKYSRMFGIGLYTMLGEKNLELLQDKPARDKITAQFAEVLPVSSD 184
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
+ +D ++Y+ L K+ Q +++E ++ E+KKR +++
Sbjct: 185 KLQKDFELYQANLEKMKQMIIVVEEALEAERKKRAKKS 222
>gi|33862947|ref|NP_894507.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9313]
gi|81577657|sp|Q7V7R3.1|THF1_PROMM RecName: Full=Protein thf1
gi|33634864|emb|CAE20850.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 243
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 111/214 (51%), Gaps = 10/214 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 6 TIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTS 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV--EGL 125
GY E + +F A + DP R AQ+ E RG + + ++G+ E L
Sbjct: 66 GYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEAL 125
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
K + A G F YSR AVGL LL A E + EKL + + +K V
Sbjct: 126 AKALRNTA-GSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFSKARV 184
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREE 212
++DL++Y++ L K+ QA EL ++ ++ E++KRE+
Sbjct: 185 EKDLNLYKSNLEKMAQAVELTEQILESERRKREQ 218
>gi|448930916|gb|AGE54479.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
KS1B]
Length = 207
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 115/215 (53%), Gaps = 12/215 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 4 ITTSPPTVSDTKRIFYANYKKLLLPLYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTT 63
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 64 LDSVLNTFPDDE-KVCIKNAFIISLNEDPEMYYSNIEYLKPYAKSSNLG-----PNKHGN 117
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F AVG+F+LL++ ++ L + V +
Sbjct: 118 TLQKSLYDIAIN----DKYVYSSFAAVGIFKLLQMNGNYTGKSVKHLSESIGFKGELVHK 173
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
D+ + +LL K +++ + L + + E KR+ ++
Sbjct: 174 DIATFFSLL-KYIESSQKLADDIREESLKRKSKSS 207
>gi|157952488|ref|YP_001497380.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
gi|155122715|gb|ABT14583.1| hypothetical protein NY2A_b184R [Paramecium bursaria Chlorella
virus NY2A]
Length = 247
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 115/214 (53%), Gaps = 12/214 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 45 ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 105 LDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F A+G+F+LL++ ++ L + V +
Sbjct: 159 TLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKHLSESVGFKGEIVHK 214
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
D+ + +LL K +++ + L + + E K+ + +
Sbjct: 215 DIATFFSLL-KYIESSQKLADDIREESLKKSKSS 247
>gi|448931221|gb|AGE54783.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
MA-1D]
Length = 248
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 108/198 (54%), Gaps = 11/198 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 45 ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 105 LDSILNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F A+G+F+LL++ ++ L + V +
Sbjct: 159 TLQKSLYDIASN----DKYVYSSFAAIGIFKLLQMNGNYTGNSVKHLSESIGFKGEIVHK 214
Query: 181 DLDVYRNLLSKLLQAKEL 198
D+ ++ +LL + +++L
Sbjct: 215 DIAMFFSLLKYIESSQKL 232
>gi|116074797|ref|ZP_01472058.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
gi|116068019|gb|EAU73772.1| hypothetical protein RS9916_29724 [Synechococcus sp. RS9916]
Length = 234
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 112/216 (51%), Gaps = 9/216 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 6 TIADSKRAFHSAFPHVIPSLYRRTADELLVELHLLSHQKQFKVDALFAVGLRQVFDAFTR 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 126
GY E +++F A + DP + A E +G + + ++ +GE +
Sbjct: 66 GYRPEAHLDSLFAAICSCNGFDPAALKQLALDSEHAVQGHSFEDVQQWLRNKGEGAPAAI 125
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSVD 179
+ +RA NF YSR AVGL LL A + ++P+ L KL L + K V+
Sbjct: 126 TKVLKRAD-HANFHYSRLMAVGLLTLLAKAQGDDGSDPSELAKLAHELSEPLGLTKERVE 184
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
+DL +Y L ++ QA EL++E + E++KRE + E
Sbjct: 185 KDLGIYTGNLERMAQAVELMEETLAAERRKRERQNE 220
>gi|157953365|ref|YP_001498256.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|156068013|gb|ABU43720.1| hypothetical protein AR158_C174R [Paramecium bursaria Chlorella
virus AR158]
gi|448930527|gb|AGE54091.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
IL-5-2s1]
gi|448934707|gb|AGE58259.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NY-2B]
gi|448935079|gb|AGE58630.1| thylakoid formation protein [Paramecium bursaria Chlorella virus
NYs1]
Length = 248
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 109/198 (55%), Gaps = 11/198 (5%)
Query: 2 ISDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTV 61
I+ PPTV++TK F YK+ + +YNT +Q ++V+QH+ RY + Y Y V ALG VT
Sbjct: 45 ITTSPPTVSDTKRIFYANYKKLLLPMYNTPIQNMLVKQHIHRYNKNYTYSDVSALGIVTA 104
Query: 62 YDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE 121
D ++ +P +E + +I A+I +L EDPE Y + + L+ +A+ P+K G
Sbjct: 105 LDSVLNTFPDDE-KTSIKNAFIISLNEDPEMYYSNIETLKPYAKSSHLG-----PNKHGN 158
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDR 180
++ L DIA + YS F A+G+F+LL++ +++L + V +
Sbjct: 159 TLQKSLYDIAIN----DKYVYSSFAAIGIFKLLQMNKNYTGNSVKQLSESIGFKGEIVHK 214
Query: 181 DLDVYRNLLSKLLQAKEL 198
D+ ++ +LL + +++L
Sbjct: 215 DIAMFFSLLKYIESSQKL 232
>gi|352093979|ref|ZP_08955150.1| Protein thf1 [Synechococcus sp. WH 8016]
gi|351680319|gb|EHA63451.1| Protein thf1 [Synechococcus sp. WH 8016]
Length = 247
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 114/221 (51%), Gaps = 11/221 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 6 TIADSKRAFHTAFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFQAFTQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E + ++ A ++ DPE + A+ G T S + E+ S G G +
Sbjct: 66 GYKPEAHLDELYAAICSSNGFDPEALKQLAEGSTSAVSGHTISEVREWLSNRG--AGAPE 123
Query: 128 DIAERAS--GKGNFSYSRFFAVGLFRLLELANATEP-------TVLEKLCAVLNVNKRSV 178
+A S G +F YSR AVGL LL A EP T+ ++ L ++K +
Sbjct: 124 PLASGISSVGGDSFHYSRLMAVGLLSLLSSAQGGEPSNPDELKTLAHEIGEQLGLSKPRL 183
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKA 219
D+DL +Y + L K+ QA EL++E + E++KR+ + +A
Sbjct: 184 DKDLTLYTSNLEKMAQAVELIEETLAAERRKRDRQAADSQA 224
>gi|113955551|ref|YP_730625.1| Thf1-like protein [Synechococcus sp. CC9311]
gi|113882902|gb|ABI47860.1| Uncharacterized protein [Synechococcus sp. CC9311]
Length = 252
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 111/213 (52%), Gaps = 11/213 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F K + IPS+Y EL+V+ HL+ +++ ++ D +FA+G V+ +
Sbjct: 11 TIADSKRAFHKSFPYVIPSLYRRTADELLVELHLLSHQQHFKSDALFAVGLRQVFMAFTQ 70
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E + ++ A T +PE + A+ G T + + E+ S G G +
Sbjct: 71 GYKPETHLDELYAAICTCNGFEPEALKQLAEGSTSAVSGHTINEVREWLSNRG--AGAPE 128
Query: 128 DIAERASGKG--NFSYSRFFAVGLFRLLELANATEPT-------VLEKLCAVLNVNKRSV 178
+A S G +F YSR AVGL LL A EP+ + ++ L ++K +
Sbjct: 129 PLASGISSVGGESFHYSRLMAVGLLSLLSSAQGGEPSNPDELKKLAHEIGEQLGLSKPRL 188
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
D+DL +Y + L K+ QA EL++E + E++KR+
Sbjct: 189 DKDLSLYTSNLEKMAQAVELIEETLAAERRKRD 221
>gi|375332109|gb|AFA52594.1| hypothetical protein [Vaucheria litorea]
Length = 249
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 118/223 (52%), Gaps = 7/223 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+ET +F Y++PI Y T++ +++ HL + YD +F GF +++ +LM+
Sbjct: 26 TVSETIKSFCIQYQKPILPQYRTMINDVLQSTHLNVVNGCFIYDAMFGYGFYSLFYKLMK 85
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
YP + + I+ A +T+L +PE+ + D + + + T + L S +GE + LL
Sbjct: 86 AYPGTGEADLIYAAMVTSLDMEPEKLKEDHETISKLIENMTRADLEN--SFKGENQNLLS 143
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLCAVLNVNKRSVDRDLDVY 185
+I+ + Y++ + +GL ++ TE + E L ++ + +DL Y
Sbjct: 144 EISSNIKADEFYLYTKTWGIGLIEAMDKVGIPLTEENI-ESLANMIGFSPIKARQDLVQY 202
Query: 186 RNLLSKLLQAKELLKEYVDREKKKREERTE--PQKANEAIKKC 226
+++L K+ QA++L KE REKKK ER E ++A EA KK
Sbjct: 203 KDVLDKVAQAEQLFKEIEIREKKKMAERLEEKAKRALEAAKKA 245
>gi|88808604|ref|ZP_01124114.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
gi|88787592|gb|EAR18749.1| hypothetical protein WH7805_02902 [Synechococcus sp. WH 7805]
Length = 234
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 109/216 (50%), Gaps = 11/216 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 13 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKSNALFAVGLRQVFTAFTK 72
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEF--PSKEGEVEGL 125
GY + +F A + + +Q A+ E+ G + + + EG E L
Sbjct: 73 GYRPADHLTELFDALCSCNGFNAQQLNSVAEGSEKAVAGHSMEEVQAWLQSKGEGAPEPL 132
Query: 126 LKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAV-------LNVNKRSV 178
+A+ A + F YSR AVGLF LL A E E LC + +++ +
Sbjct: 133 ATGLADIAGEQ--FHYSRLMAVGLFSLLSSAQGVESQDPEDLCKTAHSIGEQIGLSRPRL 190
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERT 214
++DL +YRN L K+ QA EL++E + E++KRE ++
Sbjct: 191 EKDLSLYRNNLEKMAQAVELMEETLASERRKRERQS 226
>gi|284929212|ref|YP_003421734.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
gi|284809656|gb|ADB95353.1| photosystem II biogenesis protein Psp29 [cyanobacterium UCYN-A]
Length = 237
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 118/232 (50%), Gaps = 14/232 (6%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
D TV+ETK F + +PI SIY ++EL+V+ HL+ YQY P++ALG VT+++
Sbjct: 2 DNIRTVSETKREFYNFFTKPISSIYRRFIEELLVEMHLLSVNADYQYSPIYALGVVTLFE 61
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-- 121
+ M Y ++ ++ IF A + D +QYR ++ + A + S+ E +K +
Sbjct: 62 KFMYRYQPDDHQDLIFDALCKSTGGDTKQYRQESNTILNEAETLSISNFKEDFTKSAQEK 121
Query: 122 -VEGLLKDIAERASGKGNFSYSRFFAVGLFRLLE-----LANATEP--TVLEKLCAVLNV 173
+ LL + F YSR A+GL+ LLE L + E +E++ L +
Sbjct: 122 VNDKLLWKSYYSIAQNPKFKYSRLLAIGLYSLLEKISSDLVESKEEYNKAIEQIANDLGL 181
Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKR----EERTEPQKANE 221
+ + +D+++Y + L K+ Q +++ ++ +KKR EE T NE
Sbjct: 182 SSERIQKDIELYCSNLEKMQQLLIAIEDSLEFGRKKRISQQEEDTLKTNDNE 233
>gi|318041533|ref|ZP_07973489.1| Thf1-like protein [Synechococcus sp. CB0101]
Length = 224
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 11/208 (5%)
Query: 5 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
V TVA++K F + I IY ++ EL+V+ HL+ +++ ++ D +FA+G V+D
Sbjct: 3 VSLTVADSKRAFHSAFSYVIAPIYRRLVDELLVELHLLSHQKGFRADGLFAVGLTQVFDS 62
Query: 65 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
GY E RE +FQA +A D R A++ + + + + S +G +G
Sbjct: 63 FSTGYRPEAQREPLFQALCSANGFDGAALRAQAEQARQQVGHHSLEEVKGWLSNQG--QG 120
Query: 125 LLKDIAERASG--KGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 175
+ IA G + +F YSR AVGL LL+ A+A +P L ++ + + K
Sbjct: 121 APELIASLLQGVQRDDFHYSRLVAVGLLSLLQSAQGADALDPQALRSAAHEIGESMGLIK 180
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
VD+DL +Y + K+ QA EL++E V
Sbjct: 181 DRVDKDLSLYAGNIEKMSQAVELMEETV 208
>gi|317970011|ref|ZP_07971401.1| Thf1-like protein [Synechococcus sp. CB0205]
Length = 228
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 17/208 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVA++K F + + I +Y ++ EL+V+ HL+ +++ + D +FA+G V+D
Sbjct: 8 TVADSKRAFHQAFPYVIAPLYRRLVDELLVELHLLSHQKGFHADGLFAVGLTQVFDSFSN 67
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
GY E RE +FQA +A D +R A + + + S GE +
Sbjct: 68 GYKPEAQREPLFQALCSANGFDGGAFRQMASDAATQVGHHSLDEVKGWLSNRGEGAPAPI 127
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE---PTVL----EKLCAVLNVNK 175
GLL + +F YSR AVGL LL+ A E P L ++ + + K
Sbjct: 128 AGLLHGVQRE-----DFHYSRLVAVGLLSLLQRAQGAEAMDPQALRSAAHEIGEAMGLIK 182
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKEYV 203
VD+DL +Y + K+ QA EL++E V
Sbjct: 183 ARVDKDLSLYAGNIEKMTQAVELMEETV 210
>gi|449015870|dbj|BAM79272.1| photosystem II biogenesis protein Psb29 [Cyanidioschyzon merolae
strain 10D]
Length = 327
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 112/226 (49%), Gaps = 8/226 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+ET F + KRP+ Y + E++ HL ++YD +FALGFV+VY
Sbjct: 88 TVSETVTRFYRNLKRPVVFYYQQAVDEILTTAHLALVCAMFRYDVIFALGFVSVYRDFFR 147
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKE-----GEV 122
YP ++RE++F+ AL D Q +A + +G+T + L+E ++ E
Sbjct: 148 SYPRPDERESLFRCICDALDLDVGQVTKEADDALAYVQGKTEAELIEEIERDTGEDSAEA 207
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA-TEPTVLEKLCAVLNVNKRSVDRD 181
+ ++ + G + Y+R F +GL +++ ++K +L ++ +D+D
Sbjct: 208 QPVIAALRACRRADGEYYYTRLFGIGLMKIMSSCGVEINLESVKKWANMLKISYARLDQD 267
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKK--REERTEPQKANEAIKK 225
+ Y+ + KL QA+ + KE RE+ + E + Q+A E ++K
Sbjct: 268 IGTYQMSMEKLTQAEVMFKELEARERARIADELARKAQEAEEELRK 313
>gi|116070497|ref|ZP_01467766.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
gi|116065902|gb|EAU71659.1| hypothetical protein BL107_12665 [Synechococcus sp. BL107]
Length = 215
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 116/218 (53%), Gaps = 22/218 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+F++G TV++ +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFSVGLCTVFETFSQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ +F A ++ + +R ++++ + A+ ++ ++G+
Sbjct: 66 GYRPEDHITGLFDALCSSNGYNATTFRKESKQCIDAAKSES-------------IDGMES 112
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 179
+A++ G+G+ YSR A+G+FRL E A +A +P T L K C LN V+
Sbjct: 113 HLAKQKLGEGSH-YSRLMAIGVFRLFEEAKGDAEQPDETELRKRCKEVSTTLNFPAERVE 171
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+DL ++ ++ A EL++E + E++K+E R Q
Sbjct: 172 KDLSLFAANSERMSAAVELVQETIAAERRKKERRQAEQ 209
>gi|161347491|ref|YP_001224936.2| Thf1-like protein [Synechococcus sp. WH 7803]
Length = 226
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 115/228 (50%), Gaps = 26/228 (11%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +
Sbjct: 6 TIADSKRAFHAAFPYVIPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
GY + +F A + + E+ + A+ E+ G + + + +G+ +
Sbjct: 66 GYRPADHLPQLFDALCSCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPL 125
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNV 173
L DIA F YSR AVGLF LL A ++ +P E+LC + +
Sbjct: 126 ATGLADIAGE-----QFHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGL 177
Query: 174 NKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKRE----ERTEPQ 217
++ +++DL +YRN L K+ QA EL++E + E++KRE E +PQ
Sbjct: 178 SRPRLEKDLSLYRNNLEKMAQAVELMEETLASERRKRERQASENKQPQ 225
>gi|87124410|ref|ZP_01080259.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
gi|86167982|gb|EAQ69240.1| hypothetical protein RS9917_12390 [Synechococcus sp. RS9917]
Length = 224
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 105/215 (48%), Gaps = 8/215 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + IP +Y EL+V+ HL+ +++ +Q D +FA+G V+
Sbjct: 6 TIADSKRAFHTAFPFVIPPLYRRTADELLVELHLLSHQQQFQVDALFAVGLRQVFRAFTR 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-VEGLL 126
GY + ++F+A ++ + A + E RG + + + G+ L
Sbjct: 66 GYKPGQHLASLFEALCSSTGFHAGELESLADQSEAAVRGHSIEEVRHWLEHGGDGAPAPL 125
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANA--TEPTVLEKLC----AVLNVNKRSVDR 180
+ +RA G F YSR AVGL LL A +P L KL L + V++
Sbjct: 126 ASVLQRADSSG-FHYSRLMAVGLLSLLSEAQGDQADPEQLRKLAHELSGPLGFAQTRVEK 184
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKREERTE 215
DL +Y + L K+ QA EL++E + E++KRE + +
Sbjct: 185 DLGLYASNLDKMAQAVELMEETLAAERRKRERQQQ 219
>gi|87302741|ref|ZP_01085552.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
gi|87282624|gb|EAQ74582.1| hypothetical protein WH5701_13350 [Synechococcus sp. WH 5701]
Length = 257
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 104/206 (50%), Gaps = 17/206 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVA++K F + I +Y ++ EL+V+ HL+ + + D +FA+G V+D +
Sbjct: 8 TVADSKRAFHAAFPYVIGPLYRRMVDELLVELHLLSRQSGFHSDGLFAVGLTQVFDGFAK 67
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----V 122
GY ++ E +F A + D +Q R + + + ++ ++ G+ +
Sbjct: 68 GYRPQQQSEPLFAALCASSGFDAQQIRAQHAAAVKAVGEHSLDEVKQWLAQRGQGAPEPI 127
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLEL---ANATEPTVL----EKLCAVLNVNK 175
G+L I +RA +F YSR FAVGL LL+ A A EP L ++ + + K
Sbjct: 128 AGVLAGI-DRA----DFHYSRLFAVGLLSLLQHARGAEAVEPQALRQAAHEIGESMGLMK 182
Query: 176 RSVDRDLDVYRNLLSKLLQAKELLKE 201
VD+DL +Y + L K+ QA EL++E
Sbjct: 183 ERVDKDLTLYASTLEKMAQAVELMEE 208
>gi|78184631|ref|YP_377066.1| Thf1-like protein [Synechococcus sp. CC9902]
gi|97202850|sp|Q3AY05.1|THF1_SYNS9 RecName: Full=Protein thf1
gi|78168925|gb|ABB26022.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 215
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 111/218 (50%), Gaps = 22/218 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + + I +Y + EL+V+ HL+ ++ +++ P+FA+G TV+D
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSSFKTTPLFAVGLCTVFDTFSA 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY EE + A ++ D +R ++++ + A+ ++ V+ +
Sbjct: 66 GYRPEEHITGLLDALCSSNGYDANTFRKESKRCIDAAKTES-------------VDAMDS 112
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA--NATEP--TVLEKLC----AVLNVNKRSVD 179
+A + G+G+ YSR A+G+ RL E A +A +P L K C LN V+
Sbjct: 113 HLAGQKLGEGSH-YSRLMAIGVLRLFEEAKGDADQPDEADLRKRCKELSTALNFPAERVE 171
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+DL ++ + ++ A EL++E + E++K+E R Q
Sbjct: 172 KDLSLFASNSERMSAAIELVQETIAAERRKKERRQAEQ 209
>gi|416383906|ref|ZP_11684537.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
gi|357265142|gb|EHJ13943.1| hypothetical protein CWATWH0003_1368 [Crocosphaera watsonii WH
0003]
Length = 209
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 103/200 (51%), Gaps = 17/200 (8%)
Query: 40 HLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQK 99
HL+ + YDP++ALG VT + R M+GY E D+ +IF A A+ E+Y +A+
Sbjct: 2 HLLSVNIDFSYDPIYALGVVTSFQRFMQGYSPESDKPSIFNALCQAVDGSSEKYHQEAEA 61
Query: 100 LEEWARGQTASSLVEFPSKEGEV------EGLLKDIAERASGKGNFSYSRFFAVGLFRLL 153
+ A+G S+V+F K V EG+L + F YSR A+GL+ LL
Sbjct: 62 ILNEAKGL---SIVDFKDKLTHVTDNQVGEGVLWGTFGAIAANPKFKYSRLLAIGLYTLL 118
Query: 154 -----ELANATE--PTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDRE 206
+L E ++++ L + + +DLD+YR+ L K+ Q ++++ ++ +
Sbjct: 119 MEIDSDLLKDEEKRTETIKEVSEALKFSPEKLRKDLDLYRSNLDKMQQLLTVIEDSLEAD 178
Query: 207 KKKREERTEPQKANEAIKKC 226
+KKR TE + + E +++
Sbjct: 179 RKKR-ASTEGKTSAEVVEQT 197
>gi|260436777|ref|ZP_05790747.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
gi|260414651|gb|EEX07947.1| photosystem II biogenesis protein Psp29 [Synechococcus sp. WH 8109]
Length = 215
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 108/218 (49%), Gaps = 22/218 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANGLFSVGLCTVFDTFTK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E +A+F A ++ D + R L + A+G+ +L + S EG
Sbjct: 66 GYRPEAQTDALFSALCSSNGFDAAKLRKTNASLVDQAKGKDHETLKSWLSSHSLKEG--- 122
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLE--KLCAVLNVNKRSVD 179
YSR AVGL LL+ A A TE V + +L L + V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAATADATGSDTETIVKQSKELAEGLGLPTDRVE 171
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+DL ++ + ++ QA EL++E + EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209
>gi|397644025|gb|EJK76212.1| hypothetical protein THAOC_02035 [Thalassiosira oceanica]
Length = 293
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 98/209 (46%), Gaps = 8/209 (3%)
Query: 15 NFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEED 74
F PI ++Y + +L+ HL +Q DPVF+LG VTV D L++ +P ++
Sbjct: 58 TFTDALGTPINALYKGTITDLVGSLHLTVVTARFQRDPVFSLGLVTVLDLLLKNFPEQDT 117
Query: 75 REAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERAS 134
+ I A I + + +A ++ WA+G+T + + GE + L +A A
Sbjct: 118 AKRIKSAMIESAGMVESEVDAEAAEVATWAQGKTREDIA--SALRGEGDSTLAQVANGAK 175
Query: 135 GKGNFSYSRFFAVGLFRLLELANATEPT-----VLEKLCAV-LNVNKRSVDRDLDVYRNL 188
G + YSRFF +GL +++++ + V+E L + D D+Y
Sbjct: 176 GDEYWMYSRFFGIGLVKMMDIVGIEQDMSVAYDVMEDWVGTCLGKPHYTACADSDLYFKQ 235
Query: 189 LSKLLQAKELLKEYVDREKKKREERTEPQ 217
KL + ++KE REKK+ +R E +
Sbjct: 236 KGKLDMMETMMKEIEIREKKRMADRLEAK 264
>gi|147848088|emb|CAK23639.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 206
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 107/212 (50%), Gaps = 26/212 (12%)
Query: 24 IPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYI 83
IPS+Y EL+V+ HL+ ++ ++ + +FA+G V+ +GY + +F A
Sbjct: 2 IPSLYRRTADELLVELHLLSHQTQFKTNALFAVGLRQVFTAFTKGYRPADHLPQLFDALC 61
Query: 84 TALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGE-----VEGLLKDIAERASGKGN 138
+ + E+ + A+ E+ G + + + +G+ + L DIA
Sbjct: 62 SCNGFNAEELKSLAEGSEQAVSGHSVDEVQTWLQAKGDGAPGPLATGLADIAGE-----Q 116
Query: 139 FSYSRFFAVGLFRLLELA--NATEPTVLEKLCAV-------LNVNKRSVDRDLDVYRNLL 189
F YSR AVGLF LL A ++ +P E+LC + +++ +++DL +YRN L
Sbjct: 117 FHYSRLMAVGLFSLLSSAQGDSQDP---EELCKTAHTIGEQIGLSRPRLEKDLSLYRNNL 173
Query: 190 SKLLQAKELLKEYVDREKKKRE----ERTEPQ 217
K+ QA EL++E + E++KRE E +PQ
Sbjct: 174 EKMAQAVELMEETLASERRKRERQASENKQPQ 205
>gi|78212971|ref|YP_381750.1| Thf1-like protein [Synechococcus sp. CC9605]
gi|97202855|sp|Q3AJN7.1|THF1_SYNSC RecName: Full=Protein thf1
gi|78197430|gb|ABB35195.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 215
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 110/218 (50%), Gaps = 22/218 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + + I +Y + EL+V+ HL+ ++ ++ + +F++G TV+D ++
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRLADELLVELHLLSHQSRFEANELFSVGLCTVFDTFIK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E +A+F+A ++ D + R L E A+G+ SL ++ S EG
Sbjct: 66 GYRPEAQTDALFRALCSSNGFDAAKLRKTYASLVEQAKGKDPESLKDWLSSHALKEG--- 122
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLE------LANATEPTVLE--KLCAVLNVNKRSVD 179
YSR AVGL LL+ + TE V + +L L + V+
Sbjct: 123 -----------SHYSRLMAVGLMSLLKAAAADATDSDTEAIVKQSKELAEGLGLPTDRVE 171
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQ 217
+DL ++ + ++ QA EL++E + EK+K+E R E Q
Sbjct: 172 KDLTLFGSNSERMDQAVELVEETIAAEKRKKERRLEEQ 209
>gi|428183151|gb|EKX52010.1| hypothetical protein GUITHDRAFT_150871, partial [Guillardia theta
CCMP2712]
Length = 309
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 104/223 (46%), Gaps = 17/223 (7%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+D+ P A + F KL+ RPIP ++ E++ HL ++YD ++A G + +
Sbjct: 73 ADIEPCGAAVE-RFYKLFARPIPFVFRAPTNEILYLSHLDLVNAMFRYDVIWAAGLYSTF 131
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D E+ R +FQA + LK D + + DA + +WA+G+T + +V + +GE
Sbjct: 132 DLFFSAL-DEDLRANLFQALMGGLKLDQSKIKSDADAVLQWAQGKTEADVVS--AIKGED 188
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTV------------LEKLCAV 170
+ + +F Y+R F GL +++++ EP A+
Sbjct: 189 SSPVGQVLASLGKNEDFLYTRNFGAGLIKIMQVVG-VEPNAENAKRWAEVLGFTSNTSAL 247
Query: 171 LNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
++ + D+ ++ + + K+ QA +L E REKKK E+
Sbjct: 248 SGLSASKFETDVGLFLSSVDKMQQAMQLFAEVEAREKKKIAEK 290
>gi|299469582|emb|CBN76436.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 226
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+ET +F YK+ + + + T++ E + HL Y ++YDP+F +GF T + R M
Sbjct: 107 TVSETVADFYIYYKKVVLTQFRTIVTEYLQSTHLTVYDARFKYDPLFGVGFYTSFMRFMR 166
Query: 68 GYPSEEDREAIFQAYITALKE--DPEQYRIDAQKLEEWARGQTASSLVE 114
YP E IF A + A+ DP+Q R D L+EWA G+T +VE
Sbjct: 167 AYPVPGQAELIFDAVVKAIGNGLDPDQMRKDTTALKEWAEGKTEEDVVE 215
>gi|33865836|ref|NP_897395.1| Thf1-like protein [Synechococcus sp. WH 8102]
gi|81574513|sp|Q7U6N6.1|THF1_SYNPX RecName: Full=Protein thf1
gi|33633006|emb|CAE07817.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 212
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 104/211 (49%), Gaps = 19/211 (9%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T+A++K F + + I +Y + EL+V+ HL+ ++ T+Q + +FA+G TV++R +
Sbjct: 6 TIADSKRAFHQAFPHVIAPLYRRIADELLVELHLLSHQATFQANSLFAVGLKTVFERFTQ 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E A+ A ++ D EQ + AQ + A G + + + ++ +G
Sbjct: 66 GYRPMEHPAALLSALCSSNGFDDEQLKQAAQHCLQDAEGHSDDAFQSWLKEQSLSDGA-- 123
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL-----EKLCAVLNVNKRSVDRDL 182
YSR AVGL LLE ++ KL L + V++DL
Sbjct: 124 ------------HYSRLMAVGLLALLEASSDESDASSLRQRAVKLSVDLGLPAERVEKDL 171
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREER 213
V+ + ++ QA EL++E + +++K+E+R
Sbjct: 172 TVFSSNSERMEQAVELMQETLAADRRKKEKR 202
>gi|159903384|ref|YP_001550728.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9211]
gi|254784145|sp|A9BAB2.1|THF1_PROM4 RecName: Full=Protein thf1
gi|159888560|gb|ABX08774.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 221
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 115/221 (52%), Gaps = 18/221 (8%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+E+K F K + +P++Y ++ ELIV+ +L++ + + D VFA+G +++ +
Sbjct: 6 TVSESKAIFHKEFPFVVPAVYRRLVDELIVELNLLKNQERFVADGVFAIGLTSIFLDFTK 65
Query: 68 GYPSEEDREAIFQAYITAL---KEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
GY E + + +A + EQ ++A+KL SL+ +++ E E
Sbjct: 66 GYKPENQKGILLEAICKCTGFSASNLEQIALEAKKLANGLNTNEIKSLITDNNRD-EKES 124
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRS 177
K I + N YSR A+G+++L+++ + ATE + L+ L K
Sbjct: 125 TYKLINK------NNHYSRIIAIGIYKLVDMQSNGFNKEEATENSYLD-LVNNFGYTKER 177
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQK 218
V++D+++Y++ L K+ +A EL++ + EK++ +ER K
Sbjct: 178 VEKDVNLYKSSLDKIEKALELIEMNIKDEKRRNKERVSRTK 218
>gi|219123541|ref|XP_002182081.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406682|gb|EEC46621.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 311
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 104/225 (46%), Gaps = 9/225 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV E +F + +Y ++ +++ HL+ +Q D +++LG +T D L++
Sbjct: 67 TVGEAFADFSSELGVTVNPLYKNMVTDIVGTTHLVIVNARFQRDAIWSLGILTALDLLLK 126
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
YP E I A ++ D ++ R +A+ + +WA G++ + + + EG+ +
Sbjct: 127 NYPEPEVGAKIVSALFKSVGLDEDEIRNEARTISDWAVGKSKADIETALTGEGDSP--VA 184
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANA------TEPTVLEKLCAVLNVNKRSVDRD 181
IA + YSR+F +GL +++E P + + L + + D
Sbjct: 185 AIANSIKPNDYWMYSRYFGIGLIKIMESTGVEMDKDEVYPVMESWMQEKLGRSSLTACAD 244
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKC 226
D+Y + KL + ++KE REKK+ ER E KA A++
Sbjct: 245 SDLYFKIKDKLDMMETMMKEIEIREKKRMAERLE-DKAEAALRAA 288
>gi|427701945|ref|YP_007045167.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
gi|427345113|gb|AFY27826.1| photosystem II biogenesis protein Psp29 [Cyanobium gracile PCC
6307]
Length = 231
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 11/205 (5%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TVA++K F + I +Y ++ EL+V+ HL+ ++ +Q D +FA+G + V+D
Sbjct: 6 TVADSKRAFHGAFPHVISPLYRRMVDELLVELHLLSRQKGFQIDALFAVGLIQVFDGFAR 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E + +FQA + D R Q+ + + + ++ +G G
Sbjct: 66 GYRPEAQKGPLFQALCASSGFDGPDLRRQCQEALAAMGRHSQAEVRQWIESQG--AGAPA 123
Query: 128 DIAERASG--KGNFSYSRFFAVGLFRLLELA---NATEPTVLEKLCA----VLNVNKRSV 178
+A +G + +F YSR AVGL LLE A +A EP L +L + + + +
Sbjct: 124 PVATALAGIRRPDFHYSRLMAVGLLALLEQALADDAMEPQALRQLAHEIGESMGLLRDRL 183
Query: 179 DRDLDVYRNLLSKLLQAKELLKEYV 203
D+DL +Y + L K+ A EL++E V
Sbjct: 184 DKDLALYASNLEKMSMAVELMEETV 208
>gi|223995057|ref|XP_002287212.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
gi|220976328|gb|EED94655.1| hypothetical protein THAPSDRAFT_261275 [Thalassiosira pseudonana
CCMP1335]
Length = 212
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 99/212 (46%), Gaps = 8/212 (3%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV E F P+ ++Y + +L+ HL+ +Q D VF+LG V+ D +++
Sbjct: 2 TVGEAFTQFTDKLGTPVNALYKGMCTDLVGSLHLVMVNARFQRDAVFSLGLVSALDLVLK 61
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
YP E I A + ++ D +A LE WA+G+T + EG+ + L
Sbjct: 62 NYPEAETGARIKSAMLESVGLDEAVVNAEAAALEAWAQGKTKEDIASALKGEGDSQ--LA 119
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEK-LCAVLNVNKRSVDRD 181
IA+ A G + YSRFF VGL R++E+ + V+E + + + D
Sbjct: 120 AIAKAAKGDQWWMYSRFFGVGLVRIMEIVGVEMDMSVAYDVMENWMGKCMEKPYYTACSD 179
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
D+Y KL + ++KE REKK+ +R
Sbjct: 180 SDLYFKTKGKLDMMETMMKEIEIREKKRMADR 211
>gi|72382131|ref|YP_291486.1| Thf1-like protein [Prochlorococcus marinus str. NATL2A]
gi|97202784|sp|Q46L45.1|THF1_PROMT RecName: Full=Protein thf1
gi|72001981|gb|AAZ57783.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 199
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)
Query: 5 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFSK 62
Query: 65 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHATKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 177
KG YSR A+G+++L+ + E L +N +S
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170
Query: 178 VDRDLDVYRNLLSKLLQAKELL 199
V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192
>gi|124025670|ref|YP_001014786.1| Thf1-like protein [Prochlorococcus marinus str. NATL1A]
gi|166987530|sp|A2C211.1|THF1_PROM1 RecName: Full=Protein thf1
gi|123960738|gb|ABM75521.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 199
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 95/202 (47%), Gaps = 19/202 (9%)
Query: 5 VPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDR 64
V T++++K +F K + IP+IY + EL+V+ HL+ +++ ++ D +F+ G V+ +
Sbjct: 3 VRATISDSKSDFHKEFPYVIPAIYRKLADELLVELHLLSHQKNFKKDSIFSTGLKEVFCK 62
Query: 65 LMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEG 124
GY E +F A +P + +++L A+ T L F SK
Sbjct: 63 FTSGYKPSEHVTKLFDAICNCNGFNPTEINNSSEQLVSNAKSFTKEDLNSFLSKTNN--- 119
Query: 125 LLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRS------- 177
KG YSR A+G+++L+ + E L +N +S
Sbjct: 120 ---------DNKGYDYYSRINAIGIYKLVSEMPLFKEVKEEDLNKEINDISKSLGYQYSR 170
Query: 178 VDRDLDVYRNLLSKLLQAKELL 199
V++D+ +Y++ + K+ QA E++
Sbjct: 171 VEKDISMYKSNIEKMKQALEII 192
>gi|33240369|ref|NP_875311.1| Thf1-like protein [Prochlorococcus marinus subsp. marinus str.
CCMP1375]
gi|81664534|sp|Q7VC23.1|THF1_PROMA RecName: Full=Protein thf1
gi|33237896|gb|AAP99963.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 214
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/208 (22%), Positives = 102/208 (49%), Gaps = 9/208 (4%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
T++++K F K + IP +Y VL E +V+ +L+ + ++ D +F+ G + ++R
Sbjct: 6 TISDSKGLFHKEFPYVIPPVYRKVLDEYLVELNLLSNQSNFKIDTIFSYGLIISFERFTV 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY + I ++ + D + + + +++ + ++ + E++ +
Sbjct: 66 GYEPDSHISKILESLCNSCNIDIKAIKEYSNNIKKLINEKGIKEIINILT--AEIKKSVG 123
Query: 128 DIA-ERASGKGNFSYSRFFAVGLFRLLELAN-----ATEPTVLEKLCAVLNVNKRSVDRD 181
IA SGK + YSR A+G++ L+ N + ++ + L +K V++D
Sbjct: 124 GIALSNQSGKDKY-YSRLHAIGIYELISNINEDKKEGDDKEIISECVEALGFSKDRVEKD 182
Query: 182 LDVYRNLLSKLLQAKELLKEYVDREKKK 209
++ Y+N + K+ + EL+K V+ K+K
Sbjct: 183 INQYKNSMEKIKEMMELIKLTVEETKRK 210
>gi|254526529|ref|ZP_05138581.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
gi|221537953|gb|EEE40406.1| photosystem II biogenesis protein Psp29 [Prochlorococcus marinus
str. MIT 9202]
Length = 202
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 111/211 (52%), Gaps = 22/211 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 6 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 66 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 113
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL----ELANATEPTVLEKLCAV---LNVNKRSVDR 180
+ E+++ K SR +G++ L+ +L E + + + + LN++ ++
Sbjct: 114 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKKNESDINKMISDIFEQLNLSANKAEK 170
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
D+ +Y++ +SK+ QAKEL++E + KKK E
Sbjct: 171 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 201
>gi|157413170|ref|YP_001484036.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9215]
gi|157387745|gb|ABV50450.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
Length = 217
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 111/211 (52%), Gaps = 22/211 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y + E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 21 TVSDSKKLFHEKFPYVIPGLYKRIADEMLVELNLLNHQNEFTQDFLFCVGLTETFKELMK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ + +F++ ++ + ++ +QK ++ + +T++ +V+
Sbjct: 81 GYQPEKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKDKTSTDIVKL------------ 128
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA------NATEPT-VLEKLCAVLNVNKRSVDR 180
+ E+++ K SR +G++ L+ A N ++ ++ + LN++ ++
Sbjct: 129 -LIEKSNSK--LYPSRILNLGIYILISNAQDLKKNNESDTNKMISDIFEKLNLSANKAEK 185
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKKRE 211
D+ +Y++ +SK+ QAKEL++E + KKK E
Sbjct: 186 DIGIYKSSISKMEQAKELIEELRIKNKKKDE 216
>gi|123968337|ref|YP_001009195.1| Thf1-like protein [Prochlorococcus marinus str. AS9601]
gi|123198447|gb|ABM70088.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 218
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 115/217 (52%), Gaps = 30/217 (13%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + LM+
Sbjct: 21 TVSDSKKLFHEKFPYVIPGLYKRIVDEMLVELNLLNHQNEFTLDYLFCVGLTETFKELMK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ + +F++ ++ +A+++ E ++ ++ LV+ SK+ +LK
Sbjct: 81 GYQPEKHLDLLFESLCSST-------NFEAKEINEISK-KSQKELVDKTSKD-----ILK 127
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELA-----------NATEPTVLEKLCAVLNVNKR 176
+ E+ + K SR +G++ L+ + N + EKL L+ NK
Sbjct: 128 LLVEKNNSK--LYPSRILNLGIYTLISNSQDFKEKNESDKNKMTSDIFEKLS--LSANK- 182
Query: 177 SVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
++D+ +Y++ +SK+ QAKEL++E ++K K +++
Sbjct: 183 -AEKDIGIYKSSISKMEQAKELIEELRIKDKNKNQKK 218
>gi|123200442|gb|ABM72050.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 198
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 2 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 61
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 62 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 110
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 177
K+ N SR +G++ L +ANAT+ L EK A+ LN++
Sbjct: 111 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 163
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
++D+ +Y++ + K+ QAKELL+E + KKK
Sbjct: 164 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 195
>gi|161407964|ref|YP_001011157.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9515]
Length = 217
Score = 66.6 bits (161), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 104/212 (49%), Gaps = 28/212 (13%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+E+K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 21 TVSESKKLFHEQFPFVIPGLYKRIVDEMLVELNLLNHQNEFIQDELFCVGLTETFKELTK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYR-IDAQKLEEWARGQTASSLVEFPSKEGEVEGLL 126
GY E E +F++ + P + + I + LE++ E+ LL
Sbjct: 81 GYKPESHLELLFESLCKSSNFIPSKIKEISLKTLEQYKDKSLK-----------EISILL 129
Query: 127 KDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVL---EKLCAV------LNVNKRS 177
K+ N SR +G++ L +ANAT+ L EK A+ LN++
Sbjct: 130 KE-----KSTSNLYSSRILNIGIY--LIIANATDFKGLKDSEKNKAITDNINNLNLSVNK 182
Query: 178 VDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
++D+ +Y++ + K+ QAKELL+E + KKK
Sbjct: 183 AEKDIGIYKSSIKKMEQAKELLEEAKIQNKKK 214
>gi|97202782|sp|Q7V1W1.2|THF1_PROMP RecName: Full=Protein thf1
Length = 202
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 6 TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY EE +F++ + +P++ + ++K E + ++ E+ LLK
Sbjct: 66 GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 115
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 178
++ N SR +G++ L +ANAT+ ++ + LN++
Sbjct: 116 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 168
Query: 179 DRDLDVYRNLLSKLLQAKELLKE 201
++D+ +Y++ + K+ QAKELL+E
Sbjct: 169 EKDIGIYKSSILKMEQAKELLQE 191
>gi|33861298|ref|NP_892859.1| Thf1-like protein [Prochlorococcus marinus subsp. pastoris str.
CCMP1986]
gi|33633875|emb|CAE19200.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 217
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 102/203 (50%), Gaps = 26/203 (12%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + D +F +G + L +
Sbjct: 21 TVSDSKKLFHEQFPYVIPGLYKRIVDEMLVELNLLNHQNEFIQDDLFCVGLTETFKELTK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY EE +F++ + +P++ + ++K E + ++ E+ LLK
Sbjct: 81 GYKPEEHLRVLFESLCNSSNFEPKKIKEASKKTLEVYKDKSLK----------EISILLK 130
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELANATE---------PTVLEKLCAVLNVNKRSV 178
++ N SR +G++ L +ANAT+ ++ + LN++
Sbjct: 131 QKSD-----SNLYSSRILNLGIY--LIIANATDFKDIKDPEKNKIISDIINKLNLSFNKA 183
Query: 179 DRDLDVYRNLLSKLLQAKELLKE 201
++D+ +Y++ + K+ QAKELL+E
Sbjct: 184 EKDIGIYKSSILKMEQAKELLQE 206
>gi|97202762|sp|Q31BD6.2|THF1_PROM9 RecName: Full=Protein thf1
Length = 201
Score = 63.5 bits (153), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ ++ D +F +G + L +
Sbjct: 6 TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 65
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ + +F++ + +A++++E ++ S EF K + +LK
Sbjct: 66 GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 112
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 180
+ E+++ K SR +G++ L+ E + + ++ + L++++ ++
Sbjct: 113 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 170
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKK 209
D+ +Y++ +SK+ QAKEL++E ++KKK
Sbjct: 171 DIGIYKSSISKMEQAKELIQEQRIKDKKK 199
>gi|194476659|ref|YP_002048838.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
gi|171191666|gb|ACB42628.1| hypothetical protein PCC_0178 [Paulinella chromatophora]
Length = 213
Score = 63.5 bits (153), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 86/207 (41%), Gaps = 36/207 (17%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
PTVA+TK F K + I + TVL EL+V+ L+ + DP+FA+G + + L
Sbjct: 8 PTVADTKRAFYKGFPYVIAPSHRTVLNELLVELFLLSPQTDIGSDPLFAVGLIQFFGVLT 67
Query: 67 EGYPSEEDREAIFQAYITALKEDP--------------EQYRIDAQKLEEWARGQTASSL 112
+ Y + R +F+A ++ D QY I ++L W+ +S
Sbjct: 68 KHYQPQNHRMLLFEALCNSIGFDSFNLRQIRKESLSELSQYNI--EELHSWSLTGADNSE 125
Query: 113 VEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT-------VLE 165
+ F + K F YSR A+GL L++ A E +
Sbjct: 126 ILFTKTFIPI-------------KRRFHYSRLMAIGLLCLIKRARGVETLEAKELYYLTH 172
Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKL 192
L + + +DRDL VY + + K+
Sbjct: 173 NLAEKMGFIRERIDRDLSVYIDTIEKM 199
>gi|78779133|ref|YP_397245.1| Thf1-like protein [Prochlorococcus marinus str. MIT 9312]
gi|78712632|gb|ABB49809.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 216
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 112/209 (53%), Gaps = 22/209 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ ++ D +F +G + L +
Sbjct: 21 TVSDSKKLFHEEFPYVIPGLYKRIVDEILVELNLLNHQNEFKQDYLFCIGLTETFKELTK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY E+ + +F++ + +A++++E ++ S EF K + +LK
Sbjct: 81 GYKPEKHLDLLFESLCIST-------NFEAKEIKEISK----ISQKEFSDKSSK--DILK 127
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLL-------ELANATEPTVLEKLCAVLNVNKRSVDR 180
+ E+++ K SR +G++ L+ E + + ++ + L++++ ++
Sbjct: 128 LLKEKSNSK--LYPSRILNLGIYILISNSQDFKENNDIEKNKMISDIFEKLSLSRNKAEK 185
Query: 181 DLDVYRNLLSKLLQAKELLKEYVDREKKK 209
D+ +Y++ +SK+ QAKEL++E ++KKK
Sbjct: 186 DIGIYKSSISKMEQAKELIQEQRIKDKKK 214
>gi|323450067|gb|EGB05951.1| hypothetical protein AURANDRAFT_66018 [Aureococcus anophagefferens]
Length = 1032
Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 46/164 (28%), Positives = 73/164 (44%), Gaps = 5/164 (3%)
Query: 48 YQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQ 107
+ YD +F GFVT+ D +M YP D E I A I AL DP R D + + EW G+
Sbjct: 69 FVYDELFGFGFVTLMDMIMSPYPVAGDGEKITDALIAALDMDPATLRGDHKAVTEWLAGK 128
Query: 108 T-ASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANAT-EPTVLE 165
T A L S +G + A G+ F ++R VGL +++ + L
Sbjct: 129 TEADVLAAVASNDGSK---VASAAATIKGQEEFHHTRPSNVGLVAVMDAVGCKPDDESLA 185
Query: 166 KLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKK 209
+ + +V R+ + + K+ A +++K EKK+
Sbjct: 186 RWTEAFGMRAPAVQRNAGLLKEYQEKVANAMQMIKSAEIMEKKR 229
>gi|126543182|gb|ABO17424.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 198
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 2 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 61
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY + + +F++ ++ + ++ +QK ++ + +T++ +LK
Sbjct: 62 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 108
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 180
+ E+++ K SR + ++ L+ A + ++ + LN++ ++
Sbjct: 109 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 166
Query: 181 DLDVYRNLLSKLLQAKELLKE 201
D+ +Y++ +SK+ QAKEL++E
Sbjct: 167 DIGIYKSSISKMEQAKELIEE 187
>gi|161407965|ref|YP_001091025.2| Thf1-like protein [Prochlorococcus marinus str. MIT 9301]
Length = 217
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/201 (19%), Positives = 103/201 (51%), Gaps = 22/201 (10%)
Query: 8 TVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLME 67
TV+++K F + + IP +Y ++ E++V+ +L+ ++ + + +F +G + LM+
Sbjct: 21 TVSDSKRLFHEKFPYVIPGLYKRIVDEILVELNLLNHQNEFTQEYLFCIGLTETFKELMK 80
Query: 68 GYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLK 127
GY + + +F++ ++ + ++ +QK ++ + +T++ +LK
Sbjct: 81 GYQPNKHLDLLFESLCSSTNFEAKEINEISQKSQKEFKNKTSN-------------DILK 127
Query: 128 DIAERASGKGNFSYSRFFAVGLFRLLELAN-------ATEPTVLEKLCAVLNVNKRSVDR 180
+ E+++ K SR + ++ L+ A + ++ + LN++ ++
Sbjct: 128 LLIEKSNSK--LYPSRILNLAIYILISSAQDLKEKEESGRNKIISDIFEKLNLSANKAEK 185
Query: 181 DLDVYRNLLSKLLQAKELLKE 201
D+ +Y++ +SK+ QAKEL++E
Sbjct: 186 DIGIYKSSISKMEQAKELIEE 206
>gi|256810247|ref|YP_003127616.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
AG86]
gi|256793447|gb|ACV24116.1| CRISPR-associated protein, Csx11 family [Methanocaldococcus fervens
AG86]
Length = 1056
Score = 41.6 bits (96), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 50/101 (49%), Gaps = 3/101 (2%)
Query: 100 LEEWARGQTASSLVEF-PSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANA 158
LE+W + + E PSK ++ ++ E K + S S+F +LEL
Sbjct: 942 LEDWKKFIKFKEIFENKPSKLQKLVNIIYKCLEDWDNKYDDSISQFLDTSFINVLELNKK 1001
Query: 159 TEPTVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELL 199
+ V+EKLC + +++ D+DL+ +RN L + K++L
Sbjct: 1002 SNKEVIEKLCVIFDISLE--DKDLEKFRNELINKIDRKKML 1040
>gi|359483284|ref|XP_003632934.1| PREDICTED: mixed-amyrin synthase-like [Vitis vinifera]
Length = 170
Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 130 AERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLDVYRNLL 189
AE + + NF +RF AV L L L EP +E+L +NV ++ R +V R
Sbjct: 40 AEVEAARENFWKNRFLAVLLSSKLSLETVGEPLDMEQLFDAVNVMILNLKRQFEVLR--- 96
Query: 190 SKLLQAKELLKEYVDR 205
++ E +KEYVDR
Sbjct: 97 ---MKDNESIKEYVDR 109
>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
Length = 1053
Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 15/89 (16%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
P +AE N +YK + ++Q+ L Y+R + P F G++ + L+
Sbjct: 109 PVLAEAYSNLGNVYK-----------ERGLLQEALDNYRRAVRLKPDFIDGYINLAAALV 157
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRI 95
+ D EA QAYITAL+ +P+ Y +
Sbjct: 158 ----AARDMEAAVQAYITALQYNPDLYCV 182
>gi|194767414|ref|XP_001965811.1| GF13981 [Drosophila ananassae]
gi|190625935|gb|EDV41459.1| GF13981 [Drosophila ananassae]
Length = 396
Score = 37.4 bits (85), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 15/89 (16%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
P +AE N +YK + +Q+ L Y+R + P F G++ + L+
Sbjct: 116 PVLAEAYSNLGNVYK-----------ERGQLQEALDNYRRAVRLKPDFIDGYINLAAALV 164
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRI 95
+ D E+ QAYITAL+ +PE Y +
Sbjct: 165 ----AARDMESAVQAYITALQYNPELYCV 189
>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
Length = 1050
Score = 37.0 bits (84), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 15/89 (16%)
Query: 7 PTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLM 66
P +AE N +YK + +Q+ L Y+R + P F G++ + L+
Sbjct: 106 PVLAEAYSNLGNVYK-----------ERGQLQEALDNYRRAVRLKPDFIDGYINLAAALV 154
Query: 67 EGYPSEEDREAIFQAYITALKEDPEQYRI 95
+ D E+ QAYITAL+ +PE Y +
Sbjct: 155 ----AARDMESAVQAYITALQYNPELYCV 179
>gi|194880104|ref|XP_001974366.1| GG21695 [Drosophila erecta]
gi|190657553|gb|EDV54766.1| GG21695 [Drosophila erecta]
Length = 428
Score = 37.0 bits (84), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 54/128 (42%), Gaps = 10/128 (7%)
Query: 103 WARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATE-- 160
W R T L +E E+ L D+ E+A + + +S L + EL N+
Sbjct: 291 WIRSCTDQRLCRLNGREDEIRKELHDLEEQALQEESVQHSSQLMYSL-EVEELRNSIRNW 349
Query: 161 ----PTVLEK---LCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREER 213
T LE +C V + + V DL Y + L + ++ +D+EK REER
Sbjct: 350 QERLDTDLENADVMCTVSRLALQKVKDDLKFYMEQKAMYLSRIDEVQAIIDQEKMTREER 409
Query: 214 TEPQKANE 221
P + NE
Sbjct: 410 VSPCRENE 417
>gi|226940415|ref|YP_002795489.1| peroxiredoxin/glutaredoxin family protein [Laribacter hongkongensis
HLHK9]
gi|226715342|gb|ACO74480.1| Probable peroxiredoxin/glutaredoxin family protein [Laribacter
hongkongensis HLHK9]
Length = 245
Score = 36.6 bits (83), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 12/121 (9%)
Query: 96 DAQKLEEWARGQTASSLVEFPSKEGEVE---GLLKDIAERASGKGNFSYSRFFAVGLFR- 151
D + EWA+ Q ++++V P GE G+L D A+ GK ++ YS G+ +
Sbjct: 82 DTFVMNEWAKDQESANIVMVPDGNGEFTEGMGMLVDKADLGFGKRSWRYSMLVKDGVVQK 141
Query: 152 -LLELANATEP---TVLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLK----EYV 203
+E +P + + + A +N N + D+ + ++ +AKELL +Y+
Sbjct: 142 MFIEPQEPGDPFKVSDADTMLAYINPNAKKPDQVVVFSKDGCPFCAKAKELLSGKGYDYI 201
Query: 204 D 204
D
Sbjct: 202 D 202
>gi|444321392|ref|XP_004181352.1| hypothetical protein TBLA_0F02940 [Tetrapisispora blattae CBS 6284]
gi|387514396|emb|CCH61833.1| hypothetical protein TBLA_0F02940 [Tetrapisispora blattae CBS 6284]
Length = 2621
Score = 36.6 bits (83), Expect = 8.6, Method: Composition-based stats.
Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 180 RDLDVYRNLLSKLLQAKELLKEYVDREKKKRE-----ERTEPQKANEAIKKCLGEYLY 232
+ L ++ +++ + Q E+ K+Y +K+K ERT P + NEA+KK E +Y
Sbjct: 1996 KSLKIFDDMIKQFTQTSEISKKYSASDKEKSSSDILYERTSPPEMNEALKKIFEEGIY 2053
>gi|227496429|ref|ZP_03926715.1| recombination factor protein RarA [Actinomyces urogenitalis DSM
15434]
gi|226834048|gb|EEH66431.1| recombination factor protein RarA [Actinomyces urogenitalis DSM
15434]
Length = 455
Score = 36.6 bits (83), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 42 MRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQ 80
+ Y YQYDP A GF D + EGYP E+REA ++
Sbjct: 388 LGYGEGYQYDPDTAEGFSGA-DYMPEGYPPREEREAFYE 425
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.375
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,506,343,023
Number of Sequences: 23463169
Number of extensions: 141391676
Number of successful extensions: 556381
Number of sequences better than 100.0: 269
Number of HSP's better than 100.0 without gapping: 202
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 555841
Number of HSP's gapped (non-prelim): 328
length of query: 235
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 97
effective length of database: 9,121,278,045
effective search space: 884763970365
effective search space used: 884763970365
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)