BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031338
(161 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225469306|ref|XP_002269823.1| PREDICTED: ycf20-like protein isoform 2 [Vitis vinifera]
gi|225469308|ref|XP_002269782.1| PREDICTED: ycf20-like protein isoform 1 [Vitis vinifera]
gi|297741124|emb|CBI31855.3| unnamed protein product [Vitis vinifera]
Length = 155
Score = 236 bits (602), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 122/156 (78%), Positives = 136/156 (87%), Gaps = 1/156 (0%)
Query: 6 MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
MA ++ + S T L +FGL+ T S G+ + S SS F S++AVQEN GPRRLVDI+R
Sbjct: 1 MAHSLSLPS-TCLLKFGLATGTRVSHGKCSMLASRSSPRFFSVQAVQENEGPRRLVDIIR 59
Query: 66 LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
+VP+LSRNYF+SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL+TEYVT
Sbjct: 60 IVPQLSRNYFQSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLITEYVT 119
Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 155
>gi|255539575|ref|XP_002510852.1| conserved hypothetical protein [Ricinus communis]
gi|223549967|gb|EEF51454.1| conserved hypothetical protein [Ricinus communis]
Length = 155
Score = 234 bits (598), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 120/156 (76%), Positives = 134/156 (85%), Gaps = 1/156 (0%)
Query: 6 MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
MA ++G++ T+ + G Q S G+ G+ S SS G SI+AVQ+NGGP+RLVDI+R
Sbjct: 1 MALSIGLIQ-TVPLKLGFGFQRRVSCGKYGMLISRSSRGLLSIQAVQDNGGPQRLVDIIR 59
Query: 66 LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
VPE SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEYVT
Sbjct: 60 RVPEFSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVICVLLTEYVT 119
Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
RFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPVALLNNFKMGFTYGLFIDAFKLAS 155
>gi|356566265|ref|XP_003551354.1| PREDICTED: uncharacterized protein ycf20-like [Glycine max]
Length = 151
Score = 234 bits (596), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 122/142 (85%), Positives = 128/142 (90%), Gaps = 3/142 (2%)
Query: 20 QFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPS 79
FG P+T S+ R +S SSAG R IRAVQENGGPRRLVDI+RLVPE SRNYFRSPS
Sbjct: 13 SFGPLPKTKVSVSGRLVS--RSSAGLR-IRAVQENGGPRRLVDIIRLVPEFSRNYFRSPS 69
Query: 80 RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLA 139
RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT+FYYSRPKVTFP+A
Sbjct: 70 RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTKFYYSRPKVTFPVA 129
Query: 140 LLNNFKMGFTYGLFIDAFKLAS 161
LLNNFKMGFTYGLFIDAFKLAS
Sbjct: 130 LLNNFKMGFTYGLFIDAFKLAS 151
>gi|224122468|ref|XP_002318844.1| predicted protein [Populus trichocarpa]
gi|222859517|gb|EEE97064.1| predicted protein [Populus trichocarpa]
Length = 155
Score = 233 bits (593), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 119/156 (76%), Positives = 132/156 (84%), Gaps = 1/156 (0%)
Query: 6 MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
MA++V +++ T + G +PQ S + G S SS G IRAVQEN GPRRL+DI+R
Sbjct: 1 MAQSVSLIT-TCPLKLGFAPQRRVSSVKYGTSVFRSSTGRLRIRAVQENEGPRRLIDIIR 59
Query: 66 LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
+PELSRNYFR PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEYVT
Sbjct: 60 TIPELSRNYFRRPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVT 119
Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
RFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 155
>gi|449451787|ref|XP_004143642.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
gi|449530953|ref|XP_004172456.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
Length = 156
Score = 228 bits (582), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 120/158 (75%), Positives = 133/158 (84%), Gaps = 4/158 (2%)
Query: 6 MARAVGIVSPTILFQFGLSPQTIASLGRRGI--SFSSSSAGFRSIRAVQENGGPRRLVDI 63
MA++ +V T L +G +P+ S R + S S FR ++AVQ+ GGPRRLVDI
Sbjct: 1 MAQSASLVH-TSLLNYGFTPKMNPSYWRYDMVKSRPSPPRSFR-VKAVQDTGGPRRLVDI 58
Query: 64 VRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY 123
+RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEY
Sbjct: 59 IRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEY 118
Query: 124 VTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
VTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 119 VTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 156
>gi|212274997|ref|NP_001130729.1| hypothetical protein [Zea mays]
gi|194689966|gb|ACF79067.1| unknown [Zea mays]
gi|195626582|gb|ACG35121.1| hypothetical protein [Zea mays]
gi|223950487|gb|ACN29327.1| unknown [Zea mays]
gi|414586642|tpg|DAA37213.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586643|tpg|DAA37214.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586644|tpg|DAA37215.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586645|tpg|DAA37216.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
gi|414586646|tpg|DAA37217.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
Length = 155
Score = 213 bits (541), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/153 (71%), Positives = 125/153 (81%), Gaps = 6/153 (3%)
Query: 9 AVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVP 68
AVG+ S + + QT + ++ SS + IRAVQ NGG RRLVDI+R++P
Sbjct: 9 AVGLPSYGLYLETRFLTQTYRNFAQK------SSYKYSRIRAVQGNGGRRRLVDIIRIIP 62
Query: 69 ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFY 128
ELSR+YF+S SRRALFGGISLLGGFYVAQTISLSFG LGVNDVIAAV+CVLLTEYVT+FY
Sbjct: 63 ELSRDYFKSRSRRALFGGISLLGGFYVAQTISLSFGTLGVNDVIAAVVCVLLTEYVTKFY 122
Query: 129 YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
YSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 123 YSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 155
>gi|115459106|ref|NP_001053153.1| Os04g0488600 [Oryza sativa Japonica Group]
gi|38344242|emb|CAD41335.2| OJ991113_30.19 [Oryza sativa Japonica Group]
gi|113564724|dbj|BAF15067.1| Os04g0488600 [Oryza sativa Japonica Group]
gi|215707021|dbj|BAG93481.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737413|dbj|BAG96543.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 155
Score = 211 bits (538), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 106/121 (87%), Positives = 111/121 (91%)
Query: 41 SSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTIS 100
SS F +RAVQ N G RRLVDI+R +PELSRNYFRS SRRALFGGISLLGGFYVAQTIS
Sbjct: 35 SSYKFIKVRAVQGNDGRRRLVDIIRTIPELSRNYFRSRSRRALFGGISLLGGFYVAQTIS 94
Query: 101 LSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
LSFGALGVNDVIAAV+CVLLTEYVT+FYYSRPKVTFP ALLNNFKMGFTYGLFIDAFKLA
Sbjct: 95 LSFGALGVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPFALLNNFKMGFTYGLFIDAFKLA 154
Query: 161 S 161
S
Sbjct: 155 S 155
>gi|125548804|gb|EAY94626.1| hypothetical protein OsI_16403 [Oryza sativa Indica Group]
gi|125590821|gb|EAZ31171.1| hypothetical protein OsJ_15270 [Oryza sativa Japonica Group]
Length = 190
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/121 (87%), Positives = 111/121 (91%)
Query: 41 SSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTIS 100
SS F +RAVQ N G RRLVDI+R +PELSRNYFRS SRRALFGGISLLGGFYVAQTIS
Sbjct: 70 SSYKFIKVRAVQGNDGRRRLVDIIRTIPELSRNYFRSRSRRALFGGISLLGGFYVAQTIS 129
Query: 101 LSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
LSFGALGVNDVIAAV+CVLLTEYVT+FYYSRPKVTFP ALLNNFKMGFTYGLFIDAFKLA
Sbjct: 130 LSFGALGVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPFALLNNFKMGFTYGLFIDAFKLA 189
Query: 161 S 161
S
Sbjct: 190 S 190
>gi|297795157|ref|XP_002865463.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311298|gb|EFH41722.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
lyrata]
Length = 159
Score = 209 bits (532), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 103/118 (87%), Positives = 109/118 (92%), Gaps = 2/118 (1%)
Query: 46 RSIRAVQENGGP--RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
RSIRA+QE GP RRL+DI+R VPE+SRNYF+ PSRR LFGGISLLGGFYVAQTISLSF
Sbjct: 42 RSIRALQETEGPTPRRLIDIIRSVPEISRNYFKKPSRRTLFGGISLLGGFYVAQTISLSF 101
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
GALGVNDVIAAVLCVLLTEYVTRFYYSR VTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 102 GALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFPIALLNNFKMGFTYGLFIDAFKLAS 159
>gi|15239166|ref|NP_199119.1| uncharacterized protein [Arabidopsis thaliana]
gi|9757831|dbj|BAB08268.1| unnamed protein product [Arabidopsis thaliana]
gi|88900426|gb|ABD57525.1| At5g43050 [Arabidopsis thaliana]
gi|332007521|gb|AED94904.1| uncharacterized protein [Arabidopsis thaliana]
Length = 158
Score = 208 bits (529), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 103/118 (87%), Positives = 109/118 (92%), Gaps = 2/118 (1%)
Query: 46 RSIRAVQENGGP--RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
RSIRA+QE GP RRL+DI+R VPE+SRNYF+ PSRR LFGGISLLGGFYVAQTISLSF
Sbjct: 41 RSIRALQETEGPTPRRLIDIIRSVPEISRNYFKKPSRRTLFGGISLLGGFYVAQTISLSF 100
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
GALGVNDVIAAVLCVLLTEYVTRFYYSR VTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 101 GALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFPIALLNNFKMGFTYGLFIDAFKLAS 158
>gi|357164342|ref|XP_003580023.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 155
Score = 205 bits (522), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 102/122 (83%), Positives = 111/122 (90%)
Query: 40 SSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTI 99
SS +RA+Q N G RRLVDI+R++PELSRNYFRS SRRALFGGI+LLGGFYVAQTI
Sbjct: 34 KSSYRLLKVRALQGNDGRRRLVDIIRIIPELSRNYFRSGSRRALFGGIALLGGFYVAQTI 93
Query: 100 SLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
SLSFGAL VNDVIAAV+CVLLTEYVT+FYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKL
Sbjct: 94 SLSFGALAVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPVALLNNFKMGFTYGLFIDAFKL 153
Query: 160 AS 161
AS
Sbjct: 154 AS 155
>gi|294460167|gb|ADE75666.1| unknown [Picea sitchensis]
Length = 200
Score = 197 bits (502), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 96/130 (73%), Positives = 112/130 (86%)
Query: 32 GRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLG 91
G+ I S+ +I+AVQ + PRRLVD ++++PE SRNYF+SP RRALFGGISLLG
Sbjct: 71 GKSCIMPPKSTFQSSNIKAVQGSRQPRRLVDAIKIIPEFSRNYFKSPFRRALFGGISLLG 130
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYG 151
GFYVAQTISLSFGALGVNDVIAAVLCV++TEY T+FYY+R KVTFP+ALLNNFKMGFTYG
Sbjct: 131 GFYVAQTISLSFGALGVNDVIAAVLCVIVTEYATKFYYTRSKVTFPVALLNNFKMGFTYG 190
Query: 152 LFIDAFKLAS 161
LFIDAFKLA+
Sbjct: 191 LFIDAFKLAN 200
>gi|167999037|ref|XP_001752224.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696619|gb|EDQ82957.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 103
Score = 167 bits (422), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/103 (75%), Positives = 92/103 (89%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RLVD+VR +P ++R YF+ P RRALFGGI++LGGFYVAQTISLSFGALGVNDVIAA +CV
Sbjct: 1 RLVDVVRRLPRIARTYFKDPWRRALFGGIAILGGFYVAQTISLSFGALGVNDVIAAAMCV 60
Query: 119 LLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
L TEY+TRF +R K++F +AL+NNFKMGFTYGLFIDAFKLAS
Sbjct: 61 LFTEYITRFTRTRKKISFTVALVNNFKMGFTYGLFIDAFKLAS 103
>gi|302816212|ref|XP_002989785.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
gi|302816919|ref|XP_002990137.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
gi|300142150|gb|EFJ08854.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
gi|300142351|gb|EFJ09052.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
Length = 104
Score = 162 bits (410), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 80/104 (76%), Positives = 93/104 (89%)
Query: 58 RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
+RL+D+V VP + NYFRSP RRALFGGISLLGGF+VAQTISLSFGA+GVNDVIA+ LC
Sbjct: 1 QRLLDLVGRVPGAATNYFRSPWRRALFGGISLLGGFFVAQTISLSFGAVGVNDVIASALC 60
Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
VLLTEY+TRFY++R + + +ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 61 VLLTEYMTRFYFTRQRPSLGVALLNNFKMGFTYGLFIDAFKLAS 104
>gi|384248674|gb|EIE22157.1| hypothetical protein COCSUDRAFT_47768 [Coccomyxa subellipsoidea
C-169]
Length = 169
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/112 (45%), Positives = 63/112 (56%), Gaps = 2/112 (1%)
Query: 50 AVQENGGPR-RLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALG 107
+VQ G R RL + VP YF P+RR L+ I+ G+Y A T+SLSFGAL
Sbjct: 56 SVQARGSGRTRLATRITTVPRRVDLYFADQPARRILWACIAFACGYYCANTVSLSFGALA 115
Query: 108 VNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
+NDV+AA L V E V+ YY+ PK T L L FK+G L DAFKL
Sbjct: 116 INDVVAAALTVAFCEVVSSLYYAAPKQTLKLLFLQCFKIGIICALIADAFKL 167
>gi|307110443|gb|EFN58679.1| hypothetical protein CHLNCDRAFT_140276 [Chlorella variabilis]
Length = 117
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 46 RSIRAVQENGGPRRLVDIVRLVPE-LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFG 104
RSI ++ RRL+ V P+ L R S RR ++ GISL GFY +LSFG
Sbjct: 2 RSI-CIKARDRQRRLLSYVLSAPQRLDRYLESSWWRRWMWAGISLSAGFYAGNIATLSFG 60
Query: 105 ALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
AL VNDV+AAV+ ++ E VT YYS K + + LNNFKMG + DA KL S
Sbjct: 61 ALAVNDVLAAVITLMFCEVVTHLYYSTEKPSLFIWFLNNFKMGLIAAMLADAAKLGS 117
>gi|303271617|ref|XP_003055170.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463144|gb|EEH60422.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 194
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 61/109 (55%), Gaps = 3/109 (2%)
Query: 56 GPR--RLVDIVRLVPELSRNYFR-SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
GPR R+ I+ +P YFR RR L+ IS GFY A ISL+FG LG+NDV+
Sbjct: 86 GPRETRIASILTTLPIRIELYFRVRKGRRTLWKAISAFAGFYTANVISLTFGVLGINDVV 145
Query: 113 AAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
A CV E VTR YY++ K LN FK+G Y L DAFKL S
Sbjct: 146 AGAFCVGFFELVTRAYYNKIKPGKYWGFLNWFKLGLIYALVTDAFKLGS 194
>gi|412990920|emb|CCO18292.1| predicted protein [Bathycoccus prasinos]
Length = 253
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 63/104 (60%), Gaps = 1/104 (0%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRAL-FGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
R+ ++ +P+ YF+ S R L + I + GFY+A I+LSFGALG+NDV+A LC
Sbjct: 150 RIASVLTSLPQRLDLYFQRRSYRMLLWKTIFIFVGFYLASVITLSFGALGINDVVAGALC 209
Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
V + E +TR+YY + L +N FK+G Y L DAFKL S
Sbjct: 210 VFVYETITRWYYKTSRPGRKLEFVNCFKLGLVYSLIADAFKLGS 253
>gi|308807413|ref|XP_003081017.1| unnamed protein product [Ostreococcus tauri]
gi|116059479|emb|CAL55186.1| unnamed protein product [Ostreococcus tauri]
Length = 178
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 66/114 (57%), Gaps = 5/114 (4%)
Query: 49 RAVQENGGPRRLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALG 107
RA +E RLV ++ +P YF R R L+ IS+ GFY+A ++LSFGALG
Sbjct: 69 RAERET----RLVSVLTSLPRRLELYFARRSYRVTLWRAISVFFGFYLANVMTLSFGALG 124
Query: 108 VNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+NDV+A L V E VTR YYS + L +N FK+G+ Y L DAFKL S
Sbjct: 125 INDVVAGALSVAFYEIVTRIYYSTLQQNKWLEFVNWFKVGYCYSLIADAFKLGS 178
>gi|145350586|ref|XP_001419683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579915|gb|ABO97976.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 104
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 61/104 (58%), Gaps = 1/104 (0%)
Query: 59 RLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
RLV ++ +P YF R R L+ IS GFY+A ++LSFGA+G+NDV+A L
Sbjct: 1 RLVSVLTSLPRRFNLYFQRRNYRVTLWRAISAYFGFYLANVMTLSFGAIGINDVVAGALS 60
Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
V E VTR YY+ + L LN FK+G+ Y L DAFKL S
Sbjct: 61 VAFFEVVTRIYYASAETNKWLEFLNWFKVGYCYSLIADAFKLGS 104
>gi|255072633|ref|XP_002499991.1| predicted protein [Micromonas sp. RCC299]
gi|226515253|gb|ACO61249.1| predicted protein [Micromonas sp. RCC299]
Length = 181
Score = 79.7 bits (195), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 79/165 (47%), Gaps = 17/165 (10%)
Query: 4 ALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRS----IRAVQENGGPR- 58
A ARA+G F+ G ASL RR ++ R RAV GPR
Sbjct: 27 ATSARALG-------FKGGEGMGQSASLARRQVAMVGPKVAGRGGAMVTRAVT---GPRE 76
Query: 59 -RLVDIVRLVPELSRNYFR-SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVL 116
RL I+ +P+ YFR R+ L+ I GFY A +SL+FG LG+NDV+A
Sbjct: 77 TRLGSILTTIPQRLDLYFRVRRGRKLLWKIIFAATGFYSANVLSLTFGVLGINDVVAGAF 136
Query: 117 CVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
C+ E VTR +Y + LN FK+G Y L DAFKL S
Sbjct: 137 CLAFAEIVTRSFYKKTNPGKYWDFLNWFKLGLCYALITDAFKLGS 181
>gi|443475153|ref|ZP_21065112.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
gi|443020076|gb|ELS34079.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
Length = 117
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 61/113 (53%), Gaps = 10/113 (8%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL IV + + +P RR ISLL GF++A IS S GA D++AA + V
Sbjct: 5 RLSQIVNINIRRLSQWSSNPWRRTSLIAISLLLGFFLASVISTSTGAKSELDILAAAITV 64
Query: 119 LLTEYVTRFYYSRPKVT--------FPLA--LLNNFKMGFTYGLFIDAFKLAS 161
E RF YS+ ++T PL +LN+ K+G TYGLF++AFKL S
Sbjct: 65 FFVELTNRFVYSQKRLTRSDGTITPRPLTSEMLNSLKLGITYGLFLEAFKLGS 117
>gi|427707469|ref|YP_007049846.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
gi|427359974|gb|AFY42696.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
Length = 110
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR--- 131
FR+P RR IS L GF++ IS G D+ A VLLTE + R +YSR
Sbjct: 21 FRNPWRRLSILIISFLFGFFLGTAISTIAGQKAELDIFVAAFLVLLTELINRIFYSRNLF 80
Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
K L LN K+GFTY LF++AFKL S
Sbjct: 81 AKRPLWLECLNTLKVGFTYSLFVEAFKLGS 110
>gi|443312485|ref|ZP_21042102.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
gi|442777463|gb|ELR87739.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
Length = 109
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
Query: 69 ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFY 128
+LSR F +P RR ISLL GF++ +S + G D+IAA + V L+E V R Y
Sbjct: 16 QLSRWLF-NPWRRLSLLAISLLFGFFLGTAVSTTAGQAADWDIIAAGILVALSEIVDRIY 74
Query: 129 YSR--PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
Y R +F + LN K+G TY LFI+AFKL S
Sbjct: 75 YRRRSGDRSFWVESLNALKIGLTYSLFIEAFKLGS 109
>gi|378787253|gb|AFC39884.1| Ycf20 [Porphyra umbilicalis]
Length = 108
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 85 GGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS-RPKVTFPLALLNN 143
G ISLL GF+++ ++S G G ++AA L V TE V++ YS + K + L+NN
Sbjct: 31 GLISLLLGFFISTSLSTIPGQTGDWGIVAASLIVAGTELVSKIVYSNKEKCVIKINLVNN 90
Query: 144 FKMGFTYGLFIDAFKLAS 161
FK+G TYGLF+DAFKL S
Sbjct: 91 FKIGITYGLFVDAFKLGS 108
>gi|67920532|ref|ZP_00514052.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
gi|67858016|gb|EAM53255.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
Length = 110
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + F +P RR IS+L GF+V Q +S++ G D+ + +
Sbjct: 5 RLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQQTYWDITVGIFLL 64
Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ TE ++R YSR K + L +LN FK+G TY L+++A KL S
Sbjct: 65 IFTEGISRITYSRSKKEGRSLSLDILNLFKIGVTYSLYVEALKLGS 110
>gi|218246762|ref|YP_002372133.1| hypothetical protein PCC8801_1939 [Cyanothece sp. PCC 8801]
gi|257059804|ref|YP_003137692.1| hypothetical protein Cyan8802_1966 [Cyanothece sp. PCC 8802]
gi|218167240|gb|ACK65977.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8801]
gi|256589970|gb|ACV00857.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8802]
Length = 111
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR--- 131
F +P RR ISLL GF+V Q ++ S G D+ A +L TE+++R +YSR
Sbjct: 21 FSNPWRRIALSLISLLLGFFVGQAVTTSVGQSAYWDITVAGFFLLFTEFISRTFYSRLSI 80
Query: 132 -PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
K F L++LN FKMG YGL+++A KL S
Sbjct: 81 NNKRLFWLSMLNTFKMGVIYGLYLEALKLGS 111
>gi|452824347|gb|EME31350.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 203
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 52/84 (61%)
Query: 78 PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP 137
P ++ L G +++L GF A +++L FGAL D + A + + TE TR+YY +F
Sbjct: 120 PKKQLLSGLMAMLFGFTCASSLTLIFGALSFLDPLIAGVLAVCTELYTRYYYQESSPSFS 179
Query: 138 LALLNNFKMGFTYGLFIDAFKLAS 161
L L+N FK+G GL +DAFK++S
Sbjct: 180 LRLVNAFKIGVMTGLILDAFKISS 203
>gi|428298335|ref|YP_007136641.1| hypothetical protein Cal6303_1627 [Calothrix sp. PCC 6303]
gi|428234879|gb|AFZ00669.1| protein of unknown function DUF565 [Calothrix sp. PCC 6303]
Length = 109
Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++ + ++ F +P RR IS L GF++ +S + G D++AA +
Sbjct: 5 RLNNLFDAIARRTQEVFVNPWRRLSLLVISFLSGFFLGTAVSTTAGQKAELDIVAAAFIL 64
Query: 119 LLTEYVTRFYYSRPKVTFPLAL--LNNFKMGFTYGLFIDAFKLAS 161
++TE V+R YY R T L L LN K+G Y LFI+A KL S
Sbjct: 65 IITETVSRIYYRRGNGTNSLGLEVLNTLKIGLMYSLFIEALKLGS 109
>gi|16329594|ref|NP_440322.1| hypothetical protein sll1509 [Synechocystis sp. PCC 6803]
gi|383321335|ref|YP_005382188.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324505|ref|YP_005385358.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490389|ref|YP_005408065.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435655|ref|YP_005650379.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
gi|451813753|ref|YP_007450205.1| YCF20 protein [Synechocystis sp. PCC 6803]
gi|6136503|sp|P72983.1|YC20L_SYNY3 RecName: Full=Ycf20-like protein
gi|1652077|dbj|BAA17002.1| ycf20 [Synechocystis sp. PCC 6803]
gi|339272687|dbj|BAK49174.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
gi|359270654|dbj|BAL28173.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359273825|dbj|BAL31343.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359276995|dbj|BAL34512.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957475|dbj|BAM50715.1| hypothetical protein BEST7613_1784 [Bacillus subtilis BEST7613]
gi|451779722|gb|AGF50691.1| YCF20 protein [Synechocystis sp. PCC 6803]
Length = 109
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL IV + + +FR+P RR +S L GF+V ++ + G DV+ A +
Sbjct: 5 RLNTIVEVRGQQLSQFFRNPWRRISLSLLSFLFGFFVGTAVATTAGQNSQWDVVCAAFIL 64
Query: 119 LLTEYVTRFYYSRPKVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
L E V R++Y R L +LN FKMG +Y LF++AFKL S
Sbjct: 65 LFCELVNRWFYRRGVKMGDLQAEVLNIFKMGVSYSLFLEAFKLGS 109
>gi|440682864|ref|YP_007157659.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
gi|428679983|gb|AFZ58749.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
Length = 110
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + L +F +P RR I+ L GF++ +S + G D++ A + V
Sbjct: 5 RLNNLLDTIARLLGQWFVNPWRRLSLLVINFLFGFFLGSAVSTTTGQKAELDIVVAAVLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LLTE +R +YSR K + LN K+GF Y LF++AFKL S
Sbjct: 65 LLTEITSRIFYSRTFFAKQALWVEALNFLKVGFIYSLFLEAFKLGS 110
>gi|90994406|ref|YP_536896.1| hypothetical chloroplast protein 20 [Pyropia yezoensis]
gi|122194735|sp|Q1XDS2.1|YCF20_PORYE RecName: Full=Uncharacterized protein ycf20
gi|90818970|dbj|BAE92339.1| unnamed protein product [Pyropia yezoensis]
Length = 108
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 55/97 (56%), Gaps = 2/97 (2%)
Query: 66 LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
L+ L+ + S S G ISLL GF+++ +S G G +IAA L V TE +
Sbjct: 13 LIKNLNNKLYYSLSELTT-GLISLLLGFFISTGLSTIPGQTGDWGIIAASLIVAATELTS 71
Query: 126 RFYYSRPK-VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ YS K + + L NNFK+G TYGLF+DAFKL S
Sbjct: 72 KIVYSSHKQLNIKINLFNNFKIGITYGLFVDAFKLGS 108
>gi|428201383|ref|YP_007079972.1| hypothetical protein Ple7327_0997 [Pleurocapsa sp. PCC 7327]
gi|427978815|gb|AFY76415.1| Protein of unknown function (DUF565) [Pleurocapsa sp. PCC 7327]
Length = 107
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 51/88 (57%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
+F +P RR I +L GF+ +S + G D+IAA +L E+V+R +Y R +
Sbjct: 20 FFSNPWRRISLILIGVLFGFFAGSALSTTAGQAAEWDIIAAGFVLLFVEFVSRIFYRRER 79
Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ L +LN FK+G Y LF++AFKL S
Sbjct: 80 RSLFLEVLNFFKIGMIYSLFLEAFKLGS 107
>gi|428316563|ref|YP_007114445.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
7112]
gi|428240243|gb|AFZ06029.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
7112]
Length = 110
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 53 ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
+N RL+D++ + R + +P RR ISLL G ++ IS G D+I
Sbjct: 2 QNTRINRLIDVL---GDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDII 58
Query: 113 AAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
AA L VLLTE+ R Y + +F + +LN K+G TY LF++AFKL S
Sbjct: 59 AAGLLVLLTEFANRLVYGSLRSDDRSFWVQMLNALKIGLTYNLFVEAFKLGS 110
>gi|416377790|ref|ZP_11683661.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
0003]
gi|357266150|gb|EHJ14818.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
0003]
Length = 110
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + F +P RR IS+L GF+V Q +S++ G D+ + +
Sbjct: 5 RLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQKTYWDINVGIFLL 64
Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ TE ++R YSR K + L +LN FK+G TY L+++A KL S
Sbjct: 65 IFTEGISRITYSRSKKEGRSLSLDILNLFKIGVTYSLYVEALKLGS 110
>gi|172037433|ref|YP_001803934.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
gi|354553685|ref|ZP_08972991.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
gi|171698887|gb|ACB51868.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
gi|353554402|gb|EHC23792.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
Length = 110
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + F +P RR ISLL GF+V Q++S++ G D+ + +
Sbjct: 5 RLNTLVEVTQTKFNQTFSNPWRRISLSLISLLLGFFVGQSVSITAGQEAYWDITVGIFLL 64
Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ TE ++R YS+ K + L +LN FK+G TYGL+I+A KL S
Sbjct: 65 IFTEGISRIAYSQNKKKGRSLGLDILNLFKIGMTYGLYIEALKLGS 110
>gi|334118318|ref|ZP_08492408.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
gi|333460303|gb|EGK88913.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
Length = 110
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 59/113 (52%), Gaps = 6/113 (5%)
Query: 52 QENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDV 111
+N RL+D++ + R + +P RR ISLL G ++ IS G D+
Sbjct: 1 MQNTRINRLIDVL---GDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDI 57
Query: 112 IAAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
IAA L VLLTE V Y P+ + + +LN K+G TY LF++AFKL S
Sbjct: 58 IAAGLLVLLTELVNWLVYGGPRPATGSLWVEMLNALKIGLTYNLFVEAFKLGS 110
>gi|428223860|ref|YP_007107957.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
gi|427983761|gb|AFY64905.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
Length = 110
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 53 ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
+N L+D+ L+ +L R + +P RR ISLL GF+ +S + G G DV+
Sbjct: 2 QNTRLSSLIDV--LLAQL-RRWLLNPWRRVSLVVISLLFGFFTGNALSTTTGQSGDIDVL 58
Query: 113 AAVLCVLLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
A + V LTE V+ F Y R + + LN FK+G Y LF++AFKL S
Sbjct: 59 VAAITVSLTEAVSWFVYGRRWGERQPLLIDSLNAFKLGLIYSLFLEAFKLGS 110
>gi|414078888|ref|YP_006998206.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
gi|413972304|gb|AFW96393.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
Length = 110
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + +F +P RR IS L GF++ +S + G D++ A V
Sbjct: 5 RLNNLLETIFSSLSEWFLNPWRRLSLILISFLFGFFLGSAVSTTAGQKAELDIVVAGFLV 64
Query: 119 LLTEYVTRFYYSRP---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LLTE +R +YSR + F + +LN K+GF Y LF++A KL S
Sbjct: 65 LLTEITSRIFYSRSFFVRQAFWVEILNYLKVGFIYSLFLEALKLGS 110
>gi|78211846|ref|YP_380625.1| hypothetical protein Syncc9605_0294 [Synechococcus sp. CC9605]
gi|78196305|gb|ABB34070.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 110
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSR 131
N+ R+P RR I+ L GF + I+ G LG D +AA+L VL TE + R S
Sbjct: 22 NWARNPWRRLSLLAIAGLIGFLIGTAITPVAGVLGQMDPVAALLVVLGTELTIRRQRSSE 81
Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKL 159
P + P LL+ ++GF YGLF++ FKL
Sbjct: 82 PSLKLPQQLLDLGRIGFLYGLFLEGFKL 109
>gi|434397744|ref|YP_007131748.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
gi|428268841|gb|AFZ34782.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
Length = 116
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 12/111 (10%)
Query: 60 LVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL 119
LVD+V EL F +P RR IS+L GF++ I+ + G DVI A L +
Sbjct: 9 LVDVVGARIEL---LFSNPWRRFALSIISILFGFFMGSAIATTAGQDASWDVIGAALILF 65
Query: 120 LTEYVTRFYYSRP-------KVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
TE ++ F Y RP + PL ++N FK+G TY LF++AFKL S
Sbjct: 66 FTELLSNFVYRRPSRDRTETNLRRPLLVDVVNLFKIGLTYSLFLEAFKLGS 116
>gi|282899121|ref|ZP_06307102.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
CS-505]
gi|281196037|gb|EFA70953.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
CS-505]
Length = 110
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + +F +P RR IS L GF++ +S + G D+ AA + V
Sbjct: 5 RLNNLLDTIATNVAQWFINPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAAAVLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LLTE +R +YSR + + LN K+G Y LF++AFKL S
Sbjct: 65 LLTEVASRLFYSRGFFSQRAIWVESLNWLKIGLIYSLFLEAFKLGS 110
>gi|282897978|ref|ZP_06305973.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
gi|281197122|gb|EFA72023.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
Length = 120
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + +F +P RR IS L GF++ +S + G D+ AA + V
Sbjct: 15 RLNNLLDTIATNVAQWFVNPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAAAVLV 74
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LLTE +R +YSR + + LN K+G Y LF++AFKL S
Sbjct: 75 LLTEVASRLFYSRGFFSQRAIWVESLNWLKIGLIYSLFLEAFKLGS 120
>gi|409992567|ref|ZP_11275749.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
Paraca]
gi|409936561|gb|EKN78043.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
Paraca]
Length = 111
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 58/99 (58%), Gaps = 4/99 (4%)
Query: 67 VPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTR 126
+ + + +FR+P RR ISLL GF++ I+ G +G DV A + +L+TE+++
Sbjct: 13 IAQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQVGSWDVSVAFITLLVTEWISW 72
Query: 127 FYY---SRPKV-TFPLALLNNFKMGFTYGLFIDAFKLAS 161
Y SR ++ + L LLN+ K+G TY LF+ AF L S
Sbjct: 73 LSYRKVSRSEINSLGLDLLNSLKIGMTYSLFLLAFLLGS 111
>gi|11465680|ref|NP_053824.1| ORF20 [Porphyra purpurea]
gi|1723324|sp|P51214.1|YCF20_PORPU RecName: Full=Uncharacterized protein ycf20
gi|1276680|gb|AAC08100.1| hypothetical chloroplast ORF 20 (chloroplast) [Porphyra purpurea]
Length = 108
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 53/92 (57%), Gaps = 3/92 (3%)
Query: 71 SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS 130
SR Y+ A G ISLL GF+++ +S G G +IAA L V E V++ YS
Sbjct: 19 SRLYYSLNELTA--GLISLLLGFFISTGLSTIPGQTGDWGIIAASLIVAAIELVSKIVYS 76
Query: 131 -RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ K + LLNN K+G TYGLF+DAFKL S
Sbjct: 77 NKKKYGVRINLLNNLKIGITYGLFVDAFKLGS 108
>gi|427720360|ref|YP_007068354.1| hypothetical protein Cal7507_5179 [Calothrix sp. PCC 7507]
gi|427352796|gb|AFY35520.1| protein of unknown function DUF565 [Calothrix sp. PCC 7507]
Length = 110
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 3/92 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
+F +P RR IS L GF++ +S + G D++ A V LTE +R +Y+R
Sbjct: 19 EWFTNPWRRLSLLVISFLFGFFLGTAVSTTAGQKADWDIVVAAFLVFLTEISSRIFYNRS 78
Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
K + + LN K+GFTY LFI+A KL S
Sbjct: 79 FLSKRSLWVEALNLLKVGFTYSLFIEALKLGS 110
>gi|218437362|ref|YP_002375691.1| hypothetical protein PCC7424_0355 [Cyanothece sp. PCC 7424]
gi|218170090|gb|ACK68823.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7424]
Length = 116
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 12/111 (10%)
Query: 60 LVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL 119
LVD + V +L R ++R+P RR I LGG +VAQ +S + G D++ A +L
Sbjct: 9 LVDTI--VFKLQR-FYRNPWRRISVLVIGFLGGIFVAQALSTTGGQEADWDIVMATFVLL 65
Query: 120 LTEYVTRFYYSRPK-------VTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
TE V+ F Y P+ + + L LN FK+G TY +F++AFKL S
Sbjct: 66 FTEGVSIFVYRLPRKQSSNLDIRYSLFTESLNMFKIGITYSMFLEAFKLGS 116
>gi|298493111|ref|YP_003723288.1| hypothetical protein Aazo_5069 ['Nostoc azollae' 0708]
gi|298235029|gb|ADI66165.1| protein of unknown function DUF565 ['Nostoc azollae' 0708]
Length = 110
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + +F +P RR IS L GF++ +S + G D++ A V
Sbjct: 5 RLNNLLDTIARSLGQWFVNPWRRFSLLLISWLFGFFMGSAVSTTAGQKAELDIVVAAFLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LLTE +R +YSR + + LN K+GF Y LF++AFKL S
Sbjct: 65 LLTEITSRIFYSRNFFARQAIWVESLNLLKIGFIYSLFLEAFKLGS 110
>gi|434405265|ref|YP_007148150.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
7417]
gi|428259520|gb|AFZ25470.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
7417]
Length = 110
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
+F +P RR IS L GF++ +S + G D++ A VLLTE +R +YSR
Sbjct: 19 QWFINPWRRLSLLVISFLFGFFLGTAVSTTAGQKAELDIVVAGFLVLLTEVTSRIFYSRG 78
Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ + + LN K+GF Y LFI+AFKL S
Sbjct: 79 FLARRSLWVESLNLLKVGFIYSLFIEAFKLGS 110
>gi|126654810|ref|ZP_01726344.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
gi|126623545|gb|EAZ94249.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
Length = 110
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + F +P RR ISLL GF+V Q++S++ G D+ + +
Sbjct: 5 RLNTLVEVTQTKFNQTFSNPWRRISLSLISLLLGFFVGQSVSITAGQEAYWDITVGIFLL 64
Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ TE ++R YSR + L +LN FK+G TY L+++A KL S
Sbjct: 65 IFTEGISRIAYSRSSKKARSLSLDILNLFKIGITYSLYVEALKLGS 110
>gi|449020096|dbj|BAM83498.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 206
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 46/78 (58%)
Query: 83 LFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLN 142
+G +LL G + A +IS GAL D IA + V + E++ YY RP+ + + +LN
Sbjct: 128 FWGSCALLFGAFSATSISTIVGALAEWDPIAGAVIVFIMEWIATQYYRRPRPSNIIRILN 187
Query: 143 NFKMGFTYGLFIDAFKLA 160
+++G T+G +DA KLA
Sbjct: 188 AYQVGLTFGFILDALKLA 205
>gi|427730935|ref|YP_007077172.1| hypothetical protein Nos7524_3796 [Nostoc sp. PCC 7524]
gi|427366854|gb|AFY49575.1| Protein of unknown function (DUF565) [Nostoc sp. PCC 7524]
Length = 110
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 6/102 (5%)
Query: 66 LVPELSRN---YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTE 122
L ++RN +F +P RR IS L GF++ IS G G D++ A V+LTE
Sbjct: 9 LFDAIARNLGLWFLNPWRRLSLLVISFLFGFFLGTAISTIAGQRGTLDIMIAGFLVVLTE 68
Query: 123 YVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+R +YSR + + LN K+GF Y LFI+AFKL S
Sbjct: 69 VTSRIFYSRSFFARRELWVESLNLLKVGFIYSLFIEAFKLGS 110
>gi|291566840|dbj|BAI89112.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 111
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 61/113 (53%), Gaps = 7/113 (6%)
Query: 53 ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
+N R D + + + +FR+P RR ISL GF++ I+ G +G DV
Sbjct: 2 QNTRLNRFTDAI---AQQTLQWFRNPWRRISVLLISLFFGFFLGVAIATIVGQVGSWDVS 58
Query: 113 AAVLCVLLTEYVTRFYY---SRPKV-TFPLALLNNFKMGFTYGLFIDAFKLAS 161
A + +L+TE+++ Y SR ++ + L LLN+ K+G TY LF+ AF L S
Sbjct: 59 VAFITLLVTEWISWLSYRKVSRSEINSLWLDLLNSMKIGMTYSLFLLAFLLGS 111
>gi|209527452|ref|ZP_03275957.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
gi|376003361|ref|ZP_09781173.1| putative (predicted) allophycocyanin linker protein [Arthrospira
sp. PCC 8005]
gi|423066926|ref|ZP_17055716.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
gi|209492125|gb|EDZ92475.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
gi|375328283|emb|CCE16926.1| putative (predicted) allophycocyanin linker protein [Arthrospira
sp. PCC 8005]
gi|406711212|gb|EKD06413.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
Length = 111
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 7/113 (6%)
Query: 53 ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
+N R D + + + +FR+P RR ISLL GF++ I+ G G DV
Sbjct: 2 QNTRLNRFTDAI---AQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQAGSWDVT 58
Query: 113 AAVLCVLLTEYVTRFYYSR-PKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
A + +L+TE+++ Y + P+ + L LLN+ K+G TY LF+ AF L S
Sbjct: 59 VAFITLLVTEWISWLSYRKVPRSEINSLWLDLLNSLKIGMTYSLFLLAFLLGS 111
>gi|443328441|ref|ZP_21057038.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
gi|442791895|gb|ELS01385.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
Length = 114
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 50/94 (53%), Gaps = 7/94 (7%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
F +P RR I+LL GF++ I S G V D+ A + ++ E +R+ Y + K
Sbjct: 21 FNNPWRRISLNIIALLMGFFMGSAIVSSAGQDAVWDITGAAVLLIFAELSSRWIYGQNKK 80
Query: 135 TFP-------LALLNNFKMGFTYGLFIDAFKLAS 161
+ L +LN FK+G TY LF++AFKL S
Sbjct: 81 SSAIIRQSLFLDILNKFKIGITYSLFLEAFKLNS 114
>gi|427420155|ref|ZP_18910338.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
gi|425762868|gb|EKV03721.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
Length = 110
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + + +P RR ISLL G + A IS + G D++ + + V
Sbjct: 5 RLSTLVDQAGQRFSQWVFNPWRRISLAVISLLFGNFFATAISATAGQTAEIDILISAILV 64
Query: 119 LLTEYVTRFYY---SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ E+++ Y+ S + T LLN+FK+G YGLF++AFKL S
Sbjct: 65 AMVEFISWLYHRSRSNGQRTILPELLNSFKLGMVYGLFVEAFKLGS 110
>gi|307153416|ref|YP_003888800.1| hypothetical protein Cyan7822_3584 [Cyanothece sp. PCC 7822]
gi|306983644|gb|ADN15525.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7822]
Length = 116
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 51/99 (51%), Gaps = 9/99 (9%)
Query: 72 RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY-- 129
+ + +P RR I LGGF+ AQ IS + G DV+ A L +L TE + F Y
Sbjct: 18 QRFLSNPWRRISVLLIGFLGGFFAAQAISTTGGQKADLDVVMAALILLFTEGASIFVYRF 77
Query: 130 -SRP------KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
RP + + LN FKMG YGLF++AFKL S
Sbjct: 78 SGRPSKAGEVRQWLFVETLNLFKMGMIYGLFLEAFKLGS 116
>gi|186685288|ref|YP_001868484.1| hypothetical protein Npun_R5211 [Nostoc punctiforme PCC 73102]
gi|186467740|gb|ACC83541.1| protein of unknown function DUF565 [Nostoc punctiforme PCC 73102]
Length = 110
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +++ + +F +P RR IS L GF++ +S + G D++ A V
Sbjct: 5 RLNNLLDAIATRLGQWFLNPWRRLSLLIISFLFGFFLGNAVSTTAGQRAELDIVVAGFLV 64
Query: 119 LLTEYVTRFYYSRPKVT---FPLALLNNFKMGFTYGLFIDAFKLAS 161
+LTE +R +YS+ T + LN K+GFTY LF++AFKL S
Sbjct: 65 VLTEVTSRIFYSQSFFTRRSLFVDSLNLLKVGFTYSLFLEAFKLGS 110
>gi|434393240|ref|YP_007128187.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
gi|428265081|gb|AFZ31027.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
Length = 110
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 59 RLVDIVR-LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
RL D+ + +L R +F +P R+ IS+L GF++ IS G D+I A +
Sbjct: 5 RLNDLFNSMAGQLGR-WFFNPWRKLSLLVISVLFGFFLGSAISTIAGQAADWDIIGAGVL 63
Query: 118 VLLTEYVTRFYYS---RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
V +E V R YYS R + L LN K+G TY LF++AFKL S
Sbjct: 64 VAWSEIVDRIYYSRTWRKQRGLWLEALNAGKIGLTYSLFLEAFKLGS 110
>gi|427736638|ref|YP_007056182.1| hypothetical protein Riv7116_3162 [Rivularia sp. PCC 7116]
gi|427371679|gb|AFY55635.1| Protein of unknown function (DUF565) [Rivularia sp. PCC 7116]
Length = 110
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP- 132
+F +P RR IS L GF++ + + G V D++ A V LTE +R Y RP
Sbjct: 20 WFLNPWRRISLLMISFLFGFFMGTVAATTAGQRAVLDIVVAAFLVFLTEIGSRIVYRRPP 79
Query: 133 --KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ + LN K+GF Y LF++AFKL S
Sbjct: 80 KERRLLWVESLNFLKVGFIYSLFVEAFKLGS 110
>gi|260436046|ref|ZP_05790016.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260413920|gb|EEX07216.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 134
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRP 132
+ R+P RR ++ L GF + I+ G LG D IAA++ VL TE + R S P
Sbjct: 47 WARNPWRRFSLLALAGLIGFLIGSAITSVAGVLGQMDPIAALVVVLGTELTIRRRRSSEP 106
Query: 133 KVTFPLALLNNFKMGFTYGLFIDAFKL 159
+ P LL+ ++GF YGLF++ FKL
Sbjct: 107 SLKLPQQLLDLGRIGFLYGLFLEGFKL 133
>gi|428306677|ref|YP_007143502.1| hypothetical protein Cri9333_3154 [Crinalium epipsammum PCC 9333]
gi|428248212|gb|AFZ13992.1| protein of unknown function DUF565 [Crinalium epipsammum PCC 9333]
Length = 118
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 11/114 (9%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++V ++ R + +P RR ISLL G ++ I + G DV++A + V
Sbjct: 5 RLNNLVDVILGQLRLWLSNPWRRLSLIVISLLLGIFLGTAIPTTSGQTAEWDVVSAGVLV 64
Query: 119 LLTEYVTRFYYSRPKVT-----------FPLALLNNFKMGFTYGLFIDAFKLAS 161
+ TE V+RF Y R + F LLN+FK+G Y LF++AFKL S
Sbjct: 65 IFTEAVSRFVYGRNRRPPANEEALSRGLFVAELLNSFKIGLIYSLFVEAFKLGS 118
>gi|422304172|ref|ZP_16391521.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
gi|389790773|emb|CCI13392.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
Length = 113
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L V
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE ++R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEVISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113
>gi|81299804|ref|YP_400012.1| hypothetical protein Synpcc7942_0995 [Synechococcus elongatus PCC
7942]
gi|81168685|gb|ABB57025.1| conserved hypothetical protein YCF20 [Synechococcus elongatus PCC
7942]
Length = 131
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--- 129
+F+ P RR I LL GF++ T+SL G GV D A + + E + YY
Sbjct: 38 QWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSFAAVITISVETCSWLYYRGP 97
Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+P + P LL N K+G YGL ++ KL S
Sbjct: 98 GAGKPDRSLPFLLLQNLKLGALYGLALETLKLGS 131
>gi|425442753|ref|ZP_18822990.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
gi|425466273|ref|ZP_18845576.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
gi|389716121|emb|CCH99606.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
gi|389831302|emb|CCI26073.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
Length = 113
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L V
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE ++R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113
>gi|56750558|ref|YP_171259.1| hypothetical protein syc0549_c [Synechococcus elongatus PCC 6301]
gi|56685517|dbj|BAD78739.1| hypothetical protein YCF20 [Synechococcus elongatus PCC 6301]
Length = 112
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--- 129
+F+ P RR I LL GF++ T+SL G GV D A + + E + YY
Sbjct: 19 QWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSFAAVITISVETCSWLYYRGP 78
Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+P + P LL N K+G YGL ++ KL S
Sbjct: 79 GAGKPDRSLPFLLLQNLKLGALYGLALETLKLGS 112
>gi|166363096|ref|YP_001655369.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
gi|166085469|dbj|BAG00177.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
Length = 111
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L V
Sbjct: 5 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 64
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE ++R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 65 LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 111
>gi|300868146|ref|ZP_07112779.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300333849|emb|CBN57959.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 110
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 6/112 (5%)
Query: 53 ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
+N RL+D++ E ++ ++P RR ISLL G ++ IS G D+
Sbjct: 2 QNTRLNRLIDVL---SEQLGDWLQNPWRRISLLIISLLFGSFLGGAISTIAGQAAEWDIF 58
Query: 113 AAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
AA L V LTE++ Y ++ + + +LN K+G Y LF++AFKL S
Sbjct: 59 AAGLLVALTEFINWIVYRGTRLVGRSLRIDILNCLKIGLIYSLFVEAFKLGS 110
>gi|302763017|ref|XP_002964930.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
gi|300167163|gb|EFJ33768.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
Length = 109
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 47/91 (51%), Gaps = 4/91 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
FR R LL GFY A ++ G G DV+ A + V +TE + Y P V
Sbjct: 19 FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAIIVAVTEGIGFLMYKMPPV 78
Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
T F + +LN +K G ++GLF+DAFKL S
Sbjct: 79 TEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109
>gi|354568968|ref|ZP_08988128.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
gi|353539180|gb|EHC08672.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
Length = 110
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++ ++ + +P RR IS L GF++ ++ + G D++ A V
Sbjct: 5 RLNNLFNVLTRRLAEWILNPWRRISILIISFLLGFFLGSAVATTAGQTAEWDIVVAGTLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
L TE +R +Y+R + F + +N K+G TY LF++AFKL S
Sbjct: 65 LATEIASRIFYNRSVLARQVFWVQAINYLKVGLTYSLFLEAFKLGS 110
>gi|113477257|ref|YP_723318.1| hypothetical protein Tery_3800 [Trichodesmium erythraeum IMS101]
gi|110168305|gb|ABG52845.1| protein of unknown function DUF565 [Trichodesmium erythraeum
IMS101]
Length = 110
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + ++ + + +P RR ISLL G + A +S G G DV+ + C+
Sbjct: 5 RLNRLFNVILKRFEQWLVNPWRRISILLISLLFGNFAASAVSTIAGQEGYLDVLYTLACL 64
Query: 119 LLTEYVTRFYYSRP-KVT--FPLALLNNFKMGFTYGLFIDAFKLAS 161
+TE + Y R KV + +LN+ K+GF YGLF++AFKL S
Sbjct: 65 FITEALNWLVYGRRGKVARFLIIDILNSLKIGFIYGLFLEAFKLGS 110
>gi|428208508|ref|YP_007092861.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
7203]
gi|428010429|gb|AFY88992.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
7203]
Length = 110
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 55/106 (51%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
R+ ++ + ++ +F +P RR +SLL GF++ S G D+I A + V
Sbjct: 5 RINGLLGAIAAATQRWFFNPWRRLSVLIVSLLFGFFLGSAFSTIAGQAADWDIIGAGVLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+E + R YY+R + L LN K+G TY LF++AFKL S
Sbjct: 65 AGSEIIDRIYYNRIWQARRGLWLEALNALKIGLTYSLFLEAFKLGS 110
>gi|434389053|ref|YP_007099664.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
gi|428020043|gb|AFY96137.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
Length = 120
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 13/102 (12%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
+ ++P RR ISLL G ++A T+S + G D+ A++ V + E V+ Y
Sbjct: 19 QWGQNPWRRISLVIISLLSGNFLATTVSTTTGQKADLDITVALILVTIVETVSWIAYGSN 78
Query: 132 --PKVTFP----------LALLNNFKMGFTYGLFIDAFKLAS 161
P+ P +A+LN+ K+G YGLF++AFKL S
Sbjct: 79 LGPRRPDPEAILGQRPLWIAILNSLKLGLIYGLFVEAFKLGS 120
>gi|119512830|ref|ZP_01631897.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
gi|119462503|gb|EAW43473.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
Length = 110
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS-- 130
+F +P RR I+ GF++ IS G G+ D++ A + V+LTE +R +YS
Sbjct: 19 QWFLNPWRRLSLLLINFFFGFFLGTAISTIAGQRGLLDIVIAGILVVLTEVTSRIFYSPS 78
Query: 131 -RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
K + + LN K+GF Y +F+++FKL S
Sbjct: 79 FLSKRSLLVESLNILKVGFIYSMFVESFKLGS 110
>gi|390437653|ref|ZP_10226185.1| Ycf20-like protein [Microcystis sp. T1-4]
gi|389838936|emb|CCI30307.1| Ycf20-like protein [Microcystis sp. T1-4]
Length = 113
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L +
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE ++R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEAISRIVYRKSAQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113
>gi|425446456|ref|ZP_18826460.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
gi|425456084|ref|ZP_18835795.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
gi|389733318|emb|CCI02893.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
gi|389802905|emb|CCI18105.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
Length = 113
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + +F +P RR I +L G + Q I + G DV AA L +
Sbjct: 7 RLSTLANVTSSRFNRFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE ++R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113
>gi|428779733|ref|YP_007171519.1| hypothetical protein Dacsa_1477 [Dactylococcopsis salina PCC 8305]
gi|428694012|gb|AFZ50162.1| Protein of unknown function (DUF565) [Dactylococcopsis salina PCC
8305]
Length = 112
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 71 SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS 130
++ +F +P R+ +SLL G ++A I S G G D+I A + ++ TE ++ F Y
Sbjct: 17 TQQFFLNPWRKITLLLLSLLSGIFIAVAIVTSAGQNGRLDIIVAAVLLIFTEGISWFVYR 76
Query: 131 RPKVTFP-----LALLNNFKMGFTYGLFIDAFKLAS 161
R T L +LN FK+GFTY L+I AF L S
Sbjct: 77 RYSSTDERNLGFLEVLNTFKIGFTYSLYIQAFILGS 112
>gi|428220739|ref|YP_007104909.1| hypothetical protein Syn7502_00625 [Synechococcus sp. PCC 7502]
gi|427994079|gb|AFY72774.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 7502]
Length = 109
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
+F +P RR IS L GF++A IS GA D A +L+TE ++ Y R K
Sbjct: 20 WFSNPWRRISLYIISPLLGFFLASIISTVSGAKSAFDPYVAAALLLVTELISFVAYRRTK 79
Query: 134 V--TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ L +LN K+G YG+F++AFKL S
Sbjct: 80 EKRSLYLEILNLLKIGLIYGMFLEAFKLGS 109
>gi|302809637|ref|XP_002986511.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
gi|300145694|gb|EFJ12368.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
Length = 109
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
FR R LL GFY A ++ G G DV+ A + V + E + Y P V
Sbjct: 19 FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAIIVAVIEGIGFLMYKMPPV 78
Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
T F + +LN +K G ++GLF+DAFKL S
Sbjct: 79 TEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109
>gi|78185616|ref|YP_378050.1| hypothetical protein Syncc9902_2049 [Synechococcus sp. CC9902]
gi|78169910|gb|ABB27007.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 110
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSR 131
+ ++P RR I+L GF + ++ G L D +A+++ VL TE+ + R S
Sbjct: 22 QWTQNPWRRLSLLAIALFSGFLLGSAVTSVAGVLSQMDPVASLIVVLGTEFTLRRVRRSG 81
Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKL 159
T P LL+ ++GF YGLF++AFKL
Sbjct: 82 ASWTLPEQLLDLSRIGFLYGLFLEAFKL 109
>gi|425469331|ref|ZP_18848277.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
gi|389881508|emb|CCI37952.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
Length = 113
Score = 52.4 bits (124), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L +
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQLICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE +R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEATSRIVYRKSSQAKPAPILRESLNLLKIGITYSLFLEAFKIGS 113
>gi|425462004|ref|ZP_18841478.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
gi|389825104|emb|CCI25466.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
Length = 113
Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L +
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE +R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEATSRIVYRKSSQAKPAPILRESLNLLKIGITYSLFLEAFKIGS 113
>gi|440752489|ref|ZP_20931692.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
gi|443648037|ref|ZP_21129867.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
gi|440176982|gb|ELP56255.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
gi|443335303|gb|ELS49778.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
Length = 111
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L +
Sbjct: 5 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 64
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE +R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 65 LFTEATSRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 111
>gi|425436198|ref|ZP_18816636.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
gi|425449828|ref|ZP_18829661.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
gi|159028644|emb|CAO88115.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|389679122|emb|CCH92059.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
gi|389769599|emb|CCI05595.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
Length = 113
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + + ++F +P RR I +L G + Q I + G DV AA L +
Sbjct: 7 RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66
Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
L TE +R Y + P +L N K+G TY LF++AFK+ S
Sbjct: 67 LFTEATSRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113
>gi|220909685|ref|YP_002484996.1| hypothetical protein Cyan7425_4325 [Cyanothece sp. PCC 7425]
gi|219866296|gb|ACL46635.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7425]
Length = 110
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 3/93 (3%)
Query: 72 RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR 131
R R+P RR +SLL G Y+ ++ + G LG D++A+ L L+ E ++ +YS
Sbjct: 18 RQQLRNPWRRLSVLVMSLLFGVYLGTALASTAGQLGEIDIVASALVALVAEVISWLFYSD 77
Query: 132 P---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
K T +LN K G YGLF+ AF L S
Sbjct: 78 RWNFKATLFGEILNAVKFGVLYGLFLIAFLLGS 110
>gi|75907255|ref|YP_321551.1| hypothetical protein Ava_1032 [Anabaena variabilis ATCC 29413]
gi|75700980|gb|ABA20656.1| Protein of unknown function DUF565 [Anabaena variabilis ATCC 29413]
Length = 110
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++ + +F +P RR IS L G ++ IS + G D++ A V
Sbjct: 5 RLNNLFDTIARQLGQWFLNPWRRLSLLLISFLFGIFLGTAISTTAGQRSELDIVIAAFLV 64
Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
LTE +R +Y + + + + LN K+GFTY LFI+AFKL S
Sbjct: 65 FLTEVTSRIFYMQGFFARRSLLVESLNLLKVGFTYSLFIEAFKLGS 110
>gi|427722143|ref|YP_007069420.1| hypothetical protein Lepto7376_0138 [Leptolyngbya sp. PCC 7376]
gi|427353863|gb|AFY36586.1| hypothetical protein Lepto7376_0138 [Leptolyngbya sp. PCC 7376]
Length = 120
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 13/101 (12%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
+F++P RR ISLL G ++ + + G D + ++ E V+R YYSRPK
Sbjct: 20 FFQNPWRRFALTIISLLFGNFIGTVVPTTSGQKAAWDPVVGAGILIFIELVSRIYYSRPK 79
Query: 134 V-------------TFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ LLN FK+G YGL ++A +L S
Sbjct: 80 LFQTSTDQDISKGNHLCFDLLNIFKIGLIYGLIVEALELGS 120
>gi|254417601|ref|ZP_05031338.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196175623|gb|EDX70650.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 118
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 18/122 (14%)
Query: 53 ENGGPRRLVDIVRLVPELSR--NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVND 110
+N LVD++ L+R +F +P RR + LL G ++ IS + G D
Sbjct: 2 QNTRLNTLVDVI-----LARLAQWFTNPWRRLSIILLGLLSGIFLGSAISTTAGQTADWD 56
Query: 111 VIAAVLCVLLTEYVTRFYYSRPKVTFP-----------LALLNNFKMGFTYGLFIDAFKL 159
VIAA + ++ TE V+R Y R + P L +N K+G Y +F++AFKL
Sbjct: 57 VIAAGILIIFTETVSRIVYGRNRQRLPSLSRASRLSWFLETVNALKIGLVYSMFLEAFKL 116
Query: 160 AS 161
S
Sbjct: 117 GS 118
>gi|411119550|ref|ZP_11391930.1| Protein of unknown function (DUF565) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711413|gb|EKQ68920.1| Protein of unknown function (DUF565) [Oscillatoriales
cyanobacterium JSC-12]
Length = 110
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 3/92 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP 132
N+ ++P RR ISLL G ++ IS G D++ A + +TE V R Y
Sbjct: 19 NWLQNPWRRWSVLIISLLFGNFLGTAISTISGQQADLDILVAGILTGVTEVVNRIAYGTS 78
Query: 133 KVT---FPLALLNNFKMGFTYGLFIDAFKLAS 161
T P+ LN K+G Y LF++AFKL S
Sbjct: 79 AATRRSLPVQFLNALKVGVVYSLFVEAFKLGS 110
>gi|158337933|ref|YP_001519109.1| hypothetical protein AM1_4820 [Acaryochloris marina MBIC11017]
gi|359463702|ref|ZP_09252265.1| hypothetical protein ACCM5_33604 [Acaryochloris sp. CCMEE 5410]
gi|158308174|gb|ABW29791.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 110
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL +V + + + R+P RR I+LL G Y+ + + G L D+ A L +
Sbjct: 5 RLNTLVSGLGQQLKQELRNPWRRISVLVIALLFGIYIGVSFAAIAGQLAYLDITVAALVL 64
Query: 119 LLTEYVTRFYYSRPKVTFPLAL----LNNFKMGFTYGLFIDAFKLAS 161
L TE V + + R + AL LN FK+G TYGLF+ AF L S
Sbjct: 65 LFTE-VVNWIFHRNRGNARKALWGEALNTFKLGLTYGLFVIAFMLGS 110
>gi|284928790|ref|YP_003421312.1| hypothetical protein UCYN_02050 [cyanobacterium UCYN-A]
gi|284809249|gb|ADB94954.1| Protein of unknown function (DUF565) [cyanobacterium UCYN-A]
Length = 110
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 70 LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY 129
LSR F +P R S+L GF++ Q I+L G ++ ++ ++ TE ++ Y
Sbjct: 17 LSRT-FSNPWRNLFLNFTSILVGFFIGQAIALIAGQESYWEITVGIILLVFTEVSSKIIY 75
Query: 130 SR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
R K L N FKMG Y L+I+A KL S
Sbjct: 76 GRKRTKKKILWLETFNFFKMGIVYSLYIEALKLGS 110
>gi|302790716|ref|XP_002977125.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
gi|300155101|gb|EFJ21734.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
Length = 109
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
FR R LL GFY A ++ G G D++ A + V + E + Y P V
Sbjct: 19 FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDMLVAAIIVAVIEGIGFLMYKMPPV 78
Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
T F + +LN +K G ++GLF+DAFK+ S
Sbjct: 79 TEKLRFLVRMLNYWKAGLSFGLFVDAFKVGS 109
>gi|119485209|ref|ZP_01619594.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
gi|119457437|gb|EAW38562.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
Length = 110
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 54/106 (50%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL + ++ E + R+P RR IS+L G + IS G DV AA + +
Sbjct: 5 RLNRLTNVLSERLGRWLRNPWRRISLLTISVLFGSFSGIVISTIAGQQAEWDVTAAFILL 64
Query: 119 LLTEYVTRFYY-SRPKVTFPLAL--LNNFKMGFTYGLFIDAFKLAS 161
L+TE + Y S+P + PL + LN FK+G Y LF+ AF L S
Sbjct: 65 LITEGINWLTYRSKPLIRQPLWIENLNAFKIGVVYSLFLLAFLLGS 110
>gi|224101247|ref|XP_002312200.1| predicted protein [Populus trichocarpa]
gi|222852020|gb|EEE89567.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
LL GFY A ++ G G DV+ A + V E + Y +P ++ + +A++N
Sbjct: 135 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSLSTRRLQYFVAMMN 194
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 195 YWKAGVCLGLFVDAFKLGS 213
>gi|17230364|ref|NP_486912.1| hypothetical protein alr2872 [Nostoc sp. PCC 7120]
gi|17131966|dbj|BAB74571.1| alr2872 [Nostoc sp. PCC 7120]
Length = 110
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 3/92 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
+F +P RR IS L G ++ IS + G D++ A V LTE +R +Y +
Sbjct: 19 QWFLNPWRRLSLLLISFLFGIFLGTAISTTAGQRSELDIVIAAFLVFLTEVTSRIFYMQG 78
Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ + + LN K+GF Y LFI+AFKL S
Sbjct: 79 FFARRSLLVESLNLLKVGFIYSLFIEAFKLGS 110
>gi|291335270|gb|ADD94889.1| hypothetical protein BL107_08104 [uncultured marine bacterium
MedDCM-OCT-S09-C166]
Length = 80
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRPKVTFPLALLNNFKMGFTY 150
GF + I+ G L D +AA+L VL TE+ + R S P + P LL+ ++GF Y
Sbjct: 11 GFLIGSAITSVSGVLAQMDPVAALLVVLGTEFTIRRRRSSEPSLKLPQQLLDLGRIGFLY 70
Query: 151 GLFIDAFKL 159
GLF++ FKL
Sbjct: 71 GLFLEGFKL 79
>gi|160331763|ref|XP_001712588.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
gi|159766037|gb|ABW98263.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
Length = 146
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 36/57 (63%)
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
G++ D +AA + + E T+ +YS K+T L LLN+FK+G G+F+DAFK+
Sbjct: 89 GSVADWDPLAAAVLLCWIELFTKLFYSSEKLTNFLKLLNSFKIGIILGMFVDAFKIT 145
>gi|428773704|ref|YP_007165492.1| hypothetical protein Cyast_1888 [Cyanobacterium stanieri PCC 7202]
gi|428687983|gb|AFZ47843.1| protein of unknown function (DUF565) [Cyanobacterium stanieri PCC
7202]
Length = 109
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP 132
++F +P RR I+ L GF++A T + G G DV A ++ TE Y
Sbjct: 19 DFFSNPWRRLSLNIIAFLLGFFLASTTAAIAGQAGKWDVTYAFFFLVFTEGSNIIIYKNR 78
Query: 133 KVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
PL LN FK+GF Y L+++ FKL S
Sbjct: 79 NPNKPLWRTTLNAFKIGFAYCLYLEGFKLGS 109
>gi|434389388|ref|YP_007099999.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
gi|428020378|gb|AFY96472.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
6605]
Length = 121
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 65 RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
RL + R + +P RR ISLL G ++A I+ S G DV A++ V + E +
Sbjct: 12 RLTRQFGR-WVENPWRRLSIIIISLLFGNFLASAIATSTGQRANLDVFASLTLVAIVESI 70
Query: 125 TRFYYS----------RPKVT----FPLALLNNFKMGFTYGLFIDAFKLAS 161
Y R ++ +A+LN+ K+G YGLF++AFKL S
Sbjct: 71 GWLTYGTIFGRRSTEGRDQIIGQRPISIAILNSLKLGLMYGLFVEAFKLGS 121
>gi|22298711|ref|NP_681958.1| hypothetical protein tlr1168 [Thermosynechococcus elongatus BP-1]
gi|22294892|dbj|BAC08720.1| ycf20 [Thermosynechococcus elongatus BP-1]
Length = 110
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 3/93 (3%)
Query: 72 RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS- 130
+ ++P RR I+ L G ++ IS S G LG D+IA+ + + E ++ +YS
Sbjct: 18 QEQLKNPWRRLATLSIAFLFGVFLGLAISSSAGQLGYLDIIASSMVAIAAEVISALFYSD 77
Query: 131 RPKVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
R K+ L +LN K G YGLF+ AF L S
Sbjct: 78 RWKLRQTLFGEMLNALKFGLLYGLFLVAFLLGS 110
>gi|168010839|ref|XP_001758111.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690567|gb|EDQ76933.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 129
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR------PKVTFPLALLN 142
LL GF+ A ++ G G DVIAA + V + E V Y+R + + L+N
Sbjct: 51 LLFGFFSANALATILGQTGDWDVIAAGILVAIIEGVGHLVYNRMPAFLGERGKPIIELVN 110
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K+GF + LF+DAFK+ S
Sbjct: 111 YWKIGFEFALFVDAFKVGS 129
>gi|116072099|ref|ZP_01469367.1| hypothetical protein BL107_08104 [Synechococcus sp. BL107]
gi|116065722|gb|EAU71480.1| hypothetical protein BL107_08104 [Synechococcus sp. BL107]
Length = 80
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 1/74 (1%)
Query: 87 ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRPKVTFPLALLNNFK 145
I+LL GF + ++ G L D +A+++ VL TE+ + R S T P LL+ +
Sbjct: 6 IALLSGFLLGSAVTSVAGVLSQMDPVASLIVVLGTEFTLRRVRRSGGSWTLPEQLLDLSR 65
Query: 146 MGFTYGLFIDAFKL 159
+GF YGLF++AFKL
Sbjct: 66 IGFLYGLFLEAFKL 79
>gi|428776926|ref|YP_007168713.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
gi|428691205|gb|AFZ44499.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
Length = 112
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 72 RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR 131
R +F +P R+ ISLL G ++ I S G G D+I A L ++LTE ++ F Y +
Sbjct: 18 RQFFVNPWRKITSLLISLLLGIFMGIAIVTSAGQNGRLDIIVAALLLILTEVLSWFVYRQ 77
Query: 132 -----PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ + + ++N FK+G Y L+I AF L S
Sbjct: 78 NLEQNEQRSGLIEVMNTFKIGLVYSLYIQAFVLGS 112
>gi|443323738|ref|ZP_21052741.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
gi|442786524|gb|ELR96254.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
Length = 107
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 52/103 (50%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++V + +N+ +P R + ++ GF+ I + G + DV +A +
Sbjct: 5 RLDNLVNALGLRLQNFLANPWRNIVLSLTTVFFGFFSGTAIISTAGQRSLMDVSSAASLL 64
Query: 119 LLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
++ E + R+ Y K F +LN FK+G TY +F+ AF + S
Sbjct: 65 VVFELINRWVYQSKKPRFWTNILNIFKIGVTYSIFLQAFIVGS 107
>gi|428211916|ref|YP_007085060.1| hypothetical protein Oscil6304_1433 [Oscillatoria acuminata PCC
6304]
gi|428000297|gb|AFY81140.1| Protein of unknown function (DUF565) [Oscillatoria acuminata PCC
6304]
Length = 110
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 50/98 (51%), Gaps = 4/98 (4%)
Query: 67 VPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTR 126
V EL R R+P RR IS L G +V I+ G DV+ A + ++LTE +
Sbjct: 14 VSEL-RQSLRNPWRRISVVLISFLLGNFVGTAITTIAGQNANLDVMVAAVLIVLTEVYSY 72
Query: 127 FYY---SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
F+Y + K + +N+FK+G Y LF+ AF L S
Sbjct: 73 FFYRIHDQMKRSLLAQGINSFKIGMIYNLFVLAFTLGS 110
>gi|428768705|ref|YP_007160495.1| hypothetical protein Cyan10605_0303 [Cyanobacterium aponinum PCC
10605]
gi|428682984|gb|AFZ52451.1| protein of unknown function DUF565 [Cyanobacterium aponinum PCC
10605]
Length = 110
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 72 RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY-- 129
+++ +P R+ F I L G+ + +S S G DV A ++ TE + Y
Sbjct: 18 KSFLNNPWRKLSFVIIGFLSGYITSDLLSTSLAQAGKWDVPMATTYLIFTEITSMIVYGG 77
Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+R K+ + LLN+FK+G +GL++ A LAS
Sbjct: 78 SKNRNKIMWA-DLLNSFKIGLAFGLYLSAMTLAS 110
>gi|427713331|ref|YP_007061955.1| hypothetical protein Syn6312_2297 [Synechococcus sp. PCC 6312]
gi|427377460|gb|AFY61412.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 6312]
Length = 110
Score = 45.4 bits (106), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL ++ + + R+P RR I+LL G Y+ ++ + G +G D++A+ L
Sbjct: 5 RLTTLLNRTGDNLQQQLRNPWRRLALMMIALLFGIYLGAALASTAGQVGYIDIVASALVA 64
Query: 119 LLTEYVTRFYYSRP---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ +E ++ +Y + T LN K+G YGLF+ AF L S
Sbjct: 65 IASEVLSAVFYGNRWNFQKTILGETLNAIKVGVLYGLFLIAFLLGS 110
>gi|18408276|ref|NP_564851.1| Ycf20-like protein [Arabidopsis thaliana]
gi|209572697|sp|O80813.2|YC20L_ARATH RecName: Full=Ycf20-like protein
gi|107738233|gb|ABF83666.1| At1g65420 [Arabidopsis thaliana]
gi|332196250|gb|AEE34371.1| Ycf20-like protein [Arabidopsis thaliana]
Length = 197
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
LL GFY A ++ G G DV+ A + V E + Y +P K+ + +
Sbjct: 118 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 177
Query: 142 NNFKMGFTYGLFIDAFKLAS 161
N +K G GLF+DAFKL S
Sbjct: 178 NFWKAGVCLGLFVDAFKLGS 197
>gi|33864834|ref|NP_896393.1| hypothetical protein SYNW0298 [Synechococcus sp. WH 8102]
gi|33632357|emb|CAE06813.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 107
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 2/94 (2%)
Query: 66 LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
LV +L + +P RR I+ L GF V I+ G LG D + A++ VL TE
Sbjct: 15 LVQQL-ETWAGNPWRRLSVLCIAALFGFLVGSAITSVAGVLGQMDPVGALIVVLGTEATV 73
Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
R P T +L ++G YGLF++AFKL
Sbjct: 74 RLRIVGPS-TLLQQVLTVSRIGLLYGLFLEAFKL 106
>gi|21593789|gb|AAM65756.1| antigen receptor, putative [Arabidopsis thaliana]
Length = 197
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
LL GFY A ++ G G DV+ A + V E + Y +P K+ + +
Sbjct: 118 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 177
Query: 142 NNFKMGFTYGLFIDAFKLAS 161
N +K G GLF+DAFKL S
Sbjct: 178 NFWKAGVCLGLFVDAFKLGS 197
>gi|297841047|ref|XP_002888405.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
lyrata]
gi|297334246|gb|EFH64664.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
lyrata]
Length = 198
Score = 45.1 bits (105), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
LL GFY A ++ G G DV+ A + V E + Y +P K+ + +
Sbjct: 119 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 178
Query: 142 NNFKMGFTYGLFIDAFKLAS 161
N +K G GLF+DAFKL S
Sbjct: 179 NFWKAGVCLGLFVDAFKLGS 198
>gi|71842309|ref|YP_277397.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
gi|60101552|gb|AAX13896.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
gi|336286220|gb|AEI29556.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
Length = 90
Score = 45.1 bits (105), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 39/81 (48%), Gaps = 10/81 (12%)
Query: 91 GGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS----------RPKVTFPLAL 140
GGF +A + G ++ A L V E ++ YYS + + L L
Sbjct: 10 GGFLLASIFDTTMGEFQDWSILGAALTVASVETASKIYYSFISRVQLLSSKTYYSETLFL 69
Query: 141 LNNFKMGFTYGLFIDAFKLAS 161
LN+FK+G YGL +DAFKL S
Sbjct: 70 LNSFKLGLIYGLIVDAFKLGS 90
>gi|108773201|ref|YP_635793.1| hypothetical protein RF20 [Chara vulgaris]
gi|108773301|ref|YP_635808.1| hypothetical protein RF20 [Chara vulgaris]
gi|77157857|gb|ABA61898.1| hypothetical protein RF20 [Chara vulgaris]
gi|77157957|gb|ABA61998.1| hypothetical protein RF20 [Chara vulgaris]
Length = 137
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 11/84 (13%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK--VTFPLA------- 139
LL GF+ A T+S + D++ AV+ ++ E + Y P + F +
Sbjct: 54 LLLGFFHANTLSTILAQVADWDILIAVILSVMVESIGLLIYQIPSRILLFTICKERRKKS 113
Query: 140 --LLNNFKMGFTYGLFIDAFKLAS 161
LLN +K+GFTY LF+D+FK+ S
Sbjct: 114 LFLLNVWKIGFTYALFVDSFKVGS 137
>gi|87301895|ref|ZP_01084729.1| hypothetical protein WH5701_01180 [Synechococcus sp. WH 5701]
gi|87283463|gb|EAQ75418.1| hypothetical protein WH5701_01180 [Synechococcus sp. WH 5701]
Length = 135
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 3/104 (2%)
Query: 59 RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
RL V L+ E R++ +P RR I+ G F+ + + GAL D +AA++CV
Sbjct: 31 RLHRQVALLGERLRHWAENPWRRLSLVLITGFGAFFFGGAVGMLTGALNYLDPLAALVCV 90
Query: 119 LLTEY---VTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
+ E R+ RP+ L L++ ++G YGL + FKL
Sbjct: 91 VPIELSIRCRRWLMQRPQDRLSLQLVDAGRIGLLYGLLQEGFKL 134
>gi|11466439|ref|NP_038445.1| hypothetical chloroplast RF20 [Mesostigma viride]
gi|12230812|sp|Q9MUL5.1|YCF20_MESVI RecName: Full=Uncharacterized protein ycf20; Short=RF20
gi|7259582|gb|AAF43883.1|AF166114_95 hypothetical chloroplast RF20 (chloroplast) [Mesostigma viride]
Length = 116
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 9/79 (11%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP----KVTFPLALL-----N 142
GF++A ++ FG G DV+ A + V + E + YS+ K F ++ L N
Sbjct: 38 GFFIATVLATVFGQTGDWDVLVAGILVAMIEILGNKMYSKKYISKKQVFDISFLSLIGIN 97
Query: 143 NFKMGFTYGLFIDAFKLAS 161
K+G +GLF+DAFKL S
Sbjct: 98 YIKIGLIFGLFVDAFKLGS 116
>gi|3335348|gb|AAC27150.1|AAC27150 T8F5.20 [Arabidopsis thaliana]
Length = 121
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
LL GFY A ++ G G DV+ A + V E + Y +P K+ + +
Sbjct: 42 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 101
Query: 142 NNFKMGFTYGLFIDAFKLAS 161
N +K G GLF+DAFKL S
Sbjct: 102 NFWKAGVCLGLFVDAFKLGS 121
>gi|443316319|ref|ZP_21045768.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
gi|442784083|gb|ELR93974.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
Length = 112
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 8/104 (7%)
Query: 66 LVPELSRNYFR---SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTE 122
LV L+ + R +P RR I LL G + I+ G DV+ + + E
Sbjct: 9 LVDTLANQFGRWVFNPWRRLSLVIIGLLFGNFFGIAIASIAGQTADLDVVVSAFLLAAVE 68
Query: 123 YVTRFYYSRPK-----VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
+ Y RP+ + L +LN FK+G TY LF++AFKL S
Sbjct: 69 ITSWIVYRRPRPKEGQSSLLLEVLNAFKIGITYALFVEAFKLGS 112
>gi|330040530|ref|XP_003239945.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
gi|327206871|gb|AEA39047.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
Length = 137
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 40/72 (55%)
Query: 90 LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFT 149
L GF++A + G++ D +AA + + E+ +YS + L+NNFK+G
Sbjct: 66 LFGFFLANSAITIVGSVADWDPLAAAIMLCWIEFFNCKFYSSDFNSIYFNLVNNFKIGII 125
Query: 150 YGLFIDAFKLAS 161
+G+F+DA KL S
Sbjct: 126 FGIFVDALKLTS 137
>gi|332707378|ref|ZP_08427428.1| protein of unknown function, DUF565 [Moorea producens 3L]
gi|332353869|gb|EGJ33359.1| protein of unknown function, DUF565 [Moorea producens 3L]
Length = 116
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 13/99 (13%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR-- 131
+F +P RR IS L GF V Q + + G G +++ +L V TE V R YS
Sbjct: 20 WFANPWRRLSMVLISFLFGFLVGQALCTTAGT-GQEWILSGMLIVF-TEAVNRLIYSGKF 77
Query: 132 PK---------VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
PK V++ ++N FK+G Y ++++AFKL S
Sbjct: 78 PKGQLNQTVDRVSWLADVVNAFKIGLVYSMYLEAFKLGS 116
>gi|115478380|ref|NP_001062785.1| Os09g0287500 [Oryza sativa Japonica Group]
gi|50251186|dbj|BAD29674.1| antigen receptor-like protein [Oryza sativa Japonica Group]
gi|50252920|dbj|BAD29174.1| antigen receptor-like protein [Oryza sativa Japonica Group]
gi|113631018|dbj|BAF24699.1| Os09g0287500 [Oryza sativa Japonica Group]
gi|215765531|dbj|BAG87228.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641230|gb|EEE69362.1| hypothetical protein OsJ_28696 [Oryza sativa Japonica Group]
Length = 196
Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 65 RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
L+ +L+ +SP++ L +L GFY A ++ G G DV A + V E +
Sbjct: 99 ELLEKLNSARNKSPTKIFL-----VLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGI 153
Query: 125 TRFYYSRPKVTFP------LALLNNFKMGFTYGLFIDAFKLAS 161
Y +P P + ++N +K G G F+DAFK+ S
Sbjct: 154 GMLMYRKPASRPPGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 196
>gi|428313684|ref|YP_007124661.1| hypothetical protein Mic7113_5625 [Microcoleus sp. PCC 7113]
gi|428255296|gb|AFZ21255.1| Protein of unknown function (DUF565) [Microcoleus sp. PCC 7113]
Length = 118
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 11/99 (11%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
+F +P R +SLL G ++ I + G D++ A + +L TE ++ Y++
Sbjct: 20 WFANPWRHLSLVLMSLLLGTFLGSAIPATAGQAAKLDLVGAGVLILFTEMISWIVYAKSP 79
Query: 134 VTFPLA-----------LLNNFKMGFTYGLFIDAFKLAS 161
PL +LN K+G TY +F++AFKL S
Sbjct: 80 QGRPLTENSRGLSLLTQILNAMKIGMTYSMFVEAFKLGS 118
>gi|70671583|gb|AAZ06217.1| putative antigen receptor [Oryza sativa Indica Group]
gi|70671616|gb|AAZ06247.1| putative antigen receptor [Oryza sativa Indica Group]
gi|125563084|gb|EAZ08464.1| hypothetical protein OsI_30729 [Oryza sativa Indica Group]
Length = 197
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 65 RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
L+ +L+ +SP++ L +L GFY A ++ G G DV A + V E +
Sbjct: 100 ELLEKLNSARNKSPTKIFL-----VLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGI 154
Query: 125 TRFYYSRPKVTFP------LALLNNFKMGFTYGLFIDAFKLAS 161
Y +P P + ++N +K G G F+DAFK+ S
Sbjct: 155 GMLMYRKPASRPPGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 197
>gi|399949854|gb|AFP65511.1| hypothetical protein CMESO_344 [Chroomonas mesostigmatica CCMP1168]
Length = 152
Score = 43.9 bits (102), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 34/56 (60%)
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
G++ D +AA + + E T+ +Y+ K T L+N+FK+G G+F+DAFK+
Sbjct: 95 GSVADWDPLAAAVMLCWIELFTKIFYATEKSTLYFRLINSFKIGINLGMFVDAFKI 150
>gi|318042681|ref|ZP_07974637.1| hypothetical protein SCB01_13289 [Synechococcus sp. CB0101]
Length = 115
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 13/123 (10%)
Query: 40 SSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTI 99
SSS + R Q GG P L + R+P RR I LL F++ +
Sbjct: 2 SSSRPLQRTRLQQSLGGA---------WPRLE-QWARNPWRRLSVLLIVLLSTFFLGNAV 51
Query: 100 SLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPKVTFPLALLNNFKMGFTYGLFIDA 156
S GA D AA++CV+L E R R L LL+ ++GF YGL ++
Sbjct: 52 STIIGARAFLDPPAALICVVLLELAIRARKPLLRRGGDRLGLELLDMSRIGFCYGLLLEG 111
Query: 157 FKL 159
FKL
Sbjct: 112 FKL 114
>gi|79315375|ref|NP_001030875.1| uncharacterized protein [Arabidopsis thaliana]
gi|145332877|ref|NP_001078304.1| uncharacterized protein [Arabidopsis thaliana]
gi|9663002|emb|CAC00746.1| putative protein [Arabidopsis thaliana]
gi|45752730|gb|AAS76263.1| At3g56830 [Arabidopsis thaliana]
gi|110740493|dbj|BAE98352.1| hypothetical protein [Arabidopsis thaliana]
gi|332646050|gb|AEE79571.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646051|gb|AEE79572.1| uncharacterized protein [Arabidopsis thaliana]
Length = 188
Score = 43.5 bits (101), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 44 GFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
G R I+A+Q + LV I + +L + F LL GFY A S
Sbjct: 81 GLRLIKAIQVLRS-KLLVKIQEIKKDLPKKLF------------FLLVGFYSATAFSTFI 127
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFTYGLFIDAFK 158
G G DV++A L VL+ E + Y S P K+ + + N +K G GLF+D+FK
Sbjct: 128 GQTGDWDVLSAGLAVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFK 187
>gi|302763747|ref|XP_002965295.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
gi|300167528|gb|EFJ34133.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
Length = 105
Score = 43.5 bits (101), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 4/88 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
FR R LL GFY ++ G G DV+ + V + E + Y P V
Sbjct: 18 FRETRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAIVVAVIEGIGFLMYKMPPV 77
Query: 135 T----FPLALLNNFKMGFTYGLFIDAFK 158
T F + +LN +K+G ++GLF+D K
Sbjct: 78 TEKLRFLVKMLNYWKVGLSFGLFVDGSK 105
>gi|42565978|ref|NP_191242.2| uncharacterized protein [Arabidopsis thaliana]
gi|332646049|gb|AEE79570.1| uncharacterized protein [Arabidopsis thaliana]
Length = 230
Score = 43.1 bits (100), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 44 GFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
G R I+A+Q + LV I + +L + F LL GFY A S
Sbjct: 81 GLRLIKAIQVLRS-KLLVKIQEIKKDLPKKLF------------FLLVGFYSATAFSTFI 127
Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFTYGLFIDAFK 158
G G DV++A L VL+ E + Y S P K+ + + N +K G GLF+D+FK
Sbjct: 128 GQTGDWDVLSAGLAVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFK 187
>gi|302809861|ref|XP_002986623.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
gi|300145806|gb|EFJ12480.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
Length = 106
Score = 42.0 bits (97), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 4/88 (4%)
Query: 75 FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
FR R LL GFY ++ G G DV+ + V + E + Y P V
Sbjct: 19 FREIRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAIIVAVIEGIGFLMYKIPPV 78
Query: 135 T----FPLALLNNFKMGFTYGLFIDAFK 158
T F + +LN +K+G ++GLF+D K
Sbjct: 79 TEKLRFLVKMLNYWKVGLSFGLFVDGSK 106
>gi|116075669|ref|ZP_01472928.1| hypothetical protein RS9916_39426 [Synechococcus sp. RS9916]
gi|116066984|gb|EAU72739.1| hypothetical protein RS9916_39426 [Synechococcus sp. RS9916]
Length = 113
Score = 42.0 bits (97), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
Query: 74 YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
+ +P R+A I +L F V ++ G G+ D +AA++ V E + R PK
Sbjct: 24 WSTNPWRKASLLLIVMLSSFVVGSSVGAIAGVSGLMDPVAALITVAAWETMVRLRRPWPK 83
Query: 134 ---VTFPLALLNNFKMGFTYGLFIDAFKL 159
L +L+ ++G YGL ++ FKL
Sbjct: 84 RADTLLGLQMLDMTRIGLIYGLLLEGFKL 112
>gi|224063751|ref|XP_002301276.1| predicted protein [Populus trichocarpa]
gi|222843002|gb|EEE80549.1| predicted protein [Populus trichocarpa]
Length = 152
Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 27/121 (22%)
Query: 46 RSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA 105
R+I+A Q G R + E+ +N P + F L GFY A + G
Sbjct: 50 RAIQAFQTRLGVR--------IREIRKNL---PMKLLFF-----LVGFYCATAFATVIGQ 93
Query: 106 LGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP--------LALLNNFKMGFTYGLFIDAF 157
G D+++A L V++ E + Y + +FP +++ N +K G T GLF+D+F
Sbjct: 94 TGDWDILSAALAVVVVEGIGALMY---RASFPFVKSIRSIVSVFNYWKAGLTLGLFLDSF 150
Query: 158 K 158
K
Sbjct: 151 K 151
>gi|388511147|gb|AFK43635.1| unknown [Medicago truncatula]
Length = 201
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLN 142
LL GFY A ++ G G DV+ A + V E + Y +P T L ++N
Sbjct: 123 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGRLQSFLVMVN 182
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 183 YWKAGICLGLFVDAFKLGS 201
>gi|255553945|ref|XP_002518013.1| conserved hypothetical protein [Ricinus communis]
gi|223542995|gb|EEF44531.1| conserved hypothetical protein [Ricinus communis]
Length = 238
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 27/147 (18%)
Query: 23 LSPQTIASLGRRGISFSSSS---AGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPS 79
++ Q +S G+ SSS G R +RA+Q IV + E+ +N P
Sbjct: 83 ITWQIRSSADSSGLDPSSSGRTRGGTRLLRAIQA-----LQTKIVARIQEIRKNL---PI 134
Query: 80 RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL- 138
+ F L GFY A + G G D+++A L V + E + Y + +FPL
Sbjct: 135 KLLFF-----LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMY---RASFPLF 186
Query: 139 -------ALLNNFKMGFTYGLFIDAFK 158
+ N +K G + GLF+D+FK
Sbjct: 187 NNMRSLITMFNYWKAGLSLGLFLDSFK 213
>gi|357476287|ref|XP_003608429.1| Ycf20-like protein [Medicago truncatula]
gi|355509484|gb|AES90626.1| Ycf20-like protein [Medicago truncatula]
Length = 209
Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLN 142
LL GFY A ++ G G DV+ A + V E + Y +P T L ++N
Sbjct: 131 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGRLQSFLVMVN 190
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 191 YWKAGICLGLFVDAFKLGS 209
>gi|283794933|ref|YP_003359286.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
gi|253981905|gb|ACT46822.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
Length = 105
Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 13/105 (12%)
Query: 70 LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT-RFY 128
+ + F + L I +L G ++A T+S + G G ++ + + V E ++ R Y
Sbjct: 1 MRKQQFAHQLIQLLINFILILFGCFIATTLSTTLGQTGDWGMLISGIIVTFLEIISKRTY 60
Query: 129 ------------YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
Y K+ + + LLN+ K+G YG FI+AFKL S
Sbjct: 61 TIKKLIMILPKNYQSHKIKYFVVLLNDIKIGVLYGFFIEAFKLGS 105
>gi|222622484|gb|EEE56616.1| hypothetical protein OsJ_05998 [Oryza sativa Japonica Group]
Length = 191
Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P +A++N +K
Sbjct: 116 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 175
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 176 AGVCLGLFVDAFKLGS 191
>gi|218190363|gb|EEC72790.1| hypothetical protein OsI_06472 [Oryza sativa Indica Group]
Length = 191
Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P +A++N +K
Sbjct: 116 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 175
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 176 AGVCLGLFVDAFKLGS 191
>gi|356496360|ref|XP_003517036.1| PREDICTED: uncharacterized protein LOC100306594 [Glycine max]
Length = 197
Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 5/74 (6%)
Query: 90 LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRPKVTFP---LALLNNF 144
L GFY A + G G D+++A L V + E + Y S P V+ ++L N +
Sbjct: 123 LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMYRASLPLVSKSRSLISLFNYW 182
Query: 145 KMGFTYGLFIDAFK 158
K G T GLF+D+FK
Sbjct: 183 KAGLTLGLFLDSFK 196
>gi|296089191|emb|CBI38894.3| unnamed protein product [Vitis vinifera]
Length = 243
Score = 40.8 bits (94), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 70/169 (41%), Gaps = 28/169 (16%)
Query: 10 VGIVSPTILFQFGLSPQTIASLG-----RRGISFSSSSAGFRSIRAVQENG--GPRRLVD 62
VGIV+ FGL +A L +R + SS S+ NG G RLV
Sbjct: 59 VGIVTCLHKTSFGLCFFRVAQLSLVNTSKRMVWSVRSSVDGNSLDPTPTNGTDGRTRLVR 118
Query: 63 IVRLVP--------ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAA 114
+++ + EL ++ P + F L GFY A + G G D+++A
Sbjct: 119 VIQGIQIKLSTRILELRKDL---PMKILFF-----LVGFYCATAFATVIGQTGDWDILSA 170
Query: 115 VLCVLLTEYVTRFYYSR-----PKVTFPLALLNNFKMGFTYGLFIDAFK 158
L V + E + Y KV + + N +K G GLF+D+FK
Sbjct: 171 ALAVFVVELIGALMYRASIHLFDKVRRLITMFNYWKAGLCLGLFLDSFK 219
>gi|357118772|ref|XP_003561123.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 219
Score = 40.4 bits (93), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
GFY A + G G D+++A L V + E++ Y ++ + + N +K
Sbjct: 124 GFYCATAFATVIGQTGDWDILSAGLAVAIVEFIGALMYRASFAFFGRIKNMITIFNYWKA 183
Query: 147 GFTYGLFIDAFK 158
G T GLF+D+FK
Sbjct: 184 GLTLGLFLDSFK 195
>gi|225453999|ref|XP_002274644.1| PREDICTED: ycf20-like protein-like [Vitis vinifera]
Length = 217
Score = 40.4 bits (93), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 70/169 (41%), Gaps = 28/169 (16%)
Query: 10 VGIVSPTILFQFGLSPQTIASLG-----RRGISFSSSSAGFRSIRAVQENG--GPRRLVD 62
VGIV+ FGL +A L +R + SS S+ NG G RLV
Sbjct: 33 VGIVTCLHKTSFGLCFFRVAQLSLVNTSKRMVWSVRSSVDGNSLDPTPTNGTDGRTRLVR 92
Query: 63 IVRLVP--------ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAA 114
+++ + EL ++ P + F L GFY A + G G D+++A
Sbjct: 93 VIQGIQIKLSTRILELRKDL---PMKILFF-----LVGFYCATAFATVIGQTGDWDILSA 144
Query: 115 VLCVLLTEYVTRFYYSR-----PKVTFPLALLNNFKMGFTYGLFIDAFK 158
L V + E + Y KV + + N +K G GLF+D+FK
Sbjct: 145 ALAVFVVELIGALMYRASIHLFDKVRRLITMFNYWKAGLCLGLFLDSFK 193
>gi|50251788|dbj|BAD27720.1| antigen receptor-like [Oryza sativa Japonica Group]
Length = 217
Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P +A++N +K
Sbjct: 142 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 201
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 202 AGVCLGLFVDAFKLGS 217
>gi|357157876|ref|XP_003577943.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
Length = 144
Score = 40.0 bits (92), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P +A++N +K
Sbjct: 69 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPVSRPPGRFQSLIAMVNYWK 128
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 129 AGVCLGLFVDAFKLGS 144
>gi|297820458|ref|XP_002878112.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
lyrata]
gi|297323950|gb|EFH54371.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
lyrata]
Length = 188
Score = 40.0 bits (92), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNN 143
LL GFY A S G G DV++A + VL+ E + Y S P K+ + + N
Sbjct: 113 LLVGFYSATAFSTFIGQTGDWDVLSAGVAVLVVECIGALMYRASIPLINKMRGTITMFNY 172
Query: 144 FKMGFTYGLFIDAFK 158
+K G GLF+D+FK
Sbjct: 173 WKTGLALGLFLDSFK 187
>gi|212275874|ref|NP_001131010.1| uncharacterized protein LOC100192115 [Zea mays]
gi|194690700|gb|ACF79434.1| unknown [Zea mays]
gi|194695206|gb|ACF81687.1| unknown [Zea mays]
gi|195654383|gb|ACG46659.1| hypothetical protein [Zea mays]
Length = 232
Score = 40.0 bits (92), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P ++++N +K
Sbjct: 157 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPGRFQSLISMVNYWK 216
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 217 AGVCLGLFVDAFKLGS 232
>gi|449526593|ref|XP_004170298.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 169
Score = 39.7 bits (91), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
LL GFY A ++ G G DV+ A + V E++ Y +P ++ + ++N
Sbjct: 91 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKSFIVMVN 150
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 151 YWKAGVCLGLFVDAFKLGS 169
>gi|242048754|ref|XP_002462123.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
gi|241925500|gb|EER98644.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
Length = 201
Score = 39.7 bits (91), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
GFY A ++ G G DV+ A + V E + Y +P P ++++N +K
Sbjct: 126 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPITRPPGRFQSLISMVNYWK 185
Query: 146 MGFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 186 AGVCLGLFVDAFKLGS 201
>gi|356506255|ref|XP_003521902.1| PREDICTED: ycf20-like protein-like [Glycine max]
Length = 197
Score = 39.7 bits (91), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 11/77 (14%)
Query: 90 LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------ALL 141
L GFY A + G G D+++A L V + E + Y + + PL +L
Sbjct: 123 LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMY---RASLPLVRKSRSLISLF 179
Query: 142 NNFKMGFTYGLFIDAFK 158
N +K G T GLF+D+FK
Sbjct: 180 NYWKAGLTLGLFLDSFK 196
>gi|449442657|ref|XP_004139097.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 212
Score = 39.7 bits (91), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
LL GFY A ++ G G DV+ A + V E++ Y +P ++ + ++N
Sbjct: 134 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKSFIVMVN 193
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 194 YWKAGVCLGLFVDAFKLGS 212
>gi|255547482|ref|XP_002514798.1| conserved hypothetical protein [Ricinus communis]
gi|223545849|gb|EEF47352.1| conserved hypothetical protein [Ricinus communis]
Length = 201
Score = 39.7 bits (91), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 10/81 (12%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP--------LAL 140
LL GFY A ++ G G DV+ A + V E + Y +P T P + +
Sbjct: 123 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP--TSPSTGRLQSFVVM 180
Query: 141 LNNFKMGFTYGLFIDAFKLAS 161
+N +K G GLF+DAFKL S
Sbjct: 181 MNYWKAGVCLGLFVDAFKLGS 201
>gi|126695592|ref|YP_001090478.1| hypothetical protein P9301_02541 [Prochlorococcus marinus str. MIT
9301]
gi|126542635|gb|ABO16877.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 113
Score = 39.7 bits (91), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 77 SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT---RFYYSRPK 133
+P RR I+LL G+++ ++ + + + D +AA L V+ E + R + S K
Sbjct: 27 NPWRRYSISLITLLIGYFIGSSLGMVSAVVELMDPVAAFLSVVFIEILITLRRNFRSERK 86
Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKL 159
F + L++ ++G YG F ++ KL
Sbjct: 87 KKFLILFLDSLRLGLFYGFFTESLKL 112
>gi|148239449|ref|YP_001224836.1| hypothetical protein SynWH7803_1113 [Synechococcus sp. WH 7803]
gi|147847988|emb|CAK23539.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 112
Score = 39.3 bits (90), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 3/87 (3%)
Query: 78 PSRRALFGGISLLGGFYVAQTISL-SFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTF 136
P RR G +SLL GF V +++ + G + IA + VL+ E + R +
Sbjct: 26 PWRRRSVGLLSLLFGFIVGSNVTMYWYQKSGQDRPIAVLTMVLILEVLVRLRSRVRTDPW 85
Query: 137 PLAL--LNNFKMGFTYGLFIDAFKLAS 161
PL L+N ++G Y + +AFKL S
Sbjct: 86 PLGWLALDNLRIGTVYAVVFEAFKLGS 112
>gi|359807488|ref|NP_001241142.1| uncharacterized protein LOC100814758 [Glycine max]
gi|255639749|gb|ACU20168.1| unknown [Glycine max]
Length = 207
Score = 39.3 bits (90), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
LL GFY A ++ G G DV+ A + V E + Y +P ++ L L+N
Sbjct: 129 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYRKPPTEKTGRLQSFLVLVN 188
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 189 YWKAGICLGLFVDAFKLGS 207
>gi|33239713|ref|NP_874655.1| hypothetical protein Pro0261 [Prochlorococcus marinus subsp.
marinus str. CCMP1375]
gi|33237238|gb|AAP99307.1| Uncharacterized secreted or membrane protein [Prochlorococcus
marinus subsp. marinus str. CCMP1375]
Length = 111
Score = 39.3 bits (90), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 52/111 (46%), Gaps = 7/111 (6%)
Query: 52 QENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDV 111
Q R +++++ + S N P RR I L F +I + G L + D
Sbjct: 4 QSTSLNRNIINLIMRLEVWSDN----PWRRYSMLIIIFLSAFLFGSSIGMINGVLALMDP 59
Query: 112 IAAVLCVLLTEYVTRF---YYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
I A V L E + RF + + L++L++F+MGF YGLF + FKL
Sbjct: 60 IGAFFTVTLIEVLVRFRKVNVQKKGSSISLSILDSFRMGFIYGLFTEGFKL 110
>gi|159902795|ref|YP_001550139.1| hypothetical protein P9211_02541 [Prochlorococcus marinus str. MIT
9211]
gi|159887971|gb|ABX08185.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 113
Score = 39.3 bits (90), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 77 SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPK 133
+P RR I G F ++ + L + D+I A VLL E++ R + +
Sbjct: 27 NPWRRYSLLLIIFFGAFVFGSSVGMINAVLELMDLIGAFFTVLLLEFMVRLRQAWNRKNS 86
Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKL 159
+ L ++++ ++G YGLF++ FKL
Sbjct: 87 SSILLQIIDSARIGLLYGLFMEGFKL 112
>gi|449498939|ref|XP_004160676.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 221
Score = 39.3 bits (90), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 58/140 (41%), Gaps = 5/140 (3%)
Query: 24 SPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRAL 83
P + + + +S S G + G RL+ I++ + +
Sbjct: 57 KPSVLKNFKKMALSIRGSVEGNPLDPPSSNSNGRTRLIRIIQAIQTKLNARINELRKNLP 116
Query: 84 FGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPL 138
+ L GFY A + G G D+++A L V++ E + Y S P K+ +
Sbjct: 117 LKILFFLVGFYSATAFATVIGQTGDWDILSAALAVVVVEGIGALMYRASLPLLNKIKDLI 176
Query: 139 ALLNNFKMGFTYGLFIDAFK 158
+ N +K G + GLF+D+FK
Sbjct: 177 TMFNYWKAGLSMGLFLDSFK 196
>gi|449454528|ref|XP_004145006.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
gi|449474824|ref|XP_004154295.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
Length = 221
Score = 38.9 bits (89), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 25/129 (19%)
Query: 39 SSSSAG----FRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFY 94
SS+S G R I+A+Q R + EL +N P + F L GFY
Sbjct: 84 SSNSNGRTRLIRIIQAIQTKLNAR--------INELRKNL---PLKILFF-----LVGFY 127
Query: 95 VAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFT 149
A + G G D+++A L V++ E + Y S P K+ + + N +K G +
Sbjct: 128 SATAFATVIGQTGDWDILSAALAVVVVEGIGALMYRASLPLLNKIKDLITMFNYWKAGLS 187
Query: 150 YGLFIDAFK 158
GLF+D+FK
Sbjct: 188 MGLFLDSFK 196
>gi|148243301|ref|YP_001228458.1| hypothetical protein SynRCC307_2202 [Synechococcus sp. RCC307]
gi|147851611|emb|CAK29105.1| Uncharacterized conserved membrane protein [Synechococcus sp.
RCC307]
Length = 110
Score = 38.9 bits (89), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 3/89 (3%)
Query: 73 NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT---RFYY 129
+ +P RR + LL GF +I G L V D + A++ V +E R
Sbjct: 19 QWAENPWRRFSLLVLVLLAGFSAGVSIGSVAGVLMVIDPVGALVVVAASELAIKARRPLL 78
Query: 130 SRPKVTFPLALLNNFKMGFTYGLFIDAFK 158
R PL LL+ +MG YGL ++ K
Sbjct: 79 KRGGDKLPLGLLDMARMGIVYGLLLEGIK 107
>gi|242037377|ref|XP_002466083.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
gi|241919937|gb|EER93081.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
Length = 204
Score = 38.5 bits (88), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------ALLNN 143
GFY A + G G D+++A L V + E + Y + +F L + N
Sbjct: 109 GFYCATAFATVIGQTGDWDILSAGLAVAVVEVIGALMY---RASFALLGRIRNMITIFNY 165
Query: 144 FKMGFTYGLFIDAFK 158
+K G T GLF+D+FK
Sbjct: 166 WKAGLTLGLFLDSFK 180
>gi|72382195|ref|YP_291550.1| hypothetical protein PMN2A_0355 [Prochlorococcus marinus str.
NATL2A]
gi|72002045|gb|AAZ57847.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 106
Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 50/85 (58%), Gaps = 2/85 (2%)
Query: 77 SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTF 136
S S+R+L +SLL GFY + VN + A++ +L+ E V R Y PK+T
Sbjct: 24 SWSKRSLLV-LSLLFGFYFTNSFLSFLLDKSVNTIFLAIIILLIMELVIRSYLI-PKLTI 81
Query: 137 PLALLNNFKMGFTYGLFIDAFKLAS 161
P+ ++NN ++G TY L ++A+KL S
Sbjct: 82 PVMIINNIRIGSTYALILEAYKLGS 106
>gi|413932398|gb|AFW66949.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
gi|413932399|gb|AFW66950.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
gi|413932400|gb|AFW66951.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
Length = 212
Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
GFY A + G G D+++A L V + E + Y ++ + + N +K
Sbjct: 116 GFYCATAFATVIGQTGDWDILSAGLAVAIVEVIGALMYRASFAFLGRIGNMITVFNYWKA 175
Query: 147 GFTYGLFIDAFK 158
G T GLF+D+FK
Sbjct: 176 GLTLGLFLDSFK 187
>gi|33240432|ref|NP_875374.1| hypothetical protein Pro0982 [Prochlorococcus marinus subsp.
marinus str. CCMP1375]
gi|33237959|gb|AAQ00027.1| Uncharacterized secreted or membrane protein [Prochlorococcus
marinus subsp. marinus str. CCMP1375]
Length = 111
Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Query: 78 PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPKV 134
P + G ISLL G+Y+A ++ F L + + V+L E R + K
Sbjct: 26 PWKVRSIGLISLLVGYYLASILTTYFLVLFTQRSLVVIPLVILVEISVRLRRALLNNQKE 85
Query: 135 TFPLALLNNFKMGFTYGLFIDAFKLAS 161
++ L +L+N ++G TY + ++AFKL S
Sbjct: 86 SY-LLILDNIRIGITYAVVLEAFKLGS 111
>gi|195615956|gb|ACG29808.1| hypothetical protein [Zea mays]
Length = 231
Score = 37.7 bits (86), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 93 FYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFKM 146
FY A ++ G G DV+ A + V E + Y +P P ++++N +K
Sbjct: 157 FYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPGRFQSLISMVNYWKA 216
Query: 147 GFTYGLFIDAFKLAS 161
G GLF+DAFKL S
Sbjct: 217 GVCLGLFVDAFKLGS 231
>gi|255070727|ref|XP_002507445.1| predicted protein [Micromonas sp. RCC299]
gi|226522720|gb|ACO68703.1| predicted protein [Micromonas sp. RCC299]
Length = 152
Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 12/63 (19%)
Query: 110 DVIAAVLCVLLTEYVTRFYYS--RPKVTF---------PLALLNNFKMGFTYGLFIDAFK 158
D + V L E+++ YS RP+ PL +LNNFK+G YGLF DAFK
Sbjct: 91 DAPICLTIVFLAEFLSSVLYSPKRPRGILKFLKLNDLVPL-VLNNFKLGTLYGLFCDAFK 149
Query: 159 LAS 161
+ S
Sbjct: 150 VGS 152
>gi|317968664|ref|ZP_07970054.1| hypothetical protein SCB02_03913 [Synechococcus sp. CB0205]
Length = 80
Score = 37.7 bits (86), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 87 ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVT---FPLALLNN 143
I LL F++ I+ GA D AA+LCVL E R + T L LL+
Sbjct: 4 IVLLSTFFLGNAIATLTGARSFLDPPAALLCVLAIEAAIRLRGPLLRNTSNRLGLQLLDM 63
Query: 144 FKMGFTYGLFIDAFKL 159
++GF YGL ++ FKL
Sbjct: 64 SRVGFCYGLLLEGFKL 79
>gi|115456649|ref|NP_001051925.1| Os03g0852600 [Oryza sativa Japonica Group]
gi|113550396|dbj|BAF13839.1| Os03g0852600, partial [Oryza sativa Japonica Group]
Length = 206
Score = 37.4 bits (85), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 92 GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
GFY A + G G D+++A L V + E + Y ++ + + N +K
Sbjct: 112 GFYCATAFATVIGQTGDWDILSAGLAVAIVEGIGALMYRASFAFLGRIRKMITIFNYWKA 171
Query: 147 GFTYGLFIDAFK 158
G T GLF+D+FK
Sbjct: 172 GLTLGLFLDSFK 183
>gi|88808713|ref|ZP_01124223.1| hypothetical protein WH7805_03447 [Synechococcus sp. WH 7805]
gi|88787701|gb|EAR18858.1| hypothetical protein WH7805_03447 [Synechococcus sp. WH 7805]
Length = 112
Score = 36.6 bits (83), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 80 RRALFGGISLLGGFYVAQTISL-SFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV-TFP 137
RR G +SLL GF V +++ F G A ++ VL+ E R SR ++ +P
Sbjct: 28 RRRSVGLLSLLFGFIVGSNVTMYWFQKSGQQRPFAVLMVVLVLEMQVRLR-SRVRIDQWP 86
Query: 138 LALL--NNFKMGFTYGLFIDAFKLAS 161
L+ L +N ++G Y + +AFKL S
Sbjct: 87 LSWLALDNLRIGTVYAVVFEAFKLGS 112
>gi|124112127|ref|YP_001019166.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
gi|124012234|gb|ABM87970.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
Length = 127
Score = 36.6 bits (83), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 40/92 (43%), Gaps = 18/92 (19%)
Query: 88 SLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------- 138
LL GF++A + G +V+ A V+L E F Y + TF L
Sbjct: 36 CLLMGFFMATFFATILGFPKNWNVLNATFLVILIEVTGFFLYGKTSATFNLYTNKNLHSN 95
Query: 139 ---------ALLNNFKMGFTYGLFIDAFKLAS 161
A +N K+G +GLF+DAFKL S
Sbjct: 96 KKKNINLWVANINYLKIGLIFGLFVDAFKLGS 127
>gi|308802359|ref|XP_003078493.1| unnamed protein product [Ostreococcus tauri]
gi|116056945|emb|CAL53234.1| unnamed protein product [Ostreococcus tauri]
Length = 130
Score = 36.6 bits (83), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 13/85 (15%)
Query: 87 ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS--------RPKVTFPL 138
ISLL GF + I D + +LL+E+V+ Y R PL
Sbjct: 49 ISLLVGFSMCALIP---HPESPQDAFISFTIILLSEFVSSVLYGVDNQGGLLRWLCEGPL 105
Query: 139 A--LLNNFKMGFTYGLFIDAFKLAS 161
A ++N FK+G YGLF DAFK+ S
Sbjct: 106 APMIVNCFKIGVLYGLFCDAFKVGS 130
>gi|327283939|ref|XP_003226697.1| PREDICTED: cell death activator CIDE-B-like [Anolis carolinensis]
Length = 221
Score = 35.8 bits (81), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 51/117 (43%), Gaps = 7/117 (5%)
Query: 2 IEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLV 61
I +L+ G + T F L T + +G +S S G RS E PR
Sbjct: 72 IMSLVLEEDGTLLETEEFFEALEDNTTVMVLEKGQKWSHSKGGGRSYALSGEK--PRNSR 129
Query: 62 DIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
DI R+ ++ Y +P R LFG +++ FY ++S F LG V+ +L V
Sbjct: 130 DIARITFDI---YKLNP--RDLFGSLNISATFYGLYSMSCDFKCLGPKKVLREILRV 181
>gi|124025743|ref|YP_001014859.1| hypothetical protein NATL1_10361 [Prochlorococcus marinus str.
NATL1A]
gi|123960811|gb|ABM75594.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 106
Score = 35.8 bits (81), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 24/33 (72%)
Query: 129 YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
Y P++T P+ ++NN ++G TY L ++A+KL S
Sbjct: 74 YLIPRLTIPVMIINNIRIGSTYALILEAYKLGS 106
>gi|465500|sp|P34813.1|YCF20_AGLNE RecName: Full=Uncharacterized protein ycf20
gi|304023|gb|AAA18512.1| unknown [Aglaothamnion neglectum]
Length = 100
Score = 35.0 bits (79), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 87 ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV--TFPLALLNNF 144
ISL G++ I G +I A+L V + E ++ YS K + +N
Sbjct: 24 ISLFLGYFFCTIICSIPKETGDWGLITALLIVGINETMSALIYSYKKYENNIIVKTINGI 83
Query: 145 KMGFTYGLFIDAFKLAS 161
K+G YGLF+D+FKL S
Sbjct: 84 KIGIIYGLFVDSFKLGS 100
>gi|359495755|ref|XP_002267846.2| PREDICTED: ycf20-like protein [Vitis vinifera]
gi|298205056|emb|CBI38352.3| unnamed protein product [Vitis vinifera]
Length = 225
Score = 35.0 bits (79), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 89 LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
LL GFY A ++ G G DV+ A + V E + Y + ++ + ++N
Sbjct: 147 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLVYKKSSSQSTGRLQSLVVMIN 206
Query: 143 NFKMGFTYGLFIDAFKLAS 161
+K G GLF+DAFKL S
Sbjct: 207 YWKAGVCLGLFVDAFKLGS 225
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.328 0.142 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,331,713,557
Number of Sequences: 23463169
Number of extensions: 89851568
Number of successful extensions: 280003
Number of sequences better than 100.0: 200
Number of HSP's better than 100.0 without gapping: 90
Number of HSP's successfully gapped in prelim test: 110
Number of HSP's that attempted gapping in prelim test: 279772
Number of HSP's gapped (non-prelim): 204
length of query: 161
length of database: 8,064,228,071
effective HSP length: 123
effective length of query: 38
effective length of database: 9,473,225,580
effective search space: 359982572040
effective search space used: 359982572040
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 71 (32.0 bits)