BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 031338
         (161 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225469306|ref|XP_002269823.1| PREDICTED: ycf20-like protein isoform 2 [Vitis vinifera]
 gi|225469308|ref|XP_002269782.1| PREDICTED: ycf20-like protein isoform 1 [Vitis vinifera]
 gi|297741124|emb|CBI31855.3| unnamed protein product [Vitis vinifera]
          Length = 155

 Score =  236 bits (602), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 122/156 (78%), Positives = 136/156 (87%), Gaps = 1/156 (0%)

Query: 6   MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
           MA ++ + S T L +FGL+  T  S G+  +  S SS  F S++AVQEN GPRRLVDI+R
Sbjct: 1   MAHSLSLPS-TCLLKFGLATGTRVSHGKCSMLASRSSPRFFSVQAVQENEGPRRLVDIIR 59

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
           +VP+LSRNYF+SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL+TEYVT
Sbjct: 60  IVPQLSRNYFQSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLITEYVT 119

Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 155


>gi|255539575|ref|XP_002510852.1| conserved hypothetical protein [Ricinus communis]
 gi|223549967|gb|EEF51454.1| conserved hypothetical protein [Ricinus communis]
          Length = 155

 Score =  234 bits (598), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 120/156 (76%), Positives = 134/156 (85%), Gaps = 1/156 (0%)

Query: 6   MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
           MA ++G++  T+  + G   Q   S G+ G+  S SS G  SI+AVQ+NGGP+RLVDI+R
Sbjct: 1   MALSIGLIQ-TVPLKLGFGFQRRVSCGKYGMLISRSSRGLLSIQAVQDNGGPQRLVDIIR 59

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
            VPE SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEYVT
Sbjct: 60  RVPEFSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVICVLLTEYVT 119

Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           RFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPVALLNNFKMGFTYGLFIDAFKLAS 155


>gi|356566265|ref|XP_003551354.1| PREDICTED: uncharacterized protein ycf20-like [Glycine max]
          Length = 151

 Score =  234 bits (596), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 122/142 (85%), Positives = 128/142 (90%), Gaps = 3/142 (2%)

Query: 20  QFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPS 79
            FG  P+T  S+  R +S   SSAG R IRAVQENGGPRRLVDI+RLVPE SRNYFRSPS
Sbjct: 13  SFGPLPKTKVSVSGRLVS--RSSAGLR-IRAVQENGGPRRLVDIIRLVPEFSRNYFRSPS 69

Query: 80  RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLA 139
           RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT+FYYSRPKVTFP+A
Sbjct: 70  RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTKFYYSRPKVTFPVA 129

Query: 140 LLNNFKMGFTYGLFIDAFKLAS 161
           LLNNFKMGFTYGLFIDAFKLAS
Sbjct: 130 LLNNFKMGFTYGLFIDAFKLAS 151


>gi|224122468|ref|XP_002318844.1| predicted protein [Populus trichocarpa]
 gi|222859517|gb|EEE97064.1| predicted protein [Populus trichocarpa]
          Length = 155

 Score =  233 bits (593), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 119/156 (76%), Positives = 132/156 (84%), Gaps = 1/156 (0%)

Query: 6   MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
           MA++V +++ T   + G +PQ   S  + G S   SS G   IRAVQEN GPRRL+DI+R
Sbjct: 1   MAQSVSLIT-TCPLKLGFAPQRRVSSVKYGTSVFRSSTGRLRIRAVQENEGPRRLIDIIR 59

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
            +PELSRNYFR PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEYVT
Sbjct: 60  TIPELSRNYFRRPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVT 119

Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           RFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 155


>gi|449451787|ref|XP_004143642.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
 gi|449530953|ref|XP_004172456.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus]
          Length = 156

 Score =  228 bits (582), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 120/158 (75%), Positives = 133/158 (84%), Gaps = 4/158 (2%)

Query: 6   MARAVGIVSPTILFQFGLSPQTIASLGRRGI--SFSSSSAGFRSIRAVQENGGPRRLVDI 63
           MA++  +V  T L  +G +P+   S  R  +  S  S    FR ++AVQ+ GGPRRLVDI
Sbjct: 1   MAQSASLVH-TSLLNYGFTPKMNPSYWRYDMVKSRPSPPRSFR-VKAVQDTGGPRRLVDI 58

Query: 64  VRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY 123
           +RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAV+CVLLTEY
Sbjct: 59  IRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEY 118

Query: 124 VTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           VTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 119 VTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 156


>gi|212274997|ref|NP_001130729.1| hypothetical protein [Zea mays]
 gi|194689966|gb|ACF79067.1| unknown [Zea mays]
 gi|195626582|gb|ACG35121.1| hypothetical protein [Zea mays]
 gi|223950487|gb|ACN29327.1| unknown [Zea mays]
 gi|414586642|tpg|DAA37213.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
 gi|414586643|tpg|DAA37214.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
 gi|414586644|tpg|DAA37215.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
 gi|414586645|tpg|DAA37216.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
 gi|414586646|tpg|DAA37217.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays]
          Length = 155

 Score =  213 bits (541), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 110/153 (71%), Positives = 125/153 (81%), Gaps = 6/153 (3%)

Query: 9   AVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVP 68
           AVG+ S  +  +     QT  +  ++      SS  +  IRAVQ NGG RRLVDI+R++P
Sbjct: 9   AVGLPSYGLYLETRFLTQTYRNFAQK------SSYKYSRIRAVQGNGGRRRLVDIIRIIP 62

Query: 69  ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFY 128
           ELSR+YF+S SRRALFGGISLLGGFYVAQTISLSFG LGVNDVIAAV+CVLLTEYVT+FY
Sbjct: 63  ELSRDYFKSRSRRALFGGISLLGGFYVAQTISLSFGTLGVNDVIAAVVCVLLTEYVTKFY 122

Query: 129 YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           YSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 123 YSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS 155


>gi|115459106|ref|NP_001053153.1| Os04g0488600 [Oryza sativa Japonica Group]
 gi|38344242|emb|CAD41335.2| OJ991113_30.19 [Oryza sativa Japonica Group]
 gi|113564724|dbj|BAF15067.1| Os04g0488600 [Oryza sativa Japonica Group]
 gi|215707021|dbj|BAG93481.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737413|dbj|BAG96543.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 155

 Score =  211 bits (538), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 106/121 (87%), Positives = 111/121 (91%)

Query: 41  SSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTIS 100
           SS  F  +RAVQ N G RRLVDI+R +PELSRNYFRS SRRALFGGISLLGGFYVAQTIS
Sbjct: 35  SSYKFIKVRAVQGNDGRRRLVDIIRTIPELSRNYFRSRSRRALFGGISLLGGFYVAQTIS 94

Query: 101 LSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
           LSFGALGVNDVIAAV+CVLLTEYVT+FYYSRPKVTFP ALLNNFKMGFTYGLFIDAFKLA
Sbjct: 95  LSFGALGVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPFALLNNFKMGFTYGLFIDAFKLA 154

Query: 161 S 161
           S
Sbjct: 155 S 155


>gi|125548804|gb|EAY94626.1| hypothetical protein OsI_16403 [Oryza sativa Indica Group]
 gi|125590821|gb|EAZ31171.1| hypothetical protein OsJ_15270 [Oryza sativa Japonica Group]
          Length = 190

 Score =  211 bits (536), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/121 (87%), Positives = 111/121 (91%)

Query: 41  SSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTIS 100
           SS  F  +RAVQ N G RRLVDI+R +PELSRNYFRS SRRALFGGISLLGGFYVAQTIS
Sbjct: 70  SSYKFIKVRAVQGNDGRRRLVDIIRTIPELSRNYFRSRSRRALFGGISLLGGFYVAQTIS 129

Query: 101 LSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
           LSFGALGVNDVIAAV+CVLLTEYVT+FYYSRPKVTFP ALLNNFKMGFTYGLFIDAFKLA
Sbjct: 130 LSFGALGVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPFALLNNFKMGFTYGLFIDAFKLA 189

Query: 161 S 161
           S
Sbjct: 190 S 190


>gi|297795157|ref|XP_002865463.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311298|gb|EFH41722.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 159

 Score =  209 bits (532), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 103/118 (87%), Positives = 109/118 (92%), Gaps = 2/118 (1%)

Query: 46  RSIRAVQENGGP--RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
           RSIRA+QE  GP  RRL+DI+R VPE+SRNYF+ PSRR LFGGISLLGGFYVAQTISLSF
Sbjct: 42  RSIRALQETEGPTPRRLIDIIRSVPEISRNYFKKPSRRTLFGGISLLGGFYVAQTISLSF 101

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           GALGVNDVIAAVLCVLLTEYVTRFYYSR  VTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 102 GALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFPIALLNNFKMGFTYGLFIDAFKLAS 159


>gi|15239166|ref|NP_199119.1| uncharacterized protein [Arabidopsis thaliana]
 gi|9757831|dbj|BAB08268.1| unnamed protein product [Arabidopsis thaliana]
 gi|88900426|gb|ABD57525.1| At5g43050 [Arabidopsis thaliana]
 gi|332007521|gb|AED94904.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 158

 Score =  208 bits (529), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 103/118 (87%), Positives = 109/118 (92%), Gaps = 2/118 (1%)

Query: 46  RSIRAVQENGGP--RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
           RSIRA+QE  GP  RRL+DI+R VPE+SRNYF+ PSRR LFGGISLLGGFYVAQTISLSF
Sbjct: 41  RSIRALQETEGPTPRRLIDIIRSVPEISRNYFKKPSRRTLFGGISLLGGFYVAQTISLSF 100

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           GALGVNDVIAAVLCVLLTEYVTRFYYSR  VTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 101 GALGVNDVIAAVLCVLLTEYVTRFYYSRTTVTFPIALLNNFKMGFTYGLFIDAFKLAS 158


>gi|357164342|ref|XP_003580023.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
          Length = 155

 Score =  205 bits (522), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 102/122 (83%), Positives = 111/122 (90%)

Query: 40  SSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTI 99
            SS     +RA+Q N G RRLVDI+R++PELSRNYFRS SRRALFGGI+LLGGFYVAQTI
Sbjct: 34  KSSYRLLKVRALQGNDGRRRLVDIIRIIPELSRNYFRSGSRRALFGGIALLGGFYVAQTI 93

Query: 100 SLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           SLSFGAL VNDVIAAV+CVLLTEYVT+FYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKL
Sbjct: 94  SLSFGALAVNDVIAAVVCVLLTEYVTKFYYSRPKVTFPVALLNNFKMGFTYGLFIDAFKL 153

Query: 160 AS 161
           AS
Sbjct: 154 AS 155


>gi|294460167|gb|ADE75666.1| unknown [Picea sitchensis]
          Length = 200

 Score =  197 bits (502), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 96/130 (73%), Positives = 112/130 (86%)

Query: 32  GRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLG 91
           G+  I    S+    +I+AVQ +  PRRLVD ++++PE SRNYF+SP RRALFGGISLLG
Sbjct: 71  GKSCIMPPKSTFQSSNIKAVQGSRQPRRLVDAIKIIPEFSRNYFKSPFRRALFGGISLLG 130

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYG 151
           GFYVAQTISLSFGALGVNDVIAAVLCV++TEY T+FYY+R KVTFP+ALLNNFKMGFTYG
Sbjct: 131 GFYVAQTISLSFGALGVNDVIAAVLCVIVTEYATKFYYTRSKVTFPVALLNNFKMGFTYG 190

Query: 152 LFIDAFKLAS 161
           LFIDAFKLA+
Sbjct: 191 LFIDAFKLAN 200


>gi|167999037|ref|XP_001752224.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696619|gb|EDQ82957.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 103

 Score =  167 bits (422), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 78/103 (75%), Positives = 92/103 (89%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RLVD+VR +P ++R YF+ P RRALFGGI++LGGFYVAQTISLSFGALGVNDVIAA +CV
Sbjct: 1   RLVDVVRRLPRIARTYFKDPWRRALFGGIAILGGFYVAQTISLSFGALGVNDVIAAAMCV 60

Query: 119 LLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           L TEY+TRF  +R K++F +AL+NNFKMGFTYGLFIDAFKLAS
Sbjct: 61  LFTEYITRFTRTRKKISFTVALVNNFKMGFTYGLFIDAFKLAS 103


>gi|302816212|ref|XP_002989785.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
 gi|302816919|ref|XP_002990137.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
 gi|300142150|gb|EFJ08854.1| hypothetical protein SELMODRAFT_49273 [Selaginella moellendorffii]
 gi|300142351|gb|EFJ09052.1| hypothetical protein SELMODRAFT_49272 [Selaginella moellendorffii]
          Length = 104

 Score =  162 bits (410), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 80/104 (76%), Positives = 93/104 (89%)

Query: 58  RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
           +RL+D+V  VP  + NYFRSP RRALFGGISLLGGF+VAQTISLSFGA+GVNDVIA+ LC
Sbjct: 1   QRLLDLVGRVPGAATNYFRSPWRRALFGGISLLGGFFVAQTISLSFGAVGVNDVIASALC 60

Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           VLLTEY+TRFY++R + +  +ALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 61  VLLTEYMTRFYFTRQRPSLGVALLNNFKMGFTYGLFIDAFKLAS 104


>gi|384248674|gb|EIE22157.1| hypothetical protein COCSUDRAFT_47768 [Coccomyxa subellipsoidea
           C-169]
          Length = 169

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/112 (45%), Positives = 63/112 (56%), Gaps = 2/112 (1%)

Query: 50  AVQENGGPR-RLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALG 107
           +VQ  G  R RL   +  VP     YF   P+RR L+  I+   G+Y A T+SLSFGAL 
Sbjct: 56  SVQARGSGRTRLATRITTVPRRVDLYFADQPARRILWACIAFACGYYCANTVSLSFGALA 115

Query: 108 VNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           +NDV+AA L V   E V+  YY+ PK T  L  L  FK+G    L  DAFKL
Sbjct: 116 INDVVAAALTVAFCEVVSSLYYAAPKQTLKLLFLQCFKIGIICALIADAFKL 167


>gi|307110443|gb|EFN58679.1| hypothetical protein CHLNCDRAFT_140276 [Chlorella variabilis]
          Length = 117

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 46  RSIRAVQENGGPRRLVDIVRLVPE-LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFG 104
           RSI  ++     RRL+  V   P+ L R    S  RR ++ GISL  GFY     +LSFG
Sbjct: 2   RSI-CIKARDRQRRLLSYVLSAPQRLDRYLESSWWRRWMWAGISLSAGFYAGNIATLSFG 60

Query: 105 ALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           AL VNDV+AAV+ ++  E VT  YYS  K +  +  LNNFKMG    +  DA KL S
Sbjct: 61  ALAVNDVLAAVITLMFCEVVTHLYYSTEKPSLFIWFLNNFKMGLIAAMLADAAKLGS 117


>gi|303271617|ref|XP_003055170.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463144|gb|EEH60422.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 194

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 51/109 (46%), Positives = 61/109 (55%), Gaps = 3/109 (2%)

Query: 56  GPR--RLVDIVRLVPELSRNYFR-SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           GPR  R+  I+  +P     YFR    RR L+  IS   GFY A  ISL+FG LG+NDV+
Sbjct: 86  GPRETRIASILTTLPIRIELYFRVRKGRRTLWKAISAFAGFYTANVISLTFGVLGINDVV 145

Query: 113 AAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           A   CV   E VTR YY++ K       LN FK+G  Y L  DAFKL S
Sbjct: 146 AGAFCVGFFELVTRAYYNKIKPGKYWGFLNWFKLGLIYALVTDAFKLGS 194


>gi|412990920|emb|CCO18292.1| predicted protein [Bathycoccus prasinos]
          Length = 253

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 63/104 (60%), Gaps = 1/104 (0%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRAL-FGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
           R+  ++  +P+    YF+  S R L +  I +  GFY+A  I+LSFGALG+NDV+A  LC
Sbjct: 150 RIASVLTSLPQRLDLYFQRRSYRMLLWKTIFIFVGFYLASVITLSFGALGINDVVAGALC 209

Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           V + E +TR+YY   +    L  +N FK+G  Y L  DAFKL S
Sbjct: 210 VFVYETITRWYYKTSRPGRKLEFVNCFKLGLVYSLIADAFKLGS 253


>gi|308807413|ref|XP_003081017.1| unnamed protein product [Ostreococcus tauri]
 gi|116059479|emb|CAL55186.1| unnamed protein product [Ostreococcus tauri]
          Length = 178

 Score = 84.3 bits (207), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/114 (44%), Positives = 66/114 (57%), Gaps = 5/114 (4%)

Query: 49  RAVQENGGPRRLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALG 107
           RA +E     RLV ++  +P     YF R   R  L+  IS+  GFY+A  ++LSFGALG
Sbjct: 69  RAERET----RLVSVLTSLPRRLELYFARRSYRVTLWRAISVFFGFYLANVMTLSFGALG 124

Query: 108 VNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           +NDV+A  L V   E VTR YYS  +    L  +N FK+G+ Y L  DAFKL S
Sbjct: 125 INDVVAGALSVAFYEIVTRIYYSTLQQNKWLEFVNWFKVGYCYSLIADAFKLGS 178


>gi|145350586|ref|XP_001419683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579915|gb|ABO97976.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 104

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 61/104 (58%), Gaps = 1/104 (0%)

Query: 59  RLVDIVRLVPELSRNYF-RSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
           RLV ++  +P     YF R   R  L+  IS   GFY+A  ++LSFGA+G+NDV+A  L 
Sbjct: 1   RLVSVLTSLPRRFNLYFQRRNYRVTLWRAISAYFGFYLANVMTLSFGAIGINDVVAGALS 60

Query: 118 VLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           V   E VTR YY+  +    L  LN FK+G+ Y L  DAFKL S
Sbjct: 61  VAFFEVVTRIYYASAETNKWLEFLNWFKVGYCYSLIADAFKLGS 104


>gi|255072633|ref|XP_002499991.1| predicted protein [Micromonas sp. RCC299]
 gi|226515253|gb|ACO61249.1| predicted protein [Micromonas sp. RCC299]
          Length = 181

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 79/165 (47%), Gaps = 17/165 (10%)

Query: 4   ALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRS----IRAVQENGGPR- 58
           A  ARA+G       F+ G      ASL RR ++        R      RAV    GPR 
Sbjct: 27  ATSARALG-------FKGGEGMGQSASLARRQVAMVGPKVAGRGGAMVTRAVT---GPRE 76

Query: 59  -RLVDIVRLVPELSRNYFR-SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVL 116
            RL  I+  +P+    YFR    R+ L+  I    GFY A  +SL+FG LG+NDV+A   
Sbjct: 77  TRLGSILTTIPQRLDLYFRVRRGRKLLWKIIFAATGFYSANVLSLTFGVLGINDVVAGAF 136

Query: 117 CVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           C+   E VTR +Y +         LN FK+G  Y L  DAFKL S
Sbjct: 137 CLAFAEIVTRSFYKKTNPGKYWDFLNWFKLGLCYALITDAFKLGS 181


>gi|443475153|ref|ZP_21065112.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
 gi|443020076|gb|ELS34079.1| protein of unknown function DUF565 [Pseudanabaena biceps PCC 7429]
          Length = 117

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 61/113 (53%), Gaps = 10/113 (8%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  IV +       +  +P RR     ISLL GF++A  IS S GA    D++AA + V
Sbjct: 5   RLSQIVNINIRRLSQWSSNPWRRTSLIAISLLLGFFLASVISTSTGAKSELDILAAAITV 64

Query: 119 LLTEYVTRFYYSRPKVT--------FPLA--LLNNFKMGFTYGLFIDAFKLAS 161
              E   RF YS+ ++T         PL   +LN+ K+G TYGLF++AFKL S
Sbjct: 65  FFVELTNRFVYSQKRLTRSDGTITPRPLTSEMLNSLKLGITYGLFLEAFKLGS 117


>gi|427707469|ref|YP_007049846.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
 gi|427359974|gb|AFY42696.1| hypothetical protein Nos7107_2073 [Nostoc sp. PCC 7107]
          Length = 110

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 49/90 (54%), Gaps = 3/90 (3%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR--- 131
           FR+P RR     IS L GF++   IS   G     D+  A   VLLTE + R +YSR   
Sbjct: 21  FRNPWRRLSILIISFLFGFFLGTAISTIAGQKAELDIFVAAFLVLLTELINRIFYSRNLF 80

Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            K    L  LN  K+GFTY LF++AFKL S
Sbjct: 81  AKRPLWLECLNTLKVGFTYSLFVEAFKLGS 110


>gi|443312485|ref|ZP_21042102.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
 gi|442777463|gb|ELR87739.1| Protein of unknown function (DUF565) [Synechocystis sp. PCC 7509]
          Length = 109

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 55/95 (57%), Gaps = 3/95 (3%)

Query: 69  ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFY 128
           +LSR  F +P RR     ISLL GF++   +S + G     D+IAA + V L+E V R Y
Sbjct: 16  QLSRWLF-NPWRRLSLLAISLLFGFFLGTAVSTTAGQAADWDIIAAGILVALSEIVDRIY 74

Query: 129 YSR--PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           Y R     +F +  LN  K+G TY LFI+AFKL S
Sbjct: 75  YRRRSGDRSFWVESLNALKIGLTYSLFIEAFKLGS 109


>gi|378787253|gb|AFC39884.1| Ycf20 [Porphyra umbilicalis]
          Length = 108

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 85  GGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS-RPKVTFPLALLNN 143
           G ISLL GF+++ ++S   G  G   ++AA L V  TE V++  YS + K    + L+NN
Sbjct: 31  GLISLLLGFFISTSLSTIPGQTGDWGIVAASLIVAGTELVSKIVYSNKEKCVIKINLVNN 90

Query: 144 FKMGFTYGLFIDAFKLAS 161
           FK+G TYGLF+DAFKL S
Sbjct: 91  FKIGITYGLFVDAFKLGS 108


>gi|67920532|ref|ZP_00514052.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
 gi|67858016|gb|EAM53255.1| Protein of unknown function DUF565 [Crocosphaera watsonii WH 8501]
          Length = 110

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V +        F +P RR     IS+L GF+V Q +S++ G     D+   +  +
Sbjct: 5   RLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQQTYWDITVGIFLL 64

Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           + TE ++R  YSR K    +  L +LN FK+G TY L+++A KL S
Sbjct: 65  IFTEGISRITYSRSKKEGRSLSLDILNLFKIGVTYSLYVEALKLGS 110


>gi|218246762|ref|YP_002372133.1| hypothetical protein PCC8801_1939 [Cyanothece sp. PCC 8801]
 gi|257059804|ref|YP_003137692.1| hypothetical protein Cyan8802_1966 [Cyanothece sp. PCC 8802]
 gi|218167240|gb|ACK65977.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8801]
 gi|256589970|gb|ACV00857.1| protein of unknown function DUF565 [Cyanothece sp. PCC 8802]
          Length = 111

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 4/91 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR--- 131
           F +P RR     ISLL GF+V Q ++ S G     D+  A   +L TE+++R +YSR   
Sbjct: 21  FSNPWRRIALSLISLLLGFFVGQAVTTSVGQSAYWDITVAGFFLLFTEFISRTFYSRLSI 80

Query: 132 -PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             K  F L++LN FKMG  YGL+++A KL S
Sbjct: 81  NNKRLFWLSMLNTFKMGVIYGLYLEALKLGS 111


>gi|452824347|gb|EME31350.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
          Length = 203

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 52/84 (61%)

Query: 78  PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP 137
           P ++ L G +++L GF  A +++L FGAL   D + A +  + TE  TR+YY     +F 
Sbjct: 120 PKKQLLSGLMAMLFGFTCASSLTLIFGALSFLDPLIAGVLAVCTELYTRYYYQESSPSFS 179

Query: 138 LALLNNFKMGFTYGLFIDAFKLAS 161
           L L+N FK+G   GL +DAFK++S
Sbjct: 180 LRLVNAFKIGVMTGLILDAFKISS 203


>gi|428298335|ref|YP_007136641.1| hypothetical protein Cal6303_1627 [Calothrix sp. PCC 6303]
 gi|428234879|gb|AFZ00669.1| protein of unknown function DUF565 [Calothrix sp. PCC 6303]
          Length = 109

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL ++   +   ++  F +P RR     IS L GF++   +S + G     D++AA   +
Sbjct: 5   RLNNLFDAIARRTQEVFVNPWRRLSLLVISFLSGFFLGTAVSTTAGQKAELDIVAAAFIL 64

Query: 119 LLTEYVTRFYYSRPKVTFPLAL--LNNFKMGFTYGLFIDAFKLAS 161
           ++TE V+R YY R   T  L L  LN  K+G  Y LFI+A KL S
Sbjct: 65  IITETVSRIYYRRGNGTNSLGLEVLNTLKIGLMYSLFIEALKLGS 109


>gi|16329594|ref|NP_440322.1| hypothetical protein sll1509 [Synechocystis sp. PCC 6803]
 gi|383321335|ref|YP_005382188.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383324505|ref|YP_005385358.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383490389|ref|YP_005408065.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384435655|ref|YP_005650379.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
 gi|451813753|ref|YP_007450205.1| YCF20 protein [Synechocystis sp. PCC 6803]
 gi|6136503|sp|P72983.1|YC20L_SYNY3 RecName: Full=Ycf20-like protein
 gi|1652077|dbj|BAA17002.1| ycf20 [Synechocystis sp. PCC 6803]
 gi|339272687|dbj|BAK49174.1| hypothetical protein SYNGTS_0426 [Synechocystis sp. PCC 6803]
 gi|359270654|dbj|BAL28173.1| hypothetical protein SYNGTI_0426 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359273825|dbj|BAL31343.1| hypothetical protein SYNPCCN_0426 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359276995|dbj|BAL34512.1| hypothetical protein SYNPCCP_0426 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407957475|dbj|BAM50715.1| hypothetical protein BEST7613_1784 [Bacillus subtilis BEST7613]
 gi|451779722|gb|AGF50691.1| YCF20 protein [Synechocystis sp. PCC 6803]
          Length = 109

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  IV +  +    +FR+P RR     +S L GF+V   ++ + G     DV+ A   +
Sbjct: 5   RLNTIVEVRGQQLSQFFRNPWRRISLSLLSFLFGFFVGTAVATTAGQNSQWDVVCAAFIL 64

Query: 119 LLTEYVTRFYYSRPKVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
           L  E V R++Y R      L   +LN FKMG +Y LF++AFKL S
Sbjct: 65  LFCELVNRWFYRRGVKMGDLQAEVLNIFKMGVSYSLFLEAFKLGS 109


>gi|440682864|ref|YP_007157659.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
 gi|428679983|gb|AFZ58749.1| protein of unknown function DUF565 [Anabaena cylindrica PCC 7122]
          Length = 110

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +  L   +F +P RR     I+ L GF++   +S + G     D++ A + V
Sbjct: 5   RLNNLLDTIARLLGQWFVNPWRRLSLLVINFLFGFFLGSAVSTTTGQKAELDIVVAAVLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           LLTE  +R +YSR    K    +  LN  K+GF Y LF++AFKL S
Sbjct: 65  LLTEITSRIFYSRTFFAKQALWVEALNFLKVGFIYSLFLEAFKLGS 110


>gi|90994406|ref|YP_536896.1| hypothetical chloroplast protein 20 [Pyropia yezoensis]
 gi|122194735|sp|Q1XDS2.1|YCF20_PORYE RecName: Full=Uncharacterized protein ycf20
 gi|90818970|dbj|BAE92339.1| unnamed protein product [Pyropia yezoensis]
          Length = 108

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 55/97 (56%), Gaps = 2/97 (2%)

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
           L+  L+   + S S     G ISLL GF+++  +S   G  G   +IAA L V  TE  +
Sbjct: 13  LIKNLNNKLYYSLSELTT-GLISLLLGFFISTGLSTIPGQTGDWGIIAASLIVAATELTS 71

Query: 126 RFYYSRPK-VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           +  YS  K +   + L NNFK+G TYGLF+DAFKL S
Sbjct: 72  KIVYSSHKQLNIKINLFNNFKIGITYGLFVDAFKLGS 108


>gi|428201383|ref|YP_007079972.1| hypothetical protein Ple7327_0997 [Pleurocapsa sp. PCC 7327]
 gi|427978815|gb|AFY76415.1| Protein of unknown function (DUF565) [Pleurocapsa sp. PCC 7327]
          Length = 107

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 51/88 (57%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
           +F +P RR     I +L GF+    +S + G     D+IAA   +L  E+V+R +Y R +
Sbjct: 20  FFSNPWRRISLILIGVLFGFFAGSALSTTAGQAAEWDIIAAGFVLLFVEFVSRIFYRRER 79

Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            +  L +LN FK+G  Y LF++AFKL S
Sbjct: 80  RSLFLEVLNFFKIGMIYSLFLEAFKLGS 107


>gi|428316563|ref|YP_007114445.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428240243|gb|AFZ06029.1| protein of unknown function DUF565 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 110

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 53  ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           +N    RL+D++    +  R +  +P RR     ISLL G ++   IS   G     D+I
Sbjct: 2   QNTRINRLIDVL---GDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDII 58

Query: 113 AAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           AA L VLLTE+  R  Y   +    +F + +LN  K+G TY LF++AFKL S
Sbjct: 59  AAGLLVLLTEFANRLVYGSLRSDDRSFWVQMLNALKIGLTYNLFVEAFKLGS 110


>gi|416377790|ref|ZP_11683661.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
           0003]
 gi|357266150|gb|EHJ14818.1| hypothetical protein CWATWH0003_0506 [Crocosphaera watsonii WH
           0003]
          Length = 110

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V +        F +P RR     IS+L GF+V Q +S++ G     D+   +  +
Sbjct: 5   RLNTLVEVTQTKFNETFSNPWRRISLSLISVLLGFFVGQAVSITAGQKTYWDINVGIFLL 64

Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           + TE ++R  YSR K    +  L +LN FK+G TY L+++A KL S
Sbjct: 65  IFTEGISRITYSRSKKEGRSLSLDILNLFKIGVTYSLYVEALKLGS 110


>gi|172037433|ref|YP_001803934.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
 gi|354553685|ref|ZP_08972991.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
 gi|171698887|gb|ACB51868.1| hypothetical protein cce_2520 [Cyanothece sp. ATCC 51142]
 gi|353554402|gb|EHC23792.1| protein of unknown function DUF565 [Cyanothece sp. ATCC 51472]
          Length = 110

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V +        F +P RR     ISLL GF+V Q++S++ G     D+   +  +
Sbjct: 5   RLNTLVEVTQTKFNQTFSNPWRRISLSLISLLLGFFVGQSVSITAGQEAYWDITVGIFLL 64

Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           + TE ++R  YS+ K    +  L +LN FK+G TYGL+I+A KL S
Sbjct: 65  IFTEGISRIAYSQNKKKGRSLGLDILNLFKIGMTYGLYIEALKLGS 110


>gi|334118318|ref|ZP_08492408.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
 gi|333460303|gb|EGK88913.1| protein of unknown function DUF565 [Microcoleus vaginatus FGP-2]
          Length = 110

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 59/113 (52%), Gaps = 6/113 (5%)

Query: 52  QENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDV 111
            +N    RL+D++    +  R +  +P RR     ISLL G ++   IS   G     D+
Sbjct: 1   MQNTRINRLIDVL---GDRFRGWLSNPWRRISLLIISLLFGTFLGTAISTIAGQSADWDI 57

Query: 112 IAAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           IAA L VLLTE V    Y  P+    +  + +LN  K+G TY LF++AFKL S
Sbjct: 58  IAAGLLVLLTELVNWLVYGGPRPATGSLWVEMLNALKIGLTYNLFVEAFKLGS 110


>gi|428223860|ref|YP_007107957.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
 gi|427983761|gb|AFY64905.1| hypothetical protein GEI7407_0404 [Geitlerinema sp. PCC 7407]
          Length = 110

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 53  ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           +N     L+D+  L+ +L R +  +P RR     ISLL GF+    +S + G  G  DV+
Sbjct: 2   QNTRLSSLIDV--LLAQL-RRWLLNPWRRVSLVVISLLFGFFTGNALSTTTGQSGDIDVL 58

Query: 113 AAVLCVLLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            A + V LTE V+ F Y R    +    +  LN FK+G  Y LF++AFKL S
Sbjct: 59  VAAITVSLTEAVSWFVYGRRWGERQPLLIDSLNAFKLGLIYSLFLEAFKLGS 110


>gi|414078888|ref|YP_006998206.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
 gi|413972304|gb|AFW96393.1| hypothetical protein ANA_C13740 [Anabaena sp. 90]
          Length = 110

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +      +F +P RR     IS L GF++   +S + G     D++ A   V
Sbjct: 5   RLNNLLETIFSSLSEWFLNPWRRLSLILISFLFGFFLGSAVSTTAGQKAELDIVVAGFLV 64

Query: 119 LLTEYVTRFYYSRP---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           LLTE  +R +YSR    +  F + +LN  K+GF Y LF++A KL S
Sbjct: 65  LLTEITSRIFYSRSFFVRQAFWVEILNYLKVGFIYSLFLEALKLGS 110


>gi|78211846|ref|YP_380625.1| hypothetical protein Syncc9605_0294 [Synechococcus sp. CC9605]
 gi|78196305|gb|ABB34070.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 110

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSR 131
           N+ R+P RR     I+ L GF +   I+   G LG  D +AA+L VL TE  + R   S 
Sbjct: 22  NWARNPWRRLSLLAIAGLIGFLIGTAITPVAGVLGQMDPVAALLVVLGTELTIRRQRSSE 81

Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           P +  P  LL+  ++GF YGLF++ FKL
Sbjct: 82  PSLKLPQQLLDLGRIGFLYGLFLEGFKL 109


>gi|434397744|ref|YP_007131748.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
 gi|428268841|gb|AFZ34782.1| hypothetical protein Sta7437_1212 [Stanieria cyanosphaera PCC 7437]
          Length = 116

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 12/111 (10%)

Query: 60  LVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL 119
           LVD+V    EL    F +P RR     IS+L GF++   I+ + G     DVI A L + 
Sbjct: 9   LVDVVGARIEL---LFSNPWRRFALSIISILFGFFMGSAIATTAGQDASWDVIGAALILF 65

Query: 120 LTEYVTRFYYSRP-------KVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
            TE ++ F Y RP        +  PL   ++N FK+G TY LF++AFKL S
Sbjct: 66  FTELLSNFVYRRPSRDRTETNLRRPLLVDVVNLFKIGLTYSLFLEAFKLGS 116


>gi|282899121|ref|ZP_06307102.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196037|gb|EFA70953.1| protein of unknown function DUF565 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 110

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +      +F +P RR     IS L GF++   +S + G     D+ AA + V
Sbjct: 5   RLNNLLDTIATNVAQWFINPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAAAVLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           LLTE  +R +YSR    +    +  LN  K+G  Y LF++AFKL S
Sbjct: 65  LLTEVASRLFYSRGFFSQRAIWVESLNWLKIGLIYSLFLEAFKLGS 110


>gi|282897978|ref|ZP_06305973.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
 gi|281197122|gb|EFA72023.1| Protein of unknown function DUF565 [Raphidiopsis brookii D9]
          Length = 120

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +      +F +P RR     IS L GF++   +S + G     D+ AA + V
Sbjct: 15  RLNNLLDTIATNVAQWFVNPWRRLCLLLISWLFGFFLGSVVSTTAGQQAQLDIWAAAVLV 74

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           LLTE  +R +YSR    +    +  LN  K+G  Y LF++AFKL S
Sbjct: 75  LLTEVASRLFYSRGFFSQRAIWVESLNWLKIGLIYSLFLEAFKLGS 120


>gi|409992567|ref|ZP_11275749.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
           Paraca]
 gi|409936561|gb|EKN78043.1| hypothetical protein APPUASWS_15812 [Arthrospira platensis str.
           Paraca]
          Length = 111

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 58/99 (58%), Gaps = 4/99 (4%)

Query: 67  VPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTR 126
           + + +  +FR+P RR     ISLL GF++   I+   G +G  DV  A + +L+TE+++ 
Sbjct: 13  IAQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQVGSWDVSVAFITLLVTEWISW 72

Query: 127 FYY---SRPKV-TFPLALLNNFKMGFTYGLFIDAFKLAS 161
             Y   SR ++ +  L LLN+ K+G TY LF+ AF L S
Sbjct: 73  LSYRKVSRSEINSLGLDLLNSLKIGMTYSLFLLAFLLGS 111


>gi|11465680|ref|NP_053824.1| ORF20 [Porphyra purpurea]
 gi|1723324|sp|P51214.1|YCF20_PORPU RecName: Full=Uncharacterized protein ycf20
 gi|1276680|gb|AAC08100.1| hypothetical chloroplast ORF 20 (chloroplast) [Porphyra purpurea]
          Length = 108

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 53/92 (57%), Gaps = 3/92 (3%)

Query: 71  SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS 130
           SR Y+      A  G ISLL GF+++  +S   G  G   +IAA L V   E V++  YS
Sbjct: 19  SRLYYSLNELTA--GLISLLLGFFISTGLSTIPGQTGDWGIIAASLIVAAIELVSKIVYS 76

Query: 131 -RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            + K    + LLNN K+G TYGLF+DAFKL S
Sbjct: 77  NKKKYGVRINLLNNLKIGITYGLFVDAFKLGS 108


>gi|427720360|ref|YP_007068354.1| hypothetical protein Cal7507_5179 [Calothrix sp. PCC 7507]
 gi|427352796|gb|AFY35520.1| protein of unknown function DUF565 [Calothrix sp. PCC 7507]
          Length = 110

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 3/92 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
            +F +P RR     IS L GF++   +S + G     D++ A   V LTE  +R +Y+R 
Sbjct: 19  EWFTNPWRRLSLLVISFLFGFFLGTAVSTTAGQKADWDIVVAAFLVFLTEISSRIFYNRS 78

Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              K +  +  LN  K+GFTY LFI+A KL S
Sbjct: 79  FLSKRSLWVEALNLLKVGFTYSLFIEALKLGS 110


>gi|218437362|ref|YP_002375691.1| hypothetical protein PCC7424_0355 [Cyanothece sp. PCC 7424]
 gi|218170090|gb|ACK68823.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7424]
          Length = 116

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 12/111 (10%)

Query: 60  LVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL 119
           LVD +  V +L R ++R+P RR     I  LGG +VAQ +S + G     D++ A   +L
Sbjct: 9   LVDTI--VFKLQR-FYRNPWRRISVLVIGFLGGIFVAQALSTTGGQEADWDIVMATFVLL 65

Query: 120 LTEYVTRFYYSRPK-------VTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
            TE V+ F Y  P+       + + L    LN FK+G TY +F++AFKL S
Sbjct: 66  FTEGVSIFVYRLPRKQSSNLDIRYSLFTESLNMFKIGITYSMFLEAFKLGS 116


>gi|298493111|ref|YP_003723288.1| hypothetical protein Aazo_5069 ['Nostoc azollae' 0708]
 gi|298235029|gb|ADI66165.1| protein of unknown function DUF565 ['Nostoc azollae' 0708]
          Length = 110

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +      +F +P RR     IS L GF++   +S + G     D++ A   V
Sbjct: 5   RLNNLLDTIARSLGQWFVNPWRRFSLLLISWLFGFFMGSAVSTTAGQKAELDIVVAAFLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           LLTE  +R +YSR    +    +  LN  K+GF Y LF++AFKL S
Sbjct: 65  LLTEITSRIFYSRNFFARQAIWVESLNLLKIGFIYSLFLEAFKLGS 110


>gi|434405265|ref|YP_007148150.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
           7417]
 gi|428259520|gb|AFZ25470.1| Protein of unknown function (DUF565) [Cylindrospermum stagnale PCC
           7417]
          Length = 110

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
            +F +P RR     IS L GF++   +S + G     D++ A   VLLTE  +R +YSR 
Sbjct: 19  QWFINPWRRLSLLVISFLFGFFLGTAVSTTAGQKAELDIVVAGFLVLLTEVTSRIFYSRG 78

Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              + +  +  LN  K+GF Y LFI+AFKL S
Sbjct: 79  FLARRSLWVESLNLLKVGFIYSLFIEAFKLGS 110


>gi|126654810|ref|ZP_01726344.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
 gi|126623545|gb|EAZ94249.1| hypothetical protein CY0110_10252 [Cyanothece sp. CCY0110]
          Length = 110

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V +        F +P RR     ISLL GF+V Q++S++ G     D+   +  +
Sbjct: 5   RLNTLVEVTQTKFNQTFSNPWRRISLSLISLLLGFFVGQSVSITAGQEAYWDITVGIFLL 64

Query: 119 LLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           + TE ++R  YSR      +  L +LN FK+G TY L+++A KL S
Sbjct: 65  IFTEGISRIAYSRSSKKARSLSLDILNLFKIGITYSLYVEALKLGS 110


>gi|449020096|dbj|BAM83498.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 206

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 46/78 (58%)

Query: 83  LFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLN 142
            +G  +LL G + A +IS   GAL   D IA  + V + E++   YY RP+ +  + +LN
Sbjct: 128 FWGSCALLFGAFSATSISTIVGALAEWDPIAGAVIVFIMEWIATQYYRRPRPSNIIRILN 187

Query: 143 NFKMGFTYGLFIDAFKLA 160
            +++G T+G  +DA KLA
Sbjct: 188 AYQVGLTFGFILDALKLA 205


>gi|427730935|ref|YP_007077172.1| hypothetical protein Nos7524_3796 [Nostoc sp. PCC 7524]
 gi|427366854|gb|AFY49575.1| Protein of unknown function (DUF565) [Nostoc sp. PCC 7524]
          Length = 110

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 55/102 (53%), Gaps = 6/102 (5%)

Query: 66  LVPELSRN---YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTE 122
           L   ++RN   +F +P RR     IS L GF++   IS   G  G  D++ A   V+LTE
Sbjct: 9   LFDAIARNLGLWFLNPWRRLSLLVISFLFGFFLGTAISTIAGQRGTLDIMIAGFLVVLTE 68

Query: 123 YVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             +R +YSR    +    +  LN  K+GF Y LFI+AFKL S
Sbjct: 69  VTSRIFYSRSFFARRELWVESLNLLKVGFIYSLFIEAFKLGS 110


>gi|291566840|dbj|BAI89112.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 111

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 61/113 (53%), Gaps = 7/113 (6%)

Query: 53  ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           +N    R  D +    + +  +FR+P RR     ISL  GF++   I+   G +G  DV 
Sbjct: 2   QNTRLNRFTDAI---AQQTLQWFRNPWRRISVLLISLFFGFFLGVAIATIVGQVGSWDVS 58

Query: 113 AAVLCVLLTEYVTRFYY---SRPKV-TFPLALLNNFKMGFTYGLFIDAFKLAS 161
            A + +L+TE+++   Y   SR ++ +  L LLN+ K+G TY LF+ AF L S
Sbjct: 59  VAFITLLVTEWISWLSYRKVSRSEINSLWLDLLNSMKIGMTYSLFLLAFLLGS 111


>gi|209527452|ref|ZP_03275957.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
 gi|376003361|ref|ZP_09781173.1| putative (predicted) allophycocyanin linker protein [Arthrospira
           sp. PCC 8005]
 gi|423066926|ref|ZP_17055716.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
 gi|209492125|gb|EDZ92475.1| protein of unknown function DUF565 [Arthrospira maxima CS-328]
 gi|375328283|emb|CCE16926.1| putative (predicted) allophycocyanin linker protein [Arthrospira
           sp. PCC 8005]
 gi|406711212|gb|EKD06413.1| hypothetical protein SPLC1_S530040 [Arthrospira platensis C1]
          Length = 111

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 7/113 (6%)

Query: 53  ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           +N    R  D +    + +  +FR+P RR     ISLL GF++   I+   G  G  DV 
Sbjct: 2   QNTRLNRFTDAI---AQQTLQWFRNPWRRISVLLISLLFGFFLGVAIATIVGQAGSWDVT 58

Query: 113 AAVLCVLLTEYVTRFYYSR-PKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
            A + +L+TE+++   Y + P+    +  L LLN+ K+G TY LF+ AF L S
Sbjct: 59  VAFITLLVTEWISWLSYRKVPRSEINSLWLDLLNSLKIGMTYSLFLLAFLLGS 111


>gi|443328441|ref|ZP_21057038.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
 gi|442791895|gb|ELS01385.1| Protein of unknown function (DUF565) [Xenococcus sp. PCC 7305]
          Length = 114

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 50/94 (53%), Gaps = 7/94 (7%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           F +P RR     I+LL GF++   I  S G   V D+  A + ++  E  +R+ Y + K 
Sbjct: 21  FNNPWRRISLNIIALLMGFFMGSAIVSSAGQDAVWDITGAAVLLIFAELSSRWIYGQNKK 80

Query: 135 TFP-------LALLNNFKMGFTYGLFIDAFKLAS 161
           +         L +LN FK+G TY LF++AFKL S
Sbjct: 81  SSAIIRQSLFLDILNKFKIGITYSLFLEAFKLNS 114


>gi|427420155|ref|ZP_18910338.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
 gi|425762868|gb|EKV03721.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 7375]
          Length = 110

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V    +    +  +P RR     ISLL G + A  IS + G     D++ + + V
Sbjct: 5   RLSTLVDQAGQRFSQWVFNPWRRISLAVISLLFGNFFATAISATAGQTAEIDILISAILV 64

Query: 119 LLTEYVTRFYY---SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            + E+++  Y+   S  + T    LLN+FK+G  YGLF++AFKL S
Sbjct: 65  AMVEFISWLYHRSRSNGQRTILPELLNSFKLGMVYGLFVEAFKLGS 110


>gi|307153416|ref|YP_003888800.1| hypothetical protein Cyan7822_3584 [Cyanothece sp. PCC 7822]
 gi|306983644|gb|ADN15525.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7822]
          Length = 116

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 51/99 (51%), Gaps = 9/99 (9%)

Query: 72  RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY-- 129
           + +  +P RR     I  LGGF+ AQ IS + G     DV+ A L +L TE  + F Y  
Sbjct: 18  QRFLSNPWRRISVLLIGFLGGFFAAQAISTTGGQKADLDVVMAALILLFTEGASIFVYRF 77

Query: 130 -SRP------KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             RP      +    +  LN FKMG  YGLF++AFKL S
Sbjct: 78  SGRPSKAGEVRQWLFVETLNLFKMGMIYGLFLEAFKLGS 116


>gi|186685288|ref|YP_001868484.1| hypothetical protein Npun_R5211 [Nostoc punctiforme PCC 73102]
 gi|186467740|gb|ACC83541.1| protein of unknown function DUF565 [Nostoc punctiforme PCC 73102]
          Length = 110

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL +++  +      +F +P RR     IS L GF++   +S + G     D++ A   V
Sbjct: 5   RLNNLLDAIATRLGQWFLNPWRRLSLLIISFLFGFFLGNAVSTTAGQRAELDIVVAGFLV 64

Query: 119 LLTEYVTRFYYSRPKVT---FPLALLNNFKMGFTYGLFIDAFKLAS 161
           +LTE  +R +YS+   T     +  LN  K+GFTY LF++AFKL S
Sbjct: 65  VLTEVTSRIFYSQSFFTRRSLFVDSLNLLKVGFTYSLFLEAFKLGS 110


>gi|434393240|ref|YP_007128187.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
 gi|428265081|gb|AFZ31027.1| hypothetical protein Glo7428_2520 [Gloeocapsa sp. PCC 7428]
          Length = 110

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 57/107 (53%), Gaps = 5/107 (4%)

Query: 59  RLVDIVR-LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLC 117
           RL D+   +  +L R +F +P R+     IS+L GF++   IS   G     D+I A + 
Sbjct: 5   RLNDLFNSMAGQLGR-WFFNPWRKLSLLVISVLFGFFLGSAISTIAGQAADWDIIGAGVL 63

Query: 118 VLLTEYVTRFYYS---RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           V  +E V R YYS   R +    L  LN  K+G TY LF++AFKL S
Sbjct: 64  VAWSEIVDRIYYSRTWRKQRGLWLEALNAGKIGLTYSLFLEAFKLGS 110


>gi|427736638|ref|YP_007056182.1| hypothetical protein Riv7116_3162 [Rivularia sp. PCC 7116]
 gi|427371679|gb|AFY55635.1| Protein of unknown function (DUF565) [Rivularia sp. PCC 7116]
          Length = 110

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP- 132
           +F +P RR     IS L GF++    + + G   V D++ A   V LTE  +R  Y RP 
Sbjct: 20  WFLNPWRRISLLMISFLFGFFMGTVAATTAGQRAVLDIVVAAFLVFLTEIGSRIVYRRPP 79

Query: 133 --KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             +    +  LN  K+GF Y LF++AFKL S
Sbjct: 80  KERRLLWVESLNFLKVGFIYSLFVEAFKLGS 110


>gi|260436046|ref|ZP_05790016.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
 gi|260413920|gb|EEX07216.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
          Length = 134

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 1/87 (1%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRP 132
           + R+P RR     ++ L GF +   I+   G LG  D IAA++ VL TE  + R   S P
Sbjct: 47  WARNPWRRFSLLALAGLIGFLIGSAITSVAGVLGQMDPIAALVVVLGTELTIRRRRSSEP 106

Query: 133 KVTFPLALLNNFKMGFTYGLFIDAFKL 159
            +  P  LL+  ++GF YGLF++ FKL
Sbjct: 107 SLKLPQQLLDLGRIGFLYGLFLEGFKL 133


>gi|428306677|ref|YP_007143502.1| hypothetical protein Cri9333_3154 [Crinalium epipsammum PCC 9333]
 gi|428248212|gb|AFZ13992.1| protein of unknown function DUF565 [Crinalium epipsammum PCC 9333]
          Length = 118

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 11/114 (9%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL ++V ++    R +  +P RR     ISLL G ++   I  + G     DV++A + V
Sbjct: 5   RLNNLVDVILGQLRLWLSNPWRRLSLIVISLLLGIFLGTAIPTTSGQTAEWDVVSAGVLV 64

Query: 119 LLTEYVTRFYYSRPKVT-----------FPLALLNNFKMGFTYGLFIDAFKLAS 161
           + TE V+RF Y R +             F   LLN+FK+G  Y LF++AFKL S
Sbjct: 65  IFTEAVSRFVYGRNRRPPANEEALSRGLFVAELLNSFKIGLIYSLFVEAFKLGS 118


>gi|422304172|ref|ZP_16391521.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
 gi|389790773|emb|CCI13392.1| Ycf20-like protein [Microcystis aeruginosa PCC 9806]
          Length = 113

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L V
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE ++R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEVISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113


>gi|81299804|ref|YP_400012.1| hypothetical protein Synpcc7942_0995 [Synechococcus elongatus PCC
           7942]
 gi|81168685|gb|ABB57025.1| conserved hypothetical protein YCF20 [Synechococcus elongatus PCC
           7942]
          Length = 131

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--- 129
            +F+ P RR     I LL GF++  T+SL  G  GV D   A +  +  E  +  YY   
Sbjct: 38  QWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSFAAVITISVETCSWLYYRGP 97

Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              +P  + P  LL N K+G  YGL ++  KL S
Sbjct: 98  GAGKPDRSLPFLLLQNLKLGALYGLALETLKLGS 131


>gi|425442753|ref|ZP_18822990.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
 gi|425466273|ref|ZP_18845576.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
 gi|389716121|emb|CCH99606.1| Ycf20-like protein [Microcystis aeruginosa PCC 9717]
 gi|389831302|emb|CCI26073.1| Ycf20-like protein [Microcystis aeruginosa PCC 9809]
          Length = 113

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L V
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE ++R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113


>gi|56750558|ref|YP_171259.1| hypothetical protein syc0549_c [Synechococcus elongatus PCC 6301]
 gi|56685517|dbj|BAD78739.1| hypothetical protein YCF20 [Synechococcus elongatus PCC 6301]
          Length = 112

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--- 129
            +F+ P RR     I LL GF++  T+SL  G  GV D   A +  +  E  +  YY   
Sbjct: 19  QWFQQPWRRLSLQIIFLLFGFWLTNTLSLVAGQSGVWDPSFAAVITISVETCSWLYYRGP 78

Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              +P  + P  LL N K+G  YGL ++  KL S
Sbjct: 79  GAGKPDRSLPFLLLQNLKLGALYGLALETLKLGS 112


>gi|166363096|ref|YP_001655369.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
 gi|166085469|dbj|BAG00177.1| hypothetical protein MAE_03550 [Microcystis aeruginosa NIES-843]
          Length = 111

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L V
Sbjct: 5   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGIFSGQAIVTTAGQTAQWDVTAAGLLV 64

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE ++R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 65  LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 111


>gi|300868146|ref|ZP_07112779.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300333849|emb|CBN57959.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 110

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 6/112 (5%)

Query: 53  ENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVI 112
           +N    RL+D++    E   ++ ++P RR     ISLL G ++   IS   G     D+ 
Sbjct: 2   QNTRLNRLIDVL---SEQLGDWLQNPWRRISLLIISLLFGSFLGGAISTIAGQAAEWDIF 58

Query: 113 AAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           AA L V LTE++    Y   ++   +  + +LN  K+G  Y LF++AFKL S
Sbjct: 59  AAGLLVALTEFINWIVYRGTRLVGRSLRIDILNCLKIGLIYSLFVEAFKLGS 110


>gi|302763017|ref|XP_002964930.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
 gi|300167163|gb|EFJ33768.1| hypothetical protein SELMODRAFT_68854 [Selaginella moellendorffii]
          Length = 109

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 47/91 (51%), Gaps = 4/91 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           FR   R        LL GFY A  ++   G  G  DV+ A + V +TE +    Y  P V
Sbjct: 19  FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAIIVAVTEGIGFLMYKMPPV 78

Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
           T    F + +LN +K G ++GLF+DAFKL S
Sbjct: 79  TEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109


>gi|354568968|ref|ZP_08988128.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
 gi|353539180|gb|EHC08672.1| protein of unknown function DUF565 [Fischerella sp. JSC-11]
          Length = 110

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL ++  ++      +  +P RR     IS L GF++   ++ + G     D++ A   V
Sbjct: 5   RLNNLFNVLTRRLAEWILNPWRRISILIISFLLGFFLGSAVATTAGQTAEWDIVVAGTLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           L TE  +R +Y+R    +  F +  +N  K+G TY LF++AFKL S
Sbjct: 65  LATEIASRIFYNRSVLARQVFWVQAINYLKVGLTYSLFLEAFKLGS 110


>gi|113477257|ref|YP_723318.1| hypothetical protein Tery_3800 [Trichodesmium erythraeum IMS101]
 gi|110168305|gb|ABG52845.1| protein of unknown function DUF565 [Trichodesmium erythraeum
           IMS101]
          Length = 110

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  ++ +    +  +P RR     ISLL G + A  +S   G  G  DV+  + C+
Sbjct: 5   RLNRLFNVILKRFEQWLVNPWRRISILLISLLFGNFAASAVSTIAGQEGYLDVLYTLACL 64

Query: 119 LLTEYVTRFYYSRP-KVT--FPLALLNNFKMGFTYGLFIDAFKLAS 161
            +TE +    Y R  KV     + +LN+ K+GF YGLF++AFKL S
Sbjct: 65  FITEALNWLVYGRRGKVARFLIIDILNSLKIGFIYGLFLEAFKLGS 110


>gi|428208508|ref|YP_007092861.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428010429|gb|AFY88992.1| hypothetical protein Chro_3535 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 110

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 55/106 (51%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           R+  ++  +   ++ +F +P RR     +SLL GF++    S   G     D+I A + V
Sbjct: 5   RINGLLGAIAAATQRWFFNPWRRLSVLIVSLLFGFFLGSAFSTIAGQAADWDIIGAGVLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             +E + R YY+R    +    L  LN  K+G TY LF++AFKL S
Sbjct: 65  AGSEIIDRIYYNRIWQARRGLWLEALNALKIGLTYSLFLEAFKLGS 110


>gi|434389053|ref|YP_007099664.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
           6605]
 gi|428020043|gb|AFY96137.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
           6605]
          Length = 120

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 13/102 (12%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
            + ++P RR     ISLL G ++A T+S + G     D+  A++ V + E V+   Y   
Sbjct: 19  QWGQNPWRRISLVIISLLSGNFLATTVSTTTGQKADLDITVALILVTIVETVSWIAYGSN 78

Query: 132 --PKVTFP----------LALLNNFKMGFTYGLFIDAFKLAS 161
             P+   P          +A+LN+ K+G  YGLF++AFKL S
Sbjct: 79  LGPRRPDPEAILGQRPLWIAILNSLKLGLIYGLFVEAFKLGS 120


>gi|119512830|ref|ZP_01631897.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
 gi|119462503|gb|EAW43473.1| hypothetical protein N9414_02411 [Nodularia spumigena CCY9414]
          Length = 110

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS-- 130
            +F +P RR     I+   GF++   IS   G  G+ D++ A + V+LTE  +R +YS  
Sbjct: 19  QWFLNPWRRLSLLLINFFFGFFLGTAISTIAGQRGLLDIVIAGILVVLTEVTSRIFYSPS 78

Query: 131 -RPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              K +  +  LN  K+GF Y +F+++FKL S
Sbjct: 79  FLSKRSLLVESLNILKVGFIYSMFVESFKLGS 110


>gi|390437653|ref|ZP_10226185.1| Ycf20-like protein [Microcystis sp. T1-4]
 gi|389838936|emb|CCI30307.1| Ycf20-like protein [Microcystis sp. T1-4]
          Length = 113

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE ++R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEAISRIVYRKSAQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113


>gi|425446456|ref|ZP_18826460.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
 gi|425456084|ref|ZP_18835795.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
 gi|389733318|emb|CCI02893.1| Ycf20-like protein [Microcystis aeruginosa PCC 9443]
 gi|389802905|emb|CCI18105.1| Ycf20-like protein [Microcystis aeruginosa PCC 9807]
          Length = 113

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +       +F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 7   RLSTLANVTSSRFNRFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE ++R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEAISRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113


>gi|428779733|ref|YP_007171519.1| hypothetical protein Dacsa_1477 [Dactylococcopsis salina PCC 8305]
 gi|428694012|gb|AFZ50162.1| Protein of unknown function (DUF565) [Dactylococcopsis salina PCC
           8305]
          Length = 112

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 53/96 (55%), Gaps = 5/96 (5%)

Query: 71  SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS 130
           ++ +F +P R+     +SLL G ++A  I  S G  G  D+I A + ++ TE ++ F Y 
Sbjct: 17  TQQFFLNPWRKITLLLLSLLSGIFIAVAIVTSAGQNGRLDIIVAAVLLIFTEGISWFVYR 76

Query: 131 RPKVTFP-----LALLNNFKMGFTYGLFIDAFKLAS 161
           R   T       L +LN FK+GFTY L+I AF L S
Sbjct: 77  RYSSTDERNLGFLEVLNTFKIGFTYSLYIQAFILGS 112


>gi|428220739|ref|YP_007104909.1| hypothetical protein Syn7502_00625 [Synechococcus sp. PCC 7502]
 gi|427994079|gb|AFY72774.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 7502]
          Length = 109

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 49/90 (54%), Gaps = 2/90 (2%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
           +F +P RR     IS L GF++A  IS   GA    D   A   +L+TE ++   Y R K
Sbjct: 20  WFSNPWRRISLYIISPLLGFFLASIISTVSGAKSAFDPYVAAALLLVTELISFVAYRRTK 79

Query: 134 V--TFPLALLNNFKMGFTYGLFIDAFKLAS 161
              +  L +LN  K+G  YG+F++AFKL S
Sbjct: 80  EKRSLYLEILNLLKIGLIYGMFLEAFKLGS 109


>gi|302809637|ref|XP_002986511.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
 gi|300145694|gb|EFJ12368.1| hypothetical protein SELMODRAFT_48834 [Selaginella moellendorffii]
          Length = 109

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 4/91 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           FR   R        LL GFY A  ++   G  G  DV+ A + V + E +    Y  P V
Sbjct: 19  FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDVLVAAIIVAVIEGIGFLMYKMPPV 78

Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
           T    F + +LN +K G ++GLF+DAFKL S
Sbjct: 79  TEKLRFLVRMLNYWKAGLSFGLFVDAFKLGS 109


>gi|78185616|ref|YP_378050.1| hypothetical protein Syncc9902_2049 [Synechococcus sp. CC9902]
 gi|78169910|gb|ABB27007.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 110

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 1/88 (1%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSR 131
            + ++P RR     I+L  GF +   ++   G L   D +A+++ VL TE+ + R   S 
Sbjct: 22  QWTQNPWRRLSLLAIALFSGFLLGSAVTSVAGVLSQMDPVASLIVVLGTEFTLRRVRRSG 81

Query: 132 PKVTFPLALLNNFKMGFTYGLFIDAFKL 159
              T P  LL+  ++GF YGLF++AFKL
Sbjct: 82  ASWTLPEQLLDLSRIGFLYGLFLEAFKL 109


>gi|425469331|ref|ZP_18848277.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
 gi|389881508|emb|CCI37952.1| Ycf20-like protein [Microcystis aeruginosa PCC 9701]
          Length = 113

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQLICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE  +R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEATSRIVYRKSSQAKPAPILRESLNLLKIGITYSLFLEAFKIGS 113


>gi|425462004|ref|ZP_18841478.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
 gi|389825104|emb|CCI25466.1| Ycf20-like protein [Microcystis aeruginosa PCC 9808]
          Length = 113

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE  +R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEATSRIVYRKSSQAKPAPILRESLNLLKIGITYSLFLEAFKIGS 113


>gi|440752489|ref|ZP_20931692.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
 gi|443648037|ref|ZP_21129867.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
 gi|440176982|gb|ELP56255.1| putative protein Ycf20 [Microcystis aeruginosa TAIHU98]
 gi|443335303|gb|ELS49778.1| putative protein Ycf20 [Microcystis aeruginosa DIANCHI905]
          Length = 111

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 5   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 64

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE  +R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 65  LFTEATSRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 111


>gi|425436198|ref|ZP_18816636.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
 gi|425449828|ref|ZP_18829661.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
 gi|159028644|emb|CAO88115.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|389679122|emb|CCH92059.1| Ycf20-like protein [Microcystis aeruginosa PCC 9432]
 gi|389769599|emb|CCI05595.1| Ycf20-like protein [Microcystis aeruginosa PCC 7941]
          Length = 113

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  +      ++F +P RR     I +L G +  Q I  + G     DV AA L +
Sbjct: 7   RLSTLANVTSSRFNSFFGNPWRRISLQIICVLFGVFSGQAIVTTAGQTAQWDVTAAGLLL 66

Query: 119 LLTEYVTRFYYSRPKVTFPLALL----NNFKMGFTYGLFIDAFKLAS 161
           L TE  +R  Y +     P  +L    N  K+G TY LF++AFK+ S
Sbjct: 67  LFTEATSRIVYRKSSQAKPAPILRESFNLLKIGITYSLFLEAFKIGS 113


>gi|220909685|ref|YP_002484996.1| hypothetical protein Cyan7425_4325 [Cyanothece sp. PCC 7425]
 gi|219866296|gb|ACL46635.1| protein of unknown function DUF565 [Cyanothece sp. PCC 7425]
          Length = 110

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 3/93 (3%)

Query: 72  RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR 131
           R   R+P RR     +SLL G Y+   ++ + G LG  D++A+ L  L+ E ++  +YS 
Sbjct: 18  RQQLRNPWRRLSVLVMSLLFGVYLGTALASTAGQLGEIDIVASALVALVAEVISWLFYSD 77

Query: 132 P---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
               K T    +LN  K G  YGLF+ AF L S
Sbjct: 78  RWNFKATLFGEILNAVKFGVLYGLFLIAFLLGS 110


>gi|75907255|ref|YP_321551.1| hypothetical protein Ava_1032 [Anabaena variabilis ATCC 29413]
 gi|75700980|gb|ABA20656.1| Protein of unknown function DUF565 [Anabaena variabilis ATCC 29413]
          Length = 110

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL ++   +      +F +P RR     IS L G ++   IS + G     D++ A   V
Sbjct: 5   RLNNLFDTIARQLGQWFLNPWRRLSLLLISFLFGIFLGTAISTTAGQRSELDIVIAAFLV 64

Query: 119 LLTEYVTRFYYSR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            LTE  +R +Y +    + +  +  LN  K+GFTY LFI+AFKL S
Sbjct: 65  FLTEVTSRIFYMQGFFARRSLLVESLNLLKVGFTYSLFIEAFKLGS 110


>gi|427722143|ref|YP_007069420.1| hypothetical protein Lepto7376_0138 [Leptolyngbya sp. PCC 7376]
 gi|427353863|gb|AFY36586.1| hypothetical protein Lepto7376_0138 [Leptolyngbya sp. PCC 7376]
          Length = 120

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 13/101 (12%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
           +F++P RR     ISLL G ++   +  + G     D +     ++  E V+R YYSRPK
Sbjct: 20  FFQNPWRRFALTIISLLFGNFIGTVVPTTSGQKAAWDPVVGAGILIFIELVSRIYYSRPK 79

Query: 134 V-------------TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           +                  LLN FK+G  YGL ++A +L S
Sbjct: 80  LFQTSTDQDISKGNHLCFDLLNIFKIGLIYGLIVEALELGS 120


>gi|254417601|ref|ZP_05031338.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196175623|gb|EDX70650.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 118

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 18/122 (14%)

Query: 53  ENGGPRRLVDIVRLVPELSR--NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVND 110
           +N     LVD++     L+R   +F +P RR     + LL G ++   IS + G     D
Sbjct: 2   QNTRLNTLVDVI-----LARLAQWFTNPWRRLSIILLGLLSGIFLGSAISTTAGQTADWD 56

Query: 111 VIAAVLCVLLTEYVTRFYYSRPKVTFP-----------LALLNNFKMGFTYGLFIDAFKL 159
           VIAA + ++ TE V+R  Y R +   P           L  +N  K+G  Y +F++AFKL
Sbjct: 57  VIAAGILIIFTETVSRIVYGRNRQRLPSLSRASRLSWFLETVNALKIGLVYSMFLEAFKL 116

Query: 160 AS 161
            S
Sbjct: 117 GS 118


>gi|411119550|ref|ZP_11391930.1| Protein of unknown function (DUF565) [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410711413|gb|EKQ68920.1| Protein of unknown function (DUF565) [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 110

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP 132
           N+ ++P RR     ISLL G ++   IS   G     D++ A +   +TE V R  Y   
Sbjct: 19  NWLQNPWRRWSVLIISLLFGNFLGTAISTISGQQADLDILVAGILTGVTEVVNRIAYGTS 78

Query: 133 KVT---FPLALLNNFKMGFTYGLFIDAFKLAS 161
             T    P+  LN  K+G  Y LF++AFKL S
Sbjct: 79  AATRRSLPVQFLNALKVGVVYSLFVEAFKLGS 110


>gi|158337933|ref|YP_001519109.1| hypothetical protein AM1_4820 [Acaryochloris marina MBIC11017]
 gi|359463702|ref|ZP_09252265.1| hypothetical protein ACCM5_33604 [Acaryochloris sp. CCMEE 5410]
 gi|158308174|gb|ABW29791.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 110

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 5/107 (4%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +V  + +  +   R+P RR     I+LL G Y+  + +   G L   D+  A L +
Sbjct: 5   RLNTLVSGLGQQLKQELRNPWRRISVLVIALLFGIYIGVSFAAIAGQLAYLDITVAALVL 64

Query: 119 LLTEYVTRFYYSRPKVTFPLAL----LNNFKMGFTYGLFIDAFKLAS 161
           L TE V  + + R +     AL    LN FK+G TYGLF+ AF L S
Sbjct: 65  LFTE-VVNWIFHRNRGNARKALWGEALNTFKLGLTYGLFVIAFMLGS 110


>gi|284928790|ref|YP_003421312.1| hypothetical protein UCYN_02050 [cyanobacterium UCYN-A]
 gi|284809249|gb|ADB94954.1| Protein of unknown function (DUF565) [cyanobacterium UCYN-A]
          Length = 110

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 70  LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY 129
           LSR  F +P R       S+L GF++ Q I+L  G     ++   ++ ++ TE  ++  Y
Sbjct: 17  LSRT-FSNPWRNLFLNFTSILVGFFIGQAIALIAGQESYWEITVGIILLVFTEVSSKIIY 75

Query: 130 SR---PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
            R    K    L   N FKMG  Y L+I+A KL S
Sbjct: 76  GRKRTKKKILWLETFNFFKMGIVYSLYIEALKLGS 110


>gi|302790716|ref|XP_002977125.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
 gi|300155101|gb|EFJ21734.1| hypothetical protein SELMODRAFT_59700 [Selaginella moellendorffii]
          Length = 109

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 4/91 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           FR   R        LL GFY A  ++   G  G  D++ A + V + E +    Y  P V
Sbjct: 19  FREIRRNLPLKVFLLLLGFYSANALATVLGQTGDWDMLVAAIIVAVIEGIGFLMYKMPPV 78

Query: 135 T----FPLALLNNFKMGFTYGLFIDAFKLAS 161
           T    F + +LN +K G ++GLF+DAFK+ S
Sbjct: 79  TEKLRFLVRMLNYWKAGLSFGLFVDAFKVGS 109


>gi|119485209|ref|ZP_01619594.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
 gi|119457437|gb|EAW38562.1| hypothetical protein L8106_07164 [Lyngbya sp. PCC 8106]
          Length = 110

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 54/106 (50%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  +  ++ E    + R+P RR     IS+L G +    IS   G     DV AA + +
Sbjct: 5   RLNRLTNVLSERLGRWLRNPWRRISLLTISVLFGSFSGIVISTIAGQQAEWDVTAAFILL 64

Query: 119 LLTEYVTRFYY-SRPKVTFPLAL--LNNFKMGFTYGLFIDAFKLAS 161
           L+TE +    Y S+P +  PL +  LN FK+G  Y LF+ AF L S
Sbjct: 65  LITEGINWLTYRSKPLIRQPLWIENLNAFKIGVVYSLFLLAFLLGS 110


>gi|224101247|ref|XP_002312200.1| predicted protein [Populus trichocarpa]
 gi|222852020|gb|EEE89567.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P      ++ + +A++N
Sbjct: 135 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSLSTRRLQYFVAMMN 194

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 195 YWKAGVCLGLFVDAFKLGS 213


>gi|17230364|ref|NP_486912.1| hypothetical protein alr2872 [Nostoc sp. PCC 7120]
 gi|17131966|dbj|BAB74571.1| alr2872 [Nostoc sp. PCC 7120]
          Length = 110

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 3/92 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR- 131
            +F +P RR     IS L G ++   IS + G     D++ A   V LTE  +R +Y + 
Sbjct: 19  QWFLNPWRRLSLLLISFLFGIFLGTAISTTAGQRSELDIVIAAFLVFLTEVTSRIFYMQG 78

Query: 132 --PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
              + +  +  LN  K+GF Y LFI+AFKL S
Sbjct: 79  FFARRSLLVESLNLLKVGFIYSLFIEAFKLGS 110


>gi|291335270|gb|ADD94889.1| hypothetical protein BL107_08104 [uncultured marine bacterium
           MedDCM-OCT-S09-C166]
          Length = 80

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRPKVTFPLALLNNFKMGFTY 150
           GF +   I+   G L   D +AA+L VL TE+ + R   S P +  P  LL+  ++GF Y
Sbjct: 11  GFLIGSAITSVSGVLAQMDPVAALLVVLGTEFTIRRRRSSEPSLKLPQQLLDLGRIGFLY 70

Query: 151 GLFIDAFKL 159
           GLF++ FKL
Sbjct: 71  GLFLEGFKL 79


>gi|160331763|ref|XP_001712588.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
 gi|159766037|gb|ABW98263.1| hypothetical protein HAN_3g461 [Hemiselmis andersenii]
          Length = 146

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 36/57 (63%)

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLA 160
           G++   D +AA + +   E  T+ +YS  K+T  L LLN+FK+G   G+F+DAFK+ 
Sbjct: 89  GSVADWDPLAAAVLLCWIELFTKLFYSSEKLTNFLKLLNSFKIGIILGMFVDAFKIT 145


>gi|428773704|ref|YP_007165492.1| hypothetical protein Cyast_1888 [Cyanobacterium stanieri PCC 7202]
 gi|428687983|gb|AFZ47843.1| protein of unknown function (DUF565) [Cyanobacterium stanieri PCC
           7202]
          Length = 109

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 45/91 (49%), Gaps = 2/91 (2%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP 132
           ++F +P RR     I+ L GF++A T +   G  G  DV  A   ++ TE      Y   
Sbjct: 19  DFFSNPWRRLSLNIIAFLLGFFLASTTAAIAGQAGKWDVTYAFFFLVFTEGSNIIIYKNR 78

Query: 133 KVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
               PL    LN FK+GF Y L+++ FKL S
Sbjct: 79  NPNKPLWRTTLNAFKIGFAYCLYLEGFKLGS 109


>gi|434389388|ref|YP_007099999.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
           6605]
 gi|428020378|gb|AFY96472.1| Protein of unknown function (DUF565) [Chamaesiphon minutus PCC
           6605]
          Length = 121

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 15/111 (13%)

Query: 65  RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
           RL  +  R +  +P RR     ISLL G ++A  I+ S G     DV A++  V + E +
Sbjct: 12  RLTRQFGR-WVENPWRRLSIIIISLLFGNFLASAIATSTGQRANLDVFASLTLVAIVESI 70

Query: 125 TRFYYS----------RPKVT----FPLALLNNFKMGFTYGLFIDAFKLAS 161
               Y           R ++       +A+LN+ K+G  YGLF++AFKL S
Sbjct: 71  GWLTYGTIFGRRSTEGRDQIIGQRPISIAILNSLKLGLMYGLFVEAFKLGS 121


>gi|22298711|ref|NP_681958.1| hypothetical protein tlr1168 [Thermosynechococcus elongatus BP-1]
 gi|22294892|dbj|BAC08720.1| ycf20 [Thermosynechococcus elongatus BP-1]
          Length = 110

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 3/93 (3%)

Query: 72  RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS- 130
           +   ++P RR     I+ L G ++   IS S G LG  D+IA+ +  +  E ++  +YS 
Sbjct: 18  QEQLKNPWRRLATLSIAFLFGVFLGLAISSSAGQLGYLDIIASSMVAIAAEVISALFYSD 77

Query: 131 RPKVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
           R K+   L   +LN  K G  YGLF+ AF L S
Sbjct: 78  RWKLRQTLFGEMLNALKFGLLYGLFLVAFLLGS 110


>gi|168010839|ref|XP_001758111.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690567|gb|EDQ76933.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 129

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR------PKVTFPLALLN 142
           LL GF+ A  ++   G  G  DVIAA + V + E V    Y+R       +    + L+N
Sbjct: 51  LLFGFFSANALATILGQTGDWDVIAAGILVAIIEGVGHLVYNRMPAFLGERGKPIIELVN 110

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K+GF + LF+DAFK+ S
Sbjct: 111 YWKIGFEFALFVDAFKVGS 129


>gi|116072099|ref|ZP_01469367.1| hypothetical protein BL107_08104 [Synechococcus sp. BL107]
 gi|116065722|gb|EAU71480.1| hypothetical protein BL107_08104 [Synechococcus sp. BL107]
          Length = 80

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 1/74 (1%)

Query: 87  ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEY-VTRFYYSRPKVTFPLALLNNFK 145
           I+LL GF +   ++   G L   D +A+++ VL TE+ + R   S    T P  LL+  +
Sbjct: 6   IALLSGFLLGSAVTSVAGVLSQMDPVASLIVVLGTEFTLRRVRRSGGSWTLPEQLLDLSR 65

Query: 146 MGFTYGLFIDAFKL 159
           +GF YGLF++AFKL
Sbjct: 66  IGFLYGLFLEAFKL 79


>gi|428776926|ref|YP_007168713.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
 gi|428691205|gb|AFZ44499.1| hypothetical protein PCC7418_2350 [Halothece sp. PCC 7418]
          Length = 112

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 51/95 (53%), Gaps = 5/95 (5%)

Query: 72  RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR 131
           R +F +P R+     ISLL G ++   I  S G  G  D+I A L ++LTE ++ F Y +
Sbjct: 18  RQFFVNPWRKITSLLISLLLGIFMGIAIVTSAGQNGRLDIIVAALLLILTEVLSWFVYRQ 77

Query: 132 -----PKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
                 + +  + ++N FK+G  Y L+I AF L S
Sbjct: 78  NLEQNEQRSGLIEVMNTFKIGLVYSLYIQAFVLGS 112


>gi|443323738|ref|ZP_21052741.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
 gi|442786524|gb|ELR96254.1| Protein of unknown function (DUF565) [Gloeocapsa sp. PCC 73106]
          Length = 107

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 52/103 (50%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL ++V  +    +N+  +P R  +    ++  GF+    I  + G   + DV +A   +
Sbjct: 5   RLDNLVNALGLRLQNFLANPWRNIVLSLTTVFFGFFSGTAIISTAGQRSLMDVSSAASLL 64

Query: 119 LLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           ++ E + R+ Y   K  F   +LN FK+G TY +F+ AF + S
Sbjct: 65  VVFELINRWVYQSKKPRFWTNILNIFKIGVTYSIFLQAFIVGS 107


>gi|428211916|ref|YP_007085060.1| hypothetical protein Oscil6304_1433 [Oscillatoria acuminata PCC
           6304]
 gi|428000297|gb|AFY81140.1| Protein of unknown function (DUF565) [Oscillatoria acuminata PCC
           6304]
          Length = 110

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 50/98 (51%), Gaps = 4/98 (4%)

Query: 67  VPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTR 126
           V EL R   R+P RR     IS L G +V   I+   G     DV+ A + ++LTE  + 
Sbjct: 14  VSEL-RQSLRNPWRRISVVLISFLLGNFVGTAITTIAGQNANLDVMVAAVLIVLTEVYSY 72

Query: 127 FYY---SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           F+Y    + K +     +N+FK+G  Y LF+ AF L S
Sbjct: 73  FFYRIHDQMKRSLLAQGINSFKIGMIYNLFVLAFTLGS 110


>gi|428768705|ref|YP_007160495.1| hypothetical protein Cyan10605_0303 [Cyanobacterium aponinum PCC
           10605]
 gi|428682984|gb|AFZ52451.1| protein of unknown function DUF565 [Cyanobacterium aponinum PCC
           10605]
          Length = 110

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 72  RNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY-- 129
           +++  +P R+  F  I  L G+  +  +S S    G  DV  A   ++ TE  +   Y  
Sbjct: 18  KSFLNNPWRKLSFVIIGFLSGYITSDLLSTSLAQAGKWDVPMATTYLIFTEITSMIVYGG 77

Query: 130 --SRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             +R K+ +   LLN+FK+G  +GL++ A  LAS
Sbjct: 78  SKNRNKIMWA-DLLNSFKIGLAFGLYLSAMTLAS 110


>gi|427713331|ref|YP_007061955.1| hypothetical protein Syn6312_2297 [Synechococcus sp. PCC 6312]
 gi|427377460|gb|AFY61412.1| Protein of unknown function (DUF565) [Synechococcus sp. PCC 6312]
          Length = 110

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  ++    +  +   R+P RR     I+LL G Y+   ++ + G +G  D++A+ L  
Sbjct: 5   RLTTLLNRTGDNLQQQLRNPWRRLALMMIALLFGIYLGAALASTAGQVGYIDIVASALVA 64

Query: 119 LLTEYVTRFYYSRP---KVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           + +E ++  +Y      + T     LN  K+G  YGLF+ AF L S
Sbjct: 65  IASEVLSAVFYGNRWNFQKTILGETLNAIKVGVLYGLFLIAFLLGS 110


>gi|18408276|ref|NP_564851.1| Ycf20-like protein [Arabidopsis thaliana]
 gi|209572697|sp|O80813.2|YC20L_ARATH RecName: Full=Ycf20-like protein
 gi|107738233|gb|ABF83666.1| At1g65420 [Arabidopsis thaliana]
 gi|332196250|gb|AEE34371.1| Ycf20-like protein [Arabidopsis thaliana]
          Length = 197

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P       K+   +  +
Sbjct: 118 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 177

Query: 142 NNFKMGFTYGLFIDAFKLAS 161
           N +K G   GLF+DAFKL S
Sbjct: 178 NFWKAGVCLGLFVDAFKLGS 197


>gi|33864834|ref|NP_896393.1| hypothetical protein SYNW0298 [Synechococcus sp. WH 8102]
 gi|33632357|emb|CAE06813.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 107

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 2/94 (2%)

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
           LV +L   +  +P RR     I+ L GF V   I+   G LG  D + A++ VL TE   
Sbjct: 15  LVQQL-ETWAGNPWRRLSVLCIAALFGFLVGSAITSVAGVLGQMDPVGALIVVLGTEATV 73

Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           R     P  T    +L   ++G  YGLF++AFKL
Sbjct: 74  RLRIVGPS-TLLQQVLTVSRIGLLYGLFLEAFKL 106


>gi|21593789|gb|AAM65756.1| antigen receptor, putative [Arabidopsis thaliana]
          Length = 197

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P       K+   +  +
Sbjct: 118 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 177

Query: 142 NNFKMGFTYGLFIDAFKLAS 161
           N +K G   GLF+DAFKL S
Sbjct: 178 NFWKAGVCLGLFVDAFKLGS 197


>gi|297841047|ref|XP_002888405.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334246|gb|EFH64664.1| hypothetical protein ARALYDRAFT_894093 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 198

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P       K+   +  +
Sbjct: 119 LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 178

Query: 142 NNFKMGFTYGLFIDAFKLAS 161
           N +K G   GLF+DAFKL S
Sbjct: 179 NFWKAGVCLGLFVDAFKLGS 198


>gi|71842309|ref|YP_277397.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
 gi|60101552|gb|AAX13896.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
 gi|336286220|gb|AEI29556.1| hypothetical chloroplast RF20 [Emiliania huxleyi]
          Length = 90

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 39/81 (48%), Gaps = 10/81 (12%)

Query: 91  GGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS----------RPKVTFPLAL 140
           GGF +A     + G      ++ A L V   E  ++ YYS          +   +  L L
Sbjct: 10  GGFLLASIFDTTMGEFQDWSILGAALTVASVETASKIYYSFISRVQLLSSKTYYSETLFL 69

Query: 141 LNNFKMGFTYGLFIDAFKLAS 161
           LN+FK+G  YGL +DAFKL S
Sbjct: 70  LNSFKLGLIYGLIVDAFKLGS 90


>gi|108773201|ref|YP_635793.1| hypothetical protein RF20 [Chara vulgaris]
 gi|108773301|ref|YP_635808.1| hypothetical protein RF20 [Chara vulgaris]
 gi|77157857|gb|ABA61898.1| hypothetical protein RF20 [Chara vulgaris]
 gi|77157957|gb|ABA61998.1| hypothetical protein RF20 [Chara vulgaris]
          Length = 137

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 11/84 (13%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK--VTFPLA------- 139
           LL GF+ A T+S     +   D++ AV+  ++ E +    Y  P   + F +        
Sbjct: 54  LLLGFFHANTLSTILAQVADWDILIAVILSVMVESIGLLIYQIPSRILLFTICKERRKKS 113

Query: 140 --LLNNFKMGFTYGLFIDAFKLAS 161
             LLN +K+GFTY LF+D+FK+ S
Sbjct: 114 LFLLNVWKIGFTYALFVDSFKVGS 137


>gi|87301895|ref|ZP_01084729.1| hypothetical protein WH5701_01180 [Synechococcus sp. WH 5701]
 gi|87283463|gb|EAQ75418.1| hypothetical protein WH5701_01180 [Synechococcus sp. WH 5701]
          Length = 135

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 3/104 (2%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL   V L+ E  R++  +P RR     I+  G F+    + +  GAL   D +AA++CV
Sbjct: 31  RLHRQVALLGERLRHWAENPWRRLSLVLITGFGAFFFGGAVGMLTGALNYLDPLAALVCV 90

Query: 119 LLTEY---VTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           +  E      R+   RP+    L L++  ++G  YGL  + FKL
Sbjct: 91  VPIELSIRCRRWLMQRPQDRLSLQLVDAGRIGLLYGLLQEGFKL 134


>gi|11466439|ref|NP_038445.1| hypothetical chloroplast RF20 [Mesostigma viride]
 gi|12230812|sp|Q9MUL5.1|YCF20_MESVI RecName: Full=Uncharacterized protein ycf20; Short=RF20
 gi|7259582|gb|AAF43883.1|AF166114_95 hypothetical chloroplast RF20 (chloroplast) [Mesostigma viride]
          Length = 116

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP----KVTFPLALL-----N 142
           GF++A  ++  FG  G  DV+ A + V + E +    YS+     K  F ++ L     N
Sbjct: 38  GFFIATVLATVFGQTGDWDVLVAGILVAMIEILGNKMYSKKYISKKQVFDISFLSLIGIN 97

Query: 143 NFKMGFTYGLFIDAFKLAS 161
             K+G  +GLF+DAFKL S
Sbjct: 98  YIKIGLIFGLFVDAFKLGS 116


>gi|3335348|gb|AAC27150.1|AAC27150 T8F5.20 [Arabidopsis thaliana]
          Length = 121

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-------KVTFPLALL 141
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P       K+   +  +
Sbjct: 42  LLLGFYTANALATILGQTGDWDVLVAGIVVAAIEGIGMLMYKKPSSSMFSGKLQSFVVFM 101

Query: 142 NNFKMGFTYGLFIDAFKLAS 161
           N +K G   GLF+DAFKL S
Sbjct: 102 NFWKAGVCLGLFVDAFKLGS 121


>gi|443316319|ref|ZP_21045768.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
 gi|442784083|gb|ELR93974.1| Protein of unknown function (DUF565) [Leptolyngbya sp. PCC 6406]
          Length = 112

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 8/104 (7%)

Query: 66  LVPELSRNYFR---SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTE 122
           LV  L+  + R   +P RR     I LL G +    I+   G     DV+ +   +   E
Sbjct: 9   LVDTLANQFGRWVFNPWRRLSLVIIGLLFGNFFGIAIASIAGQTADLDVVVSAFLLAAVE 68

Query: 123 YVTRFYYSRPK-----VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             +   Y RP+      +  L +LN FK+G TY LF++AFKL S
Sbjct: 69  ITSWIVYRRPRPKEGQSSLLLEVLNAFKIGITYALFVEAFKLGS 112


>gi|330040530|ref|XP_003239945.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
 gi|327206871|gb|AEA39047.1| hypothetical protein CPARA_3gp389 [Cryptomonas paramecium]
          Length = 137

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 40/72 (55%)

Query: 90  LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFT 149
           L GF++A +     G++   D +AA + +   E+    +YS    +    L+NNFK+G  
Sbjct: 66  LFGFFLANSAITIVGSVADWDPLAAAIMLCWIEFFNCKFYSSDFNSIYFNLVNNFKIGII 125

Query: 150 YGLFIDAFKLAS 161
           +G+F+DA KL S
Sbjct: 126 FGIFVDALKLTS 137


>gi|332707378|ref|ZP_08427428.1| protein of unknown function, DUF565 [Moorea producens 3L]
 gi|332353869|gb|EGJ33359.1| protein of unknown function, DUF565 [Moorea producens 3L]
          Length = 116

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 13/99 (13%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSR-- 131
           +F +P RR     IS L GF V Q +  + G  G   +++ +L V  TE V R  YS   
Sbjct: 20  WFANPWRRLSMVLISFLFGFLVGQALCTTAGT-GQEWILSGMLIVF-TEAVNRLIYSGKF 77

Query: 132 PK---------VTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           PK         V++   ++N FK+G  Y ++++AFKL S
Sbjct: 78  PKGQLNQTVDRVSWLADVVNAFKIGLVYSMYLEAFKLGS 116


>gi|115478380|ref|NP_001062785.1| Os09g0287500 [Oryza sativa Japonica Group]
 gi|50251186|dbj|BAD29674.1| antigen receptor-like protein [Oryza sativa Japonica Group]
 gi|50252920|dbj|BAD29174.1| antigen receptor-like protein [Oryza sativa Japonica Group]
 gi|113631018|dbj|BAF24699.1| Os09g0287500 [Oryza sativa Japonica Group]
 gi|215765531|dbj|BAG87228.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641230|gb|EEE69362.1| hypothetical protein OsJ_28696 [Oryza sativa Japonica Group]
          Length = 196

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 65  RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
            L+ +L+    +SP++  L     +L GFY A  ++   G  G  DV  A + V   E +
Sbjct: 99  ELLEKLNSARNKSPTKIFL-----VLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGI 153

Query: 125 TRFYYSRPKVTFP------LALLNNFKMGFTYGLFIDAFKLAS 161
               Y +P    P      + ++N +K G   G F+DAFK+ S
Sbjct: 154 GMLMYRKPASRPPGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 196


>gi|428313684|ref|YP_007124661.1| hypothetical protein Mic7113_5625 [Microcoleus sp. PCC 7113]
 gi|428255296|gb|AFZ21255.1| Protein of unknown function (DUF565) [Microcoleus sp. PCC 7113]
          Length = 118

 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 11/99 (11%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
           +F +P R      +SLL G ++   I  + G     D++ A + +L TE ++   Y++  
Sbjct: 20  WFANPWRHLSLVLMSLLLGTFLGSAIPATAGQAAKLDLVGAGVLILFTEMISWIVYAKSP 79

Query: 134 VTFPLA-----------LLNNFKMGFTYGLFIDAFKLAS 161
              PL            +LN  K+G TY +F++AFKL S
Sbjct: 80  QGRPLTENSRGLSLLTQILNAMKIGMTYSMFVEAFKLGS 118


>gi|70671583|gb|AAZ06217.1| putative antigen receptor [Oryza sativa Indica Group]
 gi|70671616|gb|AAZ06247.1| putative antigen receptor [Oryza sativa Indica Group]
 gi|125563084|gb|EAZ08464.1| hypothetical protein OsI_30729 [Oryza sativa Indica Group]
          Length = 197

 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 65  RLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYV 124
            L+ +L+    +SP++  L     +L GFY A  ++   G  G  DV  A + V   E +
Sbjct: 100 ELLEKLNSARNKSPTKIFL-----VLLGFYTANALATVLGQTGDWDVFVAAIVVATIEGI 154

Query: 125 TRFYYSRPKVTFP------LALLNNFKMGFTYGLFIDAFKLAS 161
               Y +P    P      + ++N +K G   G F+DAFK+ S
Sbjct: 155 GMLMYRKPASRPPGRFWSMITMVNYWKAGVCLGFFVDAFKVGS 197


>gi|399949854|gb|AFP65511.1| hypothetical protein CMESO_344 [Chroomonas mesostigmatica CCMP1168]
          Length = 152

 Score = 43.9 bits (102), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 34/56 (60%)

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           G++   D +AA + +   E  T+ +Y+  K T    L+N+FK+G   G+F+DAFK+
Sbjct: 95  GSVADWDPLAAAVMLCWIELFTKIFYATEKSTLYFRLINSFKIGINLGMFVDAFKI 150


>gi|318042681|ref|ZP_07974637.1| hypothetical protein SCB01_13289 [Synechococcus sp. CB0101]
          Length = 115

 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 13/123 (10%)

Query: 40  SSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTI 99
           SSS   +  R  Q  GG           P L   + R+P RR     I LL  F++   +
Sbjct: 2   SSSRPLQRTRLQQSLGGA---------WPRLE-QWARNPWRRLSVLLIVLLSTFFLGNAV 51

Query: 100 SLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPKVTFPLALLNNFKMGFTYGLFIDA 156
           S   GA    D  AA++CV+L E   R       R      L LL+  ++GF YGL ++ 
Sbjct: 52  STIIGARAFLDPPAALICVVLLELAIRARKPLLRRGGDRLGLELLDMSRIGFCYGLLLEG 111

Query: 157 FKL 159
           FKL
Sbjct: 112 FKL 114


>gi|79315375|ref|NP_001030875.1| uncharacterized protein [Arabidopsis thaliana]
 gi|145332877|ref|NP_001078304.1| uncharacterized protein [Arabidopsis thaliana]
 gi|9663002|emb|CAC00746.1| putative protein [Arabidopsis thaliana]
 gi|45752730|gb|AAS76263.1| At3g56830 [Arabidopsis thaliana]
 gi|110740493|dbj|BAE98352.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646050|gb|AEE79571.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646051|gb|AEE79572.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 188

 Score = 43.5 bits (101), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 18/120 (15%)

Query: 44  GFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
           G R I+A+Q     + LV I  +  +L +  F             LL GFY A   S   
Sbjct: 81  GLRLIKAIQVLRS-KLLVKIQEIKKDLPKKLF------------FLLVGFYSATAFSTFI 127

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFTYGLFIDAFK 158
           G  G  DV++A L VL+ E +    Y  S P   K+   + + N +K G   GLF+D+FK
Sbjct: 128 GQTGDWDVLSAGLAVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFK 187


>gi|302763747|ref|XP_002965295.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
 gi|300167528|gb|EFJ34133.1| hypothetical protein SELMODRAFT_7627 [Selaginella moellendorffii]
          Length = 105

 Score = 43.5 bits (101), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 4/88 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           FR   R        LL GFY    ++   G  G  DV+   + V + E +    Y  P V
Sbjct: 18  FRETRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAIVVAVIEGIGFLMYKMPPV 77

Query: 135 T----FPLALLNNFKMGFTYGLFIDAFK 158
           T    F + +LN +K+G ++GLF+D  K
Sbjct: 78  TEKLRFLVKMLNYWKVGLSFGLFVDGSK 105


>gi|42565978|ref|NP_191242.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332646049|gb|AEE79570.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 230

 Score = 43.1 bits (100), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 18/120 (15%)

Query: 44  GFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSF 103
           G R I+A+Q     + LV I  +  +L +  F             LL GFY A   S   
Sbjct: 81  GLRLIKAIQVLRS-KLLVKIQEIKKDLPKKLF------------FLLVGFYSATAFSTFI 127

Query: 104 GALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFTYGLFIDAFK 158
           G  G  DV++A L VL+ E +    Y  S P   K+   + + N +K G   GLF+D+FK
Sbjct: 128 GQTGDWDVLSAGLAVLVVECIGALMYRASIPLINKMRSTITMFNYWKTGLALGLFLDSFK 187


>gi|302809861|ref|XP_002986623.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
 gi|300145806|gb|EFJ12480.1| hypothetical protein SELMODRAFT_48840 [Selaginella moellendorffii]
          Length = 106

 Score = 42.0 bits (97), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 4/88 (4%)

Query: 75  FRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV 134
           FR   R        LL GFY    ++   G  G  DV+   + V + E +    Y  P V
Sbjct: 19  FREIRRNLPLKVFLLLLGFYSGNALATVLGQTGDWDVLVGAIIVAVIEGIGFLMYKIPPV 78

Query: 135 T----FPLALLNNFKMGFTYGLFIDAFK 158
           T    F + +LN +K+G ++GLF+D  K
Sbjct: 79  TEKLRFLVKMLNYWKVGLSFGLFVDGSK 106


>gi|116075669|ref|ZP_01472928.1| hypothetical protein RS9916_39426 [Synechococcus sp. RS9916]
 gi|116066984|gb|EAU72739.1| hypothetical protein RS9916_39426 [Synechococcus sp. RS9916]
          Length = 113

 Score = 42.0 bits (97), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 74  YFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPK 133
           +  +P R+A    I +L  F V  ++    G  G+ D +AA++ V   E + R     PK
Sbjct: 24  WSTNPWRKASLLLIVMLSSFVVGSSVGAIAGVSGLMDPVAALITVAAWETMVRLRRPWPK 83

Query: 134 ---VTFPLALLNNFKMGFTYGLFIDAFKL 159
                  L +L+  ++G  YGL ++ FKL
Sbjct: 84  RADTLLGLQMLDMTRIGLIYGLLLEGFKL 112


>gi|224063751|ref|XP_002301276.1| predicted protein [Populus trichocarpa]
 gi|222843002|gb|EEE80549.1| predicted protein [Populus trichocarpa]
          Length = 152

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 27/121 (22%)

Query: 46  RSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA 105
           R+I+A Q   G R        + E+ +N    P +   F     L GFY A   +   G 
Sbjct: 50  RAIQAFQTRLGVR--------IREIRKNL---PMKLLFF-----LVGFYCATAFATVIGQ 93

Query: 106 LGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP--------LALLNNFKMGFTYGLFIDAF 157
            G  D+++A L V++ E +    Y   + +FP        +++ N +K G T GLF+D+F
Sbjct: 94  TGDWDILSAALAVVVVEGIGALMY---RASFPFVKSIRSIVSVFNYWKAGLTLGLFLDSF 150

Query: 158 K 158
           K
Sbjct: 151 K 151


>gi|388511147|gb|AFK43635.1| unknown [Medicago truncatula]
          Length = 201

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P  T        L ++N
Sbjct: 123 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGRLQSFLVMVN 182

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 183 YWKAGICLGLFVDAFKLGS 201


>gi|255553945|ref|XP_002518013.1| conserved hypothetical protein [Ricinus communis]
 gi|223542995|gb|EEF44531.1| conserved hypothetical protein [Ricinus communis]
          Length = 238

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 27/147 (18%)

Query: 23  LSPQTIASLGRRGISFSSSS---AGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPS 79
           ++ Q  +S    G+  SSS     G R +RA+Q          IV  + E+ +N    P 
Sbjct: 83  ITWQIRSSADSSGLDPSSSGRTRGGTRLLRAIQA-----LQTKIVARIQEIRKNL---PI 134

Query: 80  RRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL- 138
           +   F     L GFY A   +   G  G  D+++A L V + E +    Y   + +FPL 
Sbjct: 135 KLLFF-----LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMY---RASFPLF 186

Query: 139 -------ALLNNFKMGFTYGLFIDAFK 158
                   + N +K G + GLF+D+FK
Sbjct: 187 NNMRSLITMFNYWKAGLSLGLFLDSFK 213


>gi|357476287|ref|XP_003608429.1| Ycf20-like protein [Medicago truncatula]
 gi|355509484|gb|AES90626.1| Ycf20-like protein [Medicago truncatula]
          Length = 209

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P  T        L ++N
Sbjct: 131 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYKKPPTTRTGRLQSFLVMVN 190

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 191 YWKAGICLGLFVDAFKLGS 209


>gi|283794933|ref|YP_003359286.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
 gi|253981905|gb|ACT46822.1| hypothetical chloroplast RF20 [Cryptomonas paramecium]
          Length = 105

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 70  LSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT-RFY 128
           + +  F     + L   I +L G ++A T+S + G  G   ++ + + V   E ++ R Y
Sbjct: 1   MRKQQFAHQLIQLLINFILILFGCFIATTLSTTLGQTGDWGMLISGIIVTFLEIISKRTY 60

Query: 129 ------------YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
                       Y   K+ + + LLN+ K+G  YG FI+AFKL S
Sbjct: 61  TIKKLIMILPKNYQSHKIKYFVVLLNDIKIGVLYGFFIEAFKLGS 105


>gi|222622484|gb|EEE56616.1| hypothetical protein OsJ_05998 [Oryza sativa Japonica Group]
          Length = 191

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      +A++N +K
Sbjct: 116 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 175

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 176 AGVCLGLFVDAFKLGS 191


>gi|218190363|gb|EEC72790.1| hypothetical protein OsI_06472 [Oryza sativa Indica Group]
          Length = 191

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      +A++N +K
Sbjct: 116 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 175

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 176 AGVCLGLFVDAFKLGS 191


>gi|356496360|ref|XP_003517036.1| PREDICTED: uncharacterized protein LOC100306594 [Glycine max]
          Length = 197

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 5/74 (6%)

Query: 90  LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRPKVTFP---LALLNNF 144
           L GFY A   +   G  G  D+++A L V + E +    Y  S P V+     ++L N +
Sbjct: 123 LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMYRASLPLVSKSRSLISLFNYW 182

Query: 145 KMGFTYGLFIDAFK 158
           K G T GLF+D+FK
Sbjct: 183 KAGLTLGLFLDSFK 196


>gi|296089191|emb|CBI38894.3| unnamed protein product [Vitis vinifera]
          Length = 243

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 70/169 (41%), Gaps = 28/169 (16%)

Query: 10  VGIVSPTILFQFGLSPQTIASLG-----RRGISFSSSSAGFRSIRAVQENG--GPRRLVD 62
           VGIV+      FGL    +A L      +R +    SS    S+     NG  G  RLV 
Sbjct: 59  VGIVTCLHKTSFGLCFFRVAQLSLVNTSKRMVWSVRSSVDGNSLDPTPTNGTDGRTRLVR 118

Query: 63  IVRLVP--------ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAA 114
           +++ +         EL ++    P +   F     L GFY A   +   G  G  D+++A
Sbjct: 119 VIQGIQIKLSTRILELRKDL---PMKILFF-----LVGFYCATAFATVIGQTGDWDILSA 170

Query: 115 VLCVLLTEYVTRFYYSR-----PKVTFPLALLNNFKMGFTYGLFIDAFK 158
            L V + E +    Y        KV   + + N +K G   GLF+D+FK
Sbjct: 171 ALAVFVVELIGALMYRASIHLFDKVRRLITMFNYWKAGLCLGLFLDSFK 219


>gi|357118772|ref|XP_003561123.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
          Length = 219

 Score = 40.4 bits (93), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
           GFY A   +   G  G  D+++A L V + E++    Y        ++   + + N +K 
Sbjct: 124 GFYCATAFATVIGQTGDWDILSAGLAVAIVEFIGALMYRASFAFFGRIKNMITIFNYWKA 183

Query: 147 GFTYGLFIDAFK 158
           G T GLF+D+FK
Sbjct: 184 GLTLGLFLDSFK 195


>gi|225453999|ref|XP_002274644.1| PREDICTED: ycf20-like protein-like [Vitis vinifera]
          Length = 217

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 70/169 (41%), Gaps = 28/169 (16%)

Query: 10  VGIVSPTILFQFGLSPQTIASLG-----RRGISFSSSSAGFRSIRAVQENG--GPRRLVD 62
           VGIV+      FGL    +A L      +R +    SS    S+     NG  G  RLV 
Sbjct: 33  VGIVTCLHKTSFGLCFFRVAQLSLVNTSKRMVWSVRSSVDGNSLDPTPTNGTDGRTRLVR 92

Query: 63  IVRLVP--------ELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAA 114
           +++ +         EL ++    P +   F     L GFY A   +   G  G  D+++A
Sbjct: 93  VIQGIQIKLSTRILELRKDL---PMKILFF-----LVGFYCATAFATVIGQTGDWDILSA 144

Query: 115 VLCVLLTEYVTRFYYSR-----PKVTFPLALLNNFKMGFTYGLFIDAFK 158
            L V + E +    Y        KV   + + N +K G   GLF+D+FK
Sbjct: 145 ALAVFVVELIGALMYRASIHLFDKVRRLITMFNYWKAGLCLGLFLDSFK 193


>gi|50251788|dbj|BAD27720.1| antigen receptor-like [Oryza sativa Japonica Group]
          Length = 217

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      +A++N +K
Sbjct: 142 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPMSRPPGRFQSLIAMVNYWK 201

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 202 AGVCLGLFVDAFKLGS 217


>gi|357157876|ref|XP_003577943.1| PREDICTED: ycf20-like protein-like [Brachypodium distachyon]
          Length = 144

 Score = 40.0 bits (92), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      +A++N +K
Sbjct: 69  GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPVSRPPGRFQSLIAMVNYWK 128

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 129 AGVCLGLFVDAFKLGS 144


>gi|297820458|ref|XP_002878112.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323950|gb|EFH54371.1| hypothetical protein ARALYDRAFT_486123 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 188

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 5/75 (6%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNN 143
           LL GFY A   S   G  G  DV++A + VL+ E +    Y  S P   K+   + + N 
Sbjct: 113 LLVGFYSATAFSTFIGQTGDWDVLSAGVAVLVVECIGALMYRASIPLINKMRGTITMFNY 172

Query: 144 FKMGFTYGLFIDAFK 158
           +K G   GLF+D+FK
Sbjct: 173 WKTGLALGLFLDSFK 187


>gi|212275874|ref|NP_001131010.1| uncharacterized protein LOC100192115 [Zea mays]
 gi|194690700|gb|ACF79434.1| unknown [Zea mays]
 gi|194695206|gb|ACF81687.1| unknown [Zea mays]
 gi|195654383|gb|ACG46659.1| hypothetical protein [Zea mays]
          Length = 232

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      ++++N +K
Sbjct: 157 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPGRFQSLISMVNYWK 216

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 217 AGVCLGLFVDAFKLGS 232


>gi|449526593|ref|XP_004170298.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
          Length = 169

 Score = 39.7 bits (91), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E++    Y +P      ++   + ++N
Sbjct: 91  LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKSFIVMVN 150

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 151 YWKAGVCLGLFVDAFKLGS 169


>gi|242048754|ref|XP_002462123.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
 gi|241925500|gb|EER98644.1| hypothetical protein SORBIDRAFT_02g019530 [Sorghum bicolor]
          Length = 201

 Score = 39.7 bits (91), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 6/76 (7%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFK 145
           GFY A  ++   G  G  DV+ A + V   E +    Y +P    P      ++++N +K
Sbjct: 126 GFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPITRPPGRFQSLISMVNYWK 185

Query: 146 MGFTYGLFIDAFKLAS 161
            G   GLF+DAFKL S
Sbjct: 186 AGVCLGLFVDAFKLGS 201


>gi|356506255|ref|XP_003521902.1| PREDICTED: ycf20-like protein-like [Glycine max]
          Length = 197

 Score = 39.7 bits (91), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 11/77 (14%)

Query: 90  LGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------ALL 141
           L GFY A   +   G  G  D+++A L V + E +    Y   + + PL        +L 
Sbjct: 123 LVGFYCATAFATVIGQTGDWDILSAALAVAVVEGIGALMY---RASLPLVRKSRSLISLF 179

Query: 142 NNFKMGFTYGLFIDAFK 158
           N +K G T GLF+D+FK
Sbjct: 180 NYWKAGLTLGLFLDSFK 196


>gi|449442657|ref|XP_004139097.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
          Length = 212

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E++    Y +P      ++   + ++N
Sbjct: 134 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEWIGILMYRKPPPLLTRRLKSFIVMVN 193

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 194 YWKAGVCLGLFVDAFKLGS 212


>gi|255547482|ref|XP_002514798.1| conserved hypothetical protein [Ricinus communis]
 gi|223545849|gb|EEF47352.1| conserved hypothetical protein [Ricinus communis]
          Length = 201

 Score = 39.7 bits (91), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 10/81 (12%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP--------LAL 140
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P  T P        + +
Sbjct: 123 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYKKP--TSPSTGRLQSFVVM 180

Query: 141 LNNFKMGFTYGLFIDAFKLAS 161
           +N +K G   GLF+DAFKL S
Sbjct: 181 MNYWKAGVCLGLFVDAFKLGS 201


>gi|126695592|ref|YP_001090478.1| hypothetical protein P9301_02541 [Prochlorococcus marinus str. MIT
           9301]
 gi|126542635|gb|ABO16877.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9301]
          Length = 113

 Score = 39.7 bits (91), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 77  SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT---RFYYSRPK 133
           +P RR     I+LL G+++  ++ +    + + D +AA L V+  E +    R + S  K
Sbjct: 27  NPWRRYSISLITLLIGYFIGSSLGMVSAVVELMDPVAAFLSVVFIEILITLRRNFRSERK 86

Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKL 159
             F +  L++ ++G  YG F ++ KL
Sbjct: 87  KKFLILFLDSLRLGLFYGFFTESLKL 112


>gi|148239449|ref|YP_001224836.1| hypothetical protein SynWH7803_1113 [Synechococcus sp. WH 7803]
 gi|147847988|emb|CAK23539.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 112

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 3/87 (3%)

Query: 78  PSRRALFGGISLLGGFYVAQTISL-SFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTF 136
           P RR   G +SLL GF V   +++  +   G +  IA +  VL+ E + R         +
Sbjct: 26  PWRRRSVGLLSLLFGFIVGSNVTMYWYQKSGQDRPIAVLTMVLILEVLVRLRSRVRTDPW 85

Query: 137 PLAL--LNNFKMGFTYGLFIDAFKLAS 161
           PL    L+N ++G  Y +  +AFKL S
Sbjct: 86  PLGWLALDNLRIGTVYAVVFEAFKLGS 112


>gi|359807488|ref|NP_001241142.1| uncharacterized protein LOC100814758 [Glycine max]
 gi|255639749|gb|ACU20168.1| unknown [Glycine max]
          Length = 207

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +P      ++   L L+N
Sbjct: 129 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLIYRKPPTEKTGRLQSFLVLVN 188

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 189 YWKAGICLGLFVDAFKLGS 207


>gi|33239713|ref|NP_874655.1| hypothetical protein Pro0261 [Prochlorococcus marinus subsp.
           marinus str. CCMP1375]
 gi|33237238|gb|AAP99307.1| Uncharacterized secreted or membrane protein [Prochlorococcus
           marinus subsp. marinus str. CCMP1375]
          Length = 111

 Score = 39.3 bits (90), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 52/111 (46%), Gaps = 7/111 (6%)

Query: 52  QENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDV 111
           Q     R +++++  +   S N    P RR     I  L  F    +I +  G L + D 
Sbjct: 4   QSTSLNRNIINLIMRLEVWSDN----PWRRYSMLIIIFLSAFLFGSSIGMINGVLALMDP 59

Query: 112 IAAVLCVLLTEYVTRF---YYSRPKVTFPLALLNNFKMGFTYGLFIDAFKL 159
           I A   V L E + RF      +   +  L++L++F+MGF YGLF + FKL
Sbjct: 60  IGAFFTVTLIEVLVRFRKVNVQKKGSSISLSILDSFRMGFIYGLFTEGFKL 110


>gi|159902795|ref|YP_001550139.1| hypothetical protein P9211_02541 [Prochlorococcus marinus str. MIT
           9211]
 gi|159887971|gb|ABX08185.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9211]
          Length = 113

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 3/86 (3%)

Query: 77  SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPK 133
           +P RR     I   G F    ++ +    L + D+I A   VLL E++ R    +  +  
Sbjct: 27  NPWRRYSLLLIIFFGAFVFGSSVGMINAVLELMDLIGAFFTVLLLEFMVRLRQAWNRKNS 86

Query: 134 VTFPLALLNNFKMGFTYGLFIDAFKL 159
            +  L ++++ ++G  YGLF++ FKL
Sbjct: 87  SSILLQIIDSARIGLLYGLFMEGFKL 112


>gi|449498939|ref|XP_004160676.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
          Length = 221

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 58/140 (41%), Gaps = 5/140 (3%)

Query: 24  SPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRAL 83
            P  + +  +  +S   S  G         + G  RL+ I++ +            +   
Sbjct: 57  KPSVLKNFKKMALSIRGSVEGNPLDPPSSNSNGRTRLIRIIQAIQTKLNARINELRKNLP 116

Query: 84  FGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPL 138
              +  L GFY A   +   G  G  D+++A L V++ E +    Y  S P   K+   +
Sbjct: 117 LKILFFLVGFYSATAFATVIGQTGDWDILSAALAVVVVEGIGALMYRASLPLLNKIKDLI 176

Query: 139 ALLNNFKMGFTYGLFIDAFK 158
            + N +K G + GLF+D+FK
Sbjct: 177 TMFNYWKAGLSMGLFLDSFK 196


>gi|449454528|ref|XP_004145006.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
 gi|449474824|ref|XP_004154295.1| PREDICTED: ycf20-like protein-like [Cucumis sativus]
          Length = 221

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 25/129 (19%)

Query: 39  SSSSAG----FRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFY 94
           SS+S G     R I+A+Q     R        + EL +N    P +   F     L GFY
Sbjct: 84  SSNSNGRTRLIRIIQAIQTKLNAR--------INELRKNL---PLKILFF-----LVGFY 127

Query: 95  VAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYY--SRP---KVTFPLALLNNFKMGFT 149
            A   +   G  G  D+++A L V++ E +    Y  S P   K+   + + N +K G +
Sbjct: 128 SATAFATVIGQTGDWDILSAALAVVVVEGIGALMYRASLPLLNKIKDLITMFNYWKAGLS 187

Query: 150 YGLFIDAFK 158
            GLF+D+FK
Sbjct: 188 MGLFLDSFK 196


>gi|148243301|ref|YP_001228458.1| hypothetical protein SynRCC307_2202 [Synechococcus sp. RCC307]
 gi|147851611|emb|CAK29105.1| Uncharacterized conserved membrane protein [Synechococcus sp.
           RCC307]
          Length = 110

 Score = 38.9 bits (89), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 3/89 (3%)

Query: 73  NYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT---RFYY 129
            +  +P RR     + LL GF    +I    G L V D + A++ V  +E      R   
Sbjct: 19  QWAENPWRRFSLLVLVLLAGFSAGVSIGSVAGVLMVIDPVGALVVVAASELAIKARRPLL 78

Query: 130 SRPKVTFPLALLNNFKMGFTYGLFIDAFK 158
            R     PL LL+  +MG  YGL ++  K
Sbjct: 79  KRGGDKLPLGLLDMARMGIVYGLLLEGIK 107


>gi|242037377|ref|XP_002466083.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
 gi|241919937|gb|EER93081.1| hypothetical protein SORBIDRAFT_01g000870 [Sorghum bicolor]
          Length = 204

 Score = 38.5 bits (88), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 11/75 (14%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------ALLNN 143
           GFY A   +   G  G  D+++A L V + E +    Y   + +F L         + N 
Sbjct: 109 GFYCATAFATVIGQTGDWDILSAGLAVAVVEVIGALMY---RASFALLGRIRNMITIFNY 165

Query: 144 FKMGFTYGLFIDAFK 158
           +K G T GLF+D+FK
Sbjct: 166 WKAGLTLGLFLDSFK 180


>gi|72382195|ref|YP_291550.1| hypothetical protein PMN2A_0355 [Prochlorococcus marinus str.
           NATL2A]
 gi|72002045|gb|AAZ57847.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 106

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 50/85 (58%), Gaps = 2/85 (2%)

Query: 77  SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTF 136
           S S+R+L   +SLL GFY   +         VN +  A++ +L+ E V R Y   PK+T 
Sbjct: 24  SWSKRSLLV-LSLLFGFYFTNSFLSFLLDKSVNTIFLAIIILLIMELVIRSYLI-PKLTI 81

Query: 137 PLALLNNFKMGFTYGLFIDAFKLAS 161
           P+ ++NN ++G TY L ++A+KL S
Sbjct: 82  PVMIINNIRIGSTYALILEAYKLGS 106


>gi|413932398|gb|AFW66949.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
 gi|413932399|gb|AFW66950.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
 gi|413932400|gb|AFW66951.1| hypothetical protein ZEAMMB73_929985 [Zea mays]
          Length = 212

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
           GFY A   +   G  G  D+++A L V + E +    Y        ++   + + N +K 
Sbjct: 116 GFYCATAFATVIGQTGDWDILSAGLAVAIVEVIGALMYRASFAFLGRIGNMITVFNYWKA 175

Query: 147 GFTYGLFIDAFK 158
           G T GLF+D+FK
Sbjct: 176 GLTLGLFLDSFK 187


>gi|33240432|ref|NP_875374.1| hypothetical protein Pro0982 [Prochlorococcus marinus subsp.
           marinus str. CCMP1375]
 gi|33237959|gb|AAQ00027.1| Uncharacterized secreted or membrane protein [Prochlorococcus
           marinus subsp. marinus str. CCMP1375]
          Length = 111

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 4/87 (4%)

Query: 78  PSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRF---YYSRPKV 134
           P +    G ISLL G+Y+A  ++  F  L     +  +  V+L E   R      +  K 
Sbjct: 26  PWKVRSIGLISLLVGYYLASILTTYFLVLFTQRSLVVIPLVILVEISVRLRRALLNNQKE 85

Query: 135 TFPLALLNNFKMGFTYGLFIDAFKLAS 161
           ++ L +L+N ++G TY + ++AFKL S
Sbjct: 86  SY-LLILDNIRIGITYAVVLEAFKLGS 111


>gi|195615956|gb|ACG29808.1| hypothetical protein [Zea mays]
          Length = 231

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 93  FYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFP------LALLNNFKM 146
           FY A  ++   G  G  DV+ A + V   E +    Y +P    P      ++++N +K 
Sbjct: 157 FYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLMYRKPIARPPGRFQSLISMVNYWKA 216

Query: 147 GFTYGLFIDAFKLAS 161
           G   GLF+DAFKL S
Sbjct: 217 GVCLGLFVDAFKLGS 231


>gi|255070727|ref|XP_002507445.1| predicted protein [Micromonas sp. RCC299]
 gi|226522720|gb|ACO68703.1| predicted protein [Micromonas sp. RCC299]
          Length = 152

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 12/63 (19%)

Query: 110 DVIAAVLCVLLTEYVTRFYYS--RPKVTF---------PLALLNNFKMGFTYGLFIDAFK 158
           D    +  V L E+++   YS  RP+            PL +LNNFK+G  YGLF DAFK
Sbjct: 91  DAPICLTIVFLAEFLSSVLYSPKRPRGILKFLKLNDLVPL-VLNNFKLGTLYGLFCDAFK 149

Query: 159 LAS 161
           + S
Sbjct: 150 VGS 152


>gi|317968664|ref|ZP_07970054.1| hypothetical protein SCB02_03913 [Synechococcus sp. CB0205]
          Length = 80

 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 87  ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVT---FPLALLNN 143
           I LL  F++   I+   GA    D  AA+LCVL  E   R      + T     L LL+ 
Sbjct: 4   IVLLSTFFLGNAIATLTGARSFLDPPAALLCVLAIEAAIRLRGPLLRNTSNRLGLQLLDM 63

Query: 144 FKMGFTYGLFIDAFKL 159
            ++GF YGL ++ FKL
Sbjct: 64  SRVGFCYGLLLEGFKL 79


>gi|115456649|ref|NP_001051925.1| Os03g0852600 [Oryza sativa Japonica Group]
 gi|113550396|dbj|BAF13839.1| Os03g0852600, partial [Oryza sativa Japonica Group]
          Length = 206

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 92  GFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP-----KVTFPLALLNNFKM 146
           GFY A   +   G  G  D+++A L V + E +    Y        ++   + + N +K 
Sbjct: 112 GFYCATAFATVIGQTGDWDILSAGLAVAIVEGIGALMYRASFAFLGRIRKMITIFNYWKA 171

Query: 147 GFTYGLFIDAFK 158
           G T GLF+D+FK
Sbjct: 172 GLTLGLFLDSFK 183


>gi|88808713|ref|ZP_01124223.1| hypothetical protein WH7805_03447 [Synechococcus sp. WH 7805]
 gi|88787701|gb|EAR18858.1| hypothetical protein WH7805_03447 [Synechococcus sp. WH 7805]
          Length = 112

 Score = 36.6 bits (83), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 80  RRALFGGISLLGGFYVAQTISL-SFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV-TFP 137
           RR   G +SLL GF V   +++  F   G     A ++ VL+ E   R   SR ++  +P
Sbjct: 28  RRRSVGLLSLLFGFIVGSNVTMYWFQKSGQQRPFAVLMVVLVLEMQVRLR-SRVRIDQWP 86

Query: 138 LALL--NNFKMGFTYGLFIDAFKLAS 161
           L+ L  +N ++G  Y +  +AFKL S
Sbjct: 87  LSWLALDNLRIGTVYAVVFEAFKLGS 112


>gi|124112127|ref|YP_001019166.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
 gi|124012234|gb|ABM87970.1| hypothetical chloroplast RF20 [Chlorokybus atmophyticus]
          Length = 127

 Score = 36.6 bits (83), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 40/92 (43%), Gaps = 18/92 (19%)

Query: 88  SLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPL--------- 138
            LL GF++A   +   G     +V+ A   V+L E    F Y +   TF L         
Sbjct: 36  CLLMGFFMATFFATILGFPKNWNVLNATFLVILIEVTGFFLYGKTSATFNLYTNKNLHSN 95

Query: 139 ---------ALLNNFKMGFTYGLFIDAFKLAS 161
                    A +N  K+G  +GLF+DAFKL S
Sbjct: 96  KKKNINLWVANINYLKIGLIFGLFVDAFKLGS 127


>gi|308802359|ref|XP_003078493.1| unnamed protein product [Ostreococcus tauri]
 gi|116056945|emb|CAL53234.1| unnamed protein product [Ostreococcus tauri]
          Length = 130

 Score = 36.6 bits (83), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 13/85 (15%)

Query: 87  ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYS--------RPKVTFPL 138
           ISLL GF +   I          D   +   +LL+E+V+   Y         R     PL
Sbjct: 49  ISLLVGFSMCALIP---HPESPQDAFISFTIILLSEFVSSVLYGVDNQGGLLRWLCEGPL 105

Query: 139 A--LLNNFKMGFTYGLFIDAFKLAS 161
           A  ++N FK+G  YGLF DAFK+ S
Sbjct: 106 APMIVNCFKIGVLYGLFCDAFKVGS 130


>gi|327283939|ref|XP_003226697.1| PREDICTED: cell death activator CIDE-B-like [Anolis carolinensis]
          Length = 221

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 51/117 (43%), Gaps = 7/117 (5%)

Query: 2   IEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLV 61
           I +L+    G +  T  F   L   T   +  +G  +S S  G RS     E   PR   
Sbjct: 72  IMSLVLEEDGTLLETEEFFEALEDNTTVMVLEKGQKWSHSKGGGRSYALSGEK--PRNSR 129

Query: 62  DIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           DI R+  ++   Y  +P  R LFG +++   FY   ++S  F  LG   V+  +L V
Sbjct: 130 DIARITFDI---YKLNP--RDLFGSLNISATFYGLYSMSCDFKCLGPKKVLREILRV 181


>gi|124025743|ref|YP_001014859.1| hypothetical protein NATL1_10361 [Prochlorococcus marinus str.
           NATL1A]
 gi|123960811|gb|ABM75594.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL1A]
          Length = 106

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 14/33 (42%), Positives = 24/33 (72%)

Query: 129 YSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           Y  P++T P+ ++NN ++G TY L ++A+KL S
Sbjct: 74  YLIPRLTIPVMIINNIRIGSTYALILEAYKLGS 106


>gi|465500|sp|P34813.1|YCF20_AGLNE RecName: Full=Uncharacterized protein ycf20
 gi|304023|gb|AAA18512.1| unknown [Aglaothamnion neglectum]
          Length = 100

 Score = 35.0 bits (79), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 87  ISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKV--TFPLALLNNF 144
           ISL  G++    I       G   +I A+L V + E ++   YS  K      +  +N  
Sbjct: 24  ISLFLGYFFCTIICSIPKETGDWGLITALLIVGINETMSALIYSYKKYENNIIVKTINGI 83

Query: 145 KMGFTYGLFIDAFKLAS 161
           K+G  YGLF+D+FKL S
Sbjct: 84  KIGIIYGLFVDSFKLGS 100


>gi|359495755|ref|XP_002267846.2| PREDICTED: ycf20-like protein [Vitis vinifera]
 gi|298205056|emb|CBI38352.3| unnamed protein product [Vitis vinifera]
          Length = 225

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 89  LLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRP------KVTFPLALLN 142
           LL GFY A  ++   G  G  DV+ A + V   E +    Y +       ++   + ++N
Sbjct: 147 LLLGFYTANALATILGQTGDWDVLVAGVVVAAIEGIGMLVYKKSSSQSTGRLQSLVVMIN 206

Query: 143 NFKMGFTYGLFIDAFKLAS 161
            +K G   GLF+DAFKL S
Sbjct: 207 YWKAGVCLGLFVDAFKLGS 225


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.328    0.142    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,331,713,557
Number of Sequences: 23463169
Number of extensions: 89851568
Number of successful extensions: 280003
Number of sequences better than 100.0: 200
Number of HSP's better than 100.0 without gapping: 90
Number of HSP's successfully gapped in prelim test: 110
Number of HSP's that attempted gapping in prelim test: 279772
Number of HSP's gapped (non-prelim): 204
length of query: 161
length of database: 8,064,228,071
effective HSP length: 123
effective length of query: 38
effective length of database: 9,473,225,580
effective search space: 359982572040
effective search space used: 359982572040
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 71 (32.0 bits)