BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029353
         (194 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225460141|ref|XP_002275923.1| PREDICTED: uncharacterized protein LOC100243104 isoform 1 [Vitis
           vinifera]
 gi|359495234|ref|XP_003634940.1| PREDICTED: uncharacterized protein LOC100243104 isoform 2 [Vitis
           vinifera]
 gi|297741015|emb|CBI31327.3| unnamed protein product [Vitis vinifera]
          Length = 226

 Score =  209 bits (531), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 113/197 (57%), Positives = 140/197 (71%), Gaps = 18/197 (9%)

Query: 5   SSISSSSLRTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRISSSENNR 64
           S +++S+    PS L P L         ++I++HF     T++R    + LR+SSS  N 
Sbjct: 2   SLLTNSTTLLRPSKLSPPL----LHPHTASIATHF-----TQRRSRRCNLLRVSSSPQNP 52

Query: 65  TAVDVASPLPTTSNVDGD-DGG-------GAVVVRRFYAGINGRDLASVEELIADDCVYE 116
           TA  +A P+ TT + + + DGG       G+ VVR FYAGIN  DL SVEELIA +CVYE
Sbjct: 53  TAT-IAEPVTTTQSDEANGDGGDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYE 111

Query: 117 DLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFS 176
           DLIFP+PF+GRKA +DFF KF D+IS +LQFVIDDIS  DSSAVGVTWHLEWKG+PFPFS
Sbjct: 112 DLIFPQPFVGRKAIMDFFNKFIDTISMELQFVIDDISEADSSAVGVTWHLEWKGRPFPFS 171

Query: 177 KGCSFYKLEVVNGKRQI 193
           KGCSFY+LEV+NGKRQI
Sbjct: 172 KGCSFYRLEVLNGKRQI 188


>gi|255574237|ref|XP_002528033.1| protein transporter, putative [Ricinus communis]
 gi|223532563|gb|EEF34351.1| protein transporter, putative [Ricinus communis]
          Length = 218

 Score =  195 bits (496), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 94/150 (62%), Positives = 113/150 (75%), Gaps = 5/150 (3%)

Query: 44  TTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLA 103
           + RK     + + +SSS+N        +P P T +   D    + +VRRFY GIN  DLA
Sbjct: 35  SKRKAWTLRTPVSVSSSDNQ-----TVTPSPATKDDAADTQLASDIVRRFYEGINDHDLA 89

Query: 104 SVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVT 163
           SVE+LIA++CVYEDLIFP PF+GRKA L+FFKKF DSIS DLQFVIDDIS EDS AVGVT
Sbjct: 90  SVEDLIAENCVYEDLIFPHPFVGRKAILEFFKKFIDSISKDLQFVIDDISTEDSLAVGVT 149

Query: 164 WHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           WHLEWKG+PFPFSKGCSFY+L+V+N K+QI
Sbjct: 150 WHLEWKGRPFPFSKGCSFYRLQVLNSKKQI 179


>gi|224146286|ref|XP_002325950.1| predicted protein [Populus trichocarpa]
 gi|222862825|gb|EEF00332.1| predicted protein [Populus trichocarpa]
          Length = 164

 Score =  187 bits (474), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 94/138 (68%), Positives = 106/138 (76%), Gaps = 12/138 (8%)

Query: 56  RISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVY 115
           + SSSEN    V VA  + + S+V          VR FY GING DL SVEELIA++CVY
Sbjct: 1   KASSSENQ--TVTVAPSVESASDV----------VRSFYEGINGHDLDSVEELIAENCVY 48

Query: 116 EDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPF 175
           EDLIFPRPF+GRKA L+FF KF D++S DLQFVIDDIS EDS AVGVTWHLEWKGK FPF
Sbjct: 49  EDLIFPRPFVGRKAILEFFNKFIDTVSKDLQFVIDDISNEDSFAVGVTWHLEWKGKSFPF 108

Query: 176 SKGCSFYKLEVVNGKRQI 193
           SKGCSFY+L+ VNGKRQI
Sbjct: 109 SKGCSFYRLDAVNGKRQI 126


>gi|449450034|ref|XP_004142769.1| PREDICTED: uncharacterized protein LOC101216887 [Cucumis sativus]
          Length = 218

 Score =  185 bits (469), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 101/157 (64%), Positives = 118/157 (75%), Gaps = 3/157 (1%)

Query: 37  SHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAG 96
           SH  ++P+  K  AP S LR+SSS ++  AV V SP PT + +D       VV R FY G
Sbjct: 27  SHSFTSPSIPKP-APYSPLRVSSSSSDNPAVTVPSP-PTDAPLDTLRSASNVV-RDFYDG 83

Query: 97  INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
           +N  DLASVE LIA++CVYEDLIF RPF+GRK  L FFKKF+DSIS DLQFVIDDIS ED
Sbjct: 84  VNRHDLASVEPLIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTED 143

Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           SSA+GV WHLEWKGK FPFSKGCSFY+L  V+GK+QI
Sbjct: 144 SSALGVLWHLEWKGKEFPFSKGCSFYRLADVDGKKQI 180


>gi|356495768|ref|XP_003516745.1| PREDICTED: uncharacterized protein LOC100802494 [Glycine max]
          Length = 221

 Score =  184 bits (467), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 88/144 (61%), Positives = 109/144 (75%), Gaps = 4/144 (2%)

Query: 53  SKLRISSSENNRTAVDVASPLPTTSNVDGDD--GGGAVVVRRFYAGINGRDLASVEELIA 110
           SK R+      + +V   +   TT++ D +D     + VVR FY GIN  D+ SVE L+A
Sbjct: 43  SKWRVVDGVGEKQSVTTLAA--TTTSFDDNDVVDSNSAVVRNFYGGINAHDVDSVEYLMA 100

Query: 111 DDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKG 170
           ++CVYEDL+FPRPF+GRK  L+FFKKF++S S  LQFVIDD+S EDSS+VGV WHLEWKG
Sbjct: 101 ENCVYEDLVFPRPFVGRKEILEFFKKFTNSTSKGLQFVIDDLSTEDSSSVGVIWHLEWKG 160

Query: 171 KPFPFSKGCSFYKLEVVNGKRQIT 194
           KPFPFSKGCSFY+LEV+NGKRQIT
Sbjct: 161 KPFPFSKGCSFYRLEVINGKRQIT 184


>gi|449483825|ref|XP_004156703.1| PREDICTED: uncharacterized LOC101216887 [Cucumis sativus]
          Length = 218

 Score =  183 bits (464), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 100/157 (63%), Positives = 117/157 (74%), Gaps = 3/157 (1%)

Query: 37  SHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAG 96
           SH  ++P+  K  AP S LR+SSS ++  AV V SP PT + +D       VV R FY G
Sbjct: 27  SHSFTSPSIPKP-APYSPLRVSSSSSDNPAVTVPSP-PTDAPLDTLRSASNVV-RDFYDG 83

Query: 97  INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
           +N  DLASVE LIA++CVYEDLIF RPF+GRK  L FFKKF+DSIS DLQFVIDDIS ED
Sbjct: 84  VNRHDLASVEPLIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTED 143

Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           SSA+GV WHLEWKGK FPFSKGC FY+L  V+GK+QI
Sbjct: 144 SSALGVLWHLEWKGKEFPFSKGCRFYRLADVDGKKQI 180


>gi|356539020|ref|XP_003537998.1| PREDICTED: uncharacterized protein LOC100813246 [Glycine max]
          Length = 227

 Score =  182 bits (463), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 80/109 (73%), Positives = 95/109 (87%)

Query: 86  GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
            + V++ FY GIN  D+ SVE LIA++CVYEDL+FPRPF+GRK  L+FFKKF++S S DL
Sbjct: 82  ASAVLKNFYGGINAHDVDSVEYLIAENCVYEDLVFPRPFVGRKEILEFFKKFTNSTSKDL 141

Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           QFVIDD+S EDSS+VGV WHLEWKGKPFPFSKGCSFY+LEV+NGKRQIT
Sbjct: 142 QFVIDDLSTEDSSSVGVIWHLEWKGKPFPFSKGCSFYRLEVINGKRQIT 190


>gi|294461593|gb|ADE76357.1| unknown [Picea sitchensis]
          Length = 233

 Score =  174 bits (441), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 73/105 (69%), Positives = 94/105 (89%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           +VR FY+ IN R+L  V ELIA++CVY+DL+FP+PF+GRK  +DFFKKF+D+ S+DLQFV
Sbjct: 91  IVREFYSRINSRELEFVGELIAENCVYDDLVFPQPFVGRKDIIDFFKKFTDATSTDLQFV 150

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           IDDI++EDSSAVG+TWHLEW+GKPFPFS+GCSFY+ E+ NGK+QI
Sbjct: 151 IDDITSEDSSAVGITWHLEWRGKPFPFSRGCSFYRCEIFNGKQQI 195


>gi|15217513|ref|NP_177304.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
 gi|12323719|gb|AAG51815.1|AC016163_4 unknown protein; 43994-42987 [Arabidopsis thaliana]
 gi|37202096|gb|AAQ89663.1| At1g71480 [Arabidopsis thaliana]
 gi|51969762|dbj|BAD43573.1| unknown protein [Arabidopsis thaliana]
 gi|51969880|dbj|BAD43632.1| unknown protein [Arabidopsis thaliana]
 gi|51971487|dbj|BAD44408.1| unknown protein [Arabidopsis thaliana]
 gi|51971841|dbj|BAD44585.1| unknown protein [Arabidopsis thaliana]
 gi|51971939|dbj|BAD44634.1| unknown protein [Arabidopsis thaliana]
 gi|222423003|dbj|BAH19485.1| AT1G71480 [Arabidopsis thaliana]
 gi|332197085|gb|AEE35206.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
          Length = 216

 Score =  174 bits (440), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 79/105 (75%), Positives = 90/105 (85%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV  FYA +N  DL+SV +LIA DCVYEDL+F  PF+GRKA LDFF KF +S S+DLQFV
Sbjct: 74  VVSSFYAAVNVHDLSSVTDLIAQDCVYEDLVFSSPFVGRKAILDFFGKFIESTSTDLQFV 133

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           IDDIS EDSSAVGV+WHLEWKGK FPFSKGCSFY+LEV++GKRQI
Sbjct: 134 IDDISTEDSSAVGVSWHLEWKGKNFPFSKGCSFYRLEVIDGKRQI 178


>gi|224135761|ref|XP_002327297.1| predicted protein [Populus trichocarpa]
 gi|222835667|gb|EEE74102.1| predicted protein [Populus trichocarpa]
          Length = 150

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 74/105 (70%), Positives = 87/105 (82%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           V+RRFY G+N  DL +VEE+IA++CVYEDLIFP P +G K  ++FF++F DS SSDL FV
Sbjct: 8   VIRRFYDGLNSHDLVAVEEIIAENCVYEDLIFPHPCVGHKEVIEFFQQFFDSTSSDLHFV 67

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           IDDIS EDS AVGV WH+EWKGKP P SKGCSFY+LE VNGKRQI
Sbjct: 68  IDDISGEDSQAVGVIWHIEWKGKPMPCSKGCSFYRLETVNGKRQI 112


>gi|168009022|ref|XP_001757205.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691703|gb|EDQ78064.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 148

 Score =  162 bits (410), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 70/109 (64%), Positives = 87/109 (79%)

Query: 86  GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
            AV+V+ FYA IN R++ S+ +L AD+CVYEDL+FP PF GR+A LDFFKKF DS+ S+L
Sbjct: 3   AAVMVQEFYAAINRREITSIGDLFADNCVYEDLVFPTPFTGRQAILDFFKKFMDSVGSEL 62

Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           +F IDDI+  D +A GV WHLEW+GKP PFSKGCSFY+ EV+NGKRQ  
Sbjct: 63  EFRIDDITTGDPNAAGVIWHLEWRGKPLPFSKGCSFYRCEVLNGKRQFV 111


>gi|226509360|ref|NP_001141296.1| uncharacterized protein LOC100273387 [Zea mays]
 gi|194703844|gb|ACF86006.1| unknown [Zea mays]
 gi|414876798|tpg|DAA53929.1| TPA: hypothetical protein ZEAMMB73_536788 [Zea mays]
          Length = 225

 Score =  162 bits (409), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 70/96 (72%), Positives = 85/96 (88%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR+FY G+N RDLA+VE LIA+ CVYEDL+FPRPF+GR+  + FF +F  +IS DLQFV
Sbjct: 72  VVRQFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPFVGRERVVGFFGEFMGTISPDLQFV 131

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
           IDDISAEDS+AVGVTWHLEW+G+PFPFS+GCSFY+L
Sbjct: 132 IDDISAEDSAAVGVTWHLEWRGRPFPFSRGCSFYRL 167


>gi|115478242|ref|NP_001062716.1| Os09g0266000 [Oryza sativa Japonica Group]
 gi|51536247|dbj|BAD38416.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
           Japonica Group]
 gi|113630949|dbj|BAF24630.1| Os09g0266000 [Oryza sativa Japonica Group]
 gi|215692578|dbj|BAG87998.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 331

 Score =  155 bits (393), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 71/109 (65%), Positives = 88/109 (80%), Gaps = 1/109 (0%)

Query: 86  GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
            A VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR   L FF  F  S+SSDL
Sbjct: 67  AADVVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDL 126

Query: 146 QFVIDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           +FVIDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++   ++Q+
Sbjct: 127 RFVIDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175


>gi|51536248|dbj|BAD38417.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
           Japonica Group]
 gi|125604939|gb|EAZ43975.1| hypothetical protein OsJ_28594 [Oryza sativa Japonica Group]
 gi|215695383|dbj|BAG90574.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 215

 Score =  155 bits (391), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 70/106 (66%), Positives = 87/106 (82%), Gaps = 1/106 (0%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR   L FF  F  S+SSDL+FV
Sbjct: 70  VVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDLRFV 129

Query: 149 IDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           IDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++   ++Q+
Sbjct: 130 IDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175


>gi|125562981|gb|EAZ08361.1| hypothetical protein OsI_30617 [Oryza sativa Indica Group]
          Length = 215

 Score =  154 bits (390), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 70/106 (66%), Positives = 87/106 (82%), Gaps = 1/106 (0%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR   L FF  F  S+SSDL+FV
Sbjct: 70  VVRAFYDGLNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDLRFV 129

Query: 149 IDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           IDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++   ++Q+
Sbjct: 130 IDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175


>gi|357167107|ref|XP_003581007.1| PREDICTED: uncharacterized protein LOC100842516 [Brachypodium
           distachyon]
          Length = 196

 Score =  154 bits (388), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 70/104 (67%), Positives = 82/104 (78%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR FY G+N RDLA+VE LIA+ CVYEDL+FPRP +GR   L FF +F  SIS DL+FV
Sbjct: 51  VVREFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPMVGRDQVLGFFGEFMGSISPDLRFV 110

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQ 192
           IDDIS  D SAVGVTWHLEWKG+PFPFS+GCSFY+ +    K+Q
Sbjct: 111 IDDISGLDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQPDPQKQQ 154


>gi|147794044|emb|CAN71166.1| hypothetical protein VITISV_035107 [Vitis vinifera]
          Length = 203

 Score =  151 bits (382), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/170 (51%), Positives = 113/170 (66%), Gaps = 18/170 (10%)

Query: 5   SSISSSSLRTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRISSSENNR 64
           S +++S+    PS L P L         ++I++HF     T++R    + LR+SSS  N 
Sbjct: 2   SLLTNSTTLLRPSKLSPPL----LHPHTASIATHF-----TQRRSRRCNLLRVSSSPQNP 52

Query: 65  TAVDVASPLPTTSNVDGD-DGG-------GAVVVRRFYAGINGRDLASVEELIADDCVYE 116
           TA  +A P+ TT + + + DGG       G+ VVR FYAGIN  DL SVEELIA +CVYE
Sbjct: 53  TAT-IAEPVTTTQSDEANGDGGDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYE 111

Query: 117 DLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHL 166
           DLIFP+PF+GRKA +DFF KF D+IS +LQFVIDBIS  DSSAVGVTWHL
Sbjct: 112 DLIFPQPFVGRKAIMDFFNKFIDTISMELQFVIDBISEADSSAVGVTWHL 161


>gi|326522692|dbj|BAJ88392.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 203

 Score =  148 bits (374), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 63/95 (66%), Positives = 78/95 (82%)

Query: 91  RRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVID 150
           R FY+G+N RDLA+V  LIA+ CVYEDL+FPRP +GR   + FF +F  S+S DL+FVID
Sbjct: 60  REFYSGVNRRDLAAVAPLIAEGCVYEDLVFPRPMVGRDRVVGFFGEFMGSVSPDLRFVID 119

Query: 151 DISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
           DIS +D SAVGVTWHLEWKG+PFPFS+GCSFY+ +
Sbjct: 120 DISGDDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQ 154


>gi|242042631|ref|XP_002468710.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
 gi|241922564|gb|EER95708.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
          Length = 98

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 63/88 (71%), Positives = 77/88 (87%)

Query: 97  INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
           +N R++A+VE LIA+ CVYEDL+FPRPF+GR+  + FF KF  +IS DLQFVIDDISA+D
Sbjct: 1   MNRREVAAVEPLIAEGCVYEDLVFPRPFVGRERIIGFFGKFMGTISLDLQFVIDDISADD 60

Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
           S AVGVTWHLEW+GKPFPFS+GCSFY+L
Sbjct: 61  SVAVGVTWHLEWRGKPFPFSRGCSFYRL 88


>gi|302811542|ref|XP_002987460.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
 gi|300144866|gb|EFJ11547.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
          Length = 181

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 59/114 (51%), Positives = 81/114 (71%), Gaps = 1/114 (0%)

Query: 81  GDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS 140
           G  G     VR+FY+ IN R L    +LIA DCVYEDL+F +PF+GRKA ++ F+ F+ S
Sbjct: 25  GRLGAAKDTVRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSS 84

Query: 141 ISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           +  ++ FVID+IS + + +VGVTWHL+W+G  FPFSKGCSFY+ E V+G  +I 
Sbjct: 85  VGPEVSFVIDEIS-DGNLSVGVTWHLDWRGNIFPFSKGCSFYRCEDVSGDYKIV 137


>gi|302796609|ref|XP_002980066.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
 gi|300152293|gb|EFJ18936.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
          Length = 143

 Score =  127 bits (318), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 57/105 (54%), Positives = 79/105 (75%), Gaps = 1/105 (0%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           VR+FY+ IN R L    +LIA DCVYEDL+F +PF+GRKA ++ F+ F+ S+  ++ FVI
Sbjct: 6   VRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSSVGPEVSFVI 65

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           D+IS + + +VGVTWHL+W+G  FPFSKGCSFY+ E V+G  +I 
Sbjct: 66  DEIS-DGNLSVGVTWHLDWRGNVFPFSKGCSFYRCEDVSGDYKIV 109


>gi|218246837|ref|YP_002372208.1| hypothetical protein PCC8801_2015 [Cyanothece sp. PCC 8801]
 gi|257059878|ref|YP_003137766.1| hypothetical protein Cyan8802_2040 [Cyanothece sp. PCC 8802]
 gi|218167315|gb|ACK66052.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
 gi|256590044|gb|ACV00931.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
          Length = 143

 Score =  112 bits (280), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 51/96 (53%), Positives = 65/96 (67%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           V+   Y  IN RD+ S  E I D C+YEDL FP+PF+G++   + F +   +I  DLQFV
Sbjct: 8   VIESIYGAINRRDIDSAMEWIDDQCIYEDLNFPQPFIGKEKVKELFAESMQNIPDDLQFV 67

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
           IDDI++EDS AVGV WHLE  G  FP S+G SFY+L
Sbjct: 68  IDDITSEDSLAVGVLWHLELNGMTFPNSRGASFYRL 103


>gi|302852119|ref|XP_002957581.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
           nagariensis]
 gi|300257098|gb|EFJ41351.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
           nagariensis]
          Length = 479

 Score =  105 bits (263), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 63/178 (35%), Positives = 93/178 (52%), Gaps = 12/178 (6%)

Query: 19  LLPCLNQTTFSYS-NSTISSHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTS 77
           +LPC +     Y      SS +  T     RL   +++   +++        A P     
Sbjct: 1   MLPCASYKQLLYRVKCKHSSSWTRTKEQSPRLWTYARIACYAAKTAE-----AVPPQAEG 55

Query: 78  NVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKF 137
            V+G  G   +V  R+Y   N  DL ++  L+A D  Y D+I+  PF+GR   L++ KK 
Sbjct: 56  TVNGLTGKDVIV--RYYEAYNSGDLDTIASLLAPDVSYHDMIYEDPFVGRDEVLNYLKKV 113

Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWK-GKPFPFSKGCSFYKLEVVNGKRQIT 194
             ++ +DLQFVI+D++  D  AVG+TWH+E   G  FPFS+GCSFY L   N + QIT
Sbjct: 114 RRTVPADLQFVIEDVTDGDPRAVGITWHVECGDGVVFPFSRGCSFYTL---NPRGQIT 168


>gi|254414879|ref|ZP_05028643.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196178368|gb|EDX73368.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 391

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 66/103 (64%)

Query: 88  VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF 147
           ++++  Y  IN RD+++  + I D+CVYEDL FP+PF G+ A    F++    +  DL F
Sbjct: 7   LMIKSLYQAINRRDISAAIDCIDDECVYEDLNFPQPFRGKAAVQQLFEESYQGVPDDLLF 66

Query: 148 VIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           VIDDI+  DSS+ GV WH+E  G PFP  +G SFY+L    GK
Sbjct: 67  VIDDITTGDSSSTGVLWHVELDGIPFPNGRGVSFYRLSETTGK 109


>gi|159487887|ref|XP_001701954.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281173|gb|EDP06929.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1224

 Score =  101 bits (251), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 47/117 (40%), Positives = 73/117 (62%), Gaps = 1/117 (0%)

Query: 75  TTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFF 134
           +TS+ D     G  V+ R+Y   N  D+ ++  L+A D  Y D+I+  PF GR+  + + 
Sbjct: 76  STSSRDAPVKDGRDVIVRYYEAYNAGDIDTIAGLLAPDVSYHDMIYEEPFQGREEVVAYL 135

Query: 135 KKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEW-KGKPFPFSKGCSFYKLEVVNGK 190
           +K   ++ SDL+FVI+D+++ D  AVG+TWH+E   G  FPFS+GCSFY L+   G+
Sbjct: 136 RKVRKTVPSDLKFVIEDVTSGDPRAVGITWHVECGDGVVFPFSRGCSFYTLDPATGQ 192


>gi|332708365|ref|ZP_08428343.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
 gi|332352858|gb|EGJ32420.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
          Length = 394

 Score = 98.6 bits (244), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 62/102 (60%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           +++  Y  IN RD+ +  E I D C+YEDL F +PF G++A     ++    I  DL+FV
Sbjct: 8   LIQSMYEAINRRDVNAAMEWIDDQCIYEDLNFSQPFKGKEAVRQLLEESCQGIPDDLKFV 67

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           IDDI+  D  AVGV WH+E  G PFP  +G SFY+   V GK
Sbjct: 68  IDDITTGDPLAVGVLWHVELDGIPFPNGRGVSFYRFSEVTGK 109


>gi|359474296|ref|XP_002273128.2| PREDICTED: uncharacterized protein LOC100251848 [Vitis vinifera]
          Length = 228

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 60/94 (63%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           +  FY  IN ++L  + +L++DDC + DL FP+PF G+K  L FF++ +  +  +++F I
Sbjct: 81  IELFYTCINDKNLKQLAKLVSDDCCFNDLSFPQPFKGKKEVLRFFEELTAVMGKNVKFRI 140

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
             +   D     + WHLEW+GK  PF++GCSFY+
Sbjct: 141 LHVCEGDGLTAAIDWHLEWQGKQIPFTRGCSFYE 174


>gi|145358721|ref|NP_198962.2| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
 gi|332007299|gb|AED94682.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
          Length = 277

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 73/134 (54%), Gaps = 5/134 (3%)

Query: 51  PLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIA 110
           P+  +   +S  N     +  P    + + G D      V +FY+ IN ++   +   I+
Sbjct: 45  PIKMINKYASRRNIVVSCLDRPNSRPNQISGYDA-----VMKFYSSINEKNQDQLSSCIS 99

Query: 111 DDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKG 170
            DC  +D  FP+PF G++  ++FF++   S+  +++F ++++   D  +  V WHLEWKG
Sbjct: 100 SDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNVKFCVENVCEGDGHSAAVNWHLEWKG 159

Query: 171 KPFPFSKGCSFYKL 184
           +  PF++GCSFY+ 
Sbjct: 160 RKIPFTRGCSFYEF 173


>gi|255537559|ref|XP_002509846.1| conserved hypothetical protein [Ricinus communis]
 gi|223549745|gb|EEF51233.1| conserved hypothetical protein [Ricinus communis]
          Length = 326

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 67/115 (58%)

Query: 69  VASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRK 128
           V S      N+  +    A ++++FY  IN + L  +++ I+D+C +ED  F  P  G+K
Sbjct: 68  VMSSAADDGNIPLNHISAASMIKKFYTCINEKRLEEIDKYISDNCCFEDCSFYSPIQGKK 127

Query: 129 ATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
             + F+++ +  +  +++F I+ +  +D   VGV WHLEWK    PF++GCSFY+
Sbjct: 128 EVMHFYQQLTTGMVQNVKFSIEHVCEDDEFTVGVNWHLEWKRTHIPFTRGCSFYE 182


>gi|297805502|ref|XP_002870635.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316471|gb|EFH46894.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 277

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 63/108 (58%)

Query: 86  GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
           G   V +FY+ IN ++   +   I+ DC  +D  F +PF GRK  + FF++   S+  ++
Sbjct: 75  GYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFSKPFRGRKEAMKFFEELVKSMGQNV 134

Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           +F ++++   D  +  V WHLEWKG+  PF++GCSFY+     GK  I
Sbjct: 135 KFCVENVCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEFTDEGGKLVI 182


>gi|224071972|ref|XP_002303603.1| predicted protein [Populus trichocarpa]
 gi|222841035|gb|EEE78582.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 61/95 (64%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++++FY  IN + L  ++  I++DC +ED  F +P  G++  + FF++ +  +  +++F+
Sbjct: 86  MIKQFYTCINDKKLKELDGYISEDCHFEDCSFLQPMQGKREVMHFFRQLTAGMGENMKFI 145

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
           I+ +  +D    GV WHLEWK    PF++GCSFY+
Sbjct: 146 IEHVCEDDEMTAGVNWHLEWKTIQIPFTRGCSFYE 180


>gi|67924420|ref|ZP_00517847.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
 gi|416403736|ref|ZP_11687588.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
           0003]
 gi|67853732|gb|EAM49064.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
 gi|357261654|gb|EHJ10897.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
           0003]
          Length = 386

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 62/102 (60%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++   Y  IN RD+    + + DDC+YED+ F +PF G++A    F++  D++  DL+F+
Sbjct: 9   IIESIYQAINNRDIEQAIQWVDDDCLYEDMNFSQPFQGKEAVKTLFQESCDNVPDDLKFI 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           ID+I+ ED   VG+ WH+E    P P  +G SFY++    GK
Sbjct: 69  IDEITTEDPLKVGILWHVELDDIPLPNGRGVSFYRISETTGK 110


>gi|308804507|ref|XP_003079566.1| unnamed protein product [Ostreococcus tauri]
 gi|116058021|emb|CAL54224.1| unnamed protein product [Ostreococcus tauri]
          Length = 454

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           +   Y  IN RD+A   E +A+D VYED  FP PF+G+ A    F++    I  DL FV+
Sbjct: 48  IATMYDRINARDVAGALECVAEDVVYEDFNFPEPFVGKAAVRALFEESCSGIPDDLAFVV 107

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           D+ ++   ++VG TWH+E  G+ FP ++GCS Y+   VNG  ++T
Sbjct: 108 DEWTSGSGASVGCTWHVEIMGEAFPNARGCSLYR---VNGDGKLT 149


>gi|303276012|ref|XP_003057300.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461652|gb|EEH58945.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 399

 Score = 92.8 bits (229), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 63/102 (61%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR  Y  IN RD+A+  E + DD +YED  FP PF G+ A    F++  D I  D+ F+
Sbjct: 5   VVRGMYDAINRRDVAAALEFVDDDILYEDFNFPTPFKGKAAVKKLFEESCDGIPDDMLFI 64

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           ID+ +     +VG+TW++E +G+PFP ++G S Y++    GK
Sbjct: 65  IDECTDGGGKSVGMTWYVELEGEPFPNARGASLYRISETTGK 106


>gi|60547917|gb|AAX23922.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
 gi|71905583|gb|AAZ52769.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
          Length = 198

 Score = 92.4 bits (228), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 60/93 (64%)

Query: 92  RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDD 151
           +FY+ IN ++   +   I+ DC  +D  FP+PF G++  ++FF++   S+  +++F +++
Sbjct: 2   KFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNVKFCVEN 61

Query: 152 ISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
           +   D  +  V WHLEWKG+  PF++GCSFY+ 
Sbjct: 62  VCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEF 94


>gi|218194167|gb|EEC76594.1| hypothetical protein OsI_14445 [Oryza sativa Indica Group]
          Length = 612

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 64/107 (59%), Gaps = 3/107 (2%)

Query: 78  NVDGDDGGGAV--VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFK 135
           N  G  GG  +  V+++FY+ +N +D   +E LIA DC+ +D  + +  L  K+T  +F+
Sbjct: 425 NSTGGRGGSPLPDVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYK-LLDIKSTQTYFR 483

Query: 136 KFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
           +  D++  + +F ID++S        V WHLEW GK  PF+KGCSFY
Sbjct: 484 RLMDAMGKNFKFAIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFY 530


>gi|427420214|ref|ZP_18910397.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
           7375]
 gi|425762927|gb|EKV03780.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
           7375]
          Length = 146

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 61/101 (60%), Gaps = 1/101 (0%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           ++  Y  IN RD++    LI D C YEDL F   F+G+ A    F +   ++ +DLQFVI
Sbjct: 8   IQSIYEAINHRDVSQAVALIDDQCCYEDLNFSATFMGKAAVQGLFTESCQAVPADLQFVI 67

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           DD++ +D  AVGVTWH+E  G  FP  +G SFY+    +GK
Sbjct: 68  DDMAGDD-LAVGVTWHVELDGIAFPNGRGVSFYRFSPNSGK 107


>gi|357481563|ref|XP_003611067.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
 gi|355512402|gb|AES94025.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
          Length = 249

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 59/101 (58%)

Query: 87  AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQ 146
           A +V  FY  IN ++L  ++E I+ D  + D  F  PF G+K  + F ++ +  +  +++
Sbjct: 84  AEIVDHFYTCINEKELKQLDEYISQDACFYDYTFINPFQGKKEVMHFLQQLTAGMGQNVK 143

Query: 147 FVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVV 187
           F++ +I   D   V   WHLEWK +  PF+ GCSF++LE V
Sbjct: 144 FIVKNICEGDDLTVAAKWHLEWKKEQIPFTTGCSFFQLEKV 184


>gi|145346791|ref|XP_001417866.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578094|gb|ABO96159.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 429

 Score = 90.1 bits (222), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 60/101 (59%), Gaps = 1/101 (0%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           VR  Y  IN RD+      + ++ VYED  FP PF G+ A    F++  + I  DL FVI
Sbjct: 26  VREMYERINARDVDGALACVDENVVYEDFNFPAPFRGKAAVKKLFEESCEGIPDDLTFVI 85

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
            D S     AVG+TWH+E  G+ FP ++GCSFY+++  NGK
Sbjct: 86  -DASTCGGDAVGLTWHVELMGEAFPNARGCSFYRVDETNGK 125


>gi|359461040|ref|ZP_09249603.1| hypothetical protein ACCM5_20105 [Acaryochloris sp. CCMEE 5410]
          Length = 406

 Score = 90.1 bits (222), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 68/122 (55%), Gaps = 9/122 (7%)

Query: 69  VASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRK 128
           V +P+P  +++         +++  Y  IN R++A   E +  DC Y+DL FP PF G+ 
Sbjct: 8   VLNPMPLPAHL---------LIQAMYEAINERNVAKALEYVDPDCCYQDLNFPTPFQGKA 58

Query: 129 ATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVN 188
           A  + F +    + +DL FV+DDI+  D+ AVG+ WH+E  G P P  +G SF +L    
Sbjct: 59  AVEELFTESCKGMPADLLFVVDDITEGDALAVGILWHVELGGIPLPNGRGVSFCRLSEKT 118

Query: 189 GK 190
           GK
Sbjct: 119 GK 120


>gi|307104859|gb|EFN53111.1| hypothetical protein CHLNCDRAFT_9491 [Chlorella variabilis]
          Length = 391

 Score = 89.7 bits (221), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 63/107 (58%), Gaps = 8/107 (7%)

Query: 93  FYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDI 152
           +Y   N + + +V ELIA+DCVYEDLI+  PF+GR A   +F+K    +  D++F ++DI
Sbjct: 10  YYDSYNSKQMGAVLELIAEDCVYEDLIYQDPFVGRAAIAAYFRKIEALVPPDIKFCVEDI 69

Query: 153 SAEDSSAVGVTWHLE-----WKGKPFPFSKGCSFYKLEVVNGKRQIT 194
           +  D    GV WH+E      +   FPFS+G SFY+   VN   QI 
Sbjct: 70  TEGDPRRCGVRWHVEIADDSGRATEFPFSRGVSFYE---VNDLGQIV 113


>gi|158335683|ref|YP_001516855.1| hypothetical protein AM1_2533 [Acaryochloris marina MBIC11017]
 gi|158305924|gb|ABW27541.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 406

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 61/103 (59%)

Query: 88  VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF 147
           ++++  Y  IN R++A   E +  DC Y+DL FP PF G+ A  + F +    + +DL F
Sbjct: 18  LLIQAMYEAINERNVAKALEYVDPDCCYQDLNFPIPFQGKAAVGELFTESCKGMPADLLF 77

Query: 148 VIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           V+DDI+  D+ AVG+ WH+E  G P P  +G SF +L    GK
Sbjct: 78  VVDDITEGDALAVGILWHVELGGIPLPNGRGVSFCRLSEKTGK 120


>gi|255083853|ref|XP_002508501.1| predicted protein [Micromonas sp. RCC299]
 gi|226523778|gb|ACO69759.1| predicted protein [Micromonas sp. RCC299]
          Length = 428

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 64/102 (62%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR  Y  IN RD+ +    + DD +YED  FP PF G++     F++  D I  D+ F+
Sbjct: 5   VVRGMYDAINRRDVEAALAFVDDDILYEDFNFPVPFKGKERVRKLFEESCDGIPDDMLFI 64

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           +D+ +     +VG+TW++E +G+PFP ++G SFY+++  +GK
Sbjct: 65  VDECTDGGGLSVGMTWYVELEGEPFPNARGASFYRIDPTSGK 106


>gi|357463753|ref|XP_003602158.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
 gi|355491206|gb|AES72409.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
          Length = 255

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 98/197 (49%), Gaps = 30/197 (15%)

Query: 9   SSSLRTSPSSLLPCLNQTTFSYSNSTISSHFIST-------PTTRKRLAPLSKLRISSSE 61
           +SS + + +S  PC+ +    Y+N T   HFI+        P  +K+L  + K ++ SS+
Sbjct: 17  TSSNKNTTNSTRPCITRL---YNNKT---HFINLVAKPMLLPNIKKQLK-IGKWKVLSSD 69

Query: 62  NNRTAVDV----ASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYED 117
           +    V      ASP P              +V  FY   N +D  ++++L++ +CVY+D
Sbjct: 70  DGSGDVAPEFLPASPSPVH------------IVHEFYEAFNKKDTETLKQLLSPNCVYQD 117

Query: 118 LIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSK 177
           L+F   + G+++ + F++   D++  ++   ++D+   +   V    HL WK K  PF+ 
Sbjct: 118 LLFYTAYEGQESIIKFWQSVMDAMGPNIHVFVEDVKESNHVMVTAFMHLVWKEKKLPFTN 177

Query: 178 GCSFYKLEVVNGKRQIT 194
           GC F+  E V GK  I+
Sbjct: 178 GCRFFTFEEVEGKFLIS 194


>gi|172035034|ref|YP_001801535.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
 gi|354551958|ref|ZP_08971266.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
 gi|171696488|gb|ACB49469.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
 gi|353555280|gb|EHC24668.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
          Length = 386

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 59/102 (57%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++   Y  IN R++    + + DDC+YED+ F + F G+KA    F++  D++  D +FV
Sbjct: 9   IIELIYQAINNREIDQAMQWVDDDCIYEDVNFSKTFQGKKAVKSLFQESCDNVPDDFRFV 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           +D+I+  D   VGV WH+E    P P  +G SFY++    GK
Sbjct: 69  VDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGK 110


>gi|449518407|ref|XP_004166233.1| PREDICTED: uncharacterized protein LOC101231762 [Cucumis sativus]
          Length = 193

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 55/93 (59%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++ RFY  IN ++L  +   I++DC+ ED +F   F G+KA + F +K ++S+  D++F 
Sbjct: 18  MIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFR 77

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
           I  +     S  G  WHLEW+    P +KGC+F
Sbjct: 78  IRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF 110


>gi|357114599|ref|XP_003559086.1| PREDICTED: uncharacterized protein LOC100837010 [Brachypodium
           distachyon]
          Length = 270

 Score = 85.9 bits (211), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 61/97 (62%), Gaps = 4/97 (4%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD---FFKKFSDSISSDL 145
           V++ FY+ +N +D A +++LI+ DC+ ED  + +P L  K  L+   +F +  +++  ++
Sbjct: 70  VIQEFYSSLNDKDSAQLKKLISPDCIIEDTAYYKP-LDIKVLLNTHTYFTRLMEAMGKNV 128

Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
           +F ID++       V V WHLEW GK  PF+KGCSFY
Sbjct: 129 KFAIDEVCQGVEPTVAVMWHLEWNGKTIPFAKGCSFY 165


>gi|449452454|ref|XP_004143974.1| PREDICTED: uncharacterized protein LOC101214565 [Cucumis sativus]
          Length = 157

 Score = 85.9 bits (211), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 55/93 (59%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++ RFY  IN ++L  +   I++DC+ ED +F   F G+KA + F +K ++S+  D++F 
Sbjct: 18  MIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFR 77

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
           I  +     S  G  WHLEW+    P +KGC+F
Sbjct: 78  IRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF 110


>gi|414874058|tpg|DAA52615.1| TPA: hypothetical protein ZEAMMB73_891301 [Zea mays]
          Length = 270

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 61/101 (60%), Gaps = 2/101 (1%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV+ FY+ +N R+   +++L+A DC+ ED  + +P L  K T  +FK+  +S+   ++F 
Sbjct: 73  VVQEFYSSLNERNSKRLDKLMAPDCIIEDTAYYKP-LDAKCTHIYFKRLMESMGEKVKFA 131

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
           ID++         V WHLEW G   PFS+GCSFY +  VNG
Sbjct: 132 IDEVCQGAGRTAAVMWHLEWNGYIIPFSRGCSFY-ICSVNG 171


>gi|242032127|ref|XP_002463458.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
 gi|241917312|gb|EER90456.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
          Length = 226

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 57/94 (60%), Gaps = 1/94 (1%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV+ FY+ +N ++   +++L+A DC+ ED  + +P L  K T  +FK+  +S+   ++F 
Sbjct: 87  VVQEFYSSLNEKNSKRLDKLMAPDCIVEDTAYYKP-LDAKCTRIYFKRLMESMGKKVKFA 145

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
           ID++         V WHLEW G   PF+KGCSFY
Sbjct: 146 IDEVCQGAGRTAAVMWHLEWNGYIIPFTKGCSFY 179


>gi|126659484|ref|ZP_01730617.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
 gi|126619218|gb|EAZ89954.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
          Length = 386

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 59/102 (57%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++   Y  IN R++    + + DDC+YED+ F + F G++A    F++  D+  +D +FV
Sbjct: 9   IIESIYQAINNREIEQAMQWVDDDCMYEDVNFSKTFQGKEAVKHLFQESCDNAPNDFKFV 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           +D+I+  D   VGV WH+E    P P  +G SFY++    GK
Sbjct: 69  VDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGK 110


>gi|356495695|ref|XP_003516709.1| PREDICTED: uncharacterized protein LOC100779471 [Glycine max]
          Length = 218

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 56/95 (58%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           V ++Y  IN +DL  ++E I++D  ++D  F +PF G+K  + F ++ +  +  ++ F +
Sbjct: 20  VEQYYTSINDKDLRQLDECISEDACFDDYAFTKPFQGKKEVIRFLEQLTHCMGRNVTFRL 79

Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
             I   D      +WHLEW  K  PF++GC+F+KL
Sbjct: 80  KHIYEGDDLTAVASWHLEWNEKQIPFTRGCTFFKL 114


>gi|31193920|gb|AAP44755.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108712240|gb|ABG00035.1| expressed protein [Oryza sativa Japonica Group]
          Length = 259

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 62/112 (55%), Gaps = 7/112 (6%)

Query: 78  NVDGDDGGGAV--VVRRFYAGINGRDLASVEELIADDCVYED-----LIFPRPFLGRKAT 130
           N  G  GG  +  V+++FY+ +N +D   +E LIA DC+ +D     L+  +  L   + 
Sbjct: 66  NSTGGRGGSPLPDVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYKLLDIKVLLVLHSH 125

Query: 131 LDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
           +   ++  D++  + +F ID++S        V WHLEW GK  PF+KGCSFY
Sbjct: 126 ITDLQRLMDAMGKNFKFAIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFY 177


>gi|412993256|emb|CCO16789.1| predicted protein [Bathycoccus prasinos]
          Length = 478

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 9/111 (8%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VVR  Y  IN RD+      + DD VYED  FP PF G+      F++    I  DL F+
Sbjct: 32  VVRGMYDAINKRDVTKALTFVDDDIVYEDFNFPEPFRGKARVKKLFEESCTGIPDDLDFI 91

Query: 149 IDDISAE---------DSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           I+  +           +S   G TWH+E  G+PFP ++G SFY++   +GK
Sbjct: 92  IERSTTSSSSEAGGLGESLTFGCTWHVEIAGEPFPNARGASFYEISKESGK 142


>gi|224058447|ref|XP_002299514.1| predicted protein [Populus trichocarpa]
 gi|222846772|gb|EEE84319.1| predicted protein [Populus trichocarpa]
          Length = 276

 Score = 72.4 bits (176), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 56/95 (58%), Gaps = 5/95 (5%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           ++++FY  IN + L  ++  I+DDC +ED  F +P  G+K  + FF + +  +  +++F+
Sbjct: 82  MIKQFYTCINEKKLKELDGYISDDCFFEDCSFLQPMQGKKEVMHFFGQLTAGMGQNVKFI 141

Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
           +     E S+ +    + EWK    PF+KGCSFY+
Sbjct: 142 L-----EHSNKLMRILYAEWKTIQIPFTKGCSFYE 171


>gi|397568380|gb|EJK46109.1| hypothetical protein THAOC_35242 [Thalassiosira oceanica]
          Length = 607

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 7/113 (6%)

Query: 83  DGGGAVV------VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKK 136
           DG  AVV      V R++   N RD+     L  +DC   DL +   F GR        +
Sbjct: 216 DGSSAVVDTNLSVVERYFEAWNKRDMKEAVSLFTEDCNMRDLQYDSEFKGRAEFERHLLR 275

Query: 137 FSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
             D +    +FV+DD++   + A GV WH+E  G P  F++GCSFY ++  +G
Sbjct: 276 VKDCLPGSFEFVVDDVALSPTKA-GVVWHVENDGSPLAFTRGCSFYTIDQRSG 327


>gi|219114851|ref|XP_002178221.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409956|gb|EEC49886.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 627

 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 92  RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDD 151
           +++A  N RD++S   L AD   Y+D  FP PF G+        K S++  S   F +D 
Sbjct: 244 KYFAAWNRRDMSSAVALFADTVTYDDTAFPEPFSGKTNLSSHLYKCSNAFPSTFTFQVDK 303

Query: 152 ISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
           + A+    + V WH+E  G   PF++GCSFY ++
Sbjct: 304 V-ADAGDRISVLWHVENDGDDLPFTRGCSFYNVD 336



 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 52/100 (52%), Gaps = 4/100 (4%)

Query: 87  AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSD-- 144
           A  V+ F +  N RD  ++E ++  D  ++D  +P+P  G    L+   + + +   D  
Sbjct: 80  AQTVQNFISAYNERDYNTLENMVDPDIEFDDTAYPKPCRGL-PELERRWRLTRNAQGDKR 138

Query: 145 LQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
           +Q  +DDI A  ++ VG+ +HLE      P  +G +F++L
Sbjct: 139 IQVAVDDI-ASSTTTVGIRFHLENVEGEIPNGRGAAFFQL 177


>gi|449018192|dbj|BAM81594.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 271

 Score = 68.9 bits (167), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 56/99 (56%), Gaps = 3/99 (3%)

Query: 87  AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQ 146
           A VV R++   N R++A +  L A D  YED ++PR   GR+A  +   K +  +   L+
Sbjct: 124 ARVVFRYFDQWNKREIAGIIPLFAPDVFYEDALYPRALHGREALAEHLTKVARVLPPQLE 183

Query: 147 FVIDDISAE-DSSAVGVTWHLEWK--GKPFPFSKGCSFY 182
           FVIDD+  + +     V +HLE +  GK  PFS+G SF+
Sbjct: 184 FVIDDVCEDLERGKCAVRFHLEDRRTGKQVPFSRGTSFF 222


>gi|428180781|gb|EKX49647.1| hypothetical protein GUITHDRAFT_151421 [Guillardia theta CCMP2712]
          Length = 170

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 49/80 (61%), Gaps = 3/80 (3%)

Query: 107 ELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHL 166
           +  ADD VYED+++  PF+G+ A  +F  K  +    D  FV+D +S +   + G+TWH+
Sbjct: 51  DYFADDVVYEDMVYSEPFVGKDAVREFLLKTKEMAPPDFVFVVDRVS-DGVRSCGLTWHI 109

Query: 167 EWKGKP--FPFSKGCSFYKL 184
           E K +P    F+ GCSFY+L
Sbjct: 110 ELKTRPDAGKFANGCSFYEL 129


>gi|9758055|dbj|BAB08518.1| unnamed protein product [Arabidopsis thaliana]
          Length = 158

 Score = 67.0 bits (162), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 47/81 (58%)

Query: 86  GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
           G   V +FY+ IN ++   +   I+ DC  +D  FP+PF G++  ++FF++   S+  ++
Sbjct: 75  GYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNV 134

Query: 146 QFVIDDISAEDSSAVGVTWHL 166
           +F ++++   D  +  V WHL
Sbjct: 135 KFCVENVCEGDGHSAAVNWHL 155


>gi|323451680|gb|EGB07556.1| hypothetical protein AURANDRAFT_64644 [Aureococcus anophagefferens]
          Length = 235

 Score = 66.6 bits (161), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 9/101 (8%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFL--GRKA---TLDFFKKFSDSI--- 141
           +R  Y   N RD   V  L+A+D VYEDL+     +  GR++    L F   F  +    
Sbjct: 84  IRGLYKAFNDRDAPRVASLLAEDVVYEDLLLGASTICRGRESFSQALAFHPAFLSAQLGL 143

Query: 142 -SSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
               L+ V+DD++ +   +VGV WH+E  G+PFP S+G S 
Sbjct: 144 PMGRLELVVDDVACDGERSVGVEWHVELNGEPFPLSRGLSL 184


>gi|147816094|emb|CAN72893.1| hypothetical protein VITISV_022313 [Vitis vinifera]
          Length = 267

 Score = 65.9 bits (159), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 3/111 (2%)

Query: 82  DDGGGAVVVRRFYAGINGRDLASVEELIADDC-VYEDLI-FPRPFLGRKATLDFFKKFSD 139
           +D      V++ Y  I  +++  V ++I D+C  + + I   +PF G+K  L+FF     
Sbjct: 90  EDNRALEAVQKLYTAIKNKNVKEVSDVIGDECRCFCNFISASQPFHGKKQALEFFDHLM- 148

Query: 140 SISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
            I  D   +    +  D  AVGVTW LEWK  P P   G S+Y  +   G+
Sbjct: 149 KILGDHASIRFSFTMHDGMAVGVTWRLEWKNNPVPLGTGFSYYVCQEYRGR 199


>gi|255559074|ref|XP_002520559.1| conserved hypothetical protein [Ricinus communis]
 gi|223540219|gb|EEF41792.1| conserved hypothetical protein [Ricinus communis]
          Length = 254

 Score = 65.9 bits (159), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 3/114 (2%)

Query: 82  DDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFP--RPFLGRKATLDFFKKFSD 139
           +D      V + Y  I  +++  V  +I D+C      F     F G++  LDFF     
Sbjct: 78  EDTPALETVLKLYTAIKNQNIHEVSNMIGDECRCVCNFFSSFESFQGKQQVLDFFNYVMQ 137

Query: 140 SISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
           ++ ++++FV+   +  D   VGV+W LEW     P  KG SFY  ++  GK  I
Sbjct: 138 TLGNNIEFVVQP-TKYDGMNVGVSWRLEWSKTHMPLGKGFSFYICQIYQGKVTI 190


>gi|224066735|ref|XP_002302189.1| predicted protein [Populus trichocarpa]
 gi|222843915|gb|EEE81462.1| predicted protein [Populus trichocarpa]
          Length = 203

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 61/106 (57%), Gaps = 4/106 (3%)

Query: 82  DDGGGAVVVRRFYAGINGR-DLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS 140
           DD G  +++ RF+  IN R D   +E++++ DCV++D IF   F G ++ + F +K   +
Sbjct: 94  DDIG--ILIHRFFYAINSRNDEQLLEDVLSYDCVFKDFIFQIAFDGEQSIIQFLRKVMMA 151

Query: 141 ISSDLQFVIDDISAEDS-SAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
           +  +++F I+ +  ++   A     HLEW  +  PF++ C+ ++ E
Sbjct: 152 MGPNIRFKIESVQCKNELQAATAFLHLEWDNQVIPFTRFCTDFECE 197


>gi|87241143|gb|ABD33001.1| hypothetical protein MtrDRAFT_AC150798g6v2 [Medicago truncatula]
          Length = 267

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 4/116 (3%)

Query: 81  GDDGGGAV-VVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFFKKF 137
           G+D   A+  V + Y+    +++  + E++AD+C  V   L F + F G+   L+FF   
Sbjct: 88  GEDNDQALDAVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHL 147

Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
                 ++Q V+   +  D   VGV W  EW     P  KG SF+  +   GK  I
Sbjct: 148 IRLFGDNIQIVVKP-TLHDGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVI 202


>gi|357447893|ref|XP_003594222.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
 gi|355483270|gb|AES64473.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
          Length = 251

 Score = 60.5 bits (145), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 4/116 (3%)

Query: 81  GDDGGGAV-VVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFFKKF 137
           G+D   A+  V + Y+    +++  + E++AD+C  V   L F + F G+   L+FF   
Sbjct: 72  GEDNDQALDAVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHL 131

Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
                 ++Q V+   +  D   VGV W  EW     P  KG SF+  +   GK  I
Sbjct: 132 IRLFGDNIQIVVKP-TLHDGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVI 186


>gi|159485316|ref|XP_001700691.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272033|gb|EDO97840.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 213

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%)

Query: 74  PTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDF 133
           P    V G   G A    RF + +N RDL+ + +L++DDC + DL        ++    F
Sbjct: 27  PPRRLVMGQPIGRAAACCRFISALNARDLSGMLQLVSDDCHHVDLSHEAEGHCKEDVARF 86

Query: 134 FKKFSDSISSDLQFVIDDISAEDSSAVGVTW 164
           +     S+   +Q V+DDI++ D S  G  +
Sbjct: 87  YADVVASMPEKVQVVVDDITSGDDSRAGAIY 117


>gi|323450797|gb|EGB06676.1| hypothetical protein AURANDRAFT_65293 [Aureococcus anophagefferens]
          Length = 1937

 Score = 44.3 bits (103), Expect = 0.024,   Method: Composition-based stats.
 Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 19/122 (15%)

Query: 76   TSNVDGDDG---------GGAVVVRRFY---AGINGRDLASVEELIADDCVYEDLIFPRP 123
            T+NV+ +D           GAV V  +    A   G   A   EL A+D  YED  +P P
Sbjct: 1781 TANVEKEDTPATYEPRAPTGAVAVADYLWNEAYPKGATPAVALELFAEDIRYEDFNYPAP 1840

Query: 124  FLGRKATLDFFKKFSDSISSDLQFVIDDISAE--DSSAVGVTWHLEWKGKPFPFSKGCSF 181
            FLG+ A  +F + F       + FV  +IS    D  A   TW ++  G+  P  KG SF
Sbjct: 1841 FLGKPAVKEFVEAFD---IPGIDFVPLEISGGDVDGGACCFTWIVKVNGQDGP--KGISF 1895

Query: 182  YK 183
            Y+
Sbjct: 1896 YE 1897


>gi|42407773|dbj|BAD08919.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|42408132|dbj|BAD09271.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|125562081|gb|EAZ07529.1| hypothetical protein OsI_29785 [Oryza sativa Indica Group]
 gi|125603929|gb|EAZ43254.1| hypothetical protein OsJ_27852 [Oryza sativa Japonica Group]
          Length = 172

 Score = 44.3 bits (103), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 20/111 (18%)

Query: 90  VRRFY---AGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD--------FFKKFS 138
           VRR     + +  R +  + EL  D+C+Y        F GR +++D        F    +
Sbjct: 66  VRRLVKLNSAVQNRSVRELLELAGDECLY--------FFGRISSIDVSQVSKNMFLLLHA 117

Query: 139 DSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
             +   + FV+     E    +GV W LEWKGK  P+   C+     V  G
Sbjct: 118 MMLRHHVSFVLKPTENEGFD-LGVKWSLEWKGKKLPWDLDCNITTNHVYRG 167


>gi|413925260|gb|AFW65192.1| hypothetical protein ZEAMMB73_842673 [Zea mays]
          Length = 335

 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 19/108 (17%)

Query: 92  RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS--------ISS 143
           +  + I  R +  + EL+ D+C+Y        FLG   ++D  +   D         +  
Sbjct: 82  KLNSAIQNRSVRELLELLGDECLY--------FLGNLRSVDVPRLGKDMFLLLHALMVRH 133

Query: 144 DLQFVIDDISAEDSSA--VGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
            + FV+   S +D +   +GV W LEWKGK  P+   C+     V  G
Sbjct: 134 HVSFVLKP-SPDDEAGFDLGVKWSLEWKGKKLPWDLDCNVSTTHVYRG 180


>gi|356532339|ref|XP_003534731.1| PREDICTED: uncharacterized protein LOC100812303 [Glycine max]
          Length = 399

 Score = 43.1 bits (100), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 7/119 (5%)

Query: 75  TTSNVDGDDGG-GAVVVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATL 131
             S  +G+DG  G   V + Y+    +    + EL AD+   V   L F   F GR   L
Sbjct: 21  NNSESEGEDGHHGLDSVMKLYSAFKNK---KIHELSADERRRVSNFLSFFETFQGRTQVL 77

Query: 132 DFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
           +FF   +  + +++Q +      E  +A G+ W  EW     P  KG S +     +G+
Sbjct: 78  EFFSYLTSILGNNIQIIFKPTPHEGVNA-GLQWKFEWDKIHLPLWKGFSLHISHTYHGR 135


>gi|357021815|ref|ZP_09084046.1| hypothetical protein KEK_17413 [Mycobacterium thermoresistibile
           ATCC 19527]
 gi|356479563|gb|EHI12700.1| hypothetical protein KEK_17413 [Mycobacterium thermoresistibile
           ATCC 19527]
          Length = 146

 Score = 42.7 bits (99), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFP----RPFLGRKATLDFFKKFSDS-ISS 143
           VVR +  GIN  D   + EL+A D V+E L  P    + + GR A LDF ++ +++ I+ 
Sbjct: 12  VVRAYLNGINTWDFDGMRELMAPDFVFEQLFAPPGMQKRYEGRDALLDFQRRLAETVITE 71

Query: 144 DLQFVIDDISAEDSSAVGVTWHLEWK 169
           +L     D  A D   V VT+  + K
Sbjct: 72  NLHDFEFDTLASDPGVVIVTYRSDMK 97


>gi|242080039|ref|XP_002444788.1| hypothetical protein SORBIDRAFT_07g028065 [Sorghum bicolor]
 gi|241941138|gb|EES14283.1| hypothetical protein SORBIDRAFT_07g028065 [Sorghum bicolor]
          Length = 181

 Score = 42.7 bits (99), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 16/97 (16%)

Query: 92  RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD--------FFKKFSDSISS 143
           +  + I  R +  + EL+ D+C+Y        F G   ++D        F    +  +  
Sbjct: 93  KLNSAIQNRSVRELLELLGDECLY--------FFGNLRSVDVPQLGKDMFLLLHALMVRH 144

Query: 144 DLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCS 180
            + FV+     E    +GV W LEWKGK  P+   C+
Sbjct: 145 HVSFVLKPSPDEAGFDLGVKWSLEWKGKKLPWDLDCN 181


>gi|403729559|ref|ZP_10948583.1| hypothetical protein GORHZ_197_00710 [Gordonia rhizosphera NBRC
           16068]
 gi|403202973|dbj|GAB92914.1| hypothetical protein GORHZ_197_00710 [Gordonia rhizosphera NBRC
           16068]
          Length = 117

 Score = 41.6 bits (96), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 46/100 (46%), Gaps = 4/100 (4%)

Query: 87  AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPF--LGRKATLDFFKKFSDSISSD 144
           A VV RF    +  DL  + EL+ DD V+E    P     +GR A L  F +F  S ++D
Sbjct: 6   ASVVERFNDAFDRADLTRLAELLTDDVVFESTAPPAGIRHIGRPAVLAAFAEFFAS-TTD 64

Query: 145 LQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
             F  +++ A     V V W   W G+   F +G   + +
Sbjct: 65  PDFQTEEMIAAHDRVV-VRWRFTWGGEHPGFVRGVDIFTV 103


>gi|356557961|ref|XP_003547278.1| PREDICTED: uncharacterized protein LOC100808156 [Glycine max]
          Length = 262

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 29/119 (24%), Positives = 48/119 (40%), Gaps = 6/119 (5%)

Query: 77  SNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFF 134
           S  + +D      V + Y+    ++     EL AD+   V   L F   F GR   L+FF
Sbjct: 81  SESEAEDDHALDSVMKLYSAFKNKN---THELSADERRRVSNFLSFFETFQGRTQVLEFF 137

Query: 135 KKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
              +  + +++Q +      E  + VG+ W  EW     P  KG   +     +G+  I
Sbjct: 138 SYLTSILGNNIQIIFKPTPHEGVN-VGLQWKFEWNKIHLPLWKGFGLHISHTYHGRAVI 195


>gi|389681152|ref|ZP_10172497.1| hypothetical protein PchlO6_4193 [Pseudomonas chlororaphis O6]
 gi|388554688|gb|EIM17936.1| hypothetical protein PchlO6_4193 [Pseudomonas chlororaphis O6]
          Length = 159

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 1/64 (1%)

Query: 90  VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
           ++R+   +N  DL  V  L ADD V+     P P +GR A    +++  D I  D++F I
Sbjct: 38  LKRYETALNTSDLDQVMTLYADDAVFMPQNSP-PAVGRDAVHAAYRQVFDLIRLDVRFTI 96

Query: 150 DDIS 153
           D+I 
Sbjct: 97  DEIQ 100


>gi|406030550|ref|YP_006729442.1| RNA polymerase sigma factor [Mycobacterium indicus pranii MTCC
           9506]
 gi|405129097|gb|AFS14352.1| RNA polymerase sigma factor [Mycobacterium indicus pranii MTCC
           9506]
          Length = 323

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+ S  TI+     +P   K+LA  ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207


>gi|379761768|ref|YP_005348165.1| hypothetical protein OCQ_23320 [Mycobacterium intracellulare
           MOTT-64]
 gi|378809710|gb|AFC53844.1| hypothetical protein OCQ_23320 [Mycobacterium intracellulare
           MOTT-64]
          Length = 323

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+ S  TI+     +P   K+LA  ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207


>gi|379754489|ref|YP_005343161.1| hypothetical protein OCO_24770 [Mycobacterium intracellulare
           MOTT-02]
 gi|378804705|gb|AFC48840.1| hypothetical protein OCO_24770 [Mycobacterium intracellulare
           MOTT-02]
          Length = 323

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+ S  TI+     +P   K+LA  ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207


>gi|402077688|gb|EJT73037.1| hypothetical protein GGTG_09888 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 1271

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 6/104 (5%)

Query: 20  LPCLNQTTFSYSNSTISSHFISTPTTRKRL-APLSKLRISSSENN-----RTAVDVASPL 73
            P + Q    Y  S I+S   ++P  ++ L +P+++L+  + E       RT ++ A   
Sbjct: 512 FPDIRQLVDGYRKSPITSISPASPALQRGLVSPITQLQQHAREEGDPSIMRTMIEDADEN 571

Query: 74  PTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYED 117
           P   + DG+D G   +     +GIN RD  S+ E+      Y+D
Sbjct: 572 PNFPDGDGEDIGSLTMEEELSSGINVRDSESLLEVDVQKHAYQD 615


>gi|221068339|ref|ZP_03544444.1| Steroid Delta-isomerase [Comamonas testosteroni KF-1]
 gi|220713362|gb|EED68730.1| Steroid Delta-isomerase [Comamonas testosteroni KF-1]
          Length = 125

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 9/113 (7%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV+R+ A +N  DL  +  L ADD   ED +   P  GR+A   F+   S  +   ++  
Sbjct: 10  VVQRYVAALNAGDLDGIVALFADDATVEDPVGSEPRRGREAIRAFYAN-SLKLPLAVELT 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGKP--------FPFSKGCSFYKLEVVNGKRQI 193
            +  +  D +A       E++G+         F F++      +  + G++ I
Sbjct: 69  QEVRAVADEAAFAFIVSFEYQGRKTVIAPIDHFRFNEAGKVVSMRALFGEKNI 121


>gi|427738310|ref|YP_007057854.1| SnoaL-like polyketide cyclase [Rivularia sp. PCC 7116]
 gi|427373351|gb|AFY57307.1| SnoaL-like polyketide cyclase [Rivularia sp. PCC 7116]
          Length = 166

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 91  RRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVID 150
           R+ Y G N R+L   +E+IA D      +   P +GR+A  ++  +F       L  V D
Sbjct: 18  RQIYKGFNERNLDLWDEVIASDVEVRSTVGTMPLIGREALKNWAAQFQSGFRPRLDLV-D 76

Query: 151 DISAEDSSAVGVTWHLEWK 169
           +I   + + + V  +L WK
Sbjct: 77  EIYGVNRATIAV--NLNWK 93


>gi|433646696|ref|YP_007291698.1| putative ester cyclase [Mycobacterium smegmatis JS623]
 gi|433296473|gb|AGB22293.1| putative ester cyclase [Mycobacterium smegmatis JS623]
          Length = 163

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 5/83 (6%)

Query: 92  RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF-VID 150
            ++A +N RD  ++  ++A D V+E      P  GR   +   + F  +   DL F V D
Sbjct: 33  EYFAAVNRRDFETLTNILAPDVVFEH---ESPIEGRDKLIAMLQAFCTAF-PDLTFSVTD 88

Query: 151 DISAEDSSAVGVTWHLEWKGKPF 173
            I+ ED   V       W+G+ F
Sbjct: 89  QIAHEDEVVVFFVGSGTWRGEFF 111


>gi|186682424|ref|YP_001865620.1| hypothetical protein Npun_R2054 [Nostoc punctiforme PCC 73102]
 gi|186464876|gb|ACC80677.1| protein of unknown function DUF1486 [Nostoc punctiforme PCC 73102]
          Length = 298

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 50/122 (40%), Gaps = 17/122 (13%)

Query: 83  DGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSIS 142
           D   A+V R FY   N  ++ ++EE+   D ++       PF G +          DS  
Sbjct: 4   DANKAIVRRYFYEVFNEGNITTLEEICGPDFIFTLPTHAEPFRGVEGYKGLVNMLIDSFP 63

Query: 143 SDLQFVIDDISAE-DSSAVGVTWHLEWKGKPFP-------------FSKGCSFYKLEVVN 188
             + F ++D+ AE D      T      GKPFP               +G S+++  +VN
Sbjct: 64  G-IHFAMEDVLAEGDMVLTRWTARGNHTGKPFPTVIGDIPAQGKSFLLEGMSWHR--IVN 120

Query: 189 GK 190
           GK
Sbjct: 121 GK 122


>gi|387875778|ref|YP_006306082.1| hypothetical protein W7S_11950 [Mycobacterium sp. MOTT36Y]
 gi|443305484|ref|ZP_21035272.1| hypothetical protein W7U_07420 [Mycobacterium sp. H4Y]
 gi|386789236|gb|AFJ35355.1| hypothetical protein W7S_11950 [Mycobacterium sp. MOTT36Y]
 gi|442767048|gb|ELR85042.1| hypothetical protein W7U_07420 [Mycobacterium sp. H4Y]
          Length = 323

 Score = 37.0 bits (84), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 53/117 (45%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+    TI++    +P   K+LA  ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAALLHRSPDAAKKLASRARGRL 165

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207


>gi|300790982|ref|YP_003771273.1| hypothetical protein AMED_9182 [Amycolatopsis mediterranei U32]
 gi|384154524|ref|YP_005537340.1| hypothetical protein RAM_47090 [Amycolatopsis mediterranei S699]
 gi|399542860|ref|YP_006555522.1| hypothetical protein AMES_9045 [Amycolatopsis mediterranei S699]
 gi|299800496|gb|ADJ50871.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340532678|gb|AEK47883.1| hypothetical protein RAM_47090 [Amycolatopsis mediterranei S699]
 gi|398323630|gb|AFO82577.1| hypothetical protein AMES_9045 [Amycolatopsis mediterranei S699]
          Length = 130

 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 87  AVVVRRFYAGINGRDLASVEELIADDCVYE---DLIFPRPFLGRKATLDFFKKFSDSISS 143
           A ++RR Y   +  D+A++ EL+A D V       +F     GR A L  + + ++    
Sbjct: 7   ATLIRRGYEAFSAGDVATLSELLAPDAVQHMPGHNVFSGDHKGRDAILAMYGQLAERSGG 66

Query: 144 DLQFVIDDISAEDSSAVGV 162
            L+  ++++ A D   V V
Sbjct: 67  TLKIELEEVYANDEEVVTV 85


>gi|295836440|ref|ZP_06823373.1| membrane protein [Streptomyces sp. SPB74]
 gi|197698629|gb|EDY45562.1| membrane protein [Streptomyces sp. SPB74]
          Length = 133

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 88  VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKAT 130
           +V+R F   +  RDL +VE L+A+D V+   +  +P+ GR  T
Sbjct: 2   IVMRAFREAVEARDLDAVEALLAEDVVFTSPVVHKPYRGRAIT 44


>gi|254822137|ref|ZP_05227138.1| hypothetical protein MintA_19539 [Mycobacterium intracellulare ATCC
           13950]
          Length = 317

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+    TI+     +P   K+LA  ++ R+
Sbjct: 100 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAELLHRSPDAAKKLASRARGRL 159

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 160 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 201


>gi|379747183|ref|YP_005338004.1| hypothetical protein OCU_24640 [Mycobacterium intracellulare ATCC
           13950]
 gi|378799547|gb|AFC43683.1| hypothetical protein OCU_24640 [Mycobacterium intracellulare ATCC
           13950]
          Length = 323

 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 19/117 (16%)

Query: 2   LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
           L+  S+SS+ L    R SP+  +  +    F+    TI+     +P   K+LA  ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAELLHRSPDAAKKLASRARGRL 165

Query: 58  SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
                   A   A P  T  +++        VV  F A   G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207


>gi|358009461|pdb|3MHE|A Chain A, Crystal Structure Of Ketosteroid Isomerase P39a From
           Pseudomonas Testosteroni (Tksi)
 gi|358009462|pdb|3MHE|B Chain B, Crystal Structure Of Ketosteroid Isomerase P39a From
           Pseudomonas Testosteroni (Tksi)
          Length = 125

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV+R+ A +N  DL  +  L ADD   ED +   P  G  A  +F+   S  +   ++  
Sbjct: 10  VVQRYVAALNAGDLDGIVALFADDATVEDAVGSEPRSGTAAIREFYAN-SLKLPLAVELT 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGK 171
            +  +  + +A   T   E++G+
Sbjct: 69  QEVRAVANEAAFAFTVSFEYQGR 91


>gi|6729912|pdb|1BUQ|A Chain A, Solution Structure Of Delta-5-3-Ketosteroid Isomerase
           Complexed With The Steroid
           19-Nortestosterone-Hemisuccinate
 gi|6729913|pdb|1BUQ|B Chain B, Solution Structure Of Delta-5-3-Ketosteroid Isomerase
           Complexed With The Steroid
           19-Nortestosterone-Hemisuccinate
          Length = 125

 Score = 36.6 bits (83), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 89  VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
           VV+R+ A +N  DL  +  L ADD   ED +   P  G  A  +FF   S  +   ++  
Sbjct: 10  VVQRYVAALNAGDLDGIVALFADDATVEDPVGSEPRSGTAAIREFFAN-SLKLPLAVELT 68

Query: 149 IDDISAEDSSAVGVTWHLEWKGK 171
            +  +  + +A   T   E++G+
Sbjct: 69  QEVRAVANEAAFAFTVSFEFQGR 91


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.131    0.380 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,970,543,971
Number of Sequences: 23463169
Number of extensions: 121793611
Number of successful extensions: 360653
Number of sequences better than 100.0: 152
Number of HSP's better than 100.0 without gapping: 81
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 360299
Number of HSP's gapped (non-prelim): 358
length of query: 194
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 59
effective length of database: 9,191,667,552
effective search space: 542308385568
effective search space used: 542308385568
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)