BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029353
(194 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225460141|ref|XP_002275923.1| PREDICTED: uncharacterized protein LOC100243104 isoform 1 [Vitis
vinifera]
gi|359495234|ref|XP_003634940.1| PREDICTED: uncharacterized protein LOC100243104 isoform 2 [Vitis
vinifera]
gi|297741015|emb|CBI31327.3| unnamed protein product [Vitis vinifera]
Length = 226
Score = 209 bits (531), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/197 (57%), Positives = 140/197 (71%), Gaps = 18/197 (9%)
Query: 5 SSISSSSLRTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRISSSENNR 64
S +++S+ PS L P L ++I++HF T++R + LR+SSS N
Sbjct: 2 SLLTNSTTLLRPSKLSPPL----LHPHTASIATHF-----TQRRSRRCNLLRVSSSPQNP 52
Query: 65 TAVDVASPLPTTSNVDGD-DGG-------GAVVVRRFYAGINGRDLASVEELIADDCVYE 116
TA +A P+ TT + + + DGG G+ VVR FYAGIN DL SVEELIA +CVYE
Sbjct: 53 TAT-IAEPVTTTQSDEANGDGGDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYE 111
Query: 117 DLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFS 176
DLIFP+PF+GRKA +DFF KF D+IS +LQFVIDDIS DSSAVGVTWHLEWKG+PFPFS
Sbjct: 112 DLIFPQPFVGRKAIMDFFNKFIDTISMELQFVIDDISEADSSAVGVTWHLEWKGRPFPFS 171
Query: 177 KGCSFYKLEVVNGKRQI 193
KGCSFY+LEV+NGKRQI
Sbjct: 172 KGCSFYRLEVLNGKRQI 188
>gi|255574237|ref|XP_002528033.1| protein transporter, putative [Ricinus communis]
gi|223532563|gb|EEF34351.1| protein transporter, putative [Ricinus communis]
Length = 218
Score = 195 bits (496), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 94/150 (62%), Positives = 113/150 (75%), Gaps = 5/150 (3%)
Query: 44 TTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLA 103
+ RK + + +SSS+N +P P T + D + +VRRFY GIN DLA
Sbjct: 35 SKRKAWTLRTPVSVSSSDNQ-----TVTPSPATKDDAADTQLASDIVRRFYEGINDHDLA 89
Query: 104 SVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVT 163
SVE+LIA++CVYEDLIFP PF+GRKA L+FFKKF DSIS DLQFVIDDIS EDS AVGVT
Sbjct: 90 SVEDLIAENCVYEDLIFPHPFVGRKAILEFFKKFIDSISKDLQFVIDDISTEDSLAVGVT 149
Query: 164 WHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
WHLEWKG+PFPFSKGCSFY+L+V+N K+QI
Sbjct: 150 WHLEWKGRPFPFSKGCSFYRLQVLNSKKQI 179
>gi|224146286|ref|XP_002325950.1| predicted protein [Populus trichocarpa]
gi|222862825|gb|EEF00332.1| predicted protein [Populus trichocarpa]
Length = 164
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 94/138 (68%), Positives = 106/138 (76%), Gaps = 12/138 (8%)
Query: 56 RISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVY 115
+ SSSEN V VA + + S+V VR FY GING DL SVEELIA++CVY
Sbjct: 1 KASSSENQ--TVTVAPSVESASDV----------VRSFYEGINGHDLDSVEELIAENCVY 48
Query: 116 EDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPF 175
EDLIFPRPF+GRKA L+FF KF D++S DLQFVIDDIS EDS AVGVTWHLEWKGK FPF
Sbjct: 49 EDLIFPRPFVGRKAILEFFNKFIDTVSKDLQFVIDDISNEDSFAVGVTWHLEWKGKSFPF 108
Query: 176 SKGCSFYKLEVVNGKRQI 193
SKGCSFY+L+ VNGKRQI
Sbjct: 109 SKGCSFYRLDAVNGKRQI 126
>gi|449450034|ref|XP_004142769.1| PREDICTED: uncharacterized protein LOC101216887 [Cucumis sativus]
Length = 218
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/157 (64%), Positives = 118/157 (75%), Gaps = 3/157 (1%)
Query: 37 SHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAG 96
SH ++P+ K AP S LR+SSS ++ AV V SP PT + +D VV R FY G
Sbjct: 27 SHSFTSPSIPKP-APYSPLRVSSSSSDNPAVTVPSP-PTDAPLDTLRSASNVV-RDFYDG 83
Query: 97 INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
+N DLASVE LIA++CVYEDLIF RPF+GRK L FFKKF+DSIS DLQFVIDDIS ED
Sbjct: 84 VNRHDLASVEPLIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTED 143
Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
SSA+GV WHLEWKGK FPFSKGCSFY+L V+GK+QI
Sbjct: 144 SSALGVLWHLEWKGKEFPFSKGCSFYRLADVDGKKQI 180
>gi|356495768|ref|XP_003516745.1| PREDICTED: uncharacterized protein LOC100802494 [Glycine max]
Length = 221
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/144 (61%), Positives = 109/144 (75%), Gaps = 4/144 (2%)
Query: 53 SKLRISSSENNRTAVDVASPLPTTSNVDGDD--GGGAVVVRRFYAGINGRDLASVEELIA 110
SK R+ + +V + TT++ D +D + VVR FY GIN D+ SVE L+A
Sbjct: 43 SKWRVVDGVGEKQSVTTLAA--TTTSFDDNDVVDSNSAVVRNFYGGINAHDVDSVEYLMA 100
Query: 111 DDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKG 170
++CVYEDL+FPRPF+GRK L+FFKKF++S S LQFVIDD+S EDSS+VGV WHLEWKG
Sbjct: 101 ENCVYEDLVFPRPFVGRKEILEFFKKFTNSTSKGLQFVIDDLSTEDSSSVGVIWHLEWKG 160
Query: 171 KPFPFSKGCSFYKLEVVNGKRQIT 194
KPFPFSKGCSFY+LEV+NGKRQIT
Sbjct: 161 KPFPFSKGCSFYRLEVINGKRQIT 184
>gi|449483825|ref|XP_004156703.1| PREDICTED: uncharacterized LOC101216887 [Cucumis sativus]
Length = 218
Score = 183 bits (464), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/157 (63%), Positives = 117/157 (74%), Gaps = 3/157 (1%)
Query: 37 SHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAG 96
SH ++P+ K AP S LR+SSS ++ AV V SP PT + +D VV R FY G
Sbjct: 27 SHSFTSPSIPKP-APYSPLRVSSSSSDNPAVTVPSP-PTDAPLDTLRSASNVV-RDFYDG 83
Query: 97 INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
+N DLASVE LIA++CVYEDLIF RPF+GRK L FFKKF+DSIS DLQFVIDDIS ED
Sbjct: 84 VNRHDLASVEPLIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTED 143
Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
SSA+GV WHLEWKGK FPFSKGC FY+L V+GK+QI
Sbjct: 144 SSALGVLWHLEWKGKEFPFSKGCRFYRLADVDGKKQI 180
>gi|356539020|ref|XP_003537998.1| PREDICTED: uncharacterized protein LOC100813246 [Glycine max]
Length = 227
Score = 182 bits (463), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 80/109 (73%), Positives = 95/109 (87%)
Query: 86 GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
+ V++ FY GIN D+ SVE LIA++CVYEDL+FPRPF+GRK L+FFKKF++S S DL
Sbjct: 82 ASAVLKNFYGGINAHDVDSVEYLIAENCVYEDLVFPRPFVGRKEILEFFKKFTNSTSKDL 141
Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
QFVIDD+S EDSS+VGV WHLEWKGKPFPFSKGCSFY+LEV+NGKRQIT
Sbjct: 142 QFVIDDLSTEDSSSVGVIWHLEWKGKPFPFSKGCSFYRLEVINGKRQIT 190
>gi|294461593|gb|ADE76357.1| unknown [Picea sitchensis]
Length = 233
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 73/105 (69%), Positives = 94/105 (89%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
+VR FY+ IN R+L V ELIA++CVY+DL+FP+PF+GRK +DFFKKF+D+ S+DLQFV
Sbjct: 91 IVREFYSRINSRELEFVGELIAENCVYDDLVFPQPFVGRKDIIDFFKKFTDATSTDLQFV 150
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
IDDI++EDSSAVG+TWHLEW+GKPFPFS+GCSFY+ E+ NGK+QI
Sbjct: 151 IDDITSEDSSAVGITWHLEWRGKPFPFSRGCSFYRCEIFNGKQQI 195
>gi|15217513|ref|NP_177304.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
gi|12323719|gb|AAG51815.1|AC016163_4 unknown protein; 43994-42987 [Arabidopsis thaliana]
gi|37202096|gb|AAQ89663.1| At1g71480 [Arabidopsis thaliana]
gi|51969762|dbj|BAD43573.1| unknown protein [Arabidopsis thaliana]
gi|51969880|dbj|BAD43632.1| unknown protein [Arabidopsis thaliana]
gi|51971487|dbj|BAD44408.1| unknown protein [Arabidopsis thaliana]
gi|51971841|dbj|BAD44585.1| unknown protein [Arabidopsis thaliana]
gi|51971939|dbj|BAD44634.1| unknown protein [Arabidopsis thaliana]
gi|222423003|dbj|BAH19485.1| AT1G71480 [Arabidopsis thaliana]
gi|332197085|gb|AEE35206.1| nuclear transport factor 2-like protein [Arabidopsis thaliana]
Length = 216
Score = 174 bits (440), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 79/105 (75%), Positives = 90/105 (85%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV FYA +N DL+SV +LIA DCVYEDL+F PF+GRKA LDFF KF +S S+DLQFV
Sbjct: 74 VVSSFYAAVNVHDLSSVTDLIAQDCVYEDLVFSSPFVGRKAILDFFGKFIESTSTDLQFV 133
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
IDDIS EDSSAVGV+WHLEWKGK FPFSKGCSFY+LEV++GKRQI
Sbjct: 134 IDDISTEDSSAVGVSWHLEWKGKNFPFSKGCSFYRLEVIDGKRQI 178
>gi|224135761|ref|XP_002327297.1| predicted protein [Populus trichocarpa]
gi|222835667|gb|EEE74102.1| predicted protein [Populus trichocarpa]
Length = 150
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 74/105 (70%), Positives = 87/105 (82%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
V+RRFY G+N DL +VEE+IA++CVYEDLIFP P +G K ++FF++F DS SSDL FV
Sbjct: 8 VIRRFYDGLNSHDLVAVEEIIAENCVYEDLIFPHPCVGHKEVIEFFQQFFDSTSSDLHFV 67
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
IDDIS EDS AVGV WH+EWKGKP P SKGCSFY+LE VNGKRQI
Sbjct: 68 IDDISGEDSQAVGVIWHIEWKGKPMPCSKGCSFYRLETVNGKRQI 112
>gi|168009022|ref|XP_001757205.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691703|gb|EDQ78064.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 148
Score = 162 bits (410), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 70/109 (64%), Positives = 87/109 (79%)
Query: 86 GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
AV+V+ FYA IN R++ S+ +L AD+CVYEDL+FP PF GR+A LDFFKKF DS+ S+L
Sbjct: 3 AAVMVQEFYAAINRREITSIGDLFADNCVYEDLVFPTPFTGRQAILDFFKKFMDSVGSEL 62
Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
+F IDDI+ D +A GV WHLEW+GKP PFSKGCSFY+ EV+NGKRQ
Sbjct: 63 EFRIDDITTGDPNAAGVIWHLEWRGKPLPFSKGCSFYRCEVLNGKRQFV 111
>gi|226509360|ref|NP_001141296.1| uncharacterized protein LOC100273387 [Zea mays]
gi|194703844|gb|ACF86006.1| unknown [Zea mays]
gi|414876798|tpg|DAA53929.1| TPA: hypothetical protein ZEAMMB73_536788 [Zea mays]
Length = 225
Score = 162 bits (409), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 70/96 (72%), Positives = 85/96 (88%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR+FY G+N RDLA+VE LIA+ CVYEDL+FPRPF+GR+ + FF +F +IS DLQFV
Sbjct: 72 VVRQFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPFVGRERVVGFFGEFMGTISPDLQFV 131
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
IDDISAEDS+AVGVTWHLEW+G+PFPFS+GCSFY+L
Sbjct: 132 IDDISAEDSAAVGVTWHLEWRGRPFPFSRGCSFYRL 167
>gi|115478242|ref|NP_001062716.1| Os09g0266000 [Oryza sativa Japonica Group]
gi|51536247|dbj|BAD38416.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
Japonica Group]
gi|113630949|dbj|BAF24630.1| Os09g0266000 [Oryza sativa Japonica Group]
gi|215692578|dbj|BAG87998.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 331
Score = 155 bits (393), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 71/109 (65%), Positives = 88/109 (80%), Gaps = 1/109 (0%)
Query: 86 GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
A VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR L FF F S+SSDL
Sbjct: 67 AADVVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDL 126
Query: 146 QFVIDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
+FVIDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++ ++Q+
Sbjct: 127 RFVIDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175
>gi|51536248|dbj|BAD38417.1| nuclear transport factor 2 (NTF2)-like protein [Oryza sativa
Japonica Group]
gi|125604939|gb|EAZ43975.1| hypothetical protein OsJ_28594 [Oryza sativa Japonica Group]
gi|215695383|dbj|BAG90574.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 215
Score = 155 bits (391), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 70/106 (66%), Positives = 87/106 (82%), Gaps = 1/106 (0%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR L FF F S+SSDL+FV
Sbjct: 70 VVRAFYDGVNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDLRFV 129
Query: 149 IDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
IDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++ ++Q+
Sbjct: 130 IDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175
>gi|125562981|gb|EAZ08361.1| hypothetical protein OsI_30617 [Oryza sativa Indica Group]
Length = 215
Score = 154 bits (390), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 70/106 (66%), Positives = 87/106 (82%), Gaps = 1/106 (0%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR FY G+N RDLA+VE LIA+ CVYEDL+FP PF+GR L FF F S+SSDL+FV
Sbjct: 70 VVRAFYDGLNRRDLAAVEPLIAEGCVYEDLVFPNPFVGRAEILGFFAGFMGSVSSDLRFV 129
Query: 149 IDDISA-EDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
IDDISA +DS AVGVTWHL+WKG+PFPFS+GCSFY+L++ ++Q+
Sbjct: 130 IDDISAGDDSRAVGVTWHLDWKGRPFPFSRGCSFYRLQLDEKQQQL 175
>gi|357167107|ref|XP_003581007.1| PREDICTED: uncharacterized protein LOC100842516 [Brachypodium
distachyon]
Length = 196
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 70/104 (67%), Positives = 82/104 (78%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR FY G+N RDLA+VE LIA+ CVYEDL+FPRP +GR L FF +F SIS DL+FV
Sbjct: 51 VVREFYDGVNRRDLAAVEPLIAEGCVYEDLVFPRPMVGRDQVLGFFGEFMGSISPDLRFV 110
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQ 192
IDDIS D SAVGVTWHLEWKG+PFPFS+GCSFY+ + K+Q
Sbjct: 111 IDDISGLDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQPDPQKQQ 154
>gi|147794044|emb|CAN71166.1| hypothetical protein VITISV_035107 [Vitis vinifera]
Length = 203
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/170 (51%), Positives = 113/170 (66%), Gaps = 18/170 (10%)
Query: 5 SSISSSSLRTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRISSSENNR 64
S +++S+ PS L P L ++I++HF T++R + LR+SSS N
Sbjct: 2 SLLTNSTTLLRPSKLSPPL----LHPHTASIATHF-----TQRRSRRCNLLRVSSSPQNP 52
Query: 65 TAVDVASPLPTTSNVDGD-DGG-------GAVVVRRFYAGINGRDLASVEELIADDCVYE 116
TA +A P+ TT + + + DGG G+ VVR FYAGIN DL SVEELIA +CVYE
Sbjct: 53 TAT-IAEPVTTTQSDEANGDGGDESILLSGSSVVRAFYAGINSHDLDSVEELIAHNCVYE 111
Query: 117 DLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHL 166
DLIFP+PF+GRKA +DFF KF D+IS +LQFVIDBIS DSSAVGVTWHL
Sbjct: 112 DLIFPQPFVGRKAIMDFFNKFIDTISMELQFVIDBISEADSSAVGVTWHL 161
>gi|326522692|dbj|BAJ88392.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 203
Score = 148 bits (374), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 63/95 (66%), Positives = 78/95 (82%)
Query: 91 RRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVID 150
R FY+G+N RDLA+V LIA+ CVYEDL+FPRP +GR + FF +F S+S DL+FVID
Sbjct: 60 REFYSGVNRRDLAAVAPLIAEGCVYEDLVFPRPMVGRDRVVGFFGEFMGSVSPDLRFVID 119
Query: 151 DISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
DIS +D SAVGVTWHLEWKG+PFPFS+GCSFY+ +
Sbjct: 120 DISGDDPSAVGVTWHLEWKGRPFPFSRGCSFYRCQ 154
>gi|242042631|ref|XP_002468710.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
gi|241922564|gb|EER95708.1| hypothetical protein SORBIDRAFT_01g050630 [Sorghum bicolor]
Length = 98
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 63/88 (71%), Positives = 77/88 (87%)
Query: 97 INGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAED 156
+N R++A+VE LIA+ CVYEDL+FPRPF+GR+ + FF KF +IS DLQFVIDDISA+D
Sbjct: 1 MNRREVAAVEPLIAEGCVYEDLVFPRPFVGRERIIGFFGKFMGTISLDLQFVIDDISADD 60
Query: 157 SSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
S AVGVTWHLEW+GKPFPFS+GCSFY+L
Sbjct: 61 SVAVGVTWHLEWRGKPFPFSRGCSFYRL 88
>gi|302811542|ref|XP_002987460.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
gi|300144866|gb|EFJ11547.1| hypothetical protein SELMODRAFT_446976 [Selaginella moellendorffii]
Length = 181
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/114 (51%), Positives = 81/114 (71%), Gaps = 1/114 (0%)
Query: 81 GDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS 140
G G VR+FY+ IN R L +LIA DCVYEDL+F +PF+GRKA ++ F+ F+ S
Sbjct: 25 GRLGAAKDTVRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSS 84
Query: 141 ISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
+ ++ FVID+IS + + +VGVTWHL+W+G FPFSKGCSFY+ E V+G +I
Sbjct: 85 VGPEVSFVIDEIS-DGNLSVGVTWHLDWRGNIFPFSKGCSFYRCEDVSGDYKIV 137
>gi|302796609|ref|XP_002980066.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
gi|300152293|gb|EFJ18936.1| hypothetical protein SELMODRAFT_58583 [Selaginella moellendorffii]
Length = 143
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 57/105 (54%), Positives = 79/105 (75%), Gaps = 1/105 (0%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
VR+FY+ IN R L +LIA DCVYEDL+F +PF+GRKA ++ F+ F+ S+ ++ FVI
Sbjct: 6 VRQFYSAINHRQLEIAGDLIASDCVYEDLVFSKPFVGRKAIIELFETFTSSVGPEVSFVI 65
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
D+IS + + +VGVTWHL+W+G FPFSKGCSFY+ E V+G +I
Sbjct: 66 DEIS-DGNLSVGVTWHLDWRGNVFPFSKGCSFYRCEDVSGDYKIV 109
>gi|218246837|ref|YP_002372208.1| hypothetical protein PCC8801_2015 [Cyanothece sp. PCC 8801]
gi|257059878|ref|YP_003137766.1| hypothetical protein Cyan8802_2040 [Cyanothece sp. PCC 8802]
gi|218167315|gb|ACK66052.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256590044|gb|ACV00931.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 143
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
V+ Y IN RD+ S E I D C+YEDL FP+PF+G++ + F + +I DLQFV
Sbjct: 8 VIESIYGAINRRDIDSAMEWIDDQCIYEDLNFPQPFIGKEKVKELFAESMQNIPDDLQFV 67
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
IDDI++EDS AVGV WHLE G FP S+G SFY+L
Sbjct: 68 IDDITSEDSLAVGVLWHLELNGMTFPNSRGASFYRL 103
>gi|302852119|ref|XP_002957581.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
nagariensis]
gi|300257098|gb|EFJ41351.1| hypothetical protein VOLCADRAFT_98696 [Volvox carteri f.
nagariensis]
Length = 479
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 93/178 (52%), Gaps = 12/178 (6%)
Query: 19 LLPCLNQTTFSYS-NSTISSHFISTPTTRKRLAPLSKLRISSSENNRTAVDVASPLPTTS 77
+LPC + Y SS + T RL +++ +++ A P
Sbjct: 1 MLPCASYKQLLYRVKCKHSSSWTRTKEQSPRLWTYARIACYAAKTAE-----AVPPQAEG 55
Query: 78 NVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKF 137
V+G G +V R+Y N DL ++ L+A D Y D+I+ PF+GR L++ KK
Sbjct: 56 TVNGLTGKDVIV--RYYEAYNSGDLDTIASLLAPDVSYHDMIYEDPFVGRDEVLNYLKKV 113
Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWK-GKPFPFSKGCSFYKLEVVNGKRQIT 194
++ +DLQFVI+D++ D AVG+TWH+E G FPFS+GCSFY L N + QIT
Sbjct: 114 RRTVPADLQFVIEDVTDGDPRAVGITWHVECGDGVVFPFSRGCSFYTL---NPRGQIT 168
>gi|254414879|ref|ZP_05028643.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196178368|gb|EDX73368.1| hypothetical protein MC7420_1164 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 391
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 66/103 (64%)
Query: 88 VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF 147
++++ Y IN RD+++ + I D+CVYEDL FP+PF G+ A F++ + DL F
Sbjct: 7 LMIKSLYQAINRRDISAAIDCIDDECVYEDLNFPQPFRGKAAVQQLFEESYQGVPDDLLF 66
Query: 148 VIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
VIDDI+ DSS+ GV WH+E G PFP +G SFY+L GK
Sbjct: 67 VIDDITTGDSSSTGVLWHVELDGIPFPNGRGVSFYRLSETTGK 109
>gi|159487887|ref|XP_001701954.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281173|gb|EDP06929.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1224
Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats.
Identities = 47/117 (40%), Positives = 73/117 (62%), Gaps = 1/117 (0%)
Query: 75 TTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFF 134
+TS+ D G V+ R+Y N D+ ++ L+A D Y D+I+ PF GR+ + +
Sbjct: 76 STSSRDAPVKDGRDVIVRYYEAYNAGDIDTIAGLLAPDVSYHDMIYEEPFQGREEVVAYL 135
Query: 135 KKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEW-KGKPFPFSKGCSFYKLEVVNGK 190
+K ++ SDL+FVI+D+++ D AVG+TWH+E G FPFS+GCSFY L+ G+
Sbjct: 136 RKVRKTVPSDLKFVIEDVTSGDPRAVGITWHVECGDGVVFPFSRGCSFYTLDPATGQ 192
>gi|332708365|ref|ZP_08428343.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
gi|332352858|gb|EGJ32420.1| SnoaL-like polyketide cyclase [Moorea producens 3L]
Length = 394
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 62/102 (60%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
+++ Y IN RD+ + E I D C+YEDL F +PF G++A ++ I DL+FV
Sbjct: 8 LIQSMYEAINRRDVNAAMEWIDDQCIYEDLNFSQPFKGKEAVRQLLEESCQGIPDDLKFV 67
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
IDDI+ D AVGV WH+E G PFP +G SFY+ V GK
Sbjct: 68 IDDITTGDPLAVGVLWHVELDGIPFPNGRGVSFYRFSEVTGK 109
>gi|359474296|ref|XP_002273128.2| PREDICTED: uncharacterized protein LOC100251848 [Vitis vinifera]
Length = 228
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 60/94 (63%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
+ FY IN ++L + +L++DDC + DL FP+PF G+K L FF++ + + +++F I
Sbjct: 81 IELFYTCINDKNLKQLAKLVSDDCCFNDLSFPQPFKGKKEVLRFFEELTAVMGKNVKFRI 140
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
+ D + WHLEW+GK PF++GCSFY+
Sbjct: 141 LHVCEGDGLTAAIDWHLEWQGKQIPFTRGCSFYE 174
>gi|145358721|ref|NP_198962.2| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
gi|332007299|gb|AED94682.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
Length = 277
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 73/134 (54%), Gaps = 5/134 (3%)
Query: 51 PLSKLRISSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIA 110
P+ + +S N + P + + G D V +FY+ IN ++ + I+
Sbjct: 45 PIKMINKYASRRNIVVSCLDRPNSRPNQISGYDA-----VMKFYSSINEKNQDQLSSCIS 99
Query: 111 DDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKG 170
DC +D FP+PF G++ ++FF++ S+ +++F ++++ D + V WHLEWKG
Sbjct: 100 SDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNVKFCVENVCEGDGHSAAVNWHLEWKG 159
Query: 171 KPFPFSKGCSFYKL 184
+ PF++GCSFY+
Sbjct: 160 RKIPFTRGCSFYEF 173
>gi|255537559|ref|XP_002509846.1| conserved hypothetical protein [Ricinus communis]
gi|223549745|gb|EEF51233.1| conserved hypothetical protein [Ricinus communis]
Length = 326
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 67/115 (58%)
Query: 69 VASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRK 128
V S N+ + A ++++FY IN + L +++ I+D+C +ED F P G+K
Sbjct: 68 VMSSAADDGNIPLNHISAASMIKKFYTCINEKRLEEIDKYISDNCCFEDCSFYSPIQGKK 127
Query: 129 ATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
+ F+++ + + +++F I+ + +D VGV WHLEWK PF++GCSFY+
Sbjct: 128 EVMHFYQQLTTGMVQNVKFSIEHVCEDDEFTVGVNWHLEWKRTHIPFTRGCSFYE 182
>gi|297805502|ref|XP_002870635.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
lyrata]
gi|297316471|gb|EFH46894.1| hypothetical protein ARALYDRAFT_493835 [Arabidopsis lyrata subsp.
lyrata]
Length = 277
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 63/108 (58%)
Query: 86 GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
G V +FY+ IN ++ + I+ DC +D F +PF GRK + FF++ S+ ++
Sbjct: 75 GYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFSKPFRGRKEAMKFFEELVKSMGQNV 134
Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
+F ++++ D + V WHLEWKG+ PF++GCSFY+ GK I
Sbjct: 135 KFCVENVCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEFTDEGGKLVI 182
>gi|224071972|ref|XP_002303603.1| predicted protein [Populus trichocarpa]
gi|222841035|gb|EEE78582.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 61/95 (64%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++++FY IN + L ++ I++DC +ED F +P G++ + FF++ + + +++F+
Sbjct: 86 MIKQFYTCINDKKLKELDGYISEDCHFEDCSFLQPMQGKREVMHFFRQLTAGMGENMKFI 145
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
I+ + +D GV WHLEWK PF++GCSFY+
Sbjct: 146 IEHVCEDDEMTAGVNWHLEWKTIQIPFTRGCSFYE 180
>gi|67924420|ref|ZP_00517847.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
gi|416403736|ref|ZP_11687588.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
0003]
gi|67853732|gb|EAM49064.1| hypothetical protein CwatDRAFT_1831 [Crocosphaera watsonii WH 8501]
gi|357261654|gb|EHJ10897.1| hypothetical protein CWATWH0003_4349 [Crocosphaera watsonii WH
0003]
Length = 386
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 62/102 (60%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++ Y IN RD+ + + DDC+YED+ F +PF G++A F++ D++ DL+F+
Sbjct: 9 IIESIYQAINNRDIEQAIQWVDDDCLYEDMNFSQPFQGKEAVKTLFQESCDNVPDDLKFI 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
ID+I+ ED VG+ WH+E P P +G SFY++ GK
Sbjct: 69 IDEITTEDPLKVGILWHVELDDIPLPNGRGVSFYRISETTGK 110
>gi|308804507|ref|XP_003079566.1| unnamed protein product [Ostreococcus tauri]
gi|116058021|emb|CAL54224.1| unnamed protein product [Ostreococcus tauri]
Length = 454
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
+ Y IN RD+A E +A+D VYED FP PF+G+ A F++ I DL FV+
Sbjct: 48 IATMYDRINARDVAGALECVAEDVVYEDFNFPEPFVGKAAVRALFEESCSGIPDDLAFVV 107
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQIT 194
D+ ++ ++VG TWH+E G+ FP ++GCS Y+ VNG ++T
Sbjct: 108 DEWTSGSGASVGCTWHVEIMGEAFPNARGCSLYR---VNGDGKLT 149
>gi|303276012|ref|XP_003057300.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461652|gb|EEH58945.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 399
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR Y IN RD+A+ E + DD +YED FP PF G+ A F++ D I D+ F+
Sbjct: 5 VVRGMYDAINRRDVAAALEFVDDDILYEDFNFPTPFKGKAAVKKLFEESCDGIPDDMLFI 64
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
ID+ + +VG+TW++E +G+PFP ++G S Y++ GK
Sbjct: 65 IDECTDGGGKSVGMTWYVELEGEPFPNARGASLYRISETTGK 106
>gi|60547917|gb|AAX23922.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
gi|71905583|gb|AAZ52769.1| hypothetical protein At5g41470 [Arabidopsis thaliana]
Length = 198
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 60/93 (64%)
Query: 92 RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDD 151
+FY+ IN ++ + I+ DC +D FP+PF G++ ++FF++ S+ +++F +++
Sbjct: 2 KFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNVKFCVEN 61
Query: 152 ISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
+ D + V WHLEWKG+ PF++GCSFY+
Sbjct: 62 VCEGDGHSAAVNWHLEWKGRKIPFTRGCSFYEF 94
>gi|218194167|gb|EEC76594.1| hypothetical protein OsI_14445 [Oryza sativa Indica Group]
Length = 612
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 64/107 (59%), Gaps = 3/107 (2%)
Query: 78 NVDGDDGGGAV--VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFK 135
N G GG + V+++FY+ +N +D +E LIA DC+ +D + + L K+T +F+
Sbjct: 425 NSTGGRGGSPLPDVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYK-LLDIKSTQTYFR 483
Query: 136 KFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
+ D++ + +F ID++S V WHLEW GK PF+KGCSFY
Sbjct: 484 RLMDAMGKNFKFAIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFY 530
>gi|427420214|ref|ZP_18910397.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
7375]
gi|425762927|gb|EKV03780.1| hypothetical protein Lepto7375DRAFT_6094 [Leptolyngbya sp. PCC
7375]
Length = 146
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 61/101 (60%), Gaps = 1/101 (0%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
++ Y IN RD++ LI D C YEDL F F+G+ A F + ++ +DLQFVI
Sbjct: 8 IQSIYEAINHRDVSQAVALIDDQCCYEDLNFSATFMGKAAVQGLFTESCQAVPADLQFVI 67
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
DD++ +D AVGVTWH+E G FP +G SFY+ +GK
Sbjct: 68 DDMAGDD-LAVGVTWHVELDGIAFPNGRGVSFYRFSPNSGK 107
>gi|357481563|ref|XP_003611067.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
gi|355512402|gb|AES94025.1| hypothetical protein MTR_5g010050 [Medicago truncatula]
Length = 249
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 59/101 (58%)
Query: 87 AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQ 146
A +V FY IN ++L ++E I+ D + D F PF G+K + F ++ + + +++
Sbjct: 84 AEIVDHFYTCINEKELKQLDEYISQDACFYDYTFINPFQGKKEVMHFLQQLTAGMGQNVK 143
Query: 147 FVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVV 187
F++ +I D V WHLEWK + PF+ GCSF++LE V
Sbjct: 144 FIVKNICEGDDLTVAAKWHLEWKKEQIPFTTGCSFFQLEKV 184
>gi|145346791|ref|XP_001417866.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578094|gb|ABO96159.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 429
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 60/101 (59%), Gaps = 1/101 (0%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
VR Y IN RD+ + ++ VYED FP PF G+ A F++ + I DL FVI
Sbjct: 26 VREMYERINARDVDGALACVDENVVYEDFNFPAPFRGKAAVKKLFEESCEGIPDDLTFVI 85
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
D S AVG+TWH+E G+ FP ++GCSFY+++ NGK
Sbjct: 86 -DASTCGGDAVGLTWHVELMGEAFPNARGCSFYRVDETNGK 125
>gi|359461040|ref|ZP_09249603.1| hypothetical protein ACCM5_20105 [Acaryochloris sp. CCMEE 5410]
Length = 406
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 68/122 (55%), Gaps = 9/122 (7%)
Query: 69 VASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRK 128
V +P+P +++ +++ Y IN R++A E + DC Y+DL FP PF G+
Sbjct: 8 VLNPMPLPAHL---------LIQAMYEAINERNVAKALEYVDPDCCYQDLNFPTPFQGKA 58
Query: 129 ATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVN 188
A + F + + +DL FV+DDI+ D+ AVG+ WH+E G P P +G SF +L
Sbjct: 59 AVEELFTESCKGMPADLLFVVDDITEGDALAVGILWHVELGGIPLPNGRGVSFCRLSEKT 118
Query: 189 GK 190
GK
Sbjct: 119 GK 120
>gi|307104859|gb|EFN53111.1| hypothetical protein CHLNCDRAFT_9491 [Chlorella variabilis]
Length = 391
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 63/107 (58%), Gaps = 8/107 (7%)
Query: 93 FYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDI 152
+Y N + + +V ELIA+DCVYEDLI+ PF+GR A +F+K + D++F ++DI
Sbjct: 10 YYDSYNSKQMGAVLELIAEDCVYEDLIYQDPFVGRAAIAAYFRKIEALVPPDIKFCVEDI 69
Query: 153 SAEDSSAVGVTWHLE-----WKGKPFPFSKGCSFYKLEVVNGKRQIT 194
+ D GV WH+E + FPFS+G SFY+ VN QI
Sbjct: 70 TEGDPRRCGVRWHVEIADDSGRATEFPFSRGVSFYE---VNDLGQIV 113
>gi|158335683|ref|YP_001516855.1| hypothetical protein AM1_2533 [Acaryochloris marina MBIC11017]
gi|158305924|gb|ABW27541.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 406
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 61/103 (59%)
Query: 88 VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF 147
++++ Y IN R++A E + DC Y+DL FP PF G+ A + F + + +DL F
Sbjct: 18 LLIQAMYEAINERNVAKALEYVDPDCCYQDLNFPIPFQGKAAVGELFTESCKGMPADLLF 77
Query: 148 VIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
V+DDI+ D+ AVG+ WH+E G P P +G SF +L GK
Sbjct: 78 VVDDITEGDALAVGILWHVELGGIPLPNGRGVSFCRLSEKTGK 120
>gi|255083853|ref|XP_002508501.1| predicted protein [Micromonas sp. RCC299]
gi|226523778|gb|ACO69759.1| predicted protein [Micromonas sp. RCC299]
Length = 428
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 64/102 (62%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR Y IN RD+ + + DD +YED FP PF G++ F++ D I D+ F+
Sbjct: 5 VVRGMYDAINRRDVEAALAFVDDDILYEDFNFPVPFKGKERVRKLFEESCDGIPDDMLFI 64
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
+D+ + +VG+TW++E +G+PFP ++G SFY+++ +GK
Sbjct: 65 VDECTDGGGLSVGMTWYVELEGEPFPNARGASFYRIDPTSGK 106
>gi|357463753|ref|XP_003602158.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
gi|355491206|gb|AES72409.1| hypothetical protein MTR_3g090440 [Medicago truncatula]
Length = 255
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 98/197 (49%), Gaps = 30/197 (15%)
Query: 9 SSSLRTSPSSLLPCLNQTTFSYSNSTISSHFIST-------PTTRKRLAPLSKLRISSSE 61
+SS + + +S PC+ + Y+N T HFI+ P +K+L + K ++ SS+
Sbjct: 17 TSSNKNTTNSTRPCITRL---YNNKT---HFINLVAKPMLLPNIKKQLK-IGKWKVLSSD 69
Query: 62 NNRTAVDV----ASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYED 117
+ V ASP P +V FY N +D ++++L++ +CVY+D
Sbjct: 70 DGSGDVAPEFLPASPSPVH------------IVHEFYEAFNKKDTETLKQLLSPNCVYQD 117
Query: 118 LIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSK 177
L+F + G+++ + F++ D++ ++ ++D+ + V HL WK K PF+
Sbjct: 118 LLFYTAYEGQESIIKFWQSVMDAMGPNIHVFVEDVKESNHVMVTAFMHLVWKEKKLPFTN 177
Query: 178 GCSFYKLEVVNGKRQIT 194
GC F+ E V GK I+
Sbjct: 178 GCRFFTFEEVEGKFLIS 194
>gi|172035034|ref|YP_001801535.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
gi|354551958|ref|ZP_08971266.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
gi|171696488|gb|ACB49469.1| hypothetical protein cce_0117 [Cyanothece sp. ATCC 51142]
gi|353555280|gb|EHC24668.1| hypothetical protein Cy51472DRAFT_0062 [Cyanothece sp. ATCC 51472]
Length = 386
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 59/102 (57%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++ Y IN R++ + + DDC+YED+ F + F G+KA F++ D++ D +FV
Sbjct: 9 IIELIYQAINNREIDQAMQWVDDDCIYEDVNFSKTFQGKKAVKSLFQESCDNVPDDFRFV 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
+D+I+ D VGV WH+E P P +G SFY++ GK
Sbjct: 69 VDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGK 110
>gi|449518407|ref|XP_004166233.1| PREDICTED: uncharacterized protein LOC101231762 [Cucumis sativus]
Length = 193
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 55/93 (59%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++ RFY IN ++L + I++DC+ ED +F F G+KA + F +K ++S+ D++F
Sbjct: 18 MIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFR 77
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
I + S G WHLEW+ P +KGC+F
Sbjct: 78 IRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF 110
>gi|357114599|ref|XP_003559086.1| PREDICTED: uncharacterized protein LOC100837010 [Brachypodium
distachyon]
Length = 270
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 61/97 (62%), Gaps = 4/97 (4%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD---FFKKFSDSISSDL 145
V++ FY+ +N +D A +++LI+ DC+ ED + +P L K L+ +F + +++ ++
Sbjct: 70 VIQEFYSSLNDKDSAQLKKLISPDCIIEDTAYYKP-LDIKVLLNTHTYFTRLMEAMGKNV 128
Query: 146 QFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
+F ID++ V V WHLEW GK PF+KGCSFY
Sbjct: 129 KFAIDEVCQGVEPTVAVMWHLEWNGKTIPFAKGCSFY 165
>gi|449452454|ref|XP_004143974.1| PREDICTED: uncharacterized protein LOC101214565 [Cucumis sativus]
Length = 157
Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 55/93 (59%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++ RFY IN ++L + I++DC+ ED +F F G+KA + F +K ++S+ D++F
Sbjct: 18 MIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFR 77
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
I + S G WHLEW+ P +KGC+F
Sbjct: 78 IRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF 110
>gi|414874058|tpg|DAA52615.1| TPA: hypothetical protein ZEAMMB73_891301 [Zea mays]
Length = 270
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 61/101 (60%), Gaps = 2/101 (1%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV+ FY+ +N R+ +++L+A DC+ ED + +P L K T +FK+ +S+ ++F
Sbjct: 73 VVQEFYSSLNERNSKRLDKLMAPDCIIEDTAYYKP-LDAKCTHIYFKRLMESMGEKVKFA 131
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
ID++ V WHLEW G PFS+GCSFY + VNG
Sbjct: 132 IDEVCQGAGRTAAVMWHLEWNGYIIPFSRGCSFY-ICSVNG 171
>gi|242032127|ref|XP_002463458.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
gi|241917312|gb|EER90456.1| hypothetical protein SORBIDRAFT_01g000255 [Sorghum bicolor]
Length = 226
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 57/94 (60%), Gaps = 1/94 (1%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV+ FY+ +N ++ +++L+A DC+ ED + +P L K T +FK+ +S+ ++F
Sbjct: 87 VVQEFYSSLNEKNSKRLDKLMAPDCIVEDTAYYKP-LDAKCTRIYFKRLMESMGKKVKFA 145
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
ID++ V WHLEW G PF+KGCSFY
Sbjct: 146 IDEVCQGAGRTAAVMWHLEWNGYIIPFTKGCSFY 179
>gi|126659484|ref|ZP_01730617.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
gi|126619218|gb|EAZ89954.1| hypothetical protein CY0110_07154 [Cyanothece sp. CCY0110]
Length = 386
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 59/102 (57%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++ Y IN R++ + + DDC+YED+ F + F G++A F++ D+ +D +FV
Sbjct: 9 IIESIYQAINNREIEQAMQWVDDDCMYEDVNFSKTFQGKEAVKHLFQESCDNAPNDFKFV 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
+D+I+ D VGV WH+E P P +G SFY++ GK
Sbjct: 69 VDEITTGDPLKVGVLWHVELDNIPIPNGRGVSFYRISETTGK 110
>gi|356495695|ref|XP_003516709.1| PREDICTED: uncharacterized protein LOC100779471 [Glycine max]
Length = 218
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 56/95 (58%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
V ++Y IN +DL ++E I++D ++D F +PF G+K + F ++ + + ++ F +
Sbjct: 20 VEQYYTSINDKDLRQLDECISEDACFDDYAFTKPFQGKKEVIRFLEQLTHCMGRNVTFRL 79
Query: 150 DDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
I D +WHLEW K PF++GC+F+KL
Sbjct: 80 KHIYEGDDLTAVASWHLEWNEKQIPFTRGCTFFKL 114
>gi|31193920|gb|AAP44755.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108712240|gb|ABG00035.1| expressed protein [Oryza sativa Japonica Group]
Length = 259
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 62/112 (55%), Gaps = 7/112 (6%)
Query: 78 NVDGDDGGGAV--VVRRFYAGINGRDLASVEELIADDCVYED-----LIFPRPFLGRKAT 130
N G GG + V+++FY+ +N +D +E LIA DC+ +D L+ + L +
Sbjct: 66 NSTGGRGGSPLPDVIQQFYSSLNEKDSKRLENLIAPDCIIDDNAYYKLLDIKVLLVLHSH 125
Query: 131 LDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFY 182
+ ++ D++ + +F ID++S V WHLEW GK PF+KGCSFY
Sbjct: 126 ITDLQRLMDAMGKNFKFAIDEVSQGVEPTFAVMWHLEWNGKTIPFTKGCSFY 177
>gi|412993256|emb|CCO16789.1| predicted protein [Bathycoccus prasinos]
Length = 478
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 9/111 (8%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VVR Y IN RD+ + DD VYED FP PF G+ F++ I DL F+
Sbjct: 32 VVRGMYDAINKRDVTKALTFVDDDIVYEDFNFPEPFRGKARVKKLFEESCTGIPDDLDFI 91
Query: 149 IDDISAE---------DSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
I+ + +S G TWH+E G+PFP ++G SFY++ +GK
Sbjct: 92 IERSTTSSSSEAGGLGESLTFGCTWHVEIAGEPFPNARGASFYEISKESGK 142
>gi|224058447|ref|XP_002299514.1| predicted protein [Populus trichocarpa]
gi|222846772|gb|EEE84319.1| predicted protein [Populus trichocarpa]
Length = 276
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 56/95 (58%), Gaps = 5/95 (5%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
++++FY IN + L ++ I+DDC +ED F +P G+K + FF + + + +++F+
Sbjct: 82 MIKQFYTCINEKKLKELDGYISDDCFFEDCSFLQPMQGKKEVMHFFGQLTAGMGQNVKFI 141
Query: 149 IDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYK 183
+ E S+ + + EWK PF+KGCSFY+
Sbjct: 142 L-----EHSNKLMRILYAEWKTIQIPFTKGCSFYE 171
>gi|397568380|gb|EJK46109.1| hypothetical protein THAOC_35242 [Thalassiosira oceanica]
Length = 607
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Query: 83 DGGGAVV------VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKK 136
DG AVV V R++ N RD+ L +DC DL + F GR +
Sbjct: 216 DGSSAVVDTNLSVVERYFEAWNKRDMKEAVSLFTEDCNMRDLQYDSEFKGRAEFERHLLR 275
Query: 137 FSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
D + +FV+DD++ + A GV WH+E G P F++GCSFY ++ +G
Sbjct: 276 VKDCLPGSFEFVVDDVALSPTKA-GVVWHVENDGSPLAFTRGCSFYTIDQRSG 327
>gi|219114851|ref|XP_002178221.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409956|gb|EEC49886.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 92 RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDD 151
+++A N RD++S L AD Y+D FP PF G+ K S++ S F +D
Sbjct: 244 KYFAAWNRRDMSSAVALFADTVTYDDTAFPEPFSGKTNLSSHLYKCSNAFPSTFTFQVDK 303
Query: 152 ISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
+ A+ + V WH+E G PF++GCSFY ++
Sbjct: 304 V-ADAGDRISVLWHVENDGDDLPFTRGCSFYNVD 336
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 52/100 (52%), Gaps = 4/100 (4%)
Query: 87 AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSD-- 144
A V+ F + N RD ++E ++ D ++D +P+P G L+ + + + D
Sbjct: 80 AQTVQNFISAYNERDYNTLENMVDPDIEFDDTAYPKPCRGL-PELERRWRLTRNAQGDKR 138
Query: 145 LQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
+Q +DDI A ++ VG+ +HLE P +G +F++L
Sbjct: 139 IQVAVDDI-ASSTTTVGIRFHLENVEGEIPNGRGAAFFQL 177
>gi|449018192|dbj|BAM81594.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 271
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 56/99 (56%), Gaps = 3/99 (3%)
Query: 87 AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQ 146
A VV R++ N R++A + L A D YED ++PR GR+A + K + + L+
Sbjct: 124 ARVVFRYFDQWNKREIAGIIPLFAPDVFYEDALYPRALHGREALAEHLTKVARVLPPQLE 183
Query: 147 FVIDDISAE-DSSAVGVTWHLEWK--GKPFPFSKGCSFY 182
FVIDD+ + + V +HLE + GK PFS+G SF+
Sbjct: 184 FVIDDVCEDLERGKCAVRFHLEDRRTGKQVPFSRGTSFF 222
>gi|428180781|gb|EKX49647.1| hypothetical protein GUITHDRAFT_151421 [Guillardia theta CCMP2712]
Length = 170
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 49/80 (61%), Gaps = 3/80 (3%)
Query: 107 ELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHL 166
+ ADD VYED+++ PF+G+ A +F K + D FV+D +S + + G+TWH+
Sbjct: 51 DYFADDVVYEDMVYSEPFVGKDAVREFLLKTKEMAPPDFVFVVDRVS-DGVRSCGLTWHI 109
Query: 167 EWKGKP--FPFSKGCSFYKL 184
E K +P F+ GCSFY+L
Sbjct: 110 ELKTRPDAGKFANGCSFYEL 129
>gi|9758055|dbj|BAB08518.1| unnamed protein product [Arabidopsis thaliana]
Length = 158
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 47/81 (58%)
Query: 86 GAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDL 145
G V +FY+ IN ++ + I+ DC +D FP+PF G++ ++FF++ S+ ++
Sbjct: 75 GYDAVMKFYSSINEKNQDQLSSCISSDCFIDDFSFPKPFRGKQEAMEFFEELVKSMGQNV 134
Query: 146 QFVIDDISAEDSSAVGVTWHL 166
+F ++++ D + V WHL
Sbjct: 135 KFCVENVCEGDGHSAAVNWHL 155
>gi|323451680|gb|EGB07556.1| hypothetical protein AURANDRAFT_64644 [Aureococcus anophagefferens]
Length = 235
Score = 66.6 bits (161), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFL--GRKA---TLDFFKKFSDSI--- 141
+R Y N RD V L+A+D VYEDL+ + GR++ L F F +
Sbjct: 84 IRGLYKAFNDRDAPRVASLLAEDVVYEDLLLGASTICRGRESFSQALAFHPAFLSAQLGL 143
Query: 142 -SSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSF 181
L+ V+DD++ + +VGV WH+E G+PFP S+G S
Sbjct: 144 PMGRLELVVDDVACDGERSVGVEWHVELNGEPFPLSRGLSL 184
>gi|147816094|emb|CAN72893.1| hypothetical protein VITISV_022313 [Vitis vinifera]
Length = 267
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 3/111 (2%)
Query: 82 DDGGGAVVVRRFYAGINGRDLASVEELIADDC-VYEDLI-FPRPFLGRKATLDFFKKFSD 139
+D V++ Y I +++ V ++I D+C + + I +PF G+K L+FF
Sbjct: 90 EDNRALEAVQKLYTAIKNKNVKEVSDVIGDECRCFCNFISASQPFHGKKQALEFFDHLM- 148
Query: 140 SISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
I D + + D AVGVTW LEWK P P G S+Y + G+
Sbjct: 149 KILGDHASIRFSFTMHDGMAVGVTWRLEWKNNPVPLGTGFSYYVCQEYRGR 199
>gi|255559074|ref|XP_002520559.1| conserved hypothetical protein [Ricinus communis]
gi|223540219|gb|EEF41792.1| conserved hypothetical protein [Ricinus communis]
Length = 254
Score = 65.9 bits (159), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 3/114 (2%)
Query: 82 DDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFP--RPFLGRKATLDFFKKFSD 139
+D V + Y I +++ V +I D+C F F G++ LDFF
Sbjct: 78 EDTPALETVLKLYTAIKNQNIHEVSNMIGDECRCVCNFFSSFESFQGKQQVLDFFNYVMQ 137
Query: 140 SISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
++ ++++FV+ + D VGV+W LEW P KG SFY ++ GK I
Sbjct: 138 TLGNNIEFVVQP-TKYDGMNVGVSWRLEWSKTHMPLGKGFSFYICQIYQGKVTI 190
>gi|224066735|ref|XP_002302189.1| predicted protein [Populus trichocarpa]
gi|222843915|gb|EEE81462.1| predicted protein [Populus trichocarpa]
Length = 203
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 61/106 (57%), Gaps = 4/106 (3%)
Query: 82 DDGGGAVVVRRFYAGINGR-DLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS 140
DD G +++ RF+ IN R D +E++++ DCV++D IF F G ++ + F +K +
Sbjct: 94 DDIG--ILIHRFFYAINSRNDEQLLEDVLSYDCVFKDFIFQIAFDGEQSIIQFLRKVMMA 151
Query: 141 ISSDLQFVIDDISAEDS-SAVGVTWHLEWKGKPFPFSKGCSFYKLE 185
+ +++F I+ + ++ A HLEW + PF++ C+ ++ E
Sbjct: 152 MGPNIRFKIESVQCKNELQAATAFLHLEWDNQVIPFTRFCTDFECE 197
>gi|87241143|gb|ABD33001.1| hypothetical protein MtrDRAFT_AC150798g6v2 [Medicago truncatula]
Length = 267
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 4/116 (3%)
Query: 81 GDDGGGAV-VVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFFKKF 137
G+D A+ V + Y+ +++ + E++AD+C V L F + F G+ L+FF
Sbjct: 88 GEDNDQALDAVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHL 147
Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
++Q V+ + D VGV W EW P KG SF+ + GK I
Sbjct: 148 IRLFGDNIQIVVKP-TLHDGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVI 202
>gi|357447893|ref|XP_003594222.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
gi|355483270|gb|AES64473.1| hypothetical protein MTR_2g025760 [Medicago truncatula]
Length = 251
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 4/116 (3%)
Query: 81 GDDGGGAV-VVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFFKKF 137
G+D A+ V + Y+ +++ + E++AD+C V L F + F G+ L+FF
Sbjct: 72 GEDNDQALDAVMKLYSAFKNKNIQELSEILADECRCVCNFLSFFQAFQGKTQVLEFFSHL 131
Query: 138 SDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
++Q V+ + D VGV W EW P KG SF+ + GK I
Sbjct: 132 IRLFGDNIQIVVKP-TLHDGMNVGVHWKFEWNTIHIPLGKGFSFHICQTYRGKAVI 186
>gi|159485316|ref|XP_001700691.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272033|gb|EDO97840.1| predicted protein [Chlamydomonas reinhardtii]
Length = 213
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%)
Query: 74 PTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDF 133
P V G G A RF + +N RDL+ + +L++DDC + DL ++ F
Sbjct: 27 PPRRLVMGQPIGRAAACCRFISALNARDLSGMLQLVSDDCHHVDLSHEAEGHCKEDVARF 86
Query: 134 FKKFSDSISSDLQFVIDDISAEDSSAVGVTW 164
+ S+ +Q V+DDI++ D S G +
Sbjct: 87 YADVVASMPEKVQVVVDDITSGDDSRAGAIY 117
>gi|323450797|gb|EGB06676.1| hypothetical protein AURANDRAFT_65293 [Aureococcus anophagefferens]
Length = 1937
Score = 44.3 bits (103), Expect = 0.024, Method: Composition-based stats.
Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 19/122 (15%)
Query: 76 TSNVDGDDG---------GGAVVVRRFY---AGINGRDLASVEELIADDCVYEDLIFPRP 123
T+NV+ +D GAV V + A G A EL A+D YED +P P
Sbjct: 1781 TANVEKEDTPATYEPRAPTGAVAVADYLWNEAYPKGATPAVALELFAEDIRYEDFNYPAP 1840
Query: 124 FLGRKATLDFFKKFSDSISSDLQFVIDDISAE--DSSAVGVTWHLEWKGKPFPFSKGCSF 181
FLG+ A +F + F + FV +IS D A TW ++ G+ P KG SF
Sbjct: 1841 FLGKPAVKEFVEAFD---IPGIDFVPLEISGGDVDGGACCFTWIVKVNGQDGP--KGISF 1895
Query: 182 YK 183
Y+
Sbjct: 1896 YE 1897
>gi|42407773|dbj|BAD08919.1| hypothetical protein [Oryza sativa Japonica Group]
gi|42408132|dbj|BAD09271.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125562081|gb|EAZ07529.1| hypothetical protein OsI_29785 [Oryza sativa Indica Group]
gi|125603929|gb|EAZ43254.1| hypothetical protein OsJ_27852 [Oryza sativa Japonica Group]
Length = 172
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 20/111 (18%)
Query: 90 VRRFY---AGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD--------FFKKFS 138
VRR + + R + + EL D+C+Y F GR +++D F +
Sbjct: 66 VRRLVKLNSAVQNRSVRELLELAGDECLY--------FFGRISSIDVSQVSKNMFLLLHA 117
Query: 139 DSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
+ + FV+ E +GV W LEWKGK P+ C+ V G
Sbjct: 118 MMLRHHVSFVLKPTENEGFD-LGVKWSLEWKGKKLPWDLDCNITTNHVYRG 167
>gi|413925260|gb|AFW65192.1| hypothetical protein ZEAMMB73_842673 [Zea mays]
Length = 335
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 19/108 (17%)
Query: 92 RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDS--------ISS 143
+ + I R + + EL+ D+C+Y FLG ++D + D +
Sbjct: 82 KLNSAIQNRSVRELLELLGDECLY--------FLGNLRSVDVPRLGKDMFLLLHALMVRH 133
Query: 144 DLQFVIDDISAEDSSA--VGVTWHLEWKGKPFPFSKGCSFYKLEVVNG 189
+ FV+ S +D + +GV W LEWKGK P+ C+ V G
Sbjct: 134 HVSFVLKP-SPDDEAGFDLGVKWSLEWKGKKLPWDLDCNVSTTHVYRG 180
>gi|356532339|ref|XP_003534731.1| PREDICTED: uncharacterized protein LOC100812303 [Glycine max]
Length = 399
Score = 43.1 bits (100), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 7/119 (5%)
Query: 75 TTSNVDGDDGG-GAVVVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATL 131
S +G+DG G V + Y+ + + EL AD+ V L F F GR L
Sbjct: 21 NNSESEGEDGHHGLDSVMKLYSAFKNK---KIHELSADERRRVSNFLSFFETFQGRTQVL 77
Query: 132 DFFKKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGK 190
+FF + + +++Q + E +A G+ W EW P KG S + +G+
Sbjct: 78 EFFSYLTSILGNNIQIIFKPTPHEGVNA-GLQWKFEWDKIHLPLWKGFSLHISHTYHGR 135
>gi|357021815|ref|ZP_09084046.1| hypothetical protein KEK_17413 [Mycobacterium thermoresistibile
ATCC 19527]
gi|356479563|gb|EHI12700.1| hypothetical protein KEK_17413 [Mycobacterium thermoresistibile
ATCC 19527]
Length = 146
Score = 42.7 bits (99), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFP----RPFLGRKATLDFFKKFSDS-ISS 143
VVR + GIN D + EL+A D V+E L P + + GR A LDF ++ +++ I+
Sbjct: 12 VVRAYLNGINTWDFDGMRELMAPDFVFEQLFAPPGMQKRYEGRDALLDFQRRLAETVITE 71
Query: 144 DLQFVIDDISAEDSSAVGVTWHLEWK 169
+L D A D V VT+ + K
Sbjct: 72 NLHDFEFDTLASDPGVVIVTYRSDMK 97
>gi|242080039|ref|XP_002444788.1| hypothetical protein SORBIDRAFT_07g028065 [Sorghum bicolor]
gi|241941138|gb|EES14283.1| hypothetical protein SORBIDRAFT_07g028065 [Sorghum bicolor]
Length = 181
Score = 42.7 bits (99), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 16/97 (16%)
Query: 92 RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLD--------FFKKFSDSISS 143
+ + I R + + EL+ D+C+Y F G ++D F + +
Sbjct: 93 KLNSAIQNRSVRELLELLGDECLY--------FFGNLRSVDVPQLGKDMFLLLHALMVRH 144
Query: 144 DLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCS 180
+ FV+ E +GV W LEWKGK P+ C+
Sbjct: 145 HVSFVLKPSPDEAGFDLGVKWSLEWKGKKLPWDLDCN 181
>gi|403729559|ref|ZP_10948583.1| hypothetical protein GORHZ_197_00710 [Gordonia rhizosphera NBRC
16068]
gi|403202973|dbj|GAB92914.1| hypothetical protein GORHZ_197_00710 [Gordonia rhizosphera NBRC
16068]
Length = 117
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 46/100 (46%), Gaps = 4/100 (4%)
Query: 87 AVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPF--LGRKATLDFFKKFSDSISSD 144
A VV RF + DL + EL+ DD V+E P +GR A L F +F S ++D
Sbjct: 6 ASVVERFNDAFDRADLTRLAELLTDDVVFESTAPPAGIRHIGRPAVLAAFAEFFAS-TTD 64
Query: 145 LQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKL 184
F +++ A V V W W G+ F +G + +
Sbjct: 65 PDFQTEEMIAAHDRVV-VRWRFTWGGEHPGFVRGVDIFTV 103
>gi|356557961|ref|XP_003547278.1| PREDICTED: uncharacterized protein LOC100808156 [Glycine max]
Length = 262
Score = 40.4 bits (93), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 48/119 (40%), Gaps = 6/119 (5%)
Query: 77 SNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDC--VYEDLIFPRPFLGRKATLDFF 134
S + +D V + Y+ ++ EL AD+ V L F F GR L+FF
Sbjct: 81 SESEAEDDHALDSVMKLYSAFKNKN---THELSADERRRVSNFLSFFETFQGRTQVLEFF 137
Query: 135 KKFSDSISSDLQFVIDDISAEDSSAVGVTWHLEWKGKPFPFSKGCSFYKLEVVNGKRQI 193
+ + +++Q + E + VG+ W EW P KG + +G+ I
Sbjct: 138 SYLTSILGNNIQIIFKPTPHEGVN-VGLQWKFEWNKIHLPLWKGFGLHISHTYHGRAVI 195
>gi|389681152|ref|ZP_10172497.1| hypothetical protein PchlO6_4193 [Pseudomonas chlororaphis O6]
gi|388554688|gb|EIM17936.1| hypothetical protein PchlO6_4193 [Pseudomonas chlororaphis O6]
Length = 159
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 90 VRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVI 149
++R+ +N DL V L ADD V+ P P +GR A +++ D I D++F I
Sbjct: 38 LKRYETALNTSDLDQVMTLYADDAVFMPQNSP-PAVGRDAVHAAYRQVFDLIRLDVRFTI 96
Query: 150 DDIS 153
D+I
Sbjct: 97 DEIQ 100
>gi|406030550|ref|YP_006729442.1| RNA polymerase sigma factor [Mycobacterium indicus pranii MTCC
9506]
gi|405129097|gb|AFS14352.1| RNA polymerase sigma factor [Mycobacterium indicus pranii MTCC
9506]
Length = 323
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ S TI+ +P K+LA ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207
>gi|379761768|ref|YP_005348165.1| hypothetical protein OCQ_23320 [Mycobacterium intracellulare
MOTT-64]
gi|378809710|gb|AFC53844.1| hypothetical protein OCQ_23320 [Mycobacterium intracellulare
MOTT-64]
Length = 323
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ S TI+ +P K+LA ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207
>gi|379754489|ref|YP_005343161.1| hypothetical protein OCO_24770 [Mycobacterium intracellulare
MOTT-02]
gi|378804705|gb|AFC48840.1| hypothetical protein OCO_24770 [Mycobacterium intracellulare
MOTT-02]
Length = 323
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 53/117 (45%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ S TI+ +P K+LA ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMSFETIAELLHRSPDAAKKLASRARGRL 165
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207
>gi|402077688|gb|EJT73037.1| hypothetical protein GGTG_09888 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1271
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 6/104 (5%)
Query: 20 LPCLNQTTFSYSNSTISSHFISTPTTRKRL-APLSKLRISSSENN-----RTAVDVASPL 73
P + Q Y S I+S ++P ++ L +P+++L+ + E RT ++ A
Sbjct: 512 FPDIRQLVDGYRKSPITSISPASPALQRGLVSPITQLQQHAREEGDPSIMRTMIEDADEN 571
Query: 74 PTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCVYED 117
P + DG+D G + +GIN RD S+ E+ Y+D
Sbjct: 572 PNFPDGDGEDIGSLTMEEELSSGINVRDSESLLEVDVQKHAYQD 615
>gi|221068339|ref|ZP_03544444.1| Steroid Delta-isomerase [Comamonas testosteroni KF-1]
gi|220713362|gb|EED68730.1| Steroid Delta-isomerase [Comamonas testosteroni KF-1]
Length = 125
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 9/113 (7%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV+R+ A +N DL + L ADD ED + P GR+A F+ S + ++
Sbjct: 10 VVQRYVAALNAGDLDGIVALFADDATVEDPVGSEPRRGREAIRAFYAN-SLKLPLAVELT 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGKP--------FPFSKGCSFYKLEVVNGKRQI 193
+ + D +A E++G+ F F++ + + G++ I
Sbjct: 69 QEVRAVADEAAFAFIVSFEYQGRKTVIAPIDHFRFNEAGKVVSMRALFGEKNI 121
>gi|427738310|ref|YP_007057854.1| SnoaL-like polyketide cyclase [Rivularia sp. PCC 7116]
gi|427373351|gb|AFY57307.1| SnoaL-like polyketide cyclase [Rivularia sp. PCC 7116]
Length = 166
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 91 RRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFVID 150
R+ Y G N R+L +E+IA D + P +GR+A ++ +F L V D
Sbjct: 18 RQIYKGFNERNLDLWDEVIASDVEVRSTVGTMPLIGREALKNWAAQFQSGFRPRLDLV-D 76
Query: 151 DISAEDSSAVGVTWHLEWK 169
+I + + + V +L WK
Sbjct: 77 EIYGVNRATIAV--NLNWK 93
>gi|433646696|ref|YP_007291698.1| putative ester cyclase [Mycobacterium smegmatis JS623]
gi|433296473|gb|AGB22293.1| putative ester cyclase [Mycobacterium smegmatis JS623]
Length = 163
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 5/83 (6%)
Query: 92 RFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQF-VID 150
++A +N RD ++ ++A D V+E P GR + + F + DL F V D
Sbjct: 33 EYFAAVNRRDFETLTNILAPDVVFEH---ESPIEGRDKLIAMLQAFCTAF-PDLTFSVTD 88
Query: 151 DISAEDSSAVGVTWHLEWKGKPF 173
I+ ED V W+G+ F
Sbjct: 89 QIAHEDEVVVFFVGSGTWRGEFF 111
>gi|186682424|ref|YP_001865620.1| hypothetical protein Npun_R2054 [Nostoc punctiforme PCC 73102]
gi|186464876|gb|ACC80677.1| protein of unknown function DUF1486 [Nostoc punctiforme PCC 73102]
Length = 298
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 50/122 (40%), Gaps = 17/122 (13%)
Query: 83 DGGGAVVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSIS 142
D A+V R FY N ++ ++EE+ D ++ PF G + DS
Sbjct: 4 DANKAIVRRYFYEVFNEGNITTLEEICGPDFIFTLPTHAEPFRGVEGYKGLVNMLIDSFP 63
Query: 143 SDLQFVIDDISAE-DSSAVGVTWHLEWKGKPFP-------------FSKGCSFYKLEVVN 188
+ F ++D+ AE D T GKPFP +G S+++ +VN
Sbjct: 64 G-IHFAMEDVLAEGDMVLTRWTARGNHTGKPFPTVIGDIPAQGKSFLLEGMSWHR--IVN 120
Query: 189 GK 190
GK
Sbjct: 121 GK 122
>gi|387875778|ref|YP_006306082.1| hypothetical protein W7S_11950 [Mycobacterium sp. MOTT36Y]
gi|443305484|ref|ZP_21035272.1| hypothetical protein W7U_07420 [Mycobacterium sp. H4Y]
gi|386789236|gb|AFJ35355.1| hypothetical protein W7S_11950 [Mycobacterium sp. MOTT36Y]
gi|442767048|gb|ELR85042.1| hypothetical protein W7U_07420 [Mycobacterium sp. H4Y]
Length = 323
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 53/117 (45%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ TI++ +P K+LA ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAALLHRSPDAAKKLASRARGRL 165
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207
>gi|300790982|ref|YP_003771273.1| hypothetical protein AMED_9182 [Amycolatopsis mediterranei U32]
gi|384154524|ref|YP_005537340.1| hypothetical protein RAM_47090 [Amycolatopsis mediterranei S699]
gi|399542860|ref|YP_006555522.1| hypothetical protein AMES_9045 [Amycolatopsis mediterranei S699]
gi|299800496|gb|ADJ50871.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532678|gb|AEK47883.1| hypothetical protein RAM_47090 [Amycolatopsis mediterranei S699]
gi|398323630|gb|AFO82577.1| hypothetical protein AMES_9045 [Amycolatopsis mediterranei S699]
Length = 130
Score = 37.0 bits (84), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 87 AVVVRRFYAGINGRDLASVEELIADDCVYE---DLIFPRPFLGRKATLDFFKKFSDSISS 143
A ++RR Y + D+A++ EL+A D V +F GR A L + + ++
Sbjct: 7 ATLIRRGYEAFSAGDVATLSELLAPDAVQHMPGHNVFSGDHKGRDAILAMYGQLAERSGG 66
Query: 144 DLQFVIDDISAEDSSAVGV 162
L+ ++++ A D V V
Sbjct: 67 TLKIELEEVYANDEEVVTV 85
>gi|295836440|ref|ZP_06823373.1| membrane protein [Streptomyces sp. SPB74]
gi|197698629|gb|EDY45562.1| membrane protein [Streptomyces sp. SPB74]
Length = 133
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 88 VVVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKAT 130
+V+R F + RDL +VE L+A+D V+ + +P+ GR T
Sbjct: 2 IVMRAFREAVEARDLDAVEALLAEDVVFTSPVVHKPYRGRAIT 44
>gi|254822137|ref|ZP_05227138.1| hypothetical protein MintA_19539 [Mycobacterium intracellulare ATCC
13950]
Length = 317
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ TI+ +P K+LA ++ R+
Sbjct: 100 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAELLHRSPDAAKKLASRARGRL 159
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 160 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 201
>gi|379747183|ref|YP_005338004.1| hypothetical protein OCU_24640 [Mycobacterium intracellulare ATCC
13950]
gi|378799547|gb|AFC43683.1| hypothetical protein OCU_24640 [Mycobacterium intracellulare ATCC
13950]
Length = 323
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 19/117 (16%)
Query: 2 LMTSSISSSSL----RTSPSSLLPCLNQTTFSYSNSTISSHFISTPTTRKRLAPLSKLRI 57
L+ S+SS+ L R SP+ + + F+ TI+ +P K+LA ++ R+
Sbjct: 106 LLAESVSSALLVVLDRLSPAQRVAFVLHDVFAMPFETIAELLHRSPDAAKKLASRARGRL 165
Query: 58 SSSENNRTAVDVASPLPTTSNVDGDDGGGAVVVRRFYAGINGRDLASVEELIADDCV 114
A A P T +++ VV F A G D+AS+ EL+A D V
Sbjct: 166 Q-------AAPAAQPRSTAGHLE--------VVEAFLAASRGGDIASLLELLAPDVV 207
>gi|358009461|pdb|3MHE|A Chain A, Crystal Structure Of Ketosteroid Isomerase P39a From
Pseudomonas Testosteroni (Tksi)
gi|358009462|pdb|3MHE|B Chain B, Crystal Structure Of Ketosteroid Isomerase P39a From
Pseudomonas Testosteroni (Tksi)
Length = 125
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV+R+ A +N DL + L ADD ED + P G A +F+ S + ++
Sbjct: 10 VVQRYVAALNAGDLDGIVALFADDATVEDAVGSEPRSGTAAIREFYAN-SLKLPLAVELT 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGK 171
+ + + +A T E++G+
Sbjct: 69 QEVRAVANEAAFAFTVSFEYQGR 91
>gi|6729912|pdb|1BUQ|A Chain A, Solution Structure Of Delta-5-3-Ketosteroid Isomerase
Complexed With The Steroid
19-Nortestosterone-Hemisuccinate
gi|6729913|pdb|1BUQ|B Chain B, Solution Structure Of Delta-5-3-Ketosteroid Isomerase
Complexed With The Steroid
19-Nortestosterone-Hemisuccinate
Length = 125
Score = 36.6 bits (83), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 89 VVRRFYAGINGRDLASVEELIADDCVYEDLIFPRPFLGRKATLDFFKKFSDSISSDLQFV 148
VV+R+ A +N DL + L ADD ED + P G A +FF S + ++
Sbjct: 10 VVQRYVAALNAGDLDGIVALFADDATVEDPVGSEPRSGTAAIREFFAN-SLKLPLAVELT 68
Query: 149 IDDISAEDSSAVGVTWHLEWKGK 171
+ + + +A T E++G+
Sbjct: 69 QEVRAVANEAAFAFTVSFEFQGR 91
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.131 0.380
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,970,543,971
Number of Sequences: 23463169
Number of extensions: 121793611
Number of successful extensions: 360653
Number of sequences better than 100.0: 152
Number of HSP's better than 100.0 without gapping: 81
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 360299
Number of HSP's gapped (non-prelim): 358
length of query: 194
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 59
effective length of database: 9,191,667,552
effective search space: 542308385568
effective search space used: 542308385568
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)