BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031378
(160 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224085938|ref|XP_002307747.1| predicted protein [Populus trichocarpa]
gi|222857196|gb|EEE94743.1| predicted protein [Populus trichocarpa]
Length = 212
Score = 150 bits (379), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/123 (60%), Positives = 90/123 (73%), Gaps = 7/123 (5%)
Query: 36 LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I LN LLE QEKFYNA T N+++T++V+EIVKFA +GNSYSSA E
Sbjct: 82 LHIANTLNSSFTFPLLEEFFKHQEKFYNAKTSNLSKTSIVEEIVKFATVAVGNSYSSAFE 141
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
SGF+DR TDL TRVSFK+S +RGV+ TP FFVNGF L AGSPLDYNGWR +IDPL+ K
Sbjct: 142 SGFNDRQTDLKTRVSFKYSTSRGVFGTPFFFVNGFVLPDAGSPLDYNGWRSIIDPLVGAK 201
Query: 149 GKK 151
+
Sbjct: 202 SSQ 204
>gi|255539052|ref|XP_002510591.1| conserved hypothetical protein [Ricinus communis]
gi|223551292|gb|EEF52778.1| conserved hypothetical protein [Ricinus communis]
Length = 362
Score = 150 bits (379), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 69/105 (65%), Positives = 87/105 (82%), Gaps = 1/105 (0%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
QE+FYNA T+N++R +VVK+IV+FA+ IGNSY SA+ESGF+DR TDL TRVSFK+S +
Sbjct: 123 HQERFYNAQTRNLSRVSVVKDIVEFASVAIGNSYHSAIESGFNDRETDLQTRVSFKYSTS 182
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SEKGKKRE 153
RGV+ TPTF++NGF L AGS LDYNGWRK IDPL+ ++KGK E
Sbjct: 183 RGVFGTPTFYINGFVLPDAGSTLDYNGWRKFIDPLINAKKGKSEE 227
>gi|224061957|ref|XP_002300683.1| predicted protein [Populus trichocarpa]
gi|222842409|gb|EEE79956.1| predicted protein [Populus trichocarpa]
Length = 223
Score = 137 bits (344), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/120 (59%), Positives = 86/120 (71%), Gaps = 7/120 (5%)
Query: 36 LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I LNC LLE QEKFY + T N+++ ++VKEIVKFA +G+SYSS L+
Sbjct: 104 LHIANILNCSFTFPLLEQFFKHQEKFYGSETSNLSKDSIVKEIVKFATVIVGDSYSSPLQ 163
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
GF+D TDL TRVSFK+SA+RGVYATP FFVNGF L GAGS LDY WR +IDPL+ K
Sbjct: 164 FGFNDIQTDLKTRVSFKYSASRGVYATPFFFVNGFGLPGAGSALDYKVWRSIIDPLVGAK 223
>gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity to pigpen protein from Mus musculus gb|AF224264
and contains protein of unknown function DUF78 PF|01918
domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
gb|AI997847, gb|AV543000 come from this gene
[Arabidopsis thaliana]
Length = 538
Score = 130 bits (327), Expect = 2e-28, Method: Composition-based stats.
Identities = 59/95 (62%), Positives = 71/95 (74%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q FYN+ TQ M+R AVV+E++K +GNSY S L+SGFS+ +DL TRVSFK+S +
Sbjct: 117 HQTLFYNSQTQLMSRPAVVEELIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFKYSVS 176
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
RGV ATPTF+VNGF L GAGSP DY GWR IDPL
Sbjct: 177 RGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPL 211
>gi|22329686|ref|NP_683315.1| TRX domain-containing protein [Arabidopsis thaliana]
gi|17065544|gb|AAL32926.1| Unknown protein [Arabidopsis thaliana]
gi|24899723|gb|AAN65076.1| Unknown protein [Arabidopsis thaliana]
gi|332191831|gb|AEE29952.1| TRX domain-containing protein [Arabidopsis thaliana]
Length = 233
Score = 129 bits (323), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 66/120 (55%), Positives = 82/120 (68%), Gaps = 7/120 (5%)
Query: 36 LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I LN LLE + Q FYN+ TQ M+R AVV+E++K +GNSY S L+
Sbjct: 97 LHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEELIKLGTVTLGNSYHSPLK 156
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
SGFS+ +DL TRVSFK+S +RGV ATPTF+VNGF L GAGSP DY GWR IDPL+ +
Sbjct: 157 SGFSNSKSDLATRVSFKYSVSRGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPLVKPQ 216
>gi|359492074|ref|XP_003634361.1| PREDICTED: uncharacterized protein LOC100854733 [Vitis vinifera]
Length = 234
Score = 128 bits (322), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 68/132 (51%), Positives = 89/132 (67%), Gaps = 9/132 (6%)
Query: 36 LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I +LN LLE++ QE FYN T NM+RTA+V IVKF ++ +G S SA++
Sbjct: 105 LHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNMSRTAIVDCIVKFVSKAVGESLFSAIK 164
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
SGFSDR TDL TRVSFK+ +RGV TP FFVNGF L GS ++Y+ WR ++DPL+S+
Sbjct: 165 SGFSDRQTDLTTRVSFKYGCSRGVLGTPYFFVNGFPLPDPGSAINYSKWRSILDPLVSDS 224
Query: 149 GKKREVPLHLFL 160
K+ LH FL
Sbjct: 225 VKRET--LHFFL 234
>gi|297850426|ref|XP_002893094.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
lyrata]
gi|297338936|gb|EFH69353.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 128 bits (321), Expect = 9e-28, Method: Composition-based stats.
Identities = 61/106 (57%), Positives = 75/106 (70%), Gaps = 1/106 (0%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q FYN+ TQ M+R AVV ++K +GNSY S L SGFS+ +DL TRVSFK+S +
Sbjct: 118 HQALFYNSQTQLMSRPAVVDNLIKLGTVTLGNSYHSPLISGFSNSKSDLATRVSFKYSVS 177
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREVP 155
RGV ATPTF+VNGF L GAGSP DY GWR IDPL+ +K ++ E P
Sbjct: 178 RGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPLM-DKYQRVEKP 222
>gi|81076617|gb|ABB55396.1| unknown [Solanum tuberosum]
Length = 224
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 86/132 (65%), Gaps = 7/132 (5%)
Query: 36 LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I +LN LLE QQ+KFYN T N+++ +VV E+VKF + IGNS +A++
Sbjct: 93 LHIVNKLNSSATFRLLEAFFDQQDKFYNQATFNLSKASVVDEVVKFTSNKIGNSNYAAVK 152
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
+GF+D TD TR+SFK+ +GVY P FFVNGF L GSPLDY WR VIDPL+S +
Sbjct: 153 AGFTDPKTDQATRISFKYGCVKGVYGAPFFFVNGFPLPDGGSPLDYKTWRDVIDPLISPE 212
Query: 149 GKKREVPLHLFL 160
+ R L+ FL
Sbjct: 213 EQLRAENLNFFL 224
>gi|351726088|ref|NP_001235579.1| uncharacterized protein LOC100500267 precursor [Glycine max]
gi|255629877|gb|ACU15289.1| unknown [Glycine max]
Length = 236
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 82/113 (72%), Gaps = 3/113 (2%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
QE FY A T+N++R ++++E+VK A + +G+SY +++GF+D +TD+ TRVSFK++A+R
Sbjct: 124 QENFYGAQTRNLSRASIIEEVVKSATQVVGSSYYKTIKNGFNDTTTDIQTRVSFKYAASR 183
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKK---REVPLHLFL 160
GVY TP F+VNGF L G+ +DY WRKVIDPL+ K KK E LH FL
Sbjct: 184 GVYGTPFFYVNGFLLPDTGAAVDYKTWRKVIDPLVGAKNKKSIQNEESLHFFL 236
>gi|449450508|ref|XP_004143004.1| PREDICTED: uncharacterized protein LOC101216804 isoform 1 [Cucumis
sativus]
gi|449450510|ref|XP_004143005.1| PREDICTED: uncharacterized protein LOC101216804 isoform 2 [Cucumis
sativus]
gi|449521601|ref|XP_004167818.1| PREDICTED: uncharacterized protein LOC101231840 isoform 1 [Cucumis
sativus]
gi|449521603|ref|XP_004167819.1| PREDICTED: uncharacterized protein LOC101231840 isoform 2 [Cucumis
sativus]
Length = 236
Score = 125 bits (315), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 61/111 (54%), Positives = 78/111 (70%), Gaps = 2/111 (1%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q++FYNA T+ ++R A+V +VKF E +G+SY + L +GF+DR TDLLTRVSFKFS +
Sbjct: 128 DQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRVSFKFSTS 187
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREVPLHLFL 160
RGVY TP FF+NGF GSPL+Y WR +IDPL+ K KR HL L
Sbjct: 188 RGVYGTPFFFINGFLAPDKGSPLNYTEWRNLIDPLI--KKNKRSGSQHLSL 236
>gi|351721154|ref|NP_001236688.1| uncharacterized protein LOC100527484 precursor [Glycine max]
gi|255632456|gb|ACU16578.1| unknown [Glycine max]
Length = 233
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/133 (50%), Positives = 87/133 (65%), Gaps = 8/133 (6%)
Query: 36 LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L+I LN LLE QEKFY A T+N++R ++V EIVK A E G+SY +A++
Sbjct: 101 LHIVNTLNASATFPLLEWFFKHQEKFYGAQTRNLSRASIVDEIVKSATEAAGSSYYNAIK 160
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS-E 147
GF+D TD TRVSFK++A+RGVY TP+F+VNGF L G+ DY WRKVIDPL+ +
Sbjct: 161 HGFNDTKTDYQTRVSFKYAASRGVYGTPSFYVNGFLLPDTGATADYKTWRKVIDPLVGVK 220
Query: 148 KGKKREVPLHLFL 160
K + E L FL
Sbjct: 221 KSIQNEESLRFFL 233
>gi|15222928|ref|NP_177728.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
gi|117168227|gb|ABK32196.1| At1g76020 [Arabidopsis thaliana]
gi|332197664|gb|AEE35785.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
Length = 225
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 56/96 (58%), Positives = 74/96 (77%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q FYNA TQ ++R AVV++IV+ +GNSY S L+SGFSD+ +D TRVSFK+SA+
Sbjct: 117 HQSLFYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSAS 176
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
RGVY TPTF+VNGF L+ A SP ++ GW+K+IDPL+
Sbjct: 177 RGVYGTPTFYVNGFVLSDAASPSNFGGWKKIIDPLV 212
>gi|409030182|gb|AFV07578.1| thioredoxin f5, partial [Solanum tuberosum]
Length = 206
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/103 (62%), Positives = 75/103 (72%), Gaps = 7/103 (6%)
Query: 35 VLYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
L+I LN LLE QE+FYNA T N +R VVKEIVKFA+ +GNSY SA+
Sbjct: 104 TLHIANELNTSSTFPLLERFYKHQERFYNAQTHNKSRATVVKEIVKFASAAVGNSYCSAI 163
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
ESG +DR TDL TRVSFK+SA+RGV+ATPTFFVNGF+L AGS
Sbjct: 164 ESGLNDRQTDLKTRVSFKYSASRGVFATPTFFVNGFALPDAGS 206
>gi|388493536|gb|AFK34834.1| unknown [Lotus japonicus]
Length = 239
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 91/133 (68%), Gaps = 8/133 (6%)
Query: 36 LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
L++ LN LLE++ QEKFY A T+N++R ++ +E VK A E IG+S+ ++++
Sbjct: 107 LHVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASIQEEFVKSATEVIGSSFYTSVK 166
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SE 147
+GF+D TD TR+SFK++A+RGVY TP F+VNGF L G +DY W+K+IDPL+ +E
Sbjct: 167 NGFNDSKTDHQTRISFKYAASRGVYGTPFFYVNGFLLPDVGDSIDYKTWKKIIDPLVGAE 226
Query: 148 KGKKREVPLHLFL 160
K K E LH FL
Sbjct: 227 KNVKNEESLHFFL 239
>gi|388504516|gb|AFK40324.1| unknown [Medicago truncatula]
Length = 230
Score = 122 bits (306), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 55/111 (49%), Positives = 83/111 (74%), Gaps = 1/111 (0%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
Q F+ APT+N++R +V EIVKF + +GNSY +++++GF+D ++D LTR+SFK++A+R
Sbjct: 120 QAAFFGAPTRNLSRAFIVDEIVKFTSAIVGNSYHTSIKNGFNDTNSDHLTRISFKYAASR 179
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SEKGKKREVPLHLFL 160
GV TP F+VNGF+L AG+ L+Y+ WR +IDPL+ ++K K E LH L
Sbjct: 180 GVSGTPFFYVNGFALPDAGNALNYSDWRSIIDPLIGAKKSTKNEESLHYLL 230
>gi|302141727|emb|CBI18930.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 119 bits (298), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/117 (52%), Positives = 80/117 (68%), Gaps = 7/117 (5%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
L+I +LN LLE++ QE FYN T NM+RTA+V IVKF ++ +G S SA+
Sbjct: 104 ALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNMSRTAIVDCIVKFVSKAVGESLFSAI 163
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
+SGFSDR TDL TRVSFK+ +RGV TP FFVNGF L GS ++Y+ WR ++DPL
Sbjct: 164 KSGFSDRQTDLTTRVSFKYGCSRGVLGTPYFFVNGFPLPDPGSAINYSKWRSILDPL 220
>gi|297842343|ref|XP_002889053.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
lyrata]
gi|297334894|gb|EFH65312.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
lyrata]
Length = 229
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/97 (55%), Positives = 71/97 (73%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q FYNA T +TR AVV++IV+ +G SY S L+SGFSD+ +D TRVSFK+S +
Sbjct: 118 HQPLFYNAQTNLLTRAAVVEKIVELGTVTLGKSYQSVLKSGFSDKKSDRATRVSFKYSGS 177
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
RGVY TPTF+VNGF L+ A SP ++ GW+K+IDPL+
Sbjct: 178 RGVYGTPTFYVNGFVLSDAASPSNFGGWKKIIDPLVQ 214
>gi|218202548|gb|EEC84975.1| hypothetical protein OsI_32232 [Oryza sativa Indica Group]
Length = 237
Score = 112 bits (281), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/127 (48%), Positives = 77/127 (60%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ Q KFYNA T +++ T + E+ K AA +GNS S
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+SGFSD TDL RVSFK+ TRGV P FFVNGF G GSP+DY+ W ++DPL +
Sbjct: 167 QSGFSDIRTDLAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQ 226
Query: 148 KGKKREV 154
G K E+
Sbjct: 227 HGDKLEM 233
>gi|115480453|ref|NP_001063820.1| Os09g0542200 [Oryza sativa Japonica Group]
gi|52076083|dbj|BAD46596.1| unknown protein [Oryza sativa Japonica Group]
gi|113632053|dbj|BAF25734.1| Os09g0542200 [Oryza sativa Japonica Group]
gi|215694060|dbj|BAG89259.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740808|dbj|BAG96964.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 237
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ Q KFYNA T +++ T + E+ K AA +GNS S
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+SGFSD TDL RVSFK+ TRGV P FFVNGF G GSP+DY+ W ++DPL +
Sbjct: 167 QSGFSDIRTDLAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQ 226
Query: 148 KGKKREV 154
G + E+
Sbjct: 227 HGDRLEM 233
>gi|194697338|gb|ACF82753.1| unknown [Zea mays]
Length = 238
Score = 112 bits (279), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 81/127 (63%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ QEKFYN+ T +++ +V + K AA+ +GNS S L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 168
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
SGFSD TD RVSFK+ TRGV+ P FFVNGF G GSP+DY+ W ++DPL+S+
Sbjct: 169 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLVSQ 227
Query: 148 KGKKREV 154
G++ E+
Sbjct: 228 NGERIEM 234
>gi|195655073|gb|ACG47004.1| hypothetical protein [Zea mays]
Length = 235
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 80/127 (62%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ QEKFYN+ T +++ +V + K AA+ +GNS S L
Sbjct: 106 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 165
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
SGFSD TD RVSFK+ TRGV+ P FFVNGF G GSP+DY+ W ++DPL S+
Sbjct: 166 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLASQ 224
Query: 148 KGKKREV 154
G++ E+
Sbjct: 225 NGERIEM 231
>gi|242050028|ref|XP_002462758.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
gi|241926135|gb|EER99279.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
Length = 238
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/127 (48%), Positives = 82/127 (64%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ QEKFYN+ T +++ +V E+ K AA+ +GNS S L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSSPSVAVEMSKMAAQTVGNSVSEFL 168
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
SGFSD TD RVSFK+ TRGVY P FFVNGF G GSP+DY+ W ++DPL+S+
Sbjct: 169 -SGFSDTKTDSAARVSFKYGCTRGVYGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLVSQ 227
Query: 148 KGKKREV 154
G++ E+
Sbjct: 228 NGERIEM 234
>gi|326520974|dbj|BAJ92850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 292
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/127 (45%), Positives = 76/127 (59%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
L+I +LN +LE+ Q KF N T +M+ TAV EI K AA+ +GNS S
Sbjct: 163 ALHIANKLNSSSTYPVLELFFKNQGKFSNRATSSMSSTAVTGEISKMAAQAVGNSVSD-F 221
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+SGFSD TD+ RVSFK+ TRGV P FFVNGF G GSP+DY W ++DPL++
Sbjct: 222 QSGFSDTRTDMAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYATWTSILDPLVAH 281
Query: 148 KGKKREV 154
+ E+
Sbjct: 282 HSQTIEM 288
>gi|226528876|ref|NP_001145190.1| uncharacterized protein LOC100278440 [Zea mays]
gi|195652481|gb|ACG45708.1| hypothetical protein [Zea mays]
Length = 238
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/127 (46%), Positives = 80/127 (62%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ QEKFYN+ T +++ +V + K AA+ +GNS S L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 168
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
SGFSD TD RVSFK+ TRGV+ P FFVNGF G GSP+DY+ W ++ PL+S+
Sbjct: 169 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILXPLVSQ 227
Query: 148 KGKKREV 154
G++ E+
Sbjct: 228 NGERIEM 234
>gi|357159750|ref|XP_003578547.1| PREDICTED: uncharacterized protein LOC100839255 [Brachypodium
distachyon]
Length = 238
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/127 (43%), Positives = 76/127 (59%), Gaps = 8/127 (6%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
L+I +LN LLE+ Q KF N +++ TA+ EI K A + +GNS S
Sbjct: 109 ALHIANKLNSSSTYPLLELFFKNQGKFSNHALSSLSSTAITGEISKMAVQAVGNSVSE-F 167
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+SGFSD TD RVSFK+ TRGV P FFVNGF G GSP+DY W ++DP++++
Sbjct: 168 QSGFSDARTDSAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYTTWTNILDPIVAQ 227
Query: 148 KGKKREV 154
G+K ++
Sbjct: 228 HGEKTDM 234
>gi|222642012|gb|EEE70144.1| hypothetical protein OsJ_30190 [Oryza sativa Japonica Group]
Length = 216
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 67/104 (64%), Gaps = 1/104 (0%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
+ KFYNA T +++ T + E+ K AA +GNS S +SGFSD TDL RVSFK+ TR
Sbjct: 110 EGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAARVSFKYGCTR 168
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
GV P FFVNGF G GSP+DY+ W ++DPL + G + E+
Sbjct: 169 GVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQHGDRLEM 212
>gi|116781410|gb|ABK22089.1| unknown [Picea sitchensis]
Length = 227
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 62/101 (61%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q +FYN T + +++ +I+ FA E GNS S+ ES F D +TD+ TR+SFK+ +
Sbjct: 122 HQARFYNKATLQVAPASIINQIIHFALEISGNSSSTMFESAFQDTTTDMATRISFKYGCS 181
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
R V TP FFVNG L +DY GW+ +IDPLL+ K +
Sbjct: 182 RAVTGTPYFFVNGIPLLNIEETIDYKGWKSIIDPLLAMKTR 222
>gi|8778813|gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana]
Length = 263
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 58/81 (71%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q FYNA TQ ++R AVV++IV+ +GNSY S L+SGFSD+ +D TRVSFK+SA+
Sbjct: 117 HQSLFYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSAS 176
Query: 110 RGVYATPTFFVNGFSLAGAGS 130
RGVY TPTF+VNG S S
Sbjct: 177 RGVYGTPTFYVNGSSEPNESS 197
>gi|326491935|dbj|BAJ98192.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493720|dbj|BAJ85321.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326504198|dbj|BAJ90931.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511459|dbj|BAJ87743.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326520669|dbj|BAJ92698.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 226
Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 61/98 (62%), Gaps = 1/98 (1%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
QE +YN PT TR AVV EI A IG + +A ++GF+D +D TR+SFKF
Sbjct: 128 QEGYYNTPTYTKTRAAVVAEIANNLVAPVIGEANLAAYKAGFNDSQSDQATRISFKFGCA 187
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
RGV TP +FVNG L+ +GSP+DY W +DPL+ +
Sbjct: 188 RGVTGTPYYFVNGIPLSDSGSPMDYGKWISTLDPLVGK 225
>gi|409030172|gb|AFV07573.1| DsbA c3, partial [Solanum tuberosum]
Length = 194
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 7/91 (7%)
Query: 35 VLYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
L+I LN LLE QE+FYNA T N +R VVKEIV+FA+ +GNS SA+
Sbjct: 104 TLHIANELNTSSTFPLLERFYKHQERFYNAQTHNKSRATVVKEIVRFASAAVGNSCCSAI 163
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTF 118
ESG DR TDL TRVSF++SA+RGV+ TPTF
Sbjct: 164 ESGSYDRQTDLKTRVSFQYSASRGVFGTPTF 194
>gi|115457372|ref|NP_001052286.1| Os04g0227500 [Oryza sativa Japonica Group]
gi|38346025|emb|CAE01956.2| OSJNBb0071D01.2 [Oryza sativa Japonica Group]
gi|38346893|emb|CAE03918.2| OSJNBb0015G09.12 [Oryza sativa Japonica Group]
gi|113563857|dbj|BAF14200.1| Os04g0227500 [Oryza sativa Japonica Group]
gi|125546580|gb|EAY92719.1| hypothetical protein OsI_14471 [Oryza sativa Indica Group]
gi|125588738|gb|EAZ29402.1| hypothetical protein OsJ_13476 [Oryza sativa Japonica Group]
gi|215679384|dbj|BAG96524.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765592|dbj|BAG87289.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 226
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
QE +YN PT TR V E+ K A IG + +A ++GF+D +D TR+SFK+
Sbjct: 127 QEGYYNQPTYGKTRATVDAEVAKNLVAPVIGEANLAAYKAGFNDSKSDQATRISFKYGCA 186
Query: 110 RGVYATPTFFVNGFSL-AGAGSPLDYNGWRKVIDPLLSE 147
RGV TP FFVNG L + +GSPL+YN W+ ++DPL+ +
Sbjct: 187 RGVTGTPYFFVNGIPLISDSGSPLEYNKWKSILDPLVGK 225
>gi|357168058|ref|XP_003581462.1| PREDICTED: uncharacterized protein LOC100834992 [Brachypodium
distachyon]
Length = 219
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
QE +YN PT +R VV EI A IG + +A ++GF+D +D TR+SFK+
Sbjct: 121 QEAYYNQPTYTKSRETVVAEITNNLVAPVIGEANLAAYKAGFNDSMSDQSTRISFKYGCA 180
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
RGV TP FFVNG L +GSP+DY+ W +DPL+ +
Sbjct: 181 RGVTGTPYFFVNGIPLGDSGSPVDYDKWISTLDPLVGK 218
>gi|242072508|ref|XP_002446190.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
gi|241937373|gb|EES10518.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
Length = 220
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 61/99 (61%), Gaps = 1/99 (1%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSA 108
QE +YN PT +R VV EI K A IG + +A ++GF+D +D+ TR+SFK
Sbjct: 121 DQEGYYNRPTYGKSRATVVDEITKNLVAPIIGETNLAAYKAGFNDSQSDMATRISFKNGC 180
Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
RGV TP FFVNG + +GSPL+Y W ++DPL+ +
Sbjct: 181 ARGVTGTPYFFVNGIPINDSGSPLEYKYWISILDPLVGK 219
>gi|226491612|ref|NP_001151670.1| LOC100285305 precursor [Zea mays]
gi|195648623|gb|ACG43779.1| DSBA-like thioredoxin domain containing protein [Zea mays]
Length = 223
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 57/98 (58%), Gaps = 1/98 (1%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q+++YN PT +R VV EI K IG + +A +GF+D +D TR+SFK
Sbjct: 125 QKRYYNQPTYEKSRATVVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFKNGCA 184
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
RGV TP FFVNG + +GSPL+ W ++ PL+ +
Sbjct: 185 RGVTGTPYFFVNGIPINDSGSPLEXKDWISILRPLVGK 222
>gi|302791067|ref|XP_002977300.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
gi|300154670|gb|EFJ21304.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
Length = 226
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 67/121 (55%), Gaps = 9/121 (7%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVK-FAAE-GIGNSYSS 85
L+I LN LL+++ QE+F NA T + T + V+ E FA++ G GN S
Sbjct: 97 ALHIANHLNASLTYPLLDLIFEHQEEFSNANTADKTASTVIDEFSSLFASQFGSGNEAKS 156
Query: 86 ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
+ GF D STD R+SFK+ +RGV TP FFVNG L+ + + W ++DPLL
Sbjct: 157 IFKQGFYDSSTDQAGRISFKYGCSRGVTGTPVFFVNGVPLSNVDASWGIDEWANILDPLL 216
Query: 146 S 146
+
Sbjct: 217 A 217
>gi|302780317|ref|XP_002971933.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
gi|300160232|gb|EFJ26850.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
Length = 226
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 67/121 (55%), Gaps = 9/121 (7%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVK-FAAE-GIGNSYSS 85
L+I LN LL+++ QE+F NA T + T + V+ E FA++ G GN S
Sbjct: 97 ALHIANHLNASLTYPLLDLIFEHQEEFSNANTGDKTASTVIDEFSSLFASQFGSGNEAKS 156
Query: 86 ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
+ GF D STD R+SFK+ +RGV TP FFVNG L+ + + W ++DPLL
Sbjct: 157 IFKQGFYDSSTDQAGRISFKYGCSRGVTGTPVFFVNGVPLSNVDASWGIDEWANILDPLL 216
Query: 146 S 146
+
Sbjct: 217 A 217
>gi|32526672|dbj|BAC79195.1| unknown protein [Oryza sativa Japonica Group]
Length = 220
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 65/127 (51%), Gaps = 28/127 (22%)
Query: 35 VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
LYI +LN LLE+ Q KFYNA T +++ T + E+ K AA +GNS S
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166
Query: 88 ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+SGFSD TDL RVSFK G GSP+DY+ W ++DPL +
Sbjct: 167 QSGFSDIRTDLAARVSFK--------------------PGGGSPIDYSTWVSILDPLAGQ 206
Query: 148 KGKKREV 154
G + E+
Sbjct: 207 HGDRLEM 213
>gi|413917905|gb|AFW57837.1| hypothetical protein ZEAMMB73_294031 [Zea mays]
Length = 212
Score = 72.0 bits (175), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 55/97 (56%), Gaps = 5/97 (5%)
Query: 52 EKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
EKF+ ++ + + V EI K IG + +A +GF+D +D TR+SFK R
Sbjct: 119 EKFF----KDQSYSGTVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFKNGCAR 174
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
GV TP FFVNG + +GSPL+Y W ++ PL+ +
Sbjct: 175 GVTGTPYFFVNGIPINDSGSPLEYKDWISILGPLVGK 211
>gi|168037769|ref|XP_001771375.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677293|gb|EDQ63765.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 191
Score = 72.0 bits (175), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 45 LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 102
LLE++ QE F N T + T VV ++ A + +G +S E F+D TD TR+
Sbjct: 92 LLELIFSNQESFLNPSTNHETPAHVVSRLISLA-DNLGFPTNS-FEVAFNDAVTDQATRI 149
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
SFK+ +RGV TPT+ VNG ++AGA W K+ DP+
Sbjct: 150 SFKYGCSRGVVGTPTYLVNGVAVAGADDSWSVEDWGKLFDPM 191
>gi|449450512|ref|XP_004143006.1| PREDICTED: uncharacterized protein LOC101216804 isoform 3 [Cucumis
sativus]
Length = 201
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 31/56 (55%), Positives = 43/56 (76%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 105
Q++FYNA T+ ++R A+V +VKF E +G+SY + L +GF+DR TDLLTRVSFK
Sbjct: 128 DQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRVSFK 183
>gi|196010617|ref|XP_002115173.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190582556|gb|EDV22629.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 217
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 2/105 (1%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
Q +FYN PT M+ V++ K A + G S + + + D TR+S K +
Sbjct: 115 NQSEFYNKPTLKMSGLQVIQAYAKLAHKAAGIS-AMKFANALQNSGIDHTTRLSSKLAWE 173
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
RGV TP+FF+NG + G + W+KVIDPLL K + E+
Sbjct: 174 RGVTGTPSFFINGIRVIGQSN-WTVKQWKKVIDPLLKTKFPRLEL 217
>gi|414886700|tpg|DAA62714.1| TPA: hypothetical protein ZEAMMB73_877337 [Zea mays]
Length = 280
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 40/57 (70%)
Query: 98 LLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
L +++ K+ TRGV+ P FFVNGF G GSP+DY+ W ++DPL+S+ G++ E+
Sbjct: 9 LAAKLTLKYGCTRGVFGAPFFFVNGFLEPGGGSPIDYSTWIGILDPLVSQNGERVEI 65
>gi|348690389|gb|EGZ30203.1| hypothetical protein PHYSODRAFT_553285 [Phytophthora sojae]
Length = 225
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 5/114 (4%)
Query: 46 LEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE---SGFSDRSTDLLT 100
LE+V QE+F+N T++ + V E+ + A + + E +G+ D T
Sbjct: 112 LEVVYEHQEEFWNKATKDKSAVQVTNELKQLAQKTFPDLTDQQWERGMTGYGGTEADQHT 171
Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
RV++K++ TR V TP + +NG A S WRKVIDPL+ ++ ++
Sbjct: 172 RVTWKYACTRTVTGTPQYTLNGVPFEDADSSWKLEDWRKVIDPLVKPNQERDDL 225
>gi|320169829|gb|EFW46728.1| hypothetical protein CAOG_04686 [Capsaspora owczarzaki ATCC 30864]
Length = 261
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 52/98 (53%), Gaps = 3/98 (3%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
Q+ F + PT N+T V+ + + IG + L+ G TD R+++K+ TR
Sbjct: 117 QDSFEDDPTVNLTSNQVINMYAALS-QTIGVPAAVFLK-GMDSDDTDESARIAWKYGCTR 174
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
GV TP FFVNG S+A A S + W V+DPLL+
Sbjct: 175 GVAGTPWFFVNGISVA-ASSAWSVSDWVSVLDPLLNSS 211
>gi|154420795|ref|XP_001583412.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121917653|gb|EAY22426.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 198
Score = 59.3 bits (142), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 37 YIKRRLNCLLEIVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRST 96
Y LNCL Q F + +++T + K+ +K+AAE +G + + L F +
Sbjct: 93 YAPILLNCLYADGNQSMFLGSEVKSVTAGDMQKKALKWAAEKLGITVND-LTKAFG--TQ 149
Query: 97 DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
++ TR+ FK+SA + TPTF++NG + + S + W VIDPLL+
Sbjct: 150 EMNTRIEFKYSAVHNIDGTPTFYINGVA-SDLSSDSTIDDWSNVIDPLLN 198
>gi|390354820|ref|XP_003728415.1| PREDICTED: uncharacterized protein LOC100888511 [Strongylocentrotus
purpuratus]
Length = 219
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
+Q YNA + +V + ++A++ +G S + + T L RV FK+ AT
Sbjct: 119 KQAALYNANITEQPDSYIVNALSEWASD-VGYSKAQFMMHLSPSDPTQSLARVEFKYGAT 177
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
RGV+ TP F+NG ++ + W++VIDPLL+E
Sbjct: 178 RGVFGTPQTFINGVVVSSKPN-WTLGDWKQVIDPLLAEN 215
>gi|118386030|ref|XP_001026137.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila]
gi|89307904|gb|EAS05892.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila
SB210]
Length = 230
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
QE+++N P N TR AV ++ + + + + ++R+ DL R S+K+ +R
Sbjct: 114 QEQYWNGPVTNYTRNAVFNKLAHQLHDFVPKISINETLAALTNRTYDLEARYSWKYGVSR 173
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL--SEKGKKREV 154
V TP FF+NG L GA S + W + + P + GK + V
Sbjct: 174 EVAGTPYFFLNGVQLDGAES-FSSSTWAETLLPYIQADSNGKPKSV 218
>gi|440791659|gb|ELR12897.1| hypothetical protein ACA1_095270 [Acanthamoeba castellanii str.
Neff]
Length = 212
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 53/96 (55%), Gaps = 1/96 (1%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
QQ+ + + TQ + V+ ++ ++A ++ +++ + +LL RV++K +A+
Sbjct: 117 QQDNWGDEATQGIAPLQVITQMSRYAQACCKVDPAAFVKAMGWVSAQNLLARVTWKRAAS 176
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
G+Y TP F +NG S + A WRK++DPLL
Sbjct: 177 LGIYGTPQFMINGVS-SSADESWTMADWRKLLDPLL 211
>gi|307945890|ref|ZP_07661226.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
gi|307771763|gb|EFO30988.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
Length = 191
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 4/99 (4%)
Query: 49 VQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-SGFSDRSTDLLTRVSFKFS 107
+Q +YN P + T ++ I FA + G + L+ G ++ TD R +F+
Sbjct: 95 ARQSDYYNGPFLHKTHQDLLNLIADFAHDFAGFNKDEMLKLIGTNEVYTD--ARTPIRFA 152
Query: 108 ATRGVYATPTFFVNGFSLAGAGS-PLDYNGWRKVIDPLL 145
+T+GV+ATPTFF+N L G S WR VIDP L
Sbjct: 153 STKGVWATPTFFLNSSDLVGKFSEEPSLEEWRAVIDPRL 191
>gi|301118869|ref|XP_002907162.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105674|gb|EEY63726.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 298
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAE---GIGNSYSSALESGFSDRSTDLLTRVSFKFS 107
Q+ F+N T++++ VV+++ K A + + + +G+ D TR S+K++
Sbjct: 117 QDIFWNKATKDLSPIQVVEKLKKLAQKTFPSLTDKQWDEQMTGYGGTDVDDHTRESWKYT 176
Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
+RG+ TP + +NG A A + + W KVIDPL+
Sbjct: 177 CSRGMSGTPMYTLNGVPFA-ADADWTFEQWYKVIDPLV 213
>gi|405953203|gb|EKC20909.1| Eppin [Crassostrea gigas]
Length = 403
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 3/95 (3%)
Query: 52 EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRG 111
+ N T++M+ ++ + AA G+G S ++ +D D RV +K++ TRG
Sbjct: 119 DTLSNTATKSMSEIQIISMLGDIAA-GLGIQKSVFVQK-MADPLVDEDARVEWKYTCTRG 176
Query: 112 VYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
+ TP F +N +A S WRKVIDPLLS
Sbjct: 177 ISGTPMFTINDVIVAADAS-WSLEDWRKVIDPLLS 210
>gi|348690388|gb|EGZ30202.1| hypothetical protein PHYSODRAFT_477416 [Phytophthora sojae]
Length = 303
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAE---GIGNSYSSALESGFSDRSTDLLTRVSFKFS 107
Q+ F+N T++++ VV ++ K A + + + L +G+ D TR S+K++
Sbjct: 121 QDIFWNKVTKDLSPVQVVAKLKKLAQKTFPSLTDEQWDKLMTGYGGTDVDDHTRESWKYT 180
Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
+RG+ TP + +NG A + ++ W KVIDPL+
Sbjct: 181 CSRGMSGTPMYTLNGVPFE-ADADWTFDQWFKVIDPLV 217
>gi|156379454|ref|XP_001631472.1| predicted protein [Nematostella vectensis]
gi|156218513|gb|EDO39409.1| predicted protein [Nematostella vectensis]
Length = 214
Score = 52.4 bits (124), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 45/95 (47%), Gaps = 3/95 (3%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
Q + YN T N R ++ + A + S +++G + +D R+ +K R
Sbjct: 122 QSELYNFQTMNKNRYDILNIVSSLAPKA--GVPSDIMKTGLTGTESDGAARIGWKHGCLR 179
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
V TPTFF+NG + A S W+ VIDPLL
Sbjct: 180 TVAGTPTFFINGIPVE-ADSSWTVQQWKDVIDPLL 213
>gi|299115356|emb|CBN74181.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 262
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 1/98 (1%)
Query: 49 VQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSA 108
+ Q++ N T+ MT+ + + +A G S + G +D ++ TR FK+
Sbjct: 148 IGQDEISNDSTELMTQPQIEDILESWATSSSGMS-ADNFHMGMADAEVEIQTRNQFKYGC 206
Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
GV TP ++ G +G + W+ ++DPLLS
Sbjct: 207 LHGVDGTPQVYIGGILASGLDGDATFQDWQDILDPLLS 244
>gi|390368255|ref|XP_790435.2| PREDICTED: uncharacterized protein LOC585517 isoform 2
[Strongylocentrotus purpuratus]
gi|390368257|ref|XP_003731415.1| PREDICTED: uncharacterized protein LOC585517 isoform 1
[Strongylocentrotus purpuratus]
Length = 286
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 53/98 (54%), Gaps = 2/98 (2%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
+Q +A + ++K + ++A+E +G S + + T L RV FK+ AT
Sbjct: 183 KQAALSDANISEQPDSYILKVLSEWASE-VGYSKAQFMMHLSRSDPTQSLARVEFKYGAT 241
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
RGV+ TP F+NG ++ + + W++VIDPLL++
Sbjct: 242 RGVFQTPQTFING-AVVSSEPRWTLSDWKQVIDPLLAK 278
>gi|254423276|ref|ZP_05036994.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
gi|196190765|gb|EDX85729.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
Length = 191
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 43/96 (44%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
+Q +F N ++ T+T + + AAE S + R+ +F+
Sbjct: 96 KQPEFANEAWKDKTQTEFHTFLAECAAEATAYKDKEQFLKLLSSKEIYAQARIPARFAIV 155
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
RGV++TPTFF+NG S WR VID LL
Sbjct: 156 RGVWSTPTFFINGAEATTLSSSSSVQDWRSVIDDLL 191
>gi|123446098|ref|XP_001311803.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121893626|gb|EAX98873.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 199
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
Q +F N P+ T V +K+AA+ I + + F+ +L RV FK+S +
Sbjct: 107 QGQFENDPS--FTENTVTDNCIKYAAK-ICQLSEDDIRNAFATIDINLGARVEFKYSTSH 163
Query: 111 GVYATPTFFVNGF--SLAGAGSPLDYNGWRKVIDPLLS 146
GV TP FFVNG + AGS +D W +D LL+
Sbjct: 164 GVDGTPYFFVNGVVTNDVDAGSSID--DWSLYLDNLLN 199
>gi|260803503|ref|XP_002596629.1| hypothetical protein BRAFLDRAFT_78475 [Branchiostoma floridae]
gi|229281888|gb|EEN52641.1| hypothetical protein BRAFLDRAFT_78475 [Branchiostoma floridae]
Length = 154
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
Query: 48 IVQQEKFYNAP-TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLL--TRVSF 104
I +Q+ Y P T +M+ T ++ ++ K A E IG S S S +++ + RV++
Sbjct: 9 IFEQQDDYKMPRTNDMSDTQIIDKLAKVA-ESIGVS-SKNFTSQVNNQEHMVYEDARVAW 66
Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKG 149
K+ RGV TP + +NG + A W+++ID LL ++G
Sbjct: 67 KYGCIRGVAGTPWYLLNGVPV-NASPNWTVAQWKQIIDSLLKQQG 110
>gi|260820710|ref|XP_002605677.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
gi|229291012|gb|EEN61687.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
Length = 292
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSL----AGAGSPLDYNGWRKVIDPL 144
+G + S + RV FK++ RGV ATPT+FVNG + A G W++ +D +
Sbjct: 164 TGLDESSPNHRARVEFKYACHRGVAATPTYFVNGIMVNPVPATEGGEFGLEQWKETLDSM 223
Query: 145 LSEKGKKRE 153
L ++++
Sbjct: 224 LGTAEQRQQ 232
>gi|390343493|ref|XP_003725885.1| PREDICTED: uncharacterized protein LOC100891241 [Strongylocentrotus
purpuratus]
Length = 239
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
RV FK+ ATRGV+ TP +VNG ++ + S WR +IDPL+S
Sbjct: 169 RVEFKYGATRGVFGTPQTYVNG-AVVYSDSSWTLADWRNIIDPLIS 213
>gi|340501301|gb|EGR28101.1| hypothetical protein IMG5_183230 [Ichthyophthirius multifiliis]
Length = 227
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
Query: 61 NMTRTAVVKEIVKFAAEGIGN---SYS---SALESGFSDRSTDLLTRVSFKFSATRGVYA 114
N T+T + EI + A+ + YS S + +D++ + TR S+K GV
Sbjct: 124 NKTKTLNLNEIYQKIADDTQSFLKEYSIKSSDVIDALNDQNYNQETRQSWKIGCGNGVSG 183
Query: 115 TPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
TP F++NG + GA + D W+K ++ L +K K
Sbjct: 184 TPIFYLNGVRIDGAET-FDVQNWKKFLEEYLQQKEK 218
>gi|444379557|ref|ZP_21178734.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
gi|443676286|gb|ELT82990.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
Length = 191
Score = 45.8 bits (107), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 9/100 (9%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDR--STDLLT--RVSFK 105
+Q+ FYN + T + + +A E + F +R S+D+ R +
Sbjct: 96 RQDTFYNGQFLHKTHEDLRNLVADYAVE-----HGKVDREAFIERMDSSDVYEQGRTPIR 150
Query: 106 FSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
++ATR V+ATPT F+N L W+ IDPLL
Sbjct: 151 YAATRSVWATPTVFINNAGLVPVDHHSSLEDWQAAIDPLL 190
>gi|449017941|dbj|BAM81343.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 279
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 49 VQQEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALE-----SGFSDRSTDLLTRV 102
QQE + + + TR V +E+ + A G+ +S S E SG S + +
Sbjct: 171 AQQEDWVDEKVLDKTRRQVYEELAQCLAKAGVVSSASLITERLLTASGNSGNAVGADVKF 230
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
+ ++ RGV+ TPT +NG A S W++ ID LLS +
Sbjct: 231 ACRYHRVRGVHVTPTVHINGIPELNASSSWTVQEWKERIDSLLSSE 276
>gi|260820750|ref|XP_002605697.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
gi|229291032|gb|EEN61707.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
Length = 292
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSL----AGAGSPLDYNGWRKVIDPL 144
+G + S + RV FK++ R V ATPT+FVNG + A G W++ +D +
Sbjct: 164 TGLDESSPNHRARVEFKYACHRSVAATPTYFVNGIMVNPVPATVGGSFGLEQWKETLDSM 223
Query: 145 LSEKGKKRE 153
L +++E
Sbjct: 224 LGTAEQRQE 232
>gi|440795402|gb|ELR16524.1| hypothetical protein ACA1_146580 [Acanthamoeba castellanii str.
Neff]
Length = 194
Score = 44.3 bits (103), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%)
Query: 99 LTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
L + K+ G++A+PTF VNG + A S WR+++DPLL+
Sbjct: 147 LIKWEQKYGRQNGIHASPTFLVNGLVVGQASSGWTLEQWRELLDPLLA 194
>gi|290978395|ref|XP_002671921.1| predicted protein [Naegleria gruberi]
gi|284085494|gb|EFC39177.1| predicted protein [Naegleria gruberi]
Length = 244
Score = 44.3 bits (103), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 9/92 (9%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG----NSYSSALESGFSDRSTDLLTRVSFK 105
Q YN M ++ + K I + + N Y + + G S D+ TR FK
Sbjct: 147 NQSPLYNNIVSEMKQSEMYKLIYTNNIQSLNVISWNDYLNLMLDG----SIDMRTRTLFK 202
Query: 106 FSATRGVYATPTFFVNGFSLAGAGSPLDYNGW 137
S GV+ATP+F+VNG ++ G +N W
Sbjct: 203 LSTANGVFATPSFYVNGVAIQN-GENFTFNDW 233
>gi|172055207|ref|YP_001806534.1| hypothetical protein cce_5122 [Cyanothece sp. ATCC 51142]
gi|354556960|ref|ZP_08976239.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
gi|171701488|gb|ACB54468.1| unknown [Cyanothece sp. ATCC 51142]
gi|353551086|gb|EHC20503.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
Length = 191
Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%)
Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
R+ +F+A RGV++TPTFF+NG S + W+ I+ LL
Sbjct: 147 RIPARFAAIRGVWSTPTFFINGAQTTDLSSQSNLQDWQDKINSLL 191
>gi|422295502|gb|EKU22801.1| hypothetical protein NGA_0488300 [Nannochloropsis gaditana CCMP526]
Length = 228
Score = 43.1 bits (100), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 1/102 (0%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
+Q + N TQ MT AV + + E G + + + R SFK++
Sbjct: 127 RQIQVSNELTQGMTYKAVEGVVAAWVQEATGLE-REVVRRKLQEGGVEEAVRFSFKYATI 185
Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKK 151
R ++ TP F +N + G D W + + PLL++ ++
Sbjct: 186 RSIFGTPMFVINQMLVPGLDGGSDVMDWEEYLAPLLTQMARE 227
>gi|422293098|gb|EKU20398.1| hypothetical protein NGA_0546800, partial [Nannochloropsis gaditana
CCMP526]
Length = 199
Score = 42.7 bits (99), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 43/99 (43%), Gaps = 11/99 (11%)
Query: 52 EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE---------SGFSDRSTDLLTRV 102
E+F + T+ MTR V+ ++ E ALE SG +T +
Sbjct: 103 ERFSDVNTEEMTRKEVMAALIALGVEA--GVPRPALEERLALQGGSSGNKGTATTQAVKW 160
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVI 141
+ K+ RGV+ TPT FVNG + S W++++
Sbjct: 161 ACKYHRRRGVHVTPTVFVNGLEASEVSSAWGVEEWKRLL 199
>gi|291237332|ref|XP_002738589.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 270
Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 36/87 (41%)
Query: 61 NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFV 120
N T + ++ + + A N + F D T L ++ + RGVY TP F +
Sbjct: 133 NTTESDIINALAELATGLSSNITRDSFYDKFEDEKTGHLCVYEWQSAILRGVYQTPWFLI 192
Query: 121 NGFSLAGAGSPLDYNGWRKVIDPLLSE 147
N + + W +IDPLL +
Sbjct: 193 NDMPILDFRPDWTLSNWTAIIDPLLED 219
>gi|313227081|emb|CBY22228.1| unnamed protein product [Oikopleura dioica]
Length = 217
Score = 42.4 bits (98), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 31/60 (51%)
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
+D + D TRV +KF+ + GV TP F+N L + W KV+DP++ +K
Sbjct: 151 DNMADPNLDWATRVDWKFACSLGVSGTPMPFINRVFLDQGVAEFTLADWTKVLDPIVGQK 210
>gi|291237330|ref|XP_002738588.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 215
Score = 42.0 bits (97), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 2/76 (2%)
Query: 71 IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
+ FA E +G + LE ++ T+ L R K + RGVY P F VNG ++
Sbjct: 142 LTDFAVE-VGFTEEEFLEE-YNHPKTNQLCRHEMKMANDRGVYGGPWFVVNGMTVLDYYP 199
Query: 131 PLDYNGWRKVIDPLLS 146
D W +IDPLL
Sbjct: 200 YWDVEDWISLIDPLLG 215
>gi|413936627|gb|AFW71178.1| hypothetical protein ZEAMMB73_653237 [Zea mays]
Length = 340
Score = 42.0 bits (97), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 15/69 (21%)
Query: 86 ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
L SG DR+T++ ++ K A + L G GSP+DY+ W ++DPL+
Sbjct: 133 GLRSGSEDRATEIFKKI-IKLLA--------------YKLPGGGSPIDYSTWIGILDPLV 177
Query: 146 SEKGKKREV 154
S+ G++ E+
Sbjct: 178 SQNGERVEM 186
>gi|124088536|ref|XP_001347136.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|145474255|ref|XP_001423150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057525|emb|CAH03509.1| Conserved hypothetical protein [Paramecium tetraurelia]
gi|124390210|emb|CAK55752.1| unnamed protein product [Paramecium tetraurelia]
Length = 214
Score = 42.0 bits (97), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 4/93 (4%)
Query: 52 EKFYNAPTQNMTRTAVVKEIVKFAAEGIGN---SYSSALESGFSDRSTDLLTRVSFKFSA 108
E+F + T N+ +T V ++I + +Y L S ++ TR S+K+
Sbjct: 116 EQFTDLATLNLKQTEVNQKIADLVKTQLAQYQINYDELLNSMKPGTPENIETRYSWKYGT 175
Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVI 141
+R V TPT FVNG L G L W + I
Sbjct: 176 SRSVSGTPTIFVNGV-LFDKGEELSAQQWIQYI 207
>gi|452822422|gb|EME29441.1| thioredoxin-like protein [Galdieria sulphuraria]
Length = 244
Score = 41.6 bits (96), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 31/63 (49%)
Query: 89 SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
+G L + + K+S GV+ +PT+FVNG AGS + W + + PLL +
Sbjct: 182 TGNDGNQVQSLMKFAIKYSRKLGVHISPTYFVNGIEDTAAGSAWNKEEWLERLRPLLQQG 241
Query: 149 GKK 151
K
Sbjct: 242 HTK 244
>gi|444919615|ref|ZP_21239615.1| Periplasmic thiol disulfide interchange protein DsbA [Cystobacter
fuscus DSM 2262]
gi|444708167|gb|ELW49260.1| Periplasmic thiol disulfide interchange protein DsbA [Cystobacter
fuscus DSM 2262]
Length = 652
Score = 41.6 bits (96), Expect = 0.096, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 41/70 (58%), Gaps = 9/70 (12%)
Query: 81 NSYSSALESG-FSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRK 139
N + +AL+SG F+ + T + + GV TP+FF+NG SL GA P+D +++
Sbjct: 584 NKFKAALDSGKFTAQVTADMAE-----AGRVGVTGTPSFFINGRSLVGA-QPID--AFKR 635
Query: 140 VIDPLLSEKG 149
VID L +KG
Sbjct: 636 VIDEELKKKG 645
>gi|443723886|gb|ELU12105.1| hypothetical protein CAPTEDRAFT_202202 [Capitella teleta]
Length = 208
Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 27/44 (61%)
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
++K++A RG TP F VN LA GS + W +++DPL++
Sbjct: 165 TWKYAALRGTAGTPWFHVNNVDLASGGSSPTFEEWVELLDPLIN 208
>gi|374606371|ref|ZP_09679248.1| DSBA oxidoreductase [Paenibacillus dendritiformis C454]
gi|374388016|gb|EHQ59461.1| DSBA oxidoreductase [Paenibacillus dendritiformis C454]
Length = 216
Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 2/95 (2%)
Query: 53 KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGV 112
+ Y A + A +V FA + + ++ LE DR+ + +A V
Sbjct: 122 ELYRAQQEERKDWATAPFLVDFAQRTVPDLDTAQLEKALQDRTMRDEVMKDEQQAAQAQV 181
Query: 113 YATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
+ TPT FV+G G LDY G + +ID L +
Sbjct: 182 HGTPTVFVDGVEF--NGDYLDYEGLKAMIDKALEQ 214
>gi|383454942|ref|YP_005368931.1| DSBA-like thioredoxin domain-containing protein [Corallococcus
coralloides DSM 2259]
gi|380733594|gb|AFE09596.1| DSBA-like thioredoxin domain-containing protein [Corallococcus
coralloides DSM 2259]
Length = 659
Score = 41.2 bits (95), Expect = 0.16, Method: Composition-based stats.
Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 11/80 (13%)
Query: 73 KFAAEGIG---NSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAG 129
K+A E IG N + + L+SG D+ + + +G TPTFF+NG +L GA
Sbjct: 580 KYAQE-IGLNVNKFKTDLDSGKYDKQIE----ADMADGSAKGANGTPTFFINGRTLVGA- 633
Query: 130 SPLDYNGWRKVIDPLLSEKG 149
P D ++KVID L + G
Sbjct: 634 QPFD--AFKKVIDEELKKAG 651
>gi|313220976|emb|CBY31809.1| unnamed protein product [Oikopleura dioica]
Length = 217
Score = 40.8 bits (94), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 30/57 (52%)
Query: 92 SDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
+D + D TRV +KF + GV TP F+N L + W KV+DP++ +K
Sbjct: 154 ADPNLDWATRVDWKFVCSLGVSGTPMPFINRVFLDQGVAEFTLADWTKVLDPIVGQK 210
>gi|126659892|ref|ZP_01731017.1| glutathione reductase [Cyanothece sp. CCY0110]
gi|126618855|gb|EAZ89599.1| glutathione reductase [Cyanothece sp. CCY0110]
Length = 191
Score = 40.0 bits (92), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 26/45 (57%)
Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
R+ +F+A +GV++TPTFF+NG S W+ I+ LL
Sbjct: 147 RIPARFAALQGVWSTPTFFINGAQTTDLSSQSSLQDWQDKINSLL 191
>gi|406893247|gb|EKD38362.1| disulfide bond formation protein D, selenocysteine-containing
[uncultured bacterium]
Length = 142
Score = 39.7 bits (91), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 7/68 (10%)
Query: 83 YSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVID 142
Y+ +++SG + D K + + +Y TPTF++NG + G P DY ++K+ID
Sbjct: 82 YTESIDSGRQAKEIDR----DLKLAESMDLYNTPTFYINGRQVIGE-RPFDY--FKKIID 134
Query: 143 PLLSEKGK 150
L++ G+
Sbjct: 135 EELNQAGR 142
>gi|323452082|gb|EGB07957.1| hypothetical protein AURANDRAFT_6387 [Aureococcus anophagefferens]
Length = 173
Score = 38.9 bits (89), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 24/95 (25%), Positives = 44/95 (46%), Gaps = 7/95 (7%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFA-----AEGIGNSYSSALESGF--SDRSTDLLTRV 102
+Q +F++A +M+R + +V+ A AE + + + G + + L +
Sbjct: 79 RQVEFFDAHIWDMSRAQIYDALVEIASEFVDAEALRAKLARTIVEGSLNTGNAATLDLKW 138
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGW 137
+ K+ RGV+ TPT F+NG S D + W
Sbjct: 139 ATKYHRVRGVHVTPTVFLNGVEAPDVSSGWDADQW 173
>gi|390368260|ref|XP_003731416.1| PREDICTED: uncharacterized protein LOC100890543 [Strongylocentrotus
purpuratus]
Length = 509
Score = 38.5 bits (88), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 27/57 (47%)
Query: 66 AVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNG 122
A V+E + A +G + T L RV FK+ TRGV+ TP FVNG
Sbjct: 130 AYVQETLAAWAGEVGYDQTEFASRLARSDDTQWLARVEFKYGGTRGVFGTPQTFVNG 186
>gi|153003349|ref|YP_001377674.1| DSBA oxidoreductase [Anaeromyxobacter sp. Fw109-5]
gi|152026922|gb|ABS24690.1| DSBA oxidoreductase [Anaeromyxobacter sp. Fw109-5]
Length = 354
Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
K GV TP FF+NG L+GA PLD ++KVID L+ GK
Sbjct: 312 KAGEEAGVSGTPAFFINGRMLSGA-QPLD--AFKKVIDQELASAGK 354
>gi|417862695|ref|ZP_12507745.1| DSBA oxidoreductase [Agrobacterium tumefaciens F2]
gi|338819957|gb|EGP53931.1| DSBA oxidoreductase [Agrobacterium tumefaciens F2]
Length = 206
Score = 38.1 bits (87), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 9/67 (13%)
Query: 87 LESGFSDRSTDLLTRVSFKFSATR---GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDP 143
LE+ +D++ LL V+ K +AT GV +TPTFF+NG G P+D +R +++P
Sbjct: 146 LEACLADQA--LLDNVN-KIAATGRELGVTSTPTFFINGNKYEGV-IPVDE--FRSIVEP 199
Query: 144 LLSEKGK 150
LL E GK
Sbjct: 200 LLKEAGK 206
>gi|358057096|dbj|GAA97003.1| hypothetical protein E5Q_03677 [Mixia osmundae IAM 14324]
Length = 198
Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/102 (21%), Positives = 42/102 (41%), Gaps = 5/102 (4%)
Query: 50 QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-----SGFSDRSTDLLTRVSF 104
QE++++ +N T + + + K A + +G S+ L+ G S ++
Sbjct: 97 HQEEYFDEAVENETLAQIRRRLAKLAHDSVGVEESAFLDLVKTGQGNSGNKVGADLKLQV 156
Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
K G++ TPT ++G S + W K + LS
Sbjct: 157 KLGRQNGIHVTPTVLLDGLVDPSVSSSFGKDEWAKYVKEKLS 198
>gi|418054002|ref|ZP_12692058.1| DSBA oxidoreductase [Hyphomicrobium denitrificans 1NES1]
gi|353211627|gb|EHB77027.1| DSBA oxidoreductase [Hyphomicrobium denitrificans 1NES1]
Length = 233
Score = 37.4 bits (85), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 97 DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
D +T + S T GV ATPTFF+NG L A + KVI+PLL+ K
Sbjct: 185 DQITAQRTRASDTFGVNATPTFFINGKKLQEAPT---LEALDKVIEPLLAAK 233
>gi|338740969|ref|YP_004677931.1| protein disulfide isomerase [Hyphomicrobium sp. MC1]
gi|337761532|emb|CCB67367.1| putative protein disulfide isomerase, putative protein precursor
(tat pathway signal) [Hyphomicrobium sp. MC1]
Length = 223
Score = 37.4 bits (85), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 97 DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
D +T + S T GV ATPTFF+NG L A + + KV++PLL+ K
Sbjct: 175 DAITAERSRASETFGVNATPTFFINGKKLQEAPT---MEAFDKVLEPLLAGK 223
>gi|406929335|gb|EKD64938.1| DsbA-like protein thioredoxin protein [uncultured bacterium]
Length = 256
Score = 37.0 bits (84), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 63/136 (46%), Gaps = 17/136 (12%)
Query: 21 ASYFRDKFVCFFLVVLYIKRRLNCLLEIVQQEKFYNAPTQNM-TRTA------VVKEIVK 73
+S++ F L+ + L+ + +++ N P Q+ +TA + E +K
Sbjct: 5 SSFYMPILFGFLLIAAFFLGSLSTKVSYLEKGVSSNTPNQDSPAQTAPNQLPTITNEDMK 64
Query: 74 FAAEGIG---NSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
A+ +G N ++S L+S + D + +AT GV TPTFF+NG + GA
Sbjct: 65 QWAKDLGLNTNDFNSCLDSEKHKSNVD----KDLQDAATAGVSGTPTFFINGTMIVGAQP 120
Query: 131 PLDYNGWRKVIDPLLS 146
+ ++ +ID L+
Sbjct: 121 ---FENFKTIIDQELA 133
>gi|154245118|ref|YP_001416076.1| DSBA oxidoreductase [Xanthobacter autotrophicus Py2]
gi|154159203|gb|ABS66419.1| DSBA oxidoreductase [Xanthobacter autotrophicus Py2]
Length = 270
Score = 37.0 bits (84), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 3/38 (7%)
Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
GV ATPTFF+NG ++GA L W K + PLL+ K
Sbjct: 236 GVNATPTFFINGKKISGA---LGIAEWDKELAPLLAGK 270
>gi|300024387|ref|YP_003756998.1| DSBA oxidoreductase [Hyphomicrobium denitrificans ATCC 51888]
gi|299526208|gb|ADJ24677.1| DSBA oxidoreductase [Hyphomicrobium denitrificans ATCC 51888]
Length = 233
Score = 36.6 bits (83), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 97 DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
D +T + S T GV ATPTFF+NG L + + KVI+PLL+ K
Sbjct: 185 DQITAQRTRASDTFGVNATPTFFINGKKLPETPT---LEAFDKVIEPLLAAK 233
>gi|384439999|ref|YP_005654723.1| Thiol:disulfide interchange protein dsbA [Thermus sp. CCB_US3_UF1]
gi|359291132|gb|AEV16649.1| Thiol:disulfide interchange protein dsbA [Thermus sp. CCB_US3_UF1]
Length = 212
Score = 36.6 bits (83), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 3/46 (6%)
Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
K + G+ TPTFFVNG G +D+ WR+ +D L+ GK
Sbjct: 170 KLATDLGLTGTPTFFVNGEKYGGY---MDFAKWREALDKALAGGGK 212
>gi|363752563|ref|XP_003646498.1| hypothetical protein Ecym_4660 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890133|gb|AET39681.1| hypothetical protein Ecym_4660 [Eremothecium cymbalariae
DBVPG#7215]
Length = 202
Score = 35.8 bits (81), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 45/98 (45%), Gaps = 4/98 (4%)
Query: 51 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIG--NSYSSALESGFSDRSTDLLTRVSF--KF 106
Q++++++ T + +R + E+ FA E + L SD ++ V + ++
Sbjct: 101 QQRWFDSNTADKSRNEIYTELALFAQESVNLPADSLLPLLLVSSDDGNAVVKDVKYFTRY 160
Query: 107 SATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
GV+ TPT +NG + S + K+IDPL
Sbjct: 161 HRQNGVHVTPTLALNGIIIPAIESSTPTDRVWKIIDPL 198
>gi|386038198|ref|YP_005961074.1| putative protein-disulfide oxidoreductase [Paenibacillus polymyxa
M1]
gi|343098159|emb|CCC86367.1| putative protein-disulfide oxidoreductase [Paenibacillus polymyxa
M1]
Length = 228
Score = 35.8 bits (81), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 10/105 (9%)
Query: 52 EKFYN---APTQNMTRTAVVKEIVKFAAEGIG-NSYSSALESGFSDRSTDLLTRVSFKFS 107
E+ YN ++N A + EI K A + N +L++G D + FS
Sbjct: 130 EQIYNLQKDESENWATIAYLVEIAKIAKVKVDYNEMERSLKAG----EFDEMVNKDLNFS 185
Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKR 152
G+ TPT VNG + G LDY R I+ ++++ +++
Sbjct: 186 RGLGIRQTPTLVVNGMVIETGG--LDYEQIRSAINDVINKTEERK 228
>gi|406994810|gb|EKE13735.1| hypothetical protein ACD_12C00848G0004 [uncultured bacterium]
Length = 275
Score = 35.4 bits (80), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 16/107 (14%)
Query: 52 EKFYNAPTQNMTRTAVVKEI------VKFAAEGIG---NSYSSALESGFSDRSTDLLTRV 102
EK N+ T TAV +E +K A+ +G ++ L+SG ++ T L
Sbjct: 47 EKKINSGTGGQAGTAVQQESPISVPKLKIYAKELGLNTGKFNKCLDSG--EKKT--LVEK 102
Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKG 149
K+ + GV TP FF+NG L GA ++ ++++ID +S G
Sbjct: 103 DTKYGLSLGVQGTPGFFINGKFLGGA---FPFSAFKEIIDKEISGTG 146
>gi|383458665|ref|YP_005372654.1| amidohydrolase [Corallococcus coralloides DSM 2259]
gi|380731071|gb|AFE07073.1| amidohydrolase domain-containing protein [Corallococcus coralloides
DSM 2259]
Length = 426
Score = 35.4 bits (80), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 7/57 (12%)
Query: 97 DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKRE 153
D T +K A RGV+ PT LA + L Y GW++ +DP KKRE
Sbjct: 276 DAGTPEVWKLMAQRGVFLCPT-------LAAGDAMLQYRGWKRGVDPEPESAKKKRE 325
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.139 0.410
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,335,065,012
Number of Sequences: 23463169
Number of extensions: 87943705
Number of successful extensions: 257578
Number of sequences better than 100.0: 99
Number of HSP's better than 100.0 without gapping: 66
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 257471
Number of HSP's gapped (non-prelim): 104
length of query: 160
length of database: 8,064,228,071
effective HSP length: 122
effective length of query: 38
effective length of database: 9,496,688,749
effective search space: 360874172462
effective search space used: 360874172462
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 71 (32.0 bits)