BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 031378
         (160 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224085938|ref|XP_002307747.1| predicted protein [Populus trichocarpa]
 gi|222857196|gb|EEE94743.1| predicted protein [Populus trichocarpa]
          Length = 212

 Score =  150 bits (379), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 75/123 (60%), Positives = 90/123 (73%), Gaps = 7/123 (5%)

Query: 36  LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I   LN      LLE     QEKFYNA T N+++T++V+EIVKFA   +GNSYSSA E
Sbjct: 82  LHIANTLNSSFTFPLLEEFFKHQEKFYNAKTSNLSKTSIVEEIVKFATVAVGNSYSSAFE 141

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           SGF+DR TDL TRVSFK+S +RGV+ TP FFVNGF L  AGSPLDYNGWR +IDPL+  K
Sbjct: 142 SGFNDRQTDLKTRVSFKYSTSRGVFGTPFFFVNGFVLPDAGSPLDYNGWRSIIDPLVGAK 201

Query: 149 GKK 151
             +
Sbjct: 202 SSQ 204


>gi|255539052|ref|XP_002510591.1| conserved hypothetical protein [Ricinus communis]
 gi|223551292|gb|EEF52778.1| conserved hypothetical protein [Ricinus communis]
          Length = 362

 Score =  150 bits (379), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 69/105 (65%), Positives = 87/105 (82%), Gaps = 1/105 (0%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            QE+FYNA T+N++R +VVK+IV+FA+  IGNSY SA+ESGF+DR TDL TRVSFK+S +
Sbjct: 123 HQERFYNAQTRNLSRVSVVKDIVEFASVAIGNSYHSAIESGFNDRETDLQTRVSFKYSTS 182

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SEKGKKRE 153
           RGV+ TPTF++NGF L  AGS LDYNGWRK IDPL+ ++KGK  E
Sbjct: 183 RGVFGTPTFYINGFVLPDAGSTLDYNGWRKFIDPLINAKKGKSEE 227


>gi|224061957|ref|XP_002300683.1| predicted protein [Populus trichocarpa]
 gi|222842409|gb|EEE79956.1| predicted protein [Populus trichocarpa]
          Length = 223

 Score =  137 bits (344), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 71/120 (59%), Positives = 86/120 (71%), Gaps = 7/120 (5%)

Query: 36  LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I   LNC     LLE     QEKFY + T N+++ ++VKEIVKFA   +G+SYSS L+
Sbjct: 104 LHIANILNCSFTFPLLEQFFKHQEKFYGSETSNLSKDSIVKEIVKFATVIVGDSYSSPLQ 163

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
            GF+D  TDL TRVSFK+SA+RGVYATP FFVNGF L GAGS LDY  WR +IDPL+  K
Sbjct: 164 FGFNDIQTDLKTRVSFKYSASRGVYATPFFFVNGFGLPGAGSALDYKVWRSIIDPLVGAK 223


>gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity to pigpen protein from Mus musculus gb|AF224264
           and contains protein of unknown function DUF78 PF|01918
           domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
           gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
           gb|AI997847, gb|AV543000 come from this gene
           [Arabidopsis thaliana]
          Length = 538

 Score =  130 bits (327), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 59/95 (62%), Positives = 71/95 (74%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q  FYN+ TQ M+R AVV+E++K     +GNSY S L+SGFS+  +DL TRVSFK+S +
Sbjct: 117 HQTLFYNSQTQLMSRPAVVEELIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFKYSVS 176

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
           RGV ATPTF+VNGF L GAGSP DY GWR  IDPL
Sbjct: 177 RGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPL 211


>gi|22329686|ref|NP_683315.1| TRX domain-containing protein [Arabidopsis thaliana]
 gi|17065544|gb|AAL32926.1| Unknown protein [Arabidopsis thaliana]
 gi|24899723|gb|AAN65076.1| Unknown protein [Arabidopsis thaliana]
 gi|332191831|gb|AEE29952.1| TRX domain-containing protein [Arabidopsis thaliana]
          Length = 233

 Score =  129 bits (323), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 66/120 (55%), Positives = 82/120 (68%), Gaps = 7/120 (5%)

Query: 36  LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I   LN      LLE +   Q  FYN+ TQ M+R AVV+E++K     +GNSY S L+
Sbjct: 97  LHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEELIKLGTVTLGNSYHSPLK 156

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           SGFS+  +DL TRVSFK+S +RGV ATPTF+VNGF L GAGSP DY GWR  IDPL+  +
Sbjct: 157 SGFSNSKSDLATRVSFKYSVSRGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPLVKPQ 216


>gi|359492074|ref|XP_003634361.1| PREDICTED: uncharacterized protein LOC100854733 [Vitis vinifera]
          Length = 234

 Score =  128 bits (322), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 68/132 (51%), Positives = 89/132 (67%), Gaps = 9/132 (6%)

Query: 36  LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I  +LN      LLE++   QE FYN  T NM+RTA+V  IVKF ++ +G S  SA++
Sbjct: 105 LHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNMSRTAIVDCIVKFVSKAVGESLFSAIK 164

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           SGFSDR TDL TRVSFK+  +RGV  TP FFVNGF L   GS ++Y+ WR ++DPL+S+ 
Sbjct: 165 SGFSDRQTDLTTRVSFKYGCSRGVLGTPYFFVNGFPLPDPGSAINYSKWRSILDPLVSDS 224

Query: 149 GKKREVPLHLFL 160
            K+    LH FL
Sbjct: 225 VKRET--LHFFL 234


>gi|297850426|ref|XP_002893094.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338936|gb|EFH69353.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  128 bits (321), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 61/106 (57%), Positives = 75/106 (70%), Gaps = 1/106 (0%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q  FYN+ TQ M+R AVV  ++K     +GNSY S L SGFS+  +DL TRVSFK+S +
Sbjct: 118 HQALFYNSQTQLMSRPAVVDNLIKLGTVTLGNSYHSPLISGFSNSKSDLATRVSFKYSVS 177

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREVP 155
           RGV ATPTF+VNGF L GAGSP DY GWR  IDPL+ +K ++ E P
Sbjct: 178 RGVSATPTFYVNGFELPGAGSPKDYEGWRDTIDPLM-DKYQRVEKP 222


>gi|81076617|gb|ABB55396.1| unknown [Solanum tuberosum]
          Length = 224

 Score =  126 bits (317), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 65/132 (49%), Positives = 86/132 (65%), Gaps = 7/132 (5%)

Query: 36  LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I  +LN      LLE    QQ+KFYN  T N+++ +VV E+VKF +  IGNS  +A++
Sbjct: 93  LHIVNKLNSSATFRLLEAFFDQQDKFYNQATFNLSKASVVDEVVKFTSNKIGNSNYAAVK 152

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           +GF+D  TD  TR+SFK+   +GVY  P FFVNGF L   GSPLDY  WR VIDPL+S +
Sbjct: 153 AGFTDPKTDQATRISFKYGCVKGVYGAPFFFVNGFPLPDGGSPLDYKTWRDVIDPLISPE 212

Query: 149 GKKREVPLHLFL 160
            + R   L+ FL
Sbjct: 213 EQLRAENLNFFL 224


>gi|351726088|ref|NP_001235579.1| uncharacterized protein LOC100500267 precursor [Glycine max]
 gi|255629877|gb|ACU15289.1| unknown [Glycine max]
          Length = 236

 Score =  126 bits (316), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 58/113 (51%), Positives = 82/113 (72%), Gaps = 3/113 (2%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           QE FY A T+N++R ++++E+VK A + +G+SY   +++GF+D +TD+ TRVSFK++A+R
Sbjct: 124 QENFYGAQTRNLSRASIIEEVVKSATQVVGSSYYKTIKNGFNDTTTDIQTRVSFKYAASR 183

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKK---REVPLHLFL 160
           GVY TP F+VNGF L   G+ +DY  WRKVIDPL+  K KK    E  LH FL
Sbjct: 184 GVYGTPFFYVNGFLLPDTGAAVDYKTWRKVIDPLVGAKNKKSIQNEESLHFFL 236


>gi|449450508|ref|XP_004143004.1| PREDICTED: uncharacterized protein LOC101216804 isoform 1 [Cucumis
           sativus]
 gi|449450510|ref|XP_004143005.1| PREDICTED: uncharacterized protein LOC101216804 isoform 2 [Cucumis
           sativus]
 gi|449521601|ref|XP_004167818.1| PREDICTED: uncharacterized protein LOC101231840 isoform 1 [Cucumis
           sativus]
 gi|449521603|ref|XP_004167819.1| PREDICTED: uncharacterized protein LOC101231840 isoform 2 [Cucumis
           sativus]
          Length = 236

 Score =  125 bits (315), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 61/111 (54%), Positives = 78/111 (70%), Gaps = 2/111 (1%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q++FYNA T+ ++R A+V  +VKF  E +G+SY + L +GF+DR TDLLTRVSFKFS +
Sbjct: 128 DQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRVSFKFSTS 187

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREVPLHLFL 160
           RGVY TP FF+NGF     GSPL+Y  WR +IDPL+  K  KR    HL L
Sbjct: 188 RGVYGTPFFFINGFLAPDKGSPLNYTEWRNLIDPLI--KKNKRSGSQHLSL 236


>gi|351721154|ref|NP_001236688.1| uncharacterized protein LOC100527484 precursor [Glycine max]
 gi|255632456|gb|ACU16578.1| unknown [Glycine max]
          Length = 233

 Score =  124 bits (312), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/133 (50%), Positives = 87/133 (65%), Gaps = 8/133 (6%)

Query: 36  LYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L+I   LN      LLE     QEKFY A T+N++R ++V EIVK A E  G+SY +A++
Sbjct: 101 LHIVNTLNASATFPLLEWFFKHQEKFYGAQTRNLSRASIVDEIVKSATEAAGSSYYNAIK 160

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS-E 147
            GF+D  TD  TRVSFK++A+RGVY TP+F+VNGF L   G+  DY  WRKVIDPL+  +
Sbjct: 161 HGFNDTKTDYQTRVSFKYAASRGVYGTPSFYVNGFLLPDTGATADYKTWRKVIDPLVGVK 220

Query: 148 KGKKREVPLHLFL 160
           K  + E  L  FL
Sbjct: 221 KSIQNEESLRFFL 233


>gi|15222928|ref|NP_177728.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
 gi|117168227|gb|ABK32196.1| At1g76020 [Arabidopsis thaliana]
 gi|332197664|gb|AEE35785.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
          Length = 225

 Score =  124 bits (310), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 56/96 (58%), Positives = 74/96 (77%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q  FYNA TQ ++R AVV++IV+     +GNSY S L+SGFSD+ +D  TRVSFK+SA+
Sbjct: 117 HQSLFYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSAS 176

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
           RGVY TPTF+VNGF L+ A SP ++ GW+K+IDPL+
Sbjct: 177 RGVYGTPTFYVNGFVLSDAASPSNFGGWKKIIDPLV 212


>gi|409030182|gb|AFV07578.1| thioredoxin f5, partial [Solanum tuberosum]
          Length = 206

 Score =  123 bits (308), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 64/103 (62%), Positives = 75/103 (72%), Gaps = 7/103 (6%)

Query: 35  VLYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            L+I   LN      LLE     QE+FYNA T N +R  VVKEIVKFA+  +GNSY SA+
Sbjct: 104 TLHIANELNTSSTFPLLERFYKHQERFYNAQTHNKSRATVVKEIVKFASAAVGNSYCSAI 163

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
           ESG +DR TDL TRVSFK+SA+RGV+ATPTFFVNGF+L  AGS
Sbjct: 164 ESGLNDRQTDLKTRVSFKYSASRGVFATPTFFVNGFALPDAGS 206


>gi|388493536|gb|AFK34834.1| unknown [Lotus japonicus]
          Length = 239

 Score =  123 bits (308), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 91/133 (68%), Gaps = 8/133 (6%)

Query: 36  LYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE 88
           L++   LN      LLE++   QEKFY A T+N++R ++ +E VK A E IG+S+ ++++
Sbjct: 107 LHVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASIQEEFVKSATEVIGSSFYTSVK 166

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SE 147
           +GF+D  TD  TR+SFK++A+RGVY TP F+VNGF L   G  +DY  W+K+IDPL+ +E
Sbjct: 167 NGFNDSKTDHQTRISFKYAASRGVYGTPFFYVNGFLLPDVGDSIDYKTWKKIIDPLVGAE 226

Query: 148 KGKKREVPLHLFL 160
           K  K E  LH FL
Sbjct: 227 KNVKNEESLHFFL 239


>gi|388504516|gb|AFK40324.1| unknown [Medicago truncatula]
          Length = 230

 Score =  122 bits (306), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 55/111 (49%), Positives = 83/111 (74%), Gaps = 1/111 (0%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           Q  F+ APT+N++R  +V EIVKF +  +GNSY +++++GF+D ++D LTR+SFK++A+R
Sbjct: 120 QAAFFGAPTRNLSRAFIVDEIVKFTSAIVGNSYHTSIKNGFNDTNSDHLTRISFKYAASR 179

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL-SEKGKKREVPLHLFL 160
           GV  TP F+VNGF+L  AG+ L+Y+ WR +IDPL+ ++K  K E  LH  L
Sbjct: 180 GVSGTPFFYVNGFALPDAGNALNYSDWRSIIDPLIGAKKSTKNEESLHYLL 230


>gi|302141727|emb|CBI18930.3| unnamed protein product [Vitis vinifera]
          Length = 514

 Score =  119 bits (298), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/117 (52%), Positives = 80/117 (68%), Gaps = 7/117 (5%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            L+I  +LN      LLE++   QE FYN  T NM+RTA+V  IVKF ++ +G S  SA+
Sbjct: 104 ALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNMSRTAIVDCIVKFVSKAVGESLFSAI 163

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
           +SGFSDR TDL TRVSFK+  +RGV  TP FFVNGF L   GS ++Y+ WR ++DPL
Sbjct: 164 KSGFSDRQTDLTTRVSFKYGCSRGVLGTPYFFVNGFPLPDPGSAINYSKWRSILDPL 220


>gi|297842343|ref|XP_002889053.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334894|gb|EFH65312.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 229

 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/97 (55%), Positives = 71/97 (73%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q  FYNA T  +TR AVV++IV+     +G SY S L+SGFSD+ +D  TRVSFK+S +
Sbjct: 118 HQPLFYNAQTNLLTRAAVVEKIVELGTVTLGKSYQSVLKSGFSDKKSDRATRVSFKYSGS 177

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           RGVY TPTF+VNGF L+ A SP ++ GW+K+IDPL+ 
Sbjct: 178 RGVYGTPTFYVNGFVLSDAASPSNFGGWKKIIDPLVQ 214


>gi|218202548|gb|EEC84975.1| hypothetical protein OsI_32232 [Oryza sativa Indica Group]
          Length = 237

 Score =  112 bits (281), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 61/127 (48%), Positives = 77/127 (60%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    Q KFYNA T +++ T +  E+ K AA  +GNS S   
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           +SGFSD  TDL  RVSFK+  TRGV   P FFVNGF   G GSP+DY+ W  ++DPL  +
Sbjct: 167 QSGFSDIRTDLAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQ 226

Query: 148 KGKKREV 154
            G K E+
Sbjct: 227 HGDKLEM 233


>gi|115480453|ref|NP_001063820.1| Os09g0542200 [Oryza sativa Japonica Group]
 gi|52076083|dbj|BAD46596.1| unknown protein [Oryza sativa Japonica Group]
 gi|113632053|dbj|BAF25734.1| Os09g0542200 [Oryza sativa Japonica Group]
 gi|215694060|dbj|BAG89259.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740808|dbj|BAG96964.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 237

 Score =  112 bits (279), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    Q KFYNA T +++ T +  E+ K AA  +GNS S   
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           +SGFSD  TDL  RVSFK+  TRGV   P FFVNGF   G GSP+DY+ W  ++DPL  +
Sbjct: 167 QSGFSDIRTDLAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQ 226

Query: 148 KGKKREV 154
            G + E+
Sbjct: 227 HGDRLEM 233


>gi|194697338|gb|ACF82753.1| unknown [Zea mays]
          Length = 238

 Score =  112 bits (279), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 81/127 (63%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    QEKFYN+ T +++  +V   + K AA+ +GNS S  L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 168

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
            SGFSD  TD   RVSFK+  TRGV+  P FFVNGF   G GSP+DY+ W  ++DPL+S+
Sbjct: 169 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLVSQ 227

Query: 148 KGKKREV 154
            G++ E+
Sbjct: 228 NGERIEM 234


>gi|195655073|gb|ACG47004.1| hypothetical protein [Zea mays]
          Length = 235

 Score =  111 bits (277), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 80/127 (62%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    QEKFYN+ T +++  +V   + K AA+ +GNS S  L
Sbjct: 106 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 165

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
            SGFSD  TD   RVSFK+  TRGV+  P FFVNGF   G GSP+DY+ W  ++DPL S+
Sbjct: 166 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLASQ 224

Query: 148 KGKKREV 154
            G++ E+
Sbjct: 225 NGERIEM 231


>gi|242050028|ref|XP_002462758.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
 gi|241926135|gb|EER99279.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
          Length = 238

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/127 (48%), Positives = 82/127 (64%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    QEKFYN+ T +++  +V  E+ K AA+ +GNS S  L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSSPSVAVEMSKMAAQTVGNSVSEFL 168

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
            SGFSD  TD   RVSFK+  TRGVY  P FFVNGF   G GSP+DY+ W  ++DPL+S+
Sbjct: 169 -SGFSDTKTDSAARVSFKYGCTRGVYGAPFFFVNGFLQPGGGSPIDYSTWIGILDPLVSQ 227

Query: 148 KGKKREV 154
            G++ E+
Sbjct: 228 NGERIEM 234


>gi|326520974|dbj|BAJ92850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 292

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/127 (45%), Positives = 76/127 (59%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            L+I  +LN      +LE+    Q KF N  T +M+ TAV  EI K AA+ +GNS S   
Sbjct: 163 ALHIANKLNSSSTYPVLELFFKNQGKFSNRATSSMSSTAVTGEISKMAAQAVGNSVSD-F 221

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           +SGFSD  TD+  RVSFK+  TRGV   P FFVNGF   G GSP+DY  W  ++DPL++ 
Sbjct: 222 QSGFSDTRTDMAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYATWTSILDPLVAH 281

Query: 148 KGKKREV 154
             +  E+
Sbjct: 282 HSQTIEM 288


>gi|226528876|ref|NP_001145190.1| uncharacterized protein LOC100278440 [Zea mays]
 gi|195652481|gb|ACG45708.1| hypothetical protein [Zea mays]
          Length = 238

 Score =  108 bits (270), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/127 (46%), Positives = 80/127 (62%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    QEKFYN+ T +++  +V   + K AA+ +GNS S  L
Sbjct: 109 ALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVALGMSKMAAQTVGNSVSEFL 168

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
            SGFSD  TD   RVSFK+  TRGV+  P FFVNGF   G GSP+DY+ W  ++ PL+S+
Sbjct: 169 -SGFSDGKTDSAARVSFKYGCTRGVFGAPFFFVNGFLQPGGGSPIDYSTWIGILXPLVSQ 227

Query: 148 KGKKREV 154
            G++ E+
Sbjct: 228 NGERIEM 234


>gi|357159750|ref|XP_003578547.1| PREDICTED: uncharacterized protein LOC100839255 [Brachypodium
           distachyon]
          Length = 238

 Score =  107 bits (266), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 76/127 (59%), Gaps = 8/127 (6%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            L+I  +LN      LLE+    Q KF N    +++ TA+  EI K A + +GNS S   
Sbjct: 109 ALHIANKLNSSSTYPLLELFFKNQGKFSNHALSSLSSTAITGEISKMAVQAVGNSVSE-F 167

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           +SGFSD  TD   RVSFK+  TRGV   P FFVNGF   G GSP+DY  W  ++DP++++
Sbjct: 168 QSGFSDARTDSAARVSFKYGCTRGVAGAPFFFVNGFLQPGGGSPIDYTTWTNILDPIVAQ 227

Query: 148 KGKKREV 154
            G+K ++
Sbjct: 228 HGEKTDM 234


>gi|222642012|gb|EEE70144.1| hypothetical protein OsJ_30190 [Oryza sativa Japonica Group]
          Length = 216

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 51/104 (49%), Positives = 67/104 (64%), Gaps = 1/104 (0%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           + KFYNA T +++ T +  E+ K AA  +GNS S   +SGFSD  TDL  RVSFK+  TR
Sbjct: 110 EGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAARVSFKYGCTR 168

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
           GV   P FFVNGF   G GSP+DY+ W  ++DPL  + G + E+
Sbjct: 169 GVAGAPFFFVNGFLQPGGGSPIDYSTWVSILDPLAGQHGDRLEM 212


>gi|116781410|gb|ABK22089.1| unknown [Picea sitchensis]
          Length = 227

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 62/101 (61%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q +FYN  T  +   +++ +I+ FA E  GNS S+  ES F D +TD+ TR+SFK+  +
Sbjct: 122 HQARFYNKATLQVAPASIINQIIHFALEISGNSSSTMFESAFQDTTTDMATRISFKYGCS 181

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
           R V  TP FFVNG  L      +DY GW+ +IDPLL+ K +
Sbjct: 182 RAVTGTPYFFVNGIPLLNIEETIDYKGWKSIIDPLLAMKTR 222


>gi|8778813|gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana]
          Length = 263

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 58/81 (71%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q  FYNA TQ ++R AVV++IV+     +GNSY S L+SGFSD+ +D  TRVSFK+SA+
Sbjct: 117 HQSLFYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSAS 176

Query: 110 RGVYATPTFFVNGFSLAGAGS 130
           RGVY TPTF+VNG S     S
Sbjct: 177 RGVYGTPTFYVNGSSEPNESS 197


>gi|326491935|dbj|BAJ98192.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493720|dbj|BAJ85321.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326504198|dbj|BAJ90931.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511459|dbj|BAJ87743.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326520669|dbj|BAJ92698.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 226

 Score = 95.5 bits (236), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 61/98 (62%), Gaps = 1/98 (1%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           QE +YN PT   TR AVV EI     A  IG +  +A ++GF+D  +D  TR+SFKF   
Sbjct: 128 QEGYYNTPTYTKTRAAVVAEIANNLVAPVIGEANLAAYKAGFNDSQSDQATRISFKFGCA 187

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           RGV  TP +FVNG  L+ +GSP+DY  W   +DPL+ +
Sbjct: 188 RGVTGTPYYFVNGIPLSDSGSPMDYGKWISTLDPLVGK 225


>gi|409030172|gb|AFV07573.1| DsbA c3, partial [Solanum tuberosum]
          Length = 194

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 7/91 (7%)

Query: 35  VLYIKRRLNC-----LLE--IVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            L+I   LN      LLE     QE+FYNA T N +R  VVKEIV+FA+  +GNS  SA+
Sbjct: 104 TLHIANELNTSSTFPLLERFYKHQERFYNAQTHNKSRATVVKEIVRFASAAVGNSCCSAI 163

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTF 118
           ESG  DR TDL TRVSF++SA+RGV+ TPTF
Sbjct: 164 ESGSYDRQTDLKTRVSFQYSASRGVFGTPTF 194


>gi|115457372|ref|NP_001052286.1| Os04g0227500 [Oryza sativa Japonica Group]
 gi|38346025|emb|CAE01956.2| OSJNBb0071D01.2 [Oryza sativa Japonica Group]
 gi|38346893|emb|CAE03918.2| OSJNBb0015G09.12 [Oryza sativa Japonica Group]
 gi|113563857|dbj|BAF14200.1| Os04g0227500 [Oryza sativa Japonica Group]
 gi|125546580|gb|EAY92719.1| hypothetical protein OsI_14471 [Oryza sativa Indica Group]
 gi|125588738|gb|EAZ29402.1| hypothetical protein OsJ_13476 [Oryza sativa Japonica Group]
 gi|215679384|dbj|BAG96524.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765592|dbj|BAG87289.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 226

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 45/99 (45%), Positives = 63/99 (63%), Gaps = 2/99 (2%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           QE +YN PT   TR  V  E+ K   A  IG +  +A ++GF+D  +D  TR+SFK+   
Sbjct: 127 QEGYYNQPTYGKTRATVDAEVAKNLVAPVIGEANLAAYKAGFNDSKSDQATRISFKYGCA 186

Query: 110 RGVYATPTFFVNGFSL-AGAGSPLDYNGWRKVIDPLLSE 147
           RGV  TP FFVNG  L + +GSPL+YN W+ ++DPL+ +
Sbjct: 187 RGVTGTPYFFVNGIPLISDSGSPLEYNKWKSILDPLVGK 225


>gi|357168058|ref|XP_003581462.1| PREDICTED: uncharacterized protein LOC100834992 [Brachypodium
           distachyon]
          Length = 219

 Score = 91.7 bits (226), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           QE +YN PT   +R  VV EI     A  IG +  +A ++GF+D  +D  TR+SFK+   
Sbjct: 121 QEAYYNQPTYTKSRETVVAEITNNLVAPVIGEANLAAYKAGFNDSMSDQSTRISFKYGCA 180

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           RGV  TP FFVNG  L  +GSP+DY+ W   +DPL+ +
Sbjct: 181 RGVTGTPYFFVNGIPLGDSGSPVDYDKWISTLDPLVGK 218


>gi|242072508|ref|XP_002446190.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
 gi|241937373|gb|EES10518.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
          Length = 220

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/99 (44%), Positives = 61/99 (61%), Gaps = 1/99 (1%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSA 108
            QE +YN PT   +R  VV EI K   A  IG +  +A ++GF+D  +D+ TR+SFK   
Sbjct: 121 DQEGYYNRPTYGKSRATVVDEITKNLVAPIIGETNLAAYKAGFNDSQSDMATRISFKNGC 180

Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
            RGV  TP FFVNG  +  +GSPL+Y  W  ++DPL+ +
Sbjct: 181 ARGVTGTPYFFVNGIPINDSGSPLEYKYWISILDPLVGK 219


>gi|226491612|ref|NP_001151670.1| LOC100285305 precursor [Zea mays]
 gi|195648623|gb|ACG43779.1| DSBA-like thioredoxin domain containing protein [Zea mays]
          Length = 223

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 57/98 (58%), Gaps = 1/98 (1%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           Q+++YN PT   +R  VV EI K      IG +  +A  +GF+D  +D  TR+SFK    
Sbjct: 125 QKRYYNQPTYEKSRATVVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFKNGCA 184

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           RGV  TP FFVNG  +  +GSPL+   W  ++ PL+ +
Sbjct: 185 RGVTGTPYFFVNGIPINDSGSPLEXKDWISILRPLVGK 222


>gi|302791067|ref|XP_002977300.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
 gi|300154670|gb|EFJ21304.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
          Length = 226

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 67/121 (55%), Gaps = 9/121 (7%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVK-FAAE-GIGNSYSS 85
            L+I   LN      LL+++   QE+F NA T + T + V+ E    FA++ G GN   S
Sbjct: 97  ALHIANHLNASLTYPLLDLIFEHQEEFSNANTADKTASTVIDEFSSLFASQFGSGNEAKS 156

Query: 86  ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
             + GF D STD   R+SFK+  +RGV  TP FFVNG  L+   +    + W  ++DPLL
Sbjct: 157 IFKQGFYDSSTDQAGRISFKYGCSRGVTGTPVFFVNGVPLSNVDASWGIDEWANILDPLL 216

Query: 146 S 146
           +
Sbjct: 217 A 217


>gi|302780317|ref|XP_002971933.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
 gi|300160232|gb|EFJ26850.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
          Length = 226

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 67/121 (55%), Gaps = 9/121 (7%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVK-FAAE-GIGNSYSS 85
            L+I   LN      LL+++   QE+F NA T + T + V+ E    FA++ G GN   S
Sbjct: 97  ALHIANHLNASLTYPLLDLIFEHQEEFSNANTGDKTASTVIDEFSSLFASQFGSGNEAKS 156

Query: 86  ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
             + GF D STD   R+SFK+  +RGV  TP FFVNG  L+   +    + W  ++DPLL
Sbjct: 157 IFKQGFYDSSTDQAGRISFKYGCSRGVTGTPVFFVNGVPLSNVDASWGIDEWANILDPLL 216

Query: 146 S 146
           +
Sbjct: 217 A 217


>gi|32526672|dbj|BAC79195.1| unknown protein [Oryza sativa Japonica Group]
          Length = 220

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 65/127 (51%), Gaps = 28/127 (22%)

Query: 35  VLYIKRRLNC-----LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSAL 87
            LYI  +LN      LLE+    Q KFYNA T +++ T +  E+ K AA  +GNS S   
Sbjct: 108 ALYIANKLNSSSTYPLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-F 166

Query: 88  ESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           +SGFSD  TDL  RVSFK                     G GSP+DY+ W  ++DPL  +
Sbjct: 167 QSGFSDIRTDLAARVSFK--------------------PGGGSPIDYSTWVSILDPLAGQ 206

Query: 148 KGKKREV 154
            G + E+
Sbjct: 207 HGDRLEM 213


>gi|413917905|gb|AFW57837.1| hypothetical protein ZEAMMB73_294031 [Zea mays]
          Length = 212

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 55/97 (56%), Gaps = 5/97 (5%)

Query: 52  EKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           EKF+    ++ + +  V EI K      IG +  +A  +GF+D  +D  TR+SFK    R
Sbjct: 119 EKFF----KDQSYSGTVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFKNGCAR 174

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           GV  TP FFVNG  +  +GSPL+Y  W  ++ PL+ +
Sbjct: 175 GVTGTPYFFVNGIPINDSGSPLEYKDWISILGPLVGK 211


>gi|168037769|ref|XP_001771375.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677293|gb|EDQ63765.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 191

 Score = 72.0 bits (175), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 57/102 (55%), Gaps = 4/102 (3%)

Query: 45  LLEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 102
           LLE++   QE F N  T + T   VV  ++  A + +G   +S  E  F+D  TD  TR+
Sbjct: 92  LLELIFSNQESFLNPSTNHETPAHVVSRLISLA-DNLGFPTNS-FEVAFNDAVTDQATRI 149

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
           SFK+  +RGV  TPT+ VNG ++AGA        W K+ DP+
Sbjct: 150 SFKYGCSRGVVGTPTYLVNGVAVAGADDSWSVEDWGKLFDPM 191


>gi|449450512|ref|XP_004143006.1| PREDICTED: uncharacterized protein LOC101216804 isoform 3 [Cucumis
           sativus]
          Length = 201

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 31/56 (55%), Positives = 43/56 (76%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 105
            Q++FYNA T+ ++R A+V  +VKF  E +G+SY + L +GF+DR TDLLTRVSFK
Sbjct: 128 DQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRVSFK 183


>gi|196010617|ref|XP_002115173.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190582556|gb|EDV22629.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 217

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 2/105 (1%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
            Q +FYN PT  M+   V++   K A +  G S +    +   +   D  TR+S K +  
Sbjct: 115 NQSEFYNKPTLKMSGLQVIQAYAKLAHKAAGIS-AMKFANALQNSGIDHTTRLSSKLAWE 173

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
           RGV  TP+FF+NG  + G  +      W+KVIDPLL  K  + E+
Sbjct: 174 RGVTGTPSFFINGIRVIGQSN-WTVKQWKKVIDPLLKTKFPRLEL 217


>gi|414886700|tpg|DAA62714.1| TPA: hypothetical protein ZEAMMB73_877337 [Zea mays]
          Length = 280

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 40/57 (70%)

Query: 98  LLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
           L  +++ K+  TRGV+  P FFVNGF   G GSP+DY+ W  ++DPL+S+ G++ E+
Sbjct: 9   LAAKLTLKYGCTRGVFGAPFFFVNGFLEPGGGSPIDYSTWIGILDPLVSQNGERVEI 65


>gi|348690389|gb|EGZ30203.1| hypothetical protein PHYSODRAFT_553285 [Phytophthora sojae]
          Length = 225

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 5/114 (4%)

Query: 46  LEIV--QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE---SGFSDRSTDLLT 100
           LE+V   QE+F+N  T++ +   V  E+ + A +   +      E   +G+     D  T
Sbjct: 112 LEVVYEHQEEFWNKATKDKSAVQVTNELKQLAQKTFPDLTDQQWERGMTGYGGTEADQHT 171

Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKREV 154
           RV++K++ TR V  TP + +NG     A S      WRKVIDPL+    ++ ++
Sbjct: 172 RVTWKYACTRTVTGTPQYTLNGVPFEDADSSWKLEDWRKVIDPLVKPNQERDDL 225


>gi|320169829|gb|EFW46728.1| hypothetical protein CAOG_04686 [Capsaspora owczarzaki ATCC 30864]
          Length = 261

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 52/98 (53%), Gaps = 3/98 (3%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           Q+ F + PT N+T   V+      + + IG   +  L+ G     TD   R+++K+  TR
Sbjct: 117 QDSFEDDPTVNLTSNQVINMYAALS-QTIGVPAAVFLK-GMDSDDTDESARIAWKYGCTR 174

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           GV  TP FFVNG S+A A S    + W  V+DPLL+  
Sbjct: 175 GVAGTPWFFVNGISVA-ASSAWSVSDWVSVLDPLLNSS 211


>gi|154420795|ref|XP_001583412.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121917653|gb|EAY22426.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 198

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 37  YIKRRLNCLLEIVQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRST 96
           Y    LNCL     Q  F  +  +++T   + K+ +K+AAE +G + +  L   F   + 
Sbjct: 93  YAPILLNCLYADGNQSMFLGSEVKSVTAGDMQKKALKWAAEKLGITVND-LTKAFG--TQ 149

Query: 97  DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           ++ TR+ FK+SA   +  TPTF++NG + +   S    + W  VIDPLL+
Sbjct: 150 EMNTRIEFKYSAVHNIDGTPTFYINGVA-SDLSSDSTIDDWSNVIDPLLN 198


>gi|390354820|ref|XP_003728415.1| PREDICTED: uncharacterized protein LOC100888511 [Strongylocentrotus
           purpuratus]
          Length = 219

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 2/99 (2%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           +Q   YNA       + +V  + ++A++ +G S +  +        T  L RV FK+ AT
Sbjct: 119 KQAALYNANITEQPDSYIVNALSEWASD-VGYSKAQFMMHLSPSDPTQSLARVEFKYGAT 177

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           RGV+ TP  F+NG  ++   +      W++VIDPLL+E 
Sbjct: 178 RGVFGTPQTFINGVVVSSKPN-WTLGDWKQVIDPLLAEN 215


>gi|118386030|ref|XP_001026137.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila]
 gi|89307904|gb|EAS05892.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila
           SB210]
          Length = 230

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           QE+++N P  N TR AV  ++     + +     +   +  ++R+ DL  R S+K+  +R
Sbjct: 114 QEQYWNGPVTNYTRNAVFNKLAHQLHDFVPKISINETLAALTNRTYDLEARYSWKYGVSR 173

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL--SEKGKKREV 154
            V  TP FF+NG  L GA S    + W + + P +     GK + V
Sbjct: 174 EVAGTPYFFLNGVQLDGAES-FSSSTWAETLLPYIQADSNGKPKSV 218


>gi|440791659|gb|ELR12897.1| hypothetical protein ACA1_095270 [Acanthamoeba castellanii str.
           Neff]
          Length = 212

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 53/96 (55%), Gaps = 1/96 (1%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           QQ+ + +  TQ +    V+ ++ ++A        ++ +++     + +LL RV++K +A+
Sbjct: 117 QQDNWGDEATQGIAPLQVITQMSRYAQACCKVDPAAFVKAMGWVSAQNLLARVTWKRAAS 176

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
            G+Y TP F +NG S + A        WRK++DPLL
Sbjct: 177 LGIYGTPQFMINGVS-SSADESWTMADWRKLLDPLL 211


>gi|307945890|ref|ZP_07661226.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
 gi|307771763|gb|EFO30988.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
          Length = 191

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 4/99 (4%)

Query: 49  VQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-SGFSDRSTDLLTRVSFKFS 107
            +Q  +YN P  + T   ++  I  FA +  G +    L+  G ++  TD   R   +F+
Sbjct: 95  ARQSDYYNGPFLHKTHQDLLNLIADFAHDFAGFNKDEMLKLIGTNEVYTD--ARTPIRFA 152

Query: 108 ATRGVYATPTFFVNGFSLAGAGS-PLDYNGWRKVIDPLL 145
           +T+GV+ATPTFF+N   L G  S       WR VIDP L
Sbjct: 153 STKGVWATPTFFLNSSDLVGKFSEEPSLEEWRAVIDPRL 191


>gi|301118869|ref|XP_002907162.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105674|gb|EEY63726.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 298

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 53/98 (54%), Gaps = 4/98 (4%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAE---GIGNSYSSALESGFSDRSTDLLTRVSFKFS 107
           Q+ F+N  T++++   VV+++ K A +    + +       +G+     D  TR S+K++
Sbjct: 117 QDIFWNKATKDLSPIQVVEKLKKLAQKTFPSLTDKQWDEQMTGYGGTDVDDHTRESWKYT 176

Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
            +RG+  TP + +NG   A A +   +  W KVIDPL+
Sbjct: 177 CSRGMSGTPMYTLNGVPFA-ADADWTFEQWYKVIDPLV 213


>gi|405953203|gb|EKC20909.1| Eppin [Crassostrea gigas]
          Length = 403

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 3/95 (3%)

Query: 52  EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRG 111
           +   N  T++M+   ++  +   AA G+G   S  ++   +D   D   RV +K++ TRG
Sbjct: 119 DTLSNTATKSMSEIQIISMLGDIAA-GLGIQKSVFVQK-MADPLVDEDARVEWKYTCTRG 176

Query: 112 VYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           +  TP F +N   +A   S      WRKVIDPLLS
Sbjct: 177 ISGTPMFTINDVIVAADAS-WSLEDWRKVIDPLLS 210


>gi|348690388|gb|EGZ30202.1| hypothetical protein PHYSODRAFT_477416 [Phytophthora sojae]
          Length = 303

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 53/98 (54%), Gaps = 4/98 (4%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAE---GIGNSYSSALESGFSDRSTDLLTRVSFKFS 107
           Q+ F+N  T++++   VV ++ K A +    + +     L +G+     D  TR S+K++
Sbjct: 121 QDIFWNKVTKDLSPVQVVAKLKKLAQKTFPSLTDEQWDKLMTGYGGTDVDDHTRESWKYT 180

Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
            +RG+  TP + +NG     A +   ++ W KVIDPL+
Sbjct: 181 CSRGMSGTPMYTLNGVPFE-ADADWTFDQWFKVIDPLV 217


>gi|156379454|ref|XP_001631472.1| predicted protein [Nematostella vectensis]
 gi|156218513|gb|EDO39409.1| predicted protein [Nematostella vectensis]
          Length = 214

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 45/95 (47%), Gaps = 3/95 (3%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           Q + YN  T N  R  ++  +   A +      S  +++G +   +D   R+ +K    R
Sbjct: 122 QSELYNFQTMNKNRYDILNIVSSLAPKA--GVPSDIMKTGLTGTESDGAARIGWKHGCLR 179

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
            V  TPTFF+NG  +  A S      W+ VIDPLL
Sbjct: 180 TVAGTPTFFINGIPVE-ADSSWTVQQWKDVIDPLL 213


>gi|299115356|emb|CBN74181.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 262

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 1/98 (1%)

Query: 49  VQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSA 108
           + Q++  N  T+ MT+  +   +  +A    G S +     G +D   ++ TR  FK+  
Sbjct: 148 IGQDEISNDSTELMTQPQIEDILESWATSSSGMS-ADNFHMGMADAEVEIQTRNQFKYGC 206

Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
             GV  TP  ++ G   +G      +  W+ ++DPLLS
Sbjct: 207 LHGVDGTPQVYIGGILASGLDGDATFQDWQDILDPLLS 244


>gi|390368255|ref|XP_790435.2| PREDICTED: uncharacterized protein LOC585517 isoform 2
           [Strongylocentrotus purpuratus]
 gi|390368257|ref|XP_003731415.1| PREDICTED: uncharacterized protein LOC585517 isoform 1
           [Strongylocentrotus purpuratus]
          Length = 286

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 53/98 (54%), Gaps = 2/98 (2%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           +Q    +A       + ++K + ++A+E +G S +  +        T  L RV FK+ AT
Sbjct: 183 KQAALSDANISEQPDSYILKVLSEWASE-VGYSKAQFMMHLSRSDPTQSLARVEFKYGAT 241

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           RGV+ TP  F+NG ++  +      + W++VIDPLL++
Sbjct: 242 RGVFQTPQTFING-AVVSSEPRWTLSDWKQVIDPLLAK 278


>gi|254423276|ref|ZP_05036994.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
 gi|196190765|gb|EDX85729.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
          Length = 191

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 43/96 (44%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           +Q +F N   ++ T+T     + + AAE              S +      R+  +F+  
Sbjct: 96  KQPEFANEAWKDKTQTEFHTFLAECAAEATAYKDKEQFLKLLSSKEIYAQARIPARFAIV 155

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
           RGV++TPTFF+NG       S      WR VID LL
Sbjct: 156 RGVWSTPTFFINGAEATTLSSSSSVQDWRSVIDDLL 191


>gi|123446098|ref|XP_001311803.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121893626|gb|EAX98873.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 199

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATR 110
           Q +F N P+   T   V    +K+AA+ I       + + F+    +L  RV FK+S + 
Sbjct: 107 QGQFENDPS--FTENTVTDNCIKYAAK-ICQLSEDDIRNAFATIDINLGARVEFKYSTSH 163

Query: 111 GVYATPTFFVNGF--SLAGAGSPLDYNGWRKVIDPLLS 146
           GV  TP FFVNG   +   AGS +D   W   +D LL+
Sbjct: 164 GVDGTPYFFVNGVVTNDVDAGSSID--DWSLYLDNLLN 199


>gi|260803503|ref|XP_002596629.1| hypothetical protein BRAFLDRAFT_78475 [Branchiostoma floridae]
 gi|229281888|gb|EEN52641.1| hypothetical protein BRAFLDRAFT_78475 [Branchiostoma floridae]
          Length = 154

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 55/105 (52%), Gaps = 6/105 (5%)

Query: 48  IVQQEKFYNAP-TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLL--TRVSF 104
           I +Q+  Y  P T +M+ T ++ ++ K A E IG S S    S  +++   +    RV++
Sbjct: 9   IFEQQDDYKMPRTNDMSDTQIIDKLAKVA-ESIGVS-SKNFTSQVNNQEHMVYEDARVAW 66

Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKG 149
           K+   RGV  TP + +NG  +  A        W+++ID LL ++G
Sbjct: 67  KYGCIRGVAGTPWYLLNGVPV-NASPNWTVAQWKQIIDSLLKQQG 110


>gi|260820710|ref|XP_002605677.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
 gi|229291012|gb|EEN61687.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
          Length = 292

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSL----AGAGSPLDYNGWRKVIDPL 144
           +G  + S +   RV FK++  RGV ATPT+FVNG  +    A  G       W++ +D +
Sbjct: 164 TGLDESSPNHRARVEFKYACHRGVAATPTYFVNGIMVNPVPATEGGEFGLEQWKETLDSM 223

Query: 145 LSEKGKKRE 153
           L    ++++
Sbjct: 224 LGTAEQRQQ 232


>gi|390343493|ref|XP_003725885.1| PREDICTED: uncharacterized protein LOC100891241 [Strongylocentrotus
           purpuratus]
          Length = 239

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%)

Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           RV FK+ ATRGV+ TP  +VNG ++  + S      WR +IDPL+S
Sbjct: 169 RVEFKYGATRGVFGTPQTYVNG-AVVYSDSSWTLADWRNIIDPLIS 213


>gi|340501301|gb|EGR28101.1| hypothetical protein IMG5_183230 [Ichthyophthirius multifiliis]
          Length = 227

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 7/96 (7%)

Query: 61  NMTRTAVVKEIVKFAAEGIGN---SYS---SALESGFSDRSTDLLTRVSFKFSATRGVYA 114
           N T+T  + EI +  A+   +    YS   S +    +D++ +  TR S+K     GV  
Sbjct: 124 NKTKTLNLNEIYQKIADDTQSFLKEYSIKSSDVIDALNDQNYNQETRQSWKIGCGNGVSG 183

Query: 115 TPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
           TP F++NG  + GA +  D   W+K ++  L +K K
Sbjct: 184 TPIFYLNGVRIDGAET-FDVQNWKKFLEEYLQQKEK 218


>gi|444379557|ref|ZP_21178734.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
 gi|443676286|gb|ELT82990.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
          Length = 191

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 9/100 (9%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDR--STDLLT--RVSFK 105
           +Q+ FYN    + T   +   +  +A E     +       F +R  S+D+    R   +
Sbjct: 96  RQDTFYNGQFLHKTHEDLRNLVADYAVE-----HGKVDREAFIERMDSSDVYEQGRTPIR 150

Query: 106 FSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
           ++ATR V+ATPT F+N   L           W+  IDPLL
Sbjct: 151 YAATRSVWATPTVFINNAGLVPVDHHSSLEDWQAAIDPLL 190


>gi|449017941|dbj|BAM81343.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 279

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 6/106 (5%)

Query: 49  VQQEKFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALE-----SGFSDRSTDLLTRV 102
            QQE + +    + TR  V +E+ +  A  G+ +S S   E     SG S  +     + 
Sbjct: 171 AQQEDWVDEKVLDKTRRQVYEELAQCLAKAGVVSSASLITERLLTASGNSGNAVGADVKF 230

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           + ++   RGV+ TPT  +NG     A S      W++ ID LLS +
Sbjct: 231 ACRYHRVRGVHVTPTVHINGIPELNASSSWTVQEWKERIDSLLSSE 276


>gi|260820750|ref|XP_002605697.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
 gi|229291032|gb|EEN61707.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
          Length = 292

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 4/69 (5%)

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSL----AGAGSPLDYNGWRKVIDPL 144
           +G  + S +   RV FK++  R V ATPT+FVNG  +    A  G       W++ +D +
Sbjct: 164 TGLDESSPNHRARVEFKYACHRSVAATPTYFVNGIMVNPVPATVGGSFGLEQWKETLDSM 223

Query: 145 LSEKGKKRE 153
           L    +++E
Sbjct: 224 LGTAEQRQE 232


>gi|440795402|gb|ELR16524.1| hypothetical protein ACA1_146580 [Acanthamoeba castellanii str.
           Neff]
          Length = 194

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%)

Query: 99  LTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           L +   K+    G++A+PTF VNG  +  A S      WR+++DPLL+
Sbjct: 147 LIKWEQKYGRQNGIHASPTFLVNGLVVGQASSGWTLEQWRELLDPLLA 194


>gi|290978395|ref|XP_002671921.1| predicted protein [Naegleria gruberi]
 gi|284085494|gb|EFC39177.1| predicted protein [Naegleria gruberi]
          Length = 244

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 9/92 (9%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG----NSYSSALESGFSDRSTDLLTRVSFK 105
            Q   YN     M ++ + K I     + +     N Y + +  G    S D+ TR  FK
Sbjct: 147 NQSPLYNNIVSEMKQSEMYKLIYTNNIQSLNVISWNDYLNLMLDG----SIDMRTRTLFK 202

Query: 106 FSATRGVYATPTFFVNGFSLAGAGSPLDYNGW 137
            S   GV+ATP+F+VNG ++   G    +N W
Sbjct: 203 LSTANGVFATPSFYVNGVAIQN-GENFTFNDW 233


>gi|172055207|ref|YP_001806534.1| hypothetical protein cce_5122 [Cyanothece sp. ATCC 51142]
 gi|354556960|ref|ZP_08976239.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
 gi|171701488|gb|ACB54468.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353551086|gb|EHC20503.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
          Length = 191

 Score = 43.5 bits (101), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 27/45 (60%)

Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
           R+  +F+A RGV++TPTFF+NG       S  +   W+  I+ LL
Sbjct: 147 RIPARFAAIRGVWSTPTFFINGAQTTDLSSQSNLQDWQDKINSLL 191


>gi|422295502|gb|EKU22801.1| hypothetical protein NGA_0488300 [Nannochloropsis gaditana CCMP526]
          Length = 228

 Score = 43.1 bits (100), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 1/102 (0%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSAT 109
           +Q +  N  TQ MT  AV   +  +  E  G      +     +   +   R SFK++  
Sbjct: 127 RQIQVSNELTQGMTYKAVEGVVAAWVQEATGLE-REVVRRKLQEGGVEEAVRFSFKYATI 185

Query: 110 RGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKK 151
           R ++ TP F +N   + G     D   W + + PLL++  ++
Sbjct: 186 RSIFGTPMFVINQMLVPGLDGGSDVMDWEEYLAPLLTQMARE 227


>gi|422293098|gb|EKU20398.1| hypothetical protein NGA_0546800, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 199

 Score = 42.7 bits (99), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 43/99 (43%), Gaps = 11/99 (11%)

Query: 52  EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE---------SGFSDRSTDLLTRV 102
           E+F +  T+ MTR  V+  ++    E        ALE         SG    +T    + 
Sbjct: 103 ERFSDVNTEEMTRKEVMAALIALGVEA--GVPRPALEERLALQGGSSGNKGTATTQAVKW 160

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVI 141
           + K+   RGV+ TPT FVNG   +   S      W++++
Sbjct: 161 ACKYHRRRGVHVTPTVFVNGLEASEVSSAWGVEEWKRLL 199


>gi|291237332|ref|XP_002738589.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 270

 Score = 42.7 bits (99), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 36/87 (41%)

Query: 61  NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFV 120
           N T + ++  + + A     N    +    F D  T  L    ++ +  RGVY TP F +
Sbjct: 133 NTTESDIINALAELATGLSSNITRDSFYDKFEDEKTGHLCVYEWQSAILRGVYQTPWFLI 192

Query: 121 NGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           N   +         + W  +IDPLL +
Sbjct: 193 NDMPILDFRPDWTLSNWTAIIDPLLED 219


>gi|313227081|emb|CBY22228.1| unnamed protein product [Oikopleura dioica]
          Length = 217

 Score = 42.4 bits (98), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 31/60 (51%)

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
              +D + D  TRV +KF+ + GV  TP  F+N   L    +      W KV+DP++ +K
Sbjct: 151 DNMADPNLDWATRVDWKFACSLGVSGTPMPFINRVFLDQGVAEFTLADWTKVLDPIVGQK 210


>gi|291237330|ref|XP_002738588.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 215

 Score = 42.0 bits (97), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 2/76 (2%)

Query: 71  IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
           +  FA E +G +    LE  ++   T+ L R   K +  RGVY  P F VNG ++     
Sbjct: 142 LTDFAVE-VGFTEEEFLEE-YNHPKTNQLCRHEMKMANDRGVYGGPWFVVNGMTVLDYYP 199

Query: 131 PLDYNGWRKVIDPLLS 146
             D   W  +IDPLL 
Sbjct: 200 YWDVEDWISLIDPLLG 215


>gi|413936627|gb|AFW71178.1| hypothetical protein ZEAMMB73_653237 [Zea mays]
          Length = 340

 Score = 42.0 bits (97), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 15/69 (21%)

Query: 86  ALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
            L SG  DR+T++  ++  K  A              + L G GSP+DY+ W  ++DPL+
Sbjct: 133 GLRSGSEDRATEIFKKI-IKLLA--------------YKLPGGGSPIDYSTWIGILDPLV 177

Query: 146 SEKGKKREV 154
           S+ G++ E+
Sbjct: 178 SQNGERVEM 186


>gi|124088536|ref|XP_001347136.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|145474255|ref|XP_001423150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057525|emb|CAH03509.1| Conserved hypothetical protein [Paramecium tetraurelia]
 gi|124390210|emb|CAK55752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 214

 Score = 42.0 bits (97), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 4/93 (4%)

Query: 52  EKFYNAPTQNMTRTAVVKEIVKFAAEGIGN---SYSSALESGFSDRSTDLLTRVSFKFSA 108
           E+F +  T N+ +T V ++I       +     +Y   L S       ++ TR S+K+  
Sbjct: 116 EQFTDLATLNLKQTEVNQKIADLVKTQLAQYQINYDELLNSMKPGTPENIETRYSWKYGT 175

Query: 109 TRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVI 141
           +R V  TPT FVNG  L   G  L    W + I
Sbjct: 176 SRSVSGTPTIFVNGV-LFDKGEELSAQQWIQYI 207


>gi|452822422|gb|EME29441.1| thioredoxin-like protein [Galdieria sulphuraria]
          Length = 244

 Score = 41.6 bits (96), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 31/63 (49%)

Query: 89  SGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           +G        L + + K+S   GV+ +PT+FVNG     AGS  +   W + + PLL + 
Sbjct: 182 TGNDGNQVQSLMKFAIKYSRKLGVHISPTYFVNGIEDTAAGSAWNKEEWLERLRPLLQQG 241

Query: 149 GKK 151
             K
Sbjct: 242 HTK 244


>gi|444919615|ref|ZP_21239615.1| Periplasmic thiol disulfide interchange protein DsbA [Cystobacter
           fuscus DSM 2262]
 gi|444708167|gb|ELW49260.1| Periplasmic thiol disulfide interchange protein DsbA [Cystobacter
           fuscus DSM 2262]
          Length = 652

 Score = 41.6 bits (96), Expect = 0.096,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 41/70 (58%), Gaps = 9/70 (12%)

Query: 81  NSYSSALESG-FSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRK 139
           N + +AL+SG F+ + T  +       +   GV  TP+FF+NG SL GA  P+D   +++
Sbjct: 584 NKFKAALDSGKFTAQVTADMAE-----AGRVGVTGTPSFFINGRSLVGA-QPID--AFKR 635

Query: 140 VIDPLLSEKG 149
           VID  L +KG
Sbjct: 636 VIDEELKKKG 645


>gi|443723886|gb|ELU12105.1| hypothetical protein CAPTEDRAFT_202202 [Capitella teleta]
          Length = 208

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/44 (38%), Positives = 27/44 (61%)

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           ++K++A RG   TP F VN   LA  GS   +  W +++DPL++
Sbjct: 165 TWKYAALRGTAGTPWFHVNNVDLASGGSSPTFEEWVELLDPLIN 208


>gi|374606371|ref|ZP_09679248.1| DSBA oxidoreductase [Paenibacillus dendritiformis C454]
 gi|374388016|gb|EHQ59461.1| DSBA oxidoreductase [Paenibacillus dendritiformis C454]
          Length = 216

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 2/95 (2%)

Query: 53  KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGV 112
           + Y A  +     A    +V FA   + +  ++ LE    DR+         + +A   V
Sbjct: 122 ELYRAQQEERKDWATAPFLVDFAQRTVPDLDTAQLEKALQDRTMRDEVMKDEQQAAQAQV 181

Query: 113 YATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSE 147
           + TPT FV+G      G  LDY G + +ID  L +
Sbjct: 182 HGTPTVFVDGVEF--NGDYLDYEGLKAMIDKALEQ 214


>gi|383454942|ref|YP_005368931.1| DSBA-like thioredoxin domain-containing protein [Corallococcus
           coralloides DSM 2259]
 gi|380733594|gb|AFE09596.1| DSBA-like thioredoxin domain-containing protein [Corallococcus
           coralloides DSM 2259]
          Length = 659

 Score = 41.2 bits (95), Expect = 0.16,   Method: Composition-based stats.
 Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 11/80 (13%)

Query: 73  KFAAEGIG---NSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAG 129
           K+A E IG   N + + L+SG  D+  +          + +G   TPTFF+NG +L GA 
Sbjct: 580 KYAQE-IGLNVNKFKTDLDSGKYDKQIE----ADMADGSAKGANGTPTFFINGRTLVGA- 633

Query: 130 SPLDYNGWRKVIDPLLSEKG 149
            P D   ++KVID  L + G
Sbjct: 634 QPFD--AFKKVIDEELKKAG 651


>gi|313220976|emb|CBY31809.1| unnamed protein product [Oikopleura dioica]
          Length = 217

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 92  SDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           +D + D  TRV +KF  + GV  TP  F+N   L    +      W KV+DP++ +K
Sbjct: 154 ADPNLDWATRVDWKFVCSLGVSGTPMPFINRVFLDQGVAEFTLADWTKVLDPIVGQK 210


>gi|126659892|ref|ZP_01731017.1| glutathione reductase [Cyanothece sp. CCY0110]
 gi|126618855|gb|EAZ89599.1| glutathione reductase [Cyanothece sp. CCY0110]
          Length = 191

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 26/45 (57%)

Query: 101 RVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLL 145
           R+  +F+A +GV++TPTFF+NG       S      W+  I+ LL
Sbjct: 147 RIPARFAALQGVWSTPTFFINGAQTTDLSSQSSLQDWQDKINSLL 191


>gi|406893247|gb|EKD38362.1| disulfide bond formation protein D, selenocysteine-containing
           [uncultured bacterium]
          Length = 142

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 83  YSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVID 142
           Y+ +++SG   +  D       K + +  +Y TPTF++NG  + G   P DY  ++K+ID
Sbjct: 82  YTESIDSGRQAKEIDR----DLKLAESMDLYNTPTFYINGRQVIGE-RPFDY--FKKIID 134

Query: 143 PLLSEKGK 150
             L++ G+
Sbjct: 135 EELNQAGR 142


>gi|323452082|gb|EGB07957.1| hypothetical protein AURANDRAFT_6387 [Aureococcus anophagefferens]
          Length = 173

 Score = 38.9 bits (89), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 24/95 (25%), Positives = 44/95 (46%), Gaps = 7/95 (7%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFA-----AEGIGNSYSSALESGF--SDRSTDLLTRV 102
           +Q +F++A   +M+R  +   +V+ A     AE +    +  +  G   +  +  L  + 
Sbjct: 79  RQVEFFDAHIWDMSRAQIYDALVEIASEFVDAEALRAKLARTIVEGSLNTGNAATLDLKW 138

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGW 137
           + K+   RGV+ TPT F+NG       S  D + W
Sbjct: 139 ATKYHRVRGVHVTPTVFLNGVEAPDVSSGWDADQW 173


>gi|390368260|ref|XP_003731416.1| PREDICTED: uncharacterized protein LOC100890543 [Strongylocentrotus
           purpuratus]
          Length = 509

 Score = 38.5 bits (88), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 27/57 (47%)

Query: 66  AVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNG 122
           A V+E +   A  +G   +           T  L RV FK+  TRGV+ TP  FVNG
Sbjct: 130 AYVQETLAAWAGEVGYDQTEFASRLARSDDTQWLARVEFKYGGTRGVFGTPQTFVNG 186


>gi|153003349|ref|YP_001377674.1| DSBA oxidoreductase [Anaeromyxobacter sp. Fw109-5]
 gi|152026922|gb|ABS24690.1| DSBA oxidoreductase [Anaeromyxobacter sp. Fw109-5]
          Length = 354

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
           K     GV  TP FF+NG  L+GA  PLD   ++KVID  L+  GK
Sbjct: 312 KAGEEAGVSGTPAFFINGRMLSGA-QPLD--AFKKVIDQELASAGK 354


>gi|417862695|ref|ZP_12507745.1| DSBA oxidoreductase [Agrobacterium tumefaciens F2]
 gi|338819957|gb|EGP53931.1| DSBA oxidoreductase [Agrobacterium tumefaciens F2]
          Length = 206

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 9/67 (13%)

Query: 87  LESGFSDRSTDLLTRVSFKFSATR---GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDP 143
           LE+  +D++  LL  V+ K +AT    GV +TPTFF+NG    G   P+D   +R +++P
Sbjct: 146 LEACLADQA--LLDNVN-KIAATGRELGVTSTPTFFINGNKYEGV-IPVDE--FRSIVEP 199

Query: 144 LLSEKGK 150
           LL E GK
Sbjct: 200 LLKEAGK 206


>gi|358057096|dbj|GAA97003.1| hypothetical protein E5Q_03677 [Mixia osmundae IAM 14324]
          Length = 198

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/102 (21%), Positives = 42/102 (41%), Gaps = 5/102 (4%)

Query: 50  QQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-----SGFSDRSTDLLTRVSF 104
            QE++++   +N T   + + + K A + +G   S+ L+      G S        ++  
Sbjct: 97  HQEEYFDEAVENETLAQIRRRLAKLAHDSVGVEESAFLDLVKTGQGNSGNKVGADLKLQV 156

Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLS 146
           K     G++ TPT  ++G       S    + W K +   LS
Sbjct: 157 KLGRQNGIHVTPTVLLDGLVDPSVSSSFGKDEWAKYVKEKLS 198


>gi|418054002|ref|ZP_12692058.1| DSBA oxidoreductase [Hyphomicrobium denitrificans 1NES1]
 gi|353211627|gb|EHB77027.1| DSBA oxidoreductase [Hyphomicrobium denitrificans 1NES1]
          Length = 233

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 97  DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           D +T    + S T GV ATPTFF+NG  L  A +        KVI+PLL+ K
Sbjct: 185 DQITAQRTRASDTFGVNATPTFFINGKKLQEAPT---LEALDKVIEPLLAAK 233


>gi|338740969|ref|YP_004677931.1| protein disulfide isomerase [Hyphomicrobium sp. MC1]
 gi|337761532|emb|CCB67367.1| putative protein disulfide isomerase, putative protein precursor
           (tat pathway signal) [Hyphomicrobium sp. MC1]
          Length = 223

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 97  DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           D +T    + S T GV ATPTFF+NG  L  A +      + KV++PLL+ K
Sbjct: 175 DAITAERSRASETFGVNATPTFFINGKKLQEAPT---MEAFDKVLEPLLAGK 223


>gi|406929335|gb|EKD64938.1| DsbA-like protein thioredoxin protein [uncultured bacterium]
          Length = 256

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 63/136 (46%), Gaps = 17/136 (12%)

Query: 21  ASYFRDKFVCFFLVVLYIKRRLNCLLEIVQQEKFYNAPTQNM-TRTA------VVKEIVK 73
           +S++      F L+  +    L+  +  +++    N P Q+   +TA      +  E +K
Sbjct: 5   SSFYMPILFGFLLIAAFFLGSLSTKVSYLEKGVSSNTPNQDSPAQTAPNQLPTITNEDMK 64

Query: 74  FAAEGIG---NSYSSALESGFSDRSTDLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGS 130
             A+ +G   N ++S L+S     + D       + +AT GV  TPTFF+NG  + GA  
Sbjct: 65  QWAKDLGLNTNDFNSCLDSEKHKSNVD----KDLQDAATAGVSGTPTFFINGTMIVGAQP 120

Query: 131 PLDYNGWRKVIDPLLS 146
              +  ++ +ID  L+
Sbjct: 121 ---FENFKTIIDQELA 133


>gi|154245118|ref|YP_001416076.1| DSBA oxidoreductase [Xanthobacter autotrophicus Py2]
 gi|154159203|gb|ABS66419.1| DSBA oxidoreductase [Xanthobacter autotrophicus Py2]
          Length = 270

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 3/38 (7%)

Query: 111 GVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           GV ATPTFF+NG  ++GA   L    W K + PLL+ K
Sbjct: 236 GVNATPTFFINGKKISGA---LGIAEWDKELAPLLAGK 270


>gi|300024387|ref|YP_003756998.1| DSBA oxidoreductase [Hyphomicrobium denitrificans ATCC 51888]
 gi|299526208|gb|ADJ24677.1| DSBA oxidoreductase [Hyphomicrobium denitrificans ATCC 51888]
          Length = 233

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 97  DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEK 148
           D +T    + S T GV ATPTFF+NG  L    +      + KVI+PLL+ K
Sbjct: 185 DQITAQRTRASDTFGVNATPTFFINGKKLPETPT---LEAFDKVIEPLLAAK 233


>gi|384439999|ref|YP_005654723.1| Thiol:disulfide interchange protein dsbA [Thermus sp. CCB_US3_UF1]
 gi|359291132|gb|AEV16649.1| Thiol:disulfide interchange protein dsbA [Thermus sp. CCB_US3_UF1]
          Length = 212

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 105 KFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGK 150
           K +   G+  TPTFFVNG    G    +D+  WR+ +D  L+  GK
Sbjct: 170 KLATDLGLTGTPTFFVNGEKYGGY---MDFAKWREALDKALAGGGK 212


>gi|363752563|ref|XP_003646498.1| hypothetical protein Ecym_4660 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356890133|gb|AET39681.1| hypothetical protein Ecym_4660 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 202

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 24/98 (24%), Positives = 45/98 (45%), Gaps = 4/98 (4%)

Query: 51  QEKFYNAPTQNMTRTAVVKEIVKFAAEGIG--NSYSSALESGFSDRSTDLLTRVSF--KF 106
           Q++++++ T + +R  +  E+  FA E +         L    SD    ++  V +  ++
Sbjct: 101 QQRWFDSNTADKSRNEIYTELALFAQESVNLPADSLLPLLLVSSDDGNAVVKDVKYFTRY 160

Query: 107 SATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPL 144
               GV+ TPT  +NG  +    S    +   K+IDPL
Sbjct: 161 HRQNGVHVTPTLALNGIIIPAIESSTPTDRVWKIIDPL 198


>gi|386038198|ref|YP_005961074.1| putative protein-disulfide oxidoreductase [Paenibacillus polymyxa
           M1]
 gi|343098159|emb|CCC86367.1| putative protein-disulfide oxidoreductase [Paenibacillus polymyxa
           M1]
          Length = 228

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 10/105 (9%)

Query: 52  EKFYN---APTQNMTRTAVVKEIVKFAAEGIG-NSYSSALESGFSDRSTDLLTRVSFKFS 107
           E+ YN     ++N    A + EI K A   +  N    +L++G      D +      FS
Sbjct: 130 EQIYNLQKDESENWATIAYLVEIAKIAKVKVDYNEMERSLKAG----EFDEMVNKDLNFS 185

Query: 108 ATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKR 152
              G+  TPT  VNG  +   G  LDY   R  I+ ++++  +++
Sbjct: 186 RGLGIRQTPTLVVNGMVIETGG--LDYEQIRSAINDVINKTEERK 228


>gi|406994810|gb|EKE13735.1| hypothetical protein ACD_12C00848G0004 [uncultured bacterium]
          Length = 275

 Score = 35.4 bits (80), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 16/107 (14%)

Query: 52  EKFYNAPTQNMTRTAVVKEI------VKFAAEGIG---NSYSSALESGFSDRSTDLLTRV 102
           EK  N+ T     TAV +E       +K  A+ +G     ++  L+SG  ++ T  L   
Sbjct: 47  EKKINSGTGGQAGTAVQQESPISVPKLKIYAKELGLNTGKFNKCLDSG--EKKT--LVEK 102

Query: 103 SFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKG 149
             K+  + GV  TP FF+NG  L GA     ++ ++++ID  +S  G
Sbjct: 103 DTKYGLSLGVQGTPGFFINGKFLGGA---FPFSAFKEIIDKEISGTG 146


>gi|383458665|ref|YP_005372654.1| amidohydrolase [Corallococcus coralloides DSM 2259]
 gi|380731071|gb|AFE07073.1| amidohydrolase domain-containing protein [Corallococcus coralloides
           DSM 2259]
          Length = 426

 Score = 35.4 bits (80), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 7/57 (12%)

Query: 97  DLLTRVSFKFSATRGVYATPTFFVNGFSLAGAGSPLDYNGWRKVIDPLLSEKGKKRE 153
           D  T   +K  A RGV+  PT       LA   + L Y GW++ +DP      KKRE
Sbjct: 276 DAGTPEVWKLMAQRGVFLCPT-------LAAGDAMLQYRGWKRGVDPEPESAKKKRE 325


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.139    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,335,065,012
Number of Sequences: 23463169
Number of extensions: 87943705
Number of successful extensions: 257578
Number of sequences better than 100.0: 99
Number of HSP's better than 100.0 without gapping: 66
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 257471
Number of HSP's gapped (non-prelim): 104
length of query: 160
length of database: 8,064,228,071
effective HSP length: 122
effective length of query: 38
effective length of database: 9,496,688,749
effective search space: 360874172462
effective search space used: 360874172462
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 71 (32.0 bits)