BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029265
         (196 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224085938|ref|XP_002307747.1| predicted protein [Populus trichocarpa]
 gi|222857196|gb|EEE94743.1| predicted protein [Populus trichocarpa]
          Length = 212

 Score =  233 bits (595), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 111/167 (66%), Positives = 132/167 (79%)

Query: 30  KSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
           +SQ+ PPA+YDGF Y N   D ++I++EAFFDPVCPDSRD+WPPLK+AL+HYG  V LVV
Sbjct: 5   ESQNLPPARYDGFVYENRQGDLNSILVEAFFDPVCPDSRDSWPPLKEALKHYGSRVWLVV 64

Query: 90  HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
           HLLPLPYHDNA+ + RALHI N  NSS TF LLE FFK QEKFYNA T N+++T++V+EI
Sbjct: 65  HLLPLPYHDNAFVSCRALHIANTLNSSFTFPLLEEFFKHQEKFYNAKTSNLSKTSIVEEI 124

Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           VKFA   +GNSYSSA ESGF+DR TDL TRVSFK +     F T FF
Sbjct: 125 VKFATVAVGNSYSSAFESGFNDRQTDLKTRVSFKYSTSRGVFGTPFF 171


>gi|409030182|gb|AFV07578.1| thioredoxin f5, partial [Solanum tuberosum]
          Length = 206

 Score =  223 bits (568), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 113/185 (61%), Positives = 132/185 (71%), Gaps = 5/185 (2%)

Query: 17  SALLCFF--VFNSCSK---SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW 71
           S +L FF  V N   +   +QS PPA+YDGF Y     DS+ I+IEAF+DPVCPDSRD+W
Sbjct: 10  STVLLFFLSVINVSVRDVDAQSLPPARYDGFPYQKQLDDSNTILIEAFYDPVCPDSRDSW 69

Query: 72  PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
           PPLKQAL  YG   SLVVHLLPLPYHDNA+  SR LHI N  N+S+TF LLE F+K QE+
Sbjct: 70  PPLKQALHFYGSRTSLVVHLLPLPYHDNAFVASRTLHIANELNTSSTFPLLERFYKHQER 129

Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
           FYNA T N +R  VVKEIVKFA+  +GNSY SA+ESG +DR TDL TRVSFK +     F
Sbjct: 130 FYNAQTHNKSRATVVKEIVKFASAAVGNSYCSAIESGLNDRQTDLKTRVSFKYSASRGVF 189

Query: 192 NTSFF 196
            T  F
Sbjct: 190 ATPTF 194


>gi|449450508|ref|XP_004143004.1| PREDICTED: uncharacterized protein LOC101216804 isoform 1 [Cucumis
           sativus]
 gi|449450510|ref|XP_004143005.1| PREDICTED: uncharacterized protein LOC101216804 isoform 2 [Cucumis
           sativus]
 gi|449521601|ref|XP_004167818.1| PREDICTED: uncharacterized protein LOC101231840 isoform 1 [Cucumis
           sativus]
 gi|449521603|ref|XP_004167819.1| PREDICTED: uncharacterized protein LOC101231840 isoform 2 [Cucumis
           sativus]
          Length = 236

 Score =  223 bits (567), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 107/196 (54%), Positives = 137/196 (69%)

Query: 1   MQSPSPNKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFF 60
           M+  +   +H   +L  +     +F+  + +QS PP K+DGF Y NH +D + I IEAFF
Sbjct: 1   MEKTNSTSSHLQSLLAFSFSFLALFHLTAHAQSLPPPKFDGFVYGNHSLDFNTIHIEAFF 60

Query: 61  DPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFC 120
           DPVCPDSRD+WPPLK+AL HYG  V LV+HLLPLPYHDNAYA SRALHIV+  N S TF 
Sbjct: 61  DPVCPDSRDSWPPLKKALDHYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSDTFK 120

Query: 121 LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 180
           LLE FF  Q++FYNA T+ ++R A+V  +VKF  E +G+SY + L +GF+DR TDLLTRV
Sbjct: 121 LLEAFFGDQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRV 180

Query: 181 SFKVTQKFFSFNTSFF 196
           SFK +     + T FF
Sbjct: 181 SFKFSTSRGVYGTPFF 196


>gi|449450512|ref|XP_004143006.1| PREDICTED: uncharacterized protein LOC101216804 isoform 3 [Cucumis
           sativus]
          Length = 201

 Score =  221 bits (564), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 105/184 (57%), Positives = 133/184 (72%)

Query: 1   MQSPSPNKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFF 60
           M+  +   +H   +L  +     +F+  + +QS PP K+DGF Y NH +D + I IEAFF
Sbjct: 1   MEKTNSTSSHLQSLLAFSFSFLALFHLTAHAQSLPPPKFDGFVYGNHSLDFNTIHIEAFF 60

Query: 61  DPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFC 120
           DPVCPDSRD+WPPLK+AL HYG  V LV+HLLPLPYHDNAYA SRALHIV+  N S TF 
Sbjct: 61  DPVCPDSRDSWPPLKKALDHYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSDTFK 120

Query: 121 LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 180
           LLE FF  Q++FYNA T+ ++R A+V  +VKF  E +G+SY + L +GF+DR TDLLTRV
Sbjct: 121 LLEAFFGDQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRV 180

Query: 181 SFKV 184
           SFKV
Sbjct: 181 SFKV 184


>gi|224061957|ref|XP_002300683.1| predicted protein [Populus trichocarpa]
 gi|222842409|gb|EEE79956.1| predicted protein [Populus trichocarpa]
          Length = 223

 Score =  215 bits (548), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 108/192 (56%), Positives = 135/192 (70%), Gaps = 2/192 (1%)

Query: 7   NKNHATLILQSALLCFFVFNSCSKS--QSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVC 64
            + H   ++Q + +  F+ +S      Q+ PP +YDGF Y N    SD+I+IEAFFDPVC
Sbjct: 2   KRQHHFAVIQFSYIFLFITSSQWNVLPQNLPPLRYDGFVYENRKGGSDSILIEAFFDPVC 61

Query: 65  PDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEW 124
           PDSRD WPPL++AL+HYG  VSLVVHLLPLPYHDNA+  SRALHI N  N S TF LLE 
Sbjct: 62  PDSRDTWPPLQKALKHYGSRVSLVVHLLPLPYHDNAFVASRALHIANILNCSFTFPLLEQ 121

Query: 125 FFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKV 184
           FFK QEKFY + T N+++ ++VKEIVKFA   +G+SYSS L+ GF+D  TDL TRVSFK 
Sbjct: 122 FFKHQEKFYGSETSNLSKDSIVKEIVKFATVIVGDSYSSPLQFGFNDIQTDLKTRVSFKY 181

Query: 185 TQKFFSFNTSFF 196
           +     + T FF
Sbjct: 182 SASRGVYATPFF 193


>gi|409030172|gb|AFV07573.1| DsbA c3, partial [Solanum tuberosum]
          Length = 194

 Score =  215 bits (548), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 110/185 (59%), Positives = 130/185 (70%), Gaps = 5/185 (2%)

Query: 17  SALLCFF--VFNSCSK---SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW 71
           S +L FF  V N   +   +QS PPA+YDGF Y     DS+ I+IEAF+DPVCPDSRD+W
Sbjct: 10  STVLLFFLSVINVSVRDVDAQSLPPARYDGFPYQKQLDDSNTILIEAFYDPVCPDSRDSW 69

Query: 72  PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
           PPLKQAL  YG   SLVVHLLPLPYHDNA+  SR LHI N  N+S+TF LLE F+K QE+
Sbjct: 70  PPLKQALHFYGSRTSLVVHLLPLPYHDNAFVASRTLHIANELNTSSTFPLLERFYKHQER 129

Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
           FYNA T N +R  VVKEIV+FA+  +GNS  SA+ESG  DR TDL TRVSF+ +     F
Sbjct: 130 FYNAQTHNKSRATVVKEIVRFASAAVGNSCCSAIESGSYDRQTDLKTRVSFQYSASRGVF 189

Query: 192 NTSFF 196
            T  F
Sbjct: 190 GTPTF 194


>gi|302141727|emb|CBI18930.3| unnamed protein product [Vitis vinifera]
          Length = 514

 Score =  209 bits (532), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 106/176 (60%), Positives = 126/176 (71%), Gaps = 1/176 (0%)

Query: 22  FFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
           F+VF   S  ++   PAK+DGF Y       D+IIIEAFFDPVCPDSRDAWPPLK+A+ +
Sbjct: 19  FYVFEHESIVRAVSIPAKFDGFVYNYTGTGIDSIIIEAFFDPVCPDSRDAWPPLKRAIAY 78

Query: 81  YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
           Y P VSL+VH   LPYHDNA+ATSRALHIVN+ NSSAT+ LLE  FK QE FYN  T NM
Sbjct: 79  YAPRVSLIVHPFALPYHDNAFATSRALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNM 138

Query: 141 TRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           +RTA+V  IVKF ++ +G S  SA++SGFSDR TDL TRVSFK         T +F
Sbjct: 139 SRTAIVDCIVKFVSKAVGESLFSAIKSGFSDRQTDLTTRVSFKYGCSRGVLGTPYF 194


>gi|359492074|ref|XP_003634361.1| PREDICTED: uncharacterized protein LOC100854733 [Vitis vinifera]
          Length = 234

 Score =  208 bits (530), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 104/163 (63%), Positives = 123/163 (75%), Gaps = 1/163 (0%)

Query: 22  FFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
           F+VF   S  ++   PAK+DGF Y       D+IIIEAFFDPVCPDSRDAWPPLK+A+ +
Sbjct: 19  FYVFEHESIVRAVSIPAKFDGFVYNYTGTGIDSIIIEAFFDPVCPDSRDAWPPLKRAIAY 78

Query: 81  YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
           Y P VSL+VH   LPYHDNA+ATSRALHIVN+ NSSAT+ LLE  FK QE FYN  T NM
Sbjct: 79  YAPRVSLIVHPFALPYHDNAFATSRALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNM 138

Query: 141 TRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           +RTA+V  IVKF ++ +G S  SA++SGFSDR TDL TRVSFK
Sbjct: 139 SRTAIVDCIVKFVSKAVGESLFSAIKSGFSDRQTDLTTRVSFK 181


>gi|255539052|ref|XP_002510591.1| conserved hypothetical protein [Ricinus communis]
 gi|223551292|gb|EEF52778.1| conserved hypothetical protein [Ricinus communis]
          Length = 362

 Score =  207 bits (526), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 113/184 (61%), Positives = 133/184 (72%), Gaps = 6/184 (3%)

Query: 19  LLCFFVFNSCS---KSQSTPPAKYDGFFYANHPV---DSDAIIIEAFFDPVCPDSRDAWP 72
           LL  FV N       +Q  PPA+YDGF Y  H      +D I+IEAFFDPVCPDSRD+WP
Sbjct: 8   LLFAFVINVSKWGLDAQKLPPARYDGFLYEKHGPCRHHADTILIEAFFDPVCPDSRDSWP 67

Query: 73  PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKF 132
           PLKQAL +YG   SL++HLLPLPYHDNAY  SRALHI N  N S+TF LLE FFK QE+F
Sbjct: 68  PLKQALHYYGSRTSLLLHLLPLPYHDNAYVASRALHIANNLNHSSTFPLLERFFKHQERF 127

Query: 133 YNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFN 192
           YNA T+N++R +VVK+IV+FA+  IGNSY SA+ESGF+DR TDL TRVSFK +     F 
Sbjct: 128 YNAQTRNLSRVSVVKDIVEFASVAIGNSYHSAIESGFNDRETDLQTRVSFKYSTSRGVFG 187

Query: 193 TSFF 196
           T  F
Sbjct: 188 TPTF 191


>gi|388493536|gb|AFK34834.1| unknown [Lotus japonicus]
          Length = 239

 Score =  201 bits (510), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 99/171 (57%), Positives = 124/171 (72%), Gaps = 1/171 (0%)

Query: 27  SCSKSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHV 85
           S + +   PPAK DGF Y N    + D+I+IEAF+DP+CPDSRD+WPPLKQAL  YG  V
Sbjct: 26  SPAGADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDSWPPLKQALHDYGSGV 85

Query: 86  SLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAV 145
           SLVVHLLPLPYHDNAY  SRALH+VN  NSSATF LLE  FK QEKFY A T+N++R ++
Sbjct: 86  SLVVHLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASI 145

Query: 146 VKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
            +E VK A E IG+S+ +++++GF+D  TD  TR+SFK       + T FF
Sbjct: 146 QEEFVKSATEVIGSSFYTSVKNGFNDSKTDHQTRISFKYAASRGVYGTPFF 196


>gi|388504516|gb|AFK40324.1| unknown [Medicago truncatula]
          Length = 230

 Score =  200 bits (509), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 97/180 (53%), Positives = 122/180 (67%), Gaps = 1/180 (0%)

Query: 18  ALLCFFVFNSCSKSQSTPPAKYDGFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQ 76
           ++L      S       PP KYDGF Y N H +  D I IEAF+DP+CPDS D+WPPLK+
Sbjct: 8   SVLILLALQSLVGGDYIPPKKYDGFVYKNRHHLSYDTIQIEAFYDPLCPDSADSWPPLKK 67

Query: 77  ALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAP 136
           AL HY   VS VVHLLPLPYHDNA+  SRALHIVN  N +ATF LLE FFK Q  F+ AP
Sbjct: 68  ALHHYSSRVSFVVHLLPLPYHDNAFVASRALHIVNSLNRTATFPLLEAFFKYQAAFFGAP 127

Query: 137 TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           T+N++R  +V EIVKF +  +GNSY +++++GF+D ++D LTR+SFK         T FF
Sbjct: 128 TRNLSRAFIVDEIVKFTSAIVGNSYHTSIKNGFNDTNSDHLTRISFKYAASRGVSGTPFF 187


>gi|297850426|ref|XP_002893094.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338936|gb|EFH69353.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  194 bits (494), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 95/173 (54%), Positives = 122/173 (70%), Gaps = 2/173 (1%)

Query: 13  LILQSAL-LCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDA 70
           ++++SAL L  F   +  ++Q  PPA+ DGF Y     +D D I+IEA+FDPVCPD RDA
Sbjct: 1   MMIRSALVLVVFFVGTVVQAQLIPPARRDGFVYPPGRKIDPDTILIEAYFDPVCPDCRDA 60

Query: 71  WPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQE 130
           W PLK A+ HYG  V+LV+HL+PLP+HDNA+  SRALHIV+  N++ATF LLE  FK Q 
Sbjct: 61  WEPLKLAIDHYGSRVALVLHLIPLPFHDNAFLVSRALHIVDTLNANATFNLLEGIFKHQA 120

Query: 131 KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            FYN+ TQ M+R AVV  ++K     +GNSY S L SGFS+  +DL TRVSFK
Sbjct: 121 LFYNSQTQLMSRPAVVDNLIKLGTVTLGNSYHSPLISGFSNSKSDLATRVSFK 173


>gi|8778813|gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana]
          Length = 263

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 102/172 (59%), Positives = 125/172 (72%), Gaps = 1/172 (0%)

Query: 13  LILQSALLCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAW 71
           +I    L   FV  +  ++Q  PP + DGF Y   H  D D I+IEA+FDPVCPDSRD+W
Sbjct: 1   MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60

Query: 72  PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
           PPLKQAL HYG  V+L++HLLPLPYHDNAY TSRALHIVN  +++ATF LLE FFK Q  
Sbjct: 61  PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120

Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           FYNA TQ ++R AVV++IV+     +GNSY S L+SGFSD+ +D  TRVSFK
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFK 172


>gi|15222928|ref|NP_177728.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
 gi|117168227|gb|ABK32196.1| At1g76020 [Arabidopsis thaliana]
 gi|332197664|gb|AEE35785.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
          Length = 225

 Score =  193 bits (490), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 104/185 (56%), Positives = 129/185 (69%), Gaps = 1/185 (0%)

Query: 13  LILQSALLCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAW 71
           +I    L   FV  +  ++Q  PP + DGF Y   H  D D I+IEA+FDPVCPDSRD+W
Sbjct: 1   MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60

Query: 72  PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
           PPLKQAL HYG  V+L++HLLPLPYHDNAY TSRALHIVN  +++ATF LLE FFK Q  
Sbjct: 61  PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120

Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
           FYNA TQ ++R AVV++IV+     +GNSY S L+SGFSD+ +D  TRVSFK +     +
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSASRGVY 180

Query: 192 NTSFF 196
            T  F
Sbjct: 181 GTPTF 185


>gi|22329686|ref|NP_683315.1| TRX domain-containing protein [Arabidopsis thaliana]
 gi|17065544|gb|AAL32926.1| Unknown protein [Arabidopsis thaliana]
 gi|24899723|gb|AAN65076.1| Unknown protein [Arabidopsis thaliana]
 gi|332191831|gb|AEE29952.1| TRX domain-containing protein [Arabidopsis thaliana]
          Length = 233

 Score =  192 bits (487), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 90/155 (58%), Positives = 114/155 (73%), Gaps = 1/155 (0%)

Query: 30  KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLV 88
           ++Q  PPA+ DGF Y     +D D I+IEA+ DPVCPD RDAW PLK A+ HYG  V+LV
Sbjct: 19  QAQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDAWEPLKLAIDHYGSRVALV 78

Query: 89  VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKE 148
           +HL+PLP+HDNA+  SRALHIV+  N++ATF LLE  FK Q  FYN+ TQ M+R AVV+E
Sbjct: 79  LHLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEE 138

Query: 149 IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           ++K     +GNSY S L+SGFS+  +DL TRVSFK
Sbjct: 139 LIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFK 173


>gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity to pigpen protein from Mus musculus gb|AF224264
           and contains protein of unknown function DUF78 PF|01918
           domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
           gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
           gb|AI997847, gb|AV543000 come from this gene
           [Arabidopsis thaliana]
          Length = 538

 Score =  191 bits (486), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 90/155 (58%), Positives = 114/155 (73%), Gaps = 1/155 (0%)

Query: 30  KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLV 88
           ++Q  PPA+ DGF Y     +D D I+IEA+ DPVCPD RDAW PLK A+ HYG  V+LV
Sbjct: 18  QAQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDAWEPLKLAIDHYGSRVALV 77

Query: 89  VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKE 148
           +HL+PLP+HDNA+  SRALHIV+  N++ATF LLE  FK Q  FYN+ TQ M+R AVV+E
Sbjct: 78  LHLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEE 137

Query: 149 IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           ++K     +GNSY S L+SGFS+  +DL TRVSFK
Sbjct: 138 LIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFK 172


>gi|351726088|ref|NP_001235579.1| uncharacterized protein LOC100500267 precursor [Glycine max]
 gi|255629877|gb|ACU15289.1| unknown [Glycine max]
          Length = 236

 Score =  191 bits (484), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 100/187 (53%), Positives = 135/187 (72%), Gaps = 3/187 (1%)

Query: 12  TLILQ-SALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDS-DAIIIEAFFDPVCPDSRD 69
           T ILQ +AL   F+      +   PP+++DGF Y    ++  D ++IEAF+DPVCPDSRD
Sbjct: 6   TTILQFAALQVLFILLGVG-ADYIPPSRFDGFVYGKGNLNLLDTVLIEAFYDPVCPDSRD 64

Query: 70  AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQ 129
           +WPPLKQAL HY   VSL++HLLPLPYHDNA+ TSRALHIVN  N+SATF LLEWFF+ Q
Sbjct: 65  SWPPLKQALHHYASRVSLLLHLLPLPYHDNAFVTSRALHIVNNLNASATFPLLEWFFRYQ 124

Query: 130 EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFF 189
           E FY A T+N++R ++++E+VK A + +G+SY   +++GF+D +TD+ TRVSFK      
Sbjct: 125 ENFYGAQTRNLSRASIIEEVVKSATQVVGSSYYKTIKNGFNDTTTDIQTRVSFKYAASRG 184

Query: 190 SFNTSFF 196
            + T FF
Sbjct: 185 VYGTPFF 191


>gi|81076617|gb|ABB55396.1| unknown [Solanum tuberosum]
          Length = 224

 Score =  187 bits (474), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 91/146 (62%), Positives = 112/146 (76%), Gaps = 2/146 (1%)

Query: 40  DGF-FYANHPVD-SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYH 97
           DGF F     +D + +I+IEAF DPVC DSRD+WPPLK+AL HYG  VSLVVH  PLPYH
Sbjct: 24  DGFLFGGGGKIDITKSIVIEAFLDPVCSDSRDSWPPLKKALHHYGSRVSLVVHTFPLPYH 83

Query: 98  DNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
           DNA+ TSRALHIVN+ NSSATF LLE FF QQ+KFYN  T N+++ +VV E+VKF +  I
Sbjct: 84  DNAFTTSRALHIVNKLNSSATFRLLEAFFDQQDKFYNQATFNLSKASVVDEVVKFTSNKI 143

Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
           GNS  +A+++GF+D  TD  TR+SFK
Sbjct: 144 GNSNYAAVKAGFTDPKTDQATRISFK 169


>gi|351721154|ref|NP_001236688.1| uncharacterized protein LOC100527484 precursor [Glycine max]
 gi|255632456|gb|ACU16578.1| unknown [Glycine max]
          Length = 233

 Score =  186 bits (473), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 124/170 (72%)

Query: 14  ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPP 73
           I+Q A L  FV      +   PP++ DGF Y N  +D D ++IEAF+DPVCP SRD+WPP
Sbjct: 8   IVQLAALQVFVIFLSGGADYIPPSRLDGFVYQNGYLDLDTVLIEAFYDPVCPYSRDSWPP 67

Query: 74  LKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFY 133
           LKQAL H+   VS ++HLLPLPYHDNA+  +RALHIVN  N+SATF LLEWFFK QEKFY
Sbjct: 68  LKQALHHHSSRVSFLLHLLPLPYHDNAFVATRALHIVNTLNASATFPLLEWFFKHQEKFY 127

Query: 134 NAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            A T+N++R ++V EIVK A E  G+SY +A++ GF+D  TD  TRVSFK
Sbjct: 128 GAQTRNLSRASIVDEIVKSATEAAGSSYYNAIKHGFNDTKTDYQTRVSFK 177


>gi|297842343|ref|XP_002889053.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334894|gb|EFH65312.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 229

 Score =  186 bits (473), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 101/165 (61%), Positives = 121/165 (73%), Gaps = 1/165 (0%)

Query: 20  LCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           L  FV  +  ++Q  PP K DGF Y   H  D D I+IEA+FDPVCPDSRD+WPPLKQAL
Sbjct: 9   LVVFVTETRVQAQLVPPVKQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSWPPLKQAL 68

Query: 79  QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQ 138
           +HYG  V+ ++HLLPLPYHDNAY TSRALHIVN  N++ATF LLE FFK Q  FYNA T 
Sbjct: 69  RHYGSRVAFLLHLLPLPYHDNAYVTSRALHIVNTVNANATFSLLEGFFKHQPLFYNAQTN 128

Query: 139 NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            +TR AVV++IV+     +G SY S L+SGFSD+ +D  TRVSFK
Sbjct: 129 LLTRAAVVEKIVELGTVTLGKSYQSVLKSGFSDKKSDRATRVSFK 173


>gi|32526672|dbj|BAC79195.1| unknown protein [Oryza sativa Japonica Group]
          Length = 220

 Score =  173 bits (438), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)

Query: 8   KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
           +  A L+ +  LLC  V  SCS       +Q++ P +YDGF Y +    +  D I++EAF
Sbjct: 3   RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61

Query: 60  FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
            DP+CPDSRDAW PL+ A+  Y P VSL+VH  PLPYH N++   RAL+I N+ NSS+T+
Sbjct: 62  LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121

Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
            LLE FFK Q KFYNA T +++ T +  E+ K AA  +GNS S   +SGFSD  TDL  R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180

Query: 180 VSFK 183
           VSFK
Sbjct: 181 VSFK 184


>gi|115480453|ref|NP_001063820.1| Os09g0542200 [Oryza sativa Japonica Group]
 gi|52076083|dbj|BAD46596.1| unknown protein [Oryza sativa Japonica Group]
 gi|113632053|dbj|BAF25734.1| Os09g0542200 [Oryza sativa Japonica Group]
 gi|215694060|dbj|BAG89259.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740808|dbj|BAG96964.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 237

 Score =  172 bits (437), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)

Query: 8   KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
           +  A L+ +  LLC  V  SCS       +Q++ P +YDGF Y +    +  D I++EAF
Sbjct: 3   RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61

Query: 60  FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
            DP+CPDSRDAW PL+ A+  Y P VSL+VH  PLPYH N++   RAL+I N+ NSS+T+
Sbjct: 62  LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121

Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
            LLE FFK Q KFYNA T +++ T +  E+ K AA  +GNS S   +SGFSD  TDL  R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180

Query: 180 VSFK 183
           VSFK
Sbjct: 181 VSFK 184


>gi|218202548|gb|EEC84975.1| hypothetical protein OsI_32232 [Oryza sativa Indica Group]
          Length = 237

 Score =  172 bits (437), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)

Query: 8   KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
           +  A L+ +  LLC  V  SCS       +Q++ P +YDGF Y +    +  D I++EAF
Sbjct: 3   RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61

Query: 60  FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
            DP+CPDSRDAW PL+ A+  Y P VSL+VH  PLPYH N++   RAL+I N+ NSS+T+
Sbjct: 62  LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121

Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
            LLE FFK Q KFYNA T +++ T +  E+ K AA  +GNS S   +SGFSD  TDL  R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180

Query: 180 VSFK 183
           VSFK
Sbjct: 181 VSFK 184


>gi|326520974|dbj|BAJ92850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 292

 Score =  172 bits (436), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 88/177 (49%), Positives = 116/177 (65%), Gaps = 8/177 (4%)

Query: 14  ILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPD 66
           +L+  L+   +  SC+      ++Q++ P +YDGF Y        DA++IEAF DP+CPD
Sbjct: 64  LLRRLLMAGALAASCTYFLLVIQAQASAPRRYDGFAYGVGAAAWKDAVLIEAFLDPLCPD 123

Query: 67  SRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFF 126
           SRDAW PL+ A++ Y P VSL+VH  PLPYH  +Y   RALHI N+ NSS+T+ +LE FF
Sbjct: 124 SRDAWHPLRLAVERYSPLVSLIVHPFPLPYHTYSYHACRALHIANKLNSSSTYPVLELFF 183

Query: 127 KQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           K Q KF N  T +M+ TAV  EI K AA+ +GNS S   +SGFSD  TD+  RVSFK
Sbjct: 184 KNQGKFSNRATSSMSSTAVTGEISKMAAQAVGNSVSD-FQSGFSDTRTDMAARVSFK 239


>gi|226528876|ref|NP_001145190.1| uncharacterized protein LOC100278440 [Zea mays]
 gi|195652481|gb|ACG45708.1| hypothetical protein [Zea mays]
          Length = 238

 Score =  167 bits (422), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 85/169 (50%), Positives = 111/169 (65%), Gaps = 3/169 (1%)

Query: 30  KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
           ++Q++ P +YDGF Y      +  DA+++EAF DP+CPDSRDAW PLK A++ Y P VSL
Sbjct: 31  QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDPLCPDSRDAWQPLKLAVERYAPRVSL 90

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
           +VH  PLPYH  A+   RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++  +V  
Sbjct: 91  IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 150

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
            + K AA+ +GNS S  L SGFSD  TD   RVSFK       F   FF
Sbjct: 151 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 198


>gi|194697338|gb|ACF82753.1| unknown [Zea mays]
          Length = 238

 Score =  167 bits (422), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 85/169 (50%), Positives = 111/169 (65%), Gaps = 3/169 (1%)

Query: 30  KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
           ++Q++ P +YDGF Y      +  DA+++EAF DP+CPDSRDAW PLK A++ Y P VSL
Sbjct: 31  QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDPLCPDSRDAWQPLKLAVERYAPRVSL 90

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
           +VH  PLPYH  A+   RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++  +V  
Sbjct: 91  IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 150

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
            + K AA+ +GNS S  L SGFSD  TD   RVSFK       F   FF
Sbjct: 151 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 198


>gi|357168058|ref|XP_003581462.1| PREDICTED: uncharacterized protein LOC100834992 [Brachypodium
           distachyon]
          Length = 219

 Score =  165 bits (418), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 106/155 (68%), Gaps = 2/155 (1%)

Query: 31  SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVV 89
           +Q   PA+ DGF Y       +A+++EA+FDPVCPDSRDAWPPL++A+QHYG   VS+VV
Sbjct: 22  AQMPIPARTDGFAYGKATAWGEAVVVEAYFDPVCPDSRDAWPPLQKAVQHYGAQRVSVVV 81

Query: 90  HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
           HL PLPYH NA+   R++H VN+ N+SA + LLE FFK QE +YN PT   +R  VV EI
Sbjct: 82  HLFPLPYHSNAFIACRSIHTVNKLNASAVYPLLEKFFKYQEAYYNQPTYTKSRETVVAEI 141

Query: 150 V-KFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
                A  IG +  +A ++GF+D  +D  TR+SFK
Sbjct: 142 TNNLVAPVIGEANLAAYKAGFNDSMSDQSTRISFK 176


>gi|326491935|dbj|BAJ98192.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493720|dbj|BAJ85321.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326504198|dbj|BAJ90931.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511459|dbj|BAJ87743.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326520669|dbj|BAJ92698.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 226

 Score =  164 bits (414), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 81/158 (51%), Positives = 107/158 (67%), Gaps = 5/158 (3%)

Query: 31  SQSTPPAKYDGFFYANH---PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VS 86
           +Q   PA+ DGF Y  H   P   DA+++EAFFDPVCPDSRDAWPPL++A  H+G   V+
Sbjct: 26  AQVPIPARTDGFVYGGHAAAPAWGDAVVVEAFFDPVCPDSRDAWPPLQRAADHFGARRVA 85

Query: 87  LVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVV 146
           +VVHL PLPYH +A+   R++H V++ N+SA + LLE FFK QE +YN PT   TR AVV
Sbjct: 86  VVVHLFPLPYHSSAFIACRSIHTVHKLNASAVYPLLEKFFKYQEGYYNTPTYTKTRAAVV 145

Query: 147 KEIV-KFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            EI     A  IG +  +A ++GF+D  +D  TR+SFK
Sbjct: 146 AEIANNLVAPVIGEANLAAYKAGFNDSQSDQATRISFK 183


>gi|115457372|ref|NP_001052286.1| Os04g0227500 [Oryza sativa Japonica Group]
 gi|38346025|emb|CAE01956.2| OSJNBb0071D01.2 [Oryza sativa Japonica Group]
 gi|38346893|emb|CAE03918.2| OSJNBb0015G09.12 [Oryza sativa Japonica Group]
 gi|113563857|dbj|BAF14200.1| Os04g0227500 [Oryza sativa Japonica Group]
 gi|125546580|gb|EAY92719.1| hypothetical protein OsI_14471 [Oryza sativa Indica Group]
 gi|125588738|gb|EAZ29402.1| hypothetical protein OsJ_13476 [Oryza sativa Japonica Group]
 gi|215679384|dbj|BAG96524.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765592|dbj|BAG87289.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 226

 Score =  161 bits (408), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 80/174 (45%), Positives = 109/174 (62%), Gaps = 8/174 (4%)

Query: 18  ALLCFFVFNSCS------KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDA 70
           +LL   V   C+       +Q   PA+ DGF Y    P   + +++EA+FDPVCPDSRDA
Sbjct: 9   SLLLVLVIGCCAWGCRPGAAQVPVPARTDGFVYGGKAPALGETVVVEAYFDPVCPDSRDA 68

Query: 71  WPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQE 130
           WP LK+A++HY   V++VVHL PLPYH NA+ + R++H VN+ N S  + LLE FFK QE
Sbjct: 69  WPELKKAVEHYASRVTVVVHLFPLPYHSNAFISCRSIHAVNKINPSFVYPLLERFFKYQE 128

Query: 131 KFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            +YN PT   TR  V  E+ K   A  IG +  +A ++GF+D  +D  TR+SFK
Sbjct: 129 GYYNQPTYGKTRATVDAEVAKNLVAPVIGEANLAAYKAGFNDSKSDQATRISFK 182


>gi|357159750|ref|XP_003578547.1| PREDICTED: uncharacterized protein LOC100839255 [Brachypodium
           distachyon]
          Length = 238

 Score =  160 bits (404), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 85/177 (48%), Positives = 113/177 (63%), Gaps = 13/177 (7%)

Query: 13  LILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPD 66
           ++L SAL       SCS      ++Q++ P +YDGF Y       D++++EAF DP+CPD
Sbjct: 16  ILLASALAA-----SCSYYLLVLQAQASVPPRYDGFAY-RAAAWKDSVLVEAFLDPLCPD 69

Query: 67  SRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFF 126
           SRDAW PL+ A++ Y P VSL+VH  PLPYH  ++   RALHI N+ NSS+T+ LLE FF
Sbjct: 70  SRDAWHPLRLAVERYSPRVSLIVHPFPLPYHTYSFHACRALHIANKLNSSSTYPLLELFF 129

Query: 127 KQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           K Q KF N    +++ TA+  EI K A + +GNS S   +SGFSD  TD   RVSFK
Sbjct: 130 KNQGKFSNHALSSLSSTAITGEISKMAVQAVGNSVSE-FQSGFSDARTDSAARVSFK 185


>gi|116781410|gb|ABK22089.1| unknown [Picea sitchensis]
          Length = 227

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 108/165 (65%), Gaps = 2/165 (1%)

Query: 20  LCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           L    F +   +   P P +YDGF Y +  V ++++++EAFFDP+CPDSRD+W PLK+ L
Sbjct: 14  LLMLAFTAARITAQLPIPPRYDGFVYKDR-VTANSVLVEAFFDPLCPDSRDSWWPLKEVL 72

Query: 79  QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQ 138
           + YG +++ +VH   LPYH NA+   R+LHI NR  ++ T+ LL+ FFK Q +FYN  T 
Sbjct: 73  RFYGDNITFIVHPFALPYHHNAFIACRSLHIANRIKTAYTYPLLDHFFKHQARFYNKATL 132

Query: 139 NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            +   +++ +I+ FA E  GNS S+  ES F D +TD+ TR+SFK
Sbjct: 133 QVAPASIINQIIHFALEISGNSSSTMFESAFQDTTTDMATRISFK 177


>gi|242072508|ref|XP_002446190.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
 gi|241937373|gb|EES10518.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
          Length = 220

 Score =  156 bits (394), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 78/173 (45%), Positives = 107/173 (61%), Gaps = 7/173 (4%)

Query: 18  ALLCFFVFNSCS----KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAW 71
           +LL   V   C+    ++Q   PA+ DGF Y   P  +  + +++EAF DPVCPDSRDAW
Sbjct: 5   SLLVVIVVGCCACWGCRAQIPIPARMDGFVYGGKPPPAWGETVVVEAFLDPVCPDSRDAW 64

Query: 72  PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
           P LK+ + HY   VS+V HL PLPYH  A+   +++H VN+ N S  + LLE FFK QE 
Sbjct: 65  PALKKVVDHYSNRVSVVAHLFPLPYHSYAFIACQSIHAVNKLNPSFVYPLLEKFFKDQEG 124

Query: 132 FYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           +YN PT   +R  VV EI K   A  IG +  +A ++GF+D  +D+ TR+SFK
Sbjct: 125 YYNRPTYGKSRATVVDEITKNLVAPIIGETNLAAYKAGFNDSQSDMATRISFK 177


>gi|226491612|ref|NP_001151670.1| LOC100285305 precursor [Zea mays]
 gi|195648623|gb|ACG43779.1| DSBA-like thioredoxin domain containing protein [Zea mays]
          Length = 223

 Score =  155 bits (391), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 76/167 (45%), Positives = 102/167 (61%), Gaps = 2/167 (1%)

Query: 19  LLCFFVFNSCSKSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
           LL   +     ++Q   PA+ DGF Y    P   + +++EAF DPVCPDSRDAWP LK  
Sbjct: 14  LLVVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKMV 73

Query: 78  LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
           ++HY   VS+VVHL PLPYH  A+   R++H VN+ N S  + LLE FFK Q+++YN PT
Sbjct: 74  VEHYSSRVSVVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYPLLEKFFKYQKRYYNQPT 133

Query: 138 QNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
              +R  VV EI K      IG +  +A  +GF+D  +D  TR+SFK
Sbjct: 134 YEKSRATVVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFK 180


>gi|195655073|gb|ACG47004.1| hypothetical protein [Zea mays]
          Length = 235

 Score =  154 bits (390), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/169 (49%), Positives = 108/169 (63%), Gaps = 6/169 (3%)

Query: 30  KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
           ++Q++ P +YDGF Y      +  DA+++EAF DP   DSRDAW PLK A++ Y P VSL
Sbjct: 31  QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDP---DSRDAWQPLKLAVERYAPRVSL 87

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
           +VH  PLPYH  A+   RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++  +V  
Sbjct: 88  IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 147

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
            + K AA+ +GNS S  L SGFSD  TD   RVSFK       F   FF
Sbjct: 148 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 195


>gi|242050028|ref|XP_002462758.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
 gi|241926135|gb|EER99279.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
          Length = 238

 Score =  149 bits (377), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 84/169 (49%), Positives = 111/169 (65%), Gaps = 3/169 (1%)

Query: 30  KSQSTPPAKYDGFFYANHPVD--SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
           ++Q++ P +YDGF Y         DA+++EAF DP+CPDSRD+W PLK A++ Y P VSL
Sbjct: 31  QAQASVPPRYDGFAYGGGAAAAWKDAVLVEAFLDPLCPDSRDSWHPLKLAIERYAPRVSL 90

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
           +VH  PLPYH  A+   RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++  +V  
Sbjct: 91  IVHPFPLPYHTYAFHACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSSPSVAV 150

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           E+ K AA+ +GNS S  L SGFSD  TD   RVSFK       +   FF
Sbjct: 151 EMSKMAAQTVGNSVSEFL-SGFSDTKTDSAARVSFKYGCTRGVYGAPFF 198


>gi|302791067|ref|XP_002977300.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
 gi|300154670|gb|EFJ21304.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
          Length = 226

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 95/150 (63%), Gaps = 2/150 (1%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
           P +YDGF + ++    + I++EAFFDP+CPDS+D+WP +KQ  + YG  V  +VH   LP
Sbjct: 27  PRRYDGFAFGSYGGVQEPILVEAFFDPLCPDSKDSWPAIKQVAEEYGSEVKFIVHPFALP 86

Query: 96  YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVK-FAA 154
           YH  ++ + RALHI N  N+S T+ LL+  F+ QE+F NA T + T + V+ E    FA+
Sbjct: 87  YHQQSFLSVRALHIANHLNASLTYPLLDLIFEHQEEFSNANTADKTASTVIDEFSSLFAS 146

Query: 155 E-GIGNSYSSALESGFSDRSTDLLTRVSFK 183
           + G GN   S  + GF D STD   R+SFK
Sbjct: 147 QFGSGNEAKSIFKQGFYDSSTDQAGRISFK 176


>gi|302780317|ref|XP_002971933.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
 gi|300160232|gb|EFJ26850.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
          Length = 226

 Score =  138 bits (347), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 95/150 (63%), Gaps = 2/150 (1%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
           P +YDGF + ++    + I++EAFFDP+CPDS+D+WP +KQ  + YG  V  +VH   LP
Sbjct: 27  PRRYDGFAFGSYGGVQEPILVEAFFDPLCPDSKDSWPAIKQVAEEYGSEVKFIVHPFALP 86

Query: 96  YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVK-FAA 154
           YH  ++ + RALHI N  N+S T+ LL+  F+ QE+F NA T + T + V+ E    FA+
Sbjct: 87  YHHQSFLSVRALHIANHLNASLTYPLLDLIFEHQEEFSNANTGDKTASTVIDEFSSLFAS 146

Query: 155 E-GIGNSYSSALESGFSDRSTDLLTRVSFK 183
           + G GN   S  + GF D STD   R+SFK
Sbjct: 147 QFGSGNEAKSIFKQGFYDSSTDQAGRISFK 176


>gi|413917904|gb|AFW57836.1| DSBA-like thioredoxin domain containing protein [Zea mays]
          Length = 146

 Score =  136 bits (342), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 62/130 (47%), Positives = 84/130 (64%), Gaps = 2/130 (1%)

Query: 20  LCFFVFNSCS-KSQSTPPAKYDGFFYANHP-VDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
           L   +   C+ ++Q   PA+ DGF Y   P    + +++EAF DPVCPDSRDAWP LK+ 
Sbjct: 14  LLLVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKKV 73

Query: 78  LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
           ++HY   VS+VVHL PLPYH  A+   R++H VN+ N S  + LLE FFK QE++YN PT
Sbjct: 74  VEHYSSRVSIVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYRLLEKFFKDQERYYNQPT 133

Query: 138 QNMTRTAVVK 147
              +R  VV+
Sbjct: 134 YEKSRATVVR 143


>gi|413917905|gb|AFW57837.1| hypothetical protein ZEAMMB73_294031 [Zea mays]
          Length = 212

 Score =  135 bits (341), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 70/166 (42%), Positives = 97/166 (58%), Gaps = 12/166 (7%)

Query: 20  LCFFVFNSCS-KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
           L   +   C+ ++Q   PA+ DGF Y    P   + +++EAF DPVCPDSRDAWP LK+ 
Sbjct: 14  LLLVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKKV 73

Query: 78  LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
           ++HY   VS+VVHL PLPYH  A+   R++H VN+ N S  + LLE FFK Q   Y+   
Sbjct: 74  VEHYSSRVSIVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYRLLEKFFKDQS--YSGTV 131

Query: 138 QNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
             +T+  VV          IG +  +A  +GF+D  +D  TR+SFK
Sbjct: 132 DEITKNLVVPI--------IGETNLAAFRAGFNDSHSDQATRISFK 169


>gi|222642012|gb|EEE70144.1| hypothetical protein OsJ_30190 [Oryza sativa Japonica Group]
          Length = 216

 Score =  125 bits (315), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 31/184 (16%)

Query: 8   KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
           +  A L+ +  LLC  V  SCS       +Q++ P +YDGF Y +    +  D I++EAF
Sbjct: 3   RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61

Query: 60  FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
            DP+CPDSRDAW PL+ A+  Y P VSL+VH  PLP     Y                  
Sbjct: 62  LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPQQMVGY------------------ 103

Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
           C + W    + KFYNA T +++ T +  E+ K AA  +GNS S   +SGFSD  TDL  R
Sbjct: 104 CFMCWV---EGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 159

Query: 180 VSFK 183
           VSFK
Sbjct: 160 VSFK 163


>gi|168037769|ref|XP_001771375.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677293|gb|EDQ63765.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 191

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/154 (38%), Positives = 87/154 (56%), Gaps = 3/154 (1%)

Query: 31  SQSTPPAKYDGFFY-ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
           +Q   P +YDGF Y  +       ++ E F DP+C D ++AWP +KQ ++ YG  + L+V
Sbjct: 1   AQFAIPERYDGFVYNVSTEQGRGPVMWEVFVDPLCIDCKNAWPVVKQVVEKYGSALLLIV 60

Query: 90  HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
           H  P P+H NA+  SR LH+   TNSS  + LLE  F  QE F N  T + T   VV  +
Sbjct: 61  HPFPAPFHHNAFFASRGLHVAQMTNSSLVYPLLELIFSNQESFLNPSTNHETPAHVVSRL 120

Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           +   A+ +G   +++ E  F+D  TD  TR+SFK
Sbjct: 121 ISL-ADNLGFP-TNSFEVAFNDAVTDQATRISFK 152


>gi|156379454|ref|XP_001631472.1| predicted protein [Nematostella vectensis]
 gi|156218513|gb|EDO39409.1| predicted protein [Nematostella vectensis]
          Length = 214

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/166 (33%), Positives = 83/166 (50%), Gaps = 5/166 (3%)

Query: 20  LCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           LC     S + S+  P P K  G+ Y N    +  I +EAF D  CPD + AWP +KQ  
Sbjct: 12  LCLLFTTSSTFSKGVPIPKKPLGYVYGNGK-PTAPIHLEAFVDLTCPDCQQAWPTIKQVA 70

Query: 79  QHYGPHVSLV-VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
           + YGP   LV V   PLPYH NA+  ++++ +V   NSS  F  ++  FK Q + YN  T
Sbjct: 71  KLYGPDTILVLVQPFPLPYHTNAFIAAQSVPVVASYNSSLVFTWIDVLFKFQSELYNFQT 130

Query: 138 QNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            N  R  ++  +   A +      S  +++G +   +D   R+ +K
Sbjct: 131 MNKNRYDILNIVSSLAPK--AGVPSDIMKTGLTGTESDGAARIGWK 174


>gi|125546585|gb|EAY92724.1| hypothetical protein OsI_14474 [Oryza sativa Indica Group]
          Length = 98

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 54/85 (63%), Gaps = 7/85 (8%)

Query: 18 ALLCFFVFNSCS------KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDA 70
          +LL   V   C+       +Q   PA+ DGF Y    P   + +++EA+FDPVCPDSRDA
Sbjct: 9  SLLLVLVIGCCAWGCRPGAAQVPVPARTDGFVYGGKAPALGETVVVEAYFDPVCPDSRDA 68

Query: 71 WPPLKQALQHYGPHVSLVVHLLPLP 95
          WP LK+A++HY   V++VVHL PLP
Sbjct: 69 WPELKKAVEHYASRVTVVVHLFPLP 93


>gi|320169829|gb|EFW46728.1| hypothetical protein CAOG_04686 [Capsaspora owczarzaki ATCC 30864]
          Length = 261

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYG-PHVSLVVHLLPL 94
           P++ DGF YAN    +  + I+ F D +CPDS+  WP LKQ   HYG P + LV H+ PL
Sbjct: 24  PSRPDGFAYANGS-PAAPLQIDLFVDLLCPDSQAIWPTLKQVADHYGAPTLRLVTHIFPL 82

Query: 95  PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
           PYH NAY  ++   +V   N +A +  L+  F  Q+ F + PT N+T   V+       +
Sbjct: 83  PYHHNAYYAAQGTQVVAAANVNAVYKWLDAVFAAQDSFEDDPTVNLTSNQVINMYAAL-S 141

Query: 155 EGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           + IG   +  L+ G     TD   R+++K
Sbjct: 142 QTIGVPAAVFLK-GMDSDDTDESARIAWK 169


>gi|196010617|ref|XP_002115173.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190582556|gb|EDV22629.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 217

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 88/172 (51%), Gaps = 7/172 (4%)

Query: 18  ALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
           A L  F       SQ+    + DG+ Y N   ++  I +EAF D  CPD + AWP LK+ 
Sbjct: 3   ACLVLFALIPVIYSQAPILPRPDGYVYHNGSPEAP-IHMEAFIDLACPDCKTAWPILKKL 61

Query: 78  LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSA--TFCLLEWFFKQQEKFYN 134
              YGP+ + +  H+ PL YH  AY  +  +  +   N S    F  ++  FK Q +FYN
Sbjct: 62  ADSYGPNKLRMTFHINPLVYHRQAYYAAMGVRALVAANVSVHDLFNYMDAVFKNQSEFYN 121

Query: 135 APTQNMTRTAVVKEIVKFAAEGIGNS---YSSALESGFSDRSTDLLTRVSFK 183
            PT  M+   V++   K A +  G S   +++AL++   D +T L ++++++
Sbjct: 122 KPTLKMSGLQVIQAYAKLAHKAAGISAMKFANALQNSGIDHTTRLSSKLAWE 173


>gi|154420795|ref|XP_001583412.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121917653|gb|EAY22426.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 198

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 84/167 (50%), Gaps = 11/167 (6%)

Query: 19  LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           +L FFVF + S      P++  G  Y  +P D+  II+E + DP+C D  D+W  +  A+
Sbjct: 1   MLSFFVFQALS---CPIPSRPPGIVY--NP-DAKDIIVEMYADPLCSDCLDSWSAVSAAI 54

Query: 79  QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQ--QEKFYNAP 136
             Y  +   ++HLLPLPYH   +  S+ +      N+S    LL   +    Q  F  + 
Sbjct: 55  TKYKDNARFIIHLLPLPYHTWTFMVSKVIMAAKSINTSYAPILLNCLYADGNQSMFLGSE 114

Query: 137 TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            +++T   + K+ +K+AAE +G + +  L   F  +  +  TR+ FK
Sbjct: 115 VKSVTAGDMQKKALKWAAEKLGITVND-LTKAFGTQEMN--TRIEFK 158


>gi|405953203|gb|EKC20909.1| Eppin [Crassostrea gigas]
          Length = 403

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 72/134 (53%), Gaps = 7/134 (5%)

Query: 56  IEAFFDPVCPDSRDAWPPLKQALQHYGPHV-SLVVHLLPLPYHDNAYATSRALHIVNRTN 114
           ++AF  PVCPDS++ +P L Q   HYGP V +L +H+ PLPYH N++  +    +VNR  
Sbjct: 42  VDAFLGPVCPDSKEVFPTLLQLANHYGPDVLTLRMHMFPLPYHRNSFLVAMGAQVVNRMI 101

Query: 115 SSATFCLLEW---FFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSD 171
           SSA   +  W    + + +   N  T++M+   ++  +   AA G+G   S  ++   +D
Sbjct: 102 SSAQG-VYNWTGSVYDKIDTLSNTATKSMSEIQIISMLGDIAA-GLGIQKSVFVQK-MAD 158

Query: 172 RSTDLLTRVSFKVT 185
              D   RV +K T
Sbjct: 159 PLVDEDARVEWKYT 172


>gi|118386030|ref|XP_001026137.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila]
 gi|89307904|gb|EAS05892.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila
           SB210]
          Length = 230

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 76/161 (47%), Gaps = 18/161 (11%)

Query: 31  SQSTP-PAKYDGFFYA--NHPVDSDAIIIEAFFDPVCPDSRDAWPPLK-----QALQHYG 82
           +Q+ P P  YDGF     N PV      +EAF+D +CP S  A   +           Y 
Sbjct: 18  TQNVPIPNTYDGFVIGDPNAPV-----TVEAFYDLLCPASEAANEQINLLFTNATFAQYL 72

Query: 83  PHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTR 142
           P +  + H+ PLPYHD A+      H+ +  ++  TF  + W F  QE+++N P  N TR
Sbjct: 73  PKIKFIYHIFPLPYHDYAFR-----HVFDNHDAKTTFDFINWSFDIQEQYWNGPVTNYTR 127

Query: 143 TAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            AV  ++     + +     +   +  ++R+ DL  R S+K
Sbjct: 128 NAVFNKLAHQLHDFVPKISINETLAALTNRTYDLEARYSWK 168


>gi|440791659|gb|ELR12897.1| hypothetical protein ACA1_095270 [Acanthamoeba castellanii str.
           Neff]
          Length = 212

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/186 (29%), Positives = 90/186 (48%), Gaps = 9/186 (4%)

Query: 14  ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPP 73
           IL   L  F VF +     + PP       +A    D+ AI +EAF D  CPDS+  WP 
Sbjct: 6   ILALVLALFTVFVA-----AQPPIPLASSVFALGDADA-AIKLEAFVDLQCPDSKAVWPI 59

Query: 74  LKQALQHYGPH-VSLVVHLLPLPYHDNAY--ATSRALHIVNRTNSSATFCLLEWFFKQQE 130
           L Q  +HYGPH + LV  L PLPYH NA+  A +R        N++  +   ++ + QQ+
Sbjct: 60  LLQLAKHYGPHKLQLVATLFPLPYHHNAFFAAQARPPFPELARNNTVFWDYAQYLYDQQD 119

Query: 131 KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFS 190
            + +  TQ +    V+ ++ ++A        ++ +++     + +LL RV++K       
Sbjct: 120 NWGDEATQGIAPLQVITQMSRYAQACCKVDPAAFVKAMGWVSAQNLLARVTWKRAASLGI 179

Query: 191 FNTSFF 196
           + T  F
Sbjct: 180 YGTPQF 185


>gi|390354820|ref|XP_003728415.1| PREDICTED: uncharacterized protein LOC100888511 [Strongylocentrotus
           purpuratus]
          Length = 219

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 84/175 (48%), Gaps = 4/175 (2%)

Query: 10  HATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRD 69
           H T+++ S  L      S   +Q+  P +  GFFY+N   D+  I +++F D +CPDS+ 
Sbjct: 2   HRTVVV-SLTLVLGGLISQGTAQAPIPLRPPGFFYSNGSCDA-PIQLQSFVDLMCPDSKA 59

Query: 70  AWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQ 128
           AW   ++   +YGP  V     + PLPYH  A   ++   +  R +S+ T+      F +
Sbjct: 60  AWRTYQKVADYYGPSTVRFSALIFPLPYHRAAMVAAQGAFVAARLDSNKTYSWFNTVFDK 119

Query: 129 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           Q   YNA       + +V  + ++A++ +G S +  +        T  L RV FK
Sbjct: 120 QAALYNANITEQPDSYIVNALSEWASD-VGYSKAQFMMHLSPSDPTQSLARVEFK 173


>gi|124088534|ref|XP_001347135.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|145474257|ref|XP_001423151.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057524|emb|CAH03508.1| Conserved hypothetical protein [Paramecium tetraurelia]
 gi|124390211|emb|CAK55753.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 72.4 bits (176), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 59/114 (51%), Gaps = 1/114 (0%)

Query: 50  DSDAIIIEAFFDPVCPDSRDAWPPLKQAL-QHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           DS  +IIEAFFD  CPDSR+++  L+Q L Q YG      +H+ PLPYH  A+  ++A  
Sbjct: 33  DSKKLIIEAFFDLQCPDSRNSFRILEQVLAQGYGTQFKYTIHMFPLPYHRAAFPEAQAFA 92

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYS 162
            ++  ++ A +   +  F  QE    + T   T   ++ ++   A E +   Y 
Sbjct: 93  FLSEIDAEAAYLFAKIIFDNQETLAESATLEWTWQQILNKVADLAKEHVYPKYQ 146


>gi|313227081|emb|CBY22228.1| unnamed protein product [Oikopleura dioica]
          Length = 217

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 73/156 (46%), Gaps = 6/156 (3%)

Query: 31  SQSTPPAKYDGFFY---ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
            Q+  P +  GF Y    N  V++  + ++ F D  CP+   AWP LK    HYGP+V L
Sbjct: 15  GQAPIPNRPVGFVYNPTGNQFVEN-PVEVQVFIDLQCPNCLSAWPGLKAMGDHYGPNVRL 73

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
            V   PLPYH  A+  +  L  VN+ N    +  ++  F  Q+   N     +    +V+
Sbjct: 74  SVVAFPLPYHRAAFKMAWGLQAVNKMNPQLAYDYMDNIFANQDDIANY-AAGVNDKDLVQ 132

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            I     + +  +  + L++  +D + D  TRV +K
Sbjct: 133 YIANKTVDKLNINKDTFLDN-MADPNLDWATRVDWK 167


>gi|390368255|ref|XP_790435.2| PREDICTED: uncharacterized protein LOC585517 isoform 2
           [Strongylocentrotus purpuratus]
 gi|390368257|ref|XP_003731415.1| PREDICTED: uncharacterized protein LOC585517 isoform 1
           [Strongylocentrotus purpuratus]
          Length = 286

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 84/178 (47%), Gaps = 3/178 (1%)

Query: 7   NKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPD 66
           + N    ++ S  L      S   +Q+  P +  GFFY+    D+  I +E+F D +CPD
Sbjct: 62  SDNMHRTVVASLALVLGALISQGTAQAPIPLRPPGFFYSTGSCDA-PIQLESFLDLICPD 120

Query: 67  SRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWF 125
           S+ AW   ++   +YGP  V     + PLPYH  A A ++   + +  +S+ T+  L   
Sbjct: 121 SKAAWRTYQKVADYYGPDTVRFSALMFPLPYHRAAMAAAQGAFVADMLDSNKTYSWLNTA 180

Query: 126 FKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
           F +Q    +A       + ++K + ++A+E +G S +  +        T  L RV FK
Sbjct: 181 FDKQAALSDANISEQPDSYILKVLSEWASE-VGYSKAQFMMHLSRSDPTQSLARVEFK 237


>gi|390368260|ref|XP_003731416.1| PREDICTED: uncharacterized protein LOC100890543 [Strongylocentrotus
           purpuratus]
          Length = 509

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 71/154 (46%), Gaps = 4/154 (2%)

Query: 31  SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVV 89
           +Q+  P +  GF Y   P     +++EA+ D +CPDS++AW   +    HYGP  V    
Sbjct: 19  AQAPIPDRPPGFAY--KPAPEAPVVLEAWVDLICPDSKNAWLTYQAVADHYGPEKVRFSA 76

Query: 90  HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
            + PLPYH +A   ++A  + N  N   T+   +  F +Q    +    + T+  V + +
Sbjct: 77  LMFPLPYHRSAMMAAQAAFVANELNPELTYAFFDNAFSKQPALSDDNIFDQTQAYVQETL 136

Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
             +A E   +    A     SD  T  L RV FK
Sbjct: 137 AAWAGEVGYDQTEFASRLARSD-DTQWLARVEFK 169


>gi|313220976|emb|CBY31809.1| unnamed protein product [Oikopleura dioica]
          Length = 217

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 72/156 (46%), Gaps = 6/156 (3%)

Query: 31  SQSTPPAKYDGFFY---ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
            Q+  P +  GF Y    N  V++  + ++ F D  CP+   AWP LK    HYGP+V L
Sbjct: 15  GQAPIPNRPVGFVYNPTGNQFVEN-PVEVQVFIDLQCPNCLSAWPGLKAMGDHYGPNVRL 73

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
            V   PLPYH  A+  +  L  VN+ N    +  ++  F  Q+   N     +    +V+
Sbjct: 74  SVVAFPLPYHRAAFKMAWGLQAVNKMNPQLAYDYMDNIFANQDDIANY-AAGVNDKDLVQ 132

Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
            I     + +  +  + L+   +D + D  TRV +K
Sbjct: 133 YIANKTVDKLNINKDTFLDK-MADPNLDWATRVDWK 167


>gi|307945890|ref|ZP_07661226.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
 gi|307771763|gb|EFO30988.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
          Length = 191

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 75/163 (46%), Gaps = 5/163 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPL 94
           P K  G+FY     +   + +EAF D  CP S+ AWP L    +HY    + L VHL+ L
Sbjct: 6   PRKPSGYFYGLRDAN---VQVEAFIDIQCPHSKLAWPSLVALKEHYAKGEIGLAVHLITL 62

Query: 95  PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
             H  A+  S+ +  +   ++   F    + + +Q  +YN P  + T   ++  I  FA 
Sbjct: 63  SNHRQAWDVSKGIFALAGDDTEKFFGFASYLYARQSDYYNGPFLHKTHQDLLNLIADFAH 122

Query: 155 EGIGNSYSSALE-SGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           +  G +    L+  G ++  TD  T + F  T+  ++  T F 
Sbjct: 123 DFAGFNKDEMLKLIGTNEVYTDARTPIRFASTKGVWATPTFFL 165


>gi|422295502|gb|EKU22801.1| hypothetical protein NGA_0488300 [Nannochloropsis gaditana CCMP526]
          Length = 228

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 62/146 (42%), Gaps = 3/146 (2%)

Query: 51  SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
           S  + ++ F D VC D++ AWP +KQ +Q Y   V    H  PLPYH NA+    A+   
Sbjct: 52  SSGVNVDVFLDLVCIDTKKAWPVIKQVIQTYDRAVQWRFHQFPLPYHTNAFIG--AVVAA 109

Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFS 170
              N +A    ++  F +Q +  N  TQ MT  AV   +  +  E  G      +     
Sbjct: 110 TYGNDTAALRYMDIAFARQIQVSNELTQGMTYKAVEGVVAAWVQEATGLE-REVVRRKLQ 168

Query: 171 DRSTDLLTRVSFKVTQKFFSFNTSFF 196
           +   +   R SFK       F T  F
Sbjct: 169 EGGVEEAVRFSFKYATIRSIFGTPMF 194


>gi|260820710|ref|XP_002605677.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
 gi|229291012|gb|EEN61687.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
          Length = 292

 Score = 66.2 bits (160), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 8/146 (5%)

Query: 41  GFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHD 98
           G+ Y N +P+    I IE F D VCPD R A+P LK+    YGP+ V L +HL PLPYH 
Sbjct: 40  GYVYKNGNPLAP--IQIETFGDLVCPDFRAAFPILKEVADWYGPNLVCLKIHLFPLPYHK 97

Query: 99  NAYATSRALHIVNR-TNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
            A+ T + LHI+      + TF  ++      E F  A   N T   V  +++    + +
Sbjct: 98  YAFLTHQVLHIIEPIIGINGTFDYMDRVLADLEAFSGA-VMNRTEGQVYSKLLGI-VQTL 155

Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
           G + +    +G  + S +   RV FK
Sbjct: 156 GVT-ADQYWTGLDESSPNHRARVEFK 180


>gi|260820750|ref|XP_002605697.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
 gi|229291032|gb|EEN61707.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
          Length = 292

 Score = 66.2 bits (160), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 8/146 (5%)

Query: 41  GFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHD 98
           G+ Y N +P+    I IE F D VCPD R A+P LK+    YGP+ V L +HL PLPYH 
Sbjct: 40  GYVYKNGNPLAP--IQIETFGDLVCPDFRAAFPILKEVADWYGPNLVCLKIHLFPLPYHK 97

Query: 99  NAYATSRALHIVN-RTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
            A+ T + LHI+      + TF  ++      E F  A   N T   V  +++    + +
Sbjct: 98  YAFLTHQVLHIIEPIIGINGTFDYMDRVLADLEAFSGA-VMNRTEGQVYSKLLGI-VQTL 155

Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
           G + +    +G  + S +   RV FK
Sbjct: 156 GVT-ADQYWTGLDESSPNHRARVEFK 180


>gi|348690388|gb|EGZ30202.1| hypothetical protein PHYSODRAFT_477416 [Phytophthora sojae]
          Length = 303

 Score = 65.9 bits (159), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 81/155 (52%), Gaps = 6/155 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPL 94
           P +  GF       D+  + +E++ D +CPDS+ A+P LK+  QHY P  + +   L PL
Sbjct: 27  PRRPPGFTLGGGSADA-GVQLESYIDLLCPDSKSAYPGLKKLAQHYEPDELRVRFVLFPL 85

Query: 95  PYHDNAYATSRALHIVNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
           PYH +A+AT+ A   +  T   A+F   LE  +  Q+ F+N  T++++   VV ++ K A
Sbjct: 86  PYHQHAFATAEAAFAITTTLGDASFTKWLETIYANQDIFWNKVTKDLSPVQVVAKLKKLA 145

Query: 154 AE---GIGNSYSSALESGFSDRSTDLLTRVSFKVT 185
            +    + +     L +G+     D  TR S+K T
Sbjct: 146 QKTFPSLTDEQWDKLMTGYGGTDVDDHTRESWKYT 180


>gi|124088536|ref|XP_001347136.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|145474255|ref|XP_001423150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|50057525|emb|CAH03509.1| Conserved hypothetical protein [Paramecium tetraurelia]
 gi|124390210|emb|CAK55752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 214

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 76/171 (44%), Gaps = 10/171 (5%)

Query: 19  LLCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
           +L   +   C  SQ TP P   DG    ++P     ++IEA++D  CP SR+++   K  
Sbjct: 6   VLLSLILGVCVGSQYTPIPNIPDGLIIGDNP----NLVIEAYYDIFCPGSRESYNIFKTV 61

Query: 78  LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNS-SATFCLLEWFFKQQEKFYNA 135
           +        + ++H  PLPYH NA+A S     + +T S  A +          E+F + 
Sbjct: 62  IDSLEKDSFTFIIHQFPLPYHKNAFAASAGFKYIWKTVSQEAAYKFEGLLLNNLEQFTDL 121

Query: 136 PTQNMTRTAVVKEIVKFAAEGIGN---SYSSALESGFSDRSTDLLTRVSFK 183
            T N+ +T V ++I       +     +Y   L S       ++ TR S+K
Sbjct: 122 ATLNLKQTEVNQKIADLVKTQLAQYQINYDELLNSMKPGTPENIETRYSWK 172


>gi|123446098|ref|XP_001311803.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121893626|gb|EAX98873.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 199

 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 78/163 (47%), Gaps = 8/163 (4%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW-PPLKQALQHYGPHVSLVVHLLPL 94
           PA+  G  +  +P +++ I++E + DP+CP   +AW   + + +Q Y   V  V+H LPL
Sbjct: 15  PARQTGLTW--NP-NANKILVEMYGDPLCPVCLNAWMNGVSKMIQKYQNDVKFVLHFLPL 71

Query: 95  PYHDNAYATSRALHIVNRTNSSATFCLLEWFF-KQQEKFYNAPTQNMTRTAVVKEIVKFA 153
           PYH  A+  +R +  V + +      L+   +   Q +F N P+   T   V    +K+A
Sbjct: 72  PYHTWAFVVTRTILAVKQLSEPKAQILVNALYTGGQGQFENDPS--FTENTVTDNCIKYA 129

Query: 154 AEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
           A+ I       + + F+    +L  RV FK +       T +F
Sbjct: 130 AK-ICQLSEDDIRNAFATIDINLGARVEFKYSTSHGVDGTPYF 171


>gi|301118869|ref|XP_002907162.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105674|gb|EEY63726.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 298

 Score = 60.8 bits (146), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 77/155 (49%), Gaps = 6/155 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPL 94
           P +  GF       D+  + +E++ D +CPDS+ A+P LK   +HY    + +   L PL
Sbjct: 23  PRRPPGFTMGGGSADA-GVQLESYIDLLCPDSKSAYPGLKNLAEHYEADELRVRFVLFPL 81

Query: 95  PYHDNAYATSRALHIVNRTNSSATF-CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
           PYH +A+AT+ A   V       +F   LE  +  Q+ F+N  T++++   VV+++ K A
Sbjct: 82  PYHQHAFATAEAAFTVTTALGDKSFTTWLETIYANQDIFWNKATKDLSPIQVVEKLKKLA 141

Query: 154 AE---GIGNSYSSALESGFSDRSTDLLTRVSFKVT 185
            +    + +       +G+     D  TR S+K T
Sbjct: 142 QKTFPSLTDKQWDEQMTGYGGTDVDDHTRESWKYT 176


>gi|118378878|ref|XP_001022613.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89304380|gb|EAS02368.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 225

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 46/177 (25%), Positives = 83/177 (46%), Gaps = 14/177 (7%)

Query: 17  SALLCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPL- 74
           S  +   +    S +Q  P P K DG  + N    +  + I+A++D  CPDS   +  L 
Sbjct: 4   SVFVILSILALASATQYVPIPDKPDGVGFGNV---NATLTIDAWYDLTCPDSLYTYQELI 60

Query: 75  ----KQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL-HIVNRTNSSATFCLLEWFFKQQ 129
                Q L  Y   + +  H++PLPYH NA+  ++   ++++   + AT   ++  F+ Q
Sbjct: 61  SALNNQTLAAYVNQIRVNYHIMPLPYHYNAFYLAQVFKYVLDVKGAEATLVYVDQIFENQ 120

Query: 130 EKFYNAPTQNMTRTAVVKEIVKFAAEGIG--NSYSSALESGFSDRSTDLLTRVSFKV 184
             +YNA   N+T T + + I    ++ +   N  S  +   F+D   +   R S+K+
Sbjct: 121 PDYYNA--LNLTTTQIYQLIADNTSQYLSAYNVTSQEILDSFNDYDYNGGARASYKL 175


>gi|154420474|ref|XP_001583252.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121917492|gb|EAY22266.1| hypothetical protein TVAG_094410 [Trichomonas vaginalis G3]
          Length = 197

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 33  STPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLL 92
           S+ P +Y G  Y N   +   I+I+ + DP+C D   +WP +K+ LQ+Y   + L  H +
Sbjct: 10  SSVPKRYPGMLYGNE--NEYKILIDMYCDPLCSDCAYSWPTIKKVLQYYPKDLLLRFHTI 67

Query: 93  PLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFK-QQEKFYNAPTQNMTRTAVVKEIVK 151
           PL  H  +Y + +A+  +   +      +L+  +   Q  F N    N +    +++   
Sbjct: 68  PLDIHTWSYHSVKAVQALRLMDEGKAKQMLDKLYDGDQIYFLNTEMFNTSENQAIQKFCS 127

Query: 152 FAA 154
           + A
Sbjct: 128 YVA 130


>gi|299115356|emb|CBN74181.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 262

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 69/158 (43%), Gaps = 7/158 (4%)

Query: 32  QSTPPAKYDGFFYA--NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
           QS P   +DG+ +          A+ +E + D  C D   AWP + +  + YG     + 
Sbjct: 47  QSEPAIPFDGYGHVILRQGTLEAALKVEVYLDLACSDCALAWPVMNRVAEAYGDRTEFLY 106

Query: 90  HLLPLPYHDN---AYATSRALHIVNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAV 145
            L PLPYH+N   A   ++ + + +R +  A+        F  Q++  N  T+ MT+  +
Sbjct: 107 RLFPLPYHNNAFKAAKAAKTIQLYSRGSEDASVASFFTEVFIGQDEISNDSTELMTQPQI 166

Query: 146 VKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
              +  +A    G S +     G +D   ++ TR  FK
Sbjct: 167 EDILESWATSSSGMS-ADNFHMGMADAEVEIQTRNQFK 203


>gi|348690389|gb|EGZ30203.1| hypothetical protein PHYSODRAFT_553285 [Phytophthora sojae]
          Length = 225

 Score = 56.6 bits (135), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 65/138 (47%), Gaps = 5/138 (3%)

Query: 51  SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRALHI 109
           S  + +E + D +CP S+ A+  +K    HY P  V +   L PLP+H + +  + ++  
Sbjct: 39  SAKVQLETYIDLLCPFSKAAYDGVKALADHYAPEQVRVKAVLFPLPFHQHGFTAAESVFT 98

Query: 110 VNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-- 166
           V        F   LE  ++ QE+F+N  T++ +   V  E+ + A +   +      E  
Sbjct: 99  VTSALGDDHFTNWLEVVYEHQEEFWNKATKDKSAVQVTNELKQLAQKTFPDLTDQQWERG 158

Query: 167 -SGFSDRSTDLLTRVSFK 183
            +G+     D  TRV++K
Sbjct: 159 MTGYGGTEADQHTRVTWK 176


>gi|290978395|ref|XP_002671921.1| predicted protein [Naegleria gruberi]
 gi|284085494|gb|EFC39177.1| predicted protein [Naegleria gruberi]
          Length = 244

 Score = 56.2 bits (134), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 80/176 (45%), Gaps = 21/176 (11%)

Query: 27  SCSKSQSTPP-AKYDGFFYANHPVDSDAII--IEAFFDPVCPDSRDAWPPLKQALQHYGP 83
           +C  S   PP ++    +    P +SD+I   I+ + D  C DS + W  + ++L  Y  
Sbjct: 33  NCKISPIGPPISRLKDSWSTTTPKNSDSIKVQIDLYVDLACSDSAEWWTSVFKSLDSYYF 92

Query: 84  H-------VSLVVHLLPLPYHDNAYATSRALHIVNR---TNSSATFCLLEWFFKQQEKFY 133
                   + L +HLLPLPYH ++YATS +   +N+    NS     L+  F+  Q   Y
Sbjct: 93  SNSSSSAPIKLNLHLLPLPYHISSYATSMSYIAINKYTNYNSKVLKELINSFYTNQSPLY 152

Query: 134 NAPTQNMTRTAVVKEIVKFAAEGIG----NSYSSALESGFSDRSTDLLTRVSFKVT 185
           N     M ++ + K I     + +     N Y + +  G    S D+ TR  FK++
Sbjct: 153 NNIVSEMKQSEMYKLIYTNNIQSLNVISWNDYLNLMLDG----SIDMRTRTLFKLS 204


>gi|444379557|ref|ZP_21178734.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
 gi|443676286|gb|ELT82990.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
          Length = 191

 Score = 55.8 bits (133), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 1/104 (0%)

Query: 53  AIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPLPYHDNAYATSRALHIVN 111
           ++ I+ F D  CP SR  WP L   ++HY G  VSL  HL+ L  H  A+  S  L  + 
Sbjct: 20  SVTIDVFIDIQCPHSRAIWPNLMAVMEHYKGQSVSLKTHLITLSNHRQAWDMSLGLFALA 79

Query: 112 RTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
             ++   +    + +++Q+ FYN    + T   +   +  +A E
Sbjct: 80  EGDAEKFYRFATFLYERQDTFYNGQFLHKTHEDLRNLVADYAVE 123


>gi|340501301|gb|EGR28101.1| hypothetical protein IMG5_183230 [Ichthyophthirius multifiliis]
          Length = 227

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 80/172 (46%), Gaps = 11/172 (6%)

Query: 19  LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           +   F+F +CS+  S P  K DG    N    +  I +EA++D  CPDS+ +   L    
Sbjct: 9   ITILFLFINCSQYISIPK-KPDGVTIGNK---NSQIKLEAYYDLQCPDSKHSNEQLNILF 64

Query: 79  QH--YGPHVSLVVHLLPLPYHDNAYATSRALHIV-NRTNSSATFCLLEWFFKQQEKFYNA 135
           +   Y  +++   H  PLPYH N+Y  S+ L  V ++  + A        F+ Q+ + N 
Sbjct: 65  KEKTYLQNLAFTYHYFPLPYHFNSYHLSQTLRFVYDKYGTQAAVEFTNKVFENQQNYDNN 124

Query: 136 PTQNMTRTAVVKEIVKFAAEGIGNSY---SSALESGFSDRSTDLLTRVSFKV 184
            T+ +    + ++I     +     Y   SS +    +D++ +  TR S+K+
Sbjct: 125 KTKTLNLNEIYQKIAD-DTQSFLKEYSIKSSDVIDALNDQNYNQETRQSWKI 175


>gi|291237330|ref|XP_002738588.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 215

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 74/155 (47%), Gaps = 5/155 (3%)

Query: 44  YANHPVDSDAII-IEAFFDPVCPDSRDAWPPLKQALQHYGPHVS-LVVHLLPLPYHDNAY 101
           Y  H  ++DA + I  +FD  C DS+  +  +  A  +YGP V  LV+H L LPY   ++
Sbjct: 36  YVYHGGNADAPVHITMYFDSQCSDSKMGYQGVFDAADYYGPDVVYLVLHGLSLPYFRGSF 95

Query: 102 ATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSY 161
            + +AL  V++ N + T   L+  F  Q+    +P   ++   ++  +  FA E +G + 
Sbjct: 96  ISLQALRAVDQLNKTKTVEYLQILFDNQDLISGSP-DTVSDADMIVILTDFAVE-VGFTE 153

Query: 162 SSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
              LE  ++   T+ L R   K+      +   +F
Sbjct: 154 EEFLEE-YNHPKTNQLCRHEMKMANDRGVYGGPWF 187


>gi|443723886|gb|ELU12105.1| hypothetical protein CAPTEDRAFT_202202 [Capitella teleta]
          Length = 208

 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 1/112 (0%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRA 106
           P  +  + IE F + +C DS  +W  +    QHYGP ++++VVH+  LPYH NA+  ++ 
Sbjct: 34  PSSTGDVTIEVFLELLCFDSDISWGIINDLQQHYGPENLNIVVHIYSLPYHRNAFLFAQG 93

Query: 107 LHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
           L+ V           ++  +  +  F +  T + T T +V  +   A    G
Sbjct: 94  LYQVADLAPEFVHDYVDMAYANRPSFGDDATVDETETEIVGRLSVLAESVTG 145


>gi|390343493|ref|XP_003725885.1| PREDICTED: uncharacterized protein LOC100891241 [Strongylocentrotus
           purpuratus]
          Length = 239

 Score = 53.1 bits (126), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/128 (25%), Positives = 57/128 (44%), Gaps = 2/128 (1%)

Query: 29  SKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSL 87
           + +Q+  P    GF Y +   D   I++EA+ D +CPD++ AW   ++   HYG   V  
Sbjct: 20  ASAQAPIPESPPGFVYKSTSPDV-PILLEAWVDLICPDAKGAWLTYQEVADHYGTDTVRF 78

Query: 88  VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
              + PLPY   A   S+   +    N +  +      F +Q   Y+    + T   + +
Sbjct: 79  SALMFPLPYDRAAMKASQGSFVARDMNPALVYAWFNNVFDKQAALYDNAIYDKTTEFIDE 138

Query: 148 EIVKFAAE 155
            +  +A+E
Sbjct: 139 TLATWASE 146


>gi|134110460|ref|XP_776057.1| hypothetical protein CNBD1050 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50258725|gb|EAL21410.1| hypothetical protein CNBD1050 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 210

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 7/109 (6%)

Query: 56  IEAFFDPVCPDSRDAWP-------PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           +E + DPVCP SR           P+      Y   V+LVV L P P+H  +     AL+
Sbjct: 21  LEVYIDPVCPFSRKITESIDKNVLPMITNGGKYDGKVNLVVRLYPQPFHYYSAPIIEALY 80

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
           +  +TN    +  L      +  FYN P  ++T +++  ++V+ A E +
Sbjct: 81  VFGQTNPRLFWQYLLAVHSTETTFYNRPAASLTLSSLRDKLVEIAVEQV 129


>gi|440801380|gb|ELR22400.1| glutathione reductase [Acanthamoeba castellanii str. Neff]
          Length = 456

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 7/124 (5%)

Query: 19  LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
           LL   V N     Q  P  K     +  +P  S  I I  F D  CP   D++P ++Q L
Sbjct: 12  LLALVVVNG----QQLPIPKTAPGMHRGNP--SAPIHITEFADYQCPYCADSYPIVEQVL 65

Query: 79  QHYGP-HVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
           + YGP  +   +H+ PL  H  A+  + A  +V        +    + F  Q +FYN+  
Sbjct: 66  KLYGPDKIYYTLHIFPLWLHRQAFIVAEAAGVVALNAPDRYWEAASFLFANQAQFYNSAF 125

Query: 138 QNMT 141
           QN T
Sbjct: 126 QNKT 129


>gi|325188255|emb|CCA22794.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 237

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 2/113 (1%)

Query: 43  FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAY 101
           F  N+   S  I +EAF DP+C  S+ + P LK    HY P  V + +   PL +H + Y
Sbjct: 50  FRMNNCSLSARIQLEAFIDPLCRSSKFSHPILKNLSTHYKPDEVCIRIVPFPLIFHVHTY 109

Query: 102 ATSRALH-IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
             + AL+ +VN    S     LE  ++ Q+ F      N TR  V +++ + A
Sbjct: 110 EVTTALYQVVNALEDSVFVEWLEAIYEVQDVFLYTNGPNFTRLQVREKVHEVA 162


>gi|321262693|ref|XP_003196065.1| hypothetical protein CGB_I1550W [Cryptococcus gattii WM276]
 gi|317462540|gb|ADV24278.1| Hypothetical protein CGB_I1550W [Cryptococcus gattii WM276]
          Length = 210

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 7/109 (6%)

Query: 56  IEAFFDPVCPDSRDAWPPLKQAL-------QHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           +E + DPVCP SR     + + +         Y   V+LVV L P P+H  +     AL+
Sbjct: 21  LEVYIDPVCPFSRKITESIDRNILPMITNGGKYDGTVNLVVRLYPQPFHYYSAPIIEALY 80

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
           +  +TN    +  L         FYN P  ++T +++  ++V+ A E +
Sbjct: 81  VFGQTNPRLFWQYLLAVHSTGTTFYNRPAASLTLSSLRDKLVEIAVEQV 129


>gi|325188256|emb|CCA22795.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 239

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 57/119 (47%), Gaps = 2/119 (1%)

Query: 43  FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAY 101
           F  N+   S  I +EAF DP+C  S+ + P LK    HY P  V + +   PL +H + Y
Sbjct: 52  FRMNNCSLSARIQLEAFIDPLCRSSKFSHPILKNLSTHYKPDEVCIRIVPFPLIFHVHTY 111

Query: 102 ATSRALH-IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGN 159
             + AL+ +VN    S     LE  ++ Q+ F      N TR  V +++ + A +   N
Sbjct: 112 EVTTALYQVVNALEDSVFVEWLEAIYEVQDVFLYTNGPNFTRLQVREKVHEVANKTFPN 170


>gi|440801397|gb|ELR22417.1| hypothetical protein ACA1_255150 [Acanthamoeba castellanii str.
           Neff]
          Length = 185

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRALHIVNR 112
           I +  F D  CPD   ++P +++ LQHYGP  +   +H+ PL  H  A+  + A  +V  
Sbjct: 10  IHLTEFADYQCPDCGVSYPVVEKVLQHYGPDKIYYTLHVFPLWLHRQAFILAEAAGVVAL 69

Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMT 141
                 +   ++ F  Q +F+N+  +N T
Sbjct: 70  NAPEHYWEAAQFLFANQAQFFNSAYRNKT 98


>gi|323448010|gb|EGB03914.1| hypothetical protein AURANDRAFT_67594 [Aureococcus anophagefferens]
          Length = 739

 Score = 45.4 bits (106), Expect = 0.013,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 56  IEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVN-RTN 114
           ++ F D  C DS+  WP LK AL    P V + + L PLPY+   +  ++A       TN
Sbjct: 548 MDVFLDMFCEDSKATWPTLK-ALAAKRPDVDIRIQLFPLPYNFGTWIPAQACTAAALLTN 606

Query: 115 SSATF 119
           SSATF
Sbjct: 607 SSATF 611


>gi|452002015|gb|EMD94474.1| hypothetical protein COCHEDRAFT_1094426 [Cochliobolus
           heterostrophus C5]
          Length = 211

 Score = 45.1 bits (105), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 55/124 (44%), Gaps = 7/124 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
           K+ G   A+  +   A+  +E + D VCP S   +     PPL+  L   YG  +  +  
Sbjct: 6   KFAGQKLASSSIQPKAVHTLELYLDYVCPFSAKMYKTLFTPPLRNTLLSTYGTSLVTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V R ++S  +   E  F+QQ  F++    N TR    + + 
Sbjct: 66  QQIQPWHPSSTLVHEAAYAVQRVDASKFYDFSEKLFEQQNDFFDVSLVNETRNQTYRRLA 125

Query: 151 KFAA 154
           K AA
Sbjct: 126 KIAA 129


>gi|405119955|gb|AFR94726.1| hypothetical protein CNAG_01417 [Cryptococcus neoformans var.
           grubii H99]
          Length = 212

 Score = 44.3 bits (103), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 7/107 (6%)

Query: 56  IEAFFDPVCPDSRDAWP-------PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           +E + DPVCP SR           P+      Y   V+L++ L P P+H  +     AL+
Sbjct: 21  LEVYVDPVCPFSRKITESIDKNVLPMITNGGKYDGKVNLIMRLYPQPFHYYSAPIIEALY 80

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
           +  +TN    +  L         FYN P  ++T +++  ++V+ A +
Sbjct: 81  VFGQTNPRLFWQYLLAVHSTGTTFYNRPAASLTLSSLRDKLVEIAVQ 127


>gi|451853644|gb|EMD66937.1| hypothetical protein COCSADRAFT_83566 [Cochliobolus sativus ND90Pr]
          Length = 211

 Score = 43.9 bits (102), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 55/124 (44%), Gaps = 7/124 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
           K+ G   A+  +   A+  +E + D VCP S   +     PPL+  L   YG  +  +  
Sbjct: 6   KFAGQKLASSSIQPKAVHTLEFYLDYVCPFSAKMYKTLFTPPLRNTLLSTYGTSLVTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V R ++   +   E  F+QQ+ F++    N TR    + + 
Sbjct: 66  QQIQPWHPSSTLVHEAAYAVQRVDAGKFYDFSEKLFEQQKDFFDVSLVNETRNQTYRRLA 125

Query: 151 KFAA 154
           K AA
Sbjct: 126 KIAA 129


>gi|443734729|gb|ELU18601.1| hypothetical protein CAPTEDRAFT_216673 [Capitella teleta]
          Length = 256

 Score = 43.5 bits (101), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 26/112 (23%), Positives = 50/112 (44%), Gaps = 2/112 (1%)

Query: 51  SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
           S  + +E F D  C  S+ +W  +      +G  + +V+   P PYH  A+  ++   +V
Sbjct: 37  SHPLCLEVFLDLNCEYSKKSWMTMMDLSVVFGERLEIVIQQFPQPYHTFAFILTQGYFLV 96

Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI--VKFAAEGIGNS 160
           ++    A     +  +     F N  T N+T   V++ +  ++F A GI  +
Sbjct: 97  HKEAPEAILNYTDLVYDHLASFNNENTFNLTTKEVIEILADLQFEATGINQT 148


>gi|325284363|ref|YP_004256903.1| DSBA oxidoreductase [Deinococcus proteolyticus MRP]
 gi|324316427|gb|ADY27540.1| DSBA oxidoreductase [Deinococcus proteolyticus MRP]
          Length = 216

 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 10/132 (7%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
           P D+   I+E FFDP C       P L   +Q Y   V LV    PL  H N+   +  +
Sbjct: 54  PADAKVTIVE-FFDPECESCAAVEPALMDVMQKYNGEVRLVARYFPL--HSNSTLAAGLI 110

Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALES 167
               + ++   + + ++ F++Q ++         +TA   + + + AE +G   S A  +
Sbjct: 111 EAAAQDSADKRWRMRDYLFQKQREW------GEQQTAQTDKFLDY-AEDMGLDRSKAQAT 163

Query: 168 GFSDRSTDLLTR 179
             S    DLL R
Sbjct: 164 MESAAVRDLLAR 175


>gi|407977028|ref|ZP_11157921.1| DSBA oxidoreductase [Nitratireductor indicus C115]
 gi|407427548|gb|EKF40239.1| DSBA oxidoreductase [Nitratireductor indicus C115]
          Length = 219

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 16/110 (14%)

Query: 12  TLILQSALLCFFVFNSCS---------KSQSTPPAKYDGFFYANHPV----DSDAIIIEA 58
           +++L + L+   VF   +          +Q   PA+ D    A+ PV    D+   I+E 
Sbjct: 5   SIVLLTGLVALIVFAGGAFIYDRYDGLSAQPVAPAQGDALVRAHSPVIGPADAPVTIVE- 63

Query: 59  FFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           FFDP C   R  +P +KQ + ++     LV+     P+HD +    R L 
Sbjct: 64  FFDPSCEACRAFYPAVKQIMANFPAETRLVIRY--APFHDGSDEAVRILE 111


>gi|396471125|ref|XP_003838796.1| hypothetical protein LEMA_P024690.1 [Leptosphaeria maculans JN3]
 gi|312215365|emb|CBX95317.1| hypothetical protein LEMA_P024690.1 [Leptosphaeria maculans JN3]
          Length = 210

 Score = 43.1 bits (100), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 54/124 (43%), Gaps = 7/124 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW------PPLKQALQHYGPHVSLVVH 90
           K+ G   A+  +   A+  IE + D VCP S   +      P  K  L+ Y   V  +  
Sbjct: 6   KFAGQKLASSAIQPKAVHTIEIYLDYVCPFSAKIFKTLYCSPLRKTLLEKYNDRVVTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V + + +  +   E  F+QQ+ F++A   N TR A  + + 
Sbjct: 66  QQIQPWHPSSTLVHEAAYAVQKVDPAKFYPFSEKLFEQQKDFFDASVVNETRNATYRRLA 125

Query: 151 KFAA 154
           K A 
Sbjct: 126 KIAG 129


>gi|126659892|ref|ZP_01731017.1| glutathione reductase [Cyanothece sp. CCY0110]
 gi|126618855|gb|EAZ89599.1| glutathione reductase [Cyanothece sp. CCY0110]
          Length = 191

 Score = 43.1 bits (100), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 1/91 (1%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNR 112
           I IE FFD  CP SR  W  + +  + Y    + L++  + L  H  ++  ++A  +V +
Sbjct: 21  IQIEMFFDLECPFSRKGWQTILKVFKAYDAQTIYLILQPMTLGNHRQSWDATKAAIVVAQ 80

Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMTRT 143
            N+      + + F  Q +F N   ++ T+T
Sbjct: 81  DNTEKFVDFVSYLFDHQPEFANEAFKDKTQT 111


>gi|443722837|gb|ELU11539.1| hypothetical protein CAPTEDRAFT_228382 [Capitella teleta]
          Length = 253

 Score = 42.4 bits (98), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 3/97 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
           P +  G   +N    S  + +E F D  C DS  AW  +KQ    +   + +++H  PLP
Sbjct: 48  PEQTPGIRISN---GSSQVRLEVFLDLNCADSAHAWGVIKQLRDTFPGRLEVLLHSYPLP 104

Query: 96  YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKF 132
           YH  A+  ++ L  V           ++  F   ++F
Sbjct: 105 YHPYAFLCAQGLFAVQHMAPERVESYIDTVFDNMDQF 141


>gi|255077788|ref|XP_002502476.1| predicted protein [Micromonas sp. RCC299]
 gi|226517741|gb|ACO63734.1| predicted protein [Micromonas sp. RCC299]
          Length = 203

 Score = 41.6 bits (96), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 3/118 (2%)

Query: 45  ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLK-QALQHYGPHVSLVVHLLPLPYH-DNAYA 102
           A  P+D+  + + A+ D  CP S   +  +  Q L HYG  V  V +  P P+H  ++  
Sbjct: 14  AKGPMDA-PVTLSAWLDYACPFSAKLFKTVTTQVLPHYGDKVRFVFYHQPQPWHPQSSML 72

Query: 103 TSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNS 160
              A+ + +     A +      F     FY+A T + +R+ + +E+   AA   G S
Sbjct: 73  HEAAIGVYDLGGVDAFWKFSAALFDAATDFYDANTYDKSRSKIYEELAALAARSAGVS 130


>gi|172055207|ref|YP_001806534.1| hypothetical protein cce_5122 [Cyanothece sp. ATCC 51142]
 gi|354556960|ref|ZP_08976239.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
 gi|171701488|gb|ACB54468.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353551086|gb|EHC20503.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
          Length = 191

 Score = 41.2 bits (95), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 1/91 (1%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNR 112
           I IE FFD  CP S+  W  + + L+ Y    + L++  + L  H  ++  ++A   V +
Sbjct: 21  IQIEMFFDLECPFSKKGWQTILKVLKAYDAETIYLILQPMTLSNHRQSWDATKAAMTVAQ 80

Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMTRT 143
            N+      + + F  Q +F     ++ T+T
Sbjct: 81  DNAEKFVDFVSYLFDHQSEFNEKAFKDKTQT 111


>gi|262202544|ref|YP_003273752.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
 gi|262085891|gb|ACY21859.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
          Length = 214

 Score = 40.8 bits (94), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
           P +S A  +E F D  C     A+P ++Q  Q YG  V+ VV   PLP H N
Sbjct: 52  PAESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102


>gi|330934811|ref|XP_003304718.1| hypothetical protein PTT_17367 [Pyrenophora teres f. teres 0-1]
 gi|311318600|gb|EFQ87219.1| hypothetical protein PTT_17367 [Pyrenophora teres f. teres 0-1]
          Length = 214

 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 7/125 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
           K+ G   A+  +   A+  +E + D VCP S   +      PL+Q L   Y P ++ +  
Sbjct: 6   KFAGQKLASSAISPKAVHTLEIYLDYVCPFSAKIFNTIYNTPLRQTLLTTYSPTLTTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V + + +A +      F  Q  F++A   N TR A  K + 
Sbjct: 66  QQIQPWHPSSTLVHEAAYAVQKLSPAAFWPYSALLFTHQAAFFDANVVNETRNATYKRLA 125

Query: 151 KFAAE 155
           K A E
Sbjct: 126 KLAGE 130


>gi|262202577|ref|YP_003273785.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
 gi|262085924|gb|ACY21892.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
          Length = 214

 Score = 40.4 bits (93), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 51/133 (38%), Gaps = 24/133 (18%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATS--- 104
           P +S A  +E F D  C     A+P ++Q  Q YG  V+ VV   PLP H NA   +   
Sbjct: 52  PSESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFNADRAARAV 110

Query: 105 -------------RALHIVNRTNSSATFCLLEWFFK-------QQEKFYNAPTQNMTRTA 144
                        R + +  R+       L + FF          E+F  A     TR  
Sbjct: 111 AAAAEQGQFEPMYRKMFVTQRSWGEQRVPLDDLFFSYAQELGLNMERFAAAYDSQATREL 170

Query: 145 VVKEIVKFAAEGI 157
           + +++    A G+
Sbjct: 171 IDRDVADGKALGV 183


>gi|409392046|ref|ZP_11243682.1| putative oxidoreductase [Gordonia rubripertincta NBRC 101908]
 gi|403198122|dbj|GAB86916.1| putative oxidoreductase [Gordonia rubripertincta NBRC 101908]
          Length = 214

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
           P +S A  +E F D  C     A+P ++Q  Q YG  V+ VV   PLP H N
Sbjct: 52  PTESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102


>gi|149911666|ref|ZP_01900275.1| dsbA-like thioredoxin domain protein [Moritella sp. PE36]
 gi|149805247|gb|EDM65263.1| dsbA-like thioredoxin domain protein [Moritella sp. PE36]
          Length = 222

 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 6/99 (6%)

Query: 14  ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPV----DSDAIIIEAFFDPVCPDSRD 69
           I+  A+LC  V  +     S      D   Y+ + +        + I  FFDP C   R 
Sbjct: 18  IISIAVLCIAVVITALSFTSKESINIDQHLYSQNDLVFGNKYAKVTIVEFFDPACESCRA 77

Query: 70  AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
            +P +K  L+ Y   V+L+V   P+ +H N      AL 
Sbjct: 78  FYPLVKSQLKKYKGKVNLIVR--PVAFHRNVGPVVAALE 114


>gi|40445317|ref|NP_954777.1| hypothetical protein [Gordonia westfalica]
 gi|40217347|emb|CAE09098.1| hypothetical protein [Gordonia westfalica]
          Length = 214

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
           P +S A  +E F D  C     A+P ++Q  Q YG  V+ VV   PLP H N
Sbjct: 52  PTESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102


>gi|119854999|ref|YP_935604.1| DSBA oxidoreductase [Mycobacterium sp. KMS]
 gi|145225999|ref|YP_001136653.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
 gi|119697717|gb|ABL94789.1| DSBA oxidoreductase [Mycobacterium sp. KMS]
 gi|145218462|gb|ABP47865.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
          Length = 221

 Score = 40.0 bits (92), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 50  DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNA 100
           DSD   +E F D  C   R A+P ++Q    YG  V+ V    P+P H N 
Sbjct: 59  DSDVTFVE-FLDFECEGCRAAFPAVEQLRAQYGQQVTFVARYFPMPGHFNG 108


>gi|268316621|ref|YP_003290340.1| DSBA oxidoreductase [Rhodothermus marinus DSM 4252]
 gi|262334155|gb|ACY47952.1| DSBA oxidoreductase [Rhodothermus marinus DSM 4252]
          Length = 409

 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 58/149 (38%), Gaps = 22/149 (14%)

Query: 26  NSCSKSQSTPPAKY-DGFFYANHPV----DSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
            +CS     PP +Y       N P     D+  ++IE + DP CP  +   P +KQ ++ 
Sbjct: 223 QTCSFDTERPPVRYFRDLIMMNDPTAGNPDAKVVVIE-YLDPNCPHCKHLHPIMKQVVES 281

Query: 81  YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQ--------EKF 132
           YG     V    P+P    +     AL+   R        +LE  F++Q        E  
Sbjct: 282 YGLQAYFVFK--PIPLWQFSIPQVAALYAAAREGKFEA--MLEAQFERQRSGGLTLEEIL 337

Query: 133 YNAPTQNMTRTAVVKEIVKFAAEGIGNSY 161
             A    M R  + ++I     EG+ N Y
Sbjct: 338 DIAEAIGMDRNELARQI----NEGVFNEY 362


>gi|189203275|ref|XP_001937973.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187985072|gb|EDU50560.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 214

 Score = 39.7 bits (91), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 7/125 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
           K+ G   A+  +   A+  +E + D VCP S   +      PL+Q L   Y P ++ +  
Sbjct: 6   KFAGQKLASSAISPKAVHTLEIYLDYVCPFSAKLFNTIYNTPLRQTLLSTYSPTLNTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V + + +A +      F  Q  F++A   N TR A  K + 
Sbjct: 66  QQIQPWHPSSTLVHEAAYAVQKLSPAAFWPYSALLFAHQATFFDANVVNETRNATYKRLA 125

Query: 151 KFAAE 155
           K A E
Sbjct: 126 KLAGE 130


>gi|254423276|ref|ZP_05036994.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
 gi|196190765|gb|EDX85729.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
          Length = 191

 Score = 39.7 bits (91), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 28/122 (22%), Positives = 54/122 (44%), Gaps = 4/122 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPL 94
           P++  G+   N    +  I++E FFD  CP S+  W  + Q    Y    +  V   + L
Sbjct: 6   PSRRSGYRIGNA---NAPIMVEVFFDLECPFSKKCWDTVMQVKAAYTAEQLYWVFQPMSL 62

Query: 95  PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
             H  ++  ++A   V+  ++      + + F +Q +F N   ++ T+T     + + AA
Sbjct: 63  GNHRQSWDATKAAIAVSDADTQKFIDFVSYLFGKQPEFANEAWKDKTQTEFHTFLAECAA 122

Query: 155 EG 156
           E 
Sbjct: 123 EA 124


>gi|169603159|ref|XP_001795001.1| hypothetical protein SNOG_04588 [Phaeosphaeria nodorum SN15]
 gi|111067228|gb|EAT88348.1| hypothetical protein SNOG_04588 [Phaeosphaeria nodorum SN15]
          Length = 210

 Score = 39.7 bits (91), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 7/124 (5%)

Query: 38  KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAWPP-----LKQAL-QHYGPHVSLVVH 90
           K+ G   A+  +   A+  +E + D VCP S+  +       L+Q L + YG  V  +  
Sbjct: 6   KFAGQKLASSSIQPKAVHTLELYLDYVCPFSQKMFKTVYNTNLRQTLLEKYGDRVVTIFR 65

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++     A + V + + +  +      F QQ++F++    N TR A  K + 
Sbjct: 66  QQIQPWHPSSTLVHEAGYAVQKVDPTRFYDFSAALFHQQKEFFDVNVVNETRNATYKRLA 125

Query: 151 KFAA 154
           K A 
Sbjct: 126 KIAG 129


>gi|290979057|ref|XP_002672251.1| predicted protein [Naegleria gruberi]
 gi|284085826|gb|EFC39507.1| predicted protein [Naegleria gruberi]
          Length = 244

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 12/114 (10%)

Query: 57  EAFFDPVCPDSRDAWPPLKQALQHY---------GPHVSLVVHLLPLPYHDNAYATSRAL 107
           E F D  CPD    +  + + L  Y            V   +HL+PLP H   + +++  
Sbjct: 64  EMFLDLTCPDCATHFQSITKPLVQYYFNNGTSGNQDKVVFSLHLMPLPMHIAGFYSAQTY 123

Query: 108 HIV---NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
            IV   ++ + +  +  L+ FF  Q    NA  +NM +  +   I     + +G
Sbjct: 124 SIVSKFSKADRNVCWKFLDLFFSNQSPASNANLKNMNQAQIFNLIYATYVQPLG 177


>gi|145223389|ref|YP_001134067.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
 gi|145215875|gb|ABP45279.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 50  DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNA 100
           DSD   +E F D  C   R A+P ++Q    YG  V+ V    P+P H N 
Sbjct: 59  DSDVTFVE-FLDFECEGCRAAFPAVEQLRAQYGQQVTFVARYFPMPGHFNG 108


>gi|260221388|emb|CBA29900.1| hypothetical protein Csp_A14470 [Curvibacter putative symbiont of
           Hydra magnipapillata]
          Length = 218

 Score = 39.3 bits (90), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 3/61 (4%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
           PV++   I+E FFDP C   R+ +P +K+ L+ Y   V LVV     P+H N+    + L
Sbjct: 55  PVNAPVTIVE-FFDPACETCREFYPIVKELLKKYPNDVRLVVRY--APFHRNSDLVVKML 111

Query: 108 H 108
            
Sbjct: 112 E 112


>gi|452957998|gb|EME63354.1| DSBA oxidoreductase [Amycolatopsis decaplanina DSM 44594]
          Length = 222

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 2/85 (2%)

Query: 46  NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSR 105
           + P  S   ++E F D  C     A+P +++    YG  V+ V+   P+P H NA   +R
Sbjct: 57  SDPAGSKVTVVE-FLDLECEACGAAFPGVERLRAEYGDRVTFVMRYFPIPSHRNAELAAR 115

Query: 106 ALHIVNRTNSSATFCLLEWFFKQQE 130
           A+    +         L  F KQ E
Sbjct: 116 AVEAAGKQGKLEPMYRL-MFEKQPE 139


>gi|449017941|dbj|BAM81343.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 279

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 54/129 (41%), Gaps = 15/129 (11%)

Query: 43  FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPL-KQALQHY--GPHVSLVVHLLPLPYHDN 99
           FY   P     +++E F D  CP S+ A+  L ++ L HY     V     L+P P+H  
Sbjct: 85  FYGERPYPQ--VLVEVFLDYNCPYSKKAFLTLWQEVLPHYRADSRVQFAFGLVPQPWHAQ 142

Query: 100 AYATSRALHIVNRTNSSATFCLLEW-----FFKQQEKFYNAPTQNMTRTAVVKEIVK-FA 153
                 A     + +  A F    W      F QQE + +    + TR  V +E+ +  A
Sbjct: 143 TCYMHEAALAARQVSGDAGF----WKASAALFAQQEDWVDEKVLDKTRRQVYEELAQCLA 198

Query: 154 AEGIGNSYS 162
             G+ +S S
Sbjct: 199 KAGVVSSAS 207


>gi|353238665|emb|CCA70604.1| hypothetical protein PIIN_04541 [Piriformospora indica DSM 11827]
          Length = 207

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 5/108 (4%)

Query: 56  IEAFFDPVCP----DSRDAWPPLKQALQ-HYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
           +E F+D VCP     +R     +K  LQ  Y   V ++V   P P+H ++     A   V
Sbjct: 22  LEFFWDYVCPYSAKSARAVDSIVKPLLQDKYPGQVKIIVRPHPQPWHSSSTLVHEAGLAV 81

Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
            +    A +      F  QE++Y+ PT   T   +  ++V+ A    G
Sbjct: 82  AKVAPDAYWSFSLALFNAQEEYYDVPTSTQTPNEIRTKLVQLAKTSAG 129


>gi|169861885|ref|XP_001837576.1| hypothetical protein CC1G_08130 [Coprinopsis cinerea okayama7#130]
 gi|116501305|gb|EAU84200.1| hypothetical protein CC1G_08130 [Coprinopsis cinerea okayama7#130]
          Length = 205

 Score = 38.1 bits (87), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 47/109 (43%), Gaps = 7/109 (6%)

Query: 56  IEAFFDPVCPDS-------RDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           ++ F D VCP S        +   PL +    Y   V ++  L   P+H  +  T  A  
Sbjct: 22  LDIFLDYVCPFSAKMALAIENILKPLVEQGGQYEGKVKVIFRLQVQPWHSASTLTHEAAL 81

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
            V R +    +      FK Q  +++ PT N+T   + +++V  AA+ I
Sbjct: 82  AVLRASPENFWTYSLALFKNQSDYFDIPTANLTPLQIREKLVALAAQVI 130


>gi|58266834|ref|XP_570573.1| hypothetical protein CND05280 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57226806|gb|AAW43266.1| hypothetical protein CND05280 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 200

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 39/77 (50%)

Query: 81  YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
           Y   V+LVV L P P+H  +     AL++  +TN    +  L      +  FYN P  ++
Sbjct: 8   YDGKVNLVVRLYPQPFHYYSAPIIEALYVFGQTNPRLFWQYLLAVHSTETTFYNRPAASL 67

Query: 141 TRTAVVKEIVKFAAEGI 157
           T +++  ++V+ A E +
Sbjct: 68  TLSSLRDKLVEIAVEQV 84


>gi|358450068|ref|ZP_09160540.1| outer membrane protein [Marinobacter manganoxydans MnI7-9]
 gi|357225729|gb|EHJ04222.1| outer membrane protein [Marinobacter manganoxydans MnI7-9]
          Length = 225

 Score = 37.7 bits (86), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 3/61 (4%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
           P D+  II+E FFDP C   R  +P +KQ L  Y   V LV+      +H  +   SR L
Sbjct: 64  PADAPVIIVE-FFDPSCEACRAFYPIVKQILAEYPGQVRLVLRY--TMFHQGSEEVSRIL 120

Query: 108 H 108
            
Sbjct: 121 E 121


>gi|340030885|ref|ZP_08666948.1| DSBA oxidoreductase [Paracoccus sp. TRP]
          Length = 219

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 48/110 (43%), Gaps = 16/110 (14%)

Query: 12  TLILQSALLCFFVFNS---------CSKSQSTPPAKYDGFFYANHPV----DSDAIIIEA 58
           +++L + L+   VF            + +Q  P  + D    A+ P+    D+   I+E 
Sbjct: 5   SVVLFTGLVALIVFAGGAFLYDRYGSTSAQPVPAPQGDAMTRAHSPIIGPADAPVTIVE- 63

Query: 59  FFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           FFDP C   R  +P +KQ + ++     LV+     P+H+ +    R L 
Sbjct: 64  FFDPSCEACRAFYPAVKQIMANFPGETRLVIRY--APFHEGSDEAVRILE 111


>gi|449549563|gb|EMD40528.1| hypothetical protein CERSUDRAFT_44584 [Ceriporiopsis subvermispora
           B]
          Length = 205

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 51/126 (40%), Gaps = 7/126 (5%)

Query: 56  IEAFFDPVCPDSRD-------AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           ++ F D VCP S+           PL      Y   V ++      P+H ++  T  A  
Sbjct: 22  LDVFLDFVCPYSKKLVFAIDAVLRPLLGPGGKYDGKVHIIFRPQVQPWHSSSTFTHEAAL 81

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESG 168
              R      +      FK+QE+F++ PT  +T   + +++   AA  IG     A +  
Sbjct: 82  AALRVAPDKFWAFTLALFKRQEEFFDIPTSTLTPLQIREKLAHIAAGVIGEQNVPAFKDL 141

Query: 169 FSDRST 174
            + +ST
Sbjct: 142 LALKST 147


>gi|326331477|ref|ZP_08197767.1| putative thioredoxin domain protein (DSBA) [Nocardioidaceae
           bacterium Broad-1]
 gi|325950733|gb|EGD42783.1| putative thioredoxin domain protein (DSBA) [Nocardioidaceae
           bacterium Broad-1]
          Length = 223

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 51  SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
           SD +++E F D  C     A+P ++   + YG  V+ V    PLP H N+   +R++   
Sbjct: 64  SDVVLVE-FLDFECEACGAAYPIVEDLREKYGDQVTFVARYFPLPGHFNSERAARSVESA 122

Query: 111 NRTN 114
            R  
Sbjct: 123 ARQG 126


>gi|116751066|ref|YP_847753.1| DSBA oxidoreductase [Syntrophobacter fumaroxidans MPOB]
 gi|116700130|gb|ABK19318.1| DSBA oxidoreductase [Syntrophobacter fumaroxidans MPOB]
          Length = 339

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 48  PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRA 106
           P D+   ++E F D  CP  R     +K+   H+G  V LV    PL  H NA+  ++A
Sbjct: 177 PADAPVTLVE-FSDYQCPACRATQEGVKKVKSHFGDRVRLVFKDYPLKRHKNAHLAAQA 234


>gi|303321632|ref|XP_003070810.1| hypothetical protein CPC735_039290 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110507|gb|EER28665.1| hypothetical protein CPC735_039290 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320040291|gb|EFW22224.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 209

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 5/128 (3%)

Query: 36  PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL-----QHYGPHVSLVVH 90
           P KY G    +  +      +E + D VCP S   +     ++     + Y  ++ ++  
Sbjct: 4   PPKYAGLKLFSSGITEQRHTLEIYLDYVCPYSAKFFDTFYNSVIPIIRKKYRSYLQVIFR 63

Query: 91  LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
               P+H ++  T  A  +V +   S  +   E  FK Q+++++A   N TR    K + 
Sbjct: 64  PQVQPWHPSSTLTQEAALVVLKLEPSKFWDFSEALFKAQKEYFDANVVNETRNHTYKRLA 123

Query: 151 KFAAEGIG 158
             A++  G
Sbjct: 124 ALASKVTG 131


>gi|384566815|ref|ZP_10013919.1| protein-disulfide isomerase [Saccharomonospora glauca K62]
 gi|384522669|gb|EIE99864.1| protein-disulfide isomerase [Saccharomonospora glauca K62]
          Length = 223

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 15/86 (17%)

Query: 28  CSKSQSTPPAKYDGFFYANHPV-------------DSDAIIIEAFFDPVCPDSRDAWPPL 74
            ++  + PPAK DG  +AN  +             D  A ++E F D  C     A+P +
Sbjct: 28  VTRPDTAPPAKADGK-HANAELLVRPDSHRLSSASDDKATLVE-FLDFECEACGAAYPAV 85

Query: 75  KQALQHYGPHVSLVVHLLPLPYHDNA 100
           +Q  + Y   ++ VV   P+P H NA
Sbjct: 86  EQLREEYAGRLTYVVRYFPIPSHPNA 111


>gi|406900080|gb|EKD43163.1| DsbA oxidoreductase [uncultured bacterium]
          Length = 251

 Score = 36.6 bits (83), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 36/77 (46%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRT 113
           + I  F D  CP ++   P L++ +Q YG  V L++   P   + +  A   A   +   
Sbjct: 95  VTIVEFVDFKCPYTKAEAPILREVMQKYGNKVKLIIRNFPPSDNFHPGANQFANIAMCAY 154

Query: 114 NSSATFCLLEWFFKQQE 130
                + L +WFF+QQE
Sbjct: 155 QQGYYWPLHDWFFEQQE 171


>gi|389744505|gb|EIM85688.1| hypothetical protein STEHIDRAFT_81225 [Stereum hirsutum FP-91666
           SS1]
          Length = 205

 Score = 36.6 bits (83), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 29/126 (23%), Positives = 51/126 (40%), Gaps = 7/126 (5%)

Query: 56  IEAFFDPVCPDSR-------DAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           ++ F D VCP S        +   PL      Y  HV +++     P+H  +  T  A  
Sbjct: 22  LDIFLDYVCPFSAKLARSIDNVLKPLLGPGGKYDGHVKVIIRPQVQPWHATSTYTHEAGL 81

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESG 168
            V R      +      F++QE+F++ P   +T   + +++   AAE I      A +  
Sbjct: 82  AVARVAPEFYWPFSVRLFERQEEFFDIPASTLTPLQIREKLAALAAEVIPADKVDAFKEA 141

Query: 169 FSDRST 174
              +S+
Sbjct: 142 LKLKSS 147


>gi|336374051|gb|EGO02389.1| hypothetical protein SERLA73DRAFT_86683 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336386970|gb|EGO28116.1| hypothetical protein SERLADRAFT_355115 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 207

 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 7/109 (6%)

Query: 56  IEAFFDPVCPDS-------RDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
           ++ F D VCP S        +   PL  A   Y   V  ++ L   P+H ++  T  A  
Sbjct: 22  LDIFLDYVCPYSAKLSLAINNVLKPLLDAGGKYDGKVKAIIRLQVQPWHASSTFTHEAGL 81

Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
            V R +    +      FK Q  F++ P+  +T   +  ++   AA+ I
Sbjct: 82  AVARVSPENFWPFSVLLFKSQTSFFDVPSSTLTPLQIRAQLASLAAQVI 130


>gi|145221502|ref|YP_001132180.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
 gi|145213988|gb|ABP43392.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
          Length = 234

 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 10/76 (13%)

Query: 33  STPPAKYDGF---------FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP 83
           S PP + DG             N  +DS    +E F D  C   R  +P ++Q    YG 
Sbjct: 47  SEPPIELDGSAGQAVRENSHRLNAVLDSPVYFVE-FLDFECEGCRAVYPEIEQLRAEYGD 105

Query: 84  HVSLVVHLLPLPYHDN 99
            V+ V+   PLP H N
Sbjct: 106 RVNFVLRYFPLPGHFN 121


>gi|162457290|ref|YP_001619657.1| hypothetical protein sce9005 [Sorangium cellulosum So ce56]
 gi|161167872|emb|CAN99177.1| putative membrane protein [Sorangium cellulosum So ce56]
          Length = 403

 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 12/77 (15%)

Query: 42  FFYANHPV------------DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
           F + +HPV            +   I I AF D  CP  R   P L +  + YG  V  V 
Sbjct: 179 FLWGSHPVIPPLPPELAALQEPGKITIVAFTDFECPFCRRLHPELSKIEEPYGDRVRHVR 238

Query: 90  HLLPLPYHDNAYATSRA 106
            ++PLP H  A   ++A
Sbjct: 239 KMVPLPNHPGALPAAKA 255


>gi|325000049|ref|ZP_08121161.1| DSBA oxidoreductase [Pseudonocardia sp. P1]
          Length = 216

 Score = 36.2 bits (82), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 21/46 (45%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
           + +  F D  CP      P + Q +Q YG  V  VV   PLP H N
Sbjct: 60  VTLVEFLDFQCPGCGQLQPIMSQLVQQYGDRVEFVVRDFPLPIHPN 105


>gi|422293098|gb|EKU20398.1| hypothetical protein NGA_0546800, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 199

 Score = 36.2 bits (82), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 44/105 (41%), Gaps = 3/105 (2%)

Query: 54  IIIEAFFDPVCPDSRDAWPPLKQA--LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIV 110
           +++E F D  CP SR  +  L +   ++ Y    V ++ H +  P+H        A   V
Sbjct: 24  VVVEVFLDLNCPPSRRCFKTLYEGGIIEAYKDKAVQILFHNVIQPWHAVGAWMHEAALAV 83

Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
                SA +      F   E+F +  T+ MTR  V+  ++    E
Sbjct: 84  KDLQPSAFYPYANKVFDAMERFSDVNTEEMTRKEVMAALIALGVE 128


>gi|350296986|gb|EGZ77963.1| hypothetical protein NEUTE2DRAFT_101434 [Neurospora tetrasperma
           FGSC 2509]
          Length = 219

 Score = 35.8 bits (81), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 8/111 (7%)

Query: 56  IEAFFDPVCPDSRDAW--------PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
           +E F D VCP S   +        P L+      G  V  +      P+H ++  T  A 
Sbjct: 31  VEIFLDYVCPFSAKIYNTLYTTLLPSLRSEHADLGSKVQFIFRHQIQPWHPSSTLTHEAG 90

Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
             V R   +  +      FK Q+ +++    N TR    K + K A++  G
Sbjct: 91  LAVQRLAPTKFWNFSAALFKDQKAYFDVSLVNETRNETYKRLAKLASQSAG 141


>gi|85119630|ref|XP_965678.1| hypothetical protein NCU02547 [Neurospora crassa OR74A]
 gi|28927490|gb|EAA36442.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 219

 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 8/111 (7%)

Query: 56  IEAFFDPVCPDSRDAW--------PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
           +E F D VCP S   +        P L+      G  V  +      P+H ++  T  A 
Sbjct: 31  VEIFLDYVCPFSAKIYNTLYTTLLPSLRSEHADLGSKVQFIFRHQIQPWHPSSTLTHEAG 90

Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
             V R   +  +      FK Q+ +++    N TR    K + K A++  G
Sbjct: 91  LAVQRLAPTKFWDFSAALFKDQKAYFDVSLVNETRNETYKRLAKLASQSAG 141


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.133    0.409 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,971,901,255
Number of Sequences: 23463169
Number of extensions: 113617061
Number of successful extensions: 369952
Number of sequences better than 100.0: 165
Number of HSP's better than 100.0 without gapping: 92
Number of HSP's successfully gapped in prelim test: 73
Number of HSP's that attempted gapping in prelim test: 369758
Number of HSP's gapped (non-prelim): 177
length of query: 196
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 61
effective length of database: 9,191,667,552
effective search space: 560691720672
effective search space used: 560691720672
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)