BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029265
(196 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224085938|ref|XP_002307747.1| predicted protein [Populus trichocarpa]
gi|222857196|gb|EEE94743.1| predicted protein [Populus trichocarpa]
Length = 212
Score = 233 bits (595), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 111/167 (66%), Positives = 132/167 (79%)
Query: 30 KSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
+SQ+ PPA+YDGF Y N D ++I++EAFFDPVCPDSRD+WPPLK+AL+HYG V LVV
Sbjct: 5 ESQNLPPARYDGFVYENRQGDLNSILVEAFFDPVCPDSRDSWPPLKEALKHYGSRVWLVV 64
Query: 90 HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
HLLPLPYHDNA+ + RALHI N NSS TF LLE FFK QEKFYNA T N+++T++V+EI
Sbjct: 65 HLLPLPYHDNAFVSCRALHIANTLNSSFTFPLLEEFFKHQEKFYNAKTSNLSKTSIVEEI 124
Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
VKFA +GNSYSSA ESGF+DR TDL TRVSFK + F T FF
Sbjct: 125 VKFATVAVGNSYSSAFESGFNDRQTDLKTRVSFKYSTSRGVFGTPFF 171
>gi|409030182|gb|AFV07578.1| thioredoxin f5, partial [Solanum tuberosum]
Length = 206
Score = 223 bits (568), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 113/185 (61%), Positives = 132/185 (71%), Gaps = 5/185 (2%)
Query: 17 SALLCFF--VFNSCSK---SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW 71
S +L FF V N + +QS PPA+YDGF Y DS+ I+IEAF+DPVCPDSRD+W
Sbjct: 10 STVLLFFLSVINVSVRDVDAQSLPPARYDGFPYQKQLDDSNTILIEAFYDPVCPDSRDSW 69
Query: 72 PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
PPLKQAL YG SLVVHLLPLPYHDNA+ SR LHI N N+S+TF LLE F+K QE+
Sbjct: 70 PPLKQALHFYGSRTSLVVHLLPLPYHDNAFVASRTLHIANELNTSSTFPLLERFYKHQER 129
Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
FYNA T N +R VVKEIVKFA+ +GNSY SA+ESG +DR TDL TRVSFK + F
Sbjct: 130 FYNAQTHNKSRATVVKEIVKFASAAVGNSYCSAIESGLNDRQTDLKTRVSFKYSASRGVF 189
Query: 192 NTSFF 196
T F
Sbjct: 190 ATPTF 194
>gi|449450508|ref|XP_004143004.1| PREDICTED: uncharacterized protein LOC101216804 isoform 1 [Cucumis
sativus]
gi|449450510|ref|XP_004143005.1| PREDICTED: uncharacterized protein LOC101216804 isoform 2 [Cucumis
sativus]
gi|449521601|ref|XP_004167818.1| PREDICTED: uncharacterized protein LOC101231840 isoform 1 [Cucumis
sativus]
gi|449521603|ref|XP_004167819.1| PREDICTED: uncharacterized protein LOC101231840 isoform 2 [Cucumis
sativus]
Length = 236
Score = 223 bits (567), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 107/196 (54%), Positives = 137/196 (69%)
Query: 1 MQSPSPNKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFF 60
M+ + +H +L + +F+ + +QS PP K+DGF Y NH +D + I IEAFF
Sbjct: 1 MEKTNSTSSHLQSLLAFSFSFLALFHLTAHAQSLPPPKFDGFVYGNHSLDFNTIHIEAFF 60
Query: 61 DPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFC 120
DPVCPDSRD+WPPLK+AL HYG V LV+HLLPLPYHDNAYA SRALHIV+ N S TF
Sbjct: 61 DPVCPDSRDSWPPLKKALDHYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSDTFK 120
Query: 121 LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 180
LLE FF Q++FYNA T+ ++R A+V +VKF E +G+SY + L +GF+DR TDLLTRV
Sbjct: 121 LLEAFFGDQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRV 180
Query: 181 SFKVTQKFFSFNTSFF 196
SFK + + T FF
Sbjct: 181 SFKFSTSRGVYGTPFF 196
>gi|449450512|ref|XP_004143006.1| PREDICTED: uncharacterized protein LOC101216804 isoform 3 [Cucumis
sativus]
Length = 201
Score = 221 bits (564), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 105/184 (57%), Positives = 133/184 (72%)
Query: 1 MQSPSPNKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFF 60
M+ + +H +L + +F+ + +QS PP K+DGF Y NH +D + I IEAFF
Sbjct: 1 MEKTNSTSSHLQSLLAFSFSFLALFHLTAHAQSLPPPKFDGFVYGNHSLDFNTIHIEAFF 60
Query: 61 DPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFC 120
DPVCPDSRD+WPPLK+AL HYG V LV+HLLPLPYHDNAYA SRALHIV+ N S TF
Sbjct: 61 DPVCPDSRDSWPPLKKALDHYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSDTFK 120
Query: 121 LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRV 180
LLE FF Q++FYNA T+ ++R A+V +VKF E +G+SY + L +GF+DR TDLLTRV
Sbjct: 121 LLEAFFGDQKQFYNAETRYLSRAAIVDSMVKFGVEVLGDSYKNTLVTGFNDRETDLLTRV 180
Query: 181 SFKV 184
SFKV
Sbjct: 181 SFKV 184
>gi|224061957|ref|XP_002300683.1| predicted protein [Populus trichocarpa]
gi|222842409|gb|EEE79956.1| predicted protein [Populus trichocarpa]
Length = 223
Score = 215 bits (548), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 108/192 (56%), Positives = 135/192 (70%), Gaps = 2/192 (1%)
Query: 7 NKNHATLILQSALLCFFVFNSCSKS--QSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVC 64
+ H ++Q + + F+ +S Q+ PP +YDGF Y N SD+I+IEAFFDPVC
Sbjct: 2 KRQHHFAVIQFSYIFLFITSSQWNVLPQNLPPLRYDGFVYENRKGGSDSILIEAFFDPVC 61
Query: 65 PDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEW 124
PDSRD WPPL++AL+HYG VSLVVHLLPLPYHDNA+ SRALHI N N S TF LLE
Sbjct: 62 PDSRDTWPPLQKALKHYGSRVSLVVHLLPLPYHDNAFVASRALHIANILNCSFTFPLLEQ 121
Query: 125 FFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKV 184
FFK QEKFY + T N+++ ++VKEIVKFA +G+SYSS L+ GF+D TDL TRVSFK
Sbjct: 122 FFKHQEKFYGSETSNLSKDSIVKEIVKFATVIVGDSYSSPLQFGFNDIQTDLKTRVSFKY 181
Query: 185 TQKFFSFNTSFF 196
+ + T FF
Sbjct: 182 SASRGVYATPFF 193
>gi|409030172|gb|AFV07573.1| DsbA c3, partial [Solanum tuberosum]
Length = 194
Score = 215 bits (548), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 110/185 (59%), Positives = 130/185 (70%), Gaps = 5/185 (2%)
Query: 17 SALLCFF--VFNSCSK---SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW 71
S +L FF V N + +QS PPA+YDGF Y DS+ I+IEAF+DPVCPDSRD+W
Sbjct: 10 STVLLFFLSVINVSVRDVDAQSLPPARYDGFPYQKQLDDSNTILIEAFYDPVCPDSRDSW 69
Query: 72 PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
PPLKQAL YG SLVVHLLPLPYHDNA+ SR LHI N N+S+TF LLE F+K QE+
Sbjct: 70 PPLKQALHFYGSRTSLVVHLLPLPYHDNAFVASRTLHIANELNTSSTFPLLERFYKHQER 129
Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
FYNA T N +R VVKEIV+FA+ +GNS SA+ESG DR TDL TRVSF+ + F
Sbjct: 130 FYNAQTHNKSRATVVKEIVRFASAAVGNSCCSAIESGSYDRQTDLKTRVSFQYSASRGVF 189
Query: 192 NTSFF 196
T F
Sbjct: 190 GTPTF 194
>gi|302141727|emb|CBI18930.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 209 bits (532), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 106/176 (60%), Positives = 126/176 (71%), Gaps = 1/176 (0%)
Query: 22 FFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
F+VF S ++ PAK+DGF Y D+IIIEAFFDPVCPDSRDAWPPLK+A+ +
Sbjct: 19 FYVFEHESIVRAVSIPAKFDGFVYNYTGTGIDSIIIEAFFDPVCPDSRDAWPPLKRAIAY 78
Query: 81 YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
Y P VSL+VH LPYHDNA+ATSRALHIVN+ NSSAT+ LLE FK QE FYN T NM
Sbjct: 79 YAPRVSLIVHPFALPYHDNAFATSRALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNM 138
Query: 141 TRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+RTA+V IVKF ++ +G S SA++SGFSDR TDL TRVSFK T +F
Sbjct: 139 SRTAIVDCIVKFVSKAVGESLFSAIKSGFSDRQTDLTTRVSFKYGCSRGVLGTPYF 194
>gi|359492074|ref|XP_003634361.1| PREDICTED: uncharacterized protein LOC100854733 [Vitis vinifera]
Length = 234
Score = 208 bits (530), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 104/163 (63%), Positives = 123/163 (75%), Gaps = 1/163 (0%)
Query: 22 FFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
F+VF S ++ PAK+DGF Y D+IIIEAFFDPVCPDSRDAWPPLK+A+ +
Sbjct: 19 FYVFEHESIVRAVSIPAKFDGFVYNYTGTGIDSIIIEAFFDPVCPDSRDAWPPLKRAIAY 78
Query: 81 YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
Y P VSL+VH LPYHDNA+ATSRALHIVN+ NSSAT+ LLE FK QE FYN T NM
Sbjct: 79 YAPRVSLIVHPFALPYHDNAFATSRALHIVNKLNSSATYHLLEMLFKHQEIFYNQITVNM 138
Query: 141 TRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+RTA+V IVKF ++ +G S SA++SGFSDR TDL TRVSFK
Sbjct: 139 SRTAIVDCIVKFVSKAVGESLFSAIKSGFSDRQTDLTTRVSFK 181
>gi|255539052|ref|XP_002510591.1| conserved hypothetical protein [Ricinus communis]
gi|223551292|gb|EEF52778.1| conserved hypothetical protein [Ricinus communis]
Length = 362
Score = 207 bits (526), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/184 (61%), Positives = 133/184 (72%), Gaps = 6/184 (3%)
Query: 19 LLCFFVFNSCS---KSQSTPPAKYDGFFYANHPV---DSDAIIIEAFFDPVCPDSRDAWP 72
LL FV N +Q PPA+YDGF Y H +D I+IEAFFDPVCPDSRD+WP
Sbjct: 8 LLFAFVINVSKWGLDAQKLPPARYDGFLYEKHGPCRHHADTILIEAFFDPVCPDSRDSWP 67
Query: 73 PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKF 132
PLKQAL +YG SL++HLLPLPYHDNAY SRALHI N N S+TF LLE FFK QE+F
Sbjct: 68 PLKQALHYYGSRTSLLLHLLPLPYHDNAYVASRALHIANNLNHSSTFPLLERFFKHQERF 127
Query: 133 YNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFN 192
YNA T+N++R +VVK+IV+FA+ IGNSY SA+ESGF+DR TDL TRVSFK + F
Sbjct: 128 YNAQTRNLSRVSVVKDIVEFASVAIGNSYHSAIESGFNDRETDLQTRVSFKYSTSRGVFG 187
Query: 193 TSFF 196
T F
Sbjct: 188 TPTF 191
>gi|388493536|gb|AFK34834.1| unknown [Lotus japonicus]
Length = 239
Score = 201 bits (510), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 99/171 (57%), Positives = 124/171 (72%), Gaps = 1/171 (0%)
Query: 27 SCSKSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHV 85
S + + PPAK DGF Y N + D+I+IEAF+DP+CPDSRD+WPPLKQAL YG V
Sbjct: 26 SPAGADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDSWPPLKQALHDYGSGV 85
Query: 86 SLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAV 145
SLVVHLLPLPYHDNAY SRALH+VN NSSATF LLE FK QEKFY A T+N++R ++
Sbjct: 86 SLVVHLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASI 145
Query: 146 VKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+E VK A E IG+S+ +++++GF+D TD TR+SFK + T FF
Sbjct: 146 QEEFVKSATEVIGSSFYTSVKNGFNDSKTDHQTRISFKYAASRGVYGTPFF 196
>gi|388504516|gb|AFK40324.1| unknown [Medicago truncatula]
Length = 230
Score = 200 bits (509), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/180 (53%), Positives = 122/180 (67%), Gaps = 1/180 (0%)
Query: 18 ALLCFFVFNSCSKSQSTPPAKYDGFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQ 76
++L S PP KYDGF Y N H + D I IEAF+DP+CPDS D+WPPLK+
Sbjct: 8 SVLILLALQSLVGGDYIPPKKYDGFVYKNRHHLSYDTIQIEAFYDPLCPDSADSWPPLKK 67
Query: 77 ALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAP 136
AL HY VS VVHLLPLPYHDNA+ SRALHIVN N +ATF LLE FFK Q F+ AP
Sbjct: 68 ALHHYSSRVSFVVHLLPLPYHDNAFVASRALHIVNSLNRTATFPLLEAFFKYQAAFFGAP 127
Query: 137 TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
T+N++R +V EIVKF + +GNSY +++++GF+D ++D LTR+SFK T FF
Sbjct: 128 TRNLSRAFIVDEIVKFTSAIVGNSYHTSIKNGFNDTNSDHLTRISFKYAASRGVSGTPFF 187
>gi|297850426|ref|XP_002893094.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
lyrata]
gi|297338936|gb|EFH69353.1| hypothetical protein ARALYDRAFT_312946 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 194 bits (494), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/173 (54%), Positives = 122/173 (70%), Gaps = 2/173 (1%)
Query: 13 LILQSAL-LCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDA 70
++++SAL L F + ++Q PPA+ DGF Y +D D I+IEA+FDPVCPD RDA
Sbjct: 1 MMIRSALVLVVFFVGTVVQAQLIPPARRDGFVYPPGRKIDPDTILIEAYFDPVCPDCRDA 60
Query: 71 WPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQE 130
W PLK A+ HYG V+LV+HL+PLP+HDNA+ SRALHIV+ N++ATF LLE FK Q
Sbjct: 61 WEPLKLAIDHYGSRVALVLHLIPLPFHDNAFLVSRALHIVDTLNANATFNLLEGIFKHQA 120
Query: 131 KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
FYN+ TQ M+R AVV ++K +GNSY S L SGFS+ +DL TRVSFK
Sbjct: 121 LFYNSQTQLMSRPAVVDNLIKLGTVTLGNSYHSPLISGFSNSKSDLATRVSFK 173
>gi|8778813|gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana]
Length = 263
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 102/172 (59%), Positives = 125/172 (72%), Gaps = 1/172 (0%)
Query: 13 LILQSALLCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAW 71
+I L FV + ++Q PP + DGF Y H D D I+IEA+FDPVCPDSRD+W
Sbjct: 1 MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60
Query: 72 PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
PPLKQAL HYG V+L++HLLPLPYHDNAY TSRALHIVN +++ATF LLE FFK Q
Sbjct: 61 PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120
Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
FYNA TQ ++R AVV++IV+ +GNSY S L+SGFSD+ +D TRVSFK
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFK 172
>gi|15222928|ref|NP_177728.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
gi|117168227|gb|ABK32196.1| At1g76020 [Arabidopsis thaliana]
gi|332197664|gb|AEE35785.1| Thioredoxin superfamily protein [Arabidopsis thaliana]
Length = 225
Score = 193 bits (490), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 104/185 (56%), Positives = 129/185 (69%), Gaps = 1/185 (0%)
Query: 13 LILQSALLCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAW 71
+I L FV + ++Q PP + DGF Y H D D I+IEA+FDPVCPDSRD+W
Sbjct: 1 MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60
Query: 72 PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
PPLKQAL HYG V+L++HLLPLPYHDNAY TSRALHIVN +++ATF LLE FFK Q
Sbjct: 61 PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120
Query: 132 FYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSF 191
FYNA TQ ++R AVV++IV+ +GNSY S L+SGFSD+ +D TRVSFK + +
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSYQSVLKSGFSDKKSDRATRVSFKYSASRGVY 180
Query: 192 NTSFF 196
T F
Sbjct: 181 GTPTF 185
>gi|22329686|ref|NP_683315.1| TRX domain-containing protein [Arabidopsis thaliana]
gi|17065544|gb|AAL32926.1| Unknown protein [Arabidopsis thaliana]
gi|24899723|gb|AAN65076.1| Unknown protein [Arabidopsis thaliana]
gi|332191831|gb|AEE29952.1| TRX domain-containing protein [Arabidopsis thaliana]
Length = 233
Score = 192 bits (487), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 90/155 (58%), Positives = 114/155 (73%), Gaps = 1/155 (0%)
Query: 30 KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLV 88
++Q PPA+ DGF Y +D D I+IEA+ DPVCPD RDAW PLK A+ HYG V+LV
Sbjct: 19 QAQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDAWEPLKLAIDHYGSRVALV 78
Query: 89 VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKE 148
+HL+PLP+HDNA+ SRALHIV+ N++ATF LLE FK Q FYN+ TQ M+R AVV+E
Sbjct: 79 LHLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEE 138
Query: 149 IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
++K +GNSY S L+SGFS+ +DL TRVSFK
Sbjct: 139 LIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFK 173
>gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity to pigpen protein from Mus musculus gb|AF224264
and contains protein of unknown function DUF78 PF|01918
domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
gb|AI997847, gb|AV543000 come from this gene
[Arabidopsis thaliana]
Length = 538
Score = 191 bits (486), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 90/155 (58%), Positives = 114/155 (73%), Gaps = 1/155 (0%)
Query: 30 KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLV 88
++Q PPA+ DGF Y +D D I+IEA+ DPVCPD RDAW PLK A+ HYG V+LV
Sbjct: 18 QAQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDAWEPLKLAIDHYGSRVALV 77
Query: 89 VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKE 148
+HL+PLP+HDNA+ SRALHIV+ N++ATF LLE FK Q FYN+ TQ M+R AVV+E
Sbjct: 78 LHLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEE 137
Query: 149 IVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
++K +GNSY S L+SGFS+ +DL TRVSFK
Sbjct: 138 LIKLGTVTLGNSYHSPLKSGFSNSKSDLATRVSFK 172
>gi|351726088|ref|NP_001235579.1| uncharacterized protein LOC100500267 precursor [Glycine max]
gi|255629877|gb|ACU15289.1| unknown [Glycine max]
Length = 236
Score = 191 bits (484), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/187 (53%), Positives = 135/187 (72%), Gaps = 3/187 (1%)
Query: 12 TLILQ-SALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDS-DAIIIEAFFDPVCPDSRD 69
T ILQ +AL F+ + PP+++DGF Y ++ D ++IEAF+DPVCPDSRD
Sbjct: 6 TTILQFAALQVLFILLGVG-ADYIPPSRFDGFVYGKGNLNLLDTVLIEAFYDPVCPDSRD 64
Query: 70 AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQ 129
+WPPLKQAL HY VSL++HLLPLPYHDNA+ TSRALHIVN N+SATF LLEWFF+ Q
Sbjct: 65 SWPPLKQALHHYASRVSLLLHLLPLPYHDNAFVTSRALHIVNNLNASATFPLLEWFFRYQ 124
Query: 130 EKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFF 189
E FY A T+N++R ++++E+VK A + +G+SY +++GF+D +TD+ TRVSFK
Sbjct: 125 ENFYGAQTRNLSRASIIEEVVKSATQVVGSSYYKTIKNGFNDTTTDIQTRVSFKYAASRG 184
Query: 190 SFNTSFF 196
+ T FF
Sbjct: 185 VYGTPFF 191
>gi|81076617|gb|ABB55396.1| unknown [Solanum tuberosum]
Length = 224
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/146 (62%), Positives = 112/146 (76%), Gaps = 2/146 (1%)
Query: 40 DGF-FYANHPVD-SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYH 97
DGF F +D + +I+IEAF DPVC DSRD+WPPLK+AL HYG VSLVVH PLPYH
Sbjct: 24 DGFLFGGGGKIDITKSIVIEAFLDPVCSDSRDSWPPLKKALHHYGSRVSLVVHTFPLPYH 83
Query: 98 DNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
DNA+ TSRALHIVN+ NSSATF LLE FF QQ+KFYN T N+++ +VV E+VKF + I
Sbjct: 84 DNAFTTSRALHIVNKLNSSATFRLLEAFFDQQDKFYNQATFNLSKASVVDEVVKFTSNKI 143
Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
GNS +A+++GF+D TD TR+SFK
Sbjct: 144 GNSNYAAVKAGFTDPKTDQATRISFK 169
>gi|351721154|ref|NP_001236688.1| uncharacterized protein LOC100527484 precursor [Glycine max]
gi|255632456|gb|ACU16578.1| unknown [Glycine max]
Length = 233
Score = 186 bits (473), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 124/170 (72%)
Query: 14 ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPP 73
I+Q A L FV + PP++ DGF Y N +D D ++IEAF+DPVCP SRD+WPP
Sbjct: 8 IVQLAALQVFVIFLSGGADYIPPSRLDGFVYQNGYLDLDTVLIEAFYDPVCPYSRDSWPP 67
Query: 74 LKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFY 133
LKQAL H+ VS ++HLLPLPYHDNA+ +RALHIVN N+SATF LLEWFFK QEKFY
Sbjct: 68 LKQALHHHSSRVSFLLHLLPLPYHDNAFVATRALHIVNTLNASATFPLLEWFFKHQEKFY 127
Query: 134 NAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
A T+N++R ++V EIVK A E G+SY +A++ GF+D TD TRVSFK
Sbjct: 128 GAQTRNLSRASIVDEIVKSATEAAGSSYYNAIKHGFNDTKTDYQTRVSFK 177
>gi|297842343|ref|XP_002889053.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
lyrata]
gi|297334894|gb|EFH65312.1| hypothetical protein ARALYDRAFT_895477 [Arabidopsis lyrata subsp.
lyrata]
Length = 229
Score = 186 bits (473), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/165 (61%), Positives = 121/165 (73%), Gaps = 1/165 (0%)
Query: 20 LCFFVFNSCSKSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
L FV + ++Q PP K DGF Y H D D I+IEA+FDPVCPDSRD+WPPLKQAL
Sbjct: 9 LVVFVTETRVQAQLVPPVKQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSWPPLKQAL 68
Query: 79 QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQ 138
+HYG V+ ++HLLPLPYHDNAY TSRALHIVN N++ATF LLE FFK Q FYNA T
Sbjct: 69 RHYGSRVAFLLHLLPLPYHDNAYVTSRALHIVNTVNANATFSLLEGFFKHQPLFYNAQTN 128
Query: 139 NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+TR AVV++IV+ +G SY S L+SGFSD+ +D TRVSFK
Sbjct: 129 LLTRAAVVEKIVELGTVTLGKSYQSVLKSGFSDKKSDRATRVSFK 173
>gi|32526672|dbj|BAC79195.1| unknown protein [Oryza sativa Japonica Group]
Length = 220
Score = 173 bits (438), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)
Query: 8 KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
+ A L+ + LLC V SCS +Q++ P +YDGF Y + + D I++EAF
Sbjct: 3 RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61
Query: 60 FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
DP+CPDSRDAW PL+ A+ Y P VSL+VH PLPYH N++ RAL+I N+ NSS+T+
Sbjct: 62 LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121
Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
LLE FFK Q KFYNA T +++ T + E+ K AA +GNS S +SGFSD TDL R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180
Query: 180 VSFK 183
VSFK
Sbjct: 181 VSFK 184
>gi|115480453|ref|NP_001063820.1| Os09g0542200 [Oryza sativa Japonica Group]
gi|52076083|dbj|BAD46596.1| unknown protein [Oryza sativa Japonica Group]
gi|113632053|dbj|BAF25734.1| Os09g0542200 [Oryza sativa Japonica Group]
gi|215694060|dbj|BAG89259.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740808|dbj|BAG96964.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 237
Score = 172 bits (437), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)
Query: 8 KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
+ A L+ + LLC V SCS +Q++ P +YDGF Y + + D I++EAF
Sbjct: 3 RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61
Query: 60 FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
DP+CPDSRDAW PL+ A+ Y P VSL+VH PLPYH N++ RAL+I N+ NSS+T+
Sbjct: 62 LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121
Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
LLE FFK Q KFYNA T +++ T + E+ K AA +GNS S +SGFSD TDL R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180
Query: 180 VSFK 183
VSFK
Sbjct: 181 VSFK 184
>gi|218202548|gb|EEC84975.1| hypothetical protein OsI_32232 [Oryza sativa Indica Group]
Length = 237
Score = 172 bits (437), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 10/184 (5%)
Query: 8 KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
+ A L+ + LLC V SCS +Q++ P +YDGF Y + + D I++EAF
Sbjct: 3 RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61
Query: 60 FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
DP+CPDSRDAW PL+ A+ Y P VSL+VH PLPYH N++ RAL+I N+ NSS+T+
Sbjct: 62 LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPYHTNSFLACRALYIANKLNSSSTY 121
Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
LLE FFK Q KFYNA T +++ T + E+ K AA +GNS S +SGFSD TDL R
Sbjct: 122 PLLELFFKSQGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 180
Query: 180 VSFK 183
VSFK
Sbjct: 181 VSFK 184
>gi|326520974|dbj|BAJ92850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 292
Score = 172 bits (436), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 88/177 (49%), Positives = 116/177 (65%), Gaps = 8/177 (4%)
Query: 14 ILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYA-NHPVDSDAIIIEAFFDPVCPD 66
+L+ L+ + SC+ ++Q++ P +YDGF Y DA++IEAF DP+CPD
Sbjct: 64 LLRRLLMAGALAASCTYFLLVIQAQASAPRRYDGFAYGVGAAAWKDAVLIEAFLDPLCPD 123
Query: 67 SRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFF 126
SRDAW PL+ A++ Y P VSL+VH PLPYH +Y RALHI N+ NSS+T+ +LE FF
Sbjct: 124 SRDAWHPLRLAVERYSPLVSLIVHPFPLPYHTYSYHACRALHIANKLNSSSTYPVLELFF 183
Query: 127 KQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
K Q KF N T +M+ TAV EI K AA+ +GNS S +SGFSD TD+ RVSFK
Sbjct: 184 KNQGKFSNRATSSMSSTAVTGEISKMAAQAVGNSVSD-FQSGFSDTRTDMAARVSFK 239
>gi|226528876|ref|NP_001145190.1| uncharacterized protein LOC100278440 [Zea mays]
gi|195652481|gb|ACG45708.1| hypothetical protein [Zea mays]
Length = 238
Score = 167 bits (422), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/169 (50%), Positives = 111/169 (65%), Gaps = 3/169 (1%)
Query: 30 KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
++Q++ P +YDGF Y + DA+++EAF DP+CPDSRDAW PLK A++ Y P VSL
Sbjct: 31 QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDPLCPDSRDAWQPLKLAVERYAPRVSL 90
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
+VH PLPYH A+ RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++ +V
Sbjct: 91 IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 150
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+ K AA+ +GNS S L SGFSD TD RVSFK F FF
Sbjct: 151 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 198
>gi|194697338|gb|ACF82753.1| unknown [Zea mays]
Length = 238
Score = 167 bits (422), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/169 (50%), Positives = 111/169 (65%), Gaps = 3/169 (1%)
Query: 30 KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
++Q++ P +YDGF Y + DA+++EAF DP+CPDSRDAW PLK A++ Y P VSL
Sbjct: 31 QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDPLCPDSRDAWQPLKLAVERYAPRVSL 90
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
+VH PLPYH A+ RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++ +V
Sbjct: 91 IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 150
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+ K AA+ +GNS S L SGFSD TD RVSFK F FF
Sbjct: 151 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 198
>gi|357168058|ref|XP_003581462.1| PREDICTED: uncharacterized protein LOC100834992 [Brachypodium
distachyon]
Length = 219
Score = 165 bits (418), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 106/155 (68%), Gaps = 2/155 (1%)
Query: 31 SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVV 89
+Q PA+ DGF Y +A+++EA+FDPVCPDSRDAWPPL++A+QHYG VS+VV
Sbjct: 22 AQMPIPARTDGFAYGKATAWGEAVVVEAYFDPVCPDSRDAWPPLQKAVQHYGAQRVSVVV 81
Query: 90 HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
HL PLPYH NA+ R++H VN+ N+SA + LLE FFK QE +YN PT +R VV EI
Sbjct: 82 HLFPLPYHSNAFIACRSIHTVNKLNASAVYPLLEKFFKYQEAYYNQPTYTKSRETVVAEI 141
Query: 150 V-KFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
A IG + +A ++GF+D +D TR+SFK
Sbjct: 142 TNNLVAPVIGEANLAAYKAGFNDSMSDQSTRISFK 176
>gi|326491935|dbj|BAJ98192.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493720|dbj|BAJ85321.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326504198|dbj|BAJ90931.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511459|dbj|BAJ87743.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326520669|dbj|BAJ92698.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 226
Score = 164 bits (414), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 81/158 (51%), Positives = 107/158 (67%), Gaps = 5/158 (3%)
Query: 31 SQSTPPAKYDGFFYANH---PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VS 86
+Q PA+ DGF Y H P DA+++EAFFDPVCPDSRDAWPPL++A H+G V+
Sbjct: 26 AQVPIPARTDGFVYGGHAAAPAWGDAVVVEAFFDPVCPDSRDAWPPLQRAADHFGARRVA 85
Query: 87 LVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVV 146
+VVHL PLPYH +A+ R++H V++ N+SA + LLE FFK QE +YN PT TR AVV
Sbjct: 86 VVVHLFPLPYHSSAFIACRSIHTVHKLNASAVYPLLEKFFKYQEGYYNTPTYTKTRAAVV 145
Query: 147 KEIV-KFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
EI A IG + +A ++GF+D +D TR+SFK
Sbjct: 146 AEIANNLVAPVIGEANLAAYKAGFNDSQSDQATRISFK 183
>gi|115457372|ref|NP_001052286.1| Os04g0227500 [Oryza sativa Japonica Group]
gi|38346025|emb|CAE01956.2| OSJNBb0071D01.2 [Oryza sativa Japonica Group]
gi|38346893|emb|CAE03918.2| OSJNBb0015G09.12 [Oryza sativa Japonica Group]
gi|113563857|dbj|BAF14200.1| Os04g0227500 [Oryza sativa Japonica Group]
gi|125546580|gb|EAY92719.1| hypothetical protein OsI_14471 [Oryza sativa Indica Group]
gi|125588738|gb|EAZ29402.1| hypothetical protein OsJ_13476 [Oryza sativa Japonica Group]
gi|215679384|dbj|BAG96524.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765592|dbj|BAG87289.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 226
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 109/174 (62%), Gaps = 8/174 (4%)
Query: 18 ALLCFFVFNSCS------KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDA 70
+LL V C+ +Q PA+ DGF Y P + +++EA+FDPVCPDSRDA
Sbjct: 9 SLLLVLVIGCCAWGCRPGAAQVPVPARTDGFVYGGKAPALGETVVVEAYFDPVCPDSRDA 68
Query: 71 WPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQE 130
WP LK+A++HY V++VVHL PLPYH NA+ + R++H VN+ N S + LLE FFK QE
Sbjct: 69 WPELKKAVEHYASRVTVVVHLFPLPYHSNAFISCRSIHAVNKINPSFVYPLLERFFKYQE 128
Query: 131 KFYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+YN PT TR V E+ K A IG + +A ++GF+D +D TR+SFK
Sbjct: 129 GYYNQPTYGKTRATVDAEVAKNLVAPVIGEANLAAYKAGFNDSKSDQATRISFK 182
>gi|357159750|ref|XP_003578547.1| PREDICTED: uncharacterized protein LOC100839255 [Brachypodium
distachyon]
Length = 238
Score = 160 bits (404), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 85/177 (48%), Positives = 113/177 (63%), Gaps = 13/177 (7%)
Query: 13 LILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPD 66
++L SAL SCS ++Q++ P +YDGF Y D++++EAF DP+CPD
Sbjct: 16 ILLASALAA-----SCSYYLLVLQAQASVPPRYDGFAY-RAAAWKDSVLVEAFLDPLCPD 69
Query: 67 SRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFF 126
SRDAW PL+ A++ Y P VSL+VH PLPYH ++ RALHI N+ NSS+T+ LLE FF
Sbjct: 70 SRDAWHPLRLAVERYSPRVSLIVHPFPLPYHTYSFHACRALHIANKLNSSSTYPLLELFF 129
Query: 127 KQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
K Q KF N +++ TA+ EI K A + +GNS S +SGFSD TD RVSFK
Sbjct: 130 KNQGKFSNHALSSLSSTAITGEISKMAVQAVGNSVSE-FQSGFSDARTDSAARVSFK 185
>gi|116781410|gb|ABK22089.1| unknown [Picea sitchensis]
Length = 227
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 108/165 (65%), Gaps = 2/165 (1%)
Query: 20 LCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
L F + + P P +YDGF Y + V ++++++EAFFDP+CPDSRD+W PLK+ L
Sbjct: 14 LLMLAFTAARITAQLPIPPRYDGFVYKDR-VTANSVLVEAFFDPLCPDSRDSWWPLKEVL 72
Query: 79 QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQ 138
+ YG +++ +VH LPYH NA+ R+LHI NR ++ T+ LL+ FFK Q +FYN T
Sbjct: 73 RFYGDNITFIVHPFALPYHHNAFIACRSLHIANRIKTAYTYPLLDHFFKHQARFYNKATL 132
Query: 139 NMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ +++ +I+ FA E GNS S+ ES F D +TD+ TR+SFK
Sbjct: 133 QVAPASIINQIIHFALEISGNSSSTMFESAFQDTTTDMATRISFK 177
>gi|242072508|ref|XP_002446190.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
gi|241937373|gb|EES10518.1| hypothetical protein SORBIDRAFT_06g003190 [Sorghum bicolor]
Length = 220
Score = 156 bits (394), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 107/173 (61%), Gaps = 7/173 (4%)
Query: 18 ALLCFFVFNSCS----KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAW 71
+LL V C+ ++Q PA+ DGF Y P + + +++EAF DPVCPDSRDAW
Sbjct: 5 SLLVVIVVGCCACWGCRAQIPIPARMDGFVYGGKPPPAWGETVVVEAFLDPVCPDSRDAW 64
Query: 72 PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEK 131
P LK+ + HY VS+V HL PLPYH A+ +++H VN+ N S + LLE FFK QE
Sbjct: 65 PALKKVVDHYSNRVSVVAHLFPLPYHSYAFIACQSIHAVNKLNPSFVYPLLEKFFKDQEG 124
Query: 132 FYNAPTQNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+YN PT +R VV EI K A IG + +A ++GF+D +D+ TR+SFK
Sbjct: 125 YYNRPTYGKSRATVVDEITKNLVAPIIGETNLAAYKAGFNDSQSDMATRISFK 177
>gi|226491612|ref|NP_001151670.1| LOC100285305 precursor [Zea mays]
gi|195648623|gb|ACG43779.1| DSBA-like thioredoxin domain containing protein [Zea mays]
Length = 223
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 102/167 (61%), Gaps = 2/167 (1%)
Query: 19 LLCFFVFNSCSKSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
LL + ++Q PA+ DGF Y P + +++EAF DPVCPDSRDAWP LK
Sbjct: 14 LLVVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKMV 73
Query: 78 LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
++HY VS+VVHL PLPYH A+ R++H VN+ N S + LLE FFK Q+++YN PT
Sbjct: 74 VEHYSSRVSVVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYPLLEKFFKYQKRYYNQPT 133
Query: 138 QNMTRTAVVKEIVK-FAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+R VV EI K IG + +A +GF+D +D TR+SFK
Sbjct: 134 YEKSRATVVDEITKNLVVPIIGETNLAAFRAGFNDSHSDQATRISFK 180
>gi|195655073|gb|ACG47004.1| hypothetical protein [Zea mays]
Length = 235
Score = 154 bits (390), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/169 (49%), Positives = 108/169 (63%), Gaps = 6/169 (3%)
Query: 30 KSQSTPPAKYDGFFYANHPVDS--DAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
++Q++ P +YDGF Y + DA+++EAF DP DSRDAW PLK A++ Y P VSL
Sbjct: 31 QAQASVPPRYDGFAYGGGAATAWKDAVLVEAFLDP---DSRDAWQPLKLAVERYAPRVSL 87
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
+VH PLPYH A+ RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++ +V
Sbjct: 88 IVHPFPLPYHTYAFYACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSGPSVAL 147
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+ K AA+ +GNS S L SGFSD TD RVSFK F FF
Sbjct: 148 GMSKMAAQTVGNSVSEFL-SGFSDGKTDSAARVSFKYGCTRGVFGAPFF 195
>gi|242050028|ref|XP_002462758.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
gi|241926135|gb|EER99279.1| hypothetical protein SORBIDRAFT_02g031470 [Sorghum bicolor]
Length = 238
Score = 149 bits (377), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/169 (49%), Positives = 111/169 (65%), Gaps = 3/169 (1%)
Query: 30 KSQSTPPAKYDGFFYANHPVD--SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
++Q++ P +YDGF Y DA+++EAF DP+CPDSRD+W PLK A++ Y P VSL
Sbjct: 31 QAQASVPPRYDGFAYGGGAAAAWKDAVLVEAFLDPLCPDSRDSWHPLKLAIERYAPRVSL 90
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
+VH PLPYH A+ RAL+I N+ NSS+T+ LLE FFK QEKFYN+ T +++ +V
Sbjct: 91 IVHPFPLPYHTYAFHACRALYIANKLNSSSTYPLLELFFKNQEKFYNSATSSLSSPSVAV 150
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
E+ K AA+ +GNS S L SGFSD TD RVSFK + FF
Sbjct: 151 EMSKMAAQTVGNSVSEFL-SGFSDTKTDSAARVSFKYGCTRGVYGAPFF 198
>gi|302791067|ref|XP_002977300.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
gi|300154670|gb|EFJ21304.1| hypothetical protein SELMODRAFT_271243 [Selaginella moellendorffii]
Length = 226
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 95/150 (63%), Gaps = 2/150 (1%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
P +YDGF + ++ + I++EAFFDP+CPDS+D+WP +KQ + YG V +VH LP
Sbjct: 27 PRRYDGFAFGSYGGVQEPILVEAFFDPLCPDSKDSWPAIKQVAEEYGSEVKFIVHPFALP 86
Query: 96 YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVK-FAA 154
YH ++ + RALHI N N+S T+ LL+ F+ QE+F NA T + T + V+ E FA+
Sbjct: 87 YHQQSFLSVRALHIANHLNASLTYPLLDLIFEHQEEFSNANTADKTASTVIDEFSSLFAS 146
Query: 155 E-GIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ G GN S + GF D STD R+SFK
Sbjct: 147 QFGSGNEAKSIFKQGFYDSSTDQAGRISFK 176
>gi|302780317|ref|XP_002971933.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
gi|300160232|gb|EFJ26850.1| hypothetical protein SELMODRAFT_270941 [Selaginella moellendorffii]
Length = 226
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 95/150 (63%), Gaps = 2/150 (1%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
P +YDGF + ++ + I++EAFFDP+CPDS+D+WP +KQ + YG V +VH LP
Sbjct: 27 PRRYDGFAFGSYGGVQEPILVEAFFDPLCPDSKDSWPAIKQVAEEYGSEVKFIVHPFALP 86
Query: 96 YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVK-FAA 154
YH ++ + RALHI N N+S T+ LL+ F+ QE+F NA T + T + V+ E FA+
Sbjct: 87 YHHQSFLSVRALHIANHLNASLTYPLLDLIFEHQEEFSNANTGDKTASTVIDEFSSLFAS 146
Query: 155 E-GIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ G GN S + GF D STD R+SFK
Sbjct: 147 QFGSGNEAKSIFKQGFYDSSTDQAGRISFK 176
>gi|413917904|gb|AFW57836.1| DSBA-like thioredoxin domain containing protein [Zea mays]
Length = 146
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 62/130 (47%), Positives = 84/130 (64%), Gaps = 2/130 (1%)
Query: 20 LCFFVFNSCS-KSQSTPPAKYDGFFYANHP-VDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
L + C+ ++Q PA+ DGF Y P + +++EAF DPVCPDSRDAWP LK+
Sbjct: 14 LLLVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKKV 73
Query: 78 LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
++HY VS+VVHL PLPYH A+ R++H VN+ N S + LLE FFK QE++YN PT
Sbjct: 74 VEHYSSRVSIVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYRLLEKFFKDQERYYNQPT 133
Query: 138 QNMTRTAVVK 147
+R VV+
Sbjct: 134 YEKSRATVVR 143
>gi|413917905|gb|AFW57837.1| hypothetical protein ZEAMMB73_294031 [Zea mays]
Length = 212
Score = 135 bits (341), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 97/166 (58%), Gaps = 12/166 (7%)
Query: 20 LCFFVFNSCS-KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
L + C+ ++Q PA+ DGF Y P + +++EAF DPVCPDSRDAWP LK+
Sbjct: 14 LLLVIIGCCACRAQIPIPARTDGFVYGGKPPAWGETVVVEAFLDPVCPDSRDAWPALKKV 73
Query: 78 LQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
++HY VS+VVHL PLPYH A+ R++H VN+ N S + LLE FFK Q Y+
Sbjct: 74 VEHYSSRVSIVVHLFPLPYHSYAFIACRSIHAVNKLNPSFVYRLLEKFFKDQS--YSGTV 131
Query: 138 QNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+T+ VV IG + +A +GF+D +D TR+SFK
Sbjct: 132 DEITKNLVVPI--------IGETNLAAFRAGFNDSHSDQATRISFK 169
>gi|222642012|gb|EEE70144.1| hypothetical protein OsJ_30190 [Oryza sativa Japonica Group]
Length = 216
Score = 125 bits (315), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 31/184 (16%)
Query: 8 KNHATLILQSALLCFFVFNSCS------KSQSTPPAKYDGFFYANHPVDS--DAIIIEAF 59
+ A L+ + LLC V SCS +Q++ P +YDGF Y + + D I++EAF
Sbjct: 3 RPRAPLLRRMLLLCA-VAASCSYYLLVLHAQASVPPRYDGFAYGDAATAAWKDTILVEAF 61
Query: 60 FDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATF 119
DP+CPDSRDAW PL+ A+ Y P VSL+VH PLP Y
Sbjct: 62 LDPLCPDSRDAWAPLRLAVDRYAPRVSLIVHPFPLPQQMVGY------------------ 103
Query: 120 CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTR 179
C + W + KFYNA T +++ T + E+ K AA +GNS S +SGFSD TDL R
Sbjct: 104 CFMCWV---EGKFYNAATSSLSSTVISGEMSKLAARVVGNSVSE-FQSGFSDIRTDLAAR 159
Query: 180 VSFK 183
VSFK
Sbjct: 160 VSFK 163
>gi|168037769|ref|XP_001771375.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677293|gb|EDQ63765.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 191
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 87/154 (56%), Gaps = 3/154 (1%)
Query: 31 SQSTPPAKYDGFFY-ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
+Q P +YDGF Y + ++ E F DP+C D ++AWP +KQ ++ YG + L+V
Sbjct: 1 AQFAIPERYDGFVYNVSTEQGRGPVMWEVFVDPLCIDCKNAWPVVKQVVEKYGSALLLIV 60
Query: 90 HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
H P P+H NA+ SR LH+ TNSS + LLE F QE F N T + T VV +
Sbjct: 61 HPFPAPFHHNAFFASRGLHVAQMTNSSLVYPLLELIFSNQESFLNPSTNHETPAHVVSRL 120
Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ A+ +G +++ E F+D TD TR+SFK
Sbjct: 121 ISL-ADNLGFP-TNSFEVAFNDAVTDQATRISFK 152
>gi|156379454|ref|XP_001631472.1| predicted protein [Nematostella vectensis]
gi|156218513|gb|EDO39409.1| predicted protein [Nematostella vectensis]
Length = 214
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 83/166 (50%), Gaps = 5/166 (3%)
Query: 20 LCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
LC S + S+ P P K G+ Y N + I +EAF D CPD + AWP +KQ
Sbjct: 12 LCLLFTTSSTFSKGVPIPKKPLGYVYGNGK-PTAPIHLEAFVDLTCPDCQQAWPTIKQVA 70
Query: 79 QHYGPHVSLV-VHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
+ YGP LV V PLPYH NA+ ++++ +V NSS F ++ FK Q + YN T
Sbjct: 71 KLYGPDTILVLVQPFPLPYHTNAFIAAQSVPVVASYNSSLVFTWIDVLFKFQSELYNFQT 130
Query: 138 QNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
N R ++ + A + S +++G + +D R+ +K
Sbjct: 131 MNKNRYDILNIVSSLAPK--AGVPSDIMKTGLTGTESDGAARIGWK 174
>gi|125546585|gb|EAY92724.1| hypothetical protein OsI_14474 [Oryza sativa Indica Group]
Length = 98
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 54/85 (63%), Gaps = 7/85 (8%)
Query: 18 ALLCFFVFNSCS------KSQSTPPAKYDGFFYANH-PVDSDAIIIEAFFDPVCPDSRDA 70
+LL V C+ +Q PA+ DGF Y P + +++EA+FDPVCPDSRDA
Sbjct: 9 SLLLVLVIGCCAWGCRPGAAQVPVPARTDGFVYGGKAPALGETVVVEAYFDPVCPDSRDA 68
Query: 71 WPPLKQALQHYGPHVSLVVHLLPLP 95
WP LK+A++HY V++VVHL PLP
Sbjct: 69 WPELKKAVEHYASRVTVVVHLFPLP 93
>gi|320169829|gb|EFW46728.1| hypothetical protein CAOG_04686 [Capsaspora owczarzaki ATCC 30864]
Length = 261
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYG-PHVSLVVHLLPL 94
P++ DGF YAN + + I+ F D +CPDS+ WP LKQ HYG P + LV H+ PL
Sbjct: 24 PSRPDGFAYANGS-PAAPLQIDLFVDLLCPDSQAIWPTLKQVADHYGAPTLRLVTHIFPL 82
Query: 95 PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
PYH NAY ++ +V N +A + L+ F Q+ F + PT N+T V+ +
Sbjct: 83 PYHHNAYYAAQGTQVVAAANVNAVYKWLDAVFAAQDSFEDDPTVNLTSNQVINMYAAL-S 141
Query: 155 EGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ IG + L+ G TD R+++K
Sbjct: 142 QTIGVPAAVFLK-GMDSDDTDESARIAWK 169
>gi|196010617|ref|XP_002115173.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190582556|gb|EDV22629.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 217
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 88/172 (51%), Gaps = 7/172 (4%)
Query: 18 ALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
A L F SQ+ + DG+ Y N ++ I +EAF D CPD + AWP LK+
Sbjct: 3 ACLVLFALIPVIYSQAPILPRPDGYVYHNGSPEAP-IHMEAFIDLACPDCKTAWPILKKL 61
Query: 78 LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSA--TFCLLEWFFKQQEKFYN 134
YGP+ + + H+ PL YH AY + + + N S F ++ FK Q +FYN
Sbjct: 62 ADSYGPNKLRMTFHINPLVYHRQAYYAAMGVRALVAANVSVHDLFNYMDAVFKNQSEFYN 121
Query: 135 APTQNMTRTAVVKEIVKFAAEGIGNS---YSSALESGFSDRSTDLLTRVSFK 183
PT M+ V++ K A + G S +++AL++ D +T L ++++++
Sbjct: 122 KPTLKMSGLQVIQAYAKLAHKAAGISAMKFANALQNSGIDHTTRLSSKLAWE 173
>gi|154420795|ref|XP_001583412.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121917653|gb|EAY22426.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 198
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 84/167 (50%), Gaps = 11/167 (6%)
Query: 19 LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
+L FFVF + S P++ G Y +P D+ II+E + DP+C D D+W + A+
Sbjct: 1 MLSFFVFQALS---CPIPSRPPGIVY--NP-DAKDIIVEMYADPLCSDCLDSWSAVSAAI 54
Query: 79 QHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQ--QEKFYNAP 136
Y + ++HLLPLPYH + S+ + N+S LL + Q F +
Sbjct: 55 TKYKDNARFIIHLLPLPYHTWTFMVSKVIMAAKSINTSYAPILLNCLYADGNQSMFLGSE 114
Query: 137 TQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+++T + K+ +K+AAE +G + + L F + + TR+ FK
Sbjct: 115 VKSVTAGDMQKKALKWAAEKLGITVND-LTKAFGTQEMN--TRIEFK 158
>gi|405953203|gb|EKC20909.1| Eppin [Crassostrea gigas]
Length = 403
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 72/134 (53%), Gaps = 7/134 (5%)
Query: 56 IEAFFDPVCPDSRDAWPPLKQALQHYGPHV-SLVVHLLPLPYHDNAYATSRALHIVNRTN 114
++AF PVCPDS++ +P L Q HYGP V +L +H+ PLPYH N++ + +VNR
Sbjct: 42 VDAFLGPVCPDSKEVFPTLLQLANHYGPDVLTLRMHMFPLPYHRNSFLVAMGAQVVNRMI 101
Query: 115 SSATFCLLEW---FFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSD 171
SSA + W + + + N T++M+ ++ + AA G+G S ++ +D
Sbjct: 102 SSAQG-VYNWTGSVYDKIDTLSNTATKSMSEIQIISMLGDIAA-GLGIQKSVFVQK-MAD 158
Query: 172 RSTDLLTRVSFKVT 185
D RV +K T
Sbjct: 159 PLVDEDARVEWKYT 172
>gi|118386030|ref|XP_001026137.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila]
gi|89307904|gb|EAS05892.1| hypothetical protein TTHERM_01145000 [Tetrahymena thermophila
SB210]
Length = 230
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 76/161 (47%), Gaps = 18/161 (11%)
Query: 31 SQSTP-PAKYDGFFYA--NHPVDSDAIIIEAFFDPVCPDSRDAWPPLK-----QALQHYG 82
+Q+ P P YDGF N PV +EAF+D +CP S A + Y
Sbjct: 18 TQNVPIPNTYDGFVIGDPNAPV-----TVEAFYDLLCPASEAANEQINLLFTNATFAQYL 72
Query: 83 PHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTR 142
P + + H+ PLPYHD A+ H+ + ++ TF + W F QE+++N P N TR
Sbjct: 73 PKIKFIYHIFPLPYHDYAFR-----HVFDNHDAKTTFDFINWSFDIQEQYWNGPVTNYTR 127
Query: 143 TAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
AV ++ + + + + ++R+ DL R S+K
Sbjct: 128 NAVFNKLAHQLHDFVPKISINETLAALTNRTYDLEARYSWK 168
>gi|440791659|gb|ELR12897.1| hypothetical protein ACA1_095270 [Acanthamoeba castellanii str.
Neff]
Length = 212
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 90/186 (48%), Gaps = 9/186 (4%)
Query: 14 ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPP 73
IL L F VF + + PP +A D+ AI +EAF D CPDS+ WP
Sbjct: 6 ILALVLALFTVFVA-----AQPPIPLASSVFALGDADA-AIKLEAFVDLQCPDSKAVWPI 59
Query: 74 LKQALQHYGPH-VSLVVHLLPLPYHDNAY--ATSRALHIVNRTNSSATFCLLEWFFKQQE 130
L Q +HYGPH + LV L PLPYH NA+ A +R N++ + ++ + QQ+
Sbjct: 60 LLQLAKHYGPHKLQLVATLFPLPYHHNAFFAAQARPPFPELARNNTVFWDYAQYLYDQQD 119
Query: 131 KFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFS 190
+ + TQ + V+ ++ ++A ++ +++ + +LL RV++K
Sbjct: 120 NWGDEATQGIAPLQVITQMSRYAQACCKVDPAAFVKAMGWVSAQNLLARVTWKRAASLGI 179
Query: 191 FNTSFF 196
+ T F
Sbjct: 180 YGTPQF 185
>gi|390354820|ref|XP_003728415.1| PREDICTED: uncharacterized protein LOC100888511 [Strongylocentrotus
purpuratus]
Length = 219
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 84/175 (48%), Gaps = 4/175 (2%)
Query: 10 HATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRD 69
H T+++ S L S +Q+ P + GFFY+N D+ I +++F D +CPDS+
Sbjct: 2 HRTVVV-SLTLVLGGLISQGTAQAPIPLRPPGFFYSNGSCDA-PIQLQSFVDLMCPDSKA 59
Query: 70 AWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQ 128
AW ++ +YGP V + PLPYH A ++ + R +S+ T+ F +
Sbjct: 60 AWRTYQKVADYYGPSTVRFSALIFPLPYHRAAMVAAQGAFVAARLDSNKTYSWFNTVFDK 119
Query: 129 QEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
Q YNA + +V + ++A++ +G S + + T L RV FK
Sbjct: 120 QAALYNANITEQPDSYIVNALSEWASD-VGYSKAQFMMHLSPSDPTQSLARVEFK 173
>gi|124088534|ref|XP_001347135.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|145474257|ref|XP_001423151.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057524|emb|CAH03508.1| Conserved hypothetical protein [Paramecium tetraurelia]
gi|124390211|emb|CAK55753.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 59/114 (51%), Gaps = 1/114 (0%)
Query: 50 DSDAIIIEAFFDPVCPDSRDAWPPLKQAL-QHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
DS +IIEAFFD CPDSR+++ L+Q L Q YG +H+ PLPYH A+ ++A
Sbjct: 33 DSKKLIIEAFFDLQCPDSRNSFRILEQVLAQGYGTQFKYTIHMFPLPYHRAAFPEAQAFA 92
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYS 162
++ ++ A + + F QE + T T ++ ++ A E + Y
Sbjct: 93 FLSEIDAEAAYLFAKIIFDNQETLAESATLEWTWQQILNKVADLAKEHVYPKYQ 146
>gi|313227081|emb|CBY22228.1| unnamed protein product [Oikopleura dioica]
Length = 217
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 73/156 (46%), Gaps = 6/156 (3%)
Query: 31 SQSTPPAKYDGFFY---ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
Q+ P + GF Y N V++ + ++ F D CP+ AWP LK HYGP+V L
Sbjct: 15 GQAPIPNRPVGFVYNPTGNQFVEN-PVEVQVFIDLQCPNCLSAWPGLKAMGDHYGPNVRL 73
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
V PLPYH A+ + L VN+ N + ++ F Q+ N + +V+
Sbjct: 74 SVVAFPLPYHRAAFKMAWGLQAVNKMNPQLAYDYMDNIFANQDDIANY-AAGVNDKDLVQ 132
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
I + + + + L++ +D + D TRV +K
Sbjct: 133 YIANKTVDKLNINKDTFLDN-MADPNLDWATRVDWK 167
>gi|390368255|ref|XP_790435.2| PREDICTED: uncharacterized protein LOC585517 isoform 2
[Strongylocentrotus purpuratus]
gi|390368257|ref|XP_003731415.1| PREDICTED: uncharacterized protein LOC585517 isoform 1
[Strongylocentrotus purpuratus]
Length = 286
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 84/178 (47%), Gaps = 3/178 (1%)
Query: 7 NKNHATLILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPD 66
+ N ++ S L S +Q+ P + GFFY+ D+ I +E+F D +CPD
Sbjct: 62 SDNMHRTVVASLALVLGALISQGTAQAPIPLRPPGFFYSTGSCDA-PIQLESFLDLICPD 120
Query: 67 SRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWF 125
S+ AW ++ +YGP V + PLPYH A A ++ + + +S+ T+ L
Sbjct: 121 SKAAWRTYQKVADYYGPDTVRFSALMFPLPYHRAAMAAAQGAFVADMLDSNKTYSWLNTA 180
Query: 126 FKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
F +Q +A + ++K + ++A+E +G S + + T L RV FK
Sbjct: 181 FDKQAALSDANISEQPDSYILKVLSEWASE-VGYSKAQFMMHLSRSDPTQSLARVEFK 237
>gi|390368260|ref|XP_003731416.1| PREDICTED: uncharacterized protein LOC100890543 [Strongylocentrotus
purpuratus]
Length = 509
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 71/154 (46%), Gaps = 4/154 (2%)
Query: 31 SQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVV 89
+Q+ P + GF Y P +++EA+ D +CPDS++AW + HYGP V
Sbjct: 19 AQAPIPDRPPGFAY--KPAPEAPVVLEAWVDLICPDSKNAWLTYQAVADHYGPEKVRFSA 76
Query: 90 HLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI 149
+ PLPYH +A ++A + N N T+ + F +Q + + T+ V + +
Sbjct: 77 LMFPLPYHRSAMMAAQAAFVANELNPELTYAFFDNAFSKQPALSDDNIFDQTQAYVQETL 136
Query: 150 VKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+A E + A SD T L RV FK
Sbjct: 137 AAWAGEVGYDQTEFASRLARSD-DTQWLARVEFK 169
>gi|313220976|emb|CBY31809.1| unnamed protein product [Oikopleura dioica]
Length = 217
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 72/156 (46%), Gaps = 6/156 (3%)
Query: 31 SQSTPPAKYDGFFY---ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSL 87
Q+ P + GF Y N V++ + ++ F D CP+ AWP LK HYGP+V L
Sbjct: 15 GQAPIPNRPVGFVYNPTGNQFVEN-PVEVQVFIDLQCPNCLSAWPGLKAMGDHYGPNVRL 73
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
V PLPYH A+ + L VN+ N + ++ F Q+ N + +V+
Sbjct: 74 SVVAFPLPYHRAAFKMAWGLQAVNKMNPQLAYDYMDNIFANQDDIANY-AAGVNDKDLVQ 132
Query: 148 EIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
I + + + + L+ +D + D TRV +K
Sbjct: 133 YIANKTVDKLNINKDTFLDK-MADPNLDWATRVDWK 167
>gi|307945890|ref|ZP_07661226.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
gi|307771763|gb|EFO30988.1| conserved hypothetical protein [Roseibium sp. TrichSKD4]
Length = 191
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 75/163 (46%), Gaps = 5/163 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPL 94
P K G+FY + + +EAF D CP S+ AWP L +HY + L VHL+ L
Sbjct: 6 PRKPSGYFYGLRDAN---VQVEAFIDIQCPHSKLAWPSLVALKEHYAKGEIGLAVHLITL 62
Query: 95 PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
H A+ S+ + + ++ F + + +Q +YN P + T ++ I FA
Sbjct: 63 SNHRQAWDVSKGIFALAGDDTEKFFGFASYLYARQSDYYNGPFLHKTHQDLLNLIADFAH 122
Query: 155 EGIGNSYSSALE-SGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
+ G + L+ G ++ TD T + F T+ ++ T F
Sbjct: 123 DFAGFNKDEMLKLIGTNEVYTDARTPIRFASTKGVWATPTFFL 165
>gi|422295502|gb|EKU22801.1| hypothetical protein NGA_0488300 [Nannochloropsis gaditana CCMP526]
Length = 228
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 62/146 (42%), Gaps = 3/146 (2%)
Query: 51 SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
S + ++ F D VC D++ AWP +KQ +Q Y V H PLPYH NA+ A+
Sbjct: 52 SSGVNVDVFLDLVCIDTKKAWPVIKQVIQTYDRAVQWRFHQFPLPYHTNAFIG--AVVAA 109
Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESGFS 170
N +A ++ F +Q + N TQ MT AV + + E G +
Sbjct: 110 TYGNDTAALRYMDIAFARQIQVSNELTQGMTYKAVEGVVAAWVQEATGLE-REVVRRKLQ 168
Query: 171 DRSTDLLTRVSFKVTQKFFSFNTSFF 196
+ + R SFK F T F
Sbjct: 169 EGGVEEAVRFSFKYATIRSIFGTPMF 194
>gi|260820710|ref|XP_002605677.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
gi|229291012|gb|EEN61687.1| hypothetical protein BRAFLDRAFT_77925 [Branchiostoma floridae]
Length = 292
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 41 GFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHD 98
G+ Y N +P+ I IE F D VCPD R A+P LK+ YGP+ V L +HL PLPYH
Sbjct: 40 GYVYKNGNPLAP--IQIETFGDLVCPDFRAAFPILKEVADWYGPNLVCLKIHLFPLPYHK 97
Query: 99 NAYATSRALHIVNR-TNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
A+ T + LHI+ + TF ++ E F A N T V +++ + +
Sbjct: 98 YAFLTHQVLHIIEPIIGINGTFDYMDRVLADLEAFSGA-VMNRTEGQVYSKLLGI-VQTL 155
Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
G + + +G + S + RV FK
Sbjct: 156 GVT-ADQYWTGLDESSPNHRARVEFK 180
>gi|260820750|ref|XP_002605697.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
gi|229291032|gb|EEN61707.1| hypothetical protein BRAFLDRAFT_77946 [Branchiostoma floridae]
Length = 292
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 8/146 (5%)
Query: 41 GFFYAN-HPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHD 98
G+ Y N +P+ I IE F D VCPD R A+P LK+ YGP+ V L +HL PLPYH
Sbjct: 40 GYVYKNGNPLAP--IQIETFGDLVCPDFRAAFPILKEVADWYGPNLVCLKIHLFPLPYHK 97
Query: 99 NAYATSRALHIVN-RTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
A+ T + LHI+ + TF ++ E F A N T V +++ + +
Sbjct: 98 YAFLTHQVLHIIEPIIGINGTFDYMDRVLADLEAFSGA-VMNRTEGQVYSKLLGI-VQTL 155
Query: 158 GNSYSSALESGFSDRSTDLLTRVSFK 183
G + + +G + S + RV FK
Sbjct: 156 GVT-ADQYWTGLDESSPNHRARVEFK 180
>gi|348690388|gb|EGZ30202.1| hypothetical protein PHYSODRAFT_477416 [Phytophthora sojae]
Length = 303
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 81/155 (52%), Gaps = 6/155 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPL 94
P + GF D+ + +E++ D +CPDS+ A+P LK+ QHY P + + L PL
Sbjct: 27 PRRPPGFTLGGGSADA-GVQLESYIDLLCPDSKSAYPGLKKLAQHYEPDELRVRFVLFPL 85
Query: 95 PYHDNAYATSRALHIVNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
PYH +A+AT+ A + T A+F LE + Q+ F+N T++++ VV ++ K A
Sbjct: 86 PYHQHAFATAEAAFAITTTLGDASFTKWLETIYANQDIFWNKVTKDLSPVQVVAKLKKLA 145
Query: 154 AE---GIGNSYSSALESGFSDRSTDLLTRVSFKVT 185
+ + + L +G+ D TR S+K T
Sbjct: 146 QKTFPSLTDEQWDKLMTGYGGTDVDDHTRESWKYT 180
>gi|124088536|ref|XP_001347136.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|145474255|ref|XP_001423150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057525|emb|CAH03509.1| Conserved hypothetical protein [Paramecium tetraurelia]
gi|124390210|emb|CAK55752.1| unnamed protein product [Paramecium tetraurelia]
Length = 214
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 76/171 (44%), Gaps = 10/171 (5%)
Query: 19 LLCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQA 77
+L + C SQ TP P DG ++P ++IEA++D CP SR+++ K
Sbjct: 6 VLLSLILGVCVGSQYTPIPNIPDGLIIGDNP----NLVIEAYYDIFCPGSRESYNIFKTV 61
Query: 78 LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNRTNS-SATFCLLEWFFKQQEKFYNA 135
+ + ++H PLPYH NA+A S + +T S A + E+F +
Sbjct: 62 IDSLEKDSFTFIIHQFPLPYHKNAFAASAGFKYIWKTVSQEAAYKFEGLLLNNLEQFTDL 121
Query: 136 PTQNMTRTAVVKEIVKFAAEGIGN---SYSSALESGFSDRSTDLLTRVSFK 183
T N+ +T V ++I + +Y L S ++ TR S+K
Sbjct: 122 ATLNLKQTEVNQKIADLVKTQLAQYQINYDELLNSMKPGTPENIETRYSWK 172
>gi|123446098|ref|XP_001311803.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121893626|gb|EAX98873.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 199
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 78/163 (47%), Gaps = 8/163 (4%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAW-PPLKQALQHYGPHVSLVVHLLPL 94
PA+ G + +P +++ I++E + DP+CP +AW + + +Q Y V V+H LPL
Sbjct: 15 PARQTGLTW--NP-NANKILVEMYGDPLCPVCLNAWMNGVSKMIQKYQNDVKFVLHFLPL 71
Query: 95 PYHDNAYATSRALHIVNRTNSSATFCLLEWFF-KQQEKFYNAPTQNMTRTAVVKEIVKFA 153
PYH A+ +R + V + + L+ + Q +F N P+ T V +K+A
Sbjct: 72 PYHTWAFVVTRTILAVKQLSEPKAQILVNALYTGGQGQFENDPS--FTENTVTDNCIKYA 129
Query: 154 AEGIGNSYSSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
A+ I + + F+ +L RV FK + T +F
Sbjct: 130 AK-ICQLSEDDIRNAFATIDINLGARVEFKYSTSHGVDGTPYF 171
>gi|301118869|ref|XP_002907162.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105674|gb|EEY63726.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 298
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 77/155 (49%), Gaps = 6/155 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPL 94
P + GF D+ + +E++ D +CPDS+ A+P LK +HY + + L PL
Sbjct: 23 PRRPPGFTMGGGSADA-GVQLESYIDLLCPDSKSAYPGLKNLAEHYEADELRVRFVLFPL 81
Query: 95 PYHDNAYATSRALHIVNRTNSSATF-CLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
PYH +A+AT+ A V +F LE + Q+ F+N T++++ VV+++ K A
Sbjct: 82 PYHQHAFATAEAAFTVTTALGDKSFTTWLETIYANQDIFWNKATKDLSPIQVVEKLKKLA 141
Query: 154 AE---GIGNSYSSALESGFSDRSTDLLTRVSFKVT 185
+ + + +G+ D TR S+K T
Sbjct: 142 QKTFPSLTDKQWDEQMTGYGGTDVDDHTRESWKYT 176
>gi|118378878|ref|XP_001022613.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304380|gb|EAS02368.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 225
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 83/177 (46%), Gaps = 14/177 (7%)
Query: 17 SALLCFFVFNSCSKSQSTP-PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPL- 74
S + + S +Q P P K DG + N + + I+A++D CPDS + L
Sbjct: 4 SVFVILSILALASATQYVPIPDKPDGVGFGNV---NATLTIDAWYDLTCPDSLYTYQELI 60
Query: 75 ----KQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL-HIVNRTNSSATFCLLEWFFKQQ 129
Q L Y + + H++PLPYH NA+ ++ ++++ + AT ++ F+ Q
Sbjct: 61 SALNNQTLAAYVNQIRVNYHIMPLPYHYNAFYLAQVFKYVLDVKGAEATLVYVDQIFENQ 120
Query: 130 EKFYNAPTQNMTRTAVVKEIVKFAAEGIG--NSYSSALESGFSDRSTDLLTRVSFKV 184
+YNA N+T T + + I ++ + N S + F+D + R S+K+
Sbjct: 121 PDYYNA--LNLTTTQIYQLIADNTSQYLSAYNVTSQEILDSFNDYDYNGGARASYKL 175
>gi|154420474|ref|XP_001583252.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121917492|gb|EAY22266.1| hypothetical protein TVAG_094410 [Trichomonas vaginalis G3]
Length = 197
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 33 STPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLL 92
S+ P +Y G Y N + I+I+ + DP+C D +WP +K+ LQ+Y + L H +
Sbjct: 10 SSVPKRYPGMLYGNE--NEYKILIDMYCDPLCSDCAYSWPTIKKVLQYYPKDLLLRFHTI 67
Query: 93 PLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFK-QQEKFYNAPTQNMTRTAVVKEIVK 151
PL H +Y + +A+ + + +L+ + Q F N N + +++
Sbjct: 68 PLDIHTWSYHSVKAVQALRLMDEGKAKQMLDKLYDGDQIYFLNTEMFNTSENQAIQKFCS 127
Query: 152 FAA 154
+ A
Sbjct: 128 YVA 130
>gi|299115356|emb|CBN74181.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 262
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 69/158 (43%), Gaps = 7/158 (4%)
Query: 32 QSTPPAKYDGFFYA--NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
QS P +DG+ + A+ +E + D C D AWP + + + YG +
Sbjct: 47 QSEPAIPFDGYGHVILRQGTLEAALKVEVYLDLACSDCALAWPVMNRVAEAYGDRTEFLY 106
Query: 90 HLLPLPYHDN---AYATSRALHIVNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAV 145
L PLPYH+N A ++ + + +R + A+ F Q++ N T+ MT+ +
Sbjct: 107 RLFPLPYHNNAFKAAKAAKTIQLYSRGSEDASVASFFTEVFIGQDEISNDSTELMTQPQI 166
Query: 146 VKEIVKFAAEGIGNSYSSALESGFSDRSTDLLTRVSFK 183
+ +A G S + G +D ++ TR FK
Sbjct: 167 EDILESWATSSSGMS-ADNFHMGMADAEVEIQTRNQFK 203
>gi|348690389|gb|EGZ30203.1| hypothetical protein PHYSODRAFT_553285 [Phytophthora sojae]
Length = 225
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 65/138 (47%), Gaps = 5/138 (3%)
Query: 51 SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRALHI 109
S + +E + D +CP S+ A+ +K HY P V + L PLP+H + + + ++
Sbjct: 39 SAKVQLETYIDLLCPFSKAAYDGVKALADHYAPEQVRVKAVLFPLPFHQHGFTAAESVFT 98
Query: 110 VNRTNSSATFC-LLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALE-- 166
V F LE ++ QE+F+N T++ + V E+ + A + + E
Sbjct: 99 VTSALGDDHFTNWLEVVYEHQEEFWNKATKDKSAVQVTNELKQLAQKTFPDLTDQQWERG 158
Query: 167 -SGFSDRSTDLLTRVSFK 183
+G+ D TRV++K
Sbjct: 159 MTGYGGTEADQHTRVTWK 176
>gi|290978395|ref|XP_002671921.1| predicted protein [Naegleria gruberi]
gi|284085494|gb|EFC39177.1| predicted protein [Naegleria gruberi]
Length = 244
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 80/176 (45%), Gaps = 21/176 (11%)
Query: 27 SCSKSQSTPP-AKYDGFFYANHPVDSDAII--IEAFFDPVCPDSRDAWPPLKQALQHYGP 83
+C S PP ++ + P +SD+I I+ + D C DS + W + ++L Y
Sbjct: 33 NCKISPIGPPISRLKDSWSTTTPKNSDSIKVQIDLYVDLACSDSAEWWTSVFKSLDSYYF 92
Query: 84 H-------VSLVVHLLPLPYHDNAYATSRALHIVNR---TNSSATFCLLEWFFKQQEKFY 133
+ L +HLLPLPYH ++YATS + +N+ NS L+ F+ Q Y
Sbjct: 93 SNSSSSAPIKLNLHLLPLPYHISSYATSMSYIAINKYTNYNSKVLKELINSFYTNQSPLY 152
Query: 134 NAPTQNMTRTAVVKEIVKFAAEGIG----NSYSSALESGFSDRSTDLLTRVSFKVT 185
N M ++ + K I + + N Y + + G S D+ TR FK++
Sbjct: 153 NNIVSEMKQSEMYKLIYTNNIQSLNVISWNDYLNLMLDG----SIDMRTRTLFKLS 204
>gi|444379557|ref|ZP_21178734.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
gi|443676286|gb|ELT82990.1| hypothetical protein D515_3633 [Enterovibrio sp. AK16]
Length = 191
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 53 AIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPLPYHDNAYATSRALHIVN 111
++ I+ F D CP SR WP L ++HY G VSL HL+ L H A+ S L +
Sbjct: 20 SVTIDVFIDIQCPHSRAIWPNLMAVMEHYKGQSVSLKTHLITLSNHRQAWDMSLGLFALA 79
Query: 112 RTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
++ + + +++Q+ FYN + T + + +A E
Sbjct: 80 EGDAEKFYRFATFLYERQDTFYNGQFLHKTHEDLRNLVADYAVE 123
>gi|340501301|gb|EGR28101.1| hypothetical protein IMG5_183230 [Ichthyophthirius multifiliis]
Length = 227
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 80/172 (46%), Gaps = 11/172 (6%)
Query: 19 LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
+ F+F +CS+ S P K DG N + I +EA++D CPDS+ + L
Sbjct: 9 ITILFLFINCSQYISIPK-KPDGVTIGNK---NSQIKLEAYYDLQCPDSKHSNEQLNILF 64
Query: 79 QH--YGPHVSLVVHLLPLPYHDNAYATSRALHIV-NRTNSSATFCLLEWFFKQQEKFYNA 135
+ Y +++ H PLPYH N+Y S+ L V ++ + A F+ Q+ + N
Sbjct: 65 KEKTYLQNLAFTYHYFPLPYHFNSYHLSQTLRFVYDKYGTQAAVEFTNKVFENQQNYDNN 124
Query: 136 PTQNMTRTAVVKEIVKFAAEGIGNSY---SSALESGFSDRSTDLLTRVSFKV 184
T+ + + ++I + Y SS + +D++ + TR S+K+
Sbjct: 125 KTKTLNLNEIYQKIAD-DTQSFLKEYSIKSSDVIDALNDQNYNQETRQSWKI 175
>gi|291237330|ref|XP_002738588.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 215
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 74/155 (47%), Gaps = 5/155 (3%)
Query: 44 YANHPVDSDAII-IEAFFDPVCPDSRDAWPPLKQALQHYGPHVS-LVVHLLPLPYHDNAY 101
Y H ++DA + I +FD C DS+ + + A +YGP V LV+H L LPY ++
Sbjct: 36 YVYHGGNADAPVHITMYFDSQCSDSKMGYQGVFDAADYYGPDVVYLVLHGLSLPYFRGSF 95
Query: 102 ATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSY 161
+ +AL V++ N + T L+ F Q+ +P ++ ++ + FA E +G +
Sbjct: 96 ISLQALRAVDQLNKTKTVEYLQILFDNQDLISGSP-DTVSDADMIVILTDFAVE-VGFTE 153
Query: 162 SSALESGFSDRSTDLLTRVSFKVTQKFFSFNTSFF 196
LE ++ T+ L R K+ + +F
Sbjct: 154 EEFLEE-YNHPKTNQLCRHEMKMANDRGVYGGPWF 187
>gi|443723886|gb|ELU12105.1| hypothetical protein CAPTEDRAFT_202202 [Capitella teleta]
Length = 208
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 1/112 (0%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRA 106
P + + IE F + +C DS +W + QHYGP ++++VVH+ LPYH NA+ ++
Sbjct: 34 PSSTGDVTIEVFLELLCFDSDISWGIINDLQQHYGPENLNIVVHIYSLPYHRNAFLFAQG 93
Query: 107 LHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
L+ V ++ + + F + T + T T +V + A G
Sbjct: 94 LYQVADLAPEFVHDYVDMAYANRPSFGDDATVDETETEIVGRLSVLAESVTG 145
>gi|390343493|ref|XP_003725885.1| PREDICTED: uncharacterized protein LOC100891241 [Strongylocentrotus
purpuratus]
Length = 239
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 57/128 (44%), Gaps = 2/128 (1%)
Query: 29 SKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSL 87
+ +Q+ P GF Y + D I++EA+ D +CPD++ AW ++ HYG V
Sbjct: 20 ASAQAPIPESPPGFVYKSTSPDV-PILLEAWVDLICPDAKGAWLTYQEVADHYGTDTVRF 78
Query: 88 VVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVK 147
+ PLPY A S+ + N + + F +Q Y+ + T + +
Sbjct: 79 SALMFPLPYDRAAMKASQGSFVARDMNPALVYAWFNNVFDKQAALYDNAIYDKTTEFIDE 138
Query: 148 EIVKFAAE 155
+ +A+E
Sbjct: 139 TLATWASE 146
>gi|134110460|ref|XP_776057.1| hypothetical protein CNBD1050 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258725|gb|EAL21410.1| hypothetical protein CNBD1050 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 210
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 7/109 (6%)
Query: 56 IEAFFDPVCPDSRDAWP-------PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
+E + DPVCP SR P+ Y V+LVV L P P+H + AL+
Sbjct: 21 LEVYIDPVCPFSRKITESIDKNVLPMITNGGKYDGKVNLVVRLYPQPFHYYSAPIIEALY 80
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
+ +TN + L + FYN P ++T +++ ++V+ A E +
Sbjct: 81 VFGQTNPRLFWQYLLAVHSTETTFYNRPAASLTLSSLRDKLVEIAVEQV 129
>gi|440801380|gb|ELR22400.1| glutathione reductase [Acanthamoeba castellanii str. Neff]
Length = 456
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 7/124 (5%)
Query: 19 LLCFFVFNSCSKSQSTPPAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL 78
LL V N Q P K + +P S I I F D CP D++P ++Q L
Sbjct: 12 LLALVVVNG----QQLPIPKTAPGMHRGNP--SAPIHITEFADYQCPYCADSYPIVEQVL 65
Query: 79 QHYGP-HVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPT 137
+ YGP + +H+ PL H A+ + A +V + + F Q +FYN+
Sbjct: 66 KLYGPDKIYYTLHIFPLWLHRQAFIVAEAAGVVALNAPDRYWEAASFLFANQAQFYNSAF 125
Query: 138 QNMT 141
QN T
Sbjct: 126 QNKT 129
>gi|325188255|emb|CCA22794.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 237
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 2/113 (1%)
Query: 43 FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAY 101
F N+ S I +EAF DP+C S+ + P LK HY P V + + PL +H + Y
Sbjct: 50 FRMNNCSLSARIQLEAFIDPLCRSSKFSHPILKNLSTHYKPDEVCIRIVPFPLIFHVHTY 109
Query: 102 ATSRALH-IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFA 153
+ AL+ +VN S LE ++ Q+ F N TR V +++ + A
Sbjct: 110 EVTTALYQVVNALEDSVFVEWLEAIYEVQDVFLYTNGPNFTRLQVREKVHEVA 162
>gi|321262693|ref|XP_003196065.1| hypothetical protein CGB_I1550W [Cryptococcus gattii WM276]
gi|317462540|gb|ADV24278.1| Hypothetical protein CGB_I1550W [Cryptococcus gattii WM276]
Length = 210
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 7/109 (6%)
Query: 56 IEAFFDPVCPDSRDAWPPLKQAL-------QHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
+E + DPVCP SR + + + Y V+LVV L P P+H + AL+
Sbjct: 21 LEVYIDPVCPFSRKITESIDRNILPMITNGGKYDGTVNLVVRLYPQPFHYYSAPIIEALY 80
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
+ +TN + L FYN P ++T +++ ++V+ A E +
Sbjct: 81 VFGQTNPRLFWQYLLAVHSTGTTFYNRPAASLTLSSLRDKLVEIAVEQV 129
>gi|325188256|emb|CCA22795.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 239
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 57/119 (47%), Gaps = 2/119 (1%)
Query: 43 FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAY 101
F N+ S I +EAF DP+C S+ + P LK HY P V + + PL +H + Y
Sbjct: 52 FRMNNCSLSARIQLEAFIDPLCRSSKFSHPILKNLSTHYKPDEVCIRIVPFPLIFHVHTY 111
Query: 102 ATSRALH-IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGN 159
+ AL+ +VN S LE ++ Q+ F N TR V +++ + A + N
Sbjct: 112 EVTTALYQVVNALEDSVFVEWLEAIYEVQDVFLYTNGPNFTRLQVREKVHEVANKTFPN 170
>gi|440801397|gb|ELR22417.1| hypothetical protein ACA1_255150 [Acanthamoeba castellanii str.
Neff]
Length = 185
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQALQHYGP-HVSLVVHLLPLPYHDNAYATSRALHIVNR 112
I + F D CPD ++P +++ LQHYGP + +H+ PL H A+ + A +V
Sbjct: 10 IHLTEFADYQCPDCGVSYPVVEKVLQHYGPDKIYYTLHVFPLWLHRQAFILAEAAGVVAL 69
Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMT 141
+ ++ F Q +F+N+ +N T
Sbjct: 70 NAPEHYWEAAQFLFANQAQFFNSAYRNKT 98
>gi|323448010|gb|EGB03914.1| hypothetical protein AURANDRAFT_67594 [Aureococcus anophagefferens]
Length = 739
Score = 45.4 bits (106), Expect = 0.013, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 56 IEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVN-RTN 114
++ F D C DS+ WP LK AL P V + + L PLPY+ + ++A TN
Sbjct: 548 MDVFLDMFCEDSKATWPTLK-ALAAKRPDVDIRIQLFPLPYNFGTWIPAQACTAAALLTN 606
Query: 115 SSATF 119
SSATF
Sbjct: 607 SSATF 611
>gi|452002015|gb|EMD94474.1| hypothetical protein COCHEDRAFT_1094426 [Cochliobolus
heterostrophus C5]
Length = 211
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 55/124 (44%), Gaps = 7/124 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
K+ G A+ + A+ +E + D VCP S + PPL+ L YG + +
Sbjct: 6 KFAGQKLASSSIQPKAVHTLELYLDYVCPFSAKMYKTLFTPPLRNTLLSTYGTSLVTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V R ++S + E F+QQ F++ N TR + +
Sbjct: 66 QQIQPWHPSSTLVHEAAYAVQRVDASKFYDFSEKLFEQQNDFFDVSLVNETRNQTYRRLA 125
Query: 151 KFAA 154
K AA
Sbjct: 126 KIAA 129
>gi|405119955|gb|AFR94726.1| hypothetical protein CNAG_01417 [Cryptococcus neoformans var.
grubii H99]
Length = 212
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 7/107 (6%)
Query: 56 IEAFFDPVCPDSRDAWP-------PLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
+E + DPVCP SR P+ Y V+L++ L P P+H + AL+
Sbjct: 21 LEVYVDPVCPFSRKITESIDKNVLPMITNGGKYDGKVNLIMRLYPQPFHYYSAPIIEALY 80
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
+ +TN + L FYN P ++T +++ ++V+ A +
Sbjct: 81 VFGQTNPRLFWQYLLAVHSTGTTFYNRPAASLTLSSLRDKLVEIAVQ 127
>gi|451853644|gb|EMD66937.1| hypothetical protein COCSADRAFT_83566 [Cochliobolus sativus ND90Pr]
Length = 211
Score = 43.9 bits (102), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 55/124 (44%), Gaps = 7/124 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
K+ G A+ + A+ +E + D VCP S + PPL+ L YG + +
Sbjct: 6 KFAGQKLASSSIQPKAVHTLEFYLDYVCPFSAKMYKTLFTPPLRNTLLSTYGTSLVTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V R ++ + E F+QQ+ F++ N TR + +
Sbjct: 66 QQIQPWHPSSTLVHEAAYAVQRVDAGKFYDFSEKLFEQQKDFFDVSLVNETRNQTYRRLA 125
Query: 151 KFAA 154
K AA
Sbjct: 126 KIAA 129
>gi|443734729|gb|ELU18601.1| hypothetical protein CAPTEDRAFT_216673 [Capitella teleta]
Length = 256
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 26/112 (23%), Positives = 50/112 (44%), Gaps = 2/112 (1%)
Query: 51 SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
S + +E F D C S+ +W + +G + +V+ P PYH A+ ++ +V
Sbjct: 37 SHPLCLEVFLDLNCEYSKKSWMTMMDLSVVFGERLEIVIQQFPQPYHTFAFILTQGYFLV 96
Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEI--VKFAAEGIGNS 160
++ A + + F N T N+T V++ + ++F A GI +
Sbjct: 97 HKEAPEAILNYTDLVYDHLASFNNENTFNLTTKEVIEILADLQFEATGINQT 148
>gi|325284363|ref|YP_004256903.1| DSBA oxidoreductase [Deinococcus proteolyticus MRP]
gi|324316427|gb|ADY27540.1| DSBA oxidoreductase [Deinococcus proteolyticus MRP]
Length = 216
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 10/132 (7%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
P D+ I+E FFDP C P L +Q Y V LV PL H N+ + +
Sbjct: 54 PADAKVTIVE-FFDPECESCAAVEPALMDVMQKYNGEVRLVARYFPL--HSNSTLAAGLI 110
Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALES 167
+ ++ + + ++ F++Q ++ +TA + + + AE +G S A +
Sbjct: 111 EAAAQDSADKRWRMRDYLFQKQREW------GEQQTAQTDKFLDY-AEDMGLDRSKAQAT 163
Query: 168 GFSDRSTDLLTR 179
S DLL R
Sbjct: 164 MESAAVRDLLAR 175
>gi|407977028|ref|ZP_11157921.1| DSBA oxidoreductase [Nitratireductor indicus C115]
gi|407427548|gb|EKF40239.1| DSBA oxidoreductase [Nitratireductor indicus C115]
Length = 219
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 16/110 (14%)
Query: 12 TLILQSALLCFFVFNSCS---------KSQSTPPAKYDGFFYANHPV----DSDAIIIEA 58
+++L + L+ VF + +Q PA+ D A+ PV D+ I+E
Sbjct: 5 SIVLLTGLVALIVFAGGAFIYDRYDGLSAQPVAPAQGDALVRAHSPVIGPADAPVTIVE- 63
Query: 59 FFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
FFDP C R +P +KQ + ++ LV+ P+HD + R L
Sbjct: 64 FFDPSCEACRAFYPAVKQIMANFPAETRLVIRY--APFHDGSDEAVRILE 111
>gi|396471125|ref|XP_003838796.1| hypothetical protein LEMA_P024690.1 [Leptosphaeria maculans JN3]
gi|312215365|emb|CBX95317.1| hypothetical protein LEMA_P024690.1 [Leptosphaeria maculans JN3]
Length = 210
Score = 43.1 bits (100), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 54/124 (43%), Gaps = 7/124 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW------PPLKQALQHYGPHVSLVVH 90
K+ G A+ + A+ IE + D VCP S + P K L+ Y V +
Sbjct: 6 KFAGQKLASSAIQPKAVHTIEIYLDYVCPFSAKIFKTLYCSPLRKTLLEKYNDRVVTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V + + + + E F+QQ+ F++A N TR A + +
Sbjct: 66 QQIQPWHPSSTLVHEAAYAVQKVDPAKFYPFSEKLFEQQKDFFDASVVNETRNATYRRLA 125
Query: 151 KFAA 154
K A
Sbjct: 126 KIAG 129
>gi|126659892|ref|ZP_01731017.1| glutathione reductase [Cyanothece sp. CCY0110]
gi|126618855|gb|EAZ89599.1| glutathione reductase [Cyanothece sp. CCY0110]
Length = 191
Score = 43.1 bits (100), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNR 112
I IE FFD CP SR W + + + Y + L++ + L H ++ ++A +V +
Sbjct: 21 IQIEMFFDLECPFSRKGWQTILKVFKAYDAQTIYLILQPMTLGNHRQSWDATKAAIVVAQ 80
Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMTRT 143
N+ + + F Q +F N ++ T+T
Sbjct: 81 DNTEKFVDFVSYLFDHQPEFANEAFKDKTQT 111
>gi|443722837|gb|ELU11539.1| hypothetical protein CAPTEDRAFT_228382 [Capitella teleta]
Length = 253
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 3/97 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLP 95
P + G +N S + +E F D C DS AW +KQ + + +++H PLP
Sbjct: 48 PEQTPGIRISN---GSSQVRLEVFLDLNCADSAHAWGVIKQLRDTFPGRLEVLLHSYPLP 104
Query: 96 YHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKF 132
YH A+ ++ L V ++ F ++F
Sbjct: 105 YHPYAFLCAQGLFAVQHMAPERVESYIDTVFDNMDQF 141
>gi|255077788|ref|XP_002502476.1| predicted protein [Micromonas sp. RCC299]
gi|226517741|gb|ACO63734.1| predicted protein [Micromonas sp. RCC299]
Length = 203
Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 3/118 (2%)
Query: 45 ANHPVDSDAIIIEAFFDPVCPDSRDAWPPLK-QALQHYGPHVSLVVHLLPLPYH-DNAYA 102
A P+D+ + + A+ D CP S + + Q L HYG V V + P P+H ++
Sbjct: 14 AKGPMDA-PVTLSAWLDYACPFSAKLFKTVTTQVLPHYGDKVRFVFYHQPQPWHPQSSML 72
Query: 103 TSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNS 160
A+ + + A + F FY+A T + +R+ + +E+ AA G S
Sbjct: 73 HEAAIGVYDLGGVDAFWKFSAALFDAATDFYDANTYDKSRSKIYEELAALAARSAGVS 130
>gi|172055207|ref|YP_001806534.1| hypothetical protein cce_5122 [Cyanothece sp. ATCC 51142]
gi|354556960|ref|ZP_08976239.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
gi|171701488|gb|ACB54468.1| unknown [Cyanothece sp. ATCC 51142]
gi|353551086|gb|EHC20503.1| DSBA oxidoreductase [Cyanothece sp. ATCC 51472]
Length = 191
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 1/91 (1%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQALQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIVNR 112
I IE FFD CP S+ W + + L+ Y + L++ + L H ++ ++A V +
Sbjct: 21 IQIEMFFDLECPFSKKGWQTILKVLKAYDAETIYLILQPMTLSNHRQSWDATKAAMTVAQ 80
Query: 113 TNSSATFCLLEWFFKQQEKFYNAPTQNMTRT 143
N+ + + F Q +F ++ T+T
Sbjct: 81 DNAEKFVDFVSYLFDHQSEFNEKAFKDKTQT 111
>gi|262202544|ref|YP_003273752.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
gi|262085891|gb|ACY21859.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
Length = 214
Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
P +S A +E F D C A+P ++Q Q YG V+ VV PLP H N
Sbjct: 52 PAESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102
>gi|330934811|ref|XP_003304718.1| hypothetical protein PTT_17367 [Pyrenophora teres f. teres 0-1]
gi|311318600|gb|EFQ87219.1| hypothetical protein PTT_17367 [Pyrenophora teres f. teres 0-1]
Length = 214
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 7/125 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
K+ G A+ + A+ +E + D VCP S + PL+Q L Y P ++ +
Sbjct: 6 KFAGQKLASSAISPKAVHTLEIYLDYVCPFSAKIFNTIYNTPLRQTLLTTYSPTLTTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V + + +A + F Q F++A N TR A K +
Sbjct: 66 QQIQPWHPSSTLVHEAAYAVQKLSPAAFWPYSALLFTHQAAFFDANVVNETRNATYKRLA 125
Query: 151 KFAAE 155
K A E
Sbjct: 126 KLAGE 130
>gi|262202577|ref|YP_003273785.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
gi|262085924|gb|ACY21892.1| DSBA oxidoreductase [Gordonia bronchialis DSM 43247]
Length = 214
Score = 40.4 bits (93), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 51/133 (38%), Gaps = 24/133 (18%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATS--- 104
P +S A +E F D C A+P ++Q Q YG V+ VV PLP H NA +
Sbjct: 52 PSESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFNADRAARAV 110
Query: 105 -------------RALHIVNRTNSSATFCLLEWFFK-------QQEKFYNAPTQNMTRTA 144
R + + R+ L + FF E+F A TR
Sbjct: 111 AAAAEQGQFEPMYRKMFVTQRSWGEQRVPLDDLFFSYAQELGLNMERFAAAYDSQATREL 170
Query: 145 VVKEIVKFAAEGI 157
+ +++ A G+
Sbjct: 171 IDRDVADGKALGV 183
>gi|409392046|ref|ZP_11243682.1| putative oxidoreductase [Gordonia rubripertincta NBRC 101908]
gi|403198122|dbj|GAB86916.1| putative oxidoreductase [Gordonia rubripertincta NBRC 101908]
Length = 214
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
P +S A +E F D C A+P ++Q Q YG V+ VV PLP H N
Sbjct: 52 PTESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102
>gi|149911666|ref|ZP_01900275.1| dsbA-like thioredoxin domain protein [Moritella sp. PE36]
gi|149805247|gb|EDM65263.1| dsbA-like thioredoxin domain protein [Moritella sp. PE36]
Length = 222
Score = 40.0 bits (92), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 6/99 (6%)
Query: 14 ILQSALLCFFVFNSCSKSQSTPPAKYDGFFYANHPV----DSDAIIIEAFFDPVCPDSRD 69
I+ A+LC V + S D Y+ + + + I FFDP C R
Sbjct: 18 IISIAVLCIAVVITALSFTSKESINIDQHLYSQNDLVFGNKYAKVTIVEFFDPACESCRA 77
Query: 70 AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
+P +K L+ Y V+L+V P+ +H N AL
Sbjct: 78 FYPLVKSQLKKYKGKVNLIVR--PVAFHRNVGPVVAALE 114
>gi|40445317|ref|NP_954777.1| hypothetical protein [Gordonia westfalica]
gi|40217347|emb|CAE09098.1| hypothetical protein [Gordonia westfalica]
Length = 214
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
P +S A +E F D C A+P ++Q Q YG V+ VV PLP H N
Sbjct: 52 PTESKATFVE-FLDFECEGCGAAYPAVEQLRQTYGDQVTFVVRYFPLPGHFN 102
>gi|119854999|ref|YP_935604.1| DSBA oxidoreductase [Mycobacterium sp. KMS]
gi|145225999|ref|YP_001136653.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
gi|119697717|gb|ABL94789.1| DSBA oxidoreductase [Mycobacterium sp. KMS]
gi|145218462|gb|ABP47865.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
Length = 221
Score = 40.0 bits (92), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 50 DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNA 100
DSD +E F D C R A+P ++Q YG V+ V P+P H N
Sbjct: 59 DSDVTFVE-FLDFECEGCRAAFPAVEQLRAQYGQQVTFVARYFPMPGHFNG 108
>gi|268316621|ref|YP_003290340.1| DSBA oxidoreductase [Rhodothermus marinus DSM 4252]
gi|262334155|gb|ACY47952.1| DSBA oxidoreductase [Rhodothermus marinus DSM 4252]
Length = 409
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 58/149 (38%), Gaps = 22/149 (14%)
Query: 26 NSCSKSQSTPPAKY-DGFFYANHPV----DSDAIIIEAFFDPVCPDSRDAWPPLKQALQH 80
+CS PP +Y N P D+ ++IE + DP CP + P +KQ ++
Sbjct: 223 QTCSFDTERPPVRYFRDLIMMNDPTAGNPDAKVVVIE-YLDPNCPHCKHLHPIMKQVVES 281
Query: 81 YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQ--------EKF 132
YG V P+P + AL+ R +LE F++Q E
Sbjct: 282 YGLQAYFVFK--PIPLWQFSIPQVAALYAAAREGKFEA--MLEAQFERQRSGGLTLEEIL 337
Query: 133 YNAPTQNMTRTAVVKEIVKFAAEGIGNSY 161
A M R + ++I EG+ N Y
Sbjct: 338 DIAEAIGMDRNELARQI----NEGVFNEY 362
>gi|189203275|ref|XP_001937973.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985072|gb|EDU50560.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 214
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 7/125 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAW-----PPLKQAL-QHYGPHVSLVVH 90
K+ G A+ + A+ +E + D VCP S + PL+Q L Y P ++ +
Sbjct: 6 KFAGQKLASSAISPKAVHTLEIYLDYVCPFSAKLFNTIYNTPLRQTLLSTYSPTLNTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V + + +A + F Q F++A N TR A K +
Sbjct: 66 QQIQPWHPSSTLVHEAAYAVQKLSPAAFWPYSALLFAHQATFFDANVVNETRNATYKRLA 125
Query: 151 KFAAE 155
K A E
Sbjct: 126 KLAGE 130
>gi|254423276|ref|ZP_05036994.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
gi|196190765|gb|EDX85729.1| hypothetical protein S7335_3432 [Synechococcus sp. PCC 7335]
Length = 191
Score = 39.7 bits (91), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 28/122 (22%), Positives = 54/122 (44%), Gaps = 4/122 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHY-GPHVSLVVHLLPL 94
P++ G+ N + I++E FFD CP S+ W + Q Y + V + L
Sbjct: 6 PSRRSGYRIGNA---NAPIMVEVFFDLECPFSKKCWDTVMQVKAAYTAEQLYWVFQPMSL 62
Query: 95 PYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAA 154
H ++ ++A V+ ++ + + F +Q +F N ++ T+T + + AA
Sbjct: 63 GNHRQSWDATKAAIAVSDADTQKFIDFVSYLFGKQPEFANEAWKDKTQTEFHTFLAECAA 122
Query: 155 EG 156
E
Sbjct: 123 EA 124
>gi|169603159|ref|XP_001795001.1| hypothetical protein SNOG_04588 [Phaeosphaeria nodorum SN15]
gi|111067228|gb|EAT88348.1| hypothetical protein SNOG_04588 [Phaeosphaeria nodorum SN15]
Length = 210
Score = 39.7 bits (91), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 7/124 (5%)
Query: 38 KYDGFFYANHPVDSDAI-IIEAFFDPVCPDSRDAWPP-----LKQAL-QHYGPHVSLVVH 90
K+ G A+ + A+ +E + D VCP S+ + L+Q L + YG V +
Sbjct: 6 KFAGQKLASSSIQPKAVHTLELYLDYVCPFSQKMFKTVYNTNLRQTLLEKYGDRVVTIFR 65
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ A + V + + + + F QQ++F++ N TR A K +
Sbjct: 66 QQIQPWHPSSTLVHEAGYAVQKVDPTRFYDFSAALFHQQKEFFDVNVVNETRNATYKRLA 125
Query: 151 KFAA 154
K A
Sbjct: 126 KIAG 129
>gi|290979057|ref|XP_002672251.1| predicted protein [Naegleria gruberi]
gi|284085826|gb|EFC39507.1| predicted protein [Naegleria gruberi]
Length = 244
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 12/114 (10%)
Query: 57 EAFFDPVCPDSRDAWPPLKQALQHY---------GPHVSLVVHLLPLPYHDNAYATSRAL 107
E F D CPD + + + L Y V +HL+PLP H + +++
Sbjct: 64 EMFLDLTCPDCATHFQSITKPLVQYYFNNGTSGNQDKVVFSLHLMPLPMHIAGFYSAQTY 123
Query: 108 HIV---NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
IV ++ + + + L+ FF Q NA +NM + + I + +G
Sbjct: 124 SIVSKFSKADRNVCWKFLDLFFSNQSPASNANLKNMNQAQIFNLIYATYVQPLG 177
>gi|145223389|ref|YP_001134067.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
gi|145215875|gb|ABP45279.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
Length = 219
Score = 39.7 bits (91), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 50 DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNA 100
DSD +E F D C R A+P ++Q YG V+ V P+P H N
Sbjct: 59 DSDVTFVE-FLDFECEGCRAAFPAVEQLRAQYGQQVTFVARYFPMPGHFNG 108
>gi|260221388|emb|CBA29900.1| hypothetical protein Csp_A14470 [Curvibacter putative symbiont of
Hydra magnipapillata]
Length = 218
Score = 39.3 bits (90), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
PV++ I+E FFDP C R+ +P +K+ L+ Y V LVV P+H N+ + L
Sbjct: 55 PVNAPVTIVE-FFDPACETCREFYPIVKELLKKYPNDVRLVVRY--APFHRNSDLVVKML 111
Query: 108 H 108
Sbjct: 112 E 112
>gi|452957998|gb|EME63354.1| DSBA oxidoreductase [Amycolatopsis decaplanina DSM 44594]
Length = 222
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 2/85 (2%)
Query: 46 NHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSR 105
+ P S ++E F D C A+P +++ YG V+ V+ P+P H NA +R
Sbjct: 57 SDPAGSKVTVVE-FLDLECEACGAAFPGVERLRAEYGDRVTFVMRYFPIPSHRNAELAAR 115
Query: 106 ALHIVNRTNSSATFCLLEWFFKQQE 130
A+ + L F KQ E
Sbjct: 116 AVEAAGKQGKLEPMYRL-MFEKQPE 139
>gi|449017941|dbj|BAM81343.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 279
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 54/129 (41%), Gaps = 15/129 (11%)
Query: 43 FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPL-KQALQHY--GPHVSLVVHLLPLPYHDN 99
FY P +++E F D CP S+ A+ L ++ L HY V L+P P+H
Sbjct: 85 FYGERPYPQ--VLVEVFLDYNCPYSKKAFLTLWQEVLPHYRADSRVQFAFGLVPQPWHAQ 142
Query: 100 AYATSRALHIVNRTNSSATFCLLEW-----FFKQQEKFYNAPTQNMTRTAVVKEIVK-FA 153
A + + A F W F QQE + + + TR V +E+ + A
Sbjct: 143 TCYMHEAALAARQVSGDAGF----WKASAALFAQQEDWVDEKVLDKTRRQVYEELAQCLA 198
Query: 154 AEGIGNSYS 162
G+ +S S
Sbjct: 199 KAGVVSSAS 207
>gi|353238665|emb|CCA70604.1| hypothetical protein PIIN_04541 [Piriformospora indica DSM 11827]
Length = 207
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 5/108 (4%)
Query: 56 IEAFFDPVCP----DSRDAWPPLKQALQ-HYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
+E F+D VCP +R +K LQ Y V ++V P P+H ++ A V
Sbjct: 22 LEFFWDYVCPYSAKSARAVDSIVKPLLQDKYPGQVKIIVRPHPQPWHSSSTLVHEAGLAV 81
Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
+ A + F QE++Y+ PT T + ++V+ A G
Sbjct: 82 AKVAPDAYWSFSLALFNAQEEYYDVPTSTQTPNEIRTKLVQLAKTSAG 129
>gi|169861885|ref|XP_001837576.1| hypothetical protein CC1G_08130 [Coprinopsis cinerea okayama7#130]
gi|116501305|gb|EAU84200.1| hypothetical protein CC1G_08130 [Coprinopsis cinerea okayama7#130]
Length = 205
Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 47/109 (43%), Gaps = 7/109 (6%)
Query: 56 IEAFFDPVCPDS-------RDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
++ F D VCP S + PL + Y V ++ L P+H + T A
Sbjct: 22 LDIFLDYVCPFSAKMALAIENILKPLVEQGGQYEGKVKVIFRLQVQPWHSASTLTHEAAL 81
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
V R + + FK Q +++ PT N+T + +++V AA+ I
Sbjct: 82 AVLRASPENFWTYSLALFKNQSDYFDIPTANLTPLQIREKLVALAAQVI 130
>gi|58266834|ref|XP_570573.1| hypothetical protein CND05280 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57226806|gb|AAW43266.1| hypothetical protein CND05280 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 200
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 39/77 (50%)
Query: 81 YGPHVSLVVHLLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNM 140
Y V+LVV L P P+H + AL++ +TN + L + FYN P ++
Sbjct: 8 YDGKVNLVVRLYPQPFHYYSAPIIEALYVFGQTNPRLFWQYLLAVHSTETTFYNRPAASL 67
Query: 141 TRTAVVKEIVKFAAEGI 157
T +++ ++V+ A E +
Sbjct: 68 TLSSLRDKLVEIAVEQV 84
>gi|358450068|ref|ZP_09160540.1| outer membrane protein [Marinobacter manganoxydans MnI7-9]
gi|357225729|gb|EHJ04222.1| outer membrane protein [Marinobacter manganoxydans MnI7-9]
Length = 225
Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
P D+ II+E FFDP C R +P +KQ L Y V LV+ +H + SR L
Sbjct: 64 PADAPVIIVE-FFDPSCEACRAFYPIVKQILAEYPGQVRLVLRY--TMFHQGSEEVSRIL 120
Query: 108 H 108
Sbjct: 121 E 121
>gi|340030885|ref|ZP_08666948.1| DSBA oxidoreductase [Paracoccus sp. TRP]
Length = 219
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 48/110 (43%), Gaps = 16/110 (14%)
Query: 12 TLILQSALLCFFVFNS---------CSKSQSTPPAKYDGFFYANHPV----DSDAIIIEA 58
+++L + L+ VF + +Q P + D A+ P+ D+ I+E
Sbjct: 5 SVVLFTGLVALIVFAGGAFLYDRYGSTSAQPVPAPQGDAMTRAHSPIIGPADAPVTIVE- 63
Query: 59 FFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
FFDP C R +P +KQ + ++ LV+ P+H+ + R L
Sbjct: 64 FFDPSCEACRAFYPAVKQIMANFPGETRLVIRY--APFHEGSDEAVRILE 111
>gi|449549563|gb|EMD40528.1| hypothetical protein CERSUDRAFT_44584 [Ceriporiopsis subvermispora
B]
Length = 205
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 51/126 (40%), Gaps = 7/126 (5%)
Query: 56 IEAFFDPVCPDSRD-------AWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
++ F D VCP S+ PL Y V ++ P+H ++ T A
Sbjct: 22 LDVFLDFVCPYSKKLVFAIDAVLRPLLGPGGKYDGKVHIIFRPQVQPWHSSSTFTHEAAL 81
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESG 168
R + FK+QE+F++ PT +T + +++ AA IG A +
Sbjct: 82 AALRVAPDKFWAFTLALFKRQEEFFDIPTSTLTPLQIREKLAHIAAGVIGEQNVPAFKDL 141
Query: 169 FSDRST 174
+ +ST
Sbjct: 142 LALKST 147
>gi|326331477|ref|ZP_08197767.1| putative thioredoxin domain protein (DSBA) [Nocardioidaceae
bacterium Broad-1]
gi|325950733|gb|EGD42783.1| putative thioredoxin domain protein (DSBA) [Nocardioidaceae
bacterium Broad-1]
Length = 223
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 51 SDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIV 110
SD +++E F D C A+P ++ + YG V+ V PLP H N+ +R++
Sbjct: 64 SDVVLVE-FLDFECEACGAAYPIVEDLREKYGDQVTFVARYFPLPGHFNSERAARSVESA 122
Query: 111 NRTN 114
R
Sbjct: 123 ARQG 126
>gi|116751066|ref|YP_847753.1| DSBA oxidoreductase [Syntrophobacter fumaroxidans MPOB]
gi|116700130|gb|ABK19318.1| DSBA oxidoreductase [Syntrophobacter fumaroxidans MPOB]
Length = 339
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 1/59 (1%)
Query: 48 PVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRA 106
P D+ ++E F D CP R +K+ H+G V LV PL H NA+ ++A
Sbjct: 177 PADAPVTLVE-FSDYQCPACRATQEGVKKVKSHFGDRVRLVFKDYPLKRHKNAHLAAQA 234
>gi|303321632|ref|XP_003070810.1| hypothetical protein CPC735_039290 [Coccidioides posadasii C735
delta SOWgp]
gi|240110507|gb|EER28665.1| hypothetical protein CPC735_039290 [Coccidioides posadasii C735
delta SOWgp]
gi|320040291|gb|EFW22224.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 209
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 5/128 (3%)
Query: 36 PAKYDGFFYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQAL-----QHYGPHVSLVVH 90
P KY G + + +E + D VCP S + ++ + Y ++ ++
Sbjct: 4 PPKYAGLKLFSSGITEQRHTLEIYLDYVCPYSAKFFDTFYNSVIPIIRKKYRSYLQVIFR 63
Query: 91 LLPLPYHDNAYATSRALHIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIV 150
P+H ++ T A +V + S + E FK Q+++++A N TR K +
Sbjct: 64 PQVQPWHPSSTLTQEAALVVLKLEPSKFWDFSEALFKAQKEYFDANVVNETRNHTYKRLA 123
Query: 151 KFAAEGIG 158
A++ G
Sbjct: 124 ALASKVTG 131
>gi|384566815|ref|ZP_10013919.1| protein-disulfide isomerase [Saccharomonospora glauca K62]
gi|384522669|gb|EIE99864.1| protein-disulfide isomerase [Saccharomonospora glauca K62]
Length = 223
Score = 36.6 bits (83), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 15/86 (17%)
Query: 28 CSKSQSTPPAKYDGFFYANHPV-------------DSDAIIIEAFFDPVCPDSRDAWPPL 74
++ + PPAK DG +AN + D A ++E F D C A+P +
Sbjct: 28 VTRPDTAPPAKADGK-HANAELLVRPDSHRLSSASDDKATLVE-FLDFECEACGAAYPAV 85
Query: 75 KQALQHYGPHVSLVVHLLPLPYHDNA 100
+Q + Y ++ VV P+P H NA
Sbjct: 86 EQLREEYAGRLTYVVRYFPIPSHPNA 111
>gi|406900080|gb|EKD43163.1| DsbA oxidoreductase [uncultured bacterium]
Length = 251
Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 36/77 (46%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALHIVNRT 113
+ I F D CP ++ P L++ +Q YG V L++ P + + A A +
Sbjct: 95 VTIVEFVDFKCPYTKAEAPILREVMQKYGNKVKLIIRNFPPSDNFHPGANQFANIAMCAY 154
Query: 114 NSSATFCLLEWFFKQQE 130
+ L +WFF+QQE
Sbjct: 155 QQGYYWPLHDWFFEQQE 171
>gi|389744505|gb|EIM85688.1| hypothetical protein STEHIDRAFT_81225 [Stereum hirsutum FP-91666
SS1]
Length = 205
Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 29/126 (23%), Positives = 51/126 (40%), Gaps = 7/126 (5%)
Query: 56 IEAFFDPVCPDSR-------DAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
++ F D VCP S + PL Y HV +++ P+H + T A
Sbjct: 22 LDIFLDYVCPFSAKLARSIDNVLKPLLGPGGKYDGHVKVIIRPQVQPWHATSTYTHEAGL 81
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIGNSYSSALESG 168
V R + F++QE+F++ P +T + +++ AAE I A +
Sbjct: 82 AVARVAPEFYWPFSVRLFERQEEFFDIPASTLTPLQIREKLAALAAEVIPADKVDAFKEA 141
Query: 169 FSDRST 174
+S+
Sbjct: 142 LKLKSS 147
>gi|336374051|gb|EGO02389.1| hypothetical protein SERLA73DRAFT_86683 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386970|gb|EGO28116.1| hypothetical protein SERLADRAFT_355115 [Serpula lacrymans var.
lacrymans S7.9]
Length = 207
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 7/109 (6%)
Query: 56 IEAFFDPVCPDS-------RDAWPPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRALH 108
++ F D VCP S + PL A Y V ++ L P+H ++ T A
Sbjct: 22 LDIFLDYVCPYSAKLSLAINNVLKPLLDAGGKYDGKVKAIIRLQVQPWHASSTFTHEAGL 81
Query: 109 IVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGI 157
V R + + FK Q F++ P+ +T + ++ AA+ I
Sbjct: 82 AVARVSPENFWPFSVLLFKSQTSFFDVPSSTLTPLQIRAQLASLAAQVI 130
>gi|145221502|ref|YP_001132180.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
gi|145213988|gb|ABP43392.1| DSBA oxidoreductase [Mycobacterium gilvum PYR-GCK]
Length = 234
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 10/76 (13%)
Query: 33 STPPAKYDGF---------FYANHPVDSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGP 83
S PP + DG N +DS +E F D C R +P ++Q YG
Sbjct: 47 SEPPIELDGSAGQAVRENSHRLNAVLDSPVYFVE-FLDFECEGCRAVYPEIEQLRAEYGD 105
Query: 84 HVSLVVHLLPLPYHDN 99
V+ V+ PLP H N
Sbjct: 106 RVNFVLRYFPLPGHFN 121
>gi|162457290|ref|YP_001619657.1| hypothetical protein sce9005 [Sorangium cellulosum So ce56]
gi|161167872|emb|CAN99177.1| putative membrane protein [Sorangium cellulosum So ce56]
Length = 403
Score = 36.2 bits (82), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 12/77 (15%)
Query: 42 FFYANHPV------------DSDAIIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVV 89
F + +HPV + I I AF D CP R P L + + YG V V
Sbjct: 179 FLWGSHPVIPPLPPELAALQEPGKITIVAFTDFECPFCRRLHPELSKIEEPYGDRVRHVR 238
Query: 90 HLLPLPYHDNAYATSRA 106
++PLP H A ++A
Sbjct: 239 KMVPLPNHPGALPAAKA 255
>gi|325000049|ref|ZP_08121161.1| DSBA oxidoreductase [Pseudonocardia sp. P1]
Length = 216
Score = 36.2 bits (82), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 21/46 (45%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQALQHYGPHVSLVVHLLPLPYHDN 99
+ + F D CP P + Q +Q YG V VV PLP H N
Sbjct: 60 VTLVEFLDFQCPGCGQLQPIMSQLVQQYGDRVEFVVRDFPLPIHPN 105
>gi|422293098|gb|EKU20398.1| hypothetical protein NGA_0546800, partial [Nannochloropsis gaditana
CCMP526]
Length = 199
Score = 36.2 bits (82), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 44/105 (41%), Gaps = 3/105 (2%)
Query: 54 IIIEAFFDPVCPDSRDAWPPLKQA--LQHYGPH-VSLVVHLLPLPYHDNAYATSRALHIV 110
+++E F D CP SR + L + ++ Y V ++ H + P+H A V
Sbjct: 24 VVVEVFLDLNCPPSRRCFKTLYEGGIIEAYKDKAVQILFHNVIQPWHAVGAWMHEAALAV 83
Query: 111 NRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAE 155
SA + F E+F + T+ MTR V+ ++ E
Sbjct: 84 KDLQPSAFYPYANKVFDAMERFSDVNTEEMTRKEVMAALIALGVE 128
>gi|350296986|gb|EGZ77963.1| hypothetical protein NEUTE2DRAFT_101434 [Neurospora tetrasperma
FGSC 2509]
Length = 219
Score = 35.8 bits (81), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 8/111 (7%)
Query: 56 IEAFFDPVCPDSRDAW--------PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
+E F D VCP S + P L+ G V + P+H ++ T A
Sbjct: 31 VEIFLDYVCPFSAKIYNTLYTTLLPSLRSEHADLGSKVQFIFRHQIQPWHPSSTLTHEAG 90
Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
V R + + FK Q+ +++ N TR K + K A++ G
Sbjct: 91 LAVQRLAPTKFWNFSAALFKDQKAYFDVSLVNETRNETYKRLAKLASQSAG 141
>gi|85119630|ref|XP_965678.1| hypothetical protein NCU02547 [Neurospora crassa OR74A]
gi|28927490|gb|EAA36442.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 219
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 8/111 (7%)
Query: 56 IEAFFDPVCPDSRDAW--------PPLKQALQHYGPHVSLVVHLLPLPYHDNAYATSRAL 107
+E F D VCP S + P L+ G V + P+H ++ T A
Sbjct: 31 VEIFLDYVCPFSAKIYNTLYTTLLPSLRSEHADLGSKVQFIFRHQIQPWHPSSTLTHEAG 90
Query: 108 HIVNRTNSSATFCLLEWFFKQQEKFYNAPTQNMTRTAVVKEIVKFAAEGIG 158
V R + + FK Q+ +++ N TR K + K A++ G
Sbjct: 91 LAVQRLAPTKFWDFSAALFKDQKAYFDVSLVNETRNETYKRLAKLASQSAG 141
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.133 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,971,901,255
Number of Sequences: 23463169
Number of extensions: 113617061
Number of successful extensions: 369952
Number of sequences better than 100.0: 165
Number of HSP's better than 100.0 without gapping: 92
Number of HSP's successfully gapped in prelim test: 73
Number of HSP's that attempted gapping in prelim test: 369758
Number of HSP's gapped (non-prelim): 177
length of query: 196
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 61
effective length of database: 9,191,667,552
effective search space: 560691720672
effective search space used: 560691720672
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)