BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030060
(183 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|426021772|sp|D9J034.1|WHY2_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=StWHY2; Flags: Precursor
gi|298359665|gb|ADI77438.1| Why2 protein [Solanum tuberosum]
Length = 238
Score = 204 bits (518), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 107/179 (59%), Positives = 134/179 (74%), Gaps = 6/179 (3%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAK-GSLGGRI 59
M+K+SR LL R+QL K L GE V+ + H + + AG ST +V A G GR+
Sbjct: 1 MLKVSR-LLHPRNQLLHKKLPGEC--VKGSIWQHAINTFAGFSTVRQNVVADAGKREGRV 57
Query: 60 FAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFA 119
FAPY V+KGKAA S +P LPTF +LDSG +K+ R+GVI+LTF P++GERKYDW K+Q FA
Sbjct: 58 FAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQLFA 117
Query: 120 LSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLICIASS 176
LS TEVGSL++MG RDSSEFFHDP+MLSSNAGQ+RKSLSIK NAD G+FISL + ++
Sbjct: 118 LSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLSVVNNN 176
>gi|449447529|ref|XP_004141520.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like [Cucumis sativus]
Length = 241
Score = 197 bits (501), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 99/177 (55%), Positives = 132/177 (74%), Gaps = 6/177 (3%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVS--AKGSLGGR 58
MMKL+R L SR+QL E+++ +A YV L SH S AG+S + + + + GGR
Sbjct: 1 MMKLTR--LFSRNQLFEQIVWKKAGYVGHPLGSHPFSSNAGISDSTQNFTRTVTKNAGGR 58
Query: 59 IFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHF 118
+FA Y+VYKGKAA S++P +PTF K++SG+ + R+G I+LTFAPA+GERKYDW +KQ F
Sbjct: 59 VFASYHVYKGKAALSMEPCMPTFTKVESGNFIMDRRGSIMLTFAPAVGERKYDWTRKQLF 118
Query: 119 ALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLICI 173
ALS TE+GSL+++GPRDS EFFHDP MLSS AGQ+RKSL+IKA+ D G+F SL +
Sbjct: 119 ALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVRKSLAIKAHTDGNGYFFSLNVV 175
>gi|118484514|gb|ABK94132.1| unknown [Populus trichocarpa]
Length = 229
Score = 192 bits (488), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 128/175 (73%), Gaps = 13/175 (7%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
MMKLSR L +R+ + G+ VRDG H L QA +ST G K S+ R+F
Sbjct: 1 MMKLSRFLNFNRNAV------GKPTDVRDGSALHALTFQASISTGG-----KSSIQDRVF 49
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
APY V+KGKAA SV+PVLPTF K SG+L+V R+G ++LTF PAIGERKYD+ K+Q FAL
Sbjct: 50 APYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQKFAL 109
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLICI 173
S TEVGSL++ GP+DS EFFHDP+MLSSNAGQ+RK+LSIK +AD G+F+SL +
Sbjct: 110 SATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLSVV 164
>gi|356573153|ref|XP_003554728.1| PREDICTED: uncharacterized protein LOC100817863 [Glycine max]
Length = 235
Score = 190 bits (482), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 129/176 (73%), Gaps = 9/176 (5%)
Query: 1 MMKLSRSL---LSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M KLSR L +SR +L E +L+ V D L SH AG+ST ++ +AKG
Sbjct: 1 MFKLSRMLPLTSTSRHRLLE-VLSSRKVEVGDRL-SH----SAGISTATNNYAAKGYASD 54
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
RIFAPY VYKGKAAFS+ P LPTF KL+SG + V R+G I++TF +IGERKYDW K+Q
Sbjct: 55 RIFAPYTVYKGKAAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQR 114
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADGFFISLICI 173
FALS TEVGSL+TMG +DS +FFHDP+MLSSNAGQ+RKSLSIK +A+G+F+SL +
Sbjct: 115 FALSATEVGSLITMGAQDSCDFFHDPSMLSSNAGQVRKSLSIKPHANGYFVSLTVV 170
>gi|255637711|gb|ACU19178.1| unknown [Glycine max]
Length = 235
Score = 189 bits (481), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 129/176 (73%), Gaps = 9/176 (5%)
Query: 1 MMKLSRSL---LSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M KLSR L +SR +L E +L+ V D L SH AG+ST ++ +AKG
Sbjct: 1 MFKLSRMLPLTSTSRHRLLE-VLSSRKVEVGDRL-SH----SAGISTVTNNYAAKGYASD 54
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
RIFAPY VYKGKAAFS+ P LPTF KL+SG + V R+G I++TF +IGERKYDW K+Q
Sbjct: 55 RIFAPYTVYKGKAAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQR 114
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADGFFISLICI 173
FALS TEVGSL+TMG +DS +FFHDP+MLSSNAGQ+RKSLSIK +A+G+F+SL +
Sbjct: 115 FALSATEVGSLITMGAQDSCDFFHDPSMLSSNAGQVRKSLSIKPHANGYFVSLTVV 170
>gi|22330568|ref|NP_177282.2| protein WHIRLY 2 [Arabidopsis thaliana]
gi|75161474|sp|Q8VYF7.1|WHY2_ARATH RecName: Full=Single-stranded DNA-binding protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=AtWHY2; Flags: Precursor
gi|18175814|gb|AAL59932.1| unknown protein [Arabidopsis thaliana]
gi|21689867|gb|AAM67494.1| unknown protein [Arabidopsis thaliana]
gi|225898076|dbj|BAH30370.1| hypothetical protein [Arabidopsis thaliana]
gi|332197060|gb|AEE35181.1| protein WHIRLY 2 [Arabidopsis thaliana]
Length = 238
Score = 188 bits (478), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 103/172 (59%), Positives = 130/172 (75%), Gaps = 7/172 (4%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
MMK +RSLLS RS + EA+ +R G S S G G D +AK S GR+F
Sbjct: 1 MMKQARSLLS-RSLCDQSKSLFEASTLR-GFASWSNSSTPGRGFPGKD-AAKPS--GRLF 55
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
APY ++KGKAA SV+PVLP+F ++DSG+L++ R+G +++TF PAIGERKYDW KKQ FAL
Sbjct: 56 APYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQKFAL 115
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
SPTEVGSL++MG +DSSEFFHDP+M SSNAGQ+RKSLS+K +AD G+FISL
Sbjct: 116 SPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISL 167
>gi|357512363|ref|XP_003626470.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|355501485|gb|AES82688.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|388497124|gb|AFK36628.1| unknown [Medicago truncatula]
Length = 226
Score = 188 bits (477), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 123/176 (69%), Gaps = 18/176 (10%)
Query: 1 MMKLSRSLLSS-RSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRI 59
M+K SR L SS R+ L E L Y RD S A T ++ SAKG RI
Sbjct: 5 MLKFSRMLHSSSRNHLLEVL------YARD-------FSTA----TNNNYSAKGYTSDRI 47
Query: 60 FAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFA 119
FAPY VYKGKAAFS+ P LPTF KLDSG L V R G I+++F PAIGERKYDW K+Q FA
Sbjct: 48 FAPYSVYKGKAAFSLSPCLPTFTKLDSGALVVDRHGSIMMSFMPAIGERKYDWEKRQIFA 107
Query: 120 LSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADGFFISLICIAS 175
LS TEVGSL+ +GP+DS EFFHDP+M SSNAGQ+RKSLSIK +++G+F+SL + S
Sbjct: 108 LSATEVGSLIAIGPQDSCEFFHDPSMKSSNAGQVRKSLSIKPHSNGYFVSLSVVNS 163
>gi|12323827|gb|AAG51881.1|AC016162_2 unknown protein; 79476-81015 [Arabidopsis thaliana]
Length = 237
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/171 (59%), Positives = 129/171 (75%), Gaps = 7/171 (4%)
Query: 2 MKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIFA 61
MK +RSLLS RS + EA+ +R G S S G G D +AK S GR+FA
Sbjct: 1 MKQARSLLS-RSLCDQSKSLFEASTLR-GFASWSNSSTPGRGFPGKD-AAKPS--GRLFA 55
Query: 62 PYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALS 121
PY ++KGKAA SV+PVLP+F ++DSG+L++ R+G +++TF PAIGERKYDW KKQ FALS
Sbjct: 56 PYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQKFALS 115
Query: 122 PTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
PTEVGSL++MG +DSSEFFHDP+M SSNAGQ+RKSLS+K +AD G+FISL
Sbjct: 116 PTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISL 166
>gi|388498336|gb|AFK37234.1| unknown [Lotus japonicus]
Length = 235
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/178 (58%), Positives = 125/178 (70%), Gaps = 16/178 (8%)
Query: 1 MMKLSRSLLS-SRSQLSE-KLLAGEAN-YVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M+KL R L S SR+ + E ++L+ N VRD S A T AKG
Sbjct: 1 MLKLLRVLPSTSRNWVLEMEVLSSSRNPQVRD-------FSTAARET----YVAKGYTTD 49
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R+FAPYYVYKGKAA S+ PVLPTF KLDSG L V+R+G I++ F PAIGERKYDW K+Q
Sbjct: 50 RVFAPYYVYKGKAAMSLSPVLPTFTKLDSGALVVERRGSIMMVFTPAIGERKYDWEKRQK 109
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICI 173
FALS TEVGSL+ MGP+DS EFFHDP+M SSNAGQ+RKSLSIK AN+ G+F+SL +
Sbjct: 110 FALSATEVGSLIAMGPQDSCEFFHDPSMSSSNAGQVRKSLSIKPHANSSGYFVSLTVV 167
>gi|297841891|ref|XP_002888827.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
gi|297334668|gb|EFH65086.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
Length = 242
Score = 185 bits (470), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 100/178 (56%), Positives = 130/178 (73%), Gaps = 15/178 (8%)
Query: 1 MMKLSRSLLS------SRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGS 54
MMK +R+LLS S+S E++ +A+ +R G S S G +G D
Sbjct: 1 MMKQARTLLSRSLCDHSKSLFEERV---KASTLR-GFASWSSSSTPGRGFSGKDAPKPS- 55
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
GR+FAPY ++KGKAA SV+PVLP+F ++DSG+L++ R+G +++TF PAIGERKYDW K
Sbjct: 56 --GRLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEK 113
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
KQ FALSPTEVGSL++MG +DSSEFFHDP+M SSNAGQ+RKSLSIK +AD G+FISL
Sbjct: 114 KQKFALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSIKPHADGSGYFISL 171
>gi|225459963|ref|XP_002267315.1| PREDICTED: uncharacterized protein LOC100258449 [Vitis vinifera]
gi|297734756|emb|CBI16990.3| unnamed protein product [Vitis vinifera]
Length = 231
Score = 182 bits (463), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/174 (54%), Positives = 122/174 (70%), Gaps = 2/174 (1%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
MMKL + LL SR+ LSE LL G+ +R+ H S+ +ST + KG+ R++
Sbjct: 1 MMKL-KQLLQSRTHLSENLLHGKPGDIRNPSWLHAFTSRVSLSTATDHFADKGNYPDRVY 59
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
APY VYKGKA+ +V PVLP F +LDSG LKV R GV++L F+PA+GERKYDW KKQ FAL
Sbjct: 60 APYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDWEKKQFFAL 119
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG-FFISLICI 173
S EVGSLL++ P EFFHDP+M +SNAGQ+RKSLS+K+ G +F+SL +
Sbjct: 120 SAVEVGSLLSLSPGGGCEFFHDPSMKTSNAGQVRKSLSVKSMDGGSYFLSLSVV 173
>gi|302566179|pdb|3N1H|A Chain A, Crystal Structure Of Stwhy2
gi|302566180|pdb|3N1I|A Chain A, Crystal Structure Of A Stwhy2-Ere32 Complex
gi|302566182|pdb|3N1J|A Chain A, Crystal Structure Of A Stwhy2-Dt32 Complex
gi|302566184|pdb|3N1K|A Chain A, Crystal Structure Of A Stwhy2-Cere32 Complex
gi|302566186|pdb|3N1L|A Chain A, Crystal Structure Of A Stwhy2-Rcere32 Complex
Length = 178
Score = 181 bits (459), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/126 (68%), Positives = 105/126 (83%), Gaps = 2/126 (1%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G GR+FAPY V+KGKAA S +P LPTF +LDSG +K+ R+GVI+LTF P++GERKYDW
Sbjct: 5 GKREGRVFAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDW 64
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
K+Q FALS TEVGSL++MG RDSSEFFHDP+MLSSNAGQ+RKSLSIK NAD G+FISL
Sbjct: 65 EKRQLFALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISL 124
Query: 171 ICIASS 176
+ ++
Sbjct: 125 SVVNNN 130
>gi|255632067|gb|ACU16386.1| unknown [Glycine max]
Length = 264
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 85/134 (63%), Positives = 107/134 (79%)
Query: 40 AGMSTTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILL 99
A +ST ++ +AKG RIFAPY VYKGKAAFS+ P LPTF KLDSG + V R+G I++
Sbjct: 44 AAISTATNNYAAKGHASDRIFAPYTVYKGKAAFSLIPCLPTFTKLDSGTVVVDRRGSIMM 103
Query: 100 TFAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSI 159
+F +IGERKYDW K+Q FALS TEVGSL+TM +DS +FFHDP+MLSSNAGQ+RKSLSI
Sbjct: 104 SFMHSIGERKYDWDKRQKFALSATEVGSLITMDAQDSCDFFHDPSMLSSNAGQVRKSLSI 163
Query: 160 KANADGFFISLICI 173
K +A+G+F+SL +
Sbjct: 164 KPHANGYFVSLTVV 177
>gi|347948612|pdb|3R9Y|A Chain A, Crystal Structure Of Stwhy2 K67a (Form I)
gi|347948613|pdb|3R9Z|A Chain A, Crystal Structure Of Stwhy2 K67a (Form Ii)
gi|347948614|pdb|3RA0|A Chain A, Crystal Structure Of A Stwhy2 K67a-Dt32 Complex
Length = 178
Score = 179 bits (454), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 85/126 (67%), Positives = 104/126 (82%), Gaps = 2/126 (1%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G GR+FAPY V+KG AA S +P LPTF +LDSG +K+ R+GVI+LTF P++GERKYDW
Sbjct: 5 GKREGRVFAPYSVFKGAAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDW 64
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
K+Q FALS TEVGSL++MG RDSSEFFHDP+MLSSNAGQ+RKSLSIK NAD G+FISL
Sbjct: 65 EKRQLFALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISL 124
Query: 171 ICIASS 176
+ ++
Sbjct: 125 SVVNNN 130
>gi|449533266|ref|XP_004173597.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like, partial [Cucumis sativus]
Length = 156
Score = 176 bits (446), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 80/124 (64%), Positives = 102/124 (82%), Gaps = 2/124 (1%)
Query: 49 VSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGER 108
+S + GGR+FA YYVYKGKAA S++P +PTF K++SG+ + R+G I+LTFAPA+GER
Sbjct: 3 LSVTKNAGGRVFASYYVYKGKAALSMEPCMPTFTKVESGNFIMDRRGSIMLTFAPAVGER 62
Query: 109 KYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GF 166
KYDW +KQ FALS TE+GSL+++GPRDS EFFHDP MLSS AGQ+RKSL+IKA+ D G+
Sbjct: 63 KYDWTRKQLFALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVRKSLAIKAHTDGNGY 122
Query: 167 FISL 170
F SL
Sbjct: 123 FFSL 126
>gi|224070977|ref|XP_002303313.1| predicted protein [Populus trichocarpa]
gi|222840745|gb|EEE78292.1| predicted protein [Populus trichocarpa]
Length = 183
Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/118 (67%), Positives = 100/118 (84%), Gaps = 2/118 (1%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R+FAPY V+KGKAA SV+PVLPTF K SG+L+V R+G ++LTF PAIGERKYD+ K+Q
Sbjct: 1 RVFAPYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQK 60
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLICI 173
FALS TEVGSL++ GP+DS EFFHDP+MLSSNAGQ+RK+LSIK +AD G+F+SL +
Sbjct: 61 FALSATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLSVV 118
>gi|302399107|gb|ADL36848.1| WHY domain class transcription factor [Malus x domestica]
Length = 237
Score = 164 bits (414), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/175 (48%), Positives = 112/175 (64%), Gaps = 8/175 (4%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
M+K+ R L SS ++ RD + ST S KG +++
Sbjct: 1 MLKVLRVLSSSTTKFRSHFC------TRDASSMYAYTHITRFSTATQKFSVKGPSSHQVY 54
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
A + ++KGKAA S+ PVLPTF KL+SG L V R+G ++L F PAIGERKYDW K+Q FAL
Sbjct: 55 ASFDIFKGKAALSLTPVLPTFTKLESGSLVVDRRGSVMLKFTPAIGERKYDWEKRQMFAL 114
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLICI 173
S TEVG+L+++G DS E FHDP+M SSNAGQ+RKSLSIK +AD G+F+SL +
Sbjct: 115 SATEVGALISLGSNDSCELFHDPSMKSSNAGQVRKSLSIKPHADGSGYFVSLTVV 169
>gi|147819709|emb|CAN74120.1| hypothetical protein VITISV_034895 [Vitis vinifera]
Length = 185
Score = 157 bits (398), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/145 (54%), Positives = 100/145 (68%), Gaps = 1/145 (0%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
MMKL + LL SR+ LSE LL G+ +R+ H S+ +ST + KG+ R++
Sbjct: 1 MMKL-KQLLQSRTHLSENLLHGKPGDIRNPSWLHAFTSRVSLSTATDHFADKGNYPDRVY 59
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
APY VYKGKA+ +V PVLP F +LDSG LKV R GV++L F+PA+GERKYDW KKQ FAL
Sbjct: 60 APYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDWEKKQFFAL 119
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAM 145
S EVGSLL++ P EFFHDP+M
Sbjct: 120 SAVEVGSLLSLSPGGGCEFFHDPSM 144
>gi|115444353|ref|NP_001045956.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|50251252|dbj|BAD28032.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|50252182|dbj|BAD28177.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|113535487|dbj|BAF07870.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|215692593|dbj|BAG88013.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704516|dbj|BAG94149.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740785|dbj|BAG96941.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218190103|gb|EEC72530.1| hypothetical protein OsI_05924 [Oryza sativa Indica Group]
gi|222622212|gb|EEE56344.1| hypothetical protein OsJ_05450 [Oryza sativa Japonica Group]
Length = 228
Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/175 (46%), Positives = 119/175 (68%), Gaps = 15/175 (8%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
M +LSR + SS ++++ ++D L S L Q +ST +A + GR F
Sbjct: 1 MQRLSRFVPSSSRRVTD---------LKDALWSGSLTFQHALST----FAADENTSGRKF 47
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
A Y V+KGKAA S+ P+LP+F KL+SG +V + G ++LTF PA+G+RKYD++KKQ FAL
Sbjct: 48 ASYTVFKGKAALSMQPILPSFSKLESGGSRVNKNGSVMLTFFPAVGQRKYDYSKKQLFAL 107
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICI 173
SPTEVGSL+++GP +S EFFHDP+M SS+ GQ++KSLS+ N G+F+++ +
Sbjct: 108 SPTEVGSLISLGPAESCEFFHDPSMKSSHEGQVKKSLSVTPLGNDSGYFLNITVL 162
>gi|357148896|ref|XP_003574931.1| PREDICTED: uncharacterized protein LOC100825843 [Brachypodium
distachyon]
Length = 231
Score = 153 bits (386), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 120/175 (68%), Gaps = 13/175 (7%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
M++LSR L S+ ++S+ ++D L S + + +ST+ +V S R F
Sbjct: 1 MLRLSRFLPSTSRKVSD---------LKDVLWSGSVTFEHALSTSAANVDENAS--ARKF 49
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
A Y V+KGKAA S+ P+LP F K++SG +VK+ G ++LTF PA+G+R+YD++KKQ FAL
Sbjct: 50 ASYTVFKGKAALSISPILPNFTKIESGGSRVKKNGSVMLTFFPAVGQRQYDYSKKQLFAL 109
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICI 173
SPTEVGSL+++G +S EFFHDP+M SS+ GQ++KSLSI N +G+F+++ +
Sbjct: 110 SPTEVGSLISLGSAESCEFFHDPSMKSSHEGQVKKSLSITPLGNDNGYFVNITVL 164
>gi|194703090|gb|ACF85629.1| unknown [Zea mays]
gi|323388661|gb|ADX60135.1| PBF-2 like transcription factor [Zea mays]
gi|323388771|gb|ADX60190.1| PBF-2 like transcription factor [Zea mays]
Length = 232
Score = 152 bits (384), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 109/153 (71%), Gaps = 4/153 (2%)
Query: 27 VRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDS 86
+++ L S L Q +ST ++ G+L G+ FA Y V+KGKAA S+ P+LP+F KL+S
Sbjct: 18 LKESLWSGSLTFQQAVSTAATNL--DGNLSGKKFASYTVFKGKAALSIHPILPSFSKLES 75
Query: 87 GDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAML 146
G +V + G ++LTF PA+G+RKYD+ KKQ FALSPTEVGSL+++GP +S EFFHDP+M
Sbjct: 76 GGSRVSKNGSVMLTFFPAVGQRKYDYTKKQLFALSPTEVGSLISLGPAESCEFFHDPSMK 135
Query: 147 SSNAGQMRKSLSIK--ANADGFFISLICIASSH 177
SSN G ++KSLSI + G+F+++ + S+
Sbjct: 136 SSNEGTVKKSLSITPLGSDSGYFVNITVVNSAE 168
>gi|195627490|gb|ACG35575.1| DNA binding protein [Zea mays]
Length = 230
Score = 152 bits (383), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 118/182 (64%), Gaps = 19/182 (10%)
Query: 1 MMKLSRSLLSSRSQ---LSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M++LSR L S+ + L E L G + Q +ST ++ G+L G
Sbjct: 1 MLRLSRFLPSACRRGFDLKESLWCGSLTF------------QQAVSTAATNL--DGNLSG 46
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
+ FA Y V+KGKAA S+ P+LP+F KL+SG +V + G ++LTF PA+G+RKYD+ KKQ
Sbjct: 47 KKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDYTKKQL 106
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICIAS 175
FALSPTEVGSL+++GP +S EFFHDP+M SSN G ++KSLSI + G+F+++ + S
Sbjct: 107 FALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNITVVNS 166
Query: 176 SH 177
+
Sbjct: 167 AE 168
>gi|413926543|gb|AFW66475.1| DNA binding protein isoform 1 [Zea mays]
gi|413926544|gb|AFW66476.1| DNA binding protein isoform 2 [Zea mays]
Length = 274
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 119/182 (65%), Gaps = 19/182 (10%)
Query: 1 MMKLSRSLLSSRSQ---LSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M++LSR L S+ + L E L +G + Q +ST ++ G+L G
Sbjct: 43 MLRLSRFLPSACRRGFDLKESLWSGSLTF------------QQAVSTAATNL--DGNLSG 88
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
+ FA Y V+KGKAA S+ P+LP+F KL+SG +V + G ++LTF PA+G+RKYD+ KKQ
Sbjct: 89 KKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDYTKKQL 148
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICIAS 175
FALSPTEVGSL+++GP +S EFFHDP+M SSN G ++KSLSI + G+F+++ + S
Sbjct: 149 FALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNITVVNS 208
Query: 176 SH 177
+
Sbjct: 209 AE 210
>gi|242064094|ref|XP_002453336.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
gi|241933167|gb|EES06312.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
Length = 230
Score = 150 bits (379), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 117/180 (65%), Gaps = 19/180 (10%)
Query: 1 MMKLSRSLLSSRSQ---LSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGG 57
M++LSR L S+ + L E LL+G + Q +ST+ ++ +L
Sbjct: 1 MLRLSRFLPSASRRGFDLKESLLSGSLTF------------QQAVSTSAANI--DDNLSS 46
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
+ +A Y V+KGKAA S+ P+LP+F KL+SG +V R G I+LTF PA+G RKYD+ KKQ
Sbjct: 47 KKYASYTVFKGKAALSIQPILPSFSKLESGGSRVSRNGSIMLTFFPAVGPRKYDFTKKQL 106
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICIAS 175
FALSPTEVGSL+++GP +S EFFHDP+M SSN G ++KSLSI + G+F+++ + S
Sbjct: 107 FALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGMVKKSLSITPLGSDSGYFVNITVVNS 166
>gi|226506170|ref|NP_001152589.1| LOC100286229 [Zea mays]
gi|195657845|gb|ACG48390.1| DNA binding protein [Zea mays]
Length = 232
Score = 148 bits (374), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 108/153 (70%), Gaps = 4/153 (2%)
Query: 27 VRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDS 86
+++ L S L Q +ST ++ G+L G+ FA Y V+KGKAA S+ P+L +F KL+S
Sbjct: 18 LKESLWSGSLTFQQAVSTAATNL--DGNLSGKKFASYTVFKGKAALSIHPILXSFSKLES 75
Query: 87 GDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAML 146
G +V + G ++LTF PA+G+RKYD+ KKQ FALSPTEVGSL+++GP +S EFFHDP+M
Sbjct: 76 GGSRVSKNGSVMLTFFPAVGQRKYDYTKKQLFALSPTEVGSLISLGPAESCEFFHDPSMK 135
Query: 147 SSNAGQMRKSLSIK--ANADGFFISLICIASSH 177
SSN G ++KSLSI + G+F+++ + S+
Sbjct: 136 SSNEGTVKKSLSITPLGSDSGYFVNITVVNSAE 168
>gi|448278892|gb|AGE44298.1| whirly transcription factor domain containing protein [Musa AB
Group]
Length = 245
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 74/135 (54%), Positives = 95/135 (70%), Gaps = 2/135 (1%)
Query: 41 GMSTTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLT 100
G S+ S GS R + Y V+KGKAA SV P+LPTF ++DSG +V +KG ++LT
Sbjct: 43 GSSSVRPPFSPTGSSSVRRYVEYTVFKGKAALSVSPILPTFREVDSGVSRVHKKGCVILT 102
Query: 101 FAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK 160
F PAIG+RKYDW KKQ FALSPTEVGSL+ +GP +S EFFHDP+M SS GQ++KSLSI
Sbjct: 103 FWPAIGQRKYDWQKKQAFALSPTEVGSLIGLGPAESCEFFHDPSMKSSLEGQVKKSLSIS 162
Query: 161 ANAD--GFFISLICI 173
D G+ ++L +
Sbjct: 163 PLNDKAGYLLNLSVV 177
>gi|255561490|ref|XP_002521755.1| conserved hypothetical protein [Ricinus communis]
gi|223538968|gb|EEF40565.1| conserved hypothetical protein [Ricinus communis]
Length = 173
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 89/174 (51%), Positives = 115/174 (66%), Gaps = 13/174 (7%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGR-- 58
MMKLSR L SR+ L + + A RD H L +AG+ST D + KGSL
Sbjct: 1 MMKLSRLLQQSRNSLGKSIDA------RDASALHDLTFRAGISTFRQDFTIKGSLSNEYM 54
Query: 59 IFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHF 118
+ P+ K + +V T + SG LKV+R+GVILLTF PAIGERKYD+ K+Q F
Sbjct: 55 LLIPFTREKXXGSNNVSLHSHT---VQSGHLKVERRGVILLTFLPAIGERKYDYEKRQSF 111
Query: 119 ALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
ALS TEVGSL+++GP+DS + FHDP MLSSNAG++RKSLS+K +A+ G+FISL
Sbjct: 112 ALSTTEVGSLISLGPKDSFDCFHDPGMLSSNAGEVRKSLSLKPHAEGGGYFISL 165
>gi|55296373|dbj|BAD68418.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|55297130|dbj|BAD68773.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|222634946|gb|EEE65078.1| hypothetical protein OsJ_20114 [Oryza sativa Japonica Group]
Length = 272
Score = 132 bits (331), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 61/113 (53%), Positives = 80/113 (70%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G GR+F+ Y +YKGKAA S+DP P F+ LDSG KV ++G +LL FAPA+ R+YDW
Sbjct: 86 GQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAPAVATRQYDW 145
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
+KQ F+LS E+GSLLT+GP DS EFFHDP S+ G++RK L ++ DG
Sbjct: 146 TRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDG 198
>gi|218197563|gb|EEC79990.1| hypothetical protein OsI_21637 [Oryza sativa Indica Group]
Length = 274
Score = 132 bits (331), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 61/113 (53%), Positives = 80/113 (70%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G GR+F+ Y +YKGKAA S+DP P F+ LDSG KV ++G +LL FAPA+ R+YDW
Sbjct: 88 GQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAPAVATRQYDW 147
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
+KQ F+LS E+GSLLT+GP DS EFFHDP S+ G++RK L ++ DG
Sbjct: 148 TRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDG 200
>gi|119638471|gb|ABL85062.1| expressed protein [Brachypodium sylvaticum]
Length = 266
Score = 132 bits (331), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 61/114 (53%), Positives = 80/114 (70%)
Query: 52 KGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYD 111
+G GR+FA Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+ R+YD
Sbjct: 79 QGGQSGRVFASYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYD 138
Query: 112 WAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
W +KQ F+LS E+G+LLT+GP DS EFFHDP S+ G++RK L ++ DG
Sbjct: 139 WTRKQVFSLSVWEMGTLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDG 192
>gi|194306593|ref|NP_001123589.1| LOC100170235 [Zea mays]
gi|426021717|sp|B2LXS7.1|WHY1_MAIZE RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=Protein WHIRLY 1;
Short=ZmWHY1; Flags: Precursor
gi|183229934|gb|ACC60344.1| Whirly family nucleic acid binding protein [Zea mays]
gi|194708562|gb|ACF88365.1| unknown [Zea mays]
gi|195612298|gb|ACG27979.1| DNA-binding protein p24 [Zea mays]
gi|408690350|gb|AFU81635.1| WHIRLY-type transcription factor, partial [Zea mays subsp. mays]
gi|413942843|gb|AFW75492.1| whirly1 [Zea mays]
Length = 266
Score = 132 bits (331), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 62/120 (51%), Positives = 80/120 (66%)
Query: 46 GHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI 105
G+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+
Sbjct: 73 GYGRPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLDSGAYKVAKEGFVLLQFAPAV 132
Query: 106 GERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
R+YDW +KQ F+LS E+G+LLT+GP DS EFFHDP S G++RK L I+ DG
Sbjct: 133 ATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKIEPTPDG 192
>gi|413942842|gb|AFW75491.1| whirly1 [Zea mays]
Length = 205
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 62/120 (51%), Positives = 80/120 (66%)
Query: 46 GHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI 105
G+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+
Sbjct: 73 GYGRPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLDSGAYKVAKEGFVLLQFAPAV 132
Query: 106 GERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
R+YDW +KQ F+LS E+G+LLT+GP DS EFFHDP S G++RK L I+ DG
Sbjct: 133 ATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKIEPTPDG 192
>gi|242091954|ref|XP_002436467.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
gi|241914690|gb|EER87834.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
Length = 266
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 60/116 (51%), Positives = 79/116 (68%)
Query: 50 SAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERK 109
+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+ R+
Sbjct: 77 APNGAQDGRVFTSYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVATRQ 136
Query: 110 YDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
YDW +KQ F+LS E+G+LLT+GP DS EFFHDP S G++RK L ++ DG
Sbjct: 137 YDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKVEPTPDG 192
>gi|357110788|ref|XP_003557198.1| PREDICTED: uncharacterized protein LOC100824321 [Brachypodium
distachyon]
Length = 274
Score = 129 bits (324), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 59/109 (54%), Positives = 78/109 (71%)
Query: 57 GRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQ 116
GR++A Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+ R+YDW +KQ
Sbjct: 92 GRVYASYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYDWTRKQ 151
Query: 117 HFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS E+G+LLT+GP DS EFFHDP S+ G++RK L ++ DG
Sbjct: 152 VFSLSVWEMGTLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDG 200
>gi|326493106|dbj|BAJ85014.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519312|dbj|BAJ96655.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/123 (49%), Positives = 82/123 (66%)
Query: 43 STTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFA 102
S + G GR+FA Y +YKGKAA + DP P F+ L+SG KV ++G +LL FA
Sbjct: 71 SPLEREPPVPGGQAGRVFASYSIYKGKAALAFDPRPPQFVPLESGAYKVAKEGFVLLQFA 130
Query: 103 PAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKAN 162
PA+G R+YDWA+KQ F+LS E+G+LLT+G DS EFFHDP S+ G++RK L ++
Sbjct: 131 PAVGPRQYDWARKQVFSLSVWEMGTLLTLGLTDSCEFFHDPFKGRSDEGKVRKVLKVEPT 190
Query: 163 ADG 165
DG
Sbjct: 191 PDG 193
>gi|225424922|ref|XP_002277278.1| PREDICTED: uncharacterized protein LOC100253653 [Vitis vinifera]
gi|296086421|emb|CBI32010.3| unnamed protein product [Vitis vinifera]
Length = 268
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 82/119 (68%)
Query: 47 HDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIG 106
+D S G+L R+F + +YKGKAA +V+P P F LDSG KV ++G +LL FAPA G
Sbjct: 75 NDSSFGGALQPRVFVGHSIYKGKAALTVEPKAPEFTPLDSGAFKVSKEGFVLLQFAPAAG 134
Query: 107 ERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
R+YDW +KQ F+LS TE+GSL+++G R+S EFFHDP S G++RK L ++ DG
Sbjct: 135 VRQYDWGRKQVFSLSVTEIGSLISLGARESCEFFHDPFKGRSEEGKVRKVLKVEPLPDG 193
>gi|449520335|ref|XP_004167189.1| PREDICTED: single-stranded DNA-binding protein WHY1,
chloroplastic-like [Cucumis sativus]
Length = 276
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 56/111 (50%), Positives = 79/111 (71%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
L R F + +YKGKAA +V+P P F LDSG K+ R+G+++L FAPA G R+YDW++
Sbjct: 89 LPPRFFVGHSIYKGKAALTVEPRPPEFTPLDSGAFKISREGLVMLQFAPAAGVRQYDWSR 148
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
KQ F+LS TE+GSL+ +GPR++ EFFHDP S+ G++RK L ++ DG
Sbjct: 149 KQVFSLSVTELGSLIALGPREACEFFHDPYKGKSDEGKVRKILKVEPLPDG 199
>gi|255558202|ref|XP_002520128.1| conserved hypothetical protein [Ricinus communis]
gi|223540620|gb|EEF42183.1| conserved hypothetical protein [Ricinus communis]
Length = 271
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 78/108 (72%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R++ + +YKGKAA +V+P P F LDSG KV R+G +LL FAPA G R+YDW++KQ
Sbjct: 88 RVYVGHSIYKGKAALTVEPRAPEFAALDSGAFKVAREGFVLLQFAPAAGVRQYDWSRKQV 147
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS TE+G+++++G RDS EFFHDP S+ G++RK L ++ DG
Sbjct: 148 FSLSVTEIGTIISLGARDSCEFFHDPNKGKSDEGKIRKVLKVEPLPDG 195
>gi|15223748|ref|NP_172893.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75191428|sp|Q9M9S3.1|WHY1_ARATH RecName: Full=Single-stranded DNA-binding protein WHY1,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 1; AltName: Full=Protein WHIRLY
1; Short=AtWHY1; Flags: Precursor
gi|7262683|gb|AAF43941.1|AC012188_18 Contains similarity to a hypothetical protein from Arabidopsis
thaliana gb|AC002521.2. EST gb|AI995686 comes from this
gene [Arabidopsis thaliana]
gi|12083312|gb|AAG48815.1|AF332452_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|13877787|gb|AAK43971.1|AF370156_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|16323418|gb|AAL15203.1| putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|332191039|gb|AEE29160.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 263
Score = 122 bits (307), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 56/111 (50%), Positives = 79/111 (71%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
L R + + +YKGKAA +VDP P F+ LDSG K+ + G +LL FAP+ G R+YDW+K
Sbjct: 77 LPARFYVGHSIYKGKAALTVDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSK 136
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
KQ F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ DG
Sbjct: 137 KQVFSLSVTEIGTLVSLGPRESCEFFHDPFKGKSDEGKVRKVLKVEPLPDG 187
>gi|356567550|ref|XP_003551981.1| PREDICTED: uncharacterized protein LOC100804480 [Glycine max]
Length = 235
Score = 122 bits (306), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 86/133 (64%), Gaps = 3/133 (2%)
Query: 36 LISQAGMSTTGHDVSAKGSLGG---RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVK 92
L Q +++T S+G R++ Y +YKGKAA ++ P P FM LDSG K+
Sbjct: 59 LFDQNTLASTPRPTRPSASVGALPPRVYVGYSIYKGKAALTLTPRPPEFMPLDSGAYKIS 118
Query: 93 RKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQ 152
++G +LL FAPA+G R+YDW +KQ F+LS E+GS++++G RDS EFFHDP S+ G+
Sbjct: 119 KEGYVLLQFAPAVGTRQYDWNRKQVFSLSVGEMGSVISLGARDSYEFFHDPFKGKSDEGK 178
Query: 153 MRKSLSIKANADG 165
+RK L ++ DG
Sbjct: 179 VRKILKVEPLPDG 191
>gi|75174555|sp|Q9LL85.1|WHY1_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=DNA-binding protein p24;
AltName: Full=PR-10a binding factor 2; Short=PBF-2;
AltName: Full=Protein WHIRLY 1; Short=StWhy1; Flags:
Precursor
gi|9651810|gb|AAF91282.1|AF233342_1 DNA-binding protein p24 [Solanum tuberosum]
Length = 274
Score = 122 bits (306), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 81/113 (71%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G+ ++F Y +YKGKAA +V+P P F LDSG K+ R+G+++L FAPA G R+YDW
Sbjct: 86 GASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDW 145
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
++KQ F+LS TE+GS++++G +DS EFFHDP S+ G++RK L ++ DG
Sbjct: 146 SRKQVFSLSVTEIGSIISLGAKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDG 198
>gi|157878742|pdb|1L3A|A Chain A, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878743|pdb|1L3A|B Chain B, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878744|pdb|1L3A|C Chain C, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878745|pdb|1L3A|D Chain D, Structure Of The Plant Transcriptional Regulator Pbf-2
Length = 227
Score = 122 bits (306), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 79/108 (73%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
++F Y +YKGKAA +V+P P F LDSG K+ R+G+++L FAPA G R+YDW++KQ
Sbjct: 37 KVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDWSRKQV 96
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS TE+GS++++G +DS EFFHDP S+ G++RK L ++ DG
Sbjct: 97 FSLSVTEIGSIISLGTKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDG 144
>gi|217071924|gb|ACJ84322.1| unknown [Medicago truncatula]
Length = 239
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 82/120 (68%), Gaps = 2/120 (1%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G+L R++ + +YKGKAA ++ P P F+ LDSG K+ R G +LL FAP++G R+YDW
Sbjct: 75 GALPPRVYVGHSIYKGKAALTITPTPPKFVTLDSGAYKISRDGCLLLQFAPSVGPRQYDW 134
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
+KQ F LS E+GS++++G R+S EFFHDP S+ G++RK L I+ D GFF +L
Sbjct: 135 NRKQLFMLSVDEMGSVISLGARESCEFFHDPFKGGSDEGKVRKVLKIEPFPDGSGFFFNL 194
>gi|297605166|ref|NP_001056790.2| Os06g0145800 [Oryza sativa Japonica Group]
gi|255676711|dbj|BAF18704.2| Os06g0145800 [Oryza sativa Japonica Group]
Length = 190
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/91 (59%), Positives = 68/91 (74%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G GR+F+ Y +YKGKAA S+DP P F+ LDSG KV ++G +LL FAPA+ R+YDW
Sbjct: 86 GQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAPAVATRQYDW 145
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDP 143
+KQ F+LS E+GSLLT+GP DS EFFHDP
Sbjct: 146 TRKQVFSLSVWEMGSLLTLGPTDSCEFFHDP 176
>gi|295913588|gb|ADG58040.1| transcription factor [Lycoris longituba]
Length = 246
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 76/108 (70%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R+F Y +YKGKAA +V+P P F LDSG KV ++G ILL FAPA+G R+YDW++KQ
Sbjct: 86 RVFVGYSIYKGKAALTVEPRAPEFAPLDSGAFKVAKEGFILLQFAPAVGMRQYDWSRKQV 145
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS E+G+L+++G ++S EFFHDP S G++RK L + DG
Sbjct: 146 FSLSVVEIGTLMSLGAKESCEFFHDPFKGRSEEGKVRKLLKAEPLPDG 193
>gi|388509172|gb|AFK42652.1| unknown [Lotus japonicus]
Length = 261
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 78/113 (69%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G+L R++ + +YKGKAA +V P P F LDSG K+ R+G +LL FAPAI R+YDW
Sbjct: 73 GALPPRVYVGHSIYKGKAALTVTPRPPEFAPLDSGAFKISREGYVLLQFAPAIASRQYDW 132
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
+KQ F+LS E+GS++++G R+S EFFHDP S+ G++RK L ++ DG
Sbjct: 133 NRKQVFSLSVVEMGSVISLGTRESCEFFHDPLKGKSDEGKVRKVLKLEPLPDG 185
>gi|297849846|ref|XP_002892804.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
gi|297338646|gb|EFH69063.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
Length = 264
Score = 119 bits (298), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 79/111 (71%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
L R + + +YKGKAA +++P P F+ LDSG K+ + G +LL FAP+ G R+YDW+K
Sbjct: 78 LPARFYVGHSIYKGKAALTMEPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSK 137
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
KQ F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ DG
Sbjct: 138 KQVFSLSVTEIGTLVSLGPRESCEFFHDPFKGKSDEGKVRKVLKVEPLPDG 188
>gi|223945821|gb|ACN26994.1| unknown [Zea mays]
gi|413942841|gb|AFW75490.1| whirly1 [Zea mays]
Length = 177
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/98 (55%), Positives = 69/98 (70%)
Query: 46 GHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI 105
G+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+
Sbjct: 73 GYGRPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLDSGAYKVAKEGFVLLQFAPAV 132
Query: 106 GERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDP 143
R+YDW +KQ F+LS E+G+LLT+GP DS EFFHDP
Sbjct: 133 ATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDP 170
>gi|145328252|ref|NP_001077872.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|330250525|gb|AEC05619.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 267
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 78/108 (72%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R + + +YKGKAA +++P P F+ L+SG K+ ++G +LL FAPA G R+YDW++KQ
Sbjct: 84 RFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQV 143
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS TE+G+L+++GPR+S EFFHDP + G++RK L ++ DG
Sbjct: 144 FSLSVTEIGNLVSLGPRESCEFFHDPFKGKGDEGKVRKVLKVEPLPDG 191
>gi|302399111|gb|ADL36850.1| WHY domain class transcription factor [Malus x domestica]
Length = 276
Score = 115 bits (289), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 77/111 (69%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
+ R + + +YKGKAA +V+P P F LDSG K+ R+G +LL FAPA G R YDW++
Sbjct: 90 MAPRFYVGHSIYKGKAALTVEPKAPEFTPLDSGAFKLSREGFVLLQFAPAAGVRVYDWSR 149
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
KQ F+LS TE+GSL+++G ++S EFFHDP S+ G++RK L ++ DG
Sbjct: 150 KQVFSLSVTEIGSLVSLGSKESLEFFHDPFKGKSDEGKVRKVLKVEPLPDG 200
>gi|302764906|ref|XP_002965874.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
gi|302802736|ref|XP_002983122.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300149275|gb|EFJ15931.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300166688|gb|EFJ33294.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
Length = 226
Score = 115 bits (288), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 58/118 (49%), Positives = 78/118 (66%), Gaps = 3/118 (2%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQ- 116
R+FA + YKGK A ++ + PTF DSGD + R+G ++L FAP+I +R+YDW KKQ
Sbjct: 31 RVFADHVFYKGKCALNMRLIKPTFKISDSGDAILSREGTVMLEFAPSISQRQYDWGKKQV 90
Query: 117 HFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLIC 172
FALS +E+G +L + P +S EFFHDP M S+AG +RKSL I+ D GFF L
Sbjct: 91 LFALSVSELGQILALTPSESLEFFHDPNMGKSDAGMVRKSLKIEPTTDRNGFFFGLTV 148
>gi|42568881|ref|NP_178377.2| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75115367|sp|Q66GR6.1|WHY3_ARATH RecName: Full=Single-stranded DNA-binding protein WHY3,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 11; AltName: Full=Protein
WHIRLY 3; Short=AtWHY3; Flags: Precursor
gi|51536442|gb|AAU05459.1| At2g02740 [Arabidopsis thaliana]
gi|51972072|gb|AAU15140.1| At2g02740 [Arabidopsis thaliana]
gi|330250524|gb|AEC05618.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 268
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 79/109 (72%), Gaps = 1/109 (0%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R + + +YKGKAA +++P P F+ L+SG K+ ++G +LL FAPA G R+YDW++KQ
Sbjct: 84 RFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQV 143
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAM-LSSNAGQMRKSLSIKANADG 165
F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ DG
Sbjct: 144 FSLSVTEIGNLVSLGPRESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDG 192
>gi|297814520|ref|XP_002875143.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
gi|297320981|gb|EFH51402.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
Length = 268
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 84/117 (71%), Gaps = 2/117 (1%)
Query: 50 SAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERK 109
+A+GS R + + +YKGKAA +++P P F+ L+SG K+ ++G +LL FAPA G R+
Sbjct: 77 NAEGS-SPRFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQ 135
Query: 110 YDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAM-LSSNAGQMRKSLSIKANADG 165
YDW++KQ F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ DG
Sbjct: 136 YDWSRKQVFSLSVTEIGNLVSLGPRESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDG 192
>gi|110740230|dbj|BAF02013.1| hypothetical protein [Arabidopsis thaliana]
Length = 268
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 79/109 (72%), Gaps = 1/109 (0%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R + + +YKGKAA +++P P F+ L+SG K+ ++G +LL FAPA G R+YDW+KK+
Sbjct: 84 RFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSKKRV 143
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAM-LSSNAGQMRKSLSIKANADG 165
F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ DG
Sbjct: 144 FSLSVTEIGNLVSLGPRESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDG 192
>gi|357486629|ref|XP_003613602.1| DNA-binding protein p24 [Medicago truncatula]
gi|355514937|gb|AES96560.1| DNA-binding protein p24 [Medicago truncatula]
Length = 261
Score = 112 bits (280), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 73/113 (64%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G L R+ VYKGKA V PVLP F DSG K+ ++G++LL F P+ G R+YDW
Sbjct: 76 GELPARVHVSRSVYKGKAVLVVSPVLPKFTSSDSGTFKISKEGLMLLQFVPSAGFRQYDW 135
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
+KQ F+LS E+G+L+ +G R+S E FHDP M S+ G++RK L ++ DG
Sbjct: 136 NRKQVFSLSVDEMGNLINLGARESCEIFHDPFMGRSDEGKVRKVLKVEPLHDG 188
>gi|224111254|ref|XP_002315793.1| predicted protein [Populus trichocarpa]
gi|222864833|gb|EEF01964.1| predicted protein [Populus trichocarpa]
Length = 191
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 73/108 (67%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
++F + +YKGKAA +++P P F L+SG K+ ++G +L FAPA R+YDW +KQ
Sbjct: 6 KVFVGHSIYKGKAALTIEPRAPEFSPLESGAYKLVKEGFVLSQFAPASSARQYDWTRKQV 65
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS TE+G L+++G RDS EFFHDP S G++RK L ++ DG
Sbjct: 66 FSLSVTEIGHLVSLGARDSCEFFHDPNKGRSEEGKVRKVLKVEPLPDG 113
>gi|116779826|gb|ABK21442.1| unknown [Picea sitchensis]
Length = 257
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 77/115 (66%), Gaps = 2/115 (1%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
+I+ + VYKG+ A ++ P LP ++ L+ G + V ++G + L FAPA+G R+YDW+KK+
Sbjct: 72 KIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKKI 131
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
ALS EVG+LL++GP +S EF HDP M S AG++ K L + D G+F +L
Sbjct: 132 IALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNL 186
>gi|116783258|gb|ABK22859.1| unknown [Picea sitchensis]
Length = 259
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 77/115 (66%), Gaps = 2/115 (1%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
+I+ + VYKG+ A ++ P LP ++ L+ G + V ++G + L FAPA+G R+YDW+KK+
Sbjct: 72 KIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKKI 131
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
ALS EVG+LL++GP +S EF HDP M S AG++ K L + D G+F +L
Sbjct: 132 IALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNL 186
>gi|224099743|ref|XP_002311601.1| predicted protein [Populus trichocarpa]
gi|118485247|gb|ABK94483.1| unknown [Populus trichocarpa]
gi|222851421|gb|EEE88968.1| predicted protein [Populus trichocarpa]
Length = 265
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 73/108 (67%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
++F + +YKGKAA +V+P P F LDSG K+ ++G +LL FAPA R+YDW +KQ
Sbjct: 82 KVFVGHSIYKGKAALTVEPRSPEFSPLDSGAYKLVKEGFVLLQFAPAASVRQYDWTRKQV 141
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
F+LS TE+G L+++ + S EFFHDP S+ G++RK L ++ DG
Sbjct: 142 FSLSVTEIGHLVSLDAKGSCEFFHDPNKGKSDEGKVRKLLKVEPLPDG 189
>gi|412985999|emb|CCO17199.1| predicted protein [Bathycoccus prasinos]
Length = 419
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 64/114 (56%), Gaps = 6/114 (5%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R F + +YK K+A SV V PTF G +KR G ILL FAP+IG RKYDW +K
Sbjct: 238 RTFVDFSIYKSKSALSVKLVKPTFETDHQGRTIMKRSGGILLEFAPSIGTRKYDWTRKGS 297
Query: 118 FALSPTEVGSLLT-MGPRDSS-----EFFHDPAMLSSNAGQMRKSLSIKANADG 165
F LSP E L + P S EFFHDP M S+ G + KSL ++A DG
Sbjct: 298 FMLSPIEAAELANRLNPSRQSIAQKVEFFHDPGMGGSSQGSVTKSLKMEAMPDG 351
>gi|356497854|ref|XP_003517771.1| PREDICTED: uncharacterized protein LOC100797370 [Glycine max]
Length = 263
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 71/117 (60%), Gaps = 4/117 (3%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIG--ERKYDWAKK 115
R++ Y VY K +V P P F SG KV ++G ++L FAP++G E YDW +K
Sbjct: 75 RVYVGYSVYTRKGVLTVTPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEPIYDWNQK 134
Query: 116 QHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKA--NADGFFISL 170
Q F+LS +E+G+L+T+G RDS EF H+ L SN ++RK L ++ +A G SL
Sbjct: 135 QIFSLSVSEMGTLITLGARDSWEFSHETVKLKSNETEVRKVLKVEPLLDATGHLFSL 191
>gi|255084617|ref|XP_002508883.1| predicted protein [Micromonas sp. RCC299]
gi|226524160|gb|ACO70141.1| predicted protein [Micromonas sp. RCC299]
Length = 347
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 62/119 (52%), Gaps = 5/119 (4%)
Query: 50 SAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERK 109
SA G R++ Y VYK KAA + PTF G KR G +LL APA+G R+
Sbjct: 174 SANDDAGSRVYCDYAVYKSKAAAKFQVIKPTFEVKPDGSRAKKRDGGVLLEMAPAVGPRQ 233
Query: 110 YDWAKKQHFALSPTEVGSL---LTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADG 165
YDWA+KQ LSP E+ L L G FFHDP M ++ G M KSL + DG
Sbjct: 234 YDWAQKQTIMLSPLELVELTESLHFG--RGVNFFHDPGMGTNRQGAMTKSLKAEPMPDG 290
>gi|224286043|gb|ACN40733.1| unknown [Picea sitchensis]
Length = 184
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 58/86 (67%), Gaps = 2/86 (2%)
Query: 87 GDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAML 146
G + V ++G + L FAPA+G R+YDW+KK+ ALS EVG+LL++GP +S EF HDP M
Sbjct: 28 GGVTVAKEGCMFLEFAPAVGPRQYDWSKKKIIALSVVEVGTLLSLGPDESCEFTHDPFMG 87
Query: 147 SSNAGQMRKSLSIKANAD--GFFISL 170
S AG++ K L + D G+F +L
Sbjct: 88 KSEAGKIMKVLKVGNLQDTGGYFFNL 113
>gi|356500463|ref|XP_003519051.1| PREDICTED: uncharacterized protein LOC100775220 [Glycine max]
Length = 263
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 4/117 (3%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIG--ERKYDWAKK 115
R++ Y VY K +V P P F SG KV ++G ++L FAP++G E YDW K
Sbjct: 75 RVYVGYSVYTKKGMLTVIPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEPIYDWNHK 134
Query: 116 QHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKA--NADGFFISL 170
Q F+LS +E+G+L+ +G RDS EF H+ L S+ +RK L ++ +A G SL
Sbjct: 135 QTFSLSVSEMGTLIILGARDSWEFSHETVKLKSSKIDVRKVLKVEPLLDATGHLFSL 191
>gi|223947295|gb|ACN27731.1| unknown [Zea mays]
gi|413942840|gb|AFW75489.1| whirly1 [Zea mays]
Length = 153
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/71 (50%), Positives = 47/71 (66%)
Query: 46 GHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI 105
G+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+
Sbjct: 73 GYGRPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLDSGAYKVAKEGFVLLQFAPAV 132
Query: 106 GERKYDWAKKQ 116
R+YDW +KQ
Sbjct: 133 ATRQYDWTRKQ 143
>gi|307106759|gb|EFN55004.1| hypothetical protein CHLNCDRAFT_134820 [Chlorella variabilis]
Length = 1274
Score = 80.9 bits (198), Expect = 2e-13, Method: Composition-based stats.
Identities = 44/119 (36%), Positives = 69/119 (57%), Gaps = 3/119 (2%)
Query: 59 IFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHF 118
+++ Y +YKGKAA ++ + PT+ + SG LK+ R+G +LL FA ++G ++YDW KK+ F
Sbjct: 1097 VYSDYTIYKGKAAMAIKVIKPTWESIGSG-LKISREGTLLLEFAASVGPQQYDWTKKETF 1155
Query: 119 ALSPTEVGSLLTMG-PRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD-GFFISLICIAS 175
LS E ++L R S E HDP G + K +++ D G+F S+ AS
Sbjct: 1156 GLSALECAAVLEAADARQSFEALHDPNKGRGGEGTVYKKFNMRPAPDKGWFFSIASTAS 1214
>gi|384244785|gb|EIE18283.1| ssDNA-binding transcriptional regulator [Coccomyxa subellipsoidea
C-169]
Length = 250
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 64/112 (57%), Gaps = 11/112 (9%)
Query: 59 IFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPA-------IGERKYD 111
++A Y +YKGKAA S P +++ G + + R G +++ FAP +G R Y
Sbjct: 69 LYANYAIYKGKAAASFRVRKPRWVEAQDGSISLDRAGSLIVEFAPVAPGSGTNVGNRSYQ 128
Query: 112 WAKKQHFALSPTEVGSLL---TMGPRDSSEFFHDPAMLSSNAGQMRKSLSIK 160
W KKQ FALSP E+ L+ T G + E FHDP +++G++ K+LS++
Sbjct: 129 WDKKQTFALSPVELAGLVESCTTG-KSMKELFHDPNKGGTDSGKIAKTLSLQ 179
>gi|303289333|ref|XP_003063954.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454270|gb|EEH51576.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 8/128 (6%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAP---AIGERKYDWAK 114
+++ + ++K K A + + PTF L +G + KR G + L FAP G+++YDW++
Sbjct: 139 KVYCNFAIHKSKTAVQMSAIKPTFELLPNGSKQKKRDGAMFLEFAPVAAGAGQKQYDWSR 198
Query: 115 KQHFALSPTEVGSL-LTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD---GFFISL 170
KQ +LSP E L + FFHDP M +S G+ KSL + D G F++L
Sbjct: 199 KQSISLSPLEFMELSEALAANRGVNFFHDPWMGTSRQGETTKSLKAEPMPDGSGGIFLNL 258
Query: 171 ICIASSHG 178
+AS G
Sbjct: 259 -TVASGGG 265
>gi|145355417|ref|XP_001421958.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582197|gb|ABP00252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 152
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 54/115 (46%), Gaps = 4/115 (3%)
Query: 60 FAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFA 119
F Y VYK K A + V TF SG +KR G +LL A A R+YDW K F
Sbjct: 1 FVDYGVYKTKGALKLKAVRATFESDASGRRIMKRAGGVLLELANATAPRQYDWGNKGSFM 60
Query: 120 LSPTEVGSLL-TMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD---GFFISL 170
LS TE L M FFHDP +N G + K+ ++ D G F++L
Sbjct: 61 LSATEAAELADRMASNAPCSFFHDPGAGGANRGNVNKAFKVEPMPDGSGGLFVNL 115
>gi|124484361|dbj|BAF46291.1| transcription regulator Pbf-2 [Chlamydomonas reinhardtii]
Length = 238
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 65/131 (49%), Gaps = 18/131 (13%)
Query: 51 AKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIG---- 106
A S R++ YYVYK +AA + + PTF K +G + ++R G +LL FA A
Sbjct: 66 AATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV-LERDGTMLLEFATANAAAPG 124
Query: 107 ---------ERKYDWAKKQHFALSPTEVGSLLTMGPRDSSE---FFHDPAMLSSNAGQMR 154
R Y+W K FALSP E+G++L S + +HDPA L G+
Sbjct: 125 AGNGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASDKGLVLWHDPAKLGKT-GEPI 183
Query: 155 KSLSIKANADG 165
K LS+K DG
Sbjct: 184 KKLSLKQLPDG 194
>gi|159487549|ref|XP_001701785.1| transcription factor [Chlamydomonas reinhardtii]
gi|158281004|gb|EDP06760.1| transcription factor [Chlamydomonas reinhardtii]
Length = 238
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 65/131 (49%), Gaps = 18/131 (13%)
Query: 51 AKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIG---- 106
A S R++ YYVYK +AA + + PTF K +G + ++R G +LL FA A
Sbjct: 66 AATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV-LERDGTMLLEFATANAAAPG 124
Query: 107 ---------ERKYDWAKKQHFALSPTEVGSLLTMGPRDSSE---FFHDPAMLSSNAGQMR 154
R Y+W K FALSP E+G++L S + +HDPA L G+
Sbjct: 125 AGSGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASDKGLVLWHDPAKLGKT-GEPI 183
Query: 155 KSLSIKANADG 165
K LS+K DG
Sbjct: 184 KKLSLKQLPDG 194
>gi|302829905|ref|XP_002946519.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
gi|300268265|gb|EFJ52446.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
Length = 3754
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 66/124 (53%), Gaps = 18/124 (14%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFA-----------PAIG 106
R++ +++YK +AA +V +LP ++G ++R GV+LL FA PA G
Sbjct: 3589 RVYVNFHIYKTRAAMAVR-LLPPSFTTENGYKTLERDGVMLLEFANANPGQPSGTAPAAG 3647
Query: 107 --ERKYDWAKKQHFALSPTEVGSLLTMGPRDSSE---FFHDPAMLSSNAGQMRKSLSIKA 161
R Y+W+ K FALSP+E+G++L S + +HDP L G+ K L++K
Sbjct: 3648 GINRTYNWSNKISFALSPSELGTMLAGDAIASDKGLVMYHDPTKL-GKVGEPMKRLTMKQ 3706
Query: 162 NADG 165
DG
Sbjct: 3707 MPDG 3710
>gi|348676880|gb|EGZ16697.1| hypothetical protein PHYSODRAFT_500116 [Phytophthora sojae]
Length = 224
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R+F + VY +AF V P P + S +K+KR G I+L++A A Y++ K
Sbjct: 29 RVFPNFSVYGSDSAFQVAPSAPQYTSAGS-YMKMKRVGAIMLSWAKATNS-GYNYQNKTF 86
Query: 118 FALSPTEVGSLLTMGPRDSSE--FFHDPAMLSSNAGQMRKSLSIK--ANADGFFISLICI 173
F+LSPTEVG +L + E F H P M +S + K+L I + +DG + LI +
Sbjct: 87 FSLSPTEVGLVLELLDSRIPELSFTHSPNMNASEEDKNSKTLHITRASGSDGNPLVLIKV 146
>gi|301101636|ref|XP_002899906.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102481|gb|EEY60533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 226
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGD-LKVKRKGVILLTFAPAIGERKYDWAKKQ 116
R+F + +Y +AF V P P + + G+ LK KR G I+L++A A Y++ K
Sbjct: 29 RVFPNFSIYGSDSAFQVSPSPPQYT--NGGNYLKTKRVGAIMLSWAKATNS-GYNYQNKT 85
Query: 117 HFALSPTEVGSLLTMGPRDSSE--FFHDPAMLSSNAGQMRKSLSIKANADGFFISLICIA 174
F+LSP+EVG +L + E H P M +S + KSL I + LI I
Sbjct: 86 FFSLSPSEVGLVLELLDSRIPELSLTHSPNMNASEEDKNTKSLHITRASGSDGNPLILIK 145
Query: 175 SSH 177
+H
Sbjct: 146 VNH 148
>gi|308812987|ref|XP_003083800.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
gi|116055682|emb|CAL57767.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
Length = 240
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 53/124 (42%), Gaps = 6/124 (4%)
Query: 59 IFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHF 118
F + VYK + A + V PT L + V+R G +LL A A R YDW K F
Sbjct: 81 TFVDFGVYKTRGAMKMKAVRPTLGALGENAV-VRRPGGVLLELANATAPRTYDWQNKGSF 139
Query: 119 ALSPTEVGSLL-TMGPRDSSEFFHDPAMLSSNA----GQMRKSLSIKANADGFFISLICI 173
LS TE L M S FFHD A + G+ K + + G F++L
Sbjct: 140 MLSGTEAAELADRMAANGSCSFFHDSAKANGGGGGTLGKAFKVEPMPDGSGGMFVNLTVT 199
Query: 174 ASSH 177
S +
Sbjct: 200 LSEN 203
>gi|325185050|emb|CCA19542.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 211
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 47/77 (61%), Gaps = 2/77 (2%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
+I+ + VY+ A F V P+ P +++ S LK KR G +LL++A + YD+ K
Sbjct: 19 FSSKIYPSFTVYESDAVFQVSPIQPEYVQ-QSNYLKTKRVGSLLLSWAKQ-RDGSYDYTK 76
Query: 115 KQHFALSPTEVGSLLTM 131
K +FAL+P+E+G +L +
Sbjct: 77 KLYFALTPSEIGLVLEV 93
>gi|223994305|ref|XP_002286836.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978151|gb|EED96477.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 288
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 1/93 (1%)
Query: 50 SAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGE-R 108
S+ G + R F Y V+ +A SV VLP F + + + V R+G I+L F P
Sbjct: 72 SSMGMMPRRGFPQYTVFGPDSALSVRAVLPNFKRAGTDGISVDRRGKIVLEFVPRNPSGA 131
Query: 109 KYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFH 141
+ WA K F++S EVG ++ P+ E H
Sbjct: 132 GFQWADKTTFSMSVEEVGLFVSQLPQSGIELSH 164
>gi|253761700|ref|XP_002489225.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
gi|241947085|gb|EES20230.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
Length = 91
Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 23/32 (71%)
Query: 86 SGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
SG KV ++G +LL FAPA+ R+YDW +KQ
Sbjct: 5 SGAYKVAKEGYVLLQFAPAVATRQYDWTRKQE 36
>gi|397620242|gb|EJK65621.1| hypothetical protein THAOC_13502 [Thalassiosira oceanica]
Length = 317
Score = 44.3 bits (103), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 1/85 (1%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGE-RKYDWAKKQ 116
R + Y ++ +A S+ +P F K + V+R+G ++L F P + W K
Sbjct: 7 RGYPQYTMFSSDSALSMKAAMPVFKKAGMDGVAVERRGKMMLEFVPRNASGSGFAWNDKT 66
Query: 117 HFALSPTEVGSLLTMGPRDSSEFFH 141
F+L+ EVG LL+ P ++ E H
Sbjct: 67 IFSLTVEEVGLLLSQLPGNAVELSH 91
>gi|297836056|ref|XP_002885910.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
gi|297331750|gb|EFH62169.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
Length = 74
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 85 DSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSL 128
DS K+ + G +LL FAP+ G R+Y+W KKQ + T G L
Sbjct: 3 DSEAFKLSKDGFLLLQFAPSAGVRQYNWGKKQVWFYLLTSYGPL 46
>gi|189240430|ref|XP_971358.2| PREDICTED: similar to AGAP007300-PA [Tribolium castaneum]
Length = 508
Score = 39.7 bits (91), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 10/106 (9%)
Query: 68 GKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI--GERKYDWAKKQHFALSPTEV 125
G+ AF LP K + GD + GV L+ + G+ KY W ++Q L P ++
Sbjct: 213 GRYAF-----LPNDTKGEEGDYGYRSDGVNLIEGWKKLRKGKSKYIWNREQLLGLDPQDI 267
Query: 126 GSLLTMGPRDSSEFF--HDPAMLSSNAGQMRKSLSI-KANADGFFI 168
+L + D F DPAM + ++ I N DGFF+
Sbjct: 268 DYVLGIFESDHCPFHLERDPAMDPTLTEMTESAIKILSKNPDGFFL 313
>gi|270012502|gb|EFA08950.1| hypothetical protein TcasGA2_TC006657 [Tribolium castaneum]
Length = 529
Score = 39.7 bits (91), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 10/106 (9%)
Query: 68 GKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI--GERKYDWAKKQHFALSPTEV 125
G+ AF LP K + GD + GV L+ + G+ KY W ++Q L P ++
Sbjct: 237 GRYAF-----LPNDTKGEEGDYGYRSDGVNLIEGWKKLRKGKSKYIWNREQLLGLDPQDI 291
Query: 126 GSLLTMGPRDSSEFF--HDPAMLSSNAGQMRKSLSI-KANADGFFI 168
+L + D F DPAM + ++ I N DGFF+
Sbjct: 292 DYVLGIFESDHCPFHLERDPAMDPTLTEMTESAIKILSKNPDGFFL 337
>gi|145496067|ref|XP_001434025.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401147|emb|CAK66628.1| unnamed protein product [Paramecium tetraurelia]
Length = 156
Score = 37.4 bits (85), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 25/103 (24%), Positives = 48/103 (46%), Gaps = 6/103 (5%)
Query: 72 FSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI----GERKYDWAKKQHFALSPTEVGS 127
FS+ P+ T+++ ++ +++ G IL P I G +W KK+H++L ++G
Sbjct: 2 FSMTPIKATYVEKNNY-FAIQKPGWILFELVPVIRQKEGNHHLEWDKKKHYSLGVKQIGQ 60
Query: 128 LLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADGFFISL 170
LL + E + + A +K L I A ++S+
Sbjct: 61 LLVTKNKAYDEASNFLITYQAQAND-KKLLKINKTAQDIYLSI 102
>gi|158422352|ref|YP_001523644.1| amino acid ABC transporter permease [Azorhizobium caulinodans ORS
571]
gi|158329241|dbj|BAF86726.1| amino acid ABC transporter permease protein [Azorhizobium
caulinodans ORS 571]
Length = 538
Score = 37.0 bits (84), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 67/155 (43%), Gaps = 17/155 (10%)
Query: 19 LLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVL 78
L+ EA +R L S + ++ G DV+A G G PYY G+ D
Sbjct: 107 LVQSEAETLRPSLVSGLY----ELAVGGLDVAAPGQAGVEFTIPYYATFGQIVIRRDQ-- 160
Query: 79 PTFMKLDSGDLKVKRKGVI-------LLTFAPAIGERKYDWAKKQHFALSPTEVGSLLTM 131
P ++D+ LK GV+ L P + R +D A + + AL + +++T
Sbjct: 161 PAINRIDA--LKGMTIGVLKGSRLRDTLAAVPDVVLRVFDTAPEAYAALKAGSLEAVVTD 218
Query: 132 GPRDSSEFFHDPAMLSSN--AGQMRKSLSIKANAD 164
P HDPA++ + G++ L++KA A
Sbjct: 219 YPVALYHAQHDPAVMFTGPPVGRIEYGLALKAGAQ 253
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.381
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,635,134,105
Number of Sequences: 23463169
Number of extensions: 102324407
Number of successful extensions: 200153
Number of sequences better than 100.0: 84
Number of HSP's better than 100.0 without gapping: 73
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 200054
Number of HSP's gapped (non-prelim): 84
length of query: 183
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 50
effective length of database: 9,238,593,890
effective search space: 461929694500
effective search space used: 461929694500
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)