BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027308
(225 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302399111|gb|ADL36850.1| WHY domain class transcription factor [Malus x domestica]
Length = 276
Score = 248 bits (632), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 135/208 (64%), Positives = 155/208 (74%), Gaps = 10/208 (4%)
Query: 2 MLQLQCLSS----QTLNPKLCPFHS---LSNSKGNGFGSISVTESTSIKKKKLYVKCRQS 54
ML+L LSS Q N F S LS ++ + S + +K+L +KCRQS
Sbjct: 1 MLRLHLLSSPATAQKPNQNPSQFLSSQLLSRARVFSTNTFGFAPSPILSRKRLSLKCRQS 60
Query: 55 EYYEQK--SFSASPSNSSYAPTDVAVGT-LPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
EY++Q+ S +ASP+ S AP A T + R YVGHSIYKGKAALTVEP+ PEF LD
Sbjct: 61 EYFDQQRTSTAASPNKPSPAPPTPAGATGMAPRFYVGHSIYKGKAALTVEPKAPEFTPLD 120
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA KLSREGFV+LQFAPAAGVR YDWSRKQVFSLSVTEIGSLV+LG++ES EFFHDPFK
Sbjct: 121 SGAFKLSREGFVLLQFAPAAGVRVYDWSRKQVFSLSVTEIGSLVSLGSKESLEFFHDPFK 180
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
GKS+EGKVRKVLKVEPLPDGSGHFFNLS
Sbjct: 181 GKSDEGKVRKVLKVEPLPDGSGHFFNLS 208
>gi|225424922|ref|XP_002277278.1| PREDICTED: uncharacterized protein LOC100253653 [Vitis vinifera]
gi|296086421|emb|CBI32010.3| unnamed protein product [Vitis vinifera]
Length = 268
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/207 (63%), Positives = 158/207 (76%), Gaps = 15/207 (7%)
Query: 2 MLQLQCLSSQ--TLNPKLCPFHSLSNSKGNGFGSIS-----VTESTSIKKKKLYVKCRQS 54
M L LSS NP+LCP HSLS+ + S + + +T +KK ++CRQS
Sbjct: 1 MHHLHLLSSSFTIQNPRLCPNHSLSSLHSSSPLSFTSRTPLLLSTTRHFRKKRSLQCRQS 60
Query: 55 EYYEQKSFS--ASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDS 112
+Y++Q++ + P++SS+ G L RV+VGHSIYKGKAALTVEP+ PEF LDS
Sbjct: 61 DYFQQQNITRRQPPNDSSFG------GALQPRVFVGHSIYKGKAALTVEPKAPEFTPLDS 114
Query: 113 GAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKG 172
GA K+S+EGFV+LQFAPAAGVRQYDW RKQVFSLSVTEIGSL++LGARESCEFFHDPFKG
Sbjct: 115 GAFKVSKEGFVLLQFAPAAGVRQYDWGRKQVFSLSVTEIGSLISLGARESCEFFHDPFKG 174
Query: 173 KSEEGKVRKVLKVEPLPDGSGHFFNLS 199
+SEEGKVRKVLKVEPLPDGSGHFFNLS
Sbjct: 175 RSEEGKVRKVLKVEPLPDGSGHFFNLS 201
>gi|255558202|ref|XP_002520128.1| conserved hypothetical protein [Ricinus communis]
gi|223540620|gb|EEF42183.1| conserved hypothetical protein [Ricinus communis]
Length = 271
Score = 241 bits (616), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 125/200 (62%), Positives = 152/200 (76%), Gaps = 13/200 (6%)
Query: 10 SQTLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKK-----KLYVKCRQSEYYEQKSFSA 64
+ T NPKL ++ S+ + S + + S ++ VKCRQSE+Y+Q+
Sbjct: 7 TSTQNPKLWTPNTFSSLQSFRTTSTTFIHTISAPRRLSTNTNRAVKCRQSEFYDQQQQKY 66
Query: 65 SPS-----NSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSR 119
+PS +SS+A A +P RVYVGHSIYKGKAALTVEPR PEF +LDSGA K++R
Sbjct: 67 NPSRPSSNDSSFASQSPAA--VP-RVYVGHSIYKGKAALTVEPRAPEFAALDSGAFKVAR 123
Query: 120 EGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKV 179
EGFV+LQFAPAAGVRQYDWSRKQVFSLSVTEIG++++LGAR+SCEFFHDP KGKS+EGK+
Sbjct: 124 EGFVLLQFAPAAGVRQYDWSRKQVFSLSVTEIGTIISLGARDSCEFFHDPNKGKSDEGKI 183
Query: 180 RKVLKVEPLPDGSGHFFNLS 199
RKVLKVEPLPDGSGHFFNLS
Sbjct: 184 RKVLKVEPLPDGSGHFFNLS 203
>gi|449520335|ref|XP_004167189.1| PREDICTED: single-stranded DNA-binding protein WHY1,
chloroplastic-like [Cucumis sativus]
Length = 276
Score = 241 bits (614), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/209 (61%), Positives = 151/209 (72%), Gaps = 13/209 (6%)
Query: 2 MLQLQCLSSQ---TLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKLYVKCRQS---E 55
ML+LQ LSS T NP+L P +LSN + + S T +K KC+ S +
Sbjct: 1 MLRLQWLSSSFLPTFNPELSP-ETLSNPRLCTL-NFSRPLPTLTTTRKPTPKCQYSWNTQ 58
Query: 56 YYEQKSFSASPSNSSYAPTD-----VAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSL 110
++Q +F +P S +P A LP R +VGHSIYKGKAALTVEPR PEF L
Sbjct: 59 QHQQSAFEPAPHTDSLSPQSRAGAAAAAAALPPRFFVGHSIYKGKAALTVEPRPPEFTPL 118
Query: 111 DSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPF 170
DSGA K+SREG VMLQFAPAAGVRQYDWSRKQVFSLSVTE+GSL+ALG RE+CEFFHDP+
Sbjct: 119 DSGAFKISREGLVMLQFAPAAGVRQYDWSRKQVFSLSVTELGSLIALGPREACEFFHDPY 178
Query: 171 KGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
KGKS+EGKVRK+LKVEPLPDGSGHFFNL+
Sbjct: 179 KGKSDEGKVRKILKVEPLPDGSGHFFNLT 207
>gi|15223748|ref|NP_172893.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75191428|sp|Q9M9S3.1|WHY1_ARATH RecName: Full=Single-stranded DNA-binding protein WHY1,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 1; AltName: Full=Protein WHIRLY
1; Short=AtWHY1; Flags: Precursor
gi|7262683|gb|AAF43941.1|AC012188_18 Contains similarity to a hypothetical protein from Arabidopsis
thaliana gb|AC002521.2. EST gb|AI995686 comes from this
gene [Arabidopsis thaliana]
gi|12083312|gb|AAG48815.1|AF332452_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|13877787|gb|AAK43971.1|AF370156_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|16323418|gb|AAL15203.1| putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|332191039|gb|AEE29160.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 263
Score = 237 bits (605), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 117/160 (73%), Positives = 136/160 (85%), Gaps = 7/160 (4%)
Query: 41 SIKKKKLY-VKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALT 99
+ K KL+ VK RQ++Y+E++ F S S+ S A LP R YVGHSIYKGKAALT
Sbjct: 42 TTKTVKLFSVKSRQTDYFEKQRFGDSSSSPSPAEG------LPARFYVGHSIYKGKAALT 95
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGA 159
V+PR PEFV+LDSGA KLS++GF++LQFAP+AGVRQYDWS+KQVFSLSVTEIG+LV+LG
Sbjct: 96 VDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGP 155
Query: 160 RESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
RESCEFFHDPFKGKS+EGKVRKVLKVEPLPDGSGHFFNLS
Sbjct: 156 RESCEFFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLS 195
>gi|388509172|gb|AFK42652.1| unknown [Lotus japonicus]
Length = 261
Score = 235 bits (600), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 111/154 (72%), Positives = 130/154 (84%), Gaps = 7/154 (4%)
Query: 49 VKCRQSEYYEQKSFSAS---PSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
++CR S+ ++QK+FS+S P+N P V+VG LP RVYVGHSIYKGKAALTV PR P
Sbjct: 44 IRCRHSDLFDQKTFSSSTPQPAN----PAAVSVGALPPRVYVGHSIYKGKAALTVTPRPP 99
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EF LDSGA K+SREG+V+LQFAPA RQYDW+RKQVFSLSV E+GS+++LG RESCEF
Sbjct: 100 EFAPLDSGAFKISREGYVLLQFAPAIASRQYDWNRKQVFSLSVVEMGSVISLGTRESCEF 159
Query: 166 FHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FHDP KGKS+EGKVRKVLK+EPLPDGSGHFFNLS
Sbjct: 160 FHDPLKGKSDEGKVRKVLKLEPLPDGSGHFFNLS 193
>gi|297849846|ref|XP_002892804.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
gi|297338646|gb|EFH69063.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
Length = 264
Score = 231 bits (590), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/160 (73%), Positives = 138/160 (86%), Gaps = 6/160 (3%)
Query: 41 SIKKKKLY-VKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALT 99
+ K KL+ VK RQ++Y+E++ F +SS + + A G LP R YVGHSIYKGKAALT
Sbjct: 42 TTKTVKLFSVKSRQTDYFEKQRFG----DSSSSSSQNAEG-LPARFYVGHSIYKGKAALT 96
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGA 159
+EPR PEFV+LDSGA KLS++GF++LQFAP+AGVRQYDWS+KQVFSLSVTEIG+LV+LG
Sbjct: 97 MEPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGP 156
Query: 160 RESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
RESCEFFHDPFKGKS+EGKVRKVLKVEPLPDGSGHFFNLS
Sbjct: 157 RESCEFFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLS 196
>gi|145328252|ref|NP_001077872.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|330250525|gb|AEC05619.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 267
Score = 230 bits (587), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 112/154 (72%), Positives = 129/154 (83%), Gaps = 7/154 (4%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FHDPFKGK +EGKVRKVLKVEPLPDGSG FFNLS
Sbjct: 166 FHDPFKGKGDEGKVRKVLKVEPLPDGSGRFFNLS 199
>gi|224099743|ref|XP_002311601.1| predicted protein [Populus trichocarpa]
gi|118485247|gb|ABK94483.1| unknown [Populus trichocarpa]
gi|222851421|gb|EEE88968.1| predicted protein [Populus trichocarpa]
Length = 265
Score = 229 bits (584), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 130/205 (63%), Positives = 152/205 (74%), Gaps = 15/205 (7%)
Query: 2 MLQLQCLS---SQTLNPKLC--PFHSLSNSKGNGFGSISVTESTSI--KKKKLYVKCRQS 54
MLQL +S S NPKL ++SL ++K SIS+ TS KKK L VKC
Sbjct: 1 MLQLNSVSRVSSSPQNPKLWLPQYNSLCSTK-----SISLNSKTSSTEKKKTLGVKC--- 52
Query: 55 EYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGA 114
+YY+Q+ + + S+ S + VG P +V+VGHSIYKGKAALTVEPR PEF LDSGA
Sbjct: 53 QYYDQQHKTFTTSSRSSPSSAPPVGESPPKVFVGHSIYKGKAALTVEPRSPEFSPLDSGA 112
Query: 115 VKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKS 174
KL +EGFV+LQFAPAA VRQYDW+RKQVFSLSVTEIG LV+L A+ SCEFFHDP KGKS
Sbjct: 113 YKLVKEGFVLLQFAPAASVRQYDWTRKQVFSLSVTEIGHLVSLDAKGSCEFFHDPNKGKS 172
Query: 175 EEGKVRKVLKVEPLPDGSGHFFNLS 199
+EGKVRK+LKVEPLPDGSGHFFNLS
Sbjct: 173 DEGKVRKLLKVEPLPDGSGHFFNLS 197
>gi|297814520|ref|XP_002875143.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
gi|297320981|gb|EFH51402.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
Length = 268
Score = 229 bits (583), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 115/159 (72%), Positives = 134/159 (84%), Gaps = 8/159 (5%)
Query: 42 IKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVE 101
+ + KL VK RQ +Y+E++ F S S+ + A G+ P R YVGHSIYKGKAALT+E
Sbjct: 49 MARLKLSVKSRQDDYFEKQRFGDSSSSQN------AEGSSP-RFYVGHSIYKGKAALTIE 101
Query: 102 PRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARE 161
PR PEFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RE
Sbjct: 102 PRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRE 161
Query: 162 SCEFFHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLS 199
SCEFFHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLS
Sbjct: 162 SCEFFHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLS 200
>gi|42568881|ref|NP_178377.2| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75115367|sp|Q66GR6.1|WHY3_ARATH RecName: Full=Single-stranded DNA-binding protein WHY3,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 11; AltName: Full=Protein
WHIRLY 3; Short=AtWHY3; Flags: Precursor
gi|51536442|gb|AAU05459.1| At2g02740 [Arabidopsis thaliana]
gi|51972072|gb|AAU15140.1| At2g02740 [Arabidopsis thaliana]
gi|330250524|gb|AEC05618.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 268
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 113/155 (72%), Positives = 130/155 (83%), Gaps = 8/155 (5%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLS 199
FHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLS
Sbjct: 166 FHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLS 200
>gi|356567550|ref|XP_003551981.1| PREDICTED: uncharacterized protein LOC100804480 [Glycine max]
Length = 235
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 103/155 (66%), Positives = 129/155 (83%), Gaps = 3/155 (1%)
Query: 45 KKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRG 104
K ++CR S+ ++Q + +++P + +VG LP RVYVG+SIYKGKAALT+ PR
Sbjct: 48 KPFSLRCRHSDLFDQNTLASTPRPTR---PSASVGALPPRVYVGYSIYKGKAALTLTPRP 104
Query: 105 PEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCE 164
PEF+ LDSGA K+S+EG+V+LQFAPA G RQYDW+RKQVFSLSV E+GS+++LGAR+S E
Sbjct: 105 PEFMPLDSGAYKISKEGYVLLQFAPAVGTRQYDWNRKQVFSLSVGEMGSVISLGARDSYE 164
Query: 165 FFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FFHDPFKGKS+EGKVRK+LKVEPLPDGSGHFFNLS
Sbjct: 165 FFHDPFKGKSDEGKVRKILKVEPLPDGSGHFFNLS 199
>gi|110740230|dbj|BAF02013.1| hypothetical protein [Arabidopsis thaliana]
Length = 268
Score = 224 bits (570), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 111/155 (71%), Positives = 130/155 (83%), Gaps = 8/155 (5%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWS+K+VFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSKKRVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLS 199
FHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLS
Sbjct: 166 FHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLS 200
>gi|75174555|sp|Q9LL85.1|WHY1_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=DNA-binding protein p24;
AltName: Full=PR-10a binding factor 2; Short=PBF-2;
AltName: Full=Protein WHIRLY 1; Short=StWhy1; Flags:
Precursor
gi|9651810|gb|AAF91282.1|AF233342_1 DNA-binding protein p24 [Solanum tuberosum]
Length = 274
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 114/194 (58%), Positives = 134/194 (69%), Gaps = 15/194 (7%)
Query: 14 NPKLCPFHSLSNSKGNGFGSISVTESTSIK--------KKKLYVKCRQSEYYEQKSFSAS 65
NP + S S+S F +S + S + L + CR S+Y+E +
Sbjct: 20 NPTKTSYLSFSSSINTIFAPLSSNTTKSFSGLTHKAALPRNLSLTCRHSDYFEPQQ---- 75
Query: 66 PSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVML 125
G +V+VG+SIYKGKAALTVEPR PEF LDSGA KLSREG VML
Sbjct: 76 ---QQQQQQQQPQGASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVML 132
Query: 126 QFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKV 185
QFAPAAGVRQYDWSRKQVFSLSVTEIGS+++LGA++SCEFFHDP KG+S+EG+VRKVLKV
Sbjct: 133 QFAPAAGVRQYDWSRKQVFSLSVTEIGSIISLGAKDSCEFFHDPNKGRSDEGRVRKVLKV 192
Query: 186 EPLPDGSGHFFNLS 199
EPLPDGSGHFFNLS
Sbjct: 193 EPLPDGSGHFFNLS 206
>gi|295913588|gb|ADG58040.1| transcription factor [Lycoris longituba]
Length = 246
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 107/157 (68%), Positives = 132/157 (84%), Gaps = 3/157 (1%)
Query: 43 KKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEP 102
KK+ L + CR S Y++Q+ S+S SS P+ + + +RV+VG+SIYKGKAALTVEP
Sbjct: 48 KKQNLPISCRSSNYFDQQRLSSS---SSTPPSPSSQPSSQSRVFVGYSIYKGKAALTVEP 104
Query: 103 RGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARES 162
R PEF LDSGA K+++EGF++LQFAPA G+RQYDWSRKQVFSLSV EIG+L++LGA+ES
Sbjct: 105 RAPEFAPLDSGAFKVAKEGFILLQFAPAVGMRQYDWSRKQVFSLSVVEIGTLMSLGAKES 164
Query: 163 CEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
CEFFHDPFKG+SEEGKVRK+LK EPLPDG+GHFFNLS
Sbjct: 165 CEFFHDPFKGRSEEGKVRKLLKAEPLPDGTGHFFNLS 201
>gi|157878742|pdb|1L3A|A Chain A, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878743|pdb|1L3A|B Chain B, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878744|pdb|1L3A|C Chain C, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878745|pdb|1L3A|D Chain D, Structure Of The Plant Transcriptional Regulator Pbf-2
Length = 227
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/121 (81%), Positives = 110/121 (90%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G +V+VG+SIYKGKAALTVEPR PEF LDSGA KLSREG VMLQFAPAAGVRQYDW
Sbjct: 32 GASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDW 91
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
SRKQVFSLSVTEIGS+++LG ++SCEFFHDP KG+S+EG+VRKVLKVEPLPDGSGHFFNL
Sbjct: 92 SRKQVFSLSVTEIGSIISLGTKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDGSGHFFNL 151
Query: 199 S 199
S
Sbjct: 152 S 152
>gi|224111254|ref|XP_002315793.1| predicted protein [Populus trichocarpa]
gi|222864833|gb|EEF01964.1| predicted protein [Populus trichocarpa]
Length = 191
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 97/121 (80%), Positives = 108/121 (89%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P +V+VGHSIYKGKAALT+EPR PEF L+SGA KL +EGFV+ QFAPA+ RQYDW
Sbjct: 1 GESPPKVFVGHSIYKGKAALTIEPRAPEFSPLESGAYKLVKEGFVLSQFAPASSARQYDW 60
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+RKQVFSLSVTEIG LV+LGAR+SCEFFHDP KG+SEEGKVRKVLKVEPLPDGSGHFFNL
Sbjct: 61 TRKQVFSLSVTEIGHLVSLGARDSCEFFHDPNKGRSEEGKVRKVLKVEPLPDGSGHFFNL 120
Query: 199 S 199
S
Sbjct: 121 S 121
>gi|217071924|gb|ACJ84322.1| unknown [Medicago truncatula]
Length = 239
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 99/161 (61%), Positives = 123/161 (76%), Gaps = 1/161 (0%)
Query: 39 STSIKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAAL 98
S K K ++CR S+ + + P ++ + VG LP RVYVGHSIYKGKAAL
Sbjct: 36 SLPFKFKPFTIRCRHSDVFNPSPSNPPPPATTPPNNPL-VGALPPRVYVGHSIYKGKAAL 94
Query: 99 TVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALG 158
T+ P P+FV+LDSGA K+SR+G ++LQFAP+ G RQYDW+RKQ+F LSV E+GS+++LG
Sbjct: 95 TITPTPPKFVTLDSGAYKISRDGCLLLQFAPSVGPRQYDWNRKQLFMLSVDEMGSVISLG 154
Query: 159 ARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
ARESCEFFHDPFKG S+EGKVRKVLK+EP PDGSG FFNLS
Sbjct: 155 ARESCEFFHDPFKGGSDEGKVRKVLKIEPFPDGSGFFFNLS 195
>gi|242091954|ref|XP_002436467.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
gi|241914690|gb|EER87834.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
Length = 266
Score = 190 bits (482), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 86/116 (74%), Positives = 101/116 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +SIYKGKAAL+ +PR P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 85 RVFTSYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVATRQYDWTRKQV 144
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FSLSV EIG+L+ LG +SCEFFHDPFKG+SEEGKVRKVLKVEP PDG+G FFNLS
Sbjct: 145 FSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKVEPTPDGNGRFFNLS 200
>gi|413942842|gb|AFW75491.1| whirly1 [Zea mays]
Length = 205
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 90/150 (60%), Positives = 112/150 (74%), Gaps = 2/150 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNLSNA 201
G+SEEGKVRKVLK+EP PDG+G FFNL +A
Sbjct: 173 GRSEEGKVRKVLKIEPTPDGNGRFFNLKSA 202
>gi|194306593|ref|NP_001123589.1| LOC100170235 [Zea mays]
gi|426021717|sp|B2LXS7.1|WHY1_MAIZE RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=Protein WHIRLY 1;
Short=ZmWHY1; Flags: Precursor
gi|183229934|gb|ACC60344.1| Whirly family nucleic acid binding protein [Zea mays]
gi|194708562|gb|ACF88365.1| unknown [Zea mays]
gi|195612298|gb|ACG27979.1| DNA-binding protein p24 [Zea mays]
gi|408690350|gb|AFU81635.1| WHIRLY-type transcription factor, partial [Zea mays subsp. mays]
gi|413942843|gb|AFW75492.1| whirly1 [Zea mays]
Length = 266
Score = 189 bits (480), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 90/148 (60%), Positives = 111/148 (75%), Gaps = 2/148 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
G+SEEGKVRKVLK+EP PDG+G FFNLS
Sbjct: 173 GRSEEGKVRKVLKIEPTPDGNGRFFNLS 200
>gi|119638471|gb|ABL85062.1| expressed protein [Brachypodium sylvaticum]
Length = 266
Score = 189 bits (480), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 92/155 (59%), Positives = 116/155 (74%), Gaps = 4/155 (2%)
Query: 49 VKCRQSEYYEQKSFSASPSNS----SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRG 104
V R S+Y++ ++ + P + P A G RV+ +SIYKGKAAL+ +PR
Sbjct: 46 VPARHSDYFDPRARTPPPRDEYGEPPPPPLAPAQGGQSGRVFASYSIYKGKAALSFDPRP 105
Query: 105 PEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCE 164
P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV E+G+L+ LG +SCE
Sbjct: 106 PQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYDWTRKQVFSLSVWEMGTLLTLGPTDSCE 165
Query: 165 FFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLS
Sbjct: 166 FFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLS 200
>gi|357110788|ref|XP_003557198.1| PREDICTED: uncharacterized protein LOC100824321 [Brachypodium
distachyon]
Length = 274
Score = 189 bits (479), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 85/116 (73%), Positives = 101/116 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RVY +SIYKGKAAL+ +PR P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 93 RVYASYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYDWTRKQV 152
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FSLSV E+G+L+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLS
Sbjct: 153 FSLSVWEMGTLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLS 208
>gi|326493106|dbj|BAJ85014.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519312|dbj|BAJ96655.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 84/116 (72%), Positives = 101/116 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +SIYKGKAAL +PR P+FV L+SGA K+++EGFV+LQFAPA G RQYDW+RKQV
Sbjct: 86 RVFASYSIYKGKAALAFDPRPPQFVPLESGAYKVAKEGFVLLQFAPAVGPRQYDWARKQV 145
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
FSLSV E+G+L+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLS
Sbjct: 146 FSLSVWEMGTLLTLGLTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLS 201
>gi|55296373|dbj|BAD68418.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|55297130|dbj|BAD68773.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|222634946|gb|EEE65078.1| hypothetical protein OsJ_20114 [Oryza sativa Japonica Group]
Length = 272
Score = 186 bits (472), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 86/130 (66%), Positives = 105/130 (80%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 77 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 136
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLP 189
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP P
Sbjct: 137 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTP 196
Query: 190 DGSGHFFNLS 199
DG+ FFNLS
Sbjct: 197 DGNSRFFNLS 206
>gi|218197563|gb|EEC79990.1| hypothetical protein OsI_21637 [Oryza sativa Indica Group]
Length = 274
Score = 186 bits (472), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 86/130 (66%), Positives = 105/130 (80%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 79 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 138
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLP 189
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP P
Sbjct: 139 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTP 198
Query: 190 DGSGHFFNLS 199
DG+ FFNLS
Sbjct: 199 DGNSRFFNLS 208
>gi|357486629|ref|XP_003613602.1| DNA-binding protein p24 [Medicago truncatula]
gi|355514937|gb|AES96560.1| DNA-binding protein p24 [Medicago truncatula]
Length = 261
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 128/204 (62%), Gaps = 16/204 (7%)
Query: 2 MLQLQCLSS-QTLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKL----YVKCRQSE- 55
MLQLQ S T NP P HS I T SI +++ + C E
Sbjct: 3 MLQLQPPQSYTTTNPFSVPTHSF----------IINTPKKSIFLRRVGPTFSLTCHHPEL 52
Query: 56 YYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAV 115
++ + SS + +VG LP RV+V S+YKGKA L V P P+F S DSG
Sbjct: 53 FHPKPFPPPQRPQSSSSSFSSSVGELPARVHVSRSVYKGKAVLVVSPVLPKFTSSDSGTF 112
Query: 116 KLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSE 175
K+S+EG ++LQF P+AG RQYDW+RKQVFSLSV E+G+L+ LGARESCE FHDPF G+S+
Sbjct: 113 KISKEGLMLLQFVPSAGFRQYDWNRKQVFSLSVDEMGNLINLGARESCEIFHDPFMGRSD 172
Query: 176 EGKVRKVLKVEPLPDGSGHFFNLS 199
EGKVRKVLKVEPL DGSGH F LS
Sbjct: 173 EGKVRKVLKVEPLHDGSGHMFKLS 196
>gi|116783258|gb|ABK22859.1| unknown [Picea sitchensis]
Length = 259
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 65/117 (55%), Positives = 93/117 (79%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
++YV H++YKG+ ALT++P+ P++++L+ G V +++EG + L+FAPA G RQYDWS+K+
Sbjct: 71 NKIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKK 130
Query: 143 VFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
+ +LSV E+G+L++LG ESCEF HDPF GKSE GK+ KVLKV L D G+FFNLS
Sbjct: 131 IIALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNLS 187
>gi|116779826|gb|ABK21442.1| unknown [Picea sitchensis]
Length = 257
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 65/117 (55%), Positives = 93/117 (79%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
++YV H++YKG+ ALT++P+ P++++L+ G V +++EG + L+FAPA G RQYDWS+K+
Sbjct: 71 NKIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKK 130
Query: 143 VFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
+ +LSV E+G+L++LG ESCEF HDPF GKSE GK+ KVLKV L D G+FFNLS
Sbjct: 131 IIALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNLS 187
>gi|356497854|ref|XP_003517771.1| PREDICTED: uncharacterized protein LOC100797370 [Glycine max]
Length = 263
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 75/125 (60%), Positives = 95/125 (76%), Gaps = 3/125 (2%)
Query: 78 VGTLPT-RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQ- 135
V LP RVYVG+S+Y K LTV PR PEF S SGA K+S+EG+V+LQFAP+ G +
Sbjct: 68 VAELPQQRVYVGYSVYTRKGVLTVTPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEP 127
Query: 136 -YDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGH 194
YDW++KQ+FSLSV+E+G+L+ LGAR+S EF H+ K KS E +VRKVLKVEPL D +GH
Sbjct: 128 IYDWNQKQIFSLSVSEMGTLITLGARDSWEFSHETVKLKSNETEVRKVLKVEPLLDATGH 187
Query: 195 FFNLS 199
F+LS
Sbjct: 188 LFSLS 192
>gi|223945821|gb|ACN26994.1| unknown [Zea mays]
gi|413942841|gb|AFW75490.1| whirly1 [Zea mays]
Length = 177
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 68/122 (55%), Positives = 87/122 (71%), Gaps = 2/122 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GK 173
G+
Sbjct: 173 GR 174
>gi|302566179|pdb|3N1H|A Chain A, Crystal Structure Of Stwhy2
gi|302566180|pdb|3N1I|A Chain A, Crystal Structure Of A Stwhy2-Ere32 Complex
gi|302566182|pdb|3N1J|A Chain A, Crystal Structure Of A Stwhy2-Dt32 Complex
gi|302566184|pdb|3N1K|A Chain A, Crystal Structure Of A Stwhy2-Cere32 Complex
gi|302566186|pdb|3N1L|A Chain A, Crystal Structure Of A Stwhy2-Rcere32 Complex
Length = 178
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 64/116 (55%), Positives = 87/116 (75%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 10 RVFAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 69
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 70 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLS 125
>gi|426021772|sp|D9J034.1|WHY2_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=StWHY2; Flags: Precursor
gi|298359665|gb|ADI77438.1| Why2 protein [Solanum tuberosum]
Length = 238
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 64/116 (55%), Positives = 87/116 (75%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 56 RVFAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 115
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 116 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLS 171
>gi|297605166|ref|NP_001056790.2| Os06g0145800 [Oryza sativa Japonica Group]
gi|255676711|dbj|BAF18704.2| Os06g0145800 [Oryza sativa Japonica Group]
Length = 190
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 65/104 (62%), Positives = 82/104 (78%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 77 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 136
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGK 173
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+
Sbjct: 137 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGR 180
>gi|347948612|pdb|3R9Y|A Chain A, Crystal Structure Of Stwhy2 K67a (Form I)
gi|347948613|pdb|3R9Z|A Chain A, Crystal Structure Of Stwhy2 K67a (Form Ii)
gi|347948614|pdb|3RA0|A Chain A, Crystal Structure Of A Stwhy2 K67a-Dt32 Complex
Length = 178
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 63/116 (54%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KG AAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 10 RVFAPYSVFKGAAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 69
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 70 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLS 125
>gi|356500463|ref|XP_003519051.1| PREDICTED: uncharacterized protein LOC100775220 [Glycine max]
Length = 263
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/117 (59%), Positives = 88/117 (75%), Gaps = 2/117 (1%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQ--YDWSRK 141
RVYVG+S+Y K LTV PR PEF S SGA K+S+EG+V+LQFAP+ G + YDW+ K
Sbjct: 75 RVYVGYSVYTKKGMLTVIPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEPIYDWNHK 134
Query: 142 QVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
Q FSLSV+E+G+L+ LGAR+S EF H+ K KS + VRKVLKVEPL D +GH F+L
Sbjct: 135 QTFSLSVSEMGTLIILGARDSWEFSHETVKLKSSKIDVRKVLKVEPLLDATGHLFSL 191
>gi|449533266|ref|XP_004173597.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like, partial [Cucumis sativus]
Length = 156
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 63/116 (54%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ + +YKGKAAL++EP P F ++SG + R G +ML FAPA G R+YDW+RKQ+
Sbjct: 12 RVFASYYVYKGKAALSMEPCMPTFTKVESGNFIMDRRGSIMLTFAPAVGERKYDWTRKQL 71
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TEIGSL++LG R+SCEFFHDP S G+VRK L ++ DG+G+FF+L+
Sbjct: 72 FALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVRKSLAIKAHTDGNGYFFSLN 127
>gi|449447529|ref|XP_004141520.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like [Cucumis sativus]
Length = 241
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 68/139 (48%), Positives = 94/139 (67%)
Query: 61 SFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSRE 120
S +A S+S+ T RV+ + +YKGKAAL++EP P F ++SG + R
Sbjct: 35 SSNAGISDSTQNFTRTVTKNAGGRVFASYHVYKGKAALSMEPCMPTFTKVESGNFIMDRR 94
Query: 121 GFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVR 180
G +ML FAPA G R+YDW+RKQ+F+LS TEIGSL++LG R+SCEFFHDP S G+VR
Sbjct: 95 GSIMLTFAPAVGERKYDWTRKQLFALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVR 154
Query: 181 KVLKVEPLPDGSGHFFNLS 199
K L ++ DG+G+FF+L+
Sbjct: 155 KSLAIKAHTDGNGYFFSLN 173
>gi|22330568|ref|NP_177282.2| protein WHIRLY 2 [Arabidopsis thaliana]
gi|75161474|sp|Q8VYF7.1|WHY2_ARATH RecName: Full=Single-stranded DNA-binding protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=AtWHY2; Flags: Precursor
gi|18175814|gb|AAL59932.1| unknown protein [Arabidopsis thaliana]
gi|21689867|gb|AAM67494.1| unknown protein [Arabidopsis thaliana]
gi|225898076|dbj|BAH30370.1| hypothetical protein [Arabidopsis thaliana]
gi|332197060|gb|AEE35181.1| protein WHIRLY 2 [Arabidopsis thaliana]
Length = 238
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 61/116 (52%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 53 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 112
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L V+P DGSG+F +LS
Sbjct: 113 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISLS 168
>gi|12323827|gb|AAG51881.1|AC016162_2 unknown protein; 79476-81015 [Arabidopsis thaliana]
Length = 237
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 61/116 (52%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 52 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 111
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L V+P DGSG+F +LS
Sbjct: 112 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISLS 167
>gi|297841891|ref|XP_002888827.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
gi|297334668|gb|EFH65086.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
Length = 242
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 60/116 (51%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 57 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 116
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 117 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSIKPHADGSGYFISLS 172
>gi|388498336|gb|AFK37234.1| unknown [Lotus japonicus]
Length = 235
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/126 (48%), Positives = 87/126 (69%)
Query: 74 TDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGV 133
T VA G RV+ + +YKGKAA+++ P P F LDSGA+ + R G +M+ F PA G
Sbjct: 40 TYVAKGYTTDRVFAPYYVYKGKAAMSLSPVLPTFTKLDSGALVVERRGSIMMVFTPAIGE 99
Query: 134 RQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSG 193
R+YDW ++Q F+LS TE+GSL+A+G ++SCEFFHDP S G+VRK L ++P + SG
Sbjct: 100 RKYDWEKRQKFALSATEVGSLIAMGPQDSCEFFHDPSMSSSNAGQVRKSLSIKPHANSSG 159
Query: 194 HFFNLS 199
+F +L+
Sbjct: 160 YFVSLT 165
>gi|448278892|gb|AGE44298.1| whirly transcription factor domain containing protein [Musa AB
Group]
Length = 245
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 88/132 (66%)
Query: 68 NSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQF 127
+SS P G+ R YV ++++KGKAAL+V P P F +DSG ++ ++G V+L F
Sbjct: 44 SSSVRPPFSPTGSSSVRRYVEYTVFKGKAALSVSPILPTFREVDSGVSRVHKKGCVILTF 103
Query: 128 APAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEP 187
PA G R+YDW +KQ F+LS TE+GSL+ LG ESCEFFHDP S EG+V+K L + P
Sbjct: 104 WPAIGQRKYDWQKKQAFALSPTEVGSLIGLGPAESCEFFHDPSMKSSLEGQVKKSLSISP 163
Query: 188 LPDGSGHFFNLS 199
L D +G+ NLS
Sbjct: 164 LNDKAGYLLNLS 175
>gi|302399107|gb|ADL36848.1| WHY domain class transcription factor [Malus x domestica]
Length = 237
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 84/116 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
+VY I+KGKAAL++ P P F L+SG++ + R G VML+F PA G R+YDW ++Q+
Sbjct: 52 QVYASFDIFKGKAALSLTPVLPTFTKLESGSLVVDRRGSVMLKFTPAIGERKYDWEKRQM 111
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+G+L++LG+ +SCE FHDP S G+VRK L ++P DGSG+F +L+
Sbjct: 112 FALSATEVGALISLGSNDSCELFHDPSMKSSNAGQVRKSLSIKPHADGSGYFVSLT 167
>gi|357512363|ref|XP_003626470.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|355501485|gb|AES82688.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|388497124|gb|AFK36628.1| unknown [Medicago truncatula]
Length = 226
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 61/136 (44%), Positives = 93/136 (68%), Gaps = 6/136 (4%)
Query: 64 ASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFV 123
++ +N++Y+ A G R++ +S+YKGKAA ++ P P F LDSGA+ + R G +
Sbjct: 30 STATNNNYS----AKGYTSDRIFAPYSVYKGKAAFSLSPCLPTFTKLDSGALVVDRHGSI 85
Query: 124 MLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVL 183
M+ F PA G R+YDW ++Q+F+LS TE+GSL+A+G ++SCEFFHDP S G+VRK L
Sbjct: 86 MMSFMPAIGERKYDWEKRQIFALSATEVGSLIAIGPQDSCEFFHDPSMKSSNAGQVRKSL 145
Query: 184 KVEPLPDGSGHFFNLS 199
++ P +G+F +LS
Sbjct: 146 SIK--PHSNGYFVSLS 159
>gi|302764906|ref|XP_002965874.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
gi|302802736|ref|XP_002983122.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300149275|gb|EFJ15931.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300166688|gb|EFJ33294.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
Length = 226
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/123 (55%), Positives = 82/123 (66%), Gaps = 1/123 (0%)
Query: 80 TLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWS 139
T P RV+ H YKGK AL + P F DSG LSREG VML+FAP+ RQYDW
Sbjct: 27 TRPRRVFADHVFYKGKCALNMRLIKPTFKISDSGDAILSREGTVMLEFAPSISQRQYDWG 86
Query: 140 RKQV-FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+KQV F+LSV+E+G ++AL ES EFFHDP GKS+ G VRK LK+EP D +G FF L
Sbjct: 87 KKQVLFALSVSELGQILALTPSESLEFFHDPNMGKSDAGMVRKSLKIEPTTDRNGFFFGL 146
Query: 199 SNA 201
+ A
Sbjct: 147 TVA 149
>gi|225459963|ref|XP_002267315.1| PREDICTED: uncharacterized protein LOC100258449 [Vitis vinifera]
gi|297734756|emb|CBI16990.3| unnamed protein product [Vitis vinifera]
Length = 231
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 63/121 (52%), Positives = 82/121 (67%), Gaps = 1/121 (0%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P RVY + +YKGKA+LTV P P+F LDSG +K+ R G +MLQF+PA G R+YDW
Sbjct: 52 GNYPDRVYAPYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDW 111
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+KQ F+LS E+GSL++L CEFFHDP S G+VRK L V+ + DG +F +L
Sbjct: 112 EKKQFFALSAVEVGSLLSLSPGGGCEFFHDPSMKTSNAGQVRKSLSVKSM-DGGSYFLSL 170
Query: 199 S 199
S
Sbjct: 171 S 171
>gi|242064094|ref|XP_002453336.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
gi|241933167|gb|EES06312.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
Length = 230
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 86/119 (72%)
Query: 81 LPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSR 140
L ++ Y ++++KGKAAL+++P P F L+SG ++SR G +ML F PA G R+YD+++
Sbjct: 44 LSSKKYASYTVFKGKAALSIQPILPSFSKLESGGSRVSRNGSIMLTFFPAVGPRKYDFTK 103
Query: 141 KQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N++
Sbjct: 104 KQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGMVKKSLSITPLGSDSGYFVNIT 162
>gi|357148896|ref|XP_003574931.1| PREDICTED: uncharacterized protein LOC100825843 [Brachypodium
distachyon]
Length = 231
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 85/116 (73%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R + ++++KGKAAL++ P P F ++SG ++ + G VML F PA G RQYD+S+KQ+
Sbjct: 47 RKFASYTVFKGKAALSISPILPNFTKIESGGSRVKKNGSVMLTFFPAVGQRQYDYSKKQL 106
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL++LG+ ESCEFFHDP S EG+V+K L + PL + +G+F N++
Sbjct: 107 FALSPTEVGSLISLGSAESCEFFHDPSMKSSHEGQVKKSLSITPLGNDNGYFVNIT 162
>gi|115444353|ref|NP_001045956.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|50251252|dbj|BAD28032.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|50252182|dbj|BAD28177.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|113535487|dbj|BAF07870.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|215692593|dbj|BAG88013.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704516|dbj|BAG94149.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740785|dbj|BAG96941.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218190103|gb|EEC72530.1| hypothetical protein OsI_05924 [Oryza sativa Indica Group]
gi|222622212|gb|EEE56344.1| hypothetical protein OsJ_05450 [Oryza sativa Japonica Group]
Length = 228
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 86/116 (74%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R + ++++KGKAAL+++P P F L+SG ++++ G VML F PA G R+YD+S+KQ+
Sbjct: 45 RKFASYTVFKGKAALSMQPILPSFSKLESGGSRVNKNGSVMLTFFPAVGQRKYDYSKKQL 104
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL++LG ESCEFFHDP S EG+V+K L V PL + SG+F N++
Sbjct: 105 FALSPTEVGSLISLGPAESCEFFHDPSMKSSHEGQVKKSLSVTPLGNDSGYFLNIT 160
>gi|195627490|gb|ACG35575.1| DNA binding protein [Zea mays]
Length = 230
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 59/121 (48%), Positives = 85/121 (70%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 S 199
+
Sbjct: 162 T 162
>gi|194703090|gb|ACF85629.1| unknown [Zea mays]
gi|323388661|gb|ADX60135.1| PBF-2 like transcription factor [Zea mays]
gi|323388771|gb|ADX60190.1| PBF-2 like transcription factor [Zea mays]
Length = 232
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 59/121 (48%), Positives = 85/121 (70%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 S 199
+
Sbjct: 162 T 162
>gi|118484514|gb|ABK94132.1| unknown [Populus trichocarpa]
Length = 229
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 83/116 (71%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+VEP P F SG +++ R G +ML F PA G R+YD+ ++Q
Sbjct: 47 RVFAPYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQK 106
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL++ G ++SCEFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 107 FALSATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLS 162
>gi|413926543|gb|AFW66475.1| DNA binding protein isoform 1 [Zea mays]
gi|413926544|gb|AFW66476.1| DNA binding protein isoform 2 [Zea mays]
Length = 274
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 59/121 (48%), Positives = 85/121 (70%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 84 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 143
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 144 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 203
Query: 199 S 199
+
Sbjct: 204 T 204
>gi|224070977|ref|XP_002303313.1| predicted protein [Populus trichocarpa]
gi|222840745|gb|EEE78292.1| predicted protein [Populus trichocarpa]
Length = 183
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 83/116 (71%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+VEP P F SG +++ R G +ML F PA G R+YD+ ++Q
Sbjct: 1 RVFAPYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQK 60
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
F+LS TE+GSL++ G ++SCEFFHDP S G+VRK L ++P DGSG+F +LS
Sbjct: 61 FALSATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLS 116
>gi|226506170|ref|NP_001152589.1| LOC100286229 [Zea mays]
gi|195657845|gb|ACG48390.1| DNA binding protein [Zea mays]
Length = 232
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/121 (47%), Positives = 84/121 (69%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILXSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 S 199
+
Sbjct: 162 T 162
>gi|255632067|gb|ACU16386.1| unknown [Glycine max]
Length = 264
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 104/178 (58%), Gaps = 10/178 (5%)
Query: 38 ESTSIKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAA 97
E S +K ++ + RQ E + F+ S + S+ A G R++ +++YKGKAA
Sbjct: 20 EVLSSRKVEVGDRLRQ----EGRDFTYSAAISTATNNYAAKGHASDRIFAPYTVYKGKAA 75
Query: 98 LTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL 157
++ P P F LDSG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+ +
Sbjct: 76 FSLIPCLPTFTKLDSGTVVVDRRGSIMMSFMHSIGERKYDWDKRQKFALSATEVGSLITM 135
Query: 158 GARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSNALFLELLAVNLEDYL 215
A++SC+FFHDP S G+VRK L ++ P +G+F +L+ L +N +DY
Sbjct: 136 DAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLTVVNNL----LNTKDYF 187
>gi|356573153|ref|XP_003554728.1| PREDICTED: uncharacterized protein LOC100817863 [Glycine max]
Length = 235
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 102/164 (62%), Gaps = 10/164 (6%)
Query: 40 TSIKKKKL--YVKCRQSEYYEQKSFSA--SPSNSSYAPTDVAVGTLPTRVYVGHSIYKGK 95
TS + +L + R+ E ++ S SA S + ++YA A G R++ +++YKGK
Sbjct: 11 TSTSRHRLLEVLSSRKVEVGDRLSHSAGISTATNNYA----AKGYASDRIFAPYTVYKGK 66
Query: 96 AALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV 155
AA ++ P P F L+SG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+
Sbjct: 67 AAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQRFALSATEVGSLI 126
Query: 156 ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
+GA++SC+FFHDP S G+VRK L ++ P +G+F +L+
Sbjct: 127 TMGAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLT 168
>gi|255637711|gb|ACU19178.1| unknown [Glycine max]
Length = 235
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 101/164 (61%), Gaps = 10/164 (6%)
Query: 40 TSIKKKKL--YVKCRQSEYYEQKSFSA--SPSNSSYAPTDVAVGTLPTRVYVGHSIYKGK 95
TS + +L + R+ E ++ S SA S ++YA A G R++ +++YKGK
Sbjct: 11 TSTSRHRLLEVLSSRKVEVGDRLSHSAGISTVTNNYA----AKGYASDRIFAPYTVYKGK 66
Query: 96 AALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV 155
AA ++ P P F L+SG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+
Sbjct: 67 AAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQRFALSATEVGSLI 126
Query: 156 ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
+GA++SC+FFHDP S G+VRK L ++ P +G+F +L+
Sbjct: 127 TMGAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLT 168
>gi|224286043|gb|ACN40733.1| unknown [Picea sitchensis]
Length = 184
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 53/87 (60%), Positives = 68/87 (78%)
Query: 113 GAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKG 172
G V +++EG + L+FAPA G RQYDWS+K++ +LSV E+G+L++LG ESCEF HDPF G
Sbjct: 28 GGVTVAKEGCMFLEFAPAVGPRQYDWSKKKIIALSVVEVGTLLSLGPDESCEFTHDPFMG 87
Query: 173 KSEEGKVRKVLKVEPLPDGSGHFFNLS 199
KSE GK+ KVLKV L D G+FFNLS
Sbjct: 88 KSEAGKIMKVLKVGNLQDTGGYFFNLS 114
>gi|147819709|emb|CAN74120.1| hypothetical protein VITISV_034895 [Vitis vinifera]
Length = 185
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 65/91 (71%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P RVY + +YKGKA+LTV P P+F LDSG +K+ R G +MLQF+PA G R+YDW
Sbjct: 52 GNYPDRVYAPYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDW 111
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDP 169
+KQ F+LS E+GSL++L CEFFHDP
Sbjct: 112 EKKQFFALSAVEVGSLLSLSPGGGCEFFHDP 142
>gi|412985999|emb|CCO17199.1| predicted protein [Bathycoccus prasinos]
Length = 419
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 78/135 (57%), Gaps = 7/135 (5%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R +V SIYK K+AL+V+ P F + G + R G ++L+FAP+ G R+YDW+RK
Sbjct: 238 RTFVDFSIYKSKSALSVKLVKPTFETDHQGRTIMKRSGGILLEFAPSIGTRKYDWTRKGS 297
Query: 144 FSLSVTEIGSLVAL--GARES----CEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGH-FF 196
F LS E L +R+S EFFHDP G S +G V K LK+E +PDG+G F
Sbjct: 298 FMLSPIEAAELANRLNPSRQSIAQKVEFFHDPGMGGSSQGSVTKSLKMEAMPDGTGGVFL 357
Query: 197 NLSNALFLELLAVNL 211
N + F E L VN+
Sbjct: 358 NYNQTKFGEKLNVNI 372
>gi|303289333|ref|XP_003063954.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454270|gb|EEH51576.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 84/146 (57%), Gaps = 10/146 (6%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP---AAGVRQYDWSR 140
+VY +I+K K A+ + P F L +G+ + R+G + L+FAP AG +QYDWSR
Sbjct: 139 KVYCNFAIHKSKTAVQMSAIKPTFELLPNGSKQKKRDGAMFLEFAPVAAGAGQKQYDWSR 198
Query: 141 KQVFSLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGH-FFNL 198
KQ SLS E L AL A FFHDP+ G S +G+ K LK EP+PDGSG F NL
Sbjct: 199 KQSISLSPLEFMELSEALAANRGVNFFHDPWMGTSRQGETTKSLKAEPMPDGSGGIFLNL 258
Query: 199 ---SNALFLELL--AVNLEDYLLFDD 219
S +E L AV+ +++ +F +
Sbjct: 259 TVASGGGRVEKLNIAVSFQEFAVFRE 284
>gi|307106759|gb|EFN55004.1| hypothetical protein CHLNCDRAFT_134820 [Chlorella variabilis]
Length = 1274
Score = 97.1 bits (240), Expect = 4e-18, Method: Composition-based stats.
Identities = 51/117 (43%), Positives = 78/117 (66%), Gaps = 3/117 (2%)
Query: 85 VYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVF 144
VY ++IYKGKAA+ ++ P + S+ SG +K+SREG ++L+FA + G +QYDW++K+ F
Sbjct: 1097 VYSDYTIYKGKAAMAIKVIKPTWESIGSG-LKISREGTLLLEFAASVGPQQYDWTKKETF 1155
Query: 145 SLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSN 200
LS E +++ A AR+S E HDP KG+ EG V K + P PD G FF++++
Sbjct: 1156 GLSALECAAVLEAADARQSFEALHDPNKGRGGEGTVYKKFNMRPAPD-KGWFFSIAS 1211
>gi|223947295|gb|ACN27731.1| unknown [Zea mays]
gi|413942840|gb|AFW75489.1| whirly1 [Zea mays]
Length = 153
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
SGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQV 144
>gi|255084617|ref|XP_002508883.1| predicted protein [Micromonas sp. RCC299]
gi|226524160|gb|ACO70141.1| predicted protein [Micromonas sp. RCC299]
Length = 347
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 75/138 (54%), Gaps = 6/138 (4%)
Query: 60 KSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSR 119
+ F+A + +S A D +RVY +++YK KAA + P F G+ R
Sbjct: 163 RQFTAGNNKASSANDDAG-----SRVYCDYAVYKSKAAAKFQVIKPTFEVKPDGSRAKKR 217
Query: 120 EGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGK 178
+G V+L+ APA G RQYDW++KQ LS E+ L +L FFHDP G + +G
Sbjct: 218 DGGVLLEMAPAVGPRQYDWAQKQTIMLSPLELVELTESLHFGRGVNFFHDPGMGTNRQGA 277
Query: 179 VRKVLKVEPLPDGSGHFF 196
+ K LK EP+PDGSG F
Sbjct: 278 MTKSLKAEPMPDGSGGIF 295
>gi|255561490|ref|XP_002521755.1| conserved hypothetical protein [Ricinus communis]
gi|223538968|gb|EEF40565.1| conserved hypothetical protein [Ricinus communis]
Length = 173
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 65/92 (70%)
Query: 109 SLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHD 168
++ SG +K+ R G ++L F PA G R+YD+ ++Q F+LS TE+GSL++LG ++S + FHD
Sbjct: 76 TVQSGHLKVERRGVILLTFLPAIGERKYDYEKRQSFALSTTEVGSLISLGPKDSFDCFHD 135
Query: 169 PFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSN 200
P S G+VRK L ++P +G G+F +LS+
Sbjct: 136 PGMLSSNAGEVRKSLSLKPHAEGGGYFISLSS 167
>gi|145355417|ref|XP_001421958.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582197|gb|ABP00252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 152
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 60/112 (53%), Gaps = 1/112 (0%)
Query: 86 YVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFS 145
+V + +YK K AL ++ F S SG + R G V+L+ A A RQYDW K F
Sbjct: 1 FVDYGVYKTKGALKLKAVRATFESDASGRRIMKRAGGVLLELANATAPRQYDWGNKGSFM 60
Query: 146 LSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFF 196
LS TE L + + C FFHDP G + G V K KVEP+PDGSG F
Sbjct: 61 LSATEAAELADRMASNAPCSFFHDPGAGGANRGNVNKAFKVEPMPDGSGGLF 112
>gi|384244785|gb|EIE18283.1| ssDNA-binding transcriptional regulator [Coccomyxa subellipsoidea
C-169]
Length = 250
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 85 VYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAA-------GVRQYD 137
+Y ++IYKGKAA + R P +V G++ L R G ++++FAP A G R Y
Sbjct: 69 LYANYAIYKGKAAASFRVRKPRWVEAQDGSISLDRAGSLIVEFAPVAPGSGTNVGNRSYQ 128
Query: 138 WSRKQVFSLSVTEIGSLVALGARESC-------EFFHDPFKGKSEEGKVRKVLKVEPLPD 190
W +KQ F+LS E+ LV ESC E FHDP KG ++ GK+ K L ++
Sbjct: 129 WDKKQTFALSPVELAGLV-----ESCTTGKSMKELFHDPNKGGTDSGKIAKTLSLQRFDQ 183
Query: 191 GSGHFFNLS 199
G + L+
Sbjct: 184 GQDWYLQLA 192
>gi|308812987|ref|XP_003083800.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
gi|116055682|emb|CAL57767.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
Length = 240
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 63/124 (50%), Gaps = 4/124 (3%)
Query: 82 PTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRK 141
P +V +YK + A+ ++ P +L AV + R G V+L+ A A R YDW K
Sbjct: 78 PPTTFVDFGVYKTRGAMKMKAVRPTLGALGENAV-VRRPGGVLLELANATAPRTYDWQNK 136
Query: 142 QVFSLSVTEIGSLV-ALGARESCEFFHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFF-NL 198
F LS TE L + A SC FFHD K G + K KVEP+PDGSG F NL
Sbjct: 137 GSFMLSGTEAAELADRMAANGSCSFFHDSAKANGGGGGTLGKAFKVEPMPDGSGGMFVNL 196
Query: 199 SNAL 202
+ L
Sbjct: 197 TVTL 200
>gi|2947067|gb|AAC05348.1| hypothetical protein [Arabidopsis thaliana]
Length = 192
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 7/76 (9%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREG 121
EFV+L+ +K++ G
Sbjct: 106 EFVALEIVMMKINWFG 121
>gi|302829905|ref|XP_002946519.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
gi|300268265|gb|EFJ52446.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
Length = 3754
Score = 64.7 bits (156), Expect = 3e-08, Method: Composition-based stats.
Identities = 47/129 (36%), Positives = 72/129 (55%), Gaps = 20/129 (15%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFA-----------PAAG 132
RVYV IYK +AA+ V P F + ++G L R+G ++L+FA PAAG
Sbjct: 3589 RVYVNFHIYKTRAAMAVRLLPPSFTT-ENGYKTLERDGVMLLEFANANPGQPSGTAPAAG 3647
Query: 133 --VRQYDWSRKQVFSLSVTEIGSLV---ALGARESCEFFHDPFK-GKSEEGKVRKVLKVE 186
R Y+WS K F+LS +E+G+++ A+ + + +HDP K GK G+ K L ++
Sbjct: 3648 GINRTYNWSNKISFALSPSELGTMLAGDAIASDKGLVMYHDPTKLGKV--GEPMKRLTMK 3705
Query: 187 PLPDGSGHF 195
+PDG+ F
Sbjct: 3706 QMPDGAISF 3714
>gi|124484361|dbj|BAF46291.1| transcription regulator Pbf-2 [Chlamydomonas reinhardtii]
Length = 238
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 78/156 (50%), Gaps = 21/156 (13%)
Query: 57 YEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVK 116
Y S + P N + AP D A + RVY + +YK +AA+ + P F +G V
Sbjct: 47 YSNGSAAPVPPNFA-APNDRAATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV- 104
Query: 117 LSREGFVMLQFAPAAGV-------------RQYDWSRKQVFSLSVTEIGSLV---ALGAR 160
L R+G ++L+FA A R Y+W K F+LS E+G+++ A+ +
Sbjct: 105 LERDGTMLLEFATANAAAPGAGNGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASD 164
Query: 161 ESCEFFHDPFK-GKSEEGKVRKVLKVEPLPDGSGHF 195
+ +HDP K GK+ G+ K L ++ LPDG+ F
Sbjct: 165 KGLVLWHDPAKLGKT--GEPIKKLSLKQLPDGNISF 198
>gi|159487549|ref|XP_001701785.1| transcription factor [Chlamydomonas reinhardtii]
gi|158281004|gb|EDP06760.1| transcription factor [Chlamydomonas reinhardtii]
Length = 238
Score = 62.8 bits (151), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 78/156 (50%), Gaps = 21/156 (13%)
Query: 57 YEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVK 116
Y S + P N + AP D A + RVY + +YK +AA+ + P F +G V
Sbjct: 47 YSNGSAAPVPPNFA-APNDRAATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV- 104
Query: 117 LSREGFVMLQFAPAAGV-------------RQYDWSRKQVFSLSVTEIGSLV---ALGAR 160
L R+G ++L+FA A R Y+W K F+LS E+G+++ A+ +
Sbjct: 105 LERDGTMLLEFATANAAAPGAGSGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASD 164
Query: 161 ESCEFFHDPFK-GKSEEGKVRKVLKVEPLPDGSGHF 195
+ +HDP K GK+ G+ K L ++ LPDG+ F
Sbjct: 165 KGLVLWHDPAKLGKT--GEPIKKLSLKQLPDGNISF 198
>gi|297836056|ref|XP_002885910.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
gi|297331750|gb|EFH62169.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
Length = 74
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 35/44 (79%)
Query: 111 DSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSL 154
DS A KLS++GF++LQFAP+AGVRQY+W +KQV+ +T G L
Sbjct: 3 DSEAFKLSKDGFLLLQFAPSAGVRQYNWGKKQVWFYLLTSYGPL 46
>gi|348676880|gb|EGZ16697.1| hypothetical protein PHYSODRAFT_500116 [Phytophthora sojae]
Length = 224
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ S+Y +A V P P++ S S +K+ R G +ML +A A Y++ K
Sbjct: 29 RVFPNFSVYGSDSAFQVAPSAPQYTSAGS-YMKMKRVGAIMLSWAKATN-SGYNYQNKTF 86
Query: 144 FSLSVTEIGSLVALGARESCE--FFHDPFKGKSEEGKVRKVLKV 185
FSLS TE+G ++ L E F H P SEE K K L +
Sbjct: 87 FSLSPTEVGLVLELLDSRIPELSFTHSPNMNASEEDKNSKTLHI 130
>gi|253761700|ref|XP_002489225.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
gi|241947085|gb|EES20230.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
Length = 91
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/31 (67%), Positives = 27/31 (87%)
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
SGA K+++EG+V+LQFAPA RQYDW+RKQ
Sbjct: 5 SGAYKVAKEGYVLLQFAPAVATRQYDWTRKQ 35
>gi|449445768|ref|XP_004140644.1| PREDICTED: single-stranded DNA-bindig protein WHY1,
chloroplastic-like [Cucumis sativus]
Length = 108
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/25 (88%), Positives = 25/25 (100%)
Query: 175 EEGKVRKVLKVEPLPDGSGHFFNLS 199
+EGKVRK+LKVEPLPDGSGHFFNL+
Sbjct: 19 DEGKVRKILKVEPLPDGSGHFFNLT 43
>gi|397620242|gb|EJK65621.1| hypothetical protein THAOC_13502 [Thalassiosira oceanica]
Length = 317
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 63/133 (47%), Gaps = 18/133 (13%)
Query: 81 LPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPA-AGVRQYDWS 139
+P R Y ++++ +AL+++ P F V + R G +ML+F P A + W+
Sbjct: 4 MPRRGYPQYTMFSSDSALSMKAAMPVFKKAGMDGVAVERRGKMMLEFVPRNASGSGFAWN 63
Query: 140 RKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGK--------------VRKVLKV 185
K +FSL+V E+G L++ + E H F S++G V KVL V
Sbjct: 64 DKTIFSLTVEEVGLLLSQLPGNAVELSHPTF--SSDDGAFGQESQVTQVSGDIVEKVLTV 121
Query: 186 EPLPDGSGHFFNL 198
+P DG+ F +
Sbjct: 122 DP-GDGATMTFKV 133
>gi|301101636|ref|XP_002899906.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102481|gb|EEY60533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 226
Score = 47.0 bits (110), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 4/104 (3%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ SIY +A V P P++ + +K R G +ML +A A Y++ K
Sbjct: 29 RVFPNFSIYGSDSAFQVSPSPPQYTN-GGNYLKTKRVGAIMLSWAKATN-SGYNYQNKTF 86
Query: 144 FSLSVTEIGSLVALGARESCE--FFHDPFKGKSEEGKVRKVLKV 185
FSLS +E+G ++ L E H P SEE K K L +
Sbjct: 87 FSLSPSEVGLVLELLDSRIPELSLTHSPNMNASEEDKNTKSLHI 130
>gi|223994305|ref|XP_002286836.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978151|gb|EED96477.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 288
Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 77 AVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP----AAG 132
++G +P R + ++++ +AL+V P F + + + R G ++L+F P AG
Sbjct: 73 SMGMMPRRGFPQYTVFGPDSALSVRAVLPNFKRAGTDGISVDRRGKIVLEFVPRNPSGAG 132
Query: 133 VRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFH 167
+ W+ K FS+SV E+G V+ + E H
Sbjct: 133 ---FQWADKTTFSMSVEEVGLFVSQLPQSGIELSH 164
>gi|194334549|ref|YP_002016409.1| nucleotidyl transferase [Prosthecochloris aestuarii DSM 271]
gi|194312367|gb|ACF46762.1| Nucleotidyl transferase [Prosthecochloris aestuarii DSM 271]
Length = 237
Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 3/89 (3%)
Query: 127 FAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVE 186
AP AG +W K +F + + GA + E H P+ ++E + V++ E
Sbjct: 27 LAPVAGRPFLEWQMKSLFRRGIHHFVLALGYGADKIIEVLHQPW---AKEMSIDYVIEKE 83
Query: 187 PLPDGSGHFFNLSNALFLELLAVNLEDYL 215
PL G F +++ L E+L VN + +L
Sbjct: 84 PLGTGGAIRFAMTDKLIDEVLVVNGDTFL 112
>gi|124262628|ref|YP_001023098.1| hypothetical protein Mpe_B0084 [Methylibium petroleiphilum PM1]
gi|124261874|gb|ABM96863.1| hypothetical protein Mpe_B0084 [Methylibium petroleiphilum PM1]
Length = 421
Score = 40.4 bits (93), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 24/108 (22%)
Query: 52 RQSEYYEQK--SFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVS 109
RQ + +EQ+ + A P+ +S + Y+ IY GKAAL S
Sbjct: 239 RQDQRHEQRQQARQAEPAEASA-----------DQEYLSQHIYGGKAALCF--------S 279
Query: 110 LDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL 157
D K+ V L+ A A+ RQYDW RK LS E+ ++A+
Sbjct: 280 ADQTRAKVH---TVRLEAAEASATRQYDWQRKIAIQLSQRELPLVLAV 324
>gi|406976196|gb|EKD98717.1| hypothetical protein ACD_23C00300G0005 [uncultured bacterium]
Length = 290
Score = 39.7 bits (91), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 33/63 (52%), Gaps = 5/63 (7%)
Query: 89 HSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSV 148
H IY KAALTVE E ++ +G L V+L+ A A G R YDWSRK F
Sbjct: 130 HHIYGLKAALTVEM--DELRTMGNGGGLLQ---TVILEAATATGHRSYDWSRKIAFQFMR 184
Query: 149 TEI 151
E+
Sbjct: 185 REL 187
>gi|325185050|emb|CCA19542.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 211
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
+++Y ++Y+ A V P PE+V S +K R G ++L +A YD+++K
Sbjct: 21 SKIYPSFTVYESDAVFQVSPIQPEYVQ-QSNYLKTKRVGSLLLSWAKQRD-GSYDYTKKL 78
Query: 143 VFSLSVTEIGSLVALGARESCEFF--HDPFKGKSEEGKVRKVLKV 185
F+L+ +EIG ++ + + E H P + + LKV
Sbjct: 79 YFALTPSEIGLVLEVLDSKVGEINIKHTPNRNDPAVQTTERTLKV 123
>gi|290243159|ref|YP_003494829.1| hypothetical protein TK90_2878 [Thioalkalivibrio sp. K90mix]
gi|288945664|gb|ADC73362.1| hypothetical protein TK90_2878 [Thioalkalivibrio sp. K90mix]
Length = 259
Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 123 VMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL--GARESCEF 165
V + A A G R YDW K + L+ TE+ + A+ G R CEF
Sbjct: 138 VQIDAAKAKGERAYDWGAKTIIQLTPTELAVVTAVFTGVRAKCEF 182
>gi|170077972|ref|YP_001734610.1| hypothetical protein SYNPCC7002_A1358 [Synechococcus sp. PCC 7002]
gi|169885641|gb|ACA99354.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 168
Score = 37.7 bits (86), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 60/123 (48%), Gaps = 9/123 (7%)
Query: 42 IKKKKLYVKCRQSEYYEQKSFSASPSNSSYA-PTDVAVGTLPTRVYV-GHSIYKGKAALT 99
++KK+L + RQ YY +SF ++ DV T+ + + G SI++G++ +
Sbjct: 4 LRKKQLQQRLRQDPYYRLRSFEEVALAAAIGIKIDVNQATVDDWLRLPGISIHQGRSLVQ 63
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV----FSLSVTEIGSLV 155
+ +G +F SLD A L G + P A + + + ++ +L+ ++G+LV
Sbjct: 64 LTSQGVQFYSLDDVAAAL---GVPRQRLQPLAAILSFAYYAPELAPVQINLNRADLGALV 120
Query: 156 ALG 158
G
Sbjct: 121 DFG 123
>gi|260881286|ref|ZP_05404060.2| putative 4Fe-4S binding domain protein [Mitsuokella multacida DSM
20544]
gi|260849026|gb|EEX69033.1| putative 4Fe-4S binding domain protein [Mitsuokella multacida DSM
20544]
Length = 206
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 13/85 (15%)
Query: 116 KLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSE 175
+L ++G+V L A + D + + + EI F H+P+ GK
Sbjct: 58 QLMQQGYVSLSGTKAKRAVRVDGKVRNIGHKKLAEI-------------FAHNPYMGKIY 104
Query: 176 EGKVRKVLKVEPLPDGSGHFFNLSN 200
G +VL+V L +G+G FF++SN
Sbjct: 105 PGDTCEVLEVFQLYEGTGEFFDISN 129
>gi|148237294|ref|NP_001088381.1| perforin 1 (pore forming protein) precursor [Xenopus laevis]
gi|80476913|gb|AAI08843.1| LOC495232 protein [Xenopus laevis]
Length = 532
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 48/109 (44%), Gaps = 21/109 (19%)
Query: 13 LNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKLYVKCRQS--EYYEQK-SFSASPSNS 69
LN L P H+L NS+ S+ + S IKK+ L ++C S Y Q+ S S S S
Sbjct: 331 LNYSLMPIHTLINSRSPKRESLRIALSEYIKKRSLKIECPCSGNTYPSQREDCSCSCSPS 390
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPR------GPEFVSLDS 112
Y +D PTR KG A L +E R G F S DS
Sbjct: 391 KYTNSDCC----PTR--------KGLAKLIIEIRSATGLNGDYFSSTDS 427
>gi|54038326|gb|AAH84651.1| LOC495232 protein, partial [Xenopus laevis]
Length = 513
Score = 36.2 bits (82), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 48/109 (44%), Gaps = 21/109 (19%)
Query: 13 LNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKLYVKCRQS--EYYEQK-SFSASPSNS 69
LN L P H+L NS+ S+ + S IKK+ L ++C S Y Q+ S S S S
Sbjct: 312 LNYSLMPIHTLINSRSPKRESLRIALSEYIKKRSLKIECPCSGNTYPSQREDCSCSCSPS 371
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPR------GPEFVSLDS 112
Y +D PTR KG A L +E R G F S DS
Sbjct: 372 KYTNSDCC----PTR--------KGLAKLIIEIRSATGLNGDYFSSTDS 408
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.386
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,458,285,222
Number of Sequences: 23463169
Number of extensions: 138773043
Number of successful extensions: 299073
Number of sequences better than 100.0: 93
Number of HSP's better than 100.0 without gapping: 73
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 298959
Number of HSP's gapped (non-prelim): 95
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)