BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027008
(229 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302399111|gb|ADL36850.1| WHY domain class transcription factor [Malus x domestica]
Length = 276
Score = 300 bits (768), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 160/237 (67%), Positives = 182/237 (76%), Gaps = 10/237 (4%)
Query: 2 MLQLQCLSS----QTLNPKLCPFHS---LSNSKGNGFGSISVTESTSIKKKKLYVKCRQS 54
ML+L LSS Q N F S LS ++ + S + +K+L +KCRQS
Sbjct: 1 MLRLHLLSSPATAQKPNQNPSQFLSSQLLSRARVFSTNTFGFAPSPILSRKRLSLKCRQS 60
Query: 55 EYYEQK--SFSASPSNSSYAPTDVAVGT-LPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
EY++Q+ S +ASP+ S AP A T + R YVGHSIYKGKAALTVEP+ PEF LD
Sbjct: 61 EYFDQQRTSTAASPNKPSPAPPTPAGATGMAPRFYVGHSIYKGKAALTVEPKAPEFTPLD 120
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA KLSREGFV+LQFAPAAGVR YDWSRKQVFSLSVTEIGSLV+LG++ES EFFHDPFK
Sbjct: 121 SGAFKLSREGFVLLQFAPAAGVRVYDWSRKQVFSLSVTEIGSLVSLGSKESLEFFHDPFK 180
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
GKS+EGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIP+TRAE+ VL SAFN
Sbjct: 181 GKSDEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPITRAEFAVLKSAFN 237
>gi|225424922|ref|XP_002277278.1| PREDICTED: uncharacterized protein LOC100253653 [Vitis vinifera]
gi|296086421|emb|CBI32010.3| unnamed protein product [Vitis vinifera]
Length = 268
Score = 297 bits (760), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/236 (65%), Positives = 186/236 (78%), Gaps = 15/236 (6%)
Query: 2 MLQLQCLSSQ--TLNPKLCPFHSLSNSKGNGFGSIS-----VTESTSIKKKKLYVKCRQS 54
M L LSS NP+LCP HSLS+ + S + + +T +KK ++CRQS
Sbjct: 1 MHHLHLLSSSFTIQNPRLCPNHSLSSLHSSSPLSFTSRTPLLLSTTRHFRKKRSLQCRQS 60
Query: 55 EYYEQKSFSAS--PSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDS 112
+Y++Q++ + P++SS+ G L RV+VGHSIYKGKAALTVEP+ PEF LDS
Sbjct: 61 DYFQQQNITRRQPPNDSSFG------GALQPRVFVGHSIYKGKAALTVEPKAPEFTPLDS 114
Query: 113 GAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKG 172
GA K+S+EGFV+LQFAPAAGVRQYDW RKQVFSLSVTEIGSL++LGARESCEFFHDPFKG
Sbjct: 115 GAFKVSKEGFVLLQFAPAAGVRQYDWGRKQVFSLSVTEIGSLISLGARESCEFFHDPFKG 174
Query: 173 KSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
+SEEGKVRKVLKVEPLPDGSGHFFNLSVQNKL+N+DE+IYIPVTRAE+ VL+SAFN
Sbjct: 175 RSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLLNMDENIYIPVTRAEFAVLISAFN 230
>gi|449520335|ref|XP_004167189.1| PREDICTED: single-stranded DNA-binding protein WHY1,
chloroplastic-like [Cucumis sativus]
Length = 276
Score = 291 bits (746), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/237 (64%), Positives = 178/237 (75%), Gaps = 13/237 (5%)
Query: 2 MLQLQCLSSQ---TLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKLYVKCRQS---E 55
ML+LQ LSS T NP+L P +LSN + + S T +K KC+ S +
Sbjct: 1 MLRLQWLSSSFLPTFNPELSP-ETLSNPRLCTL-NFSRPLPTLTTTRKPTPKCQYSWNTQ 58
Query: 56 YYEQKSFSASPSNSSYAPTD-----VAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSL 110
++Q +F +P S +P A LP R +VGHSIYKGKAALTVEPR PEF L
Sbjct: 59 QHQQSAFEPAPHTDSLSPQSRAGAAAAAAALPPRFFVGHSIYKGKAALTVEPRPPEFTPL 118
Query: 111 DSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPF 170
DSGA K+SREG VMLQFAPAAGVRQYDWSRKQVFSLSVTE+GSL+ALG RE+CEFFHDP+
Sbjct: 119 DSGAFKISREGLVMLQFAPAAGVRQYDWSRKQVFSLSVTELGSLIALGPREACEFFHDPY 178
Query: 171 KGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAF 227
KGKS+EGKVRK+LKVEPLPDGSGHFFNL+VQNKLIN+DESIYIP+T+AEYTVLV AF
Sbjct: 179 KGKSDEGKVRKILKVEPLPDGSGHFFNLTVQNKLINVDESIYIPITKAEYTVLVEAF 235
>gi|15223748|ref|NP_172893.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75191428|sp|Q9M9S3.1|WHY1_ARATH RecName: Full=Single-stranded DNA-binding protein WHY1,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 1; AltName: Full=Protein WHIRLY
1; Short=AtWHY1; Flags: Precursor
gi|7262683|gb|AAF43941.1|AC012188_18 Contains similarity to a hypothetical protein from Arabidopsis
thaliana gb|AC002521.2. EST gb|AI995686 comes from this
gene [Arabidopsis thaliana]
gi|12083312|gb|AAG48815.1|AF332452_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|13877787|gb|AAK43971.1|AF370156_1 putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|16323418|gb|AAL15203.1| putative DNA-binding protein p24 [Arabidopsis thaliana]
gi|332191039|gb|AEE29160.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 263
Score = 290 bits (742), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 140/189 (74%), Positives = 164/189 (86%), Gaps = 7/189 (3%)
Query: 41 SIKKKKLY-VKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALT 99
+ K KL+ VK RQ++Y+E++ F S S+ S A LP R YVGHSIYKGKAALT
Sbjct: 42 TTKTVKLFSVKSRQTDYFEKQRFGDSSSSPSPAEG------LPARFYVGHSIYKGKAALT 95
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGA 159
V+PR PEFV+LDSGA KLS++GF++LQFAP+AGVRQYDWS+KQVFSLSVTEIG+LV+LG
Sbjct: 96 VDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGP 155
Query: 160 RESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAE 219
RESCEFFHDPFKGKS+EGKVRKVLKVEPLPDGSGHFFNLSVQNKL+N+DESIYIP+TRAE
Sbjct: 156 RESCEFFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLSVQNKLVNVDESIYIPITRAE 215
Query: 220 YTVLVSAFN 228
+ VL+SAFN
Sbjct: 216 FAVLISAFN 224
>gi|255558202|ref|XP_002520128.1| conserved hypothetical protein [Ricinus communis]
gi|223540620|gb|EEF42183.1| conserved hypothetical protein [Ricinus communis]
Length = 271
Score = 289 bits (740), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 147/229 (64%), Positives = 179/229 (78%), Gaps = 13/229 (5%)
Query: 10 SQTLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKK-----KLYVKCRQSEYYEQKSFSA 64
+ T NPKL ++ S+ + S + + S ++ VKCRQSE+Y+Q+
Sbjct: 7 TSTQNPKLWTPNTFSSLQSFRTTSTTFIHTISAPRRLSTNTNRAVKCRQSEFYDQQQQKY 66
Query: 65 SPS-----NSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSR 119
+PS +SS+A A +P RVYVGHSIYKGKAALTVEPR PEF +LDSGA K++R
Sbjct: 67 NPSRPSSNDSSFASQSPAA--VP-RVYVGHSIYKGKAALTVEPRAPEFAALDSGAFKVAR 123
Query: 120 EGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKV 179
EGFV+LQFAPAAGVRQYDWSRKQVFSLSVTEIG++++LGAR+SCEFFHDP KGKS+EGK+
Sbjct: 124 EGFVLLQFAPAAGVRQYDWSRKQVFSLSVTEIGTIISLGARDSCEFFHDPNKGKSDEGKI 183
Query: 180 RKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
RKVLKVEPLPDGSGHFFNLSVQNK +N+DESIYIPVT+AE+ VL+SAFN
Sbjct: 184 RKVLKVEPLPDGSGHFFNLSVQNKPMNMDESIYIPVTKAEFAVLISAFN 232
>gi|297849846|ref|XP_002892804.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
gi|297338646|gb|EFH69063.1| ATWHY1/PTAC1 [Arabidopsis lyrata subsp. lyrata]
Length = 264
Score = 284 bits (726), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 140/189 (74%), Positives = 166/189 (87%), Gaps = 6/189 (3%)
Query: 41 SIKKKKLY-VKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALT 99
+ K KL+ VK RQ++Y+E++ F +SS + + A G LP R YVGHSIYKGKAALT
Sbjct: 42 TTKTVKLFSVKSRQTDYFEKQRFG----DSSSSSSQNAEG-LPARFYVGHSIYKGKAALT 96
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGA 159
+EPR PEFV+LDSGA KLS++GF++LQFAP+AGVRQYDWS+KQVFSLSVTEIG+LV+LG
Sbjct: 97 MEPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSKKQVFSLSVTEIGTLVSLGP 156
Query: 160 RESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAE 219
RESCEFFHDPFKGKS+EGKVRKVLKVEPLPDGSGHFFNLSVQNKL+N+DESIYIP+TRAE
Sbjct: 157 RESCEFFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLSVQNKLVNVDESIYIPITRAE 216
Query: 220 YTVLVSAFN 228
+ VL+SAFN
Sbjct: 217 FAVLISAFN 225
>gi|145328252|ref|NP_001077872.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|330250525|gb|AEC05619.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 267
Score = 280 bits (716), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 133/183 (72%), Positives = 157/183 (85%), Gaps = 7/183 (3%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVS 225
FHDPFKGK +EGKVRKVLKVEPLPDGSG FFNLSVQNKL+N+DES+YIP+T+AE+ VL+S
Sbjct: 166 FHDPFKGKGDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEFAVLIS 225
Query: 226 AFN 228
AFN
Sbjct: 226 AFN 228
>gi|224099743|ref|XP_002311601.1| predicted protein [Populus trichocarpa]
gi|118485247|gb|ABK94483.1| unknown [Populus trichocarpa]
gi|222851421|gb|EEE88968.1| predicted protein [Populus trichocarpa]
Length = 265
Score = 280 bits (716), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 153/234 (65%), Positives = 180/234 (76%), Gaps = 15/234 (6%)
Query: 2 MLQLQCLS---SQTLNPKLC--PFHSLSNSKGNGFGSISVTESTSI--KKKKLYVKCRQS 54
MLQL +S S NPKL ++SL ++K SIS+ TS KKK L VKC
Sbjct: 1 MLQLNSVSRVSSSPQNPKLWLPQYNSLCSTK-----SISLNSKTSSTEKKKTLGVKC--- 52
Query: 55 EYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGA 114
+YY+Q+ + + S+ S + VG P +V+VGHSIYKGKAALTVEPR PEF LDSGA
Sbjct: 53 QYYDQQHKTFTTSSRSSPSSAPPVGESPPKVFVGHSIYKGKAALTVEPRSPEFSPLDSGA 112
Query: 115 VKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKS 174
KL +EGFV+LQFAPAA VRQYDW+RKQVFSLSVTEIG LV+L A+ SCEFFHDP KGKS
Sbjct: 113 YKLVKEGFVLLQFAPAASVRQYDWTRKQVFSLSVTEIGHLVSLDAKGSCEFFHDPNKGKS 172
Query: 175 EEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
+EGKVRK+LKVEPLPDGSGHFFNLSVQNK++N+DE+IYIPVT+AEYTVL SAFN
Sbjct: 173 DEGKVRKLLKVEPLPDGSGHFFNLSVQNKVLNIDENIYIPVTKAEYTVLTSAFN 226
>gi|388509172|gb|AFK42652.1| unknown [Lotus japonicus]
Length = 261
Score = 279 bits (713), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 131/183 (71%), Positives = 155/183 (84%), Gaps = 7/183 (3%)
Query: 49 VKCRQSEYYEQKSFSAS---PSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
++CR S+ ++QK+FS+S P+N P V+VG LP RVYVGHSIYKGKAALTV PR P
Sbjct: 44 IRCRHSDLFDQKTFSSSTPQPAN----PAAVSVGALPPRVYVGHSIYKGKAALTVTPRPP 99
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EF LDSGA K+SREG+V+LQFAPA RQYDW+RKQVFSLSV E+GS+++LG RESCEF
Sbjct: 100 EFAPLDSGAFKISREGYVLLQFAPAIASRQYDWNRKQVFSLSVVEMGSVISLGTRESCEF 159
Query: 166 FHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVS 225
FHDP KGKS+EGKVRKVLK+EPLPDGSGHFFNLSVQNK++N+DE+IYIPVT+AE VL S
Sbjct: 160 FHDPLKGKSDEGKVRKVLKLEPLPDGSGHFFNLSVQNKIVNIDENIYIPVTKAELAVLSS 219
Query: 226 AFN 228
FN
Sbjct: 220 IFN 222
>gi|297814520|ref|XP_002875143.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
gi|297320981|gb|EFH51402.1| ATWHY3/PTAC11 [Arabidopsis lyrata subsp. lyrata]
Length = 268
Score = 279 bits (713), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 162/188 (86%), Gaps = 8/188 (4%)
Query: 42 IKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVE 101
+ + KL VK RQ +Y+E++ F S S+ + A G+ P R YVGHSIYKGKAALT+E
Sbjct: 49 MARLKLSVKSRQDDYFEKQRFGDSSSSQN------AEGSSP-RFYVGHSIYKGKAALTIE 101
Query: 102 PRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARE 161
PR PEFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RE
Sbjct: 102 PRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRE 161
Query: 162 SCEFFHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEY 220
SCEFFHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLSVQNKL+N+DES+YIP+T+AE+
Sbjct: 162 SCEFFHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEF 221
Query: 221 TVLVSAFN 228
VL+SAFN
Sbjct: 222 AVLISAFN 229
>gi|42568881|ref|NP_178377.2| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
gi|75115367|sp|Q66GR6.1|WHY3_ARATH RecName: Full=Single-stranded DNA-binding protein WHY3,
chloroplastic; AltName: Full=Protein PLASTID
TRANSCRIPTIONALLY ACTIVE 11; AltName: Full=Protein
WHIRLY 3; Short=AtWHY3; Flags: Precursor
gi|51536442|gb|AAU05459.1| At2g02740 [Arabidopsis thaliana]
gi|51972072|gb|AAU15140.1| At2g02740 [Arabidopsis thaliana]
gi|330250524|gb|AEC05618.1| ssDNA-binding transcriptional regulator [Arabidopsis thaliana]
Length = 268
Score = 277 bits (708), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 134/184 (72%), Positives = 158/184 (85%), Gaps = 8/184 (4%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWSRKQVFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLV 224
FHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLSVQNKL+N+DES+YIP+T+AE+ VL+
Sbjct: 166 FHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEFAVLI 225
Query: 225 SAFN 228
SAFN
Sbjct: 226 SAFN 229
>gi|75174555|sp|Q9LL85.1|WHY1_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=DNA-binding protein p24;
AltName: Full=PR-10a binding factor 2; Short=PBF-2;
AltName: Full=Protein WHIRLY 1; Short=StWhy1; Flags:
Precursor
gi|9651810|gb|AAF91282.1|AF233342_1 DNA-binding protein p24 [Solanum tuberosum]
Length = 274
Score = 274 bits (700), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/223 (62%), Positives = 162/223 (72%), Gaps = 15/223 (6%)
Query: 14 NPKLCPFHSLSNSKGNGFGSISVTESTSIK--------KKKLYVKCRQSEYYEQKSFSAS 65
NP + S S+S F +S + S + L + CR S+Y+E +
Sbjct: 20 NPTKTSYLSFSSSINTIFAPLSSNTTKSFSGLTHKAALPRNLSLTCRHSDYFEPQQ---- 75
Query: 66 PSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVML 125
G +V+VG+SIYKGKAALTVEPR PEF LDSGA KLSREG VML
Sbjct: 76 ---QQQQQQQQPQGASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVML 132
Query: 126 QFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKV 185
QFAPAAGVRQYDWSRKQVFSLSVTEIGS+++LGA++SCEFFHDP KG+S+EG+VRKVLKV
Sbjct: 133 QFAPAAGVRQYDWSRKQVFSLSVTEIGSIISLGAKDSCEFFHDPNKGRSDEGRVRKVLKV 192
Query: 186 EPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
EPLPDGSGHFFNLSVQNKLINLDE+IYIPVT+AE+ VLVSAFN
Sbjct: 193 EPLPDGSGHFFNLSVQNKLINLDENIYIPVTKAEFAVLVSAFN 235
>gi|110740230|dbj|BAF02013.1| hypothetical protein [Arabidopsis thaliana]
Length = 268
Score = 274 bits (700), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 132/184 (71%), Positives = 158/184 (85%), Gaps = 8/184 (4%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEF 165
EFV+L+SGA KL++EGF++LQFAPAAGVRQYDWS+K+VFSLSVTEIG+LV+LG RESCEF
Sbjct: 106 EFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSKKRVFSLSVTEIGNLVSLGPRESCEF 165
Query: 166 FHDPFKGK-SEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLV 224
FHDPFKGK S+EGKVRKVLKVEPLPDGSG FFNLSVQNKL+N+DES+YIP+T+AE+ VL+
Sbjct: 166 FHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQNKLLNVDESVYIPITKAEFAVLI 225
Query: 225 SAFN 228
SAFN
Sbjct: 226 SAFN 229
>gi|356567550|ref|XP_003551981.1| PREDICTED: uncharacterized protein LOC100804480 [Glycine max]
Length = 235
Score = 270 bits (689), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 124/185 (67%), Positives = 154/185 (83%), Gaps = 3/185 (1%)
Query: 45 KKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRG 104
K ++CR S+ ++Q + +++P + +VG LP RVYVG+SIYKGKAALT+ PR
Sbjct: 48 KPFSLRCRHSDLFDQNTLASTPRPTR---PSASVGALPPRVYVGYSIYKGKAALTLTPRP 104
Query: 105 PEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCE 164
PEF+ LDSGA K+S+EG+V+LQFAPA G RQYDW+RKQVFSLSV E+GS+++LGAR+S E
Sbjct: 105 PEFMPLDSGAYKISKEGYVLLQFAPAVGTRQYDWNRKQVFSLSVGEMGSVISLGARDSYE 164
Query: 165 FFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLV 224
FFHDPFKGKS+EGKVRK+LKVEPLPDGSGHFFNLSVQNKL+N+DESIYIPVT+AE VL
Sbjct: 165 FFHDPFKGKSDEGKVRKILKVEPLPDGSGHFFNLSVQNKLVNVDESIYIPVTKAELAVLT 224
Query: 225 SAFNV 229
S F +
Sbjct: 225 STFKI 229
>gi|157878742|pdb|1L3A|A Chain A, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878743|pdb|1L3A|B Chain B, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878744|pdb|1L3A|C Chain C, Structure Of The Plant Transcriptional Regulator Pbf-2
gi|157878745|pdb|1L3A|D Chain D, Structure Of The Plant Transcriptional Regulator Pbf-2
Length = 227
Score = 264 bits (675), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 124/150 (82%), Positives = 138/150 (92%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G +V+VG+SIYKGKAALTVEPR PEF LDSGA KLSREG VMLQFAPAAGVRQYDW
Sbjct: 32 GASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDW 91
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
SRKQVFSLSVTEIGS+++LG ++SCEFFHDP KG+S+EG+VRKVLKVEPLPDGSGHFFNL
Sbjct: 92 SRKQVFSLSVTEIGSIISLGTKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDGSGHFFNL 151
Query: 199 SVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
SVQNKLINLDE+IYIPVT+AE+ VLVSAFN
Sbjct: 152 SVQNKLINLDENIYIPVTKAEFAVLVSAFN 181
>gi|295913588|gb|ADG58040.1| transcription factor [Lycoris longituba]
Length = 246
Score = 262 bits (669), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 124/186 (66%), Positives = 155/186 (83%), Gaps = 3/186 (1%)
Query: 43 KKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEP 102
KK+ L + CR S Y++Q+ S+S S + + + RV+VG+SIYKGKAALTVEP
Sbjct: 48 KKQNLPISCRSSNYFDQQRLSSSSSTPPSPSSQPSSQS---RVFVGYSIYKGKAALTVEP 104
Query: 103 RGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARES 162
R PEF LDSGA K+++EGF++LQFAPA G+RQYDWSRKQVFSLSV EIG+L++LGA+ES
Sbjct: 105 RAPEFAPLDSGAFKVAKEGFILLQFAPAVGMRQYDWSRKQVFSLSVVEIGTLMSLGAKES 164
Query: 163 CEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTV 222
CEFFHDPFKG+SEEGKVRK+LK EPLPDG+GHFFNLSVQN+L+N+DESIYIP+++AE+ V
Sbjct: 165 CEFFHDPFKGRSEEGKVRKLLKAEPLPDGTGHFFNLSVQNRLLNVDESIYIPISKAEFAV 224
Query: 223 LVSAFN 228
L S FN
Sbjct: 225 LNSTFN 230
>gi|224111254|ref|XP_002315793.1| predicted protein [Populus trichocarpa]
gi|222864833|gb|EEF01964.1| predicted protein [Populus trichocarpa]
Length = 191
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 122/150 (81%), Positives = 136/150 (90%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P +V+VGHSIYKGKAALT+EPR PEF L+SGA KL +EGFV+ QFAPA+ RQYDW
Sbjct: 1 GESPPKVFVGHSIYKGKAALTIEPRAPEFSPLESGAYKLVKEGFVLSQFAPASSARQYDW 60
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+RKQVFSLSVTEIG LV+LGAR+SCEFFHDP KG+SEEGKVRKVLKVEPLPDGSGHFFNL
Sbjct: 61 TRKQVFSLSVTEIGHLVSLGARDSCEFFHDPNKGRSEEGKVRKVLKVEPLPDGSGHFFNL 120
Query: 199 SVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
SVQNK +N+DESIYIPVTRAEYTVL+SAFN
Sbjct: 121 SVQNKALNIDESIYIPVTRAEYTVLISAFN 150
>gi|242091954|ref|XP_002436467.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
gi|241914690|gb|EER87834.1| hypothetical protein SORBIDRAFT_10g003170 [Sorghum bicolor]
Length = 266
Score = 236 bits (603), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 106/145 (73%), Positives = 127/145 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +SIYKGKAAL+ +PR P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 85 RVFTSYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVATRQYDWTRKQV 144
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
FSLSV EIG+L+ LG +SCEFFHDPFKG+SEEGKVRKVLKVEP PDG+G FFNLSVQN+
Sbjct: 145 FSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKVEPTPDGNGRFFNLSVQNR 204
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
LIN+DESIYIP+T+ E+ V+VS FN
Sbjct: 205 LINVDESIYIPITKGEFAVIVSTFN 229
>gi|194306593|ref|NP_001123589.1| LOC100170235 [Zea mays]
gi|426021717|sp|B2LXS7.1|WHY1_MAIZE RecName: Full=Single-stranded DNA-bindig protein WHY1,
chloroplastic; AltName: Full=Protein WHIRLY 1;
Short=ZmWHY1; Flags: Precursor
gi|183229934|gb|ACC60344.1| Whirly family nucleic acid binding protein [Zea mays]
gi|194708562|gb|ACF88365.1| unknown [Zea mays]
gi|195612298|gb|ACG27979.1| DNA-binding protein p24 [Zea mays]
gi|408690350|gb|AFU81635.1| WHIRLY-type transcription factor, partial [Zea mays subsp. mays]
gi|413942843|gb|AFW75492.1| whirly1 [Zea mays]
Length = 266
Score = 236 bits (602), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 110/177 (62%), Positives = 137/177 (77%), Gaps = 2/177 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
G+SEEGKVRKVLK+EP PDG+G FFNLSVQN+LIN+DESIYIP+T+ E+ V+VS FN
Sbjct: 173 GRSEEGKVRKVLKIEPTPDGNGRFFNLSVQNRLINVDESIYIPITKGEFAVIVSTFN 229
>gi|119638471|gb|ABL85062.1| expressed protein [Brachypodium sylvaticum]
Length = 266
Score = 236 bits (602), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 111/184 (60%), Positives = 142/184 (77%), Gaps = 4/184 (2%)
Query: 49 VKCRQSEYYEQKSFSASPSNS----SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRG 104
V R S+Y++ ++ + P + P A G RV+ +SIYKGKAAL+ +PR
Sbjct: 46 VPARHSDYFDPRARTPPPRDEYGEPPPPPLAPAQGGQSGRVFASYSIYKGKAALSFDPRP 105
Query: 105 PEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCE 164
P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV E+G+L+ LG +SCE
Sbjct: 106 PQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYDWTRKQVFSLSVWEMGTLLTLGPTDSCE 165
Query: 165 FFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLV 224
FFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLSVQN+L+N+DES+YIP+T+ EY V+V
Sbjct: 166 FFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLSVQNRLLNIDESVYIPITKGEYAVIV 225
Query: 225 SAFN 228
S FN
Sbjct: 226 STFN 229
>gi|217071924|gb|ACJ84322.1| unknown [Medicago truncatula]
Length = 239
Score = 236 bits (601), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 114/189 (60%), Positives = 146/189 (77%), Gaps = 1/189 (0%)
Query: 39 STSIKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAAL 98
S K K ++CR S+ + + P ++ + VG LP RVYVGHSIYKGKAAL
Sbjct: 36 SLPFKFKPFTIRCRHSDVFNPSPSNPPPPATTPPNNPL-VGALPPRVYVGHSIYKGKAAL 94
Query: 99 TVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALG 158
T+ P P+FV+LDSGA K+SR+G ++LQFAP+ G RQYDW+RKQ+F LSV E+GS+++LG
Sbjct: 95 TITPTPPKFVTLDSGAYKISRDGCLLLQFAPSVGPRQYDWNRKQLFMLSVDEMGSVISLG 154
Query: 159 ARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRA 218
ARESCEFFHDPFKG S+EGKVRKVLK+EP PDGSG FFNLSVQ+K++N+D S+ IPV++A
Sbjct: 155 ARESCEFFHDPFKGGSDEGKVRKVLKIEPFPDGSGFFFNLSVQDKIVNVDVSMNIPVSKA 214
Query: 219 EYTVLVSAF 227
E +VL S F
Sbjct: 215 ELSVLRSIF 223
>gi|357110788|ref|XP_003557198.1| PREDICTED: uncharacterized protein LOC100824321 [Brachypodium
distachyon]
Length = 274
Score = 236 bits (601), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 104/145 (71%), Positives = 127/145 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RVY +SIYKGKAAL+ +PR P+FV LDSGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 93 RVYASYSIYKGKAALSFDPRPPQFVPLDSGAYKVAKEGFVLLQFAPAVAARQYDWTRKQV 152
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
FSLSV E+G+L+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLSVQN+
Sbjct: 153 FSLSVWEMGTLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLSVQNR 212
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
L+N+DES+YIP+T+ EY V+VS FN
Sbjct: 213 LLNIDESVYIPITKGEYAVIVSTFN 237
>gi|326493106|dbj|BAJ85014.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519312|dbj|BAJ96655.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 267
Score = 233 bits (593), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 103/145 (71%), Positives = 127/145 (87%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +SIYKGKAAL +PR P+FV L+SGA K+++EGFV+LQFAPA G RQYDW+RKQV
Sbjct: 86 RVFASYSIYKGKAALAFDPRPPQFVPLESGAYKVAKEGFVLLQFAPAVGPRQYDWARKQV 145
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
FSLSV E+G+L+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP PDG+G FFNLSVQN+
Sbjct: 146 FSLSVWEMGTLLTLGLTDSCEFFHDPFKGRSDEGKVRKVLKVEPTPDGNGRFFNLSVQNR 205
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
L+N+DE+IYIP+T+ EY V+VS FN
Sbjct: 206 LLNVDENIYIPITKGEYAVIVSTFN 230
>gi|55296373|dbj|BAD68418.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|55297130|dbj|BAD68773.1| putative DNA-binding protein p24 [Oryza sativa Japonica Group]
gi|222634946|gb|EEE65078.1| hypothetical protein OsJ_20114 [Oryza sativa Japonica Group]
Length = 272
Score = 231 bits (588), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 104/159 (65%), Positives = 131/159 (82%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 77 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 136
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLP 189
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP P
Sbjct: 137 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTP 196
Query: 190 DGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
DG+ FFNLSVQN+L+N+DE+IYIP+T+ E+ V+VS FN
Sbjct: 197 DGNSRFFNLSVQNRLLNIDENIYIPITKGEFAVIVSTFN 235
>gi|218197563|gb|EEC79990.1| hypothetical protein OsI_21637 [Oryza sativa Indica Group]
Length = 274
Score = 231 bits (588), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 104/159 (65%), Positives = 131/159 (82%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 79 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 138
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLP 189
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+S+EGKVRKVLKVEP P
Sbjct: 139 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGRSDEGKVRKVLKVEPTP 198
Query: 190 DGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
DG+ FFNLSVQN+L+N+DE+IYIP+T+ E+ V+VS FN
Sbjct: 199 DGNSRFFNLSVQNRLLNIDENIYIPITKGEFAVIVSTFN 237
>gi|357486629|ref|XP_003613602.1| DNA-binding protein p24 [Medicago truncatula]
gi|355514937|gb|AES96560.1| DNA-binding protein p24 [Medicago truncatula]
Length = 261
Score = 220 bits (560), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/232 (52%), Positives = 151/232 (65%), Gaps = 16/232 (6%)
Query: 2 MLQLQCLSS-QTLNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKL----YVKCRQSE- 55
MLQLQ S T NP P HS I T SI +++ + C E
Sbjct: 3 MLQLQPPQSYTTTNPFSVPTHSF----------IINTPKKSIFLRRVGPTFSLTCHHPEL 52
Query: 56 YYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAV 115
++ + SS + +VG LP RV+V S+YKGKA L V P P+F S DSG
Sbjct: 53 FHPKPFPPPQRPQSSSSSFSSSVGELPARVHVSRSVYKGKAVLVVSPVLPKFTSSDSGTF 112
Query: 116 KLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSE 175
K+S+EG ++LQF P+AG RQYDW+RKQVFSLSV E+G+L+ LGARESCE FHDPF G+S+
Sbjct: 113 KISKEGLMLLQFVPSAGFRQYDWNRKQVFSLSVDEMGNLINLGARESCEIFHDPFMGRSD 172
Query: 176 EGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAF 227
EGKVRKVLKVEPL DGSGH F LSVQN+L N+DE+I+IPVT+AE+ V S F
Sbjct: 173 EGKVRKVLKVEPLHDGSGHMFKLSVQNQLKNIDENIFIPVTKAEFAVFNSLF 224
>gi|413942842|gb|AFW75491.1| whirly1 [Zea mays]
Length = 205
Score = 189 bits (479), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 89/147 (60%), Positives = 110/147 (74%), Gaps = 2/147 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GKSEEGKVRKVLKVEPLPDGSGHFFNL 198
G+SEEGKVRKVLK+EP PDG+G FFNL
Sbjct: 173 GRSEEGKVRKVLKIEPTPDGNGRFFNL 199
>gi|116779826|gb|ABK21442.1| unknown [Picea sitchensis]
Length = 257
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 115/146 (78%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
++YV H++YKG+ ALT++P+ P++++L+ G V +++EG + L+FAPA G RQYDWS+K+
Sbjct: 71 NKIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKK 130
Query: 143 VFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQN 202
+ +LSV E+G+L++LG ESCEF HDPF GKSE GK+ KVLKV L D G+FFNLSV +
Sbjct: 131 IIALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNLSVTD 190
Query: 203 KLINLDESIYIPVTRAEYTVLVSAFN 228
++ ++DES IP+T+ E++V+ S FN
Sbjct: 191 RIADVDESFSIPITKGEFSVMQSIFN 216
>gi|116783258|gb|ABK22859.1| unknown [Picea sitchensis]
Length = 259
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 115/146 (78%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
++YV H++YKG+ ALT++P+ P++++L+ G V +++EG + L+FAPA G RQYDWS+K+
Sbjct: 71 NKIYVKHTVYKGEGALTMKPKLPDYITLNMGGVTVAKEGCMFLEFAPAVGPRQYDWSKKK 130
Query: 143 VFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQN 202
+ +LSV E+G+L++LG ESCEF HDPF GKSE GK+ KVLKV L D G+FFNLSV +
Sbjct: 131 IIALSVVEVGTLLSLGPDESCEFTHDPFMGKSEAGKIMKVLKVGNLQDTGGYFFNLSVTD 190
Query: 203 KLINLDESIYIPVTRAEYTVLVSAFN 228
++ ++DES IP+T+ E++V+ S FN
Sbjct: 191 RIADVDESFSIPITKGEFSVMQSIFN 216
>gi|356497854|ref|XP_003517771.1| PREDICTED: uncharacterized protein LOC100797370 [Glycine max]
Length = 263
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 90/157 (57%), Positives = 116/157 (73%), Gaps = 6/157 (3%)
Query: 78 VGTLPT-RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQ- 135
V LP RVYVG+S+Y K LTV PR PEF S SGA K+S+EG+V+LQFAP+ G +
Sbjct: 68 VAELPQQRVYVGYSVYTRKGVLTVTPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEP 127
Query: 136 -YDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGH 194
YDW++KQ+FSLSV+E+G+L+ LGAR+S EF H+ K KS E +VRKVLKVEPL D +GH
Sbjct: 128 IYDWNQKQIFSLSVSEMGTLITLGARDSWEFSHETVKLKSNETEVRKVLKVEPLLDATGH 187
Query: 195 FFNLSVQNKLINLD---ESIYIPVTRAEYTVLVSAFN 228
F+LSVQ K +N++ ++I +PVTRAE VL FN
Sbjct: 188 LFSLSVQKKPVNMEGIQKNISLPVTRAELAVLRVLFN 224
>gi|426021772|sp|D9J034.1|WHY2_SOLTU RecName: Full=Single-stranded DNA-bindig protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=StWHY2; Flags: Precursor
gi|298359665|gb|ADI77438.1| Why2 protein [Solanum tuberosum]
Length = 238
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 74/144 (51%), Positives = 104/144 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 56 RVFAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 115
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 116 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLSVVNN 175
Query: 204 LINLDESIYIPVTRAEYTVLVSAF 227
+ ++ +PVT AE+ V+ +AF
Sbjct: 176 NLKTNDRFTVPVTTAEFAVMRTAF 199
>gi|302566179|pdb|3N1H|A Chain A, Crystal Structure Of Stwhy2
gi|302566180|pdb|3N1I|A Chain A, Crystal Structure Of A Stwhy2-Ere32 Complex
gi|302566182|pdb|3N1J|A Chain A, Crystal Structure Of A Stwhy2-Dt32 Complex
gi|302566184|pdb|3N1K|A Chain A, Crystal Structure Of A Stwhy2-Cere32 Complex
gi|302566186|pdb|3N1L|A Chain A, Crystal Structure Of A Stwhy2-Rcere32 Complex
Length = 178
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 74/144 (51%), Positives = 104/144 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 10 RVFAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 69
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 70 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLSVVNN 129
Query: 204 LINLDESIYIPVTRAEYTVLVSAF 227
+ ++ +PVT AE+ V+ +AF
Sbjct: 130 NLKTNDRFTVPVTTAEFAVMRTAF 153
>gi|356500463|ref|XP_003519051.1| PREDICTED: uncharacterized protein LOC100775220 [Glycine max]
Length = 263
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/150 (56%), Positives = 109/150 (72%), Gaps = 5/150 (3%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQ--YDWSRK 141
RVYVG+S+Y K LTV PR PEF S SGA K+S+EG+V+LQFAP+ G + YDW+ K
Sbjct: 75 RVYVGYSVYTKKGMLTVIPRPPEFESKSSGAFKVSKEGYVVLQFAPSVGADEPIYDWNHK 134
Query: 142 QVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQ 201
Q FSLSV+E+G+L+ LGAR+S EF H+ K KS + VRKVLKVEPL D +GH F+L V
Sbjct: 135 QTFSLSVSEMGTLIILGARDSWEFSHETVKLKSSKIDVRKVLKVEPLLDATGHLFSLRVL 194
Query: 202 NKLINLD---ESIYIPVTRAEYTVLVSAFN 228
K N++ +SI++PVTRA+ VL S FN
Sbjct: 195 KKPANMEGIQKSIFLPVTRADLEVLRSLFN 224
>gi|347948612|pdb|3R9Y|A Chain A, Crystal Structure Of Stwhy2 K67a (Form I)
gi|347948613|pdb|3R9Z|A Chain A, Crystal Structure Of Stwhy2 K67a (Form Ii)
gi|347948614|pdb|3RA0|A Chain A, Crystal Structure Of A Stwhy2 K67a-Dt32 Complex
Length = 178
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/144 (50%), Positives = 103/144 (71%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KG AAL+ EPR P F LDSG VKL+R G +ML F P+ G R+YDW ++Q+
Sbjct: 10 RVFAPYSVFKGAAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQL 69
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G R+S EFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 70 FALSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLSVVNN 129
Query: 204 LINLDESIYIPVTRAEYTVLVSAF 227
+ ++ +PVT AE+ V+ +AF
Sbjct: 130 NLKTNDRFTVPVTTAEFAVMRTAF 153
>gi|22330568|ref|NP_177282.2| protein WHIRLY 2 [Arabidopsis thaliana]
gi|75161474|sp|Q8VYF7.1|WHY2_ARATH RecName: Full=Single-stranded DNA-binding protein WHY2,
mitochondrial; AltName: Full=Protein WHIRLY 2;
Short=AtWHY2; Flags: Precursor
gi|18175814|gb|AAL59932.1| unknown protein [Arabidopsis thaliana]
gi|21689867|gb|AAM67494.1| unknown protein [Arabidopsis thaliana]
gi|225898076|dbj|BAH30370.1| hypothetical protein [Arabidopsis thaliana]
gi|332197060|gb|AEE35181.1| protein WHIRLY 2 [Arabidopsis thaliana]
Length = 238
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 71/145 (48%), Positives = 106/145 (73%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 53 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 112
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L V+P DGSG+F +LSV N
Sbjct: 113 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISLSVNNS 172
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
++ ++ +PVT+AE+ V+ +AF+
Sbjct: 173 ILKTNDYFVVPVTKAEFAVMKTAFS 197
>gi|12323827|gb|AAG51881.1|AC016162_2 unknown protein; 79476-81015 [Arabidopsis thaliana]
Length = 237
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 71/145 (48%), Positives = 106/145 (73%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 52 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 111
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L V+P DGSG+F +LSV N
Sbjct: 112 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISLSVNNS 171
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
++ ++ +PVT+AE+ V+ +AF+
Sbjct: 172 ILKTNDYFVVPVTKAEFAVMKTAFS 196
>gi|297841891|ref|XP_002888827.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
gi|297334668|gb|EFH65086.1| ATWHY2 [Arabidopsis lyrata subsp. lyrata]
Length = 242
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 106/145 (73%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R++ +SI+KGKAAL+VEP P F +DSG +++ R G +M+ F PA G R+YDW +KQ
Sbjct: 57 RLFAPYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQK 116
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL+++G+++S EFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 117 FALSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSIKPHADGSGYFISLSVNNG 176
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
++ ++ +PVT+AE+ V+ +AF+
Sbjct: 177 ILKTNDYFVVPVTKAEFAVMKTAFS 201
>gi|449533266|ref|XP_004173597.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like, partial [Cucumis sativus]
Length = 156
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 103/143 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ + +YKGKAAL++EP P F ++SG + R G +ML FAPA G R+YDW+RKQ+
Sbjct: 12 RVFASYYVYKGKAALSMEPCMPTFTKVESGNFIMDRRGSIMLTFAPAVGERKYDWTRKQL 71
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TEIGSL++LG R+SCEFFHDP S G+VRK L ++ DG+G+FF+L+V NK
Sbjct: 72 FALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVRKSLAIKAHTDGNGYFFSLNVVNK 131
Query: 204 LINLDESIYIPVTRAEYTVLVSA 226
N ++ + +P T E++V+ +A
Sbjct: 132 PQNTNDYLSVPFTTGEFSVMKTA 154
>gi|388498336|gb|AFK37234.1| unknown [Lotus japonicus]
Length = 235
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/153 (47%), Positives = 105/153 (68%)
Query: 74 TDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGV 133
T VA G RV+ + +YKGKAA+++ P P F LDSGA+ + R G +M+ F PA G
Sbjct: 40 TYVAKGYTTDRVFAPYYVYKGKAAMSLSPVLPTFTKLDSGALVVERRGSIMMVFTPAIGE 99
Query: 134 RQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSG 193
R+YDW ++Q F+LS TE+GSL+A+G ++SCEFFHDP S G+VRK L ++P + SG
Sbjct: 100 RKYDWEKRQKFALSATEVGSLIAMGPQDSCEFFHDPSMSSSNAGQVRKSLSIKPHANSSG 159
Query: 194 HFFNLSVQNKLINLDESIYIPVTRAEYTVLVSA 226
+F +L+V N L+N E+ +PVT AE+ V+ +A
Sbjct: 160 YFVSLTVVNNLLNAKENFNVPVTTAEFAVMKTA 192
>gi|449447529|ref|XP_004141520.1| PREDICTED: single-stranded DNA-bindig protein WHY2,
mitochondrial-like [Cucumis sativus]
Length = 241
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 103/143 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ + +YKGKAAL++EP P F ++SG + R G +ML FAPA G R+YDW+RKQ+
Sbjct: 58 RVFASYHVYKGKAALSMEPCMPTFTKVESGNFIMDRRGSIMLTFAPAVGERKYDWTRKQL 117
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TEIGSL++LG R+SCEFFHDP S G+VRK L ++ DG+G+FF+L+V NK
Sbjct: 118 FALSATEIGSLISLGPRDSCEFFHDPGMLSSTAGQVRKSLAIKAHTDGNGYFFSLNVVNK 177
Query: 204 LINLDESIYIPVTRAEYTVLVSA 226
N ++ + +P T E++V+ +A
Sbjct: 178 PQNTNDYLSVPFTTGEFSVMKTA 200
>gi|302399107|gb|ADL36848.1| WHY domain class transcription factor [Malus x domestica]
Length = 237
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 100/143 (69%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
+VY I+KGKAAL++ P P F L+SG++ + R G VML+F PA G R+YDW ++Q+
Sbjct: 52 QVYASFDIFKGKAALSLTPVLPTFTKLESGSLVVDRRGSVMLKFTPAIGERKYDWEKRQM 111
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+G+L++LG+ +SCE FHDP S G+VRK L ++P DGSG+F +L+V N
Sbjct: 112 FALSATEVGALISLGSNDSCELFHDPSMKSSNAGQVRKSLSIKPHADGSGYFVSLTVVNN 171
Query: 204 LINLDESIYIPVTRAEYTVLVSA 226
L+ ES +PV AE+ V+ +A
Sbjct: 172 LLKTRESFSVPVMTAEFAVMKTA 194
>gi|448278892|gb|AGE44298.1| whirly transcription factor domain containing protein [Musa AB
Group]
Length = 245
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 74/161 (45%), Positives = 106/161 (65%)
Query: 68 NSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQF 127
+SS P G+ R YV ++++KGKAAL+V P P F +DSG ++ ++G V+L F
Sbjct: 44 SSSVRPPFSPTGSSSVRRYVEYTVFKGKAALSVSPILPTFREVDSGVSRVHKKGCVILTF 103
Query: 128 APAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEP 187
PA G R+YDW +KQ F+LS TE+GSL+ LG ESCEFFHDP S EG+V+K L + P
Sbjct: 104 WPAIGQRKYDWQKKQAFALSPTEVGSLIGLGPAESCEFFHDPSMKSSLEGQVKKSLSISP 163
Query: 188 LPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
L D +G+ NLSV N + +E +PV++AE+T + + F+
Sbjct: 164 LNDKAGYLLNLSVVNNIQKTNERFSLPVSKAEFTAIRTVFS 204
>gi|357512363|ref|XP_003626470.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|355501485|gb|AES82688.1| hypothetical protein MTR_7g116270 [Medicago truncatula]
gi|388497124|gb|AFK36628.1| unknown [Medicago truncatula]
Length = 226
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 111/163 (68%), Gaps = 6/163 (3%)
Query: 64 ASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFV 123
++ +N++Y+ A G R++ +S+YKGKAA ++ P P F LDSGA+ + R G +
Sbjct: 30 STATNNNYS----AKGYTSDRIFAPYSVYKGKAAFSLSPCLPTFTKLDSGALVVDRHGSI 85
Query: 124 MLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVL 183
M+ F PA G R+YDW ++Q+F+LS TE+GSL+A+G ++SCEFFHDP S G+VRK L
Sbjct: 86 MMSFMPAIGERKYDWEKRQIFALSATEVGSLIAIGPQDSCEFFHDPSMKSSNAGQVRKSL 145
Query: 184 KVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSA 226
++ P +G+F +LSV N ++N ++ +PVT AE+ V+ +A
Sbjct: 146 SIK--PHSNGYFVSLSVVNSVLNTKDNFSVPVTTAEFAVMKTA 186
>gi|115444353|ref|NP_001045956.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|50251252|dbj|BAD28032.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|50252182|dbj|BAD28177.1| putative Chain C, Structure Of The Plant Transcriptional Regulator
Pbf-2 [Oryza sativa Japonica Group]
gi|113535487|dbj|BAF07870.1| Os02g0158400 [Oryza sativa Japonica Group]
gi|215692593|dbj|BAG88013.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704516|dbj|BAG94149.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740785|dbj|BAG96941.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218190103|gb|EEC72530.1| hypothetical protein OsI_05924 [Oryza sativa Indica Group]
gi|222622212|gb|EEE56344.1| hypothetical protein OsJ_05450 [Oryza sativa Japonica Group]
Length = 228
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 105/145 (72%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R + ++++KGKAAL+++P P F L+SG ++++ G VML F PA G R+YD+S+KQ+
Sbjct: 45 RKFASYTVFKGKAALSMQPILPSFSKLESGGSRVNKNGSVMLTFFPAVGQRKYDYSKKQL 104
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL++LG ESCEFFHDP S EG+V+K L V PL + SG+F N++V N
Sbjct: 105 FALSPTEVGSLISLGPAESCEFFHDPSMKSSHEGQVKKSLSVTPLGNDSGYFLNITVLNN 164
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
L E + +P+++AE+TV+ +A +
Sbjct: 165 LQKTTERLSLPISKAEFTVMRTALS 189
>gi|357148896|ref|XP_003574931.1| PREDICTED: uncharacterized protein LOC100825843 [Brachypodium
distachyon]
Length = 231
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 104/145 (71%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R + ++++KGKAAL++ P P F ++SG ++ + G VML F PA G RQYD+S+KQ+
Sbjct: 47 RKFASYTVFKGKAALSISPILPNFTKIESGGSRVKKNGSVMLTFFPAVGQRQYDYSKKQL 106
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL++LG+ ESCEFFHDP S EG+V+K L + PL + +G+F N++V N
Sbjct: 107 FALSPTEVGSLISLGSAESCEFFHDPSMKSSHEGQVKKSLSITPLGNDNGYFVNITVLNN 166
Query: 204 LINLDESIYIPVTRAEYTVLVSAFN 228
+ +E + +PVT+AE+ V+ +A +
Sbjct: 167 VQKTNERLSVPVTKAEFAVMRTALS 191
>gi|118484514|gb|ABK94132.1| unknown [Populus trichocarpa]
Length = 229
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 71/143 (49%), Positives = 100/143 (69%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+VEP P F SG +++ R G +ML F PA G R+YD+ ++Q
Sbjct: 47 RVFAPYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQK 106
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL++ G ++SCEFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 107 FALSATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLSVVNN 166
Query: 204 LINLDESIYIPVTRAEYTVLVSA 226
++ E +PVT AE+TVL +A
Sbjct: 167 ILKTTERFTVPVTTAEFTVLKTA 189
>gi|302764906|ref|XP_002965874.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
gi|302802736|ref|XP_002983122.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300149275|gb|EFJ15931.1| hypothetical protein SELMODRAFT_117665 [Selaginella moellendorffii]
gi|300166688|gb|EFJ33294.1| hypothetical protein SELMODRAFT_84708 [Selaginella moellendorffii]
Length = 226
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/150 (50%), Positives = 99/150 (66%), Gaps = 1/150 (0%)
Query: 80 TLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWS 139
T P RV+ H YKGK AL + P F DSG LSREG VML+FAP+ RQYDW
Sbjct: 27 TRPRRVFADHVFYKGKCALNMRLIKPTFKISDSGDAILSREGTVMLEFAPSISQRQYDWG 86
Query: 140 RKQV-FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+KQV F+LSV+E+G ++AL ES EFFHDP GKS+ G VRK LK+EP D +G FF L
Sbjct: 87 KKQVLFALSVSELGQILALTPSESLEFFHDPNMGKSDAGMVRKSLKIEPTTDRNGFFFGL 146
Query: 199 SVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
+V NK+ + + IP+++ E+ ++ SA N
Sbjct: 147 TVANKVEKAEARLNIPISKGEFAIIRSAAN 176
>gi|224070977|ref|XP_002303313.1| predicted protein [Populus trichocarpa]
gi|222840745|gb|EEE78292.1| predicted protein [Populus trichocarpa]
Length = 183
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 71/143 (49%), Positives = 100/143 (69%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ +S++KGKAAL+VEP P F SG +++ R G +ML F PA G R+YD+ ++Q
Sbjct: 1 RVFAPYSVFKGKAALSVEPVLPTFSKFGSGNLRVDRRGSMMLTFLPAIGERKYDYEKRQK 60
Query: 144 FSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNK 203
F+LS TE+GSL++ G ++SCEFFHDP S G+VRK L ++P DGSG+F +LSV N
Sbjct: 61 FALSATEVGSLISTGPKDSCEFFHDPSMLSSNAGQVRKNLSIKPHADGSGYFVSLSVVNN 120
Query: 204 LINLDESIYIPVTRAEYTVLVSA 226
++ E +PVT AE+TVL +A
Sbjct: 121 ILKTTERFTVPVTTAEFTVLKTA 143
>gi|225459963|ref|XP_002267315.1| PREDICTED: uncharacterized protein LOC100258449 [Vitis vinifera]
gi|297734756|emb|CBI16990.3| unnamed protein product [Vitis vinifera]
Length = 231
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 99/148 (66%), Gaps = 1/148 (0%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P RVY + +YKGKA+LTV P P+F LDSG +K+ R G +MLQF+PA G R+YDW
Sbjct: 52 GNYPDRVYAPYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDW 111
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
+KQ F+LS E+GSL++L CEFFHDP S G+VRK L V+ + DG +F +L
Sbjct: 112 EKKQFFALSAVEVGSLLSLSPGGGCEFFHDPSMKTSNAGQVRKSLSVKSM-DGGSYFLSL 170
Query: 199 SVQNKLINLDESIYIPVTRAEYTVLVSA 226
SV N + +E + +P+T AE+ V+ +A
Sbjct: 171 SVVNNIQKTNERLAVPLTAAEFAVMQTA 198
>gi|242064094|ref|XP_002453336.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
gi|241933167|gb|EES06312.1| hypothetical protein SORBIDRAFT_04g004060 [Sorghum bicolor]
Length = 230
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 102/143 (71%)
Query: 81 LPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSR 140
L ++ Y ++++KGKAAL+++P P F L+SG ++SR G +ML F PA G R+YD+++
Sbjct: 44 LSSKKYASYTVFKGKAALSIQPILPSFSKLESGGSRVSRNGSIMLTFFPAVGPRKYDFTK 103
Query: 141 KQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSV 200
KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N++V
Sbjct: 104 KQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGMVKKSLSITPLGSDSGYFVNITV 163
Query: 201 QNKLINLDESIYIPVTRAEYTVL 223
N + ++ + +P+T+AE+ V+
Sbjct: 164 VNSVEKTNDRLSVPITKAEFAVM 186
>gi|224286043|gb|ACN40733.1| unknown [Picea sitchensis]
Length = 184
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 65/116 (56%), Positives = 90/116 (77%)
Query: 113 GAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKG 172
G V +++EG + L+FAPA G RQYDWS+K++ +LSV E+G+L++LG ESCEF HDPF G
Sbjct: 28 GGVTVAKEGCMFLEFAPAVGPRQYDWSKKKIIALSVVEVGTLLSLGPDESCEFTHDPFMG 87
Query: 173 KSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
KSE GK+ KVLKV L D G+FFNLSV +++ ++DES IP+T+ E++V+ S FN
Sbjct: 88 KSEAGKIMKVLKVGNLQDTGGYFFNLSVTDRIADVDESFSIPITKGEFSVMQSIFN 143
>gi|195627490|gb|ACG35575.1| DNA binding protein [Zea mays]
Length = 230
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 100/145 (68%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V N ++ + +P+T+AE+ V+
Sbjct: 162 TVVNSAERTNDRLSVPITKAEFAVI 186
>gi|194703090|gb|ACF85629.1| unknown [Zea mays]
gi|323388661|gb|ADX60135.1| PBF-2 like transcription factor [Zea mays]
gi|323388771|gb|ADX60190.1| PBF-2 like transcription factor [Zea mays]
Length = 232
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 100/145 (68%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V N ++ + +P+T+AE+ V+
Sbjct: 162 TVVNSAERTNDRLSVPITKAEFAVI 186
>gi|413926543|gb|AFW66475.1| DNA binding protein isoform 1 [Zea mays]
gi|413926544|gb|AFW66476.1| DNA binding protein isoform 2 [Zea mays]
Length = 274
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 100/145 (68%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 84 GNLSGKKFASYTVFKGKAALSIHPILPSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 143
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 144 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 203
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V N ++ + +P+T+AE+ V+
Sbjct: 204 TVVNSAERTNDRLSVPITKAEFAVI 228
>gi|223945821|gb|ACN26994.1| unknown [Zea mays]
gi|413942841|gb|AFW75490.1| whirly1 [Zea mays]
Length = 177
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/122 (55%), Positives = 87/122 (71%), Gaps = 2/122 (1%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFK 171
SGA K+++EGFV+LQFAPA RQYDW+RKQVFSLSV EIG+L+ LG +SCEFFHDPFK
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFK 172
Query: 172 GK 173
G+
Sbjct: 173 GR 174
>gi|297605166|ref|NP_001056790.2| Os06g0145800 [Oryza sativa Japonica Group]
gi|255676711|dbj|BAF18704.2| Os06g0145800 [Oryza sativa Japonica Group]
Length = 190
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 65/104 (62%), Positives = 82/104 (78%)
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP 129
+Y+P G RV+ +SIYKGKAA++++PR P+FV LDSGA K+ +EGFV+LQFAP
Sbjct: 77 AYSPPAAQGGQQNGRVFSTYSIYKGKAAMSLDPRPPQFVPLDSGAYKVVKEGFVLLQFAP 136
Query: 130 AAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGK 173
A RQYDW+RKQVFSLSV E+GSL+ LG +SCEFFHDPFKG+
Sbjct: 137 AVATRQYDWTRKQVFSLSVWEMGSLLTLGPTDSCEFFHDPFKGR 180
>gi|356573153|ref|XP_003554728.1| PREDICTED: uncharacterized protein LOC100817863 [Glycine max]
Length = 235
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/191 (39%), Positives = 120/191 (62%), Gaps = 10/191 (5%)
Query: 40 TSIKKKKLY--VKCRQSEYYEQKSFSA--SPSNSSYAPTDVAVGTLPTRVYVGHSIYKGK 95
TS + +L + R+ E ++ S SA S + ++YA A G R++ +++YKGK
Sbjct: 11 TSTSRHRLLEVLSSRKVEVGDRLSHSAGISTATNNYA----AKGYASDRIFAPYTVYKGK 66
Query: 96 AALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV 155
AA ++ P P F L+SG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+
Sbjct: 67 AAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQRFALSATEVGSLI 126
Query: 156 ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPV 215
+GA++SC+FFHDP S G+VRK L ++ P +G+F +L+V N L+N ++ +PV
Sbjct: 127 TMGAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLTVVNNLLNTNDYFSVPV 184
Query: 216 TRAEYTVLVSA 226
T AE+ V+ +A
Sbjct: 185 TTAEFAVMKTA 195
>gi|226506170|ref|NP_001152589.1| LOC100286229 [Zea mays]
gi|195657845|gb|ACG48390.1| DNA binding protein [Zea mays]
Length = 232
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 99/145 (68%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G L + + ++++KGKAAL++ P F L+SG ++S+ G VML F PA G R+YD+
Sbjct: 42 GNLSGKKFASYTVFKGKAALSIHPILXSFSKLESGGSRVSKNGSVMLTFFPAVGQRKYDY 101
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNL 198
++KQ+F+LS TE+GSL++LG ESCEFFHDP S EG V+K L + PL SG+F N+
Sbjct: 102 TKKQLFALSPTEVGSLISLGPAESCEFFHDPSMKSSNEGTVKKSLSITPLGSDSGYFVNI 161
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V N ++ + +P+T+AE+ V+
Sbjct: 162 TVVNSAERTNDRLSVPITKAEFAVI 186
>gi|255632067|gb|ACU16386.1| unknown [Glycine max]
Length = 264
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 115/189 (60%), Gaps = 6/189 (3%)
Query: 38 ESTSIKKKKLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAA 97
E S +K ++ + RQ E + F+ S + S+ A G R++ +++YKGKAA
Sbjct: 20 EVLSSRKVEVGDRLRQ----EGRDFTYSAAISTATNNYAAKGHASDRIFAPYTVYKGKAA 75
Query: 98 LTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL 157
++ P P F LDSG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+ +
Sbjct: 76 FSLIPCLPTFTKLDSGTVVVDRRGSIMMSFMHSIGERKYDWDKRQKFALSATEVGSLITM 135
Query: 158 GARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTR 217
A++SC+FFHDP S G+VRK L ++ P +G+F +L+V N L+N + +PVT
Sbjct: 136 DAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLTVVNNLLNTKDYFSVPVTT 193
Query: 218 AEYTVLVSA 226
AE+ V+ +A
Sbjct: 194 AEFAVMKTA 202
>gi|255637711|gb|ACU19178.1| unknown [Glycine max]
Length = 235
Score = 139 bits (350), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 119/191 (62%), Gaps = 10/191 (5%)
Query: 40 TSIKKKKLY--VKCRQSEYYEQKSFSA--SPSNSSYAPTDVAVGTLPTRVYVGHSIYKGK 95
TS + +L + R+ E ++ S SA S ++YA A G R++ +++YKGK
Sbjct: 11 TSTSRHRLLEVLSSRKVEVGDRLSHSAGISTVTNNYA----AKGYASDRIFAPYTVYKGK 66
Query: 96 AALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV 155
AA ++ P P F L+SG V + R G +M+ F + G R+YDW ++Q F+LS TE+GSL+
Sbjct: 67 AAFSLIPCLPTFTKLNSGTVVVDRRGSIMMTFMHSIGERKYDWEKRQRFALSATEVGSLI 126
Query: 156 ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPV 215
+GA++SC+FFHDP S G+VRK L ++ P +G+F +L+V + L+N ++ +PV
Sbjct: 127 TMGAQDSCDFFHDPSMLSSNAGQVRKSLSIK--PHANGYFVSLTVVDNLLNTNDYFSVPV 184
Query: 216 TRAEYTVLVSA 226
T AE+ V+ +A
Sbjct: 185 TTAEFAVMKTA 195
>gi|147819709|emb|CAN74120.1| hypothetical protein VITISV_034895 [Vitis vinifera]
Length = 185
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 65/91 (71%)
Query: 79 GTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDW 138
G P RVY + +YKGKA+LTV P P+F LDSG +K+ R G +MLQF+PA G R+YDW
Sbjct: 52 GNYPDRVYAPYCVYKGKASLTVYPVLPKFSRLDSGGLKVDRHGVMMLQFSPAVGERKYDW 111
Query: 139 SRKQVFSLSVTEIGSLVALGARESCEFFHDP 169
+KQ F+LS E+GSL++L CEFFHDP
Sbjct: 112 EKKQFFALSAVEVGSLLSLSPGGGCEFFHDP 142
>gi|449445768|ref|XP_004140644.1| PREDICTED: single-stranded DNA-bindig protein WHY1,
chloroplastic-like [Cucumis sativus]
Length = 108
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 46/53 (86%), Positives = 52/53 (98%)
Query: 175 EEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAF 227
+EGKVRK+LKVEPLPDGSGHFFNL+VQNKLIN+DESIYIP+T+AEYTVLV AF
Sbjct: 19 DEGKVRKILKVEPLPDGSGHFFNLTVQNKLINVDESIYIPITKAEYTVLVEAF 71
>gi|303289333|ref|XP_003063954.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454270|gb|EEH51576.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/145 (42%), Positives = 83/145 (57%), Gaps = 6/145 (4%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP---AAGVRQYDWSR 140
+VY +I+K K A+ + P F L +G+ + R+G + L+FAP AG +QYDWSR
Sbjct: 139 KVYCNFAIHKSKTAVQMSAIKPTFELLPNGSKQKKRDGAMFLEFAPVAAGAGQKQYDWSR 198
Query: 141 KQVFSLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGH-FFNL 198
KQ SLS E L AL A FFHDP+ G S +G+ K LK EP+PDGSG F NL
Sbjct: 199 KQSISLSPLEFMELSEALAANRGVNFFHDPWMGTSRQGETTKSLKAEPMPDGSGGIFLNL 258
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V + + E + I V+ E+ V
Sbjct: 259 TVASGGGRV-EKLNIAVSFQEFAVF 282
>gi|412985999|emb|CCO17199.1| predicted protein [Bathycoccus prasinos]
Length = 419
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 81/146 (55%), Gaps = 7/146 (4%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
R +V SIYK K+AL+V+ P F + G + R G ++L+FAP+ G R+YDW+RK
Sbjct: 238 RTFVDFSIYKSKSALSVKLVKPTFETDHQGRTIMKRSGGILLEFAPSIGTRKYDWTRKGS 297
Query: 144 FSLSVTEIGSLVAL--GARES----CEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFN 197
F LS E L +R+S EFFHDP G S +G V K LK+E +PDG+G F
Sbjct: 298 FMLSPIEAAELANRLNPSRQSIAQKVEFFHDPGMGGSSQGSVTKSLKMEAMPDGTGGVF- 356
Query: 198 LSVQNKLINLDESIYIPVTRAEYTVL 223
L+ ++ IPV+ E + L
Sbjct: 357 LNYNQTKFGEKLNVNIPVSFGEVSAL 382
>gi|255084617|ref|XP_002508883.1| predicted protein [Micromonas sp. RCC299]
gi|226524160|gb|ACO70141.1| predicted protein [Micromonas sp. RCC299]
Length = 347
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 90/174 (51%), Gaps = 19/174 (10%)
Query: 60 KSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSR 119
+ F+A + +S A D +RVY +++YK KAA + P F G+ R
Sbjct: 163 RQFTAGNNKASSANDDAG-----SRVYCDYAVYKSKAAAKFQVIKPTFEVKPDGSRAKKR 217
Query: 120 EGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGK 178
+G V+L+ APA G RQYDW++KQ LS E+ L +L FFHDP G + +G
Sbjct: 218 DGGVLLEMAPAVGPRQYDWAQKQTIMLSPLELVELTESLHFGRGVNFFHDPGMGTNRQGA 277
Query: 179 VRKVLKVEPLPDGSGH-FFNLSVQ--------NKLINLDESIYIPVTRAEYTVL 223
+ K LK EP+PDGSG F N+ V + +N++ I V+ AE+ L
Sbjct: 278 MTKSLKAEPMPDGSGGIFLNMGVTTGGDGANGGQRVNMN----IAVSFAEFAAL 327
>gi|307106759|gb|EFN55004.1| hypothetical protein CHLNCDRAFT_134820 [Chlorella variabilis]
Length = 1274
Score = 96.3 bits (238), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/116 (43%), Positives = 77/116 (66%), Gaps = 3/116 (2%)
Query: 85 VYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVF 144
VY ++IYKGKAA+ ++ P + S+ SG +K+SREG ++L+FA + G +QYDW++K+ F
Sbjct: 1097 VYSDYTIYKGKAAMAIKVIKPTWESIGSG-LKISREGTLLLEFAASVGPQQYDWTKKETF 1155
Query: 145 SLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLS 199
LS E +++ A AR+S E HDP KG+ EG V K + P PD G FF+++
Sbjct: 1156 GLSALECAAVLEAADARQSFEALHDPNKGRGGEGTVYKKFNMRPAPD-KGWFFSIA 1210
>gi|223947295|gb|ACN27731.1| unknown [Zea mays]
gi|413942840|gb|AFW75489.1| whirly1 [Zea mays]
Length = 153
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 52 RQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLD 111
R S+Y++ ++ + Y G RV+ +SIYKGKAAL+ +PR P FV LD
Sbjct: 55 RHSDYFDPRAPPPPRGDGGYG--RPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLD 112
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
SGA K+++EGFV+LQFAPA RQYDW+RKQV
Sbjct: 113 SGAYKVAKEGFVLLQFAPAVATRQYDWTRKQV 144
>gi|384244785|gb|EIE18283.1| ssDNA-binding transcriptional regulator [Coccomyxa subellipsoidea
C-169]
Length = 250
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 82/156 (52%), Gaps = 20/156 (12%)
Query: 85 VYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAA-------GVRQYD 137
+Y ++IYKGKAA + R P +V G++ L R G ++++FAP A G R Y
Sbjct: 69 LYANYAIYKGKAAASFRVRKPRWVEAQDGSISLDRAGSLIVEFAPVAPGSGTNVGNRSYQ 128
Query: 138 WSRKQVFSLSVTEIGSLVALGARESC-------EFFHDPFKGKSEEGKVRKVLKVEPLPD 190
W +KQ F+LS E+ LV ESC E FHDP KG ++ GK+ K L ++
Sbjct: 129 WDKKQTFALSPVELAGLV-----ESCTTGKSMKELFHDPNKGGTDSGKIAKTLSLQRFDQ 183
Query: 191 GSGHFFNLSVQNKLI-NLDESIYIPVTRAEYTVLVS 225
G + L+V++ + ++ +P+T AE L S
Sbjct: 184 GQDWYLQLAVKDSASKDGSATMGLPITGAELYTLKS 219
>gi|145355417|ref|XP_001421958.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582197|gb|ABP00252.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 152
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 71/139 (51%), Gaps = 2/139 (1%)
Query: 86 YVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFS 145
+V + +YK K AL ++ F S SG + R G V+L+ A A RQYDW K F
Sbjct: 1 FVDYGVYKTKGALKLKAVRATFESDASGRRIMKRAGGVLLELANATAPRQYDWGNKGSFM 60
Query: 146 LSVTEIGSLVA-LGARESCEFFHDPFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKL 204
LS TE L + + C FFHDP G + G V K KVEP+PDGSG F +++Q
Sbjct: 61 LSATEAAELADRMASNAPCSFFHDPGAGGANRGNVNKAFKVEPMPDGSGGLF-VNLQTTS 119
Query: 205 INLDESIYIPVTRAEYTVL 223
+ +PV+ E L
Sbjct: 120 GGNKSFVSVPVSYGESAAL 138
>gi|255561490|ref|XP_002521755.1| conserved hypothetical protein [Ricinus communis]
gi|223538968|gb|EEF40565.1| conserved hypothetical protein [Ricinus communis]
Length = 173
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 66/96 (68%)
Query: 109 SLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFHD 168
++ SG +K+ R G ++L F PA G R+YD+ ++Q F+LS TE+GSL++LG ++S + FHD
Sbjct: 76 TVQSGHLKVERRGVILLTFLPAIGERKYDYEKRQSFALSTTEVGSLISLGPKDSFDCFHD 135
Query: 169 PFKGKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKL 204
P S G+VRK L ++P +G G+F +LS N +
Sbjct: 136 PGMLSSNAGEVRKSLSLKPHAEGGGYFISLSSWNNV 171
>gi|308812987|ref|XP_003083800.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
gi|116055682|emb|CAL57767.1| DNA-binding protein p24 (ISS) [Ostreococcus tauri]
Length = 240
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 70/145 (48%), Gaps = 4/145 (2%)
Query: 82 PTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRK 141
P +V +YK + A+ ++ P +L AV + R G V+L+ A A R YDW K
Sbjct: 78 PPTTFVDFGVYKTRGAMKMKAVRPTLGALGENAV-VRRPGGVLLELANATAPRTYDWQNK 136
Query: 142 QVFSLSVTEIGSLV-ALGARESCEFFHDPFKGKSEEGK-VRKVLKVEPLPDGSGHFF-NL 198
F LS TE L + A SC FFHD K G + K KVEP+PDGSG F NL
Sbjct: 137 GSFMLSGTEAAELADRMAANGSCSFFHDSAKANGGGGGTLGKAFKVEPMPDGSGGMFVNL 196
Query: 199 SVQNKLINLDESIYIPVTRAEYTVL 223
+V + +PV+ E +
Sbjct: 197 TVTLSENRGQQRFSVPVSYGESAAI 221
>gi|124484361|dbj|BAF46291.1| transcription regulator Pbf-2 [Chlamydomonas reinhardtii]
Length = 238
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 93/186 (50%), Gaps = 28/186 (15%)
Query: 57 YEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVK 116
Y S + P N + AP D A + RVY + +YK +AA+ + P F +G V
Sbjct: 47 YSNGSAAPVPPNFA-APNDRAATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV- 104
Query: 117 LSREGFVMLQFAPAAGV-------------RQYDWSRKQVFSLSVTEIGSLV---ALGAR 160
L R+G ++L+FA A R Y+W K F+LS E+G+++ A+ +
Sbjct: 105 LERDGTMLLEFATANAAAPGAGNGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASD 164
Query: 161 ESCEFFHDPFK-GKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAE 219
+ +HDP K GK+ G+ K L ++ LPDG+ FNL+ E+ +PVT+ E
Sbjct: 165 KGLVLWHDPAKLGKT--GEPIKKLSLKQLPDGNIS-FNLTAG------PENFSVPVTKGE 215
Query: 220 YTVLVS 225
+ V+ S
Sbjct: 216 FEVIKS 221
>gi|302829905|ref|XP_002946519.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
gi|300268265|gb|EFJ52446.1| hypothetical protein VOLCADRAFT_86539 [Volvox carteri f. nagariensis]
Length = 3754
Score = 70.9 bits (172), Expect = 4e-10, Method: Composition-based stats.
Identities = 56/159 (35%), Positives = 86/159 (54%), Gaps = 27/159 (16%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFA-----------PAAG 132
RVYV IYK +AA+ V P F + ++G L R+G ++L+FA PAAG
Sbjct: 3589 RVYVNFHIYKTRAAMAVRLLPPSFTT-ENGYKTLERDGVMLLEFANANPGQPSGTAPAAG 3647
Query: 133 --VRQYDWSRKQVFSLSVTEIGSLV---ALGARESCEFFHDPFK-GKSEEGKVRKVLKVE 186
R Y+WS K F+LS +E+G+++ A+ + + +HDP K GK G+ K L ++
Sbjct: 3648 GINRTYNWSNKISFALSPSELGTMLAGDAIASDKGLVMYHDPTKLGKV--GEPMKRLTMK 3705
Query: 187 PLPDGSGHFFNLSVQNKLINLDESIYIPVTRAEYTVLVS 225
+PDG+ F L ++I +PV+R E+ VL S
Sbjct: 3706 QMPDGAISF-------SLSAAPDNISLPVSRGEFEVLKS 3737
>gi|159487549|ref|XP_001701785.1| transcription factor [Chlamydomonas reinhardtii]
gi|158281004|gb|EDP06760.1| transcription factor [Chlamydomonas reinhardtii]
Length = 238
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 93/186 (50%), Gaps = 28/186 (15%)
Query: 57 YEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVK 116
Y S + P N + AP D A + RVY + +YK +AA+ + P F +G V
Sbjct: 47 YSNGSAAPVPPNFA-APNDRAATSSSDRVYTNYYVYKTRAAMCLRLLPPTFAKAQAGKV- 104
Query: 117 LSREGFVMLQFAPAAGV-------------RQYDWSRKQVFSLSVTEIGSLV---ALGAR 160
L R+G ++L+FA A R Y+W K F+LS E+G+++ A+ +
Sbjct: 105 LERDGTMLLEFATANAAAPGAGSGPAGNVNRTYNWGNKVTFALSPVELGNILAGDAVASD 164
Query: 161 ESCEFFHDPFK-GKSEEGKVRKVLKVEPLPDGSGHFFNLSVQNKLINLDESIYIPVTRAE 219
+ +HDP K GK+ G+ K L ++ LPDG+ FNL+ E+ +PVT+ E
Sbjct: 165 KGLVLWHDPAKLGKT--GEPIKKLSLKQLPDGNIS-FNLTAG------PENFSVPVTKGE 215
Query: 220 YTVLVS 225
+ V+ S
Sbjct: 216 FEVIKS 221
>gi|2947067|gb|AAC05348.1| hypothetical protein [Arabidopsis thaliana]
Length = 192
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 7/76 (9%)
Query: 46 KLYVKCRQSEYYEQKSFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGP 105
KL VK RQS+Y+E++ F S S+ + + R YVGHSIYKGKAALT+EPR P
Sbjct: 53 KLTVKSRQSDYFEKQRFGDSSSSQNAEVSS-------PRFYVGHSIYKGKAALTIEPRAP 105
Query: 106 EFVSLDSGAVKLSREG 121
EFV+L+ +K++ G
Sbjct: 106 EFVALEIVMMKINWFG 121
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 134 RQYDWSRKQVFSLSVTEIGSLVALGARESCEFF--HDPFKGKSE---EGKVRKVLKVEPL 188
RQ D+ KQ F S + + V+ S F+ H +KGK+ E + + + +E +
Sbjct: 59 RQSDYFEKQRFGDSSSSQNAEVS-----SPRFYVGHSIYKGKAALTIEPRAPEFVALEIV 113
Query: 189 PDGSGHF----FNLSVQNKLINLDESIYIPVTRAEYTVLVSAFN 228
F ++ VQNKL+N+DES+YIP+T+AE+ VL+SAFN
Sbjct: 114 MMKINWFGERYYDCGVQNKLLNVDESVYIPITKAEFAVLISAFN 157
>gi|297836056|ref|XP_002885910.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
gi|297331750|gb|EFH62169.1| hypothetical protein ARALYDRAFT_899639 [Arabidopsis lyrata subsp.
lyrata]
Length = 74
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 35/44 (79%)
Query: 111 DSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSL 154
DS A KLS++GF++LQFAP+AGVRQY+W +KQV+ +T G L
Sbjct: 3 DSEAFKLSKDGFLLLQFAPSAGVRQYNWGKKQVWFYLLTSYGPL 46
>gi|449458111|ref|XP_004146791.1| PREDICTED: single-stranded DNA-binding protein WHY2,
mitochondrial-like [Cucumis sativus]
Length = 198
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 30/34 (88%)
Query: 194 HFFNLSVQNKLINLDESIYIPVTRAEYTVLVSAF 227
+F +VQNKLIN+DESIYIP+T+AEYTVLV AF
Sbjct: 124 YFAAQAVQNKLINVDESIYIPITKAEYTVLVEAF 157
>gi|348676880|gb|EGZ16697.1| hypothetical protein PHYSODRAFT_500116 [Phytophthora sojae]
Length = 224
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ S+Y +A V P P++ S S +K+ R G +ML +A A Y++ K
Sbjct: 29 RVFPNFSVYGSDSAFQVAPSAPQYTSAGS-YMKMKRVGAIMLSWAKATN-SGYNYQNKTF 86
Query: 144 FSLSVTEIGSLVALGARESCE--FFHDPFKGKSEEGKVRKVLKV 185
FSLS TE+G ++ L E F H P SEE K K L +
Sbjct: 87 FSLSPTEVGLVLELLDSRIPELSFTHSPNMNASEEDKNSKTLHI 130
>gi|253761700|ref|XP_002489225.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
gi|241947085|gb|EES20230.1| hypothetical protein SORBIDRAFT_0012s017770 [Sorghum bicolor]
Length = 91
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/31 (67%), Positives = 27/31 (87%)
Query: 112 SGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
SGA K+++EG+V+LQFAPA RQYDW+RKQ
Sbjct: 5 SGAYKVAKEGYVLLQFAPAVATRQYDWTRKQ 35
>gi|397620242|gb|EJK65621.1| hypothetical protein THAOC_13502 [Thalassiosira oceanica]
Length = 317
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 75/167 (44%), Gaps = 21/167 (12%)
Query: 81 LPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPA-AGVRQYDWS 139
+P R Y ++++ +AL+++ P F V + R G +ML+F P A + W+
Sbjct: 4 MPRRGYPQYTMFSSDSALSMKAAMPVFKKAGMDGVAVERRGKMMLEFVPRNASGSGFAWN 63
Query: 140 RKQVFSLSVTEIGSLVALGARESCEFFHDPFKGKSEEGK--------------VRKVLKV 185
K +FSL+V E+G L++ + E H F S++G V KVL V
Sbjct: 64 DKTIFSLTVEEVGLLLSQLPGNAVELSHPTF--SSDDGAFGQESQVTQVSGDIVEKVLTV 121
Query: 186 EPLPDGSGHFFNLS-VQNKLINLDESIY--IPVTRAEYTVLVSAFNV 229
+P DG+ F + V N + + IP T E T+ F V
Sbjct: 122 DP-GDGATMTFKVDYVTNGVGGQTPPGFDGIPSTPLEITIQAGEFEV 167
>gi|223994305|ref|XP_002286836.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978151|gb|EED96477.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 288
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 77 AVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAP----AAG 132
++G +P R + ++++ +AL+V P F + + + R G ++L+F P AG
Sbjct: 73 SMGMMPRRGFPQYTVFGPDSALSVRAVLPNFKRAGTDGISVDRRGKIVLEFVPRNPSGAG 132
Query: 133 VRQYDWSRKQVFSLSVTEIGSLVALGARESCEFFH 167
+ W+ K FS+SV E+G V+ + E H
Sbjct: 133 ---FQWADKTTFSMSVEEVGLFVSQLPQSGIELSH 164
>gi|301101636|ref|XP_002899906.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102481|gb|EEY60533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 226
Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 4/104 (3%)
Query: 84 RVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV 143
RV+ SIY +A V P P++ + +K R G +ML +A A Y++ K
Sbjct: 29 RVFPNFSIYGSDSAFQVSPSPPQYTN-GGNYLKTKRVGAIMLSWAKATN-SGYNYQNKTF 86
Query: 144 FSLSVTEIGSLVALGARESCE--FFHDPFKGKSEEGKVRKVLKV 185
FSLS +E+G ++ L E H P SEE K K L +
Sbjct: 87 FSLSPSEVGLVLELLDSRIPELSLTHSPNMNASEEDKNTKSLHI 130
>gi|124262628|ref|YP_001023098.1| hypothetical protein Mpe_B0084 [Methylibium petroleiphilum PM1]
gi|124261874|gb|ABM96863.1| hypothetical protein Mpe_B0084 [Methylibium petroleiphilum PM1]
Length = 421
Score = 40.4 bits (93), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 24/108 (22%)
Query: 52 RQSEYYEQK--SFSASPSNSSYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPRGPEFVS 109
RQ + +EQ+ + A P+ +S + Y+ IY GKAAL S
Sbjct: 239 RQDQRHEQRQQARQAEPAEASA-----------DQEYLSQHIYGGKAALCF--------S 279
Query: 110 LDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL 157
D K+ V L+ A A+ RQYDW RK LS E+ ++A+
Sbjct: 280 ADQTRAKVH---TVRLEAAEASATRQYDWQRKIAIQLSQRELPLVLAV 324
>gi|406976196|gb|EKD98717.1| hypothetical protein ACD_23C00300G0005 [uncultured bacterium]
Length = 290
Score = 40.4 bits (93), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 33/63 (52%), Gaps = 5/63 (7%)
Query: 89 HSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQVFSLSV 148
H IY KAALTVE E ++ +G L V+L+ A A G R YDWSRK F
Sbjct: 130 HHIYGLKAALTVEM--DELRTMGNGGGLLQ---TVILEAATATGHRSYDWSRKIAFQFMR 184
Query: 149 TEI 151
E+
Sbjct: 185 REL 187
>gi|325185050|emb|CCA19542.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 211
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
Query: 83 TRVYVGHSIYKGKAALTVEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQ 142
+++Y ++Y+ A V P PE+V S +K R G ++L +A YD+++K
Sbjct: 21 SKIYPSFTVYESDAVFQVSPIQPEYVQ-QSNYLKTKRVGSLLLSWAKQRD-GSYDYTKKL 78
Query: 143 VFSLSVTEIGSLVALGARESCEFF--HDPFKGKSEEGKVRKVLKV 185
F+L+ +EIG ++ + + E H P + + LKV
Sbjct: 79 YFALTPSEIGLVLEVLDSKVGEINIKHTPNRNDPAVQTTERTLKV 123
>gi|290243159|ref|YP_003494829.1| hypothetical protein TK90_2878 [Thioalkalivibrio sp. K90mix]
gi|288945664|gb|ADC73362.1| hypothetical protein TK90_2878 [Thioalkalivibrio sp. K90mix]
Length = 259
Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 123 VMLQFAPAAGVRQYDWSRKQVFSLSVTEIGSLVAL--GARESCEF 165
V + A A G R YDW K + L+ TE+ + A+ G R CEF
Sbjct: 138 VQIDAAKAKGERAYDWGAKTIIQLTPTELAVVTAVFTGVRAKCEF 182
>gi|170077972|ref|YP_001734610.1| hypothetical protein SYNPCC7002_A1358 [Synechococcus sp. PCC 7002]
gi|169885641|gb|ACA99354.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 168
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 60/123 (48%), Gaps = 9/123 (7%)
Query: 42 IKKKKLYVKCRQSEYYEQKSFSASPSNSSYA-PTDVAVGTLPTRVYV-GHSIYKGKAALT 99
++KK+L + RQ YY +SF ++ DV T+ + + G SI++G++ +
Sbjct: 4 LRKKQLQQRLRQDPYYRLRSFEEVALAAAIGIKIDVNQATVDDWLRLPGISIHQGRSLVQ 63
Query: 100 VEPRGPEFVSLDSGAVKLSREGFVMLQFAPAAGVRQYDWSRKQV----FSLSVTEIGSLV 155
+ +G +F SLD A L G + P A + + + ++ +L+ ++G+LV
Sbjct: 64 LTSQGVQFYSLDDVAAAL---GVPRQRLQPLAAILSFAYYAPELAPVQINLNRADLGALV 120
Query: 156 ALG 158
G
Sbjct: 121 DFG 123
>gi|148237294|ref|NP_001088381.1| perforin 1 (pore forming protein) precursor [Xenopus laevis]
gi|80476913|gb|AAI08843.1| LOC495232 protein [Xenopus laevis]
Length = 532
Score = 36.6 bits (83), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 48/109 (44%), Gaps = 21/109 (19%)
Query: 13 LNPKLCPFHSLSNSKGNGFGSISVTESTSIKKKKLYVKCRQS--EYYEQK-SFSASPSNS 69
LN L P H+L NS+ S+ + S IKK+ L ++C S Y Q+ S S S S
Sbjct: 331 LNYSLMPIHTLINSRSPKRESLRIALSEYIKKRSLKIECPCSGNTYPSQREDCSCSCSPS 390
Query: 70 SYAPTDVAVGTLPTRVYVGHSIYKGKAALTVEPR------GPEFVSLDS 112
Y +D PTR KG A L +E R G F S DS
Sbjct: 391 KYTNSDCC----PTR--------KGLAKLIIEIRSATGLNGDYFSSTDS 427
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,440,455,557
Number of Sequences: 23463169
Number of extensions: 138296256
Number of successful extensions: 303789
Number of sequences better than 100.0: 92
Number of HSP's better than 100.0 without gapping: 74
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 303666
Number of HSP's gapped (non-prelim): 94
length of query: 229
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 91
effective length of database: 9,121,278,045
effective search space: 830036302095
effective search space used: 830036302095
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)