BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011271
(489 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O14308|SWC4_SCHPO SWR1-complex protein 4 OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=swc4 PE=3 SV=2
Length = 437
Score = 147 bits (372), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 143/284 (50%), Gaps = 55/284 (19%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPDGISREVYALTG-GLAPLMPSIDVSQLKK 60
D +D+ +P ++ Q+ K+ +R+P+GISRE+Y+L G APL +I + K+
Sbjct: 5 DIRDVFELPPPEIGNKQKSKT----PTERRPEGISRELYSLLGENSAPL--AIYQKKFKE 58
Query: 61 RPPSDEKI-TWQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIFFN 119
+P K W PF+ S+RKD+ L+HWV
Sbjct: 59 KPKVSHKAKNWVRQPFSISSRKDDFTLHHWV----------------------------- 89
Query: 120 FFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYEKYLTDPMWTKEETDQLFE 179
+ + V Y F K+N + ++ YTDEEY+ YL D W K+ETD LF
Sbjct: 90 -----------LKSEVDSEASYKFEKFNVPLFIIDYTDEEYQNYLKDEDWNKDETDYLFR 138
Query: 180 LCERFDLRFIVIADRFPSS-----RTVEELKDRYYGVSRAILIARAP--SPTDVSGHPLV 232
LC+ +DLRF VIADR+ + RT+E+LKDR+Y VSR IL+AR P S T L
Sbjct: 139 LCKDYDLRFFVIADRYDNEKYKKHRTLEDLKDRFYSVSRKILLARNPINSMTAAQSSLLN 198
Query: 233 KDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKRITDSR 276
YN QEV RK+ L + S+T + ++ + E KRI S+
Sbjct: 199 TMEYNKEQEVIRKKYLIGLASRTPEEVAEEEALFIELKRIETSQ 242
>sp|Q9NPF5|DMAP1_HUMAN DNA methyltransferase 1-associated protein 1 OS=Homo sapiens
GN=DMAP1 PE=1 SV=1
Length = 467
Score = 144 bits (362), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 184/414 (44%), Gaps = 101/414 (24%)
Query: 30 RKPDGISREVYAL----TGGLAPLMPSIDVSQ----LKKRPPSDEKITWQWLPFTNSARK 81
++P+G+ REVYAL PL+PS D Q +K + S + W+W+PFTN ARK
Sbjct: 48 KRPEGMHREVYALLYSDKKDAPPLLPS-DTGQGYRTVKAKLGSKKVRPWKWMPFTNPARK 106
Query: 82 DNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIFFNFFGVSHAFQVRVVNGVPPTGDY 141
D +HW + E GK DY
Sbjct: 107 DGAMFFHW-------RRAAEEGK-----------------------------------DY 124
Query: 142 SFAKYNKSVDVVKYTDEEYEKYLTDPMWTKEETDQLFELCERFDLRFIVIADRFP----S 197
FA++NK+V V Y+++EY+ YL D WTK ETD LF+L RFDLRF+VI DR+
Sbjct: 125 PFARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFVVIHDRYDHQQFK 184
Query: 198 SRTVEELKDRYYGVSRAILIARAPSPTDVSGHPLVKDP-YNVSQEVERKRALSMVLSQTK 256
R+VE+LK+RYY + + RA TD +K P ++ E RK L + ++T
Sbjct: 185 KRSVEDLKERYYHICAKLANVRAVPGTD------LKIPVFDAGHERRRKEQLERLYNRTP 238
Query: 257 HQERKDAEVLAEAKRITDSRMASRAAEEPEMPVASHVGSESADRAVVLGDTVSPSSNIQ- 315
Q ++ +L E ++I ++R R S+ + + DT + +
Sbjct: 239 EQVAEEEYLLQELRKI-EARKKEREKR-----------SQDLQKLITAADTTAEQRRTER 286
Query: 316 ------LPSAT-----VVPSTSIIA--DSASTLASLRMLRVYLRTYALEQMVQAASSSAG 362
LP VP T+ I D S +LR R+ L SS G
Sbjct: 287 KAPKKKLPQKKEAEKPAVPETAGIKFPDFKSAGVTLRSQRMKL------------PSSVG 334
Query: 363 LRTIKRVEQALQELGVNLKPKVPTKAVCAEHLELRKEILTLLNLQKQLQYKEAE 416
+ IK +EQ L ELGV L P PT+ + ELR +++ L L++ E E
Sbjct: 335 QKKIKALEQMLLELGVELSP-TPTEELVHMFNELRSDLVLLYELKQACANCEYE 387
>sp|Q9JI44|DMAP1_MOUSE DNA methyltransferase 1-associated protein 1 OS=Mus musculus
GN=Dmap1 PE=1 SV=1
Length = 468
Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 184/414 (44%), Gaps = 101/414 (24%)
Query: 30 RKPDGISREVYAL----TGGLAPLMPSIDVSQ----LKKRPPSDEKITWQWLPFTNSARK 81
++P+G+ REVYAL PL+PS D Q +K + S + W+W+PFTN ARK
Sbjct: 48 KRPEGMHREVYALLYSDKKDAPPLLPS-DTGQGYRTVKAKLGSKKVRPWKWMPFTNPARK 106
Query: 82 DNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIFFNFFGVSHAFQVRVVNGVPPTGDY 141
D +HW + E GK DY
Sbjct: 107 DGAMFFHW-------RRAAEEGK-----------------------------------DY 124
Query: 142 SFAKYNKSVDVVKYTDEEYEKYLTDPMWTKEETDQLFELCERFDLRFIVIADRFP----S 197
FA++NK+V V Y+++EY+ YL D WTK ETD LF+L RFDLRF+VI DR+
Sbjct: 125 PFARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFVVIHDRYDHQQFK 184
Query: 198 SRTVEELKDRYYGVSRAILIARAPSPTDVSGHPLVKDP-YNVSQEVERKRALSMVLSQTK 256
R+VE+LK+RYY + + RA TD +K P ++ E RK L + ++T
Sbjct: 185 KRSVEDLKERYYHICAKLANVRAVPGTD------LKIPVFDAGHERRRKEQLERLYNRTP 238
Query: 257 HQERKDAEVLAEAKRITDSRMASRAAEEPEMPVASHVGSESADRAVVLGDTVSPSSNIQ- 315
Q ++ +L E ++I ++R R S+ + + DT + +
Sbjct: 239 EQVAEEEYLLQELRKI-EARKKEREKR-----------SQDLQKLITAADTTAEQRRTER 286
Query: 316 ------LPSAT-----VVPSTSIIA--DSASTLASLRMLRVYLRTYALEQMVQAASSSAG 362
LP VP T+ I D S +LR R+ L SS G
Sbjct: 287 KAPKKKLPQKKEAEKPAVPETAGIKFPDFKSAGVTLRSQRMKL------------PSSVG 334
Query: 363 LRTIKRVEQALQELGVNLKPKVPTKAVCAEHLELRKEILTLLNLQKQLQYKEAE 416
+ IK +EQ L ELGV L P PT+ + ELR +++ L L++ E E
Sbjct: 335 QKKIKALEQMLLELGVELSP-TPTEELVHMFNELRSDLVLLYELKQACANCEYE 387
>sp|P0CO96|SWC4_CRYNJ SWR1-complex protein 4 OS=Cryptococcus neoformans var. neoformans
serotype D (strain JEC21 / ATCC MYA-565) GN=SWC4 PE=3
SV=1
Length = 463
Score = 121 bits (303), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 141/300 (47%), Gaps = 61/300 (20%)
Query: 1 MDAKDILGI----PKTQLPTTQEKKSRPQKEPQRKPDGISREVYALTGGLAPLMP----S 52
M A+D+ I P T LPT K P RKPDGI+RE+YAL G AP + S
Sbjct: 1 MSAQDVRSILSLPPSTPLPTLSSSKKVPVP---RKPDGITRELYALIGDNAPSLADAQAS 57
Query: 53 IDVSQLKKRPP-SDEKITWQWLPFTNSARKDN-LQLYHWVGFGNFWKCFHELGKRSFLMW 110
+ + +++P +K+ W+W FT +AR+DN ++L HW
Sbjct: 58 LAAVKYREKPALKGKKVHWEWTKFTPAARRDNPVRLGHWA-------------------- 97
Query: 111 GVVYNIFFNFFGVSHAFQVRVVNGVPPTGDYSFAKYN-KSVDVVKYTDEEYEKYLTDPMW 169
R+ + P F K+N V++Y+ EY+++L DP W
Sbjct: 98 -------------------RITDSDPNDSVEYFGKFNLHGPSVMEYSQFEYDQHLVDPNW 138
Query: 170 TKEETDQLFELCERFDLRFIVIADRFP--------SSRTVEELKDRYYGVSRAILIARAP 221
T +ET+ LFEL + +DLRFIV ADR+ R+VE++KDRYY + R ++ R
Sbjct: 139 TLQETEYLFELLKEYDLRFIVAADRYAYVSPEGEKRKRSVEDMKDRYYTICRRLIRTRTA 198
Query: 222 SPTDVSGHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKRITDSRMASRA 281
S H + ++ ++E++RK+ S + T + ++ + E R+ + RA
Sbjct: 199 SDPVHQQHLIQAYAFDKAREIKRKQYASDLFHLTPAEIAEEEALYVEITRMQQNERRFRA 258
>sp|P0CO97|SWC4_CRYNB SWR1-complex protein 4 OS=Cryptococcus neoformans var. neoformans
serotype D (strain B-3501A) GN=SWC4 PE=3 SV=1
Length = 463
Score = 121 bits (303), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 141/300 (47%), Gaps = 61/300 (20%)
Query: 1 MDAKDILGI----PKTQLPTTQEKKSRPQKEPQRKPDGISREVYALTGGLAPLMP----S 52
M A+D+ I P T LPT K P RKPDGI+RE+YAL G AP + S
Sbjct: 1 MSAQDVRSILSLPPSTPLPTLSSSKKVPVP---RKPDGITRELYALIGDNAPSLADAQAS 57
Query: 53 IDVSQLKKRPP-SDEKITWQWLPFTNSARKDN-LQLYHWVGFGNFWKCFHELGKRSFLMW 110
+ + +++P +K+ W+W FT +AR+DN ++L HW
Sbjct: 58 LAAVKYREKPALKGKKVHWEWTKFTPAARRDNPVRLGHWA-------------------- 97
Query: 111 GVVYNIFFNFFGVSHAFQVRVVNGVPPTGDYSFAKYN-KSVDVVKYTDEEYEKYLTDPMW 169
R+ + P F K+N V++Y+ EY+++L DP W
Sbjct: 98 -------------------RITDSDPNDSVEYFGKFNLHGPSVMEYSQFEYDQHLVDPNW 138
Query: 170 TKEETDQLFELCERFDLRFIVIADRFP--------SSRTVEELKDRYYGVSRAILIARAP 221
T +ET+ LFEL + +DLRFIV ADR+ R+VE++KDRYY + R ++ R
Sbjct: 139 TLQETEYLFELLKEYDLRFIVAADRYAYVSPEGEKRKRSVEDMKDRYYTICRRLIRTRTA 198
Query: 222 SPTDVSGHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKRITDSRMASRA 281
S H + ++ ++E++RK+ S + T + ++ + E R+ + RA
Sbjct: 199 SDPVHQQHLIQAYAFDKAREIKRKQYASDLFHLTPAEIAEEEALYVEITRMQQNERRFRA 258
>sp|Q4WNY4|SWC4_ASPFU SWR1-complex protein 4 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=swc4 PE=3
SV=1
Length = 588
Score = 119 bits (299), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 151/304 (49%), Gaps = 52/304 (17%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQ---RKPDGISREVYALTGGLAPLMPSIDVSQL 58
D +D+L +P P +K+ +K P + +GI+RE+YAL G AP + +I+ ++
Sbjct: 5 DVRDMLDLPAEGQPRPHKKQKVVEKRPDTDVQASEGITRELYALLGERAPPI-AINENRY 63
Query: 59 KKRPPSDEKIT---WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYN 115
K RP K+ WQ PFTN+AR D L L HW + E K L
Sbjct: 64 KGRPKWMSKLRVRPWQMTPFTNNARSDGLVLRHW-------QRQSESAKAPALEGASEME 116
Query: 116 IFFNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDV-VKYTDEEYEKYLTDPMWTKEET 174
+ G A P +Y FAKYN V +YTDEEY ++L W+++ET
Sbjct: 117 VDQAKAGGGAAT---------PEKEYPFAKYNVKPRVPRRYTDEEYNRHLKSDDWSRQET 167
Query: 175 DQLFELCERFDLRFIVIADRF-----------------PSS--RTVEELKDRYYGVSRAI 215
D L +L E +DLR++VIADR+ P+ RT+E++K RYY ++ ++
Sbjct: 168 DYLMDLVEEYDLRWVVIADRYDFQPQPIDAEANATALVPAKQYRTMEQMKARYYFIAASM 227
Query: 216 LIARAPSPTDVS------GHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEA 269
L P P+++S ++K ++ +E RK ++ L++T + R++ +L E
Sbjct: 228 LALEHP-PSEMSEAEFDLHEKMMK--FDPDRERARKELAALQLNRTADEVREEGILLEEL 284
Query: 270 KRIT 273
KRIT
Sbjct: 285 KRIT 288
>sp|Q4PG15|SWC4_USTMA SWR1-complex protein 4 OS=Ustilago maydis (strain 521 / FGSC 9021)
GN=SWC4 PE=3 SV=1
Length = 615
Score = 109 bits (272), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 148/317 (46%), Gaps = 73/317 (23%)
Query: 15 PTTQEKKSRPQKEPQRKPDGISREVYALTGGLAPLM-----------PSIDVSQL----- 58
P++ SRP+ K DG++RE++AL G AP + P + + L
Sbjct: 44 PSSAASASRPKP----KYDGMTRELFALLGDNAPSLAMTHGLDAEGKPVMGLGGLFKPKF 99
Query: 59 KKRPPSDEKI-TWQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIF 117
K+R EK+ W+W PF NSAR D+ Q+ V N + +
Sbjct: 100 KRR---KEKVRQWRWTPFLNSAR-DDTQIDDDVPEIN-------------------HGLI 136
Query: 118 FNFFGVSHAFQVRVVNGVPPTG-----DYSFAKYNKSVDVVKYTDEEYEKYLTDPMWTKE 172
+ + + +F +G+ Y +A +N + V Y+++EY ++L D WTKE
Sbjct: 137 LHHWAPARSFSTTAADGISAEDADIDTKYQYAAFNTTSGVYSYSNDEYIQHLRDDDWTKE 196
Query: 173 ETDQLFELCERFDLRFIVIADRFP--------------------SSRTVEELKDRYYGVS 212
ETD L ELC +DLRF+VI DR+ R++E+LK RYY +
Sbjct: 197 ETDYLMELCTAYDLRFVVIHDRYDWAAAQASFLAGSTSAVPQPVKERSMEDLKVRYYAIC 256
Query: 213 RAILIARAPSPTDVSGHPLVKDPY--NVSQEVERKRALSMVLSQTKHQERKDAEVLAEAK 270
R ++ +R S DV ++ Y + +EVERK+A++ + ++T Q ++ + E +
Sbjct: 257 RRLIRSRI-STDDVETRQMLLSTYAFDKQREVERKKAVARLYTRTPEQLAEEEALYVEIR 315
Query: 271 RITDSRMASRAAEEPEM 287
RI + A A+E E+
Sbjct: 316 RI-EQNEAKYASEREEL 331
>sp|Q870Q1|SWC4_NEUCR SWR1-complex protein 4 OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=swc-4
PE=3 SV=2
Length = 733
Score = 97.4 bits (241), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 135/301 (44%), Gaps = 55/301 (18%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPDGISREVYALTGGLAPLMPSIDVSQLKKR 61
D +D+L +P KK+R P+ G++REV L GG P+ +VS KKR
Sbjct: 5 DVRDVLNLPSDHAGPRPSKKARTAT-PRPNLKGLAREVQNL-GGDNPIAIVPEVSIFKKR 62
Query: 62 PPSDEK--ITWQWLPFTNSARKDN--LQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIF 117
K W+ FTNSAR D+ L L HW + G V +
Sbjct: 63 RTVSRKPAAKWELKAFTNSARGDDGALVLRHWKRKPD----------------GTVQD-- 104
Query: 118 FNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYEKYLTDPMWTKEETDQL 177
+ G A N D SFAK+N V V +Y++++Y L P WTKEETD L
Sbjct: 105 GSAEGQDSA--ATADNSADKPEDSSFAKFNVRVSVPQYSEDQYNTNLKHPDWTKEETDYL 162
Query: 178 FELCERFDLRFIVIADRF------------------PSS--RTVEELKDRYYGVSRAILI 217
EL + FDLR+ +I DR+ P+S RT+E+LK RYY V+ ++
Sbjct: 163 LELAKDFDLRWPIIWDRYEYAPQQPEGETPDGMAVVPASKPRTMEDLKARYYEVAAKMMA 222
Query: 218 ARAPSPT------DVSGHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKR 271
+ P+ ++ L DP QE RKR L ++ + R++ +L E KR
Sbjct: 223 VQKPAQYMTRPEFELYEMMLHFDP---KQEQNRKRFAENTLKRSSDEAREEEALLLEIKR 279
Query: 272 I 272
I
Sbjct: 280 I 280
>sp|Q5B4T5|SWC4_EMENI SWR1-complex protein 4 OS=Emericella nidulans (strain FGSC A4 /
ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=swc4 PE=3
SV=1
Length = 586
Score = 96.7 bits (239), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 144/302 (47%), Gaps = 64/302 (21%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPDGISREVYALTGGLAPLMPSIDVSQLKKR 61
D +D+L +P P +K QK +++P+GI+RE++AL G AP + +I+ ++ K R
Sbjct: 5 DVRDMLDLPAEGQPRPHKK----QKVVEKRPEGITRELFALLGERAPPI-AINENRYKGR 59
Query: 62 PPSDEKITWQWLPFTN-SARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIFFNF 120
P WQ AR D+L L HW +R +
Sbjct: 60 P------KWQTKARVRPCARSDDLVLRHW--------------QREPESTNIPAIEDTRA 99
Query: 121 FGVSHAFQVRVVNGVPPTGD--YSFAKYNKSVDVV-KYTDEEYEKYLTDPMWTKEETDQL 177
G + G T D Y FAKYN + +YT +EY ++L W++EETD L
Sbjct: 100 EGETKE------QGEHKTADREYPFAKYNIKLKFSNRYTTDEYNRHLRSEDWSREETDYL 153
Query: 178 FELCERFDLRFIVIADRF------------------PSS--RTVEELKDRYYGVSRAILI 217
+L E +DLR++VIADR+ PS RT+E++K RYY V+ ++L
Sbjct: 154 MDLVEEYDLRWVVIADRYDFQPQRVDNTEETSSALVPSKQFRTMEQMKARYYFVAASMLA 213
Query: 218 ARAPSPTDVS------GHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKR 271
P P+++S ++K ++ +E RK ++ L++T + R++ +L E KR
Sbjct: 214 LEHP-PSEMSEAEFDLHERMMK--FDPERERHRKELAALQLNRTADEVREETVLLEELKR 270
Query: 272 IT 273
IT
Sbjct: 271 IT 272
>sp|Q4HY90|SWC4_GIBZE SWR1-complex protein 4 OS=Gibberella zeae (strain PH-1 / ATCC
MYA-4620 / FGSC 9075 / NRRL 31084) GN=SWC4 PE=3 SV=2
Length = 624
Score = 90.1 bits (222), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 126/283 (44%), Gaps = 69/283 (24%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPD--GISREVYALTGGLAPLMPSIDVSQLK 59
D +D+L + T + S+ QK +P+ G++REV L GG P+ +V+ K
Sbjct: 5 DVRDVLNLGDG---TVGPRSSKKQKLAAPRPNLKGLAREVQNL-GGDNPIAIVPEVTHFK 60
Query: 60 KRPPSDEKIT--WQWLPFTNSARKD-NLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNI 116
KR + K T W+ PF NSAR D N L HW + E
Sbjct: 61 KRRFTSRKPTAKWEMRPFKNSARSDSNFTLRHWRRKDEKQEGIDE--------------- 105
Query: 117 FFNFFGVSHAFQVRVVNGVPPT------GDYSFAKYNKSVDVVKYTDEEYEKYLTDPMWT 170
Q ++ G P D +FAKYN V V +Y++ +Y++ L WT
Sbjct: 106 ----------SQEQISQGDQPQPQKNELEDSAFAKYNVQVSVPQYSEGQYQQSLQHVDWT 155
Query: 171 KEETDQLFELCERFDLRFIVIADRF----PS--------------------SRTVEELKD 206
KEETD L EL + FDLR+ +I DR+ P+ SRT+E+LK
Sbjct: 156 KEETDYLLELAQDFDLRWPLIWDRYEWNPPATNGEADDDGDESKAIVPATRSRTLEDLKA 215
Query: 207 RYYGVSRAILIARAP----SPTDVSGHPLVKDPYNVSQEVERK 245
RYY V+ ++ A+ P + + S H L+ +N QE RK
Sbjct: 216 RYYEVASKMMAAQKPVQYMTQPEFSLHELMAH-FNPQQEKLRK 257
>sp|Q752S6|SWC4_ASHGO SWR1-complex protein 4 OS=Ashbya gossypii (strain ATCC 10895 / CBS
109.51 / FGSC 9923 / NRRL Y-1056) GN=SWC4 PE=3 SV=1
Length = 488
Score = 85.1 bits (209), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 133/287 (46%), Gaps = 62/287 (21%)
Query: 2 DAKDILGI-PKTQLPTTQEKKSRP--QKEPQRKPDGISREVYALTGGLAPLMPSIDVSQL 58
D D+L I PK+ P Q +S K P+ + G+ RE+Y L G P + S+
Sbjct: 5 DIFDVLNIQPKSSSPHPQTSQSNAGASKTPKPQVTGMQRELYNLLGDNTPPIVIQPTSKF 64
Query: 59 KKRPPSDEKIT-WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNIF 117
K R S K + W F + ++L HWV G + L
Sbjct: 65 KDRLASLTKPSPWTHTEFEATPY---VKLSHWVK-----------GSKELL--------- 101
Query: 118 FNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYEKYLTD---------PM 168
G P SFAKY++ + + ++T+ EY++++ P
Sbjct: 102 ---------------EGQSPKS--SFAKYDQKLTLPEFTEGEYQEFMAQAAKGANSDAPT 144
Query: 169 WTKEETDQLFELCERFDLRFIVIADR--FPSSRTVEELKDRYYGVSRAILIARAPSPTDV 226
W+ EE LF+LC R+DLR+ ++ DR + SRT+E++++ +Y V + A+ P
Sbjct: 145 WSYEEVQYLFDLCRRYDLRWHIVYDRYMYDESRTMEDIREMFYTVCQKYFQAKDP----- 199
Query: 227 SGHPLVKD-PYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKRI 272
G+PL+ Y+ QE++RK+ L+ +LS++ + ++ ++ E+++
Sbjct: 200 -GNPLLPSLAYSKDQEIQRKKYLTRLLSRSAAEIAEEEALIMESRKF 245
Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 33/49 (67%)
Query: 368 RVEQALQELGVNLKPKVPTKAVCAEHLELRKEILTLLNLQKQLQYKEAE 416
+V LQELG+ ++P +P+ AV H EL + I+TLL+L++Q EAE
Sbjct: 435 KVVSVLQELGLPVRPAMPSAAVVQHHDELLRRIVTLLDLKRQQDKLEAE 483
>sp|Q6C9M6|SWC4_YARLI SWR1-complex protein 4 OS=Yarrowia lipolytica (strain CLIB 122 / E
150) GN=SWC4 PE=3 SV=1
Length = 504
Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 85/205 (41%), Gaps = 53/205 (25%)
Query: 2 DAKDILGIPKTQLPT---TQEKKSRPQKEP--QRKPDGISREVYALTGGLAP---LMPSI 53
D +D+L +P + P TQ+ K + P ++ DG+ RE++AL G P +
Sbjct: 6 DVRDVLDLPDLE-PNDKLTQQPKRQKLAAPVGGKRMDGMQRELFALMGENTPSVSVTKDS 64
Query: 54 DVSQLKKRPPSDEKIT-WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGV 112
S K +P K+T W W PF N AR+D L L HWV G +
Sbjct: 65 HTSLFKDKPQWQAKLTPWMWTPFQNQAREDGLILSHWVRGGELTQ--------------- 109
Query: 113 VYNIFFNFFGVSHAFQVRVVNGVPPTGD-YSFAKYNKSVDVVKYTDEEYEKY-LTDPMWT 170
GD Y FA N + + T E+Y+ L P WT
Sbjct: 110 --------------------------GDQYPFAALNTQISFPELTQEDYDGLKLATPGWT 143
Query: 171 KEETDQLFELCERFDLRFIVIADRF 195
EET L LC FDLR+ VI DR+
Sbjct: 144 LEETRYLMHLCSEFDLRWPVIHDRW 168
>sp|Q6CSS3|SWC4_KLULA SWR1-complex protein 4 OS=Kluyveromyces lactis (strain ATCC 8585 /
CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37)
GN=SWC4 PE=3 SV=1
Length = 497
Score = 73.6 bits (179), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 122/265 (46%), Gaps = 56/265 (21%)
Query: 2 DAKDILGI-PKTQLPTTQEKKSRPQ----KEPQRKPDGISREVYALTGGLAPLMPSIDVS 56
D D+L I PK+ P + + P K + + GI RE+Y+L G P +P +
Sbjct: 5 DIFDVLNIRPKSSSPDAEAQTPTPSSGTVKHNKPQVTGIQRELYSLLGDNTPTVPIKQGN 64
Query: 57 QLKKRPPSDEKIT-WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYN 115
+ K S+ K + W ++ F ++ +L+L HWV K EL
Sbjct: 65 KFKDSLNSNIKPSPWSFVEFEANS---HLKLRHWV------KGSKEL------------- 102
Query: 116 IFFNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYEKYLT--DPMWTKEE 173
V V P+ SFAK+N+ + + + ++E+++++ + W E
Sbjct: 103 ---------------VGATVEPS---SFAKFNQHLTIPSFDEQEFQQFMNGNNSEWKYGE 144
Query: 174 TDQLFELCERFDLRFIVIADRFP---SSRTVEELKDRYYGVSRAILIARAPSPTDVSGHP 230
LF+LC +D+++ VIADRF + R +++LK+ +Y V + + + P +S
Sbjct: 145 VQYLFDLCRNYDMKWFVIADRFNYNGTERKLDDLKEMFYTVCQWYFLYKDPDNPLISQLN 204
Query: 231 LVKDPYNVSQEVERKRALSMVLSQT 255
K+ +E+ERK+ L +L +T
Sbjct: 205 FPKE-----KELERKKYLERLLKRT 224
Score = 40.0 bits (92), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 8/74 (10%)
Query: 343 VYLRTYALEQMVQAASSSAGLRTIKRVEQALQELGVNLKPKVPTKAVCAEHLELRKEILT 402
VYLR+ + A + ++ LQELG+ ++P +P+ V EH +L K+I T
Sbjct: 427 VYLRSSKISTYKPAVQN--------KIVATLQELGLPVRPAMPSLEVVQEHEKLLKKIAT 478
Query: 403 LLNLQKQLQYKEAE 416
LL L+KQ+ AE
Sbjct: 479 LLELKKQVDKLTAE 492
>sp|P53201|SWC4_YEAST SWR1-complex protein 4 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=SWC4 PE=1 SV=1
Length = 476
Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/306 (22%), Positives = 129/306 (42%), Gaps = 81/306 (26%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPD----GISREVYALTGGLAPLMPSIDVSQ 57
D D+L I + T + S P +P G+ RE++ L G P + +
Sbjct: 5 DIFDVLNIKQKSRSPTNGQVSVPSSSAANRPKPQVTGMQRELFNLLGENQPPVVIKSGNN 64
Query: 58 LKKRPPSDEKIT-WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNI 116
K++ S K + W ++ F + +++ L HWV G +
Sbjct: 65 FKEKMLSTSKPSPWSFVEFKAN---NSVTLRHWVK-----------GSK----------- 99
Query: 117 FFNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYEKYLTD---------- 166
++ P YS K+N+ + + +T EEYE ++ +
Sbjct: 100 -------------ELIGDTPKESPYS--KFNQHLSIPSFTKEEYEAFMNENEGTQKSVES 144
Query: 167 -----------------PMWTKEETDQLFELCERFDLRFIVIADRFP--SSRTVEELKDR 207
W+ EE + LF LC+++DLR+ +I DR+ +SRT+E+LK++
Sbjct: 145 EKNHNENFTNEKKDESKNSWSFEEIEYLFNLCKKYDLRWFLIFDRYSYNNSRTLEDLKEK 204
Query: 208 YYGVSRAILIARAPSPTDVSGHPLVKD-PYNVSQEVERKRALSMVLSQTKHQERKDAEVL 266
+Y R A PS +PL+ ++ +E+ERK+ L +LS++ + ++ ++
Sbjct: 205 FYYTCRNYFKASDPS------NPLLSSLNFSAEKEIERKKYLQRLLSRSAAEIAEEEALV 258
Query: 267 AEAKRI 272
E+K+
Sbjct: 259 VESKKF 264
>sp|Q6FTV1|SWC4_CANGA SWR1-complex protein 4 OS=Candida glabrata (strain ATCC 2001 / CBS
138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=SWC4 PE=3
SV=1
Length = 532
Score = 62.8 bits (151), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 125/280 (44%), Gaps = 37/280 (13%)
Query: 2 DAKDILGIPKTQLPTTQEKKSRPQKEPQRKPD--GISREVYALTG-GLAPLMPSIDVSQL 58
D D+L I + S P KP G+ RE+Y L G AP++ +++
Sbjct: 5 DIFDVLNIKQKSSSPNAASPSVPASGKSSKPQLTGMQRELYNLLGENEAPVVVQ-SMNRF 63
Query: 59 KKRPPSDEKIT-WQWLPFTNSARKDNLQLYHWVGFGNFWKCFHELGKRSFLMWGVVYNI- 116
K++ S+ K T W N + L L HWV G+ E + F + V I
Sbjct: 64 KEKLASNAKPTPWS---LANFKANEYLTLQHWVK-GSRELIGEEPQESEFKKYDVHLTIP 119
Query: 117 ------FFNFFGVSHAFQVRVVN---GVPPTGDYSFAKYNK--SVDVVKYTDEEYEKYLT 165
+ +F S+A + N V GD + + + D VK + + E T
Sbjct: 120 EFTEDEYNSFIPTSNAEENEKQNIGEKVEANGDSTDVNMTEEDTNDKVKESVPQDENKST 179
Query: 166 ------DPMWTKEETDQLFELCERFDLRFIVIADRFP---SSRTVEELKDRYYGVSRAIL 216
+ W E LF+LC+++DLR+ VI DR+ S+RT+E+LK ++Y VS+
Sbjct: 180 SDNKKNNEKWEYNEVKYLFDLCKKYDLRWFVIQDRYDYENSNRTLEDLKSKFYEVSKCYF 239
Query: 217 IARAPSPTDVSGHPLVKD-PYNVSQEVERKRALSMVLSQT 255
A+ P P+++ Y+ +E +RK+ L +L+++
Sbjct: 240 KAKKPD------DPMLQSLNYSKDKETQRKKYLERLLARS 273
Score = 36.6 bits (83), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 5/61 (8%)
Query: 360 SAGLRTIK-----RVEQALQELGVNLKPKVPTKAVCAEHLELRKEILTLLNLQKQLQYKE 414
SA + T K ++ QELG+ +P +PT V E+ L K+I TLL+++KQ+ E
Sbjct: 466 SAKITTFKPAIQNKIISTAQELGIPARPVMPTFEVVTEYEALLKKIATLLDIKKQIDKIE 525
Query: 415 A 415
A
Sbjct: 526 A 526
>sp|Q5AAJ7|SWC4_CANAL SWR1-complex protein 4 OS=Candida albicans (strain SC5314 / ATCC
MYA-2876) GN=SWC4 PE=3 SV=1
Length = 635
Score = 61.6 bits (148), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 64/111 (57%), Gaps = 8/111 (7%)
Query: 165 TDPMWTKEETDQLFELCERFDLRFIVIADRFPS-SRTVEELKDRYYGVSRAILIARAPSP 223
T+ WT +ET LFELC+ F+L++ +I DRFP+ +RT E+LK+++Y + IL +
Sbjct: 303 TESEWTYKETKHLFELCQAFELKWPIIHDRFPNPNRTAEDLKEQFYRICIKILENQK--- 359
Query: 224 TDVSGHPLVKD--PYNVSQEVERKRALSMVLSQTKHQERKDAEVLAEAKRI 272
L+ Y +E+ERK+ L +L +T + ++ ++ EA+R
Sbjct: 360 --NKNQALIDSLKAYCKPRELERKQYLENLLKRTPAEIAEEESLVIEARRF 408
>sp|Q5NZG9|DXR_AROAE 1-deoxy-D-xylulose 5-phosphate reductoisomerase OS=Aromatoleum
aromaticum (strain EbN1) GN=dxr PE=3 SV=1
Length = 403
Score = 37.0 bits (84), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 8/83 (9%)
Query: 149 SVDVVKYTDEEYEKYLTDPMWTKEETDQLFELCERFDLRFIVIADRFPSSRTVEELKDRY 208
++DV+ + +E + + + + ++LFELC RF RF V+ D S +L+ R
Sbjct: 25 TLDVLARHPDRFEAF---ALTAQIQVERLFELCLRFSPRFAVLVD----SAAASDLRQRL 77
Query: 209 YGVSRAILIARAPSP-TDVSGHP 230
A + P DV+ HP
Sbjct: 78 KAAGSATEVLAGPGALVDVAAHP 100
>sp|B2UR37|RHO_AKKM8 Transcription termination factor Rho OS=Akkermansia muciniphila
(strain ATCC BAA-835) GN=rho PE=3 SV=1
Length = 474
Score = 35.0 bits (79), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 47 APLMPSIDVSQLKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVG 91
AP++ ID+++L+KRP +D + + LP N+A QL +G
Sbjct: 54 APVLEQIDINELRKRPLNDLQEMAEGLPIRNAASLTKSQLVFELG 98
>sp|Q6CNI5|SHE1_KLULA Mitotic spindle-associated protein SHE1 OS=Kluyveromyces lactis
(strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 /
NRRL Y-1140 / WM37) GN=SHE1 PE=3 SV=1
Length = 331
Score = 33.9 bits (76), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 8/104 (7%)
Query: 369 VEQALQELGVNLKPKVPTKAVCAEHLELRKEILTLLNL------QKQLQYKEAEGSSYRD 422
V Q L++ NL P V + HL E+ +++ +K+ Q ++ + Y
Sbjct: 29 VLQELEQRASNLLPNVSPEKTRLMHLGHDSEMKNSISMKFQDKHEKRFQVMDSIDTHYSL 88
Query: 423 GSYIDMPGTPKRSQRAGDQDRTFVPESINFGGERVGKRDQKRKG 466
Y MP TP + + D D T V ++N G K+ QK G
Sbjct: 89 AKYSGMPQTPGKRKNGSDADETSV--AVNLGSPGSVKKKQKSVG 130
>sp|O14299|WIS4_SCHPO MAP kinase kinase kinase wis4 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=wis4 PE=3 SV=1
Length = 1401
Score = 32.7 bits (73), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 21/117 (17%)
Query: 213 RAILIARAPSPTDVSGHPLVKDPYNVSQEVERKRALSMVLSQTKHQERKDAEVLA----E 268
RA+ + P TD ++ P + +K +LS ++ E K A++LA +
Sbjct: 1296 RAVDLLTHPWITDFRKKTIITMP---PATITKKTSLSHTIT-----EEKTAQLLAGRHDD 1347
Query: 269 AKRITDSRMASRAAEEPEMPVASHVGSESADRAVVLGDTVSPSSNIQLPSATVVPST 325
+K TDS +A+ EE +PVAS+VG + + +I LP A V P T
Sbjct: 1348 SKAETDS-LAASYKEESALPVASNVGLRQPNELRI--------DSINLPPAIVTPDT 1395
>sp|O64732|U87A1_ARATH UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana GN=UGT87A1
PE=2 SV=1
Length = 440
Score = 32.3 bits (72), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Query: 63 PSDEKI--TWQWLPFTNSARKDNLQLYHWVG---FGNFWKCFHELGKRSFLMWGVVYNI- 116
PS+ K+ ++P + R +LQ+ H F F K F EL K +L++ Y +
Sbjct: 147 PSESKLDEIVDYIPGLSPTRLSDLQILHGYSHQVFNIFKKSFGELYKAKYLLFPSAYELE 206
Query: 117 --FFNFFGVSHAFQVRVVNGVPPTGDYSFAKYNKSVDVVKYTDEEYE 161
+FF F V + P + S N+ +D K+ DE+ E
Sbjct: 207 PKAIDFFTSKFDFPVYSTGPLIPLEELSVGNENRELDYFKWLDEQPE 253
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.131 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 183,348,489
Number of Sequences: 539616
Number of extensions: 7922420
Number of successful extensions: 24164
Number of sequences better than 100.0: 42
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 24065
Number of HSP's gapped (non-prelim): 82
length of query: 489
length of database: 191,569,459
effective HSP length: 122
effective length of query: 367
effective length of database: 125,736,307
effective search space: 46145224669
effective search space used: 46145224669
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.9 bits)