BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009022
(546 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6NQ88|DDB2_ARATH Protein DAMAGED DNA-BINDING 2 OS=Arabidopsis thaliana GN=DDB2 PE=1
SV=1
Length = 557
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/509 (73%), Positives = 443/509 (87%), Gaps = 7/509 (1%)
Query: 4 QTRRMAFPRVVIERDTDTEQSSSEDEEEDREEGPFSESEEE--VTENGCEEKIEEDLDAK 61
++RR P +VI RDTD+E SSSE+EEE+ + PFSESEEE +NG + ++E++ K
Sbjct: 5 RSRRKRDPEIVIARDTDSELSSSEEEEEEEDNYPFSESEEEDEAVKNGGKIELEKN---K 61
Query: 62 RKGKAPITISL-KKVCKVCKKPGHEAGFKGATYIDCPMKPCFLCKMPGHTTMSCPHRVAT 120
KGKAPIT+ L KKVCKVCK+PGHEAGFKGATYIDCPMKPCFLCKMPGHTTMSCPHRV T
Sbjct: 62 AKGKAPITVKLIKKVCKVCKQPGHEAGFKGATYIDCPMKPCFLCKMPGHTTMSCPHRVVT 121
Query: 121 EYGVTPASHRNAGNPVEYVFERQLRPNMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHP 180
++G+ P SHRN NP+++VF+RQL+P + +KP +VIPDQV+CAVIRYHSRRVTCLEFHP
Sbjct: 122 DHGILPTSHRNTKNPIDFVFKRQLQPRIPPIKPKYVIPDQVHCAVIRYHSRRVTCLEFHP 181
Query: 181 TNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVS 240
T N+ILLSGDKKGQ+GVWDF KV EK VYGNIHS VNN+RF+PTND VY+ASSDGT+
Sbjct: 182 TKNNILLSGDKKGQIGVWDFGKVYEKNVYGNIHSVQVNNMRFSPTNDDMVYSASSDGTIG 241
Query: 241 CTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSR 300
TDLETG + +L+N+NP+GW G +W+MLYGMDIN EKGVVL ADNFGFL+++D RTN+
Sbjct: 242 YTDLETGTSSTLLNLNPDGWQGANSWKMLYGMDINSEKGVVLAADNFGFLHMIDHRTNNS 301
Query: 301 SGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVN 360
+GE ILIH++GSKV GL CNP+QPELLLSCGNDHFARIWD+R+L+ +SL DL HKRVVN
Sbjct: 302 TGEPILIHKQGSKVCGLDCNPVQPELLLSCGNDHFARIWDMRKLQPKASLHDLAHKRVVN 361
Query: 361 SAYFSPS-GSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLTPFRAEWDPKDP 419
SAYFSPS G+KILTT QDNR+RIWDSIFGNLD PSREIVHS+DFNRHLTPF+AEWDPKD
Sbjct: 362 SAYFSPSSGTKILTTCQDNRIRIWDSIFGNLDLPSREIVHSNDFNRHLTPFKAEWDPKDT 421
Query: 420 SESLAVIGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPRDDVLAS 479
SESL VIGRYISENYNG ALHPIDFID + GQLVAEVMDPNITTI+PVNKLHPRDDVLAS
Sbjct: 422 SESLIVIGRYISENYNGTALHPIDFIDASNGQLVAEVMDPNITTITPVNKLHPRDDVLAS 481
Query: 480 GSSRSIFIWRPKEKSELVEQKEEMKIIVC 508
GSSRS+FIWRP++ +E+VE+K++ KII+C
Sbjct: 482 GSSRSLFIWRPQDNTEMVEEKKDKKIIIC 510
>sp|Q5ZJL7|DDB2_CHICK DNA damage-binding protein 2 OS=Gallus gallus GN=DDB2 PE=2 SV=1
Length = 507
Score = 172 bits (435), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 197/378 (52%), Gaps = 29/378 (7%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEK-IVYGNIHSCIVNNIRFNPTNDGT 229
RRVTCLE+HPT+ + G K G + +WD+ +++ + G + +I+F+P
Sbjct: 125 RRVTCLEWHPTHPSTVAVGSKGGDIILWDYEVLTKTCFIKGKGPGDSLGDIKFSPYEAVK 184
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY-GMDINPEKGVVLVADNFG 288
+Y AS DGT+S DLE G A+ +++ P+ H Y +D++ V+ DN G
Sbjct: 185 LYVASGDGTLSLQDLE-GRAVQVISRAPDCGHENHNVCCWYCSVDVSASCRAVVTGDNLG 243
Query: 289 FLYLVDARTNSRSGEAIL---IHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE 345
+ L+ S SGE I +H+K KV + N LL + D +IWD+R ++
Sbjct: 244 NVVLL-----STSGEEIWKLKLHKK--KVTHVEFNSRCEWLLATASVDQTVKIWDLRNIK 296
Query: 346 AGSSLCD-LPHKRVVNSAYFSPS-GSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDF 403
++ LPH + VN+AYFSP+ G+K+L+T Q N +R++ + P I H H
Sbjct: 297 DKANFLHVLPHDKPVNAAYFSPTDGAKLLSTDQRNEIRVYSC--SDWTKPQHLIPHPHRQ 354
Query: 404 NRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPN 460
+HLTP +A W P+ L V+GRY + G L +D D TG++V ++ DPN
Sbjct: 355 FQHLTPIKATWHPR---YDLIVVGRYPDPKFPGYTVNELRTVDIFDGNTGEMVCQLYDPN 411
Query: 461 ITTISPVNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEE--MKIIVCGKADKKQKHK 518
+ I +NK +P D LASG +I IW + E+V +K+E +K + + +
Sbjct: 412 ASGIISLNKFNPMGDTLASGMGFNILIW---SREEMVMKKQEHLLKAMTEQGIGSRSLSR 468
Query: 519 FGDESEDSDDDTSKLKRK 536
G + + ++ TSKLK K
Sbjct: 469 RGGQRQ-ANPGTSKLKAK 485
Score = 33.1 bits (74), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 5/83 (6%)
Query: 166 IRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNI--HSCIVNNIRFN 223
++ H ++VT +EF+ +L + V +WD + +K + ++ H VN F+
Sbjct: 258 LKLHKKKVTHVEFNSRCEWLLATASVDQTVKIWDLRNIKDKANFLHVLPHDKPVNAAYFS 317
Query: 224 PTNDGTVYAASSDGTV---SCTD 243
PT+ + + + SC+D
Sbjct: 318 PTDGAKLLSTDQRNEIRVYSCSD 340
>sp|Q66JG1|DDB2_XENTR DNA damage-binding protein 2 OS=Xenopus tropicalis GN=ddb2 PE=2
SV=1
Length = 501
Score = 166 bits (420), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 175/342 (51%), Gaps = 24/342 (7%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTV 230
RRVT LE+HPT+ + + G K G + +WD+ +++ ++ G + ++F+P N +
Sbjct: 122 RRVTTLEWHPTHPNTVAVGSKGGDIILWDYEELNNTLIPGIGAGGCITGMKFDPFNPNQL 181
Query: 231 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY-GMDINPEKGVVLVADNFGF 289
Y +S G+ D + N W M Y +D++ E+ V+ DN G
Sbjct: 182 YTSSVAGSTVLQDFSGRNIQTFTNTE--------DWAMWYCSLDVSAERQCVVTGDNVGN 233
Query: 290 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS 349
+ L++ T + + +H+K KV + NP LL S D ++WD+R ++ SS
Sbjct: 234 VVLLE--TCGKEIWKLRLHKK--KVTHVEFNPRCDWLLASASVDQTVKLWDLRNIKDKSS 289
Query: 350 -LCDLPHKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 407
L LPH R VNSAYFSP G+K+LTT Q + +R++ + + P I H H +HL
Sbjct: 290 YLYTLPHARGVNSAYFSPWDGAKLLTTDQHSEIRVYSAC--DWAKPQHIIPHPHRQFQHL 347
Query: 408 TPFRAEWDPKDPSESLAVIGRY---ISENYNGAALHPIDFIDITTGQLVAEVMDPNITTI 464
T +A W P+ L V+GRY + Y L +D D G +V ++ DP + I
Sbjct: 348 TAIKATWHPR---YDLIVVGRYPDPLFPGYMSDELRTVDVFDGQKGNIVCQLYDPYASGI 404
Query: 465 SPVNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEEMKII 506
+NK +P D+LASG +I IW +E +++Q+E MK +
Sbjct: 405 VSLNKFNPMGDLLASGMGFNILIWS-REILLMMKQEEMMKAL 445
Score = 35.8 bits (81), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 166 IRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFN 223
+R H ++VT +EF+P + +L S V +WD + +K +Y H+ VN+ F+
Sbjct: 247 LRLHKKKVTHVEFNPRCDWLLASASVDQTVKLWDLRNIKDKSSYLYTLPHARGVNSAYFS 306
Query: 224 P 224
P
Sbjct: 307 P 307
>sp|Q99J79|DDB2_MOUSE DNA damage-binding protein 2 OS=Mus musculus GN=Ddb2 PE=1 SV=1
Length = 432
Score = 156 bits (395), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 164/327 (50%), Gaps = 21/327 (6%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSC-IVNNIRFNPTNDGT 229
RR T L +HPT+ L G K G + +W+F + I I + + ++FN N
Sbjct: 112 RRTTSLAWHPTHPSTLAVGSKGGDIMIWNFGIKDKPIFLKGIGAGGSITGLKFNHLNTNQ 171
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGF 289
+A+S +GT D + + + N + W +D++ + VV+ DN G
Sbjct: 172 FFASSMEGTTRLQDFKGNILRVYTSSN-----SCKVW--FCSLDVSAKSRVVVTGDNMGH 224
Query: 290 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS 349
+ L+ T+ + + +H+K KV + NP LL + D +IWD+R+++ S
Sbjct: 225 VILLS--TDGKELWNLRMHKK--KVAHVALNPCCDWLLATASIDQTVKIWDLRQIKGKDS 280
Query: 350 -LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLT 408
L LPH+ VN+A FSP G+++LTT Q+N +R++ + DSP I H H +HLT
Sbjct: 281 FLYSLPHRHPVNAACFSPDGARLLTTDQNNEIRVYSA--SQWDSPLNLISHPHRHFQHLT 338
Query: 409 PFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPNITTIS 465
P +A W + +L V+GRY N L ID D ++G+++ ++ DP + I+
Sbjct: 339 PIKATWHSR---HNLIVVGRYPDPNLKSCVPYELRTIDVFDGSSGKMMCQLYDPGYSGIT 395
Query: 466 PVNKLHPRDDVLASGSSRSIFIWRPKE 492
+N+ +P D LAS I IW +E
Sbjct: 396 SLNEFNPMGDTLASTMGYHILIWSQEE 422
>sp|Q0VBY8|DDB2_BOVIN DNA damage-binding protein 2 OS=Bos taurus GN=DDB2 PE=2 SV=1
Length = 426
Score = 155 bits (391), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 165/328 (50%), Gaps = 23/328 (7%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDF-YKVSEKIVYGNIHSCIVNNIRFNPTNDGT 229
RR T L +HPT+ L G K G + +W+F K + G + ++FNP N
Sbjct: 111 RRATSLAWHPTHPSTLAVGSKGGDILLWNFGIKDKPTFIKGIGAGGSITGMKFNPLNTNQ 170
Query: 230 VYAASSDGTVSCTDLE-TGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFG 288
+ +S +GT D + L + + N W +D++ + VV+ DN G
Sbjct: 171 FFTSSMEGTTRLQDFKGNTLRVFASSDTCNVW--------FCSLDVSVKSRVVVTGDNVG 222
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 348
+ L++ + R + +H+K KV + NP LL + D +IWD+R++ S
Sbjct: 223 HVILLN--MDGRELWNLRMHKK--KVTHVALNPCCDWLLATASVDQTVKIWDLRQVRGKS 278
Query: 349 S-LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 407
S L LPH+ VN+A+FSP G+++LTT Q + +R++ + D P I H H +HL
Sbjct: 279 SFLHSLPHRHPVNAAHFSPDGAQLLTTDQKSEIRVYSAC--QWDCPPSLIPHPHRHFQHL 336
Query: 408 TPFRAEWDPKDPSESLAVIGRYISENYNGAALH---PIDFIDITTGQLVAEVMDPNITTI 464
TP +A W P+ +L V+GRY N+ + H ID D ++G+++ ++ DP + I
Sbjct: 337 TPIKASWHPR---YNLIVVGRYPDPNFKSCSPHELRTIDVFDGSSGKIMYQLYDPESSGI 393
Query: 465 SPVNKLHPRDDVLASGSSRSIFIWRPKE 492
+N+ +P D LAS I +W P++
Sbjct: 394 MSLNEFNPMGDTLASVMGYHILVWSPED 421
>sp|Q92466|DDB2_HUMAN DNA damage-binding protein 2 OS=Homo sapiens GN=DDB2 PE=1 SV=1
Length = 427
Score = 152 bits (385), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 161/328 (49%), Gaps = 23/328 (7%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDF-YKVSEKIVYGNIHSCIVNNIRFNPTNDGT 229
RR T L +HPT+ + G K G + +W+F K + G + ++FNP N
Sbjct: 112 RRATSLAWHPTHPSTVAVGSKGGDIMLWNFGIKDKPTFIKGIGAGGSITGLKFNPLNTNQ 171
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNP-NGWHGPRTWRMLYGMDINPEKGVVLVADNFG 288
YA+S +GT D + + + + N W +D++ +V+ DN G
Sbjct: 172 FYASSMEGTTRLQDFKGNILRVFASSDTINIW--------FCSLDVSASSRMVVTGDNVG 223
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 348
+ L++ + + + +H+K KV + NP L + D +IWD+R++ +
Sbjct: 224 NVILLN--MDGKELWNLRMHKK--KVTHVALNPCCDWFLATASVDQTVKIWDLRQVRGKA 279
Query: 349 S-LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 407
S L LPH+ VN+A FSP G+++LTT Q + +R++ + D P I H H +HL
Sbjct: 280 SFLYSLPHRHPVNAACFSPDGARLLTTDQKSEIRVYSA--SQWDCPLGLIPHPHRHFQHL 337
Query: 408 TPFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPNITTI 464
TP +A W P+ +L V+GRY N+ L ID D +G+++ ++ DP + I
Sbjct: 338 TPIKAAWHPR---YNLIVVGRYPDPNFKSCTPYELRTIDVFDGNSGKMMCQLYDPESSGI 394
Query: 465 SPVNKLHPRDDVLASGSSRSIFIWRPKE 492
S +N+ +P D LAS I IW +E
Sbjct: 395 SSLNEFNPMGDTLASAMGYHILIWSQEE 422
>sp|Q2YDS1|DDB2_DANRE DNA damage-binding protein 2 OS=Danio rerio GN=ddb2 PE=1 SV=2
Length = 496
Score = 134 bits (336), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 167/344 (48%), Gaps = 24/344 (6%)
Query: 171 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSE-KIVYGNIHSCIVNNIRFNPTNDGT 229
RRVT LE+HPT+ + G K G + +WD+ +++ + G + ++FN N
Sbjct: 114 RRVTSLEWHPTHPTTVAVGSKGGDIILWDYDVLNKTSFIQGMGPGDAITGMKFNQFNTNQ 173
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYG-MDINPEKGVVLVADNFG 288
++ +S G + D + + +W Y +D++ + ++ D+ G
Sbjct: 174 LFVSSIWGATTLRDFSGSVIQVFAKTD--------SWDYWYCCVDVSVSRQMLATGDSTG 225
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL-EAG 347
L L+ + E + H+ +KV NP L+ + D ++WD+R + +
Sbjct: 226 RLLLLGLDGHEIFKEKL--HK--AKVTHAEFNPRCDWLMATSSVDATVKLWDLRNIKDKN 281
Query: 348 SSLCDLPHKRVVNSAYFSPSGS-KILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRH 406
S + ++PH++ VN+AYF+P+ S K+LTT Q N +R++ S + P + I+H H +H
Sbjct: 282 SYIAEMPHEKPVNAAYFNPTDSTKLLTTDQRNEIRVYSSY--DWSKPDQIIIHPHRQFQH 339
Query: 407 LTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISP 466
LTP +A W P L V GRY + ID D +G LV ++ DPN I
Sbjct: 340 LTPIKATWHPM---YDLIVAGRYPDDQLLLNDKRTIDIYDANSGGLVHQLRDPNAAGIIS 396
Query: 467 VNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEEMKIIVCGK 510
+NK P DVLASG +I IW ++ V +K+ IV G+
Sbjct: 397 LNKFSPTGDVLASGMGFNILIWNREDTLSSVNRKQ---TIVTGE 437
>sp|Q4KLQ5|WDR76_XENLA WD repeat-containing protein 76 OS=Xenopus laevis GN=wdr76 PE=2
SV=1
Length = 580
Score = 103 bits (256), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 159/325 (48%), Gaps = 40/325 (12%)
Query: 172 RVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVS--EKIVYGNIHSCIVNNIRFNPTNDG 228
R+ + HP+ + I+ +GDK GQ+G+WD +S + + HS ++ + F+P N
Sbjct: 273 RIFSVAIHPSESRTIVAAGDKWGQIGLWDLADLSGNDGVYVFEPHSRPISCMSFSPVNSA 332
Query: 229 TVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMD-INPEKGVVLVADNF 287
+++ S DGTV C D+ + + + + D ++ + V++V+
Sbjct: 333 QLFSLSYDGTVRCGDVCRSVFDEVYRDEQDSFSS---------FDYLSADCSVLIVSHWD 383
Query: 288 GFLYLVDARTNSRSGEA-ILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE- 345
+L +VD RT S E ++ + ++ +H P+ +L + G I+D+R+L+
Sbjct: 384 SYLSVVDCRTPGTSCEQRASLNMRSARTTSVH--PVNRDLCVVAGAGDVC-IFDVRQLKK 440
Query: 346 -AGSSLCDLPHKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPS-REIVHSHD 402
A L H + V SAYFSP +G++ILTT D+ +R++DS ++P H+++
Sbjct: 441 KAQPVLSLTGHSKSVASAYFSPVTGNRILTTCADDYIRVYDSSSLCSEAPLLTAFRHNNN 500
Query: 403 FNRHLTPFRAEWDPKDPSESLAVIG-----RYISENYNGAALHPIDFIDITTGQLVAEVM 457
R LT FRA WDPK ES V+G R I E YN + F D
Sbjct: 501 TGRWLTRFRAVWDPKQ--ESCFVVGSMARPRQI-EVYNESGKLEHSFWD----------- 546
Query: 458 DPNITTISPVNKLHPRDDVLASGSS 482
++ ++ +N +HP ++L G+S
Sbjct: 547 SEHLGSVCSINAMHPTRNLLVGGNS 571
>sp|B2KIQ4|WDR76_RHIFE WD repeat-containing protein 76 OS=Rhinolophus ferrumequinum
GN=WDR76 PE=3 SV=2
Length = 630
Score = 103 bits (256), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/321 (30%), Positives = 151/321 (47%), Gaps = 33/321 (10%)
Query: 173 VTCLEFHPTNNHILLS-GDKKGQVGVWDF-YKVSEKIVY-GNIHSCIVNNIRFNPTNDGT 229
+ + FHP+ L++ G K GQVG+WD ++ E VY HS V+ + F+P N
Sbjct: 323 IFSIAFHPSEIKTLVAAGAKSGQVGLWDLTHQPKEDGVYVFQPHSQPVSCLYFSPANPAH 382
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF-G 288
+ + S DGT+ C D+ + V + R+ L D E + ++ G
Sbjct: 383 MLSLSYDGTLRCGDISSA-------VFEEVYRNERS--SLSSFDFLAEDASTFIVGHWDG 433
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 348
+ LVD RT S E LI K+ +H +P+Q + ++ G I+D RRL
Sbjct: 434 SISLVDRRTPGASYEK-LISSSLRKIRTVHVHPVQRQYFITAGLRD-THIYDARRLTPSG 491
Query: 349 S--LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSP-SREIVHSHDF 403
S L L H + + SAYFSP +G++I+TT D +LR +DS + P I H+
Sbjct: 492 SQPLISLTEHTKSIASAYFSPLTGNRIVTTCADCKLRFFDSSCISSQIPLLTTIRHNTIT 551
Query: 404 NRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHP--IDFIDITTGQLVAEVMDPNI 461
R LT RA WDPK E +I G+ HP ++ T Q+ + + +
Sbjct: 552 GRWLTRLRAVWDPKQ--EDCVII---------GSMAHPRQVEIFHETGEQVHSFLGGECL 600
Query: 462 TTISPVNKLHPRDDVLASGSS 482
++ +N +HP +LA G+S
Sbjct: 601 VSVCSINAVHPTRYILAGGNS 621
>sp|A6PWY4|WDR76_MOUSE WD repeat-containing protein 76 OS=Mus musculus GN=Wdr76 PE=2 SV=1
Length = 622
Score = 100 bits (248), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 153/318 (48%), Gaps = 30/318 (9%)
Query: 173 VTCLEFHPTNNHILLS-GDKKGQVGVWDFYKVSEKIVY-GNIHSCIVNNIRFNPTNDGTV 230
++ + HP+ L++ G K GQ+G+WD + SE +Y HS V+ + F+PTN +
Sbjct: 315 ISSVALHPSEVRTLVAAGAKSGQIGLWDLTQQSEDAMYVFYAHSRYVSCLSFSPTNPAHL 374
Query: 231 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFL 290
+ S DGT+ C D + + V N + P ++ D + +LV G L
Sbjct: 375 LSLSYDGTLRCGDFSSAV---FEEVYRNEGNSPSSF------DFLNDSSSLLVGHWDGHL 425
Query: 291 YLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS- 349
LVD RT S E + K+ +H +P+ + ++ G ++D R L++ S
Sbjct: 426 SLVDRRTPGTSYEK-FFNSSLEKIRTVHVHPLSRQYFVTAGLRD-VHVYDARFLKSRGSQ 483
Query: 350 -LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIV-HSHDFNR 405
L L H + + SAYFSP +G++++TT D +LR++DS + P + H+ R
Sbjct: 484 PLISLTEHSKSIASAYFSPVTGNRVVTTCADCKLRVFDSSSISSQLPLLSTIRHNTVTGR 543
Query: 406 HLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPNITTI 464
LT F+A WDPK E ++ G+ HP + +G+ V + + ++
Sbjct: 544 WLTRFQAVWDPKQ--EDCFIV---------GSMDHPRRVEVFHESGKNVHSLWGECLVSV 592
Query: 465 SPVNKLHPRDDVLASGSS 482
++ +HP +LA G+S
Sbjct: 593 CSLSAVHPTRYILAGGNS 610
>sp|Q9H967|WDR76_HUMAN WD repeat-containing protein 76 OS=Homo sapiens GN=WDR76 PE=1 SV=2
Length = 626
Score = 95.1 bits (235), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 149/322 (46%), Gaps = 36/322 (11%)
Query: 173 VTCLEFHPTNNHILLS-GDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFNPTNDGT 229
+ + HP+ L++ G K GQVG+ D + ++ + + HS V+ + F+P N
Sbjct: 316 IFSMALHPSETRTLVAVGAKFGQVGLCDLTQQPKEDGVYVFHPHSQPVSCLYFSPANPAH 375
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF-G 288
+ + S DGT+ C D + + + R+ D E L+ ++ G
Sbjct: 376 ILSLSYDGTLRCGDFSRAIFEEV-------YRNERS--SFSSFDFLAEDASTLIVGHWDG 426
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCG--NDHFARIWDIRRLEA 346
+ LVD RT S E + G K+ +H +P+ + ++ G + H I+D RRL +
Sbjct: 427 NMSLVDRRTPGTSYEKLTSSSMG-KIRTVHVHPVHRQYFITAGLRDTH---IYDARRLNS 482
Query: 347 GSS--LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHD 402
S L L H + + SAYFSP +G++++TT D LRI+DS + P + +
Sbjct: 483 RRSQPLISLTEHTKSIASAYFSPLTGNRVVTTCADCNLRIFDSSCISSKIPLLTTIRHNT 542
Query: 403 FN-RHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPN 460
F R LT F+A WDPK E ++ G+ HP I TG+ V
Sbjct: 543 FTGRWLTRFQAMWDPKQ--EDCVIV---------GSMAHPRRVEIFHETGKRVHSFGGEY 591
Query: 461 ITTISPVNKLHPRDDVLASGSS 482
+ ++ +N +HP +LA G+S
Sbjct: 592 LVSVCSINAMHPTRYILAGGNS 613
>sp|Q0UYV9|YD156_PHANO WD repeat-containing protein SNOG_03055 OS=Phaeosphaeria nodorum
(strain SN15 / ATCC MYA-4574 / FGSC 10173) GN=SNOG_03055
PE=3 SV=1
Length = 519
Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/407 (25%), Positives = 171/407 (42%), Gaps = 76/407 (18%)
Query: 133 GNPVEYVFE------------RQLRPNMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHP 180
NP E F+ R LR M+ ++ + + I+ R+ + HP
Sbjct: 137 ANPYERTFDFDDVKETTDKELRALREKMSGLQ----LWEDFEPNEIKITPERIYAMGMHP 192
Query: 181 TNNH-ILLSGDKKGQVGVWDF-YKVSE--------------KIVYGNIHSCIVNNIRFNP 224
T ++ +GDK G +G+ D KV+E I H+ ++ +F+P
Sbjct: 193 TTEKPLVFAGDKLGNLGICDASQKVAEVKQEDDEDADNEGPTITTLKPHTRTIHTFQFSP 252
Query: 225 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 284
+ +Y+AS D +V DL G+A+ + G P + L G++I+ + L
Sbjct: 253 HDSNALYSASYDSSVRKLDLAKGVAVEVY-----GPSDPNEDQPLSGLEISKDDANTLYF 307
Query: 285 DNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRR 343
G + D RT S E + K K+ G +P QP L+ + D +IWD+R+
Sbjct: 308 STLDGRFGIYDMRTPSDQAELFQLSEK--KIGGFSLHPQQPHLVATASLDRTLKIWDLRK 365
Query: 344 LEAGSSLCDLP------HKRVVNSAYFSPSGSKILTTSQDNRLRI--------WDSIFGN 389
+ +G LP R+ S S ++ T S D+ ++I W +
Sbjct: 366 I-SGKGDSRLPALVGEHESRLSVSHAAWNSAGQVATASYDDTIKIHDFSKSAEWATGTAL 424
Query: 390 LDS---PSREIVHSHDFNRHLTPFRAEWD--PKDPSESLAVIGRYISENYNGAALHPIDF 444
D+ PS + H++ R +T RA+W P+D + R+ N N F
Sbjct: 425 TDADMKPSVVVPHNNQTGRWVTILRAQWQQFPQDG------VQRFCIGNMN-------RF 471
Query: 445 IDITT--GQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRSIFIW 488
+DI T GQ +A++ IT + V K HP D +A+G+ S + +W
Sbjct: 472 VDIYTAKGQQLAQLGGDGITAVPAVAKFHPTLDWVAAGTASGKLCLW 518
>sp|Q2HHH2|YD156_CHAGB WD repeat-containing protein CHGG_00332 OS=Chaetomium globosum
(strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
NRRL 1970) GN=CHGG_00332 PE=3 SV=1
Length = 524
Score = 90.5 bits (223), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 173/387 (44%), Gaps = 63/387 (16%)
Query: 142 RQLRPNMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDF 200
+ LR M+ +K P V A + +RV L FHPT + I+ +GDK+G +GV+D
Sbjct: 160 KDLRLRMSGLKLYEKWP--VQGAYPKLVPQRVYSLGFHPTESKPIIFAGDKEGAMGVFDA 217
Query: 201 YK---------------VSEKIVYG-NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDL 244
+ + + I+ HS + + F+P + VY+AS D ++ DL
Sbjct: 218 SQEPVKAEDDDDDEEAEIPDPIISAFKTHSRTITSFHFSPVDANAVYSASYDSSIRKLDL 277
Query: 245 ETGLALSLMNVNPNGWHGPRTWRMLYGMDI-NPEKGVVLVADNFGFLYLVDARTNSRSGE 303
+ G++ P + +D+ + +++ + G L D RT S + E
Sbjct: 278 DKGVSTEAFAPADADEDLP-----ISAIDMPTSDPNMIIFSTLQGTLGRHDLRTKSSTAE 332
Query: 304 AILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE------AGSSLCDLPHKR 357
+ K+ G +P QP L+ + D +IWD+R+++ A + L +
Sbjct: 333 --IWGLTDQKIGGFSLHPAQPHLVATASLDRTLKIWDLRKIQGKGDARAPALLGTHDSRL 390
Query: 358 VVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS-------------PSREIVHSHDFN 404
V+ A +S +G + T+S D+R++I++ F + D P+R+I H++
Sbjct: 391 SVSHASWSSAG-HVATSSYDDRIKIYN--FPDADKWTAGAALTEAQMEPARQIPHNNQTG 447
Query: 405 RHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNIT 462
R +T + +W + P + L +++ N N F+D+ G+ +A++ IT
Sbjct: 448 RWVTILKPQWQ-RSPRDGLQ---KFVIGNMN-------RFVDVFAADGEQLAQLGGDGIT 496
Query: 463 TISPVNKLHPRDDVLASGS-SRSIFIW 488
+ V HP D +A G+ S + +W
Sbjct: 497 AVPAVAHFHPTMDWVAGGNGSGKLCLW 523
>sp|A9X1C6|WDR76_PAPAN WD repeat-containing protein 76 OS=Papio anubis GN=WDR76 PE=3 SV=1
Length = 626
Score = 89.4 bits (220), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 147/319 (46%), Gaps = 30/319 (9%)
Query: 173 VTCLEFHPTNNHILLS-GDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFNPTNDGT 229
+ + HP+ L++ G K GQVG+ D + ++ + + HS V+ + F+P N
Sbjct: 316 IFSMALHPSETRTLVAVGAKFGQVGLCDLTQQPKEDGVYVFHPHSQPVSCLYFSPANPAH 375
Query: 230 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGF 289
+ + S DGT+ C D + + + R+ + + ++V G
Sbjct: 376 ILSLSYDGTLRCGDFSRAIFEEV-------YRNERSSFSSFDFLSE-DASTLIVGHWDGN 427
Query: 290 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS 349
+ LVD RT S E + G K+ +H +P+ + ++ G I+D R+L++ S
Sbjct: 428 MSLVDRRTPGTSYEKLTSSSMG-KIRTVHVHPVHRQYFITAGLRD-THIYDARQLKSRGS 485
Query: 350 --LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFN- 404
L L H + + SAYFSP +G++++TT D LRI+DS + P + + F
Sbjct: 486 QPLISLTEHTKSIASAYFSPLTGNRVVTTCADCNLRIFDSSCVSSKIPLLTTIRHNTFTG 545
Query: 405 RHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPNITT 463
R LT F+A WDPK E ++ G+ HP I TG+ V + +
Sbjct: 546 RWLTRFQAMWDPKQ--EDCVIV---------GSMAHPRRVEIFHETGKRVHSFGGECLVS 594
Query: 464 ISPVNKLHPRDDVLASGSS 482
+ +N +HP +LA G+S
Sbjct: 595 VCSINAMHPTRYILAGGNS 613
>sp|Q7S1H9|YD156_NEUCR WD repeat-containing protein NCU09302/NCU11420 OS=Neurospora crassa
(strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257
/ FGSC 987) GN=NCU09302 PE=3 SV=1
Length = 521
Score = 85.9 bits (211), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 87/363 (23%), Positives = 158/363 (43%), Gaps = 63/363 (17%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGN------------- 211
I+ +R+ + FHPT I+ +GDK+G +GV+D + + KI +
Sbjct: 181 IKIVPQRIYSMCFHPTEEKPIIFAGDKEGAMGVFDASQPTPKIEDDDEDAEYPDPIISAF 240
Query: 212 -IHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 270
HS +++ F+PT+ +Y+AS D ++ DL+ G++ + + + P +
Sbjct: 241 KTHSRTISSFHFSPTDANAIYSASYDSSIRKLDLDKGISTEIFAPSSSSEDLP-----IS 295
Query: 271 GMDI-NPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLS 329
+DI + +++ + G L D RT S E + K+ G +P P L+ +
Sbjct: 296 AIDIPTTDPNMIIFSTLHGSLGRQDQRTKPSSAE--IWGLTDHKIGGFSLHPRHPYLVAT 353
Query: 330 CGNDHFARIWDIRRLEAGSSLCDLPH--------KRVVNSAYFSPSGSKILTTSQDNRLR 381
D +IWD+R++ + DL H R+ S S I T+S D+R++
Sbjct: 354 ASLDRTLKIWDLRKI---TGKGDLRHPALLGEHESRLSVSHASWSSSGHIATSSYDDRIK 410
Query: 382 IWD-----------SIFGNLDSPSREIVHSHDFNRHLTPFRAEW--DPKDPSESLAVIGR 428
I+ I P+ EI H++ R +T + +W +P+D + A+
Sbjct: 411 IYSFPSAGEWKAGHDIPAKEMQPTVEIPHNNQTGRWVTILKPQWQRNPQDGWQKFAI--- 467
Query: 429 YISENYNGAALHPIDFIDITT--GQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRSI 485
N N F+D+ G+ +A++ IT + V HP D +A G+ S +
Sbjct: 468 ---GNMN-------RFVDVYAEDGEQLAQLGGDGITAVPAVAHFHPTKDWVAGGTASGKL 517
Query: 486 FIW 488
+W
Sbjct: 518 CLW 520
>sp|Q6C0U2|YD156_YARLI WD repeat-containing protein YALI0F21747g OS=Yarrowia lipolytica
(strain CLIB 122 / E 150) GN=YALI0F21747g PE=3 SV=1
Length = 539
Score = 82.8 bits (203), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/373 (22%), Positives = 150/373 (40%), Gaps = 77/373 (20%)
Query: 172 RVTCLEFHP-TNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTV 230
R+ HP T+ I+L+GDK G +G+WD +E + +H + + F+ ++ +
Sbjct: 186 RIYITAVHPGTDKRIVLAGDKIGVLGIWDVDSDNEPLQL-QLHHATIPALCFDQNSNDIL 244
Query: 231 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFL 290
Y+AS DG+V +L+TG + ++++ + + NP+ ++ + G L
Sbjct: 245 YSASYDGSVRSLELKTGKSGDVLDL-----EAKKNASVGVSDVANPQPHLLYASTLCGHL 299
Query: 291 YLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEA---- 346
D RT S E +++ K K+ G +PI LL + D RIWD+R E
Sbjct: 300 IRKDLRTKSTEYETLILGEK--KIGGFSVDPINTHLLATGSLDRSMRIWDLRATETARTI 357
Query: 347 -GSSLCD----LPHKRVVNSAYFSPSGS------KILTTSQDNRLRIW------------ 383
G + D +PH + + ++ S S + +I+ D+ + I+
Sbjct: 358 PGGEVIDTQFQMPHLQAIYNSRLSVSSTDWNLAGQIVCNGYDDTINIFNQSDYFLDMLND 417
Query: 384 ------------------------DSIFGNLDSPSREIVHSHDFNRHLTPFRAEW--DPK 417
D + PS I H+ R +T +A W P
Sbjct: 418 GNGTEPVKKTRRTRNSKLAEPEISDQELPEIKKPSVRIKHNCQTGRWVTILKARWQQQPL 477
Query: 418 DPSESLAV--IGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPRDD 475
D + A+ + RYI + Y+G TG +A + D +T + HP +
Sbjct: 478 DGVQKFAIANMNRYI-DIYSG------------TGHQLAHLGDALMTAVPSALAFHPTQN 524
Query: 476 VLASGSSRSIFIW 488
+A G+S W
Sbjct: 525 WIAGGNSSGKMYW 537
>sp|Q1E6Q0|YD156_COCIM WD repeat-containing protein CIMG_01763 OS=Coccidioides immitis
(strain RS) GN=CIMG_01763 PE=3 SV=1
Length = 525
Score = 79.7 bits (195), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 151/370 (40%), Gaps = 73/370 (19%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEK-------------IVYGN 211
I+ R+ + FHPT + ++ +GDK G +G+ D + ++ I
Sbjct: 181 IKITRERIYSMLFHPTESKPLIFAGDKTGHLGILDASQQPDQNESDEEDEYPDPTITTIK 240
Query: 212 IHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYG 271
H+ ++ + +P++ +Y+ S D ++ DLE +A + P L G
Sbjct: 241 PHTNTISAMHIHPSDPSKLYSGSYDSSIRALDLEKSVATEAYAPASSSDDEP-----LSG 295
Query: 272 MDINPEKGVVLVADNF-GFLYLVDARTNSRS----GEAILIHRKGSKVVG-LHCNPIQPE 325
+D+ P VL GF D R +S++ G A+ ++ K +G P QP
Sbjct: 296 IDMAPTDPHVLYFTTLDGFFGRHDMRVSSKANPGDGSAVTFYQLSEKKIGGFSLCPTQPH 355
Query: 326 LLLSCGNDHFARIWDIRRL---------EAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQ 376
+ + D ++WD+R L E SSL V+ A F+ G +I TTS
Sbjct: 356 YMATASLDRTMKVWDLRHLSTKHPKPVGEHESSLS-------VSHAAFNQKG-QIATTSY 407
Query: 377 DNRLRIWDSIFGNLD-------------SPSREIVHSHDFNRHLTPFRAEWD--PKDPSE 421
DN ++I+D L +P I H+ + +T R +W P P E
Sbjct: 408 DNSIKIYDLASKGLKDWKPNHTLSEDEMAPDAVIRHNCQTGKWVTILRPQWQACPDSPVE 467
Query: 422 SLAVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLAS 479
R+ N N F+DI +TG+ +A++ IT + V H + +
Sbjct: 468 ------RFCIGNMN-------RFVDIYTSTGEQLAQLGADVITAVPAVAVFHRTQNWVVG 514
Query: 480 GS-SRSIFIW 488
G+ S + +W
Sbjct: 515 GTGSAKVCLW 524
>sp|P0CS56|YD156_CRYNJ WD repeat-containing protein CNI03070 OS=Cryptococcus neoformans
var. neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=CNI03070 PE=3 SV=1
Length = 595
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/419 (22%), Positives = 153/419 (36%), Gaps = 120/419 (28%)
Query: 172 RVTCLEFHPTNNHIL-LSGDKKGQVGVWDFY-----------------------KVSEKI 207
RV + HP L L GDK GQ+G+WD + E
Sbjct: 189 RVFSMCVHPEKTKTLVLVGDKYGQLGIWDALGPPMEKPENEDDTSGLLRAEGEDEYQEGR 248
Query: 208 VYGNIHSCIVNNI---RFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPR 264
V+ + + N+I + +P N +++ + D ++ T + L +
Sbjct: 249 VW-RVQAHAKNSISCMKVDPVNGSGLFSTAYDCSLRHLSFSTLQSTELFSFQDED----- 302
Query: 265 TWRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSR-SGEAILIHR---KGSKVVGLHCN 320
++ D+ P + D G + D R + R SG + + +G+K+ G+ N
Sbjct: 303 --LLINHFDLLPSAQEAWMVDKNGGISHWDTRESKRESGRRRWVVQEEGRGAKLGGVSVN 360
Query: 321 PIQPELLLSCGNDHFARIWDIRRLEAGSS----------------------LCDLPHK-- 356
P+ P L+ + GND RIWD R L + SS LPH
Sbjct: 361 PLMPHLICTAGNDQHVRIWDTRHLFSISSNLVPSAAAIEEEEEGTSTLSGQSSSLPHDTH 420
Query: 357 ------------------------------RVVNSAYFSPSGSKILTTSQDNRLRIW--- 383
+ +SAY+ P G +ILTTS D+ LR++
Sbjct: 421 PTRESDYSTVTSYLASPRGKGLMRAKWQHGKSCSSAYWDPWGRRILTTSYDDHLRVFNID 480
Query: 384 -------DSIFGNLDS-----PSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIGRYIS 431
D G+L P++ + H+ R LT RA+W SL + Y+
Sbjct: 481 PGSSLVDDRAVGSLLQPNGFKPTKVVRHNCQTGRWLTILRAQW-------SLNM--EYMP 531
Query: 432 ENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPR--DDVLASGSSRSIFIW 488
G +D + TG+ + + ++T + V HP D V+ +S I +W
Sbjct: 532 HFTVGNMKRTLDVVS-ATGEKIVGLWTDDVTAVPTVTASHPNIVDRVVGGNTSGRIQLW 589
>sp|P0CS57|YD156_CRYNB WD repeat-containing protein CNBH2930 OS=Cryptococcus neoformans
var. neoformans serotype D (strain B-3501A) GN=CNBH2930
PE=3 SV=1
Length = 595
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/419 (22%), Positives = 153/419 (36%), Gaps = 120/419 (28%)
Query: 172 RVTCLEFHPTNNHIL-LSGDKKGQVGVWDFY-----------------------KVSEKI 207
RV + HP L L GDK GQ+G+WD + E
Sbjct: 189 RVFSMCVHPEKTKTLVLVGDKYGQLGIWDALGPPMEKPENEDDTSGLLRAEGEDEYQEGR 248
Query: 208 VYGNIHSCIVNNI---RFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPR 264
V+ + + N+I + +P N +++ + D ++ T + L +
Sbjct: 249 VW-RVQAHAKNSISCMKVDPVNGSGLFSTAYDCSLRHLSFSTLQSTELFSFQDED----- 302
Query: 265 TWRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSR-SGEAILIHR---KGSKVVGLHCN 320
++ D+ P + D G + D R + R SG + + +G+K+ G+ N
Sbjct: 303 --LLINHFDLLPSAQEAWMVDKNGGISHWDTRESKRESGRRRWVVQEEGRGAKLGGVSVN 360
Query: 321 PIQPELLLSCGNDHFARIWDIRRLEAGSS----------------------LCDLPHK-- 356
P+ P L+ + GND RIWD R L + SS LPH
Sbjct: 361 PLMPHLICTAGNDQHVRIWDTRHLFSISSNLVPSAAAIEEEEEGTSTLSGQSSSLPHDTH 420
Query: 357 ------------------------------RVVNSAYFSPSGSKILTTSQDNRLRIW--- 383
+ +SAY+ P G +ILTTS D+ LR++
Sbjct: 421 PTRESDYSTVTSYLASPRGKGLMRAKWQHGKSCSSAYWDPWGRRILTTSYDDHLRVFNID 480
Query: 384 -------DSIFGNLDS-----PSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIGRYIS 431
D G+L P++ + H+ R LT RA+W SL + Y+
Sbjct: 481 PGSSLVDDRAVGSLLQPNGFKPTKVVRHNCQTGRWLTILRAQW-------SLNM--EYMP 531
Query: 432 ENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPR--DDVLASGSSRSIFIW 488
G +D + TG+ + + ++T + V HP D V+ +S I +W
Sbjct: 532 HFTVGNMKRTLDVVS-ATGEKIVGLWTDDVTAVPTVTASHPNIVDRVVGGNTSGRIQLW 589
>sp|A1DNV8|YD156_NEOFI WD repeat-containing protein NFIA_058290 OS=Neosartorya fischeri
(strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181)
GN=NFIA_058290 PE=3 SV=1
Length = 527
Score = 66.2 bits (160), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 84/368 (22%), Positives = 146/368 (39%), Gaps = 69/368 (18%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYG-------------- 210
I+ R+ + FHP+ ++ +GDK G +GV D + EK +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQ--EKPISAVKQEDDEDAEDDDP 240
Query: 211 -------NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGP 263
H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 241 DPVLTTLKPHTRTISSMHVHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP 300
Query: 264 RTWRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPI 322
+ G+D+ P+ L G D R + RS A K+ G P
Sbjct: 301 -----ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPT 354
Query: 323 QPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRL 380
P + D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L
Sbjct: 355 HPHFFATASLDRTMRLWDIRKLSHDEPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTL 413
Query: 381 RIWDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEW--DPKDPSESL 423
+I+D FG+ P + H+ R +T R +W +P+ P
Sbjct: 414 KIYD--FGSKGIAAWKPGHTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQANPQSP---- 467
Query: 424 AVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS 481
I R+ N N F+D+ ++G +A++ IT + V H + +A G+
Sbjct: 468 --IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSTNWIAGGT 518
Query: 482 -SRSIFIW 488
S I +W
Sbjct: 519 ASGKICLW 526
>sp|Q4WLU1|YD156_ASPFU WD repeat-containing protein AFUA_6G12330 OS=Neosartorya fumigata
(strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=AFUA_6G12330 PE=3 SV=1
Length = 527
Score = 65.9 bits (159), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 143/364 (39%), Gaps = 61/364 (16%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYK-------------------VSE 205
I+ R+ + FHP+ ++ +GDK G +GV D +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQEKPTSAVKQEDDEDAEDDDPDP 242
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
+ H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 243 VLTTLKPHTRTISSLHIHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP-- 300
Query: 266 WRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP 324
+ G+D+ P+ L G D R + RS A K+ G P P
Sbjct: 301 ---ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPTHP 356
Query: 325 ELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRLRI 382
+ D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L+I
Sbjct: 357 HFFATASLDRTMRLWDIRKLSHDDPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTLKI 415
Query: 383 WDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIG 427
+D FG+ P + H+ R +T R +W +P S I
Sbjct: 416 YD--FGSKGIAAWEPGYTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQ-ANPQSS---IQ 469
Query: 428 RYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRS 484
R+ N N F+D+ ++G +A++ IT + V H + +A G+ S
Sbjct: 470 RFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSTNWIAGGTASGK 522
Query: 485 IFIW 488
I +W
Sbjct: 523 ICLW 526
>sp|P49695|PKWA_THECU Probable serine/threonine-protein kinase PkwA OS=Thermomonospora
curvata GN=pkwA PE=3 SV=1
Length = 742
Score = 65.1 bits (157), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 96/217 (44%), Gaps = 21/217 (9%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDG 228
H+ V + F P + +L SG V +WD E+ V+ H+ V +I F+P DG
Sbjct: 500 HTDWVRAVAFSP-DGALLASGSDDATVRLWDVAAAEERAVFEG-HTHYVLDIAFSP--DG 555
Query: 229 TVYAASS-DGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 287
++ A+ S DGT ++ TG +++ + + +Y + +P+ +V
Sbjct: 556 SMVASGSRDGTARLWNVATGTEHAVLKGHTD---------YVYAVAFSPDGSMVASGSRD 606
Query: 288 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 347
G + L D T E ++ VV L +P +L G+D +WD+ EA
Sbjct: 607 GTIRLWDVATGK---ERDVLQAPAENVVSLAFSPDGS--MLVHGSDSTVHLWDVASGEAL 661
Query: 348 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 384
+ H V + FSP G+ + + S D +R+WD
Sbjct: 662 HTFEG--HTDWVRAVAFSPDGALLASGSDDRTIRLWD 696
Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 97/221 (43%), Gaps = 21/221 (9%)
Query: 164 AVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFN 223
AV H+ V + F P + ++ SG + G +W+ +E V H+ V + F+
Sbjct: 537 AVFEGHTHYVLDIAFSP-DGSMVASGSRDGTARLWNVATGTEHAVLKG-HTDYVYAVAFS 594
Query: 224 PTNDGTVYAASS-DGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVL 282
P DG++ A+ S DGT+ D+ TG ++ + + +P+ G +L
Sbjct: 595 P--DGSMVASGSRDGTIRLWDVATGKERDVLQAPAEN---------VVSLAFSPD-GSML 642
Query: 283 VADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIR 342
V + ++L D SGEA+ + V LL S +D R+WD+
Sbjct: 643 VHGSDSTVHLWDVA----SGEALHTFEGHTDWVRAVAFSPDGALLASGSDDRTIRLWDVA 698
Query: 343 RLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIW 383
E ++L H V+S F P G+ + + S+D +RIW
Sbjct: 699 AQEEHTTLEG--HTEPVHSVAFHPEGTTLASASEDGTIRIW 737
Score = 37.4 bits (85), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 326 LLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDS 385
++ S D AR+W++ + L H V + FSP GS + + S+D +R+WD
Sbjct: 557 MVASGSRDGTARLWNVATGTEHAVLKG--HTDYVYAVAFSPDGSMVASGSRDGTIRLWDV 614
Query: 386 IFGN----LDSPSREIV 398
G L +P+ +V
Sbjct: 615 ATGKERDVLQAPAENVV 631
Score = 36.2 bits (82), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 326 LLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDS 385
LL S +D R+WD+ E + H V FSP GS + + S+D R+W+
Sbjct: 515 LLASGSDDATVRLWDVAAAEERAVFEG--HTHYVLDIAFSPDGSMVASGSRDGTARLWNV 572
Query: 386 IFGN 389
G
Sbjct: 573 ATGT 576
>sp|B0Y8S0|YD156_ASPFC WD repeat-containing protein AFUB_078330 OS=Neosartorya fumigata
(strain CEA10 / CBS 144.89 / FGSC A1163) GN=AFUB_078330
PE=3 SV=1
Length = 528
Score = 65.1 bits (157), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/365 (22%), Positives = 143/365 (39%), Gaps = 62/365 (16%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYK--------------------VS 204
I+ R+ + FHP+ ++ +GDK G +GV D +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQEKPTSAVKQEDDEEDAEDDDPD 242
Query: 205 EKIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPR 264
+ H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 243 PVLTTLKPHTRTISSLHIHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP- 301
Query: 265 TWRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQ 323
+ G+D+ P+ L G D R + RS A K+ G P
Sbjct: 302 ----ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPTH 356
Query: 324 PELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRLR 381
P + D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L+
Sbjct: 357 PHFFATASLDRTMRLWDIRKLSHDDPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTLK 415
Query: 382 IWDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVI 426
I+D FG+ P + H+ R +T R +W +P S I
Sbjct: 416 IYD--FGSKGIAAWEPGYTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQ-ANPQSS---I 469
Query: 427 GRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SR 483
R+ N N F+D+ ++G +A++ IT + V H + +A G+ S
Sbjct: 470 QRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHCSTNWIAGGTASG 522
Query: 484 SIFIW 488
I +W
Sbjct: 523 KICLW 527
>sp|A1CU75|YD156_ASPCL WD repeat-containing protein ACLA_085580 OS=Aspergillus clavatus
(strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 /
NRRL 1) GN=ACLA_085580 PE=3 SV=1
Length = 531
Score = 63.5 bits (153), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/376 (22%), Positives = 144/376 (38%), Gaps = 82/376 (21%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGNI------------ 212
I+ R+ + FHP+ ++ +GDK G +GV D S++ +I
Sbjct: 184 IKVTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLD---ASQERPVSSIKHEDGDEEEQED 240
Query: 213 -------------HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNG 259
H+ ++++ +P+ +Y AS D ++ DLE S+ P+
Sbjct: 241 DDDPDPVLTTLKPHTRTISSMHIHPSKPTHLYTASYDSSIRELDLEK--TTSVETYAPDS 298
Query: 260 WHGPRTWRMLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRK 310
D P G+ + AD+ LY D R + R+ A
Sbjct: 299 -----------PSDDVPISGIDMAADDPNTLYWTTLDGAFGRYDTRASRRTAVATW-QLS 346
Query: 311 GSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSG 368
K+ G P P + D R+WD+R+L L H + V+ A F+ +G
Sbjct: 347 EKKIGGFSLYPTHPHFFATASLDRTMRLWDLRKLSHDDPLPVGEHLSRLSVSHAAFNSAG 406
Query: 369 SKILTTSQDNRLRIWDSIFGNLDS-------------PSREIVHSHDFNRHLTPFRAEWD 415
++ T+S D+ L+I+D + S P + H+ R +T R +W
Sbjct: 407 -QVATSSYDDSLKIYDFGAKGIASWEQGHTLSDAEMKPDTVVRHNCQTGRWVTILRPQWQ 465
Query: 416 PKDPSESLAVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPR 473
S I R+ N N F+D+ ++G +A++ IT + V H
Sbjct: 466 ANPQSH----IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRS 514
Query: 474 DDVLASGS-SRSIFIW 488
+ +A G+ S I +W
Sbjct: 515 KNWIAGGTASGKICLW 530
>sp|Q6BRR2|SEC31_DEBHA Protein transport protein SEC31 OS=Debaryomyces hansenii (strain
ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=SEC31 PE=3 SV=2
Length = 1265
Score = 62.0 bits (149), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 102/222 (45%), Gaps = 33/222 (14%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 226
HS V L F+P +H+L++G G++ +WD K +E V G + + V ++ +N +
Sbjct: 117 HSGPVKTLSFNPNQDHVLVTGGSNGEIFIWDTKKFTEPSVPGQAMTPMDEVTSVAWNNSV 176
Query: 227 DGTVYAASSDGTVSCTDLETGLALSLMNVN-PNG--------WHGPRTWRMLYGMDINPE 277
+A + G S DL++ + ++ N P+G WH ++ +++ D
Sbjct: 177 SHIFASAGNGGYTSIWDLKSKREVLHLSYNGPSGRANFSCVAWHPTQSTKLITASD---- 232
Query: 278 KGVVLVADNFGFLYLVDARTNSRSGEAIL-IHRKGSKVVGLHCNPIQPELLLSCGNDHFA 336
D + D R N+ + E I+ H+KG V+ L PELL+S G D+
Sbjct: 233 ------NDGCPLILTWDLR-NANAPEKIMEGHKKG--VLSLDWCKHDPELLISSGKDNST 283
Query: 337 RIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTS 375
+W+ + G L + P N A+ F+P+ +I TS
Sbjct: 284 MLWNPIK---GEKLGEYP--TTANWAFHTKFAPAAPEIFATS 320
>sp|A5DB75|SEC31_PICGU Protein transport protein SEC31 OS=Meyerozyma guilliermondii
(strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC
10279 / NRRL Y-324) GN=SEC31 PE=3 SV=2
Length = 1266
Score = 62.0 bits (149), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 102/225 (45%), Gaps = 29/225 (12%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 226
HS V L+F+P H+LLSG GQ+ VWD K+S+ + G + + ++ + +N +
Sbjct: 116 HSGPVKTLQFNPLQEHVLLSGGSNGQIFVWDTKKLSDPVAPGKAMTPMDEISCVSWNNSV 175
Query: 227 DGTVYAASSDGTVSCTDLETGLALSLMNVNPN----GWHGPRTWRMLYGMDINPEKGVVL 282
+ G S DL++ + ++ + N WH ++ ++ V
Sbjct: 176 SHIFATTGNSGYTSIWDLKSKREVLHLSYSANFSCVAWHPTQSTKL-----------VTA 224
Query: 283 VADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDI 341
++ L L N+ + E I+ H+KG ++ L PE+L+S G D+ +W+
Sbjct: 225 TGNDSDALILTWDLKNANAPEKIMRGHKKG--ILSLDWCKQDPEILISSGKDNATMLWNP 282
Query: 342 RRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTSQ-DNRLRI 382
+ G L + P N A+ F+P+ +I T+ D ++ I
Sbjct: 283 IK---GEKLGEYP--TTANWAFHTRFAPAAPEIFATASFDGKIVI 322
Score = 32.3 bits (72), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 46/223 (20%), Positives = 89/223 (39%), Gaps = 37/223 (16%)
Query: 185 ILLSGDKKGQVGVWDFYKV-------SEKIVYGNIHSCIVNNIRFNPTNDGTVYAASSDG 237
+L + G + +WD ++ I + HS V ++FNP + + + S+G
Sbjct: 81 VLAGAFENGTIELWDVQELITSKDLQKASIFKSSAHSGPVKTLQFNPLQEHVLLSGGSNG 140
Query: 238 TVSCTD-------LETGLALSLMN-VNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGF 289
+ D + G A++ M+ ++ W+ N + N G+
Sbjct: 141 QIFVWDTKKLSDPVAPGKAMTPMDEISCVSWN-------------NSVSHIFATTGNSGY 187
Query: 290 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP-ELLLSCGNDHFARI--WDIRRLEA 346
+ D + S +L + + +P Q +L+ + GND A I WD++ A
Sbjct: 188 TSIWDLK----SKREVLHLSYSANFSCVAWHPTQSTKLVTATGNDSDALILTWDLKNANA 243
Query: 347 GSSLCDLPHKRVVNSAYFSPSGSKILTTS-QDNRLRIWDSIFG 388
+ HK+ + S + +IL +S +DN +W+ I G
Sbjct: 244 PEKIMRG-HKKGILSLDWCKQDPEILISSGKDNATMLWNPIKG 285
>sp|Q16960|DYI3_HELCR Dynein intermediate chain 3, ciliary OS=Heliocidaris crassispina
PE=2 SV=1
Length = 597
Score = 61.2 bits (147), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 14/229 (6%)
Query: 173 VTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNI----HSCIVNNIRFNPTNDG 228
+ CLE++P + H+L+ G GQV WD K S+ + + H + I
Sbjct: 218 LVCLEYNPKDVHVLIGGCYNGQVAFWDTRKGSQAVEMSPVEHSHHDPVYKTIWLQSKTGT 277
Query: 229 TVYAASSDGTVSCTDL-ETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGV-VLVADN 286
++AS+DG V D+ + G + ++P+ + + ++ P +V
Sbjct: 278 ECFSASTDGQVLWWDMRKLGEPTEKLIMDPSKKGKMENAQGVISLEYEPTIPTKFMVGTE 337
Query: 287 FGFLYLVDARTNSRSGEAILIHRKG-SKVVGLHCNPIQPELLLSCGNDHFARIW--DIRR 343
G + + + + + + I+++ V L NP P+ L+ G D ARIW DIR
Sbjct: 338 QGTIISCNRKAKTPPEKIVAIYKEHIGPVYSLQRNPFFPKNFLTVG-DWTARIWSEDIR- 395
Query: 344 LEAGSSLCDLPHKRVVNSAYFSPSGSKI-LTTSQDNRLRIWDSIFGNLD 391
S + H + +SP + TT D L +WD +F D
Sbjct: 396 --DSSIMWTKYHMSYLTDGCWSPVRPAVFFTTKMDGSLDVWDYLFKQKD 442
>sp|Q4PGT8|YD156_USTMA WD repeat-containing protein UM00675 OS=Ustilago maydis (strain 521
/ FGSC 9021) GN=UM00675 PE=3 SV=1
Length = 637
Score = 60.5 bits (145), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 116/285 (40%), Gaps = 70/285 (24%)
Query: 171 RRVTCLEFHP-TNNHILLSGDKKGQVGVWDFYKVS-------------------EKIVYG 210
+R+ + +HP T+ ++ GDK+G +GVWD V+ E+ G
Sbjct: 188 KRIYSMAYHPSTDKDLVFVGDKEGSIGVWDAAPVAFASNRNGVKTADDQDEDAEERFPEG 247
Query: 211 -----NIHS-CIVNNIRFNPTNDGTVYAASSDGTVSCTDLETG-----------LALSLM 253
+H+ V I+F+P N +V ++S D TV DL T + LS+
Sbjct: 248 KAWTLQVHARSPVTCIKFDPVNHNSVLSSSYDSTVRKLDLATAKSEEIWAGEEDVLLSIF 307
Query: 254 NVNPNGWHGPRTWRMLYGMDINP--EKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKG 311
+V P T +Y NP ++ + +AD+ G L +D R +R G +
Sbjct: 308 DV-----LSPSTHPSVYMDTPNPSLDERSMWIADHRGGLLHIDLRERTRRGNNTRRWQVC 362
Query: 312 SKVVG-LHCNPIQPELLLSCGNDHFARIWDIRRLEA-GSSLCDLPH-------------- 355
K +G + N + P + + D R++D+R L + D P+
Sbjct: 363 EKKIGAMSVNRLAPHCIATASLDQHIRLFDVRALASVVKQTADAPYNYKGVDADDLESAQ 422
Query: 356 ----------KRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNL 390
++ S FSP G +++ S D+ +++W G+L
Sbjct: 423 TKAQFASSKARQACTSVDFSPRGDQLVGVSYDDVVKVWSMEPGSL 467
>sp|Q6CKE8|PRP46_KLULA Pre-mRNA-splicing factor PRP46 OS=Kluyveromyces lactis (strain ATCC
8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 /
WM37) GN=PRP46 PE=3 SV=1
Length = 434
Score = 60.1 bits (144), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 105/245 (42%), Gaps = 24/245 (9%)
Query: 165 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKI-VYGNIHSCIVNNIRFN 223
VI H+ V C+ P +N +G + +WD KI + G++ S V +I +
Sbjct: 117 VINGHTGWVRCVCVDPVDNEWFATGSNDTTIKIWDLAAGKLKITLIGHVMS--VRDIAIS 174
Query: 224 PTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLV 283
+ +++AS D V C DLE A+ + + +G H +D++P ++
Sbjct: 175 KRHP-YMFSASEDKLVKCWDLERNTAIRDFHGHLSGVH---------TVDVHPSLDIIAT 224
Query: 284 ADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRR 343
A + L D R+ S E +++ S + + C P+ P+ ++SC D R+WDI
Sbjct: 225 AGRDAVVRLWDIRSRS---EIMVLPGHKSPINKVKCLPVDPQ-IISCSGDATVRLWDIIA 280
Query: 344 LEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD----SIFGNLDSPSREIVH 399
+A L H R + P+ + S N +R W + N S + I++
Sbjct: 281 GKASKVLTH--HSRNIRDLTLHPAEFSFASVST-NDVRSWKLPEGQLLTNFQSQNTGILN 337
Query: 400 SHDFN 404
+ N
Sbjct: 338 TVSIN 342
Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 9/101 (8%)
Query: 316 GLHCNPIQPEL--LLSCGNDHFARIWDIRRLEAGSSLCDLP-HKRVVNSAYFSPSGSKIL 372
G+H + P L + + G D R+WDIR + S + LP HK +N P +I+
Sbjct: 209 GVHTVDVHPSLDIIATAGRDAVVRLWDIR---SRSEIMVLPGHKSPINKVKCLPVDPQII 265
Query: 373 TTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLTPFRAE 413
+ S D +R+WD I G S+ + H R LT AE
Sbjct: 266 SCSGDATVRLWDIIAGK---ASKVLTHHSRNIRDLTLHPAE 303
>sp|Q75BY3|PRP46_ASHGO Pre-mRNA-splicing factor PRP46 OS=Ashbya gossypii (strain ATCC
10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=PRP46
PE=3 SV=2
Length = 425
Score = 59.7 bits (143), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 20/227 (8%)
Query: 165 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 224
VI H+ V C+ P +N +G + VWD K+ H V +I +
Sbjct: 108 VINGHTGWVRCVCVDPVDNAWFATGSNDSTIRVWDLATGKLKVTLQG-HIMTVRDICISA 166
Query: 225 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 284
+ +++AS D V C DLE N +HG T ++ +D++P +++ A
Sbjct: 167 RHP-YMFSASQDKLVKCWDLE-------RNTVVRDFHG--TLSGVHSVDLHPSLDLIVSA 216
Query: 285 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 344
+ + D R SRS L +G + + C P+ P+ ++SC D ++WD L
Sbjct: 217 GRDSVVRVWDIR--SRSCVLTLAGHRGP-INKVRCLPVDPQ-IVSCSTDATVKLWD---L 269
Query: 345 EAGSSLCDLP-HKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNL 390
AG + L HKR V F+P+ + D+ +R W + G L
Sbjct: 270 VAGKPMKTLTHHKRNVRDLAFNPTEFSFASACTDD-IRSWKLVDGQL 315
Score = 38.9 bits (89), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 31/226 (13%)
Query: 173 VTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTVYA 232
V ++ HP+ + +++S + V VWD S + H +N +R P D + +
Sbjct: 201 VHSVDLHPSLD-LIVSAGRDSVVRVWDIRSRSCVLTLAG-HRGPINKVRCLPV-DPQIVS 257
Query: 233 ASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA--DNFGFL 290
S+D TV DL G + + H R R L NP + A D+
Sbjct: 258 CSTDATVKLWDLVAGKPMKTLT------HHKRNVRDLA---FNPTEFSFASACTDDIRSW 308
Query: 291 YLVDAR--TNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWD------IR 342
LVD + TN S EA+ I V L CN Q +L + G+ +D +
Sbjct: 309 KLVDGQLLTNFNS-EALGI------VNTLACN--QDGVLFAGGDTGELSFFDYKTGHKFQ 359
Query: 343 RLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFG 388
+LE + L ++ V ++ F +G ++LT +D ++IW I G
Sbjct: 360 KLETTAMPGSLESEKGVLASTFDRTGLRLLTCERDKSIKIWKHIDG 405
>sp|Q0CSP9|YD156_ASPTN WD repeat-containing protein ATEG_03285 OS=Aspergillus terreus
(strain NIH 2624 / FGSC A1156) GN=ATEG_03285 PE=3 SV=1
Length = 530
Score = 59.3 bits (142), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 81/370 (21%), Positives = 140/370 (37%), Gaps = 73/370 (19%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKV-------------------SE 205
I+ R+ + FHP+ + ++ +GDK G +GV D +
Sbjct: 186 IKLTPERIYAMTFHPSESKPLIFAGDKMGHLGVLDASQTKPVSAATHDEDEEDDDDDPDP 245
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
+ H+ ++ + +P+ +Y AS D ++ DLE ++ + P +
Sbjct: 246 VLTTLKPHTRTISCMTIHPSKPTHLYTASYDSSIREMDLEKTTSVER--------YAPAS 297
Query: 266 WRMLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRKGSKVVG 316
D P G+ + D+ LY D RT R A K+ G
Sbjct: 298 -----TADDVPISGLDMALDDPHCLYWTTLDGEFGRYDMRT-PRQDSATRWTLSDKKIGG 351
Query: 317 LHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTT 374
P P + D R+WD+R+L S + H + V+ A F+ +G ++ T+
Sbjct: 352 FSLYPTHPHYFATASLDRTMRLWDLRKLSHKSPVAVGEHESRLSVSHAAFNGAG-QVATS 410
Query: 375 SQDNRLRIWD-------------SIFGNLDSPSREIVHSHDFNRHLTPFRAEWDPKDPSE 421
S D+ L+I+D S+ P + H+ R +T R +W S
Sbjct: 411 SYDDSLKIYDFGAKGIASWKPGHSLSDAQMKPDVVVRHNCQTGRWVTILRPQWQQNPQSH 470
Query: 422 SLAVIGRYISENYNGAALHPIDFIDITTGQ--LVAEVMDPNITTISPVNKLHPRDDVLAS 479
I R+ N N F+DI +G +A++ IT + V H + +A
Sbjct: 471 ----IQRFCIGNMN-------RFVDIYSGSGDQLAQLGGDGITAVPAVAVFHRSKNWVAG 519
Query: 480 GS-SRSIFIW 488
G+ S I +W
Sbjct: 520 GTASGKICLW 529
>sp|Q5AAU3|SEC31_CANAL Protein transport protein SEC31 OS=Candida albicans (strain SC5314
/ ATCC MYA-2876) GN=SEC31 PE=3 SV=1
Length = 1265
Score = 58.9 bits (141), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 97/223 (43%), Gaps = 35/223 (15%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 226
H+ V L+F+P NH+L++G GQ+ +WD SE G + + + + +N +
Sbjct: 117 HTGAVKSLQFNPIQNHVLVTGGSNGQIFIWDTKTFSEPFAPGQAMTPMDEITFVSWNNSV 176
Query: 227 DGTVYAASSDGTVSCTDLETG---LALSLM------NVNPNGWHGPRTWRMLYGMDINPE 277
+ + + G S DL+T L LS N + WH ++ +++ D
Sbjct: 177 SHILASTGNGGYTSIWDLKTKREVLHLSYTGAGGRANFSYVSWHPSQSTKLITASD---- 232
Query: 278 KGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFA 336
D+ + D R NS + E IL H+KG V+ L P LLLS G D+
Sbjct: 233 ------NDSCPLILTWDLR-NSNAPEKILEGHKKG--VLSLDWCKQDPTLLLSSGKDNST 283
Query: 337 RIWD-IRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTS 375
+W+ I ++ G + N A+ F+P+ I T+
Sbjct: 284 FLWNPIEGIKLGE------YPTTANWAFETKFAPAAPDIFATA 320
>sp|A3GFK8|SEC31_PICST Protein transport protein SEC31 OS=Scheffersomyces stipitis (strain
ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545)
GN=SEC31 PE=3 SV=2
Length = 1244
Score = 58.5 bits (140), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 110/252 (43%), Gaps = 36/252 (14%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 226
HS V L+F+P +H+L+SG GQ+ +WD K +E G+ + + ++++ +N +
Sbjct: 117 HSGPVRSLQFNPLQSHVLVSGGSHGQIFIWDTKKFTEPFSPGSAMTPMDEISSVAWNNSV 176
Query: 227 DGTVYAASSDGTVSCTDLETG---LALSLM------NVNPNGWHGPRTWRMLYGMDINPE 277
+ + + G S DL++ L LS N + WH ++ ++ D
Sbjct: 177 SHILASTGNSGYTSIWDLKSKREVLHLSYTGASGRANFSHVAWHPTKSTELITASD---- 232
Query: 278 KGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFA 336
D + D R NS + E IL H+KG V+ L PELL+S G D+
Sbjct: 233 ------NDACPLILTWDLR-NSNAPEKILEGHKKG--VLSLDWCQQDPELLISSGKDNTT 283
Query: 337 RIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTSQ-DNRLRIWDSIFGNLDS 392
+W+ G L + P N A+ F+P I T+ D ++ + +
Sbjct: 284 FLWNPT---TGQKLGEYP--TTANWAFQTAFAPKVPDIFATASFDGKIVVQS--LQDTSP 336
Query: 393 PSREIVHSHDFN 404
P E V S+D N
Sbjct: 337 PVSEKVTSNDDN 348
>sp|Q96DI7|SNR40_HUMAN U5 small nuclear ribonucleoprotein 40 kDa protein OS=Homo sapiens
GN=SNRNP40 PE=1 SV=1
Length = 357
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 160 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 219
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 56 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 114
Query: 220 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 272
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 115 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 165
Query: 273 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 332
+V + G + L D R + AI + +V+ + N +++ S G
Sbjct: 166 -------LVCTGSDDGTVKLWDIRKKA----AIQTFQNTYQVLAVTFNDTSDQII-SGGI 213
Query: 333 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 392
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 214 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 268
Query: 393 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 447
P V + H+F ++L R W P + R++ ++
Sbjct: 269 PKERCVKIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 313
Query: 448 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 482
TT + + + + +I+ V HP + ++ S SS
Sbjct: 314 TTSRRILYKLPGHAGSINEV-AFHPDEPIIISASS 347
>sp|Q2HJH6|SNR40_BOVIN U5 small nuclear ribonucleoprotein 40 kDa protein OS=Bos taurus
GN=SNRNP40 PE=2 SV=1
Length = 358
Score = 58.2 bits (139), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 160 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 219
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 57 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 115
Query: 220 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 272
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 116 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 166
Query: 273 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 332
+V + G + L D R + AI + +V+ + N +++ S G
Sbjct: 167 -------LVCTGSDDGTVKLWDIRKKA----AIQTFQNTYQVLAVTFNDTSDQII-SGGI 214
Query: 333 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 392
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 215 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 269
Query: 393 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 447
P V + H+F ++L R W P + R++ ++
Sbjct: 270 PKERCVRIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 314
Query: 448 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 482
TT + + + + +I+ V HP + ++ S SS
Sbjct: 315 TTSRRILYKLPGHAGSINEV-AFHPDEPIILSASS 348
>sp|Q6PE01|SNR40_MOUSE U5 small nuclear ribonucleoprotein 40 kDa protein OS=Mus musculus
GN=Snrnp40 PE=2 SV=1
Length = 358
Score = 58.2 bits (139), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 160 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 219
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 57 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 115
Query: 220 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 272
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 116 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 166
Query: 273 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 332
+V + G + L D R + A+ + +V+ + N +++ S G
Sbjct: 167 -------LVCTGSDDGTVKLWDIRKKA----AVQTFQNTYQVLAVTFNDTSDQII-SGGI 214
Query: 333 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 392
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 215 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 269
Query: 393 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 447
P V + H+F ++L R W P + R++ ++
Sbjct: 270 PKERCVKIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 314
Query: 448 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 482
TT + V + + +I+ V HP + ++ S SS
Sbjct: 315 TTSRRVLYKLPGHAGSINEV-AFHPDEPIILSASS 348
>sp|Q8CFD5|ERCC8_MOUSE DNA excision repair protein ERCC-8 OS=Mus musculus GN=Ercc8 PE=2
SV=2
Length = 397
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 42/254 (16%)
Query: 165 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYG------------NI 212
V R H V L+ P +LSG G V ++D S + Y ++
Sbjct: 38 VERIHGSGVNTLDIEPVEGRYMLSGGSDGVVVLYDLENASRQPHYTCKAVCSVGRSHPDV 97
Query: 213 HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGM 272
H V +++ P + G ++S D T+ D T A + N +Y
Sbjct: 98 HKYSVETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQAADVFNFE----------ETVYSH 147
Query: 273 DINP---EKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLS 329
++P + +V V + L D ++ S S + HR+ +++ + +P +L +
Sbjct: 148 HMSPAATKHCLVAVGTRGPKVQLCDLKSGSCS-HILQGHRQ--EILAVSWSPRHDYILAT 204
Query: 330 CGNDHFARIWDIRRLEA--------------GSSLCDLPHKRVVNSAYFSPSGSKILTTS 375
D ++WD+RR + + H VN F+ G +LT
Sbjct: 205 ASADSRVKLWDVRRASGCLLTLDQHNGKKSQAAESANTAHNGKVNGLCFTSDGLHLLTIG 264
Query: 376 QDNRLRIWDSIFGN 389
DNR+R+W+S G+
Sbjct: 265 TDNRMRLWNSSSGD 278
>sp|Q6C414|SEC31_YARLI Protein transport protein SEC31 OS=Yarrowia lipolytica (strain CLIB
122 / E 150) GN=SEC31 PE=3 SV=1
Length = 1184
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 85/197 (43%), Gaps = 31/197 (15%)
Query: 157 IPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIV-------- 208
I D ++ HS + L+F P N L+SG KG++ VWD + I
Sbjct: 101 IKDSSTSVSVKEHSGPIKTLQFDPHNPTRLVSGGTKGEIFVWDLSDPKKPIAKKLGTDNK 160
Query: 209 YGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMN----VNPNGWHGPR 264
G+I S NNI N + +SS+G + +++ L+ + V+ WH +
Sbjct: 161 AGDIESLAFNNITRN-----ILATSSSNGITTIWNVDQNKELTRVKHDKPVSHVVWHPSK 215
Query: 265 TWRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQ 323
P K + VAD+ + L+ N+ + E +L H KG ++ + +
Sbjct: 216 -----------PTKLITAVADDAEPVMLIWDLKNANAPEGVLQGHSKG--ILSVDWCQLD 262
Query: 324 PELLLSCGNDHFARIWD 340
P LLSCG D+ +W+
Sbjct: 263 PRFLLSCGKDNRTLLWN 279
>sp|Q6FQU2|YD156_CANGA WD repeat-containing protein CAGL0I03542g OS=Candida glabrata
(strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 /
NRRL Y-65) GN=CAGL0I03542g PE=3 SV=1
Length = 534
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/353 (20%), Positives = 144/353 (40%), Gaps = 52/353 (14%)
Query: 166 IRYHSRRVTCLEFHP-TNNHILLSGDKKGQVGVWDFYK-----------VSEKIVYGNIH 213
I+ + R+T + FHP T+ +++ GD G VG+W+ V I
Sbjct: 196 IKITNERITSMFFHPSTDKKLIVGGDTSGTVGLWNVRDEPLAENGEDDLVEPDITKVKFF 255
Query: 214 SCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMD 273
+ V I PT+ T+ S DG++ L+ + +M + N + P +
Sbjct: 256 TKNVGKIECFPTDTSTLLITSYDGSIRTLGLKDLKSADIMTLR-NSYEEPLG---ISDCQ 311
Query: 274 INPEKGVVLVADNFGFLYL-VDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 332
+ + VL G + +D R ++ E K+ + NP +P + +
Sbjct: 312 FSYDNSQVLFLTTLGGEFTQLDLR--AKPTETKFWRLSDKKIGSMAINPQRPYEIATGSL 369
Query: 333 DHFARIWDIRR------------LEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRL 380
D RIWD+R+ + + + V++ +SP+ ++ D+ +
Sbjct: 370 DRTLRIWDVRKTVETPEWSQYEDYHSHEIVSTFDSRLSVSAVSYSPTDGTLVCNGYDDTI 429
Query: 381 RIWD---SIFGNLDSPSREIV-HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNG 436
R++D + +LD ++ ++ H+ R + +A + P ++A +GR I + YN
Sbjct: 430 RLFDVNGELPEDLDEKNKTVLKHNCQSGRWTSILKARFKPDQNVFAIANMGRAI-DIYN- 487
Query: 437 AALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPRDDVLASG-SSRSIFIW 488
++GQ +A + T+ V HP + +A G SS +F++
Sbjct: 488 -----------SSGQQLAHL---TTATVPAVLGWHPLKNWIAGGNSSGKVFLF 526
>sp|Q2UUT4|YD156_ASPOR WD repeat-containing protein AO090009000186 OS=Aspergillus oryzae
(strain ATCC 42149 / RIB 40) GN=AO090009000186 PE=3 SV=1
Length = 522
Score = 57.8 bits (138), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/368 (21%), Positives = 140/368 (38%), Gaps = 71/368 (19%)
Query: 166 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGNI------------ 212
I+ RV + FHP+ ++ +GDK G +G+ D + V
Sbjct: 180 IKLTPERVYTMTFHPSETKPLIFAGDKMGHLGILDASQEKPTSVKQEDEDEEDDDPDPVL 239
Query: 213 -----HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWR 267
H+ ++++ +P+ +Y AS D ++ DL+ S+ P+
Sbjct: 240 TTLKPHTRTISSMVIHPSKPTHLYTASYDSSIREMDLDK--TTSVERYAPDS-------- 289
Query: 268 MLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRKGSKVVGLH 318
D P G+ + AD+ LY D RT + G + K+ G
Sbjct: 290 ---TSDDVPLSGLDMAADDPNTLYWTTLEGEFGRYDMRT-PKQGSVAVWSLSEKKIGGFS 345
Query: 319 CNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR--VVNSAYFSPSGSKILTTSQ 376
P + D R+WDIR+L + H+ V+ A F+ +G ++ T+S
Sbjct: 346 LFPTHSHYFATASLDRTMRLWDIRKLSRREPVPVGEHQSRLSVSHAAFNSAG-QVATSSY 404
Query: 377 DNRLRIWDSIFGNLDS-------------PSREIVHSHDFNRHLTPFRAEWDPKDPSESL 423
D+ L+++D + S P + H+ R +T R +W S
Sbjct: 405 DDSLKLYDFGAKGIASWKPGHTLSDAEMKPDTVVRHNCQTGRWVTILRPQWQINPQSH-- 462
Query: 424 AVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS 481
I R+ N N F+D+ ++G +A++ IT + V H + +A G+
Sbjct: 463 --IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSKNWIAGGT 513
Query: 482 -SRSIFIW 488
S I +W
Sbjct: 514 ASGKICLW 521
>sp|O61585|KTNB1_STRPU Katanin p80 WD40 repeat-containing subunit B1 OS=Strongylocentrotus
purpuratus GN=KATNB1 PE=1 SV=1
Length = 690
Score = 57.4 bits (137), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/325 (22%), Positives = 120/325 (36%), Gaps = 85/325 (26%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDG 228
HS V CL P + ++++G + +V +W K I+ + H+ V++++FN +++
Sbjct: 15 HSSNVNCLALGPMSGRVMVTGGEDKKVNLWAVGK-QNCIISLSGHTSPVDSVKFN-SSEE 72
Query: 229 TVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFG 288
V A S GT+ DLE + + + N + MD +P FG
Sbjct: 73 LVVAGSQSGTMKIYDLEPAKIVRTLTGHRNS---------IRCMDFHP----------FG 113
Query: 289 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 348
E + S D ++WD+RR G
Sbjct: 114 ------------------------------------EFVASGSTDTNVKLWDVRR--KGC 135
Query: 349 SLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLT 408
H VN FSP G ++T S+D +++WD G L +F H
Sbjct: 136 IYTYKGHSDQVNMIKFSPDGKWLVTASEDTTIKLWDLTMGKL---------FQEFKNHTG 186
Query: 409 PFRA-EWDPKDPSESLAVIGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPV 467
E+ P+E L +G++ + F D+ T QLV+ P + + +
Sbjct: 187 GVTGIEF---HPNEFLLA---------SGSSDRTVQFWDLETFQLVSST-SPGASAVRSI 233
Query: 468 NKLHPRDDVLASGSSRSI--FIWRP 490
+ HP L S + F W P
Sbjct: 234 S-FHPDGSYLFCSSQDMLHAFGWEP 257
>sp|Q9D7H2|WDR5B_MOUSE WD repeat-containing protein 5B OS=Mus musculus GN=Wdr5b PE=1 SV=1
Length = 328
Score = 56.2 bits (134), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 99/217 (45%), Gaps = 19/217 (8%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-EKIVYGNIHSCIVNNIRFNPTND 227
HS ++ ++F P N L S + +W Y + +K +YG HS ++++ ++ ++
Sbjct: 38 HSAAISSVKFSP-NGEWLASSAADALIIIWGAYDGNCKKTLYG--HSLEISDVAWS-SDS 93
Query: 228 GTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 287
+ +AS D T+ D+ +G L + + + ++ D NP +++
Sbjct: 94 SRLVSASDDKTLKVWDMRSGKCLKTLKGHSD---------FVFCCDFNPPSNLIVSGSFD 144
Query: 288 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 347
+ + + +T + + + H V +CN L++S D RIWD +
Sbjct: 145 ESVKIWEVKTG-KCLKTLSAHSDPISAVNFNCNG---SLIVSGSYDGLCRIWDAASGQCL 200
Query: 348 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 384
+L D + V + FSP+G ILT + DN L++WD
Sbjct: 201 RTLADEGNPPV-SFVKFSPNGKYILTATLDNTLKLWD 236
Score = 34.3 bits (77), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 49/223 (21%), Positives = 94/223 (42%), Gaps = 16/223 (7%)
Query: 165 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 224
++ HS V C +F+P +N +++SG V +W+ K + + + HS ++ + FN
Sbjct: 118 TLKGHSDFVFCCDFNPPSN-LIVSGSFDESVKIWEV-KTGKCLKTLSAHSDPISAVNFN- 174
Query: 225 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 284
N + + S DG D +G L + N P ++ + +P +L A
Sbjct: 175 CNGSLIVSGSYDGLCRIWDAASGQCLRTLADEGN---PPVSF-----VKFSPNGKYILTA 226
Query: 285 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 344
L L D + R + H+ + + + ++S D+ IW+++
Sbjct: 227 TLDNTLKLWD-YSRGRCLKTYTGHKNEKYCLFASFSVTGRKWVVSGSEDNMVYIWNLQTK 285
Query: 345 EAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN--RLRIWDS 385
E L H VV SA P+ + I + + +N +++W S
Sbjct: 286 EIVQRL--QGHTDVVISAACHPTKNIIASAALENDKTIKVWSS 326
Score = 33.5 bits (75), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 300 RSGEAILIHRKGSKVVGLHCNPIQP--ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR 357
RSG+ + + S V C P L++S D +IW+++ + +L H
Sbjct: 111 RSGKCLKTLKGHSDFV--FCCDFNPPSNLIVSGSFDESVKIWEVKTGKCLKTLS--AHSD 166
Query: 358 VVNSAYFSPSGSKILTTSQDNRLRIWDSIFG 388
+++ F+ +GS I++ S D RIWD+ G
Sbjct: 167 PISAVNFNCNGSLIVSGSYDGLCRIWDAASG 197
>sp|Q4V8C4|WDR5B_RAT WD repeat-containing protein 5B OS=Rattus norvegicus GN=Wdr5b PE=2
SV=1
Length = 328
Score = 56.2 bits (134), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 98/217 (45%), Gaps = 19/217 (8%)
Query: 169 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYK-VSEKIVYGNIHSCIVNNIRFNPTND 227
HS ++ ++F P N L S + +W Y +K +YG HS ++++ ++ ++
Sbjct: 38 HSAAISSVKFSP-NGEWLASSAADALIIIWGAYDGKCKKTLYG--HSLEISDVAWS-SDS 93
Query: 228 GTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 287
+ +AS D T+ D+ +G L + + + ++ D NP +++
Sbjct: 94 SRLVSASDDKTLKLWDVRSGKCLKTLKGHSD---------FVFCCDFNPPSNLIVSGSFD 144
Query: 288 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 347
+ + + +T + + + H V HCN L++S D RIWD +
Sbjct: 145 ESVKIWEVKTG-KCLKTLSAHSDPISAVHFHCNG---SLIVSGSYDGLCRIWDAASGQCL 200
Query: 348 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 384
+L D + V + FSP+G ILT + D+ L++WD
Sbjct: 201 RTLADEGNPPV-SFVKFSPNGKYILTATLDSTLKLWD 236
Score = 33.9 bits (76), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 6/92 (6%)
Query: 300 RSGEAILIHRKGSKVVGLHCNPIQP--ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR 357
RSG+ + + S V C P L++S D +IW+++ + +L H
Sbjct: 111 RSGKCLKTLKGHSDFV--FCCDFNPPSNLIVSGSFDESVKIWEVKTGKCLKTLS--AHSD 166
Query: 358 VVNSAYFSPSGSKILTTSQDNRLRIWDSIFGN 389
+++ +F +GS I++ S D RIWD+ G
Sbjct: 167 PISAVHFHCNGSLIVSGSYDGLCRIWDAASGQ 198
Score = 33.9 bits (76), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 95/224 (42%), Gaps = 16/224 (7%)
Query: 166 IRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPT 225
++ HS V C +F+P +N +++SG V +W+ K + + + HS ++ + F+
Sbjct: 119 LKGHSDFVFCCDFNPPSN-LIVSGSFDESVKIWEV-KTGKCLKTLSAHSDPISAVHFH-C 175
Query: 226 NDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVAD 285
N + + S DG D +G L + N P ++ + +P +L A
Sbjct: 176 NGSLIVSGSYDGLCRIWDAASGQCLRTLADEGN---PPVSF-----VKFSPNGKYILTAT 227
Query: 286 NFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE 345
L L D + R + H+ + + + ++S D+ IW+++ E
Sbjct: 228 LDSTLKLWD-YSRGRCLKTYTGHKNEKYCIFASFSVTGRKWVVSGSEDNMVYIWNLQTKE 286
Query: 346 AGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN--RLRIWDSIF 387
L H VV SA P+ + I + + +N ++IW S +
Sbjct: 287 IVQRL--QGHTDVVISAACHPTENIIASAALENDKTIKIWSSDY 328
>sp|O22212|PRP4L_ARATH U4/U6 small nuclear ribonucleoprotein PRP4-like protein
OS=Arabidopsis thaliana GN=EMB2776 PE=2 SV=1
Length = 554
Score = 55.8 bits (133), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 45/217 (20%)
Query: 182 NNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTVYAAS------- 234
+ IL + G +W+ +V+ I H ++ F+P +D A++
Sbjct: 266 DGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLW 325
Query: 235 -SDGTVSCTDLETGL-ALSLMNVNPNGWH-----GPRTWRMLYGMDINPEKGVVLVADNF 287
+DGT+ T E L L+ + +P+G + +TWR+ DIN ++L
Sbjct: 326 KTDGTLLQT-FEGHLDRLARVAFHPSGKYLGTTSYDKTWRL---WDINTGAELLL----- 376
Query: 288 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 347
+SRS I + G+ L SCG D AR+WD+R
Sbjct: 377 -------QEGHSRSVYGIAFQQDGA-------------LAASCGLDSLARVWDLR--TGR 414
Query: 348 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 384
S L H + V S FSP+G + + +DN+ RIWD
Sbjct: 415 SILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWD 451
>sp|Q5M786|WDR5_XENTR WD repeat-containing protein 5 OS=Xenopus tropicalis GN=wdr5 PE=2
SV=1
Length = 334
Score = 55.8 bits (133), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 102/241 (42%), Gaps = 33/241 (13%)
Query: 152 KPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-EKIVYG 210
KPA V P+ + H++ V+ ++F P N L S + +W Y EK + G
Sbjct: 27 KPAPVKPNYTLKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFEKTISG 85
Query: 211 NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 270
H ++++ ++ ++ + +AS D T+ D+ +G L + + N ++
Sbjct: 86 --HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN---------YVF 133
Query: 271 GMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP------ 324
+ NP+ +++ + + D +T K K + H +P+
Sbjct: 134 CCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAVHFNRD 182
Query: 325 -ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIW 383
L++S D RIWD + +L D V+ FSP+G IL + DN L++W
Sbjct: 183 GSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDNTLKLW 241
Query: 384 D 384
D
Sbjct: 242 D 242
>sp|Q498M4|WDR5_RAT WD repeat-containing protein 5 OS=Rattus norvegicus GN=Wdr5 PE=2
SV=1
Length = 334
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 103/246 (41%), Gaps = 33/246 (13%)
Query: 147 NMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-E 205
+ T KP V P+ + H++ V+ ++F P N L S + +W Y E
Sbjct: 22 SATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFE 80
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
K + G H ++++ ++ ++ + +AS D T+ D+ +G L + + N
Sbjct: 81 KTISG--HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN------- 130
Query: 266 WRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP- 324
++ + NP+ +++ + + D +T K K + H +P+
Sbjct: 131 --YVFCCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAV 177
Query: 325 ------ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN 378
L++S D RIWD + +L D V+ FSP+G IL + DN
Sbjct: 178 HFNRDGSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDN 236
Query: 379 RLRIWD 384
L++WD
Sbjct: 237 TLKLWD 242
>sp|P61965|WDR5_MOUSE WD repeat-containing protein 5 OS=Mus musculus GN=Wdr5 PE=1 SV=1
Length = 334
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 103/246 (41%), Gaps = 33/246 (13%)
Query: 147 NMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-E 205
+ T KP V P+ + H++ V+ ++F P N L S + +W Y E
Sbjct: 22 SATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFE 80
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
K + G H ++++ ++ ++ + +AS D T+ D+ +G L + + N
Sbjct: 81 KTISG--HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN------- 130
Query: 266 WRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP- 324
++ + NP+ +++ + + D +T K K + H +P+
Sbjct: 131 --YVFCCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAV 177
Query: 325 ------ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN 378
L++S D RIWD + +L D V+ FSP+G IL + DN
Sbjct: 178 HFNRDGSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDN 236
Query: 379 RLRIWD 384
L++WD
Sbjct: 237 TLKLWD 242
>sp|P61964|WDR5_HUMAN WD repeat-containing protein 5 OS=Homo sapiens GN=WDR5 PE=1 SV=1
Length = 334
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 103/246 (41%), Gaps = 33/246 (13%)
Query: 147 NMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-E 205
+ T KP V P+ + H++ V+ ++F P N L S + +W Y E
Sbjct: 22 SATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFE 80
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
K + G H ++++ ++ ++ + +AS D T+ D+ +G L + + N
Sbjct: 81 KTISG--HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN------- 130
Query: 266 WRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP- 324
++ + NP+ +++ + + D +T K K + H +P+
Sbjct: 131 --YVFCCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAV 177
Query: 325 ------ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN 378
L++S D RIWD + +L D V+ FSP+G IL + DN
Sbjct: 178 HFNRDGSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDN 236
Query: 379 RLRIWD 384
L++WD
Sbjct: 237 TLKLWD 242
>sp|Q9UTC7|YIDC_SCHPO Uncharacterized WD repeat-containing protein C227.12
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=SPAC227.12 PE=4 SV=1
Length = 462
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 112/251 (44%), Gaps = 37/251 (14%)
Query: 145 RPNMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS 204
+ ++ +++ A ++ Q+ R + + F NH SG GQV VW+ +S
Sbjct: 153 KSSIEHLQKAELMGSQIGG------ERPIAIVRFSNNGNH-FASGSWGGQVKVWNSDNLS 205
Query: 205 EKIVYGNIHSCIVNNIRFNP--------TNDGTVYAASSDGTVSCTDLETGLALSLMNVN 256
E ++ H+ V+ + + P + T+ ++D TV L +
Sbjct: 206 EVQLFRG-HTDRVSGLDWYPLCQAWDADSEQLTLATGAADNTVCLWKASQSTPLLRLEG- 263
Query: 257 PNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFLY-LVDARTNSRSGEAILIHRKGSK-V 314
H R R+ + +P G LV+ +F + L D T G +L+ S+ +
Sbjct: 264 ----HLARVGRVAF----HP-SGDYLVSASFDTTWRLWDVHT----GVELLMQEGHSEGI 310
Query: 315 VGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDL-PHKRVVNSAYFSPSGSKILT 373
+ C P L+ S GND RIWD+R +G S+ L H R + + +SP+G ++ T
Sbjct: 311 FSIACQP-DGSLVSSGGNDAIGRIWDLR---SGKSIMVLDEHIRQIVAMAWSPNGYQLAT 366
Query: 374 TSQDNRLRIWD 384
+S D+ ++IWD
Sbjct: 367 SSADDTVKIWD 377
>sp|Q2KIG2|WDR5_BOVIN WD repeat-containing protein 5 OS=Bos taurus GN=WDR5 PE=2 SV=1
Length = 334
Score = 55.1 bits (131), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 103/246 (41%), Gaps = 33/246 (13%)
Query: 147 NMTYMKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-E 205
+ T KP V P+ + H++ V+ ++F P N L S + +W Y E
Sbjct: 22 SATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFE 80
Query: 206 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 265
K + G H ++++ ++ ++ + +AS D T+ D+ +G L + + N
Sbjct: 81 KTISG--HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN------- 130
Query: 266 WRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP- 324
++ + NP+ +++ + + D +T K K + H +P+
Sbjct: 131 --YVFCCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAV 177
Query: 325 ------ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN 378
L++S D RIWD + +L D V+ FSP+G IL + DN
Sbjct: 178 HFNRDGSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDN 236
Query: 379 RLRIWD 384
L++WD
Sbjct: 237 TLKLWD 242
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.134 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 220,731,761
Number of Sequences: 539616
Number of extensions: 10012897
Number of successful extensions: 64569
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 362
Number of HSP's successfully gapped in prelim test: 906
Number of HSP's that attempted gapping in prelim test: 54888
Number of HSP's gapped (non-prelim): 7868
length of query: 546
length of database: 191,569,459
effective HSP length: 122
effective length of query: 424
effective length of database: 125,736,307
effective search space: 53312194168
effective search space used: 53312194168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (29.3 bits)