BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016032
(396 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6NQ88|DDB2_ARATH Protein DAMAGED DNA-BINDING 2 OS=Arabidopsis thaliana GN=DDB2 PE=1
SV=1
Length = 557
Score = 598 bits (1543), Expect = e-170, Method: Compositional matrix adjust.
Identities = 279/359 (77%), Positives = 322/359 (89%), Gaps = 1/359 (0%)
Query: 1 MKPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYG 60
+KP +VIPDQV+CAVIRYHSRRVTCLEFHPT N+ILLSGDKKGQ+GVWDF KV EK VYG
Sbjct: 152 IKPKYVIPDQVHCAVIRYHSRRVTCLEFHPTKNNILLSGDKKGQIGVWDFGKVYEKNVYG 211
Query: 61 NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 120
NIHS VNN+RF+PTND VY+ASSDGT+ TDLETG + +L+N+NP+GW G +W+MLY
Sbjct: 212 NIHSVQVNNMRFSPTNDDMVYSASSDGTIGYTDLETGTSSTLLNLNPDGWQGANSWKMLY 271
Query: 121 GMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSC 180
GMDIN EKGVVL ADNFGFL+++D RTN+ +GE ILIH++GSKV GL CNP+QPELLLSC
Sbjct: 272 GMDINSEKGVVLAADNFGFLHMIDHRTNNSTGEPILIHKQGSKVCGLDCNPVQPELLLSC 331
Query: 181 GNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGN 239
GNDHFARIWD+R+L+ +SL DL HKRVVNSAYFSP SG+KILTT QDNR+RIWDSIFGN
Sbjct: 332 GNDHFARIWDMRKLQPKASLHDLAHKRVVNSAYFSPSSGTKILTTCQDNRIRIWDSIFGN 391
Query: 240 LDSPSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDITT 299
LD PSREIVHS+DFNRHLTPF+AEWDPKD SESL VIGRYISENYNG ALHPIDFID +
Sbjct: 392 LDLPSREIVHSNDFNRHLTPFKAEWDPKDTSESLIVIGRYISENYNGTALHPIDFIDASN 451
Query: 300 GQLVAEVMDPNITTISPVNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEEMKIIVC 358
GQLVAEVMDPNITTI+PVNKLHPRDDVLASGSSRS+FIWRP++ +E+VE+K++ KII+C
Sbjct: 452 GQLVAEVMDPNITTITPVNKLHPRDDVLASGSSRSLFIWRPQDNTEMVEEKKDKKIIIC 510
>sp|Q5ZJL7|DDB2_CHICK DNA damage-binding protein 2 OS=Gallus gallus GN=DDB2 PE=2 SV=1
Length = 507
Score = 172 bits (435), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 197/378 (52%), Gaps = 29/378 (7%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEK-IVYGNIHSCIVNNIRFNPTNDGT 79
RRVTCLE+HPT+ + G K G + +WD+ +++ + G + +I+F+P
Sbjct: 125 RRVTCLEWHPTHPSTVAVGSKGGDIILWDYEVLTKTCFIKGKGPGDSLGDIKFSPYEAVK 184
Query: 80 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY-GMDINPEKGVVLVADNFG 138
+Y AS DGT+S DLE G A+ +++ P+ H Y +D++ V+ DN G
Sbjct: 185 LYVASGDGTLSLQDLE-GRAVQVISRAPDCGHENHNVCCWYCSVDVSASCRAVVTGDNLG 243
Query: 139 FLYLVDARTNSRSGEAIL---IHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE 195
+ L+ S SGE I +H+K KV + N LL + D +IWD+R ++
Sbjct: 244 NVVLL-----STSGEEIWKLKLHKK--KVTHVEFNSRCEWLLATASVDQTVKIWDLRNIK 296
Query: 196 AGSSLCD-LPHKRVVNSAYFSPS-GSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDF 253
++ LPH + VN+AYFSP+ G+K+L+T Q N +R++ + P I H H
Sbjct: 297 DKANFLHVLPHDKPVNAAYFSPTDGAKLLSTDQRNEIRVYSC--SDWTKPQHLIPHPHRQ 354
Query: 254 NRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPN 310
+HLTP +A W P+ L V+GRY + G L +D D TG++V ++ DPN
Sbjct: 355 FQHLTPIKATWHPR---YDLIVVGRYPDPKFPGYTVNELRTVDIFDGNTGEMVCQLYDPN 411
Query: 311 ITTISPVNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEE--MKIIVCGKADKKQKHK 368
+ I +NK +P D LASG +I IW + E+V +K+E +K + + +
Sbjct: 412 ASGIISLNKFNPMGDTLASGMGFNILIW---SREEMVMKKQEHLLKAMTEQGIGSRSLSR 468
Query: 369 FGDESEDSDDDTSKLKRK 386
G + + ++ TSKLK K
Sbjct: 469 RGGQRQ-ANPGTSKLKAK 485
Score = 32.7 bits (73), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 5/83 (6%)
Query: 16 IRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNI--HSCIVNNIRFN 73
++ H ++VT +EF+ +L + V +WD + +K + ++ H VN F+
Sbjct: 258 LKLHKKKVTHVEFNSRCEWLLATASVDQTVKIWDLRNIKDKANFLHVLPHDKPVNAAYFS 317
Query: 74 PTNDGTVYAASSDGTV---SCTD 93
PT+ + + + SC+D
Sbjct: 318 PTDGAKLLSTDQRNEIRVYSCSD 340
>sp|Q66JG1|DDB2_XENTR DNA damage-binding protein 2 OS=Xenopus tropicalis GN=ddb2 PE=2
SV=1
Length = 501
Score = 164 bits (415), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 175/342 (51%), Gaps = 24/342 (7%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTV 80
RRVT LE+HPT+ + + G K G + +WD+ +++ ++ G + ++F+P N +
Sbjct: 122 RRVTTLEWHPTHPNTVAVGSKGGDIILWDYEELNNTLIPGIGAGGCITGMKFDPFNPNQL 181
Query: 81 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY-GMDINPEKGVVLVADNFGF 139
Y +S G+ D + N W M Y +D++ E+ V+ DN G
Sbjct: 182 YTSSVAGSTVLQDFSGRNIQTFTNTE--------DWAMWYCSLDVSAERQCVVTGDNVGN 233
Query: 140 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS 199
+ L++ T + + +H+K KV + NP LL S D ++WD+R ++ SS
Sbjct: 234 VVLLE--TCGKEIWKLRLHKK--KVTHVEFNPRCDWLLASASVDQTVKLWDLRNIKDKSS 289
Query: 200 -LCDLPHKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 257
L LPH R VNSAYFSP G+K+LTT Q + +R++ + + P I H H +HL
Sbjct: 290 YLYTLPHARGVNSAYFSPWDGAKLLTTDQHSEIRVYSAC--DWAKPQHIIPHPHRQFQHL 347
Query: 258 TPFRAEWDPKDPSESLAVIGRY---ISENYNGAALHPIDFIDITTGQLVAEVMDPNITTI 314
T +A W P+ L V+GRY + Y L +D D G +V ++ DP + I
Sbjct: 348 TAIKATWHPR---YDLIVVGRYPDPLFPGYMSDELRTVDVFDGQKGNIVCQLYDPYASGI 404
Query: 315 SPVNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEEMKII 356
+NK +P D+LASG +I IW +E +++Q+E MK +
Sbjct: 405 VSLNKFNPMGDLLASGMGFNILIWS-REILLMMKQEEMMKAL 445
Score = 35.4 bits (80), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 16 IRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFN 73
+R H ++VT +EF+P + +L S V +WD + +K +Y H+ VN+ F+
Sbjct: 247 LRLHKKKVTHVEFNPRCDWLLASASVDQTVKLWDLRNIKDKSSYLYTLPHARGVNSAYFS 306
Query: 74 P 74
P
Sbjct: 307 P 307
>sp|Q99J79|DDB2_MOUSE DNA damage-binding protein 2 OS=Mus musculus GN=Ddb2 PE=1 SV=1
Length = 432
Score = 155 bits (392), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 164/327 (50%), Gaps = 21/327 (6%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSC-IVNNIRFNPTNDGT 79
RR T L +HPT+ L G K G + +W+F + I I + + ++FN N
Sbjct: 112 RRTTSLAWHPTHPSTLAVGSKGGDIMIWNFGIKDKPIFLKGIGAGGSITGLKFNHLNTNQ 171
Query: 80 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGF 139
+A+S +GT D + + + N + W +D++ + VV+ DN G
Sbjct: 172 FFASSMEGTTRLQDFKGNILRVYTSSN-----SCKVW--FCSLDVSAKSRVVVTGDNMGH 224
Query: 140 LYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS 199
+ L+ T+ + + +H+K KV + NP LL + D +IWD+R+++ S
Sbjct: 225 VILLS--TDGKELWNLRMHKK--KVAHVALNPCCDWLLATASIDQTVKIWDLRQIKGKDS 280
Query: 200 -LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLT 258
L LPH+ VN+A FSP G+++LTT Q+N +R++ + DSP I H H +HLT
Sbjct: 281 FLYSLPHRHPVNAACFSPDGARLLTTDQNNEIRVYSA--SQWDSPLNLISHPHRHFQHLT 338
Query: 259 PFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPNITTIS 315
P +A W + +L V+GRY N L ID D ++G+++ ++ DP + I+
Sbjct: 339 PIKATWHSR---HNLIVVGRYPDPNLKSCVPYELRTIDVFDGSSGKMMCQLYDPGYSGIT 395
Query: 316 PVNKLHPRDDVLASGSSRSIFIWRPKE 342
+N+ +P D LAS I IW +E
Sbjct: 396 SLNEFNPMGDTLASTMGYHILIWSQEE 422
>sp|Q0VBY8|DDB2_BOVIN DNA damage-binding protein 2 OS=Bos taurus GN=DDB2 PE=2 SV=1
Length = 426
Score = 154 bits (389), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 165/328 (50%), Gaps = 23/328 (7%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDF-YKVSEKIVYGNIHSCIVNNIRFNPTNDGT 79
RR T L +HPT+ L G K G + +W+F K + G + ++FNP N
Sbjct: 111 RRATSLAWHPTHPSTLAVGSKGGDILLWNFGIKDKPTFIKGIGAGGSITGMKFNPLNTNQ 170
Query: 80 VYAASSDGTVSCTDLE-TGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFG 138
+ +S +GT D + L + + N W +D++ + VV+ DN G
Sbjct: 171 FFTSSMEGTTRLQDFKGNTLRVFASSDTCNVW--------FCSLDVSVKSRVVVTGDNVG 222
Query: 139 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 198
+ L++ + R + +H+K KV + NP LL + D +IWD+R++ S
Sbjct: 223 HVILLN--MDGRELWNLRMHKK--KVTHVALNPCCDWLLATASVDQTVKIWDLRQVRGKS 278
Query: 199 S-LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 257
S L LPH+ VN+A+FSP G+++LTT Q + +R++ + D P I H H +HL
Sbjct: 279 SFLHSLPHRHPVNAAHFSPDGAQLLTTDQKSEIRVYSAC--QWDCPPSLIPHPHRHFQHL 336
Query: 258 TPFRAEWDPKDPSESLAVIGRYISENYNGAALH---PIDFIDITTGQLVAEVMDPNITTI 314
TP +A W P+ +L V+GRY N+ + H ID D ++G+++ ++ DP + I
Sbjct: 337 TPIKASWHPR---YNLIVVGRYPDPNFKSCSPHELRTIDVFDGSSGKIMYQLYDPESSGI 393
Query: 315 SPVNKLHPRDDVLASGSSRSIFIWRPKE 342
+N+ +P D LAS I +W P++
Sbjct: 394 MSLNEFNPMGDTLASVMGYHILVWSPED 421
>sp|Q92466|DDB2_HUMAN DNA damage-binding protein 2 OS=Homo sapiens GN=DDB2 PE=1 SV=1
Length = 427
Score = 152 bits (383), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 161/328 (49%), Gaps = 23/328 (7%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDF-YKVSEKIVYGNIHSCIVNNIRFNPTNDGT 79
RR T L +HPT+ + G K G + +W+F K + G + ++FNP N
Sbjct: 112 RRATSLAWHPTHPSTVAVGSKGGDIMLWNFGIKDKPTFIKGIGAGGSITGLKFNPLNTNQ 171
Query: 80 VYAASSDGTVSCTDLETGLALSLMNVNP-NGWHGPRTWRMLYGMDINPEKGVVLVADNFG 138
YA+S +GT D + + + + N W +D++ +V+ DN G
Sbjct: 172 FYASSMEGTTRLQDFKGNILRVFASSDTINIW--------FCSLDVSASSRMVVTGDNVG 223
Query: 139 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 198
+ L++ + + + +H+K KV + NP L + D +IWD+R++ +
Sbjct: 224 NVILLN--MDGKELWNLRMHKK--KVTHVALNPCCDWFLATASVDQTVKIWDLRQVRGKA 279
Query: 199 S-LCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHL 257
S L LPH+ VN+A FSP G+++LTT Q + +R++ + D P I H H +HL
Sbjct: 280 SFLYSLPHRHPVNAACFSPDGARLLTTDQKSEIRVYSA--SQWDCPLGLIPHPHRHFQHL 337
Query: 258 TPFRAEWDPKDPSESLAVIGRYISENYNGAA---LHPIDFIDITTGQLVAEVMDPNITTI 314
TP +A W P+ +L V+GRY N+ L ID D +G+++ ++ DP + I
Sbjct: 338 TPIKAAWHPR---YNLIVVGRYPDPNFKSCTPYELRTIDVFDGNSGKMMCQLYDPESSGI 394
Query: 315 SPVNKLHPRDDVLASGSSRSIFIWRPKE 342
S +N+ +P D LAS I IW +E
Sbjct: 395 SSLNEFNPMGDTLASAMGYHILIWSQEE 422
>sp|Q2YDS1|DDB2_DANRE DNA damage-binding protein 2 OS=Danio rerio GN=ddb2 PE=1 SV=2
Length = 496
Score = 132 bits (332), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 167/344 (48%), Gaps = 24/344 (6%)
Query: 21 RRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSE-KIVYGNIHSCIVNNIRFNPTNDGT 79
RRVT LE+HPT+ + G K G + +WD+ +++ + G + ++FN N
Sbjct: 114 RRVTSLEWHPTHPTTVAVGSKGGDIILWDYDVLNKTSFIQGMGPGDAITGMKFNQFNTNQ 173
Query: 80 VYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYG-MDINPEKGVVLVADNFG 138
++ +S G + D + + +W Y +D++ + ++ D+ G
Sbjct: 174 LFVSSIWGATTLRDFSGSVIQVFAKTD--------SWDYWYCCVDVSVSRQMLATGDSTG 225
Query: 139 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL-EAG 197
L L+ + E + H+ +KV NP L+ + D ++WD+R + +
Sbjct: 226 RLLLLGLDGHEIFKEKL--HK--AKVTHAEFNPRCDWLMATSSVDATVKLWDLRNIKDKN 281
Query: 198 SSLCDLPHKRVVNSAYFSPSGS-KILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRH 256
S + ++PH++ VN+AYF+P+ S K+LTT Q N +R++ S + P + I+H H +H
Sbjct: 282 SYIAEMPHEKPVNAAYFNPTDSTKLLTTDQRNEIRVYSSY--DWSKPDQIIIHPHRQFQH 339
Query: 257 LTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISP 316
LTP +A W P L V GRY + ID D +G LV ++ DPN I
Sbjct: 340 LTPIKATWHPM---YDLIVAGRYPDDQLLLNDKRTIDIYDANSGGLVHQLRDPNAAGIIS 396
Query: 317 VNKLHPRDDVLASGSSRSIFIWRPKEKSELVEQKEEMKIIVCGK 360
+NK P DVLASG +I IW ++ V +K+ IV G+
Sbjct: 397 LNKFSPTGDVLASGMGFNILIWNREDTLSSVNRKQ---TIVTGE 437
Score = 32.0 bits (71), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 2/62 (3%)
Query: 17 RYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVY--GNIHSCIVNNIRFNP 74
+ H +VT EF+P + ++ + V +WD + +K Y H VN FNP
Sbjct: 241 KLHKAKVTHAEFNPRCDWLMATSSVDATVKLWDLRNIKDKNSYIAEMPHEKPVNAAYFNP 300
Query: 75 TN 76
T+
Sbjct: 301 TD 302
>sp|B2KIQ4|WDR76_RHIFE WD repeat-containing protein 76 OS=Rhinolophus ferrumequinum
GN=WDR76 PE=3 SV=2
Length = 630
Score = 102 bits (254), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 152/320 (47%), Gaps = 37/320 (11%)
Query: 26 LEFHPTNNHILLS-GDKKGQVGVWDF-YKVSEKIVY-GNIHSCIVNNIRFNPTNDGTVYA 82
+ FHP+ L++ G K GQVG+WD ++ E VY HS V+ + F+P N + +
Sbjct: 326 IAFHPSEIKTLVAAGAKSGQVGLWDLTHQPKEDGVYVFQPHSQPVSCLYFSPANPAHMLS 385
Query: 83 ASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF-GFLY 141
S DGT+ C D+ + V + R+ L D E + ++ G +
Sbjct: 386 LSYDGTLRCGDISSA-------VFEEVYRNERS--SLSSFDFLAEDASTFIVGHWDGSIS 436
Query: 142 LVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCG--NDHFARIWDIRRLEAGSS 199
LVD RT S E LI K+ +H +P+Q + ++ G + H I+D RRL S
Sbjct: 437 LVDRRTPGASYEK-LISSSLRKIRTVHVHPVQRQYFITAGLRDTH---IYDARRLTPSGS 492
Query: 200 --LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSP-SREIVHSHDFN 254
L L H + + SAYFSP +G++I+TT D +LR +DS + P I H+
Sbjct: 493 QPLISLTEHTKSIASAYFSPLTGNRIVTTCADCKLRFFDSSCISSQIPLLTTIRHNTITG 552
Query: 255 RHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHP--IDFIDITTGQLVAEVMDPNIT 312
R LT RA WDPK E +I G+ HP ++ T Q+ + + +
Sbjct: 553 RWLTRLRAVWDPKQ--EDCVII---------GSMAHPRQVEIFHETGEQVHSFLGGECLV 601
Query: 313 TISPVNKLHPRDDVLASGSS 332
++ +N +HP +LA G+S
Sbjct: 602 SVCSINAVHPTRYILAGGNS 621
>sp|Q4KLQ5|WDR76_XENLA WD repeat-containing protein 76 OS=Xenopus laevis GN=wdr76 PE=2
SV=1
Length = 580
Score = 100 bits (250), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 159/325 (48%), Gaps = 40/325 (12%)
Query: 22 RVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVS--EKIVYGNIHSCIVNNIRFNPTNDG 78
R+ + HP+ + I+ +GDK GQ+G+WD +S + + HS ++ + F+P N
Sbjct: 273 RIFSVAIHPSESRTIVAAGDKWGQIGLWDLADLSGNDGVYVFEPHSRPISCMSFSPVNSA 332
Query: 79 TVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMD-INPEKGVVLVADNF 137
+++ S DGTV C D+ + + + + D ++ + V++V+
Sbjct: 333 QLFSLSYDGTVRCGDVCRSVFDEVYRDEQDSFSS---------FDYLSADCSVLIVSHWD 383
Query: 138 GFLYLVDARTNSRSGEA-ILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLE- 195
+L +VD RT S E ++ + ++ +H P+ +L + G I+D+R+L+
Sbjct: 384 SYLSVVDCRTPGTSCEQRASLNMRSARTTSVH--PVNRDLCVVAGAGDVC-IFDVRQLKK 440
Query: 196 -AGSSLCDLPHKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPS-REIVHSHD 252
A L H + V SAYFSP +G++ILTT D+ +R++DS ++P H+++
Sbjct: 441 KAQPVLSLTGHSKSVASAYFSPVTGNRILTTCADDYIRVYDSSSLCSEAPLLTAFRHNNN 500
Query: 253 FNRHLTPFRAEWDPKDPSESLAVIG-----RYISENYNGAALHPIDFIDITTGQLVAEVM 307
R LT FRA WDPK ES V+G R I E YN + F D
Sbjct: 501 TGRWLTRFRAVWDPKQ--ESCFVVGSMARPRQI-EVYNESGKLEHSFWD----------- 546
Query: 308 DPNITTISPVNKLHPRDDVLASGSS 332
++ ++ +N +HP ++L G+S
Sbjct: 547 SEHLGSVCSINAMHPTRNLLVGGNS 571
>sp|A6PWY4|WDR76_MOUSE WD repeat-containing protein 76 OS=Mus musculus GN=Wdr76 PE=2 SV=1
Length = 622
Score = 98.2 bits (243), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 153/318 (48%), Gaps = 30/318 (9%)
Query: 23 VTCLEFHPTNNHILLS-GDKKGQVGVWDFYKVSEKIVY-GNIHSCIVNNIRFNPTNDGTV 80
++ + HP+ L++ G K GQ+G+WD + SE +Y HS V+ + F+PTN +
Sbjct: 315 ISSVALHPSEVRTLVAAGAKSGQIGLWDLTQQSEDAMYVFYAHSRYVSCLSFSPTNPAHL 374
Query: 81 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFL 140
+ S DGT+ C D + + V N + P ++ D + +LV G L
Sbjct: 375 LSLSYDGTLRCGDFSSAV---FEEVYRNEGNSPSSF------DFLNDSSSLLVGHWDGHL 425
Query: 141 YLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSS- 199
LVD RT S E + K+ +H +P+ + ++ G ++D R L++ S
Sbjct: 426 SLVDRRTPGTSYEK-FFNSSLEKIRTVHVHPLSRQYFVTAGLRD-VHVYDARFLKSRGSQ 483
Query: 200 -LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIV-HSHDFNR 255
L L H + + SAYFSP +G++++TT D +LR++DS + P + H+ R
Sbjct: 484 PLISLTEHSKSIASAYFSPVTGNRVVTTCADCKLRVFDSSSISSQLPLLSTIRHNTVTGR 543
Query: 256 HLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPNITTI 314
LT F+A WDPK E ++ G+ HP + +G+ V + + ++
Sbjct: 544 WLTRFQAVWDPKQ--EDCFIV---------GSMDHPRRVEVFHESGKNVHSLWGECLVSV 592
Query: 315 SPVNKLHPRDDVLASGSS 332
++ +HP +LA G+S
Sbjct: 593 CSLSAVHPTRYILAGGNS 610
Score = 35.0 bits (79), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 23/47 (48%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGN 61
V HSR V+CL F PTN LLS G + DF + VY N
Sbjct: 353 VFYAHSRYVSCLSFSPTNPAHLLSLSYDGTLRCGDFSSAVFEEVYRN 399
>sp|Q9H967|WDR76_HUMAN WD repeat-containing protein 76 OS=Homo sapiens GN=WDR76 PE=1 SV=2
Length = 626
Score = 94.0 bits (232), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/319 (28%), Positives = 148/319 (46%), Gaps = 36/319 (11%)
Query: 26 LEFHPTNNHILLS-GDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFNPTNDGTVYA 82
+ HP+ L++ G K GQVG+ D + ++ + + HS V+ + F+P N + +
Sbjct: 319 MALHPSETRTLVAVGAKFGQVGLCDLTQQPKEDGVYVFHPHSQPVSCLYFSPANPAHILS 378
Query: 83 ASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF-GFLY 141
S DGT+ C D + + + R+ D E L+ ++ G +
Sbjct: 379 LSYDGTLRCGDFSRAIFEEV-------YRNERS--SFSSFDFLAEDASTLIVGHWDGNMS 429
Query: 142 LVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCG--NDHFARIWDIRRLEAGSS 199
LVD RT S E + G K+ +H +P+ + ++ G + H I+D RRL + S
Sbjct: 430 LVDRRTPGTSYEKLTSSSMG-KIRTVHVHPVHRQYFITAGLRDTH---IYDARRLNSRRS 485
Query: 200 --LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFN- 254
L L H + + SAYFSP +G++++TT D LRI+DS + P + + F
Sbjct: 486 QPLISLTEHTKSIASAYFSPLTGNRVVTTCADCNLRIFDSSCISSKIPLLTTIRHNTFTG 545
Query: 255 RHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPNITT 313
R LT F+A WDPK E ++ G+ HP I TG+ V + +
Sbjct: 546 RWLTRFQAMWDPKQ--EDCVIV---------GSMAHPRRVEIFHETGKRVHSFGGEYLVS 594
Query: 314 ISPVNKLHPRDDVLASGSS 332
+ +N +HP +LA G+S
Sbjct: 595 VCSINAMHPTRYILAGGNS 613
>sp|Q0UYV9|YD156_PHANO WD repeat-containing protein SNOG_03055 OS=Phaeosphaeria nodorum
(strain SN15 / ATCC MYA-4574 / FGSC 10173) GN=SNOG_03055
PE=3 SV=1
Length = 519
Score = 94.0 bits (232), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 157/362 (43%), Gaps = 60/362 (16%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDF-YKVSE--------------KIVY 59
I+ R+ + HPT ++ +GDK G +G+ D KV+E I
Sbjct: 178 IKITPERIYAMGMHPTTEKPLVFAGDKLGNLGICDASQKVAEVKQEDDEDADNEGPTITT 237
Query: 60 GNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRML 119
H+ ++ +F+P + +Y+AS D +V DL G+A+ + G P + L
Sbjct: 238 LKPHTRTIHTFQFSPHDSNALYSASYDSSVRKLDLAKGVAVEVY-----GPSDPNEDQPL 292
Query: 120 YGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLL 178
G++I+ + L G + D RT S E + K K+ G +P QP L+
Sbjct: 293 SGLEISKDDANTLYFSTLDGRFGIYDMRTPSDQAELFQLSEK--KIGGFSLHPQQPHLVA 350
Query: 179 SCGNDHFARIWDIRRLEAGSSLCDLP------HKRVVNSAYFSPSGSKILTTSQDNRLRI 232
+ D +IWD+R++ +G LP R+ S S ++ T S D+ ++I
Sbjct: 351 TASLDRTLKIWDLRKI-SGKGDSRLPALVGEHESRLSVSHAAWNSAGQVATASYDDTIKI 409
Query: 233 --------WDSIFGNLDS---PSREIVHSHDFNRHLTPFRAEWD--PKDPSESLAVIGRY 279
W + D+ PS + H++ R +T RA+W P+D + R+
Sbjct: 410 HDFSKSAEWATGTALTDADMKPSVVVPHNNQTGRWVTILRAQWQQFPQDG------VQRF 463
Query: 280 ISENYNGAALHPIDFIDITT--GQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRSIF 336
N N F+DI T GQ +A++ IT + V K HP D +A+G+ S +
Sbjct: 464 CIGNMN-------RFVDIYTAKGQQLAQLGGDGITAVPAVAKFHPTLDWVAAGTASGKLC 516
Query: 337 IW 338
+W
Sbjct: 517 LW 518
>sp|A9X1C6|WDR76_PAPAN WD repeat-containing protein 76 OS=Papio anubis GN=WDR76 PE=3 SV=1
Length = 626
Score = 89.0 bits (219), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 148/318 (46%), Gaps = 34/318 (10%)
Query: 26 LEFHPTNNHILLS-GDKKGQVGVWDFYKVSEK--IVYGNIHSCIVNNIRFNPTNDGTVYA 82
+ HP+ L++ G K GQVG+ D + ++ + + HS V+ + F+P N + +
Sbjct: 319 MALHPSETRTLVAVGAKFGQVGLCDLTQQPKEDGVYVFHPHSQPVSCLYFSPANPAHILS 378
Query: 83 ASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFLYL 142
S DGT+ C D + + + R+ + + ++V G + L
Sbjct: 379 LSYDGTLRCGDFSRAIFEEV-------YRNERSSFSSFDFLSE-DASTLIVGHWDGNMSL 430
Query: 143 VDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCG--NDHFARIWDIRRLEAGSS- 199
VD RT S E + G K+ +H +P+ + ++ G + H I+D R+L++ S
Sbjct: 431 VDRRTPGTSYEKLTSSSMG-KIRTVHVHPVHRQYFITAGLRDTH---IYDARQLKSRGSQ 486
Query: 200 -LCDLP-HKRVVNSAYFSP-SGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFN-R 255
L L H + + SAYFSP +G++++TT D LRI+DS + P + + F R
Sbjct: 487 PLISLTEHTKSIASAYFSPLTGNRVVTTCADCNLRIFDSSCVSSKIPLLTTIRHNTFTGR 546
Query: 256 HLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDF-IDITTGQLVAEVMDPNITTI 314
LT F+A WDPK E ++ G+ HP I TG+ V + ++
Sbjct: 547 WLTRFQAMWDPKQ--EDCVIV---------GSMAHPRRVEIFHETGKRVHSFGGECLVSV 595
Query: 315 SPVNKLHPRDDVLASGSS 332
+N +HP +LA G+S
Sbjct: 596 CSINAMHPTRYILAGGNS 613
>sp|Q2HHH2|YD156_CHAGB WD repeat-containing protein CHGG_00332 OS=Chaetomium globosum
(strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
NRRL 1970) GN=CHGG_00332 PE=3 SV=1
Length = 524
Score = 87.8 bits (216), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 166/368 (45%), Gaps = 61/368 (16%)
Query: 11 VNCAVIRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYK---------------VS 54
V A + +RV L FHPT + I+ +GDK+G +GV+D + +
Sbjct: 177 VQGAYPKLVPQRVYSLGFHPTESKPIIFAGDKEGAMGVFDASQEPVKAEDDDDDEEAEIP 236
Query: 55 EKIVYG-NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGP 113
+ I+ HS + + F+P + VY+AS D ++ DL+ G++ P
Sbjct: 237 DPIISAFKTHSRTITSFHFSPVDANAVYSASYDSSIRKLDLDKGVSTEAFAPADADEDLP 296
Query: 114 RTWRMLYGMDI-NPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPI 172
+ +D+ + +++ + G L D RT S + E + + K+ G +P
Sbjct: 297 -----ISAIDMPTSDPNMIIFSTLQGTLGRHDLRTKSSTAEIWGLTDQ--KIGGFSLHPA 349
Query: 173 QPELLLSCGNDHFARIWDIRRLE------AGSSLCDLPHKRVVNSAYFSPSGSKILTTSQ 226
QP L+ + D +IWD+R+++ A + L + V+ A +S +G + T+S
Sbjct: 350 QPHLVATASLDRTLKIWDLRKIQGKGDARAPALLGTHDSRLSVSHASWSSAG-HVATSSY 408
Query: 227 DNRLRIWDSIFGNLD-------------SPSREIVHSHDFNRHLTPFRAEWDPKDPSESL 273
D+R++I++ F + D P+R+I H++ R +T + +W + P + L
Sbjct: 409 DDRIKIYN--FPDADKWTAGAALTEAQMEPARQIPHNNQTGRWVTILKPQWQ-RSPRDGL 465
Query: 274 AVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS 331
+++ N N F+D+ G+ +A++ IT + V HP D +A G+
Sbjct: 466 Q---KFVIGNMN-------RFVDVFAADGEQLAQLGGDGITAVPAVAHFHPTMDWVAGGN 515
Query: 332 -SRSIFIW 338
S + +W
Sbjct: 516 GSGKLCLW 523
>sp|Q7S1H9|YD156_NEUCR WD repeat-containing protein NCU09302/NCU11420 OS=Neurospora crassa
(strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257
/ FGSC 987) GN=NCU09302 PE=3 SV=1
Length = 521
Score = 85.5 bits (210), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 156/360 (43%), Gaps = 57/360 (15%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGN------------- 61
I+ +R+ + FHPT I+ +GDK+G +GV+D + + KI +
Sbjct: 181 IKIVPQRIYSMCFHPTEEKPIIFAGDKEGAMGVFDASQPTPKIEDDDEDAEYPDPIISAF 240
Query: 62 -IHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 120
HS +++ F+PT+ +Y+AS D ++ DL+ G++ + + + P +
Sbjct: 241 KTHSRTISSFHFSPTDANAIYSASYDSSIRKLDLDKGISTEIFAPSSSSEDLP-----IS 295
Query: 121 GMDI-NPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLS 179
+DI + +++ + G L D RT S E + K+ G +P P L+ +
Sbjct: 296 AIDIPTTDPNMIIFSTLHGSLGRQDQRTKPSSAE--IWGLTDHKIGGFSLHPRHPYLVAT 353
Query: 180 CGNDHFARIWDIRRLEAGS-----SLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 234
D +IWD+R++ +L R+ S S I T+S D+R++I+
Sbjct: 354 ASLDRTLKIWDLRKITGKGDLRHPALLGEHESRLSVSHASWSSSGHIATSSYDDRIKIYS 413
Query: 235 -----------SIFGNLDSPSREIVHSHDFNRHLTPFRAEW--DPKDPSESLAVIGRYIS 281
I P+ EI H++ R +T + +W +P+D + A+
Sbjct: 414 FPSAGEWKAGHDIPAKEMQPTVEIPHNNQTGRWVTILKPQWQRNPQDGWQKFAI------ 467
Query: 282 ENYNGAALHPIDFIDITT--GQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRSIFIW 338
N N F+D+ G+ +A++ IT + V HP D +A G+ S + +W
Sbjct: 468 GNMN-------RFVDVYAEDGEQLAQLGGDGITAVPAVAHFHPTKDWVAGGTASGKLCLW 520
>sp|Q6C0U2|YD156_YARLI WD repeat-containing protein YALI0F21747g OS=Yarrowia lipolytica
(strain CLIB 122 / E 150) GN=YALI0F21747g PE=3 SV=1
Length = 539
Score = 82.0 bits (201), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 85/373 (22%), Positives = 150/373 (40%), Gaps = 77/373 (20%)
Query: 22 RVTCLEFHP-TNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTV 80
R+ HP T+ I+L+GDK G +G+WD +E + +H + + F+ ++ +
Sbjct: 186 RIYITAVHPGTDKRIVLAGDKIGVLGIWDVDSDNEPLQL-QLHHATIPALCFDQNSNDIL 244
Query: 81 YAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFGFL 140
Y+AS DG+V +L+TG + ++++ + + NP+ ++ + G L
Sbjct: 245 YSASYDGSVRSLELKTGKSGDVLDL-----EAKKNASVGVSDVANPQPHLLYASTLCGHL 299
Query: 141 YLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEA---- 196
D RT S E +++ K K+ G +PI LL + D RIWD+R E
Sbjct: 300 IRKDLRTKSTEYETLILGEK--KIGGFSVDPINTHLLATGSLDRSMRIWDLRATETARTI 357
Query: 197 -GSSLCD----LPHKRVVNSAYFSPSGS------KILTTSQDNRLRIW------------ 233
G + D +PH + + ++ S S + +I+ D+ + I+
Sbjct: 358 PGGEVIDTQFQMPHLQAIYNSRLSVSSTDWNLAGQIVCNGYDDTINIFNQSDYFLDMLND 417
Query: 234 ------------------------DSIFGNLDSPSREIVHSHDFNRHLTPFRAEW--DPK 267
D + PS I H+ R +T +A W P
Sbjct: 418 GNGTEPVKKTRRTRNSKLAEPEISDQELPEIKKPSVRIKHNCQTGRWVTILKARWQQQPL 477
Query: 268 DPSESLAV--IGRYISENYNGAALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPRDD 325
D + A+ + RYI + Y+G TG +A + D +T + HP +
Sbjct: 478 DGVQKFAIANMNRYI-DIYSG------------TGHQLAHLGDALMTAVPSALAFHPTQN 524
Query: 326 VLASGSSRSIFIW 338
+A G+S W
Sbjct: 525 WIAGGNSSGKMYW 537
>sp|Q1E6Q0|YD156_COCIM WD repeat-containing protein CIMG_01763 OS=Coccidioides immitis
(strain RS) GN=CIMG_01763 PE=3 SV=1
Length = 525
Score = 79.0 bits (193), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/363 (23%), Positives = 149/363 (41%), Gaps = 59/363 (16%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEK-------------IVYGN 61
I+ R+ + FHPT + ++ +GDK G +G+ D + ++ I
Sbjct: 181 IKITRERIYSMLFHPTESKPLIFAGDKTGHLGILDASQQPDQNESDEEDEYPDPTITTIK 240
Query: 62 IHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYG 121
H+ ++ + +P++ +Y+ S D ++ DLE +A + P L G
Sbjct: 241 PHTNTISAMHIHPSDPSKLYSGSYDSSIRALDLEKSVATEAYAPASSSDDEP-----LSG 295
Query: 122 MDINPEKGVVLVADNF-GFLYLVDARTNSRS----GEAILIHRKGSKVVG-LHCNPIQPE 175
+D+ P VL GF D R +S++ G A+ ++ K +G P QP
Sbjct: 296 IDMAPTDPHVLYFTTLDGFFGRHDMRVSSKANPGDGSAVTFYQLSEKKIGGFSLCPTQPH 355
Query: 176 LLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR--VVNSAYFSPSGSKILTTSQDNRLRIW 233
+ + D ++WD+R L H+ V+ A F+ G +I TTS DN ++I+
Sbjct: 356 YMATASLDRTMKVWDLRHLSTKHPKPVGEHESSLSVSHAAFNQKG-QIATTSYDNSIKIY 414
Query: 234 DSIFGNLD-------------SPSREIVHSHDFNRHLTPFRAEWD--PKDPSESLAVIGR 278
D L +P I H+ + +T R +W P P E R
Sbjct: 415 DLASKGLKDWKPNHTLSEDEMAPDAVIRHNCQTGKWVTILRPQWQACPDSPVE------R 468
Query: 279 YISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRSI 335
+ N N F+DI +TG+ +A++ IT + V H + + G+ S +
Sbjct: 469 FCIGNMN-------RFVDIYTSTGEQLAQLGADVITAVPAVAVFHRTQNWVVGGTGSAKV 521
Query: 336 FIW 338
+W
Sbjct: 522 CLW 524
>sp|P0CS56|YD156_CRYNJ WD repeat-containing protein CNI03070 OS=Cryptococcus neoformans
var. neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=CNI03070 PE=3 SV=1
Length = 595
Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 95/411 (23%), Positives = 155/411 (37%), Gaps = 104/411 (25%)
Query: 22 RVTCLEFHPTNNHIL-LSGDKKGQVGVWD-FYKVSEKIVYGNIHSCIVNNIRFNPTNDGT 79
RV + HP L L GDK GQ+G+WD EK + S ++ + +G
Sbjct: 189 RVFSMCVHPEKTKTLVLVGDKYGQLGIWDALGPPMEKPENEDDTSGLLRAEGEDEYQEGR 248
Query: 80 VY--AASSDGTVSCTDLE---------TGLALSLMNVNPNGWHGPRTWR------MLYGM 122
V+ A + ++SC ++ T SL +++ + + ++
Sbjct: 249 VWRVQAHAKNSISCMKVDPVNGSGLFSTAYDCSLRHLSFSTLQSTELFSFQDEDLLINHF 308
Query: 123 DINPEKGVVLVADNFGFLYLVDARTNSR-SGE---AILIHRKGSKVVGLHCNPIQPELLL 178
D+ P + D G + D R + R SG + +G+K+ G+ NP+ P L+
Sbjct: 309 DLLPSAQEAWMVDKNGGISHWDTRESKRESGRRRWVVQEEGRGAKLGGVSVNPLMPHLIC 368
Query: 179 SCGNDHFARIWDIRRLEAGSS----------------------LCDLPHK---------- 206
+ GND RIWD R L + SS LPH
Sbjct: 369 TAGNDQHVRIWDTRHLFSISSNLVPSAAAIEEEEEGTSTLSGQSSSLPHDTHPTRESDYS 428
Query: 207 ----------------------RVVNSAYFSPSGSKILTTSQDNRLRIW----------D 234
+ +SAY+ P G +ILTTS D+ LR++ D
Sbjct: 429 TVTSYLASPRGKGLMRAKWQHGKSCSSAYWDPWGRRILTTSYDDHLRVFNIDPGSSLVDD 488
Query: 235 SIFGNLDS-----PSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAAL 289
G+L P++ + H+ R LT RA+W SL + Y+ G
Sbjct: 489 RAVGSLLQPNGFKPTKVVRHNCQTGRWLTILRAQW-------SLNM--EYMPHFTVGNMK 539
Query: 290 HPIDFIDITTGQLVAEVMDPNITTISPVNKLHPR--DDVLASGSSRSIFIW 338
+D + TG+ + + ++T + V HP D V+ +S I +W
Sbjct: 540 RTLDVVS-ATGEKIVGLWTDDVTAVPTVTASHPNIVDRVVGGNTSGRIQLW 589
>sp|P0CS57|YD156_CRYNB WD repeat-containing protein CNBH2930 OS=Cryptococcus neoformans
var. neoformans serotype D (strain B-3501A) GN=CNBH2930
PE=3 SV=1
Length = 595
Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 95/411 (23%), Positives = 155/411 (37%), Gaps = 104/411 (25%)
Query: 22 RVTCLEFHPTNNHIL-LSGDKKGQVGVWD-FYKVSEKIVYGNIHSCIVNNIRFNPTNDGT 79
RV + HP L L GDK GQ+G+WD EK + S ++ + +G
Sbjct: 189 RVFSMCVHPEKTKTLVLVGDKYGQLGIWDALGPPMEKPENEDDTSGLLRAEGEDEYQEGR 248
Query: 80 VY--AASSDGTVSCTDLE---------TGLALSLMNVNPNGWHGPRTWR------MLYGM 122
V+ A + ++SC ++ T SL +++ + + ++
Sbjct: 249 VWRVQAHAKNSISCMKVDPVNGSGLFSTAYDCSLRHLSFSTLQSTELFSFQDEDLLINHF 308
Query: 123 DINPEKGVVLVADNFGFLYLVDARTNSR-SGE---AILIHRKGSKVVGLHCNPIQPELLL 178
D+ P + D G + D R + R SG + +G+K+ G+ NP+ P L+
Sbjct: 309 DLLPSAQEAWMVDKNGGISHWDTRESKRESGRRRWVVQEEGRGAKLGGVSVNPLMPHLIC 368
Query: 179 SCGNDHFARIWDIRRLEAGSS----------------------LCDLPHK---------- 206
+ GND RIWD R L + SS LPH
Sbjct: 369 TAGNDQHVRIWDTRHLFSISSNLVPSAAAIEEEEEGTSTLSGQSSSLPHDTHPTRESDYS 428
Query: 207 ----------------------RVVNSAYFSPSGSKILTTSQDNRLRIW----------D 234
+ +SAY+ P G +ILTTS D+ LR++ D
Sbjct: 429 TVTSYLASPRGKGLMRAKWQHGKSCSSAYWDPWGRRILTTSYDDHLRVFNIDPGSSLVDD 488
Query: 235 SIFGNLDS-----PSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAAL 289
G+L P++ + H+ R LT RA+W SL + Y+ G
Sbjct: 489 RAVGSLLQPNGFKPTKVVRHNCQTGRWLTILRAQW-------SLNM--EYMPHFTVGNMK 539
Query: 290 HPIDFIDITTGQLVAEVMDPNITTISPVNKLHPR--DDVLASGSSRSIFIW 338
+D + TG+ + + ++T + V HP D V+ +S I +W
Sbjct: 540 RTLDVVS-ATGEKIVGLWTDDVTAVPTVTASHPNIVDRVVGGNTSGRIQLW 589
>sp|A1DNV8|YD156_NEOFI WD repeat-containing protein NFIA_058290 OS=Neosartorya fischeri
(strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181)
GN=NFIA_058290 PE=3 SV=1
Length = 527
Score = 66.2 bits (160), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 84/368 (22%), Positives = 146/368 (39%), Gaps = 69/368 (18%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYG-------------- 60
I+ R+ + FHP+ ++ +GDK G +GV D + EK +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQ--EKPISAVKQEDDEDAEDDDP 240
Query: 61 -------NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGP 113
H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 241 DPVLTTLKPHTRTISSMHVHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP 300
Query: 114 RTWRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPI 172
+ G+D+ P+ L G D R + RS A K+ G P
Sbjct: 301 -----ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPT 354
Query: 173 QPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRL 230
P + D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L
Sbjct: 355 HPHFFATASLDRTMRLWDIRKLSHDEPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTL 413
Query: 231 RIWDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEW--DPKDPSESL 273
+I+D FG+ P + H+ R +T R +W +P+ P
Sbjct: 414 KIYD--FGSKGIAAWKPGHTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQANPQSP---- 467
Query: 274 AVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS 331
I R+ N N F+D+ ++G +A++ IT + V H + +A G+
Sbjct: 468 --IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSTNWIAGGT 518
Query: 332 -SRSIFIW 338
S I +W
Sbjct: 519 ASGKICLW 526
>sp|Q4WLU1|YD156_ASPFU WD repeat-containing protein AFUA_6G12330 OS=Neosartorya fumigata
(strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=AFUA_6G12330 PE=3 SV=1
Length = 527
Score = 65.5 bits (158), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 143/364 (39%), Gaps = 61/364 (16%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYK-------------------VSE 55
I+ R+ + FHP+ ++ +GDK G +GV D +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQEKPTSAVKQEDDEDAEDDDPDP 242
Query: 56 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 115
+ H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 243 VLTTLKPHTRTISSLHIHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP-- 300
Query: 116 WRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP 174
+ G+D+ P+ L G D R + RS A K+ G P P
Sbjct: 301 ---ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPTHP 356
Query: 175 ELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRLRI 232
+ D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L+I
Sbjct: 357 HFFATASLDRTMRLWDIRKLSHDDPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTLKI 415
Query: 233 WDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVIG 277
+D FG+ P + H+ R +T R +W +P S I
Sbjct: 416 YD--FGSKGIAAWEPGYTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQ-ANPQSS---IQ 469
Query: 278 RYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SRS 334
R+ N N F+D+ ++G +A++ IT + V H + +A G+ S
Sbjct: 470 RFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSTNWIAGGTASGK 522
Query: 335 IFIW 338
I +W
Sbjct: 523 ICLW 526
>sp|B0Y8S0|YD156_ASPFC WD repeat-containing protein AFUB_078330 OS=Neosartorya fumigata
(strain CEA10 / CBS 144.89 / FGSC A1163) GN=AFUB_078330
PE=3 SV=1
Length = 528
Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/365 (22%), Positives = 143/365 (39%), Gaps = 62/365 (16%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYK--------------------VS 54
I+ R+ + FHP+ ++ +GDK G +GV D +
Sbjct: 183 IKLTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLDASQEKPTSAVKQEDDEEDAEDDDPD 242
Query: 55 EKIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPR 114
+ H+ ++++ +P+ +Y+AS D ++ DLE ++ P
Sbjct: 243 PVLTTLKPHTRTISSLHIHPSKPTHLYSASYDSSIRELDLEKTTSVEKYAPESTSDDIP- 301
Query: 115 TWRMLYGMDINPEKGVVLVADNF-GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQ 173
+ G+D+ P+ L G D R + RS A K+ G P
Sbjct: 302 ----ISGIDMAPDDPNTLYWTTLDGAFGRYDTRASRRSAVATW-QLSEKKIGGFSLFPTH 356
Query: 174 PELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTTSQDNRLR 231
P + D R+WDIR+L + H + V+ A F+ +G +I T+S D+ L+
Sbjct: 357 PHFFATASLDRTMRLWDIRKLSHDDPVPVGEHVSRLSVSHAAFNSAG-QIATSSYDDTLK 415
Query: 232 IWDSIFGNLD---------------SPSREIVHSHDFNRHLTPFRAEWDPKDPSESLAVI 276
I+D FG+ P + H+ R +T R +W +P S I
Sbjct: 416 IYD--FGSKGIAAWEPGYTLSDAEMKPDTIVRHNCQTGRWVTILRPQWQ-ANPQSS---I 469
Query: 277 GRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS-SR 333
R+ N N F+D+ ++G +A++ IT + V H + +A G+ S
Sbjct: 470 QRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHCSTNWIAGGTASG 522
Query: 334 SIFIW 338
I +W
Sbjct: 523 KICLW 527
>sp|P49695|PKWA_THECU Probable serine/threonine-protein kinase PkwA OS=Thermomonospora
curvata GN=pkwA PE=3 SV=1
Length = 742
Score = 63.5 bits (153), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 96/217 (44%), Gaps = 21/217 (9%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDG 78
H+ V + F P + +L SG V +WD E+ V+ H+ V +I F+P DG
Sbjct: 500 HTDWVRAVAFSP-DGALLASGSDDATVRLWDVAAAEERAVFEG-HTHYVLDIAFSP--DG 555
Query: 79 TVYAASS-DGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 137
++ A+ S DGT ++ TG +++ + + +Y + +P+ +V
Sbjct: 556 SMVASGSRDGTARLWNVATGTEHAVLKGHTD---------YVYAVAFSPDGSMVASGSRD 606
Query: 138 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 197
G + L D T E ++ VV L +P +L G+D +WD+ EA
Sbjct: 607 GTIRLWDVATGK---ERDVLQAPAENVVSLAFSPDG--SMLVHGSDSTVHLWDVASGEAL 661
Query: 198 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 234
+ H V + FSP G+ + + S D +R+WD
Sbjct: 662 HTFEG--HTDWVRAVAFSPDGALLASGSDDRTIRLWD 696
Score = 59.7 bits (143), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 97/221 (43%), Gaps = 21/221 (9%)
Query: 14 AVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFN 73
AV H+ V + F P + ++ SG + G +W+ +E V H+ V + F+
Sbjct: 537 AVFEGHTHYVLDIAFSP-DGSMVASGSRDGTARLWNVATGTEHAVLKG-HTDYVYAVAFS 594
Query: 74 PTNDGTVYAASS-DGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVL 132
P DG++ A+ S DGT+ D+ TG ++ + + +P+ G +L
Sbjct: 595 P--DGSMVASGSRDGTIRLWDVATGKERDVLQAPAEN---------VVSLAFSPD-GSML 642
Query: 133 VADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIR 192
V + ++L D SGEA+ + V LL S +D R+WD+
Sbjct: 643 VHGSDSTVHLWDVA----SGEALHTFEGHTDWVRAVAFSPDGALLASGSDDRTIRLWDVA 698
Query: 193 RLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIW 233
E ++L H V+S F P G+ + + S+D +RIW
Sbjct: 699 AQEEHTTLEG--HTEPVHSVAFHPEGTTLASASEDGTIRIW 737
Score = 37.0 bits (84), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 176 LLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDS 235
++ S D AR+W++ + L H V + FSP GS + + S+D +R+WD
Sbjct: 557 MVASGSRDGTARLWNVATGTEHAVLKG--HTDYVYAVAFSPDGSMVASGSRDGTIRLWDV 614
Query: 236 IFGN----LDSPSREIV 248
G L +P+ +V
Sbjct: 615 ATGKERDVLQAPAENVV 631
Score = 35.8 bits (81), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 176 LLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDS 235
LL S +D R+WD+ E + H V FSP GS + + S+D R+W+
Sbjct: 515 LLASGSDDATVRLWDVAAAEERAVFEG--HTHYVLDIAFSPDGSMVASGSRDGTARLWNV 572
Query: 236 IFGN 239
G
Sbjct: 573 ATGT 576
>sp|A1CU75|YD156_ASPCL WD repeat-containing protein ACLA_085580 OS=Aspergillus clavatus
(strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 /
NRRL 1) GN=ACLA_085580 PE=3 SV=1
Length = 531
Score = 62.8 bits (151), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/376 (22%), Positives = 144/376 (38%), Gaps = 82/376 (21%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGNI------------ 62
I+ R+ + FHP+ ++ +GDK G +GV D S++ +I
Sbjct: 184 IKVTPERIYTMTFHPSEAKPLIFAGDKMGNLGVLD---ASQERPVSSIKHEDGDEEEQED 240
Query: 63 -------------HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNG 109
H+ ++++ +P+ +Y AS D ++ DLE S+ P+
Sbjct: 241 DDDPDPVLTTLKPHTRTISSMHIHPSKPTHLYTASYDSSIRELDLEK--TTSVETYAPDS 298
Query: 110 WHGPRTWRMLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRK 160
D P G+ + AD+ LY D R + R+ A
Sbjct: 299 -----------PSDDVPISGIDMAADDPNTLYWTTLDGAFGRYDTRASRRTAVATW-QLS 346
Query: 161 GSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSG 218
K+ G P P + D R+WD+R+L L H + V+ A F+ +G
Sbjct: 347 EKKIGGFSLYPTHPHFFATASLDRTMRLWDLRKLSHDDPLPVGEHLSRLSVSHAAFNSAG 406
Query: 219 SKILTTSQDNRLRIWDSIFGNLDS-------------PSREIVHSHDFNRHLTPFRAEWD 265
++ T+S D+ L+I+D + S P + H+ R +T R +W
Sbjct: 407 -QVATSSYDDSLKIYDFGAKGIASWEQGHTLSDAEMKPDTVVRHNCQTGRWVTILRPQWQ 465
Query: 266 PKDPSESLAVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPR 323
S I R+ N N F+D+ ++G +A++ IT + V H
Sbjct: 466 ANPQSH----IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRS 514
Query: 324 DDVLASGS-SRSIFIW 338
+ +A G+ S I +W
Sbjct: 515 KNWIAGGTASGKICLW 530
>sp|Q6BRR2|SEC31_DEBHA Protein transport protein SEC31 OS=Debaryomyces hansenii (strain
ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=SEC31 PE=3 SV=2
Length = 1265
Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 102/222 (45%), Gaps = 33/222 (14%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 76
HS V L F+P +H+L++G G++ +WD K +E V G + + V ++ +N +
Sbjct: 117 HSGPVKTLSFNPNQDHVLVTGGSNGEIFIWDTKKFTEPSVPGQAMTPMDEVTSVAWNNSV 176
Query: 77 DGTVYAASSDGTVSCTDLETGLALSLMNVN-PNG--------WHGPRTWRMLYGMDINPE 127
+A + G S DL++ + ++ N P+G WH ++ +++ D
Sbjct: 177 SHIFASAGNGGYTSIWDLKSKREVLHLSYNGPSGRANFSCVAWHPTQSTKLITASD---- 232
Query: 128 KGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFA 186
D + D R N+ + E I+ H+KG V+ L PELL+S G D+
Sbjct: 233 ------NDGCPLILTWDLR-NANAPEKIMEGHKKG--VLSLDWCKHDPELLISSGKDNST 283
Query: 187 RIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTS 225
+W+ + G L + P N A+ F+P+ +I TS
Sbjct: 284 MLWNPIK---GEKLGEYP--TTANWAFHTKFAPAAPEIFATS 320
>sp|Q16960|DYI3_HELCR Dynein intermediate chain 3, ciliary OS=Heliocidaris crassispina
PE=2 SV=1
Length = 597
Score = 60.8 bits (146), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 14/229 (6%)
Query: 23 VTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNI----HSCIVNNIRFNPTNDG 78
+ CLE++P + H+L+ G GQV WD K S+ + + H + I
Sbjct: 218 LVCLEYNPKDVHVLIGGCYNGQVAFWDTRKGSQAVEMSPVEHSHHDPVYKTIWLQSKTGT 277
Query: 79 TVYAASSDGTVSCTDL-ETGLALSLMNVNPNGWHGPRTWRMLYGMDINPE-KGVVLVADN 136
++AS+DG V D+ + G + ++P+ + + ++ P +V
Sbjct: 278 ECFSASTDGQVLWWDMRKLGEPTEKLIMDPSKKGKMENAQGVISLEYEPTIPTKFMVGTE 337
Query: 137 FGFLYLVDARTNSRSGEAILIHRKG-SKVVGLHCNPIQPELLLSCGNDHFARIW--DIRR 193
G + + + + + + I+++ V L NP P+ L+ G D ARIW DIR
Sbjct: 338 QGTIISCNRKAKTPPEKIVAIYKEHIGPVYSLQRNPFFPKNFLTVG-DWTARIWSEDIR- 395
Query: 194 LEAGSSLCDLPHKRVVNSAYFSPSGSKI-LTTSQDNRLRIWDSIFGNLD 241
S + H + +SP + TT D L +WD +F D
Sbjct: 396 --DSSIMWTKYHMSYLTDGCWSPVRPAVFFTTKMDGSLDVWDYLFKQKD 442
>sp|A5DB75|SEC31_PICGU Protein transport protein SEC31 OS=Meyerozyma guilliermondii
(strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC
10279 / NRRL Y-324) GN=SEC31 PE=3 SV=2
Length = 1266
Score = 60.1 bits (144), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 102/225 (45%), Gaps = 29/225 (12%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 76
HS V L+F+P H+LLSG GQ+ VWD K+S+ + G + + ++ + +N +
Sbjct: 116 HSGPVKTLQFNPLQEHVLLSGGSNGQIFVWDTKKLSDPVAPGKAMTPMDEISCVSWNNSV 175
Query: 77 DGTVYAASSDGTVSCTDLETGLALSLMNVNPN----GWHGPRTWRMLYGMDINPEKGVVL 132
+ G S DL++ + ++ + N WH ++ ++ V
Sbjct: 176 SHIFATTGNSGYTSIWDLKSKREVLHLSYSANFSCVAWHPTQSTKL-----------VTA 224
Query: 133 VADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDI 191
++ L L N+ + E I+ H+KG ++ L PE+L+S G D+ +W+
Sbjct: 225 TGNDSDALILTWDLKNANAPEKIMRGHKKG--ILSLDWCKQDPEILISSGKDNATMLWNP 282
Query: 192 RRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTSQ-DNRLRI 232
+ G L + P N A+ F+P+ +I T+ D ++ I
Sbjct: 283 IK---GEKLGEYP--TTANWAFHTRFAPAAPEIFATASFDGKIVI 322
>sp|Q6CKE8|PRP46_KLULA Pre-mRNA-splicing factor PRP46 OS=Kluyveromyces lactis (strain ATCC
8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 /
WM37) GN=PRP46 PE=3 SV=1
Length = 434
Score = 59.7 bits (143), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 98/227 (43%), Gaps = 20/227 (8%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKI-VYGNIHSCIVNNIRFN 73
VI H+ V C+ P +N +G + +WD KI + G++ S V +I +
Sbjct: 117 VINGHTGWVRCVCVDPVDNEWFATGSNDTTIKIWDLAAGKLKITLIGHVMS--VRDIAIS 174
Query: 74 PTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLV 133
+ +++AS D V C DLE A+ + + +G H +D++P ++
Sbjct: 175 KRHP-YMFSASEDKLVKCWDLERNTAIRDFHGHLSGVH---------TVDVHPSLDIIAT 224
Query: 134 ADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRR 193
A + L D R+ S E +++ S + + C P+ P+ ++SC D R+WDI
Sbjct: 225 AGRDAVVRLWDIRSRS---EIMVLPGHKSPINKVKCLPVDPQ-IISCSGDATVRLWDIIA 280
Query: 194 LEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNL 240
+A L H R + P+ + S N +R W G L
Sbjct: 281 GKASKVLTH--HSRNIRDLTLHPAEFSFASVST-NDVRSWKLPEGQL 324
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 9/102 (8%)
Query: 166 GLHCNPIQPEL--LLSCGNDHFARIWDIRRLEAGSSLCDLP-HKRVVNSAYFSPSGSKIL 222
G+H + P L + + G D R+WDIR + S + LP HK +N P +I+
Sbjct: 209 GVHTVDVHPSLDIIATAGRDAVVRLWDIR---SRSEIMVLPGHKSPINKVKCLPVDPQII 265
Query: 223 TTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLTPFRAEW 264
+ S D +R+WD I G S+ + H R LT AE+
Sbjct: 266 SCSGDATVRLWDIIAGK---ASKVLTHHSRNIRDLTLHPAEF 304
>sp|Q4PGT8|YD156_USTMA WD repeat-containing protein UM00675 OS=Ustilago maydis (strain 521
/ FGSC 9021) GN=UM00675 PE=3 SV=1
Length = 637
Score = 59.3 bits (142), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 116/285 (40%), Gaps = 70/285 (24%)
Query: 21 RRVTCLEFHP-TNNHILLSGDKKGQVGVWDFYKVS-------------------EKIVYG 60
+R+ + +HP T+ ++ GDK+G +GVWD V+ E+ G
Sbjct: 188 KRIYSMAYHPSTDKDLVFVGDKEGSIGVWDAAPVAFASNRNGVKTADDQDEDAEERFPEG 247
Query: 61 -----NIHS-CIVNNIRFNPTNDGTVYAASSDGTVSCTDLETG-----------LALSLM 103
+H+ V I+F+P N +V ++S D TV DL T + LS+
Sbjct: 248 KAWTLQVHARSPVTCIKFDPVNHNSVLSSSYDSTVRKLDLATAKSEEIWAGEEDVLLSIF 307
Query: 104 NVNPNGWHGPRTWRMLYGMDINP--EKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKG 161
+V P T +Y NP ++ + +AD+ G L +D R +R G +
Sbjct: 308 DV-----LSPSTHPSVYMDTPNPSLDERSMWIADHRGGLLHIDLRERTRRGNNTRRWQVC 362
Query: 162 SKVVG-LHCNPIQPELLLSCGNDHFARIWDIRRLEA-GSSLCDLPH-------------- 205
K +G + N + P + + D R++D+R L + D P+
Sbjct: 363 EKKIGAMSVNRLAPHCIATASLDQHIRLFDVRALASVVKQTADAPYNYKGVDADDLESAQ 422
Query: 206 ----------KRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNL 240
++ S FSP G +++ S D+ +++W G+L
Sbjct: 423 TKAQFASSKARQACTSVDFSPRGDQLVGVSYDDVVKVWSMEPGSL 467
>sp|Q75BY3|PRP46_ASHGO Pre-mRNA-splicing factor PRP46 OS=Ashbya gossypii (strain ATCC
10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=PRP46
PE=3 SV=2
Length = 425
Score = 59.3 bits (142), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 20/227 (8%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 74
VI H+ V C+ P +N +G + VWD K+ H V +I +
Sbjct: 108 VINGHTGWVRCVCVDPVDNAWFATGSNDSTIRVWDLATGKLKVTLQG-HIMTVRDICISA 166
Query: 75 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 134
+ +++AS D V C DLE N +HG T ++ +D++P +++ A
Sbjct: 167 RHP-YMFSASQDKLVKCWDLE-------RNTVVRDFHG--TLSGVHSVDLHPSLDLIVSA 216
Query: 135 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 194
+ + D R SRS L +G + + C P+ P+ ++SC D ++WD L
Sbjct: 217 GRDSVVRVWDIR--SRSCVLTLAGHRGP-INKVRCLPVDPQ-IVSCSTDATVKLWD---L 269
Query: 195 EAGSSLCDLP-HKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNL 240
AG + L HKR V F+P+ + D+ +R W + G L
Sbjct: 270 VAGKPMKTLTHHKRNVRDLAFNPTEFSFASACTDD-IRSWKLVDGQL 315
Score = 39.3 bits (90), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 31/226 (13%)
Query: 23 VTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTVYA 82
V ++ HP+ + +++S + V VWD S + H +N +R P D + +
Sbjct: 201 VHSVDLHPSLD-LIVSAGRDSVVRVWDIRSRSCVLTLAG-HRGPINKVRCLPV-DPQIVS 257
Query: 83 ASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA--DNFGFL 140
S+D TV DL G + + H R R L NP + A D+
Sbjct: 258 CSTDATVKLWDLVAGKPMKTLT------HHKRNVRDLA---FNPTEFSFASACTDDIRSW 308
Query: 141 YLVDAR--TNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWD------IR 192
LVD + TN S EA+ I V L CN Q +L + G+ +D +
Sbjct: 309 KLVDGQLLTNFNS-EALGI------VNTLACN--QDGVLFAGGDTGELSFFDYKTGHKFQ 359
Query: 193 RLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFG 238
+LE + L ++ V ++ F +G ++LT +D ++IW I G
Sbjct: 360 KLETTAMPGSLESEKGVLASTFDRTGLRLLTCERDKSIKIWKHIDG 405
>sp|Q0CSP9|YD156_ASPTN WD repeat-containing protein ATEG_03285 OS=Aspergillus terreus
(strain NIH 2624 / FGSC A1156) GN=ATEG_03285 PE=3 SV=1
Length = 530
Score = 58.9 bits (141), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 81/370 (21%), Positives = 140/370 (37%), Gaps = 73/370 (19%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKV-------------------SE 55
I+ R+ + FHP+ + ++ +GDK G +GV D +
Sbjct: 186 IKLTPERIYAMTFHPSESKPLIFAGDKMGHLGVLDASQTKPVSAATHDEDEEDDDDDPDP 245
Query: 56 KIVYGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRT 115
+ H+ ++ + +P+ +Y AS D ++ DLE ++ + P +
Sbjct: 246 VLTTLKPHTRTISCMTIHPSKPTHLYTASYDSSIREMDLEKTTSVER--------YAPAS 297
Query: 116 WRMLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRKGSKVVG 166
D P G+ + D+ LY D RT R A K+ G
Sbjct: 298 -----TADDVPISGLDMALDDPHCLYWTTLDGEFGRYDMRT-PRQDSATRWTLSDKKIGG 351
Query: 167 LHCNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPH--KRVVNSAYFSPSGSKILTT 224
P P + D R+WD+R+L S + H + V+ A F+ +G ++ T+
Sbjct: 352 FSLYPTHPHYFATASLDRTMRLWDLRKLSHKSPVAVGEHESRLSVSHAAFNGAG-QVATS 410
Query: 225 SQDNRLRIWD-------------SIFGNLDSPSREIVHSHDFNRHLTPFRAEWDPKDPSE 271
S D+ L+I+D S+ P + H+ R +T R +W S
Sbjct: 411 SYDDSLKIYDFGAKGIASWKPGHSLSDAQMKPDVVVRHNCQTGRWVTILRPQWQQNPQSH 470
Query: 272 SLAVIGRYISENYNGAALHPIDFIDITTGQ--LVAEVMDPNITTISPVNKLHPRDDVLAS 329
I R+ N N F+DI +G +A++ IT + V H + +A
Sbjct: 471 ----IQRFCIGNMN-------RFVDIYSGSGDQLAQLGGDGITAVPAVAVFHRSKNWVAG 519
Query: 330 GS-SRSIFIW 338
G+ S I +W
Sbjct: 520 GTASGKICLW 529
>sp|Q96DI7|SNR40_HUMAN U5 small nuclear ribonucleoprotein 40 kDa protein OS=Homo sapiens
GN=SNRNP40 PE=1 SV=1
Length = 357
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 10 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 69
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 56 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 114
Query: 70 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 122
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 115 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 165
Query: 123 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 182
+V + G + L D R + AI + +V+ + N +++ S G
Sbjct: 166 -------LVCTGSDDGTVKLWDIRKKA----AIQTFQNTYQVLAVTFNDTSDQII-SGGI 213
Query: 183 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 242
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 214 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 268
Query: 243 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 297
P V + H+F ++L R W P + R++ ++
Sbjct: 269 PKERCVKIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 313
Query: 298 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 332
TT + + + + +I+ V HP + ++ S SS
Sbjct: 314 TTSRRILYKLPGHAGSINEV-AFHPDEPIIISASS 347
>sp|Q2HJH6|SNR40_BOVIN U5 small nuclear ribonucleoprotein 40 kDa protein OS=Bos taurus
GN=SNRNP40 PE=2 SV=1
Length = 358
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 10 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 69
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 57 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 115
Query: 70 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 122
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 116 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 166
Query: 123 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 182
+V + G + L D R + AI + +V+ + N +++ S G
Sbjct: 167 -------LVCTGSDDGTVKLWDIRKKA----AIQTFQNTYQVLAVTFNDTSDQII-SGGI 214
Query: 183 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 242
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 215 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 269
Query: 243 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 297
P V + H+F ++L R W P + R++ ++
Sbjct: 270 PKERCVRIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 314
Query: 298 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 332
TT + + + + +I+ V HP + ++ S SS
Sbjct: 315 TTSRRILYKLPGHAGSINEV-AFHPDEPIILSASS 348
>sp|Q6PE01|SNR40_MOUSE U5 small nuclear ribonucleoprotein 40 kDa protein OS=Mus musculus
GN=Snrnp40 PE=2 SV=1
Length = 358
Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 135/335 (40%), Gaps = 55/335 (16%)
Query: 10 QVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNN 69
Q ++ H V C +FHP N L S + +W+ Y + HS V
Sbjct: 57 QAPIMLLSGHEGEVYCCKFHP-NGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVME 115
Query: 70 IRFNPTNDGTVYAASSDGTVSCTDLETGLAL-------SLMNVNPNGWHGPRTWRMLYGM 122
+ +N T+ +++AS+D TV+ D ETG + S +N GP+
Sbjct: 116 LHYN-TDGSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQ-------- 166
Query: 123 DINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 182
+V + G + L D R + A+ + +V+ + N +++ S G
Sbjct: 167 -------LVCTGSDDGTVKLWDIRKKA----AVQTFQNTYQVLAVTFNDTSDQII-SGGI 214
Query: 183 DHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDS 242
D+ ++WD+R+ + ++ H V S GS +L+ + DN +R+WD +
Sbjct: 215 DNDIKVWDLRQNKLTYTM--RGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV---RPFA 269
Query: 243 PSREIV-----HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNGAALHPIDFIDI 297
P V + H+F ++L R W P + R++ ++
Sbjct: 270 PKERCVKIFQGNVHNFEKNL--LRCSWSPDGSKIAAGSADRFV-------------YVWD 314
Query: 298 TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGSS 332
TT + V + + +I+ V HP + ++ S SS
Sbjct: 315 TTSRRVLYKLPGHAGSINEV-AFHPDEPIILSASS 348
>sp|A3GFK8|SEC31_PICST Protein transport protein SEC31 OS=Scheffersomyces stipitis (strain
ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545)
GN=SEC31 PE=3 SV=2
Length = 1244
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 110/252 (43%), Gaps = 36/252 (14%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 76
HS V L+F+P +H+L+SG GQ+ +WD K +E G+ + + ++++ +N +
Sbjct: 117 HSGPVRSLQFNPLQSHVLVSGGSHGQIFIWDTKKFTEPFSPGSAMTPMDEISSVAWNNSV 176
Query: 77 DGTVYAASSDGTVSCTDLETG---LALSLM------NVNPNGWHGPRTWRMLYGMDINPE 127
+ + + G S DL++ L LS N + WH ++ ++ D
Sbjct: 177 SHILASTGNSGYTSIWDLKSKREVLHLSYTGASGRANFSHVAWHPTKSTELITASD---- 232
Query: 128 KGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFA 186
D + D R NS + E IL H+KG V+ L PELL+S G D+
Sbjct: 233 ------NDACPLILTWDLR-NSNAPEKILEGHKKG--VLSLDWCQQDPELLISSGKDNTT 283
Query: 187 RIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTSQ-DNRLRIWDSIFGNLDS 242
+W+ G L + P N A+ F+P I T+ D ++ + +
Sbjct: 284 FLWNPT---TGQKLGEYP--TTANWAFQTAFAPKVPDIFATASFDGKIVVQS--LQDTSP 336
Query: 243 PSREIVHSHDFN 254
P E V S+D N
Sbjct: 337 PVSEKVTSNDDN 348
>sp|Q8CFD5|ERCC8_MOUSE DNA excision repair protein ERCC-8 OS=Mus musculus GN=Ercc8 PE=2
SV=2
Length = 397
Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 42/254 (16%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYG------------NI 62
V R H V L+ P +LSG G V ++D S + Y ++
Sbjct: 38 VERIHGSGVNTLDIEPVEGRYMLSGGSDGVVVLYDLENASRQPHYTCKAVCSVGRSHPDV 97
Query: 63 HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGM 122
H V +++ P + G ++S D T+ D T A + N +Y
Sbjct: 98 HKYSVETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQAADVFNFE----------ETVYSH 147
Query: 123 DINP---EKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLS 179
++P + +V V + L D ++ S S + HR+ +++ + +P +L +
Sbjct: 148 HMSPAATKHCLVAVGTRGPKVQLCDLKSGSCS-HILQGHRQ--EILAVSWSPRHDYILAT 204
Query: 180 CGNDHFARIWDIRRLEA--------------GSSLCDLPHKRVVNSAYFSPSGSKILTTS 225
D ++WD+RR + + H VN F+ G +LT
Sbjct: 205 ASADSRVKLWDVRRASGCLLTLDQHNGKKSQAAESANTAHNGKVNGLCFTSDGLHLLTIG 264
Query: 226 QDNRLRIWDSIFGN 239
DNR+R+W+S G+
Sbjct: 265 TDNRMRLWNSSSGD 278
>sp|Q2UUT4|YD156_ASPOR WD repeat-containing protein AO090009000186 OS=Aspergillus oryzae
(strain ATCC 42149 / RIB 40) GN=AO090009000186 PE=3 SV=1
Length = 522
Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/368 (21%), Positives = 140/368 (38%), Gaps = 71/368 (19%)
Query: 16 IRYHSRRVTCLEFHPTNNH-ILLSGDKKGQVGVWDFYKVSEKIVYGNI------------ 62
I+ RV + FHP+ ++ +GDK G +G+ D + V
Sbjct: 180 IKLTPERVYTMTFHPSETKPLIFAGDKMGHLGILDASQEKPTSVKQEDEDEEDDDPDPVL 239
Query: 63 -----HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWR 117
H+ ++++ +P+ +Y AS D ++ DL+ S+ P+
Sbjct: 240 TTLKPHTRTISSMVIHPSKPTHLYTASYDSSIREMDLDK--TTSVERYAPDS-------- 289
Query: 118 MLYGMDINPEKGVVLVADNFGFLYLV---------DARTNSRSGEAILIHRKGSKVVGLH 168
D P G+ + AD+ LY D RT + G + K+ G
Sbjct: 290 ---TSDDVPLSGLDMAADDPNTLYWTTLEGEFGRYDMRT-PKQGSVAVWSLSEKKIGGFS 345
Query: 169 CNPIQPELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR--VVNSAYFSPSGSKILTTSQ 226
P + D R+WDIR+L + H+ V+ A F+ +G ++ T+S
Sbjct: 346 LFPTHSHYFATASLDRTMRLWDIRKLSRREPVPVGEHQSRLSVSHAAFNSAG-QVATSSY 404
Query: 227 DNRLRIWDSIFGNLDS-------------PSREIVHSHDFNRHLTPFRAEWDPKDPSESL 273
D+ L+++D + S P + H+ R +T R +W S
Sbjct: 405 DDSLKLYDFGAKGIASWKPGHTLSDAEMKPDTVVRHNCQTGRWVTILRPQWQINPQSH-- 462
Query: 274 AVIGRYISENYNGAALHPIDFIDI--TTGQLVAEVMDPNITTISPVNKLHPRDDVLASGS 331
I R+ N N F+D+ ++G +A++ IT + V H + +A G+
Sbjct: 463 --IQRFCIGNMN-------RFVDVYSSSGDQLAQLGGDGITAVPAVAVFHRSKNWIAGGT 513
Query: 332 -SRSIFIW 338
S I +W
Sbjct: 514 ASGKICLW 521
>sp|Q6FQU2|YD156_CANGA WD repeat-containing protein CAGL0I03542g OS=Candida glabrata
(strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 /
NRRL Y-65) GN=CAGL0I03542g PE=3 SV=1
Length = 534
Score = 57.0 bits (136), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/353 (20%), Positives = 144/353 (40%), Gaps = 52/353 (14%)
Query: 16 IRYHSRRVTCLEFHP-TNNHILLSGDKKGQVGVWDFYK-----------VSEKIVYGNIH 63
I+ + R+T + FHP T+ +++ GD G VG+W+ V I
Sbjct: 196 IKITNERITSMFFHPSTDKKLIVGGDTSGTVGLWNVRDEPLAENGEDDLVEPDITKVKFF 255
Query: 64 SCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMD 123
+ V I PT+ T+ S DG++ L+ + +M + N + P +
Sbjct: 256 TKNVGKIECFPTDTSTLLITSYDGSIRTLGLKDLKSADIMTLR-NSYEEPLG---ISDCQ 311
Query: 124 INPEKGVVLVADNFGFLYL-VDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGN 182
+ + VL G + +D R ++ E K+ + NP +P + +
Sbjct: 312 FSYDNSQVLFLTTLGGEFTQLDLR--AKPTETKFWRLSDKKIGSMAINPQRPYEIATGSL 369
Query: 183 DHFARIWDIRR------------LEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRL 230
D RIWD+R+ + + + V++ +SP+ ++ D+ +
Sbjct: 370 DRTLRIWDVRKTVETPEWSQYEDYHSHEIVSTFDSRLSVSAVSYSPTDGTLVCNGYDDTI 429
Query: 231 RIWD---SIFGNLDSPSREIV-HSHDFNRHLTPFRAEWDPKDPSESLAVIGRYISENYNG 286
R++D + +LD ++ ++ H+ R + +A + P ++A +GR I + YN
Sbjct: 430 RLFDVNGELPEDLDEKNKTVLKHNCQSGRWTSILKARFKPDQNVFAIANMGRAI-DIYN- 487
Query: 287 AALHPIDFIDITTGQLVAEVMDPNITTISPVNKLHPRDDVLASG-SSRSIFIW 338
++GQ +A + T+ V HP + +A G SS +F++
Sbjct: 488 -----------SSGQQLAHL---TTATVPAVLGWHPLKNWIAGGNSSGKVFLF 526
>sp|Q5AAU3|SEC31_CANAL Protein transport protein SEC31 OS=Candida albicans (strain SC5314
/ ATCC MYA-2876) GN=SEC31 PE=3 SV=1
Length = 1265
Score = 56.6 bits (135), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 96/222 (43%), Gaps = 33/222 (14%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNPTN 76
H+ V L+F+P NH+L++G GQ+ +WD SE G + + + + +N +
Sbjct: 117 HTGAVKSLQFNPIQNHVLVTGGSNGQIFIWDTKTFSEPFAPGQAMTPMDEITFVSWNNSV 176
Query: 77 DGTVYAASSDGTVSCTDLETG---LALSLM------NVNPNGWHGPRTWRMLYGMDINPE 127
+ + + G S DL+T L LS N + WH ++ +++ D
Sbjct: 177 SHILASTGNGGYTSIWDLKTKREVLHLSYTGAGGRANFSYVSWHPSQSTKLITASD---- 232
Query: 128 KGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQPELLLSCGNDHFA 186
D+ + D R NS + E IL H+KG V+ L P LLLS G D+
Sbjct: 233 ------NDSCPLILTWDLR-NSNAPEKILEGHKKG--VLSLDWCKQDPTLLLSSGKDNST 283
Query: 187 RIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTS 225
+W+ G L + P N A+ F+P+ I T+
Sbjct: 284 FLWNPIE---GIKLGEYP--TTANWAFETKFAPAAPDIFATA 320
>sp|O61585|KTNB1_STRPU Katanin p80 WD40 repeat-containing subunit B1 OS=Strongylocentrotus
purpuratus GN=KATNB1 PE=1 SV=1
Length = 690
Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/328 (21%), Positives = 118/328 (35%), Gaps = 91/328 (27%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDG 78
HS V CL P + ++++G + +V +W K I+ + H+ V++++FN +++
Sbjct: 15 HSSNVNCLALGPMSGRVMVTGGEDKKVNLWAVGK-QNCIISLSGHTSPVDSVKFN-SSEE 72
Query: 79 TVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNFG 138
V A S GT+ DLE + + + N + MD +P FG
Sbjct: 73 LVVAGSQSGTMKIYDLEPAKIVRTLTGHRNS---------IRCMDFHP----------FG 113
Query: 139 FLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAGS 198
E + S D ++WD+RR G
Sbjct: 114 ------------------------------------EFVASGSTDTNVKLWDVRR--KGC 135
Query: 199 SLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLT 258
H VN FSP G ++T S+D +++WD G L +F H
Sbjct: 136 IYTYKGHSDQVNMIKFSPDGKWLVTASEDTTIKLWDLTMGKL---------FQEFKNH-- 184
Query: 259 PFRAEWDPKDPSESLAVIGRYISEN----YNGAALHPIDFIDITTGQLVAEVMDPNITTI 314
+ V G N +G++ + F D+ T QLV+ P + +
Sbjct: 185 -------------TGGVTGIEFHPNEFLLASGSSDRTVQFWDLETFQLVSST-SPGASAV 230
Query: 315 SPVNKLHPRDDVLASGSSRSI--FIWRP 340
++ HP L S + F W P
Sbjct: 231 RSIS-FHPDGSYLFCSSQDMLHAFGWEP 257
>sp|Q6C414|SEC31_YARLI Protein transport protein SEC31 OS=Yarrowia lipolytica (strain CLIB
122 / E 150) GN=SEC31 PE=3 SV=1
Length = 1184
Score = 56.2 bits (134), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 85/197 (43%), Gaps = 31/197 (15%)
Query: 7 IPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIV-------- 58
I D ++ HS + L+F P N L+SG KG++ VWD + I
Sbjct: 101 IKDSSTSVSVKEHSGPIKTLQFDPHNPTRLVSGGTKGEIFVWDLSDPKKPIAKKLGTDNK 160
Query: 59 YGNIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMN----VNPNGWHGPR 114
G+I S NNI N + +SS+G + +++ L+ + V+ WH +
Sbjct: 161 AGDIESLAFNNITRN-----ILATSSSNGITTIWNVDQNKELTRVKHDKPVSHVVWHPSK 215
Query: 115 TWRMLYGMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILI-HRKGSKVVGLHCNPIQ 173
P K + VAD+ + L+ N+ + E +L H KG ++ + +
Sbjct: 216 -----------PTKLITAVADDAEPVMLIWDLKNANAPEGVLQGHSKG--ILSVDWCQLD 262
Query: 174 PELLLSCGNDHFARIWD 190
P LLSCG D+ +W+
Sbjct: 263 PRFLLSCGKDNRTLLWN 279
>sp|Q4V8C4|WDR5B_RAT WD repeat-containing protein 5B OS=Rattus norvegicus GN=Wdr5b PE=2
SV=1
Length = 328
Score = 55.8 bits (133), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 98/217 (45%), Gaps = 19/217 (8%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYK-VSEKIVYGNIHSCIVNNIRFNPTND 77
HS ++ ++F P N L S + +W Y +K +YG HS ++++ ++ ++
Sbjct: 38 HSAAISSVKFSP-NGEWLASSAADALIIIWGAYDGKCKKTLYG--HSLEISDVAWS-SDS 93
Query: 78 GTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 137
+ +AS D T+ D+ +G L + + + ++ D NP +++
Sbjct: 94 SRLVSASDDKTLKLWDVRSGKCLKTLKGHSD---------FVFCCDFNPPSNLIVSGSFD 144
Query: 138 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 197
+ + + +T + + + H V HCN L++S D RIWD +
Sbjct: 145 ESVKIWEVKTG-KCLKTLSAHSDPISAVHFHCNG---SLIVSGSYDGLCRIWDAASGQCL 200
Query: 198 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 234
+L D + V + FSP+G ILT + D+ L++WD
Sbjct: 201 RTLADEGNPPV-SFVKFSPNGKYILTATLDSTLKLWD 236
Score = 34.7 bits (78), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 95/225 (42%), Gaps = 16/225 (7%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 74
++ HS V C +F+P +N +++SG V +W+ K + + + HS ++ + F+
Sbjct: 118 TLKGHSDFVFCCDFNPPSN-LIVSGSFDESVKIWEV-KTGKCLKTLSAHSDPISAVHFH- 174
Query: 75 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 134
N + + S DG D +G L + N P ++ + +P +L A
Sbjct: 175 CNGSLIVSGSYDGLCRIWDAASGQCLRTLADEGN---PPVSF-----VKFSPNGKYILTA 226
Query: 135 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 194
L L D + R + H+ + + + ++S D+ IW+++
Sbjct: 227 TLDSTLKLWD-YSRGRCLKTYTGHKNEKYCIFASFSVTGRKWVVSGSEDNMVYIWNLQTK 285
Query: 195 EAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN--RLRIWDSIF 237
E L H VV SA P+ + I + + +N ++IW S +
Sbjct: 286 EIVQRL--QGHTDVVISAACHPTENIIASAALENDKTIKIWSSDY 328
Score = 34.3 bits (77), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 6/92 (6%)
Query: 150 RSGEAILIHRKGSKVVGLHCNPIQP--ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR 207
RSG+ + + S V C P L++S D +IW+++ + +L H
Sbjct: 111 RSGKCLKTLKGHSDFV--FCCDFNPPSNLIVSGSFDESVKIWEVKTGKCLKTLS--AHSD 166
Query: 208 VVNSAYFSPSGSKILTTSQDNRLRIWDSIFGN 239
+++ +F +GS I++ S D RIWD+ G
Sbjct: 167 PISAVHFHCNGSLIVSGSYDGLCRIWDAASGQ 198
>sp|Q9D7H2|WDR5B_MOUSE WD repeat-containing protein 5B OS=Mus musculus GN=Wdr5b PE=1 SV=1
Length = 328
Score = 55.8 bits (133), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 99/217 (45%), Gaps = 19/217 (8%)
Query: 19 HSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-EKIVYGNIHSCIVNNIRFNPTND 77
HS ++ ++F P N L S + +W Y + +K +YG HS ++++ ++ ++
Sbjct: 38 HSAAISSVKFSP-NGEWLASSAADALIIIWGAYDGNCKKTLYG--HSLEISDVAWS-SDS 93
Query: 78 GTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVADNF 137
+ +AS D T+ D+ +G L + + + ++ D NP +++
Sbjct: 94 SRLVSASDDKTLKVWDMRSGKCLKTLKGHSD---------FVFCCDFNPPSNLIVSGSFD 144
Query: 138 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 197
+ + + +T + + + H V +CN L++S D RIWD +
Sbjct: 145 ESVKIWEVKTG-KCLKTLSAHSDPISAVNFNCNG---SLIVSGSYDGLCRIWDAASGQCL 200
Query: 198 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 234
+L D + V + FSP+G ILT + DN L++WD
Sbjct: 201 RTLADEGNPPV-SFVKFSPNGKYILTATLDNTLKLWD 236
Score = 35.0 bits (79), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 49/223 (21%), Positives = 94/223 (42%), Gaps = 16/223 (7%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 74
++ HS V C +F+P +N +++SG V +W+ K + + + HS ++ + FN
Sbjct: 118 TLKGHSDFVFCCDFNPPSN-LIVSGSFDESVKIWEV-KTGKCLKTLSAHSDPISAVNFN- 174
Query: 75 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 134
N + + S DG D +G L + N P ++ + +P +L A
Sbjct: 175 CNGSLIVSGSYDGLCRIWDAASGQCLRTLADEGN---PPVSF-----VKFSPNGKYILTA 226
Query: 135 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 194
L L D + R + H+ + + + ++S D+ IW+++
Sbjct: 227 TLDNTLKLWD-YSRGRCLKTYTGHKNEKYCLFASFSVTGRKWVVSGSEDNMVYIWNLQTK 285
Query: 195 EAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDN--RLRIWDS 235
E L H VV SA P+ + I + + +N +++W S
Sbjct: 286 EIVQRL--QGHTDVVISAACHPTKNIIASAALENDKTIKVWSS 326
Score = 33.1 bits (74), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 6/92 (6%)
Query: 150 RSGEAILIHRKGSKVVGLHCNPIQP--ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKR 207
RSG+ + + S V C P L++S D +IW+++ + +L H
Sbjct: 111 RSGKCLKTLKGHSDFV--FCCDFNPPSNLIVSGSFDESVKIWEVKTGKCLKTLS--AHSD 166
Query: 208 VVNSAYFSPSGSKILTTSQDNRLRIWDSIFGN 239
+++ F+ +GS I++ S D RIWD+ G
Sbjct: 167 PISAVNFNCNGSLIVSGSYDGLCRIWDAASGQ 198
>sp|Q5M786|WDR5_XENTR WD repeat-containing protein 5 OS=Xenopus tropicalis GN=wdr5 PE=2
SV=1
Length = 334
Score = 55.5 bits (132), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 102/241 (42%), Gaps = 33/241 (13%)
Query: 2 KPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-EKIVYG 60
KPA V P+ + H++ V+ ++F P N L S + +W Y EK + G
Sbjct: 27 KPAPVKPNYTLKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFEKTISG 85
Query: 61 NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 120
H ++++ ++ ++ + +AS D T+ D+ +G L + + N ++
Sbjct: 86 --HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN---------YVF 133
Query: 121 GMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP------ 174
+ NP+ +++ + + D +T K K + H +P+
Sbjct: 134 CCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAVHFNRD 182
Query: 175 -ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIW 233
L++S D RIWD + +L D V+ FSP+G IL + DN L++W
Sbjct: 183 GSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDNTLKLW 241
Query: 234 D 234
D
Sbjct: 242 D 242
>sp|O14727|APAF_HUMAN Apoptotic protease-activating factor 1 OS=Homo sapiens GN=APAF1
PE=1 SV=2
Length = 1248
Score = 55.1 bits (131), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 96/213 (45%), Gaps = 32/213 (15%)
Query: 148 NSRSGEAILIHRKGSKVVGL-HCNPIQPELLLSCGN-DHFARIWDIRRLEAGSSLCDLPH 205
NS +GE + + + S+ V H LLL+ G+ D F ++WD+ + E +++ H
Sbjct: 685 NSMTGELVHTYDEHSEQVNCCHFTNSSHHLLLATGSSDCFLKLWDLNQKECRNTM--FGH 742
Query: 206 KRVVNSAYFSPSGSKILTTSQDNRLRIWDSIFGNLDSPSREIVHSHDFNRHLTPFRAEWD 265
VN FSP + + S D L++WD+ N R+ ++ F +L
Sbjct: 743 TNSVNHCRFSPDDKLLASCSADGTLKLWDATSAN----ERKSINVKQFFLNL-------- 790
Query: 266 PKDPSESLAVIGRYISENYNG-----AALHPIDFIDITTGQLVAEVMDPNITTI-----S 315
+DP E + VI + S + +G AA + I DI T L+ E+ + +TI S
Sbjct: 791 -EDPQEDMEVIVKCCSWSADGARIMVAAKNKIFLFDIHTSGLLGEIHTGHHSTIQYCDFS 849
Query: 316 PVNKLHPRDDVLASGSSRSIFIWRPKEKSELVE 348
P N L + + S + +W +S++ +
Sbjct: 850 PQNHL-----AVVALSQYCVELWNTDSRSKVAD 877
Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 97/223 (43%), Gaps = 15/223 (6%)
Query: 19 HSRRVTCLEFHPTNNHILL-SGDKKGQVGVWDF-YKVSEKIVYGNIHSCIVNNIRFNPTN 76
HS +V C F +++H+LL +G + +WD K ++G+ +S VN+ RF+P +
Sbjct: 698 HSEQVNCCHFTNSSHHLLLATGSSDCFLKLWDLNQKECRNTMFGHTNS--VNHCRFSP-D 754
Query: 77 DGTVYAASSDGTVSCTDLETGLALSLMNVNP---NGWHGPRTWRMLYGMDINPEKGVVLV 133
D + + S+DGT+ D + +NV N ++ G ++
Sbjct: 755 DKLLASCSADGTLKLWDATSANERKSINVKQFFLNLEDPQEDMEVIVKCCSWSADGARIM 814
Query: 134 ADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRR 193
++L D T+ GE IH + Q L + + + +W+
Sbjct: 815 VAAKNKIFLFDIHTSGLLGE---IHTGHHSTIQYCDFSPQNHLAVVALSQYCVELWNT-- 869
Query: 194 LEAGSSLCDL-PHKRVVNSAYFSPSGSKILTTSQDNRLRIWDS 235
++ S + D H V+ FSP GS LT+S D +R+W++
Sbjct: 870 -DSRSKVADCRGHLSWVHGVMFSPDGSSFLTSSDDQTIRLWET 911
>sp|O22212|PRP4L_ARATH U4/U6 small nuclear ribonucleoprotein PRP4-like protein
OS=Arabidopsis thaliana GN=EMB2776 PE=2 SV=1
Length = 554
Score = 55.1 bits (131), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 45/217 (20%)
Query: 32 NNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNPTNDGTVYAAS------- 84
+ IL + G +W+ +V+ I H ++ F+P +D A++
Sbjct: 266 DGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLW 325
Query: 85 -SDGTVSCTDLETGL-ALSLMNVNPNGWH-----GPRTWRMLYGMDINPEKGVVLVADNF 137
+DGT+ T E L L+ + +P+G + +TWR+ DIN ++L
Sbjct: 326 KTDGTLLQT-FEGHLDRLARVAFHPSGKYLGTTSYDKTWRL---WDINTGAELLL----- 376
Query: 138 GFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRLEAG 197
+SRS I + G+ L SCG D AR+WD+R
Sbjct: 377 -------QEGHSRSVYGIAFQQDGA-------------LAASCGLDSLARVWDLR--TGR 414
Query: 198 SSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIWD 234
S L H + V S FSP+G + + +DN+ RIWD
Sbjct: 415 SILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWD 451
>sp|Q5BIM8|ERCC8_BOVIN DNA excision repair protein ERCC-8 OS=Bos taurus GN=ERCC8 PE=2 SV=1
Length = 397
Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/253 (21%), Positives = 98/253 (38%), Gaps = 42/253 (16%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYG------------NI 62
V R H V L+ P +LSG G + ++D S + Y ++
Sbjct: 38 VERIHCSGVNTLDIEPVEGRYMLSGGSDGVIVLYDLENSSRQPYYTCKAVCSVGRSHPDV 97
Query: 63 HSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGM 122
H V +++ P + G ++S D T+ D T + N +Y
Sbjct: 98 HKYSVETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQIADVFNFE----------ETVYSH 147
Query: 123 DINP---EKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLS 179
++P + +V V + L D ++ S S + HR+ +++ + +P +L +
Sbjct: 148 HMSPVATKHCLVAVGTRGPKVQLCDLKSGSCS-HILQGHRQ--EILAVSWSPRYEYILAT 204
Query: 180 CGNDHFARIWDIRRLEA--------------GSSLCDLPHKRVVNSAYFSPSGSKILTTS 225
D A++WD+RR + + H VN F+ G +LT
Sbjct: 205 ASADSRAKLWDVRRASGCLITLDQHNGKKSQAAESANTAHNGKVNGLCFTSDGLHLLTVG 264
Query: 226 QDNRLRIWDSIFG 238
DNR+R+W+S G
Sbjct: 265 TDNRMRLWNSSNG 277
>sp|Q12417|PRP46_YEAST Pre-mRNA-splicing factor PRP46 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=PRP46 PE=1 SV=1
Length = 451
Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 101/246 (41%), Gaps = 24/246 (9%)
Query: 15 VIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCIVNNIRFNP 74
VI H V C+ P +N ++G + VWD K H V ++ +
Sbjct: 134 VINGHLGWVRCVAIDPVDNEWFITGSNDTTMKVWDLATGKLKTTLAG-HVMTVRDVAVSD 192
Query: 75 TNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLYGMDINPEKGVVLVA 134
+ +++ S D TV C DLE ++ G RT + I+P ++ A
Sbjct: 193 RHP-YLFSVSEDKTVKCWDLEKN---QIIRDYYGHLSGVRT------VSIHPTLDLIATA 242
Query: 135 DNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDHFARIWDIRRL 194
+ L D RT R L+ KG + + C P+ P+ ++S D R+WD+
Sbjct: 243 GRDSVIKLWDMRT--RIPVITLVGHKGP-INQVQCTPVDPQ-VVSSSTDATVRLWDVV-- 296
Query: 195 EAGSSLCDLP-HKRVVNSAYFSPSGSKILTTSQDNRLRIW----DSIFGNLDSPSREIVH 249
AG ++ L HKR V + P + + D+ +R W S+ N +S I++
Sbjct: 297 -AGKTMKVLTHHKRSVRATALHPKEFSVASACTDD-IRSWGLAEGSLLTNFESEKTGIIN 354
Query: 250 SHDFNR 255
+ N+
Sbjct: 355 TLSINQ 360
>sp|A5DTX3|SEC31_LODEL Protein transport protein SEC31 OS=Lodderomyces elongisporus
(strain ATCC 11503 / CBS 2605 / JCM 1781 / NBRC 1676 /
NRRL YB-4239) GN=SEC31 PE=3 SV=1
Length = 953
Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 106/254 (41%), Gaps = 33/254 (12%)
Query: 17 RYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVSEKIVYGNIHSCI--VNNIRFNP 74
++ V L+F+P +H+L++G GQ+ VWD E G + + ++ + +N
Sbjct: 116 KHSGGAVKSLQFNPIQHHVLVTGGSNGQIFVWDTKTFGEPFSPGQAMTPMDEISCVAWNN 175
Query: 75 TNDGTVYAASSDGTVSCTDLE----------TGLALSLMNVNPNGWHGPRTWRMLYGMDI 124
+ + + + G S DL+ TG A N + WH ++ +++ D
Sbjct: 176 SVSHILASTGNSGYTSIWDLKSKKEVLHLSYTG-ATGKANFSHVAWHPTKSTKLVTASD- 233
Query: 125 NPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQPELLLSCGNDH 184
+++ + D R ++ + + H+KG V+ L PELL+S G D+
Sbjct: 234 ---------SESCPVILTWDLRNSNAPEKVMEGHKKG--VLSLDWCKQDPELLISSGKDN 282
Query: 185 FARIWDIRRLEAGSSLCDLPHKRVVNSAY---FSPSGSKILTTSQDNRLRIWDSIFGNLD 241
+W+ G L + P N A+ F+PS I T+ + I +I
Sbjct: 283 ATILWNPIE---GKKLGEYP--TTANWAFKTRFAPSAPDIFATASFDGKIIVQTIQDTSP 337
Query: 242 SPSREIVHSHDFNR 255
S ++ SHD N
Sbjct: 338 PVSTKVSASHDDNE 351
>sp|Q498M4|WDR5_RAT WD repeat-containing protein 5 OS=Rattus norvegicus GN=Wdr5 PE=2
SV=1
Length = 334
Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 101/241 (41%), Gaps = 33/241 (13%)
Query: 2 KPAHVIPDQVNCAVIRYHSRRVTCLEFHPTNNHILLSGDKKGQVGVWDFYKVS-EKIVYG 60
KP V P+ + H++ V+ ++F P N L S + +W Y EK + G
Sbjct: 27 KPTPVKPNYALKFTLAGHTKAVSSVKFSP-NGEWLASSSADKLIKIWGAYDGKFEKTISG 85
Query: 61 NIHSCIVNNIRFNPTNDGTVYAASSDGTVSCTDLETGLALSLMNVNPNGWHGPRTWRMLY 120
H ++++ ++ ++ + +AS D T+ D+ +G L + + N ++
Sbjct: 86 --HKLGISDVAWS-SDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSN---------YVF 133
Query: 121 GMDINPEKGVVLVADNFGFLYLVDARTNSRSGEAILIHRKGSKVVGLHCNPIQP------ 174
+ NP+ +++ + + D +T K K + H +P+
Sbjct: 134 CCNFNPQSNLIVSGSFDESVRIWDVKTG-----------KCLKTLPAHSDPVSAVHFNRD 182
Query: 175 -ELLLSCGNDHFARIWDIRRLEAGSSLCDLPHKRVVNSAYFSPSGSKILTTSQDNRLRIW 233
L++S D RIWD + +L D V+ FSP+G IL + DN L++W
Sbjct: 183 GSLIVSSSYDGLCRIWDTASGQCLKTLID-DDNPPVSFVKFSPNGKYILAATLDNTLKLW 241
Query: 234 D 234
D
Sbjct: 242 D 242
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 158,317,474
Number of Sequences: 539616
Number of extensions: 6936068
Number of successful extensions: 27146
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 250
Number of HSP's successfully gapped in prelim test: 749
Number of HSP's that attempted gapping in prelim test: 23271
Number of HSP's gapped (non-prelim): 3930
length of query: 396
length of database: 191,569,459
effective HSP length: 120
effective length of query: 276
effective length of database: 126,815,539
effective search space: 35001088764
effective search space used: 35001088764
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)