BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019258
(343 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8NFH3|NUP43_HUMAN Nucleoporin Nup43 OS=Homo sapiens GN=NUP43 PE=1 SV=1
Length = 380
Score = 61.6 bits (148), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 93/228 (40%), Gaps = 40/228 (17%)
Query: 82 QSLVAASTLSGSLHVLSSAMQVLLSVDEN---SGFHVGP---------ISGVDLTGSECV 129
+ +VAAS+ L LSV++ + +H GP +GV E V
Sbjct: 86 ERIVAASSTGCVTVFLHHPNNQTLSVNQQWTTAHYHTGPGSPSYSSAPCTGVVCNNPEIV 145
Query: 130 SVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGYGLQWWDLRRAG 189
+VGEDGR+NL E R D + V + E +T L+ WD R+ G
Sbjct: 146 TVGEDGRINLFRA-DHKEAVRTIDNADSSTLHAVTFLRTPEILTVNSIGQLKIWDFRQQG 204
Query: 190 SG-GPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLS 248
+ +L G+ +H VD HP+++H GG G + WD+R Q ++ +S
Sbjct: 205 NEPSQILSLTGD-------RVPLHCVDRHPNQQHVVATGGQDGMLSIWDVR--QGTMPVS 255
Query: 249 GVGTGEAVTHSLSESEVWEVQYDCHTKSSNISSSRVLPAMFCSEDGIL 296
+ EA E+WEV + S CSEDG L
Sbjct: 256 LLKAHEA--------EMWEVHF---------HPSNPEHLFTCSEDGSL 286
>sp|P59235|NUP43_MOUSE Nucleoporin Nup43 OS=Mus musculus GN=Nup43 PE=2 SV=2
Length = 380
Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 72/179 (40%), Gaps = 28/179 (15%)
Query: 119 SGVDLTGSECVSVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGY 178
+G+ E V+VGEDGR+NL V E R D + V + E VT
Sbjct: 135 TGIVCDNPEIVTVGEDGRINLFRV-DHKEAVRTIDNADSSTLHAVTFLRTPEIVTVNSIG 193
Query: 179 GLQWWDLRRAGSG-GPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWD 237
L+ WD R+ GS +L G+ +H VD HP ++H GG G + WD
Sbjct: 194 QLKIWDFRQQGSEPCQILSLTGD-------RVPLHCVDRHPDQQHVVATGGQDGMLSIWD 246
Query: 238 LRWQQQSIVLSGVGTGEAVTHSLSESEVWEVQYDCHTKSSNISSSRVLPAMFCSEDGIL 296
+R GT E+E+WEV H SN CSEDG L
Sbjct: 247 VR----------QGTMPVSLLKAHEAEMWEV----HFHPSNPDH-----LFTCSEDGSL 286
>sp|Q54Z22|NUP43_DICDI Nuclear pore complex protein nup43 OS=Dictyostelium discoideum
GN=nup43 PE=3 SV=1
Length = 417
Score = 52.0 bits (123), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 84/199 (42%), Gaps = 38/199 (19%)
Query: 129 VSVGEDGRVNLVSV--IGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGG-FGYGLQWWDL 185
V+VG DG +NL+S+ + R DG + N VK + + +T G G +++WD+
Sbjct: 206 VTVGNDGVMNLLSIENLIPIYTNRYIDG---LSVNVVKSITSNQIITSGELGRYIKFWDI 262
Query: 186 RRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSI 245
R + S PV K I S+ IH +H AG S G V +D+R
Sbjct: 263 R-SNSSQPVKTIK-------TQCPRIFSLAIHKDEQHIIAAGSSDGQVNLFDIR------ 308
Query: 246 VLSGVGTGEAVTHSLSESEVWEVQYDCHTKSSNISSSRVLPAMFCSEDGILAVVEQGEEP 305
+ + TH+ S VWE+ + S S CSEDG + ++
Sbjct: 309 --NDYSIDQNKTHN---SNVWELSF---------SKSNPNQLYSCSEDGFIYQYSYNKDN 354
Query: 306 F---ELLAEPCAI-NSFDI 320
++ P I N+FDI
Sbjct: 355 LGGGGMMTLPNQINNTFDI 373
>sp|A5DB75|SEC31_PICGU Protein transport protein SEC31 OS=Meyerozyma guilliermondii
(strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC
10279 / NRRL Y-324) GN=SEC31 PE=3 SV=2
Length = 1266
Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 160 FNCVKWASPTE-----FVTGGFGYG-LQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHS 213
F + W+ P+E + G F G ++ WD++ + LQ F S SG + +
Sbjct: 65 FYAIAWSKPSEGRSKGVLAGAFENGTIELWDVQELITSK-DLQKASIFKSSAH-SGPVKT 122
Query: 214 VDIHPSRKHTCLAGGSSGTVFAWDLR 239
+ +P ++H L+GGS+G +F WD +
Sbjct: 123 LQFNPLQEHVLLSGGSNGQIFVWDTK 148
>sp|Q810D6|GRWD1_MOUSE Glutamate-rich WD repeat-containing protein 1 OS=Mus musculus
GN=Grwd1 PE=2 SV=2
Length = 446
Score = 40.8 bits (94), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 151 VYDGNGLVGFNCVKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASG 209
+DG+ N + W+ F+ +GG L+ WDLR+ SG PV FK +
Sbjct: 306 AHDGD----VNVISWSRREPFLLSGGDDGALKVWDLRQFKSGSPVATFKQHMAP------ 355
Query: 210 MIHSVDIHPSRKHTCLAGGSSGTVFAWDL 238
+ SV+ HP A G+ + WDL
Sbjct: 356 -VTSVEWHPQDSGVFAASGADNQITQWDL 383
>sp|Q1JQD2|GRWD1_BOVIN Glutamate-rich WD repeat-containing protein 1 OS=Bos taurus
GN=GRWD1 PE=2 SV=1
Length = 446
Score = 40.4 bits (93), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 151 VYDGNGLVGFNCVKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASG 209
+DG+ N + W+ F+ +GG L+ WDLR+ SG PV FK +
Sbjct: 306 AHDGD----VNVINWSHREPFLLSGGDDGALKVWDLRQFKSGSPVATFKQHVAP------ 355
Query: 210 MIHSVDIHPSRKHTCLAGGSSGTVFAWDL 238
+ SV+ HP A G+ + WDL
Sbjct: 356 -VTSVEWHPQDSGVFAASGADNQITQWDL 383
>sp|Q9BQ67|GRWD1_HUMAN Glutamate-rich WD repeat-containing protein 1 OS=Homo sapiens
GN=GRWD1 PE=1 SV=1
Length = 446
Score = 40.0 bits (92), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 151 VYDGNGLVGFNCVKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASG 209
+DG+ N + W+ F+ +GG L+ WDLR+ SG PV FK +
Sbjct: 306 AHDGD----VNVISWSRREPFLLSGGDDGALKIWDLRQFKSGSPVATFKQHVAP------ 355
Query: 210 MIHSVDIHPSRKHTCLAGGSSGTVFAWDL 238
+ SV+ HP A G+ + WDL
Sbjct: 356 -VTSVEWHPQDSGVFAASGADHQITQWDL 383
>sp|Q8SRB0|GBLP_ENCCU Guanine nucleotide-binding protein subunit beta-like protein
OS=Encephalitozoon cuniculi (strain GB-M1) GN=ECU08_1110
PE=1 SV=1
Length = 334
Score = 39.7 bits (91), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 96/215 (44%), Gaps = 29/215 (13%)
Query: 112 GFHVGPISGVDLT--GSECVSVGEDGRVNLVSVIGGGEFKR--VYDGNGLVGFNCVKWAS 167
G H I+G+D++ G+ V+VG DG + I E K+ + +G+G ++
Sbjct: 64 GGHSKRINGLDVSKDGNMMVTVGSDG----IGRIWDTESKKSILLEGHGRDVLCVSINSN 119
Query: 168 PTEFVTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAG 227
T+ VTG + ++ + G VL+ + + G I+ V HP+ + +G
Sbjct: 120 DTKIVTGSVDRTMNLYNTK----GDLVLKMGRDMEMMHR--GWINCVTFHPTEESILASG 173
Query: 228 GSSGTVFAWDLRWQQQSIVLSGVGTGEAVTHSLSESEVWEVQYDCHTKSSNISSSRVLPA 287
+ GTV WDL Q+ + + +G E + V YD S+ + A
Sbjct: 174 SADGTVKIWDLDTQEH--LQTYLGGAYVDYEKAKEKKTSPVDYD---------ESKSVTA 222
Query: 288 MFCSEDG-ILAVVEQGEEPF--ELLAEPCAINSFD 319
M S+DG IL E+ + + ++ ++ C I SFD
Sbjct: 223 MAFSKDGSILTYGEKSGKMYLVKVDSKEC-IQSFD 256
>sp|Q6BRR2|SEC31_DEBHA Protein transport protein SEC31 OS=Debaryomyces hansenii (strain
ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=SEC31 PE=3 SV=2
Length = 1265
Score = 39.7 bits (91), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 16/111 (14%)
Query: 138 NLVSVIGGGEFKRVYDGNGLVGFNCVKWASPTE-----FVTGGFGYGL-QWWD---LRRA 188
++ S GG K V+ + F + W+ P E + G F G+ ++WD L ++
Sbjct: 46 DIFSPTNGG--KPVFSASVDNRFYALAWSKPFEGRPKGLLAGAFENGVVEFWDAEVLIKS 103
Query: 189 GSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLR 239
KG K SG + ++ +P++ H + GGS+G +F WD +
Sbjct: 104 KDLNKASVHKGT-----KHSGPVKTLSFNPNQDHVLVTGGSNGEIFIWDTK 149
>sp|Q5XI13|GRWD1_RAT Glutamate-rich WD repeat-containing protein 1 OS=Rattus norvegicus
GN=Grwd1 PE=2 SV=1
Length = 445
Score = 39.7 bits (91), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 151 VYDGNGLVGFNCVKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASG 209
+DG+ N + W+ F+ +GG L+ WDLR+ SG PV FK +
Sbjct: 305 AHDGD----VNVISWSRREPFLLSGGDDGTLKVWDLRQFKSGSPVATFKQHVAP------ 354
Query: 210 MIHSVDIHPSRKHTCLAGGSSGTVFAWDL 238
+ SV+ HP A G+ + WDL
Sbjct: 355 -VTSVEWHPQDSGVFAASGADNQITQWDL 382
>sp|Q5AAU3|SEC31_CANAL Protein transport protein SEC31 OS=Candida albicans (strain SC5314
/ ATCC MYA-2876) GN=SEC31 PE=3 SV=1
Length = 1265
Score = 39.3 bits (90), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 8/86 (9%)
Query: 160 FNCVKWASPTE-----FVTGGFGYG-LQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHS 213
F + W+ P E + G F G +++WD + K + K +G + S
Sbjct: 66 FYALAWSKPFEGRPQGLLAGAFENGTVEFWDADVLIKTKDLA--KASVHKSNKHTGAVKS 123
Query: 214 VDIHPSRKHTCLAGGSSGTVFAWDLR 239
+ +P + H + GGS+G +F WD +
Sbjct: 124 LQFNPIQNHVLVTGGSNGQIFIWDTK 149
>sp|A3GFK8|SEC31_PICST Protein transport protein SEC31 OS=Scheffersomyces stipitis (strain
ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545)
GN=SEC31 PE=3 SV=2
Length = 1244
Score = 38.1 bits (87), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 38/86 (44%), Gaps = 8/86 (9%)
Query: 160 FNCVKWASPTE-----FVTGGFGYG-LQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHS 213
F + W+ P E + F G +++WD + K + K SG + S
Sbjct: 66 FYALAWSKPFEGRPRGLIAAAFENGVIEFWDAEVLIISKDLA--KASVHKSSKHSGPVRS 123
Query: 214 VDIHPSRKHTCLAGGSSGTVFAWDLR 239
+ +P + H ++GGS G +F WD +
Sbjct: 124 LQFNPLQSHVLVSGGSHGQIFIWDTK 149
>sp|Q6FNU4|SEC31_CANGA Protein transport protein SEC31 OS=Candida glabrata (strain ATCC
2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65)
GN=SEC31 PE=3 SV=1
Length = 1281
Score = 37.7 bits (86), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/116 (19%), Positives = 53/116 (45%), Gaps = 5/116 (4%)
Query: 124 TGSECVSVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGYGLQWW 183
+G+ + ++ L S++G K + + F + W++ +++ G G
Sbjct: 29 SGTIDADFSNESKLELWSLLGTDADKPSHSVSTDAKFKDLDWSADNKYIAGAMDNGA--V 86
Query: 184 DLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLR 239
++ S L+ +G+F + S ++ +V + + + L+GG+S +F WDL
Sbjct: 87 EIYEFNSSKQELKKQGSFKNH---STVVRTVKFNSKQNNVLLSGGNSQEIFIWDLN 139
>sp|Q5A7Q3|PRP46_CANAL Pre-mRNA-splicing factor PRP46 OS=Candida albicans (strain SC5314 /
ATCC MYA-2876) GN=PRP46 PE=3 SV=1
Length = 389
Score = 37.4 bits (85), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 42/90 (46%), Gaps = 11/90 (12%)
Query: 162 CVKWASPTEFVTGGFGYGLQWWDLRRAGS--GGPVLQFKGNFGSQGKASGMIHSVDIHPS 219
C+ P F +GG L+ WDL R+ S G + + G+ G ++S+ +HP
Sbjct: 135 CISKRHPYLF-SGGEDKSLRCWDLERSNSDAGCQIRSYHGHLGG-------VYSIGLHPE 186
Query: 220 RKHTCLAGGSSGTVFAWDLRWQQQSIVLSG 249
+GG V WD+R + +++ L G
Sbjct: 187 LD-VLFSGGKDCVVRVWDIRSRVEAMTLLG 215
>sp|Q9VAT2|DCA10_DROME DDB1- and CUL4-associated factor 10 homolog OS=Drosophila
melanogaster GN=CG1523 PE=1 SV=1
Length = 621
Score = 37.4 bits (85), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/124 (23%), Positives = 53/124 (42%), Gaps = 11/124 (8%)
Query: 125 GSECVSVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGYGLQWWD 184
G+ V+ E V + I E +V D + NC+K+ F TG + + WD
Sbjct: 60 GNVVVAATERKCVLVFDAITQKEIFKVPDAH-TDSVNCIKFFDERLFATGSDDFTVALWD 118
Query: 185 LRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQS 244
LR VL N+ + +++ + S+ ++ G G++F WD+ Q +
Sbjct: 119 LRNMKQKLRVLHGHSNW---------VKNIE-YSSKDKLLVSSGFDGSIFTWDINSQTEQ 168
Query: 245 IVLS 248
++S
Sbjct: 169 GLIS 172
>sp|A5DTX3|SEC31_LODEL Protein transport protein SEC31 OS=Lodderomyces elongisporus
(strain ATCC 11503 / CBS 2605 / JCM 1781 / NBRC 1676 /
NRRL YB-4239) GN=SEC31 PE=3 SV=1
Length = 953
Score = 37.4 bits (85), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 13/89 (14%)
Query: 160 FNCVKWASPTE-----FVTGGFGYG-LQWWD---LRRAGSGGPVLQFKGNFGSQGKASGM 210
F + W+ P E + G F G +++WD L ++ + L+ S + G
Sbjct: 66 FYALAWSKPFENRPKGLIAGAFEDGTVEFWDADVLIKSKN----LKKASVHKSTKHSGGA 121
Query: 211 IHSVDIHPSRKHTCLAGGSSGTVFAWDLR 239
+ S+ +P + H + GGS+G +F WD +
Sbjct: 122 VKSLQFNPIQHHVLVTGGSNGQIFVWDTK 150
>sp|A8PT44|YTM1_MALGO Ribosome biogenesis protein YTM1 OS=Malassezia globosa (strain ATCC
MYA-4612 / CBS 7966) GN=YTM1 PE=3 SV=1
Length = 528
Score = 35.0 bits (79), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 27/119 (22%), Positives = 47/119 (39%), Gaps = 15/119 (12%)
Query: 134 DGRVNLVSVIGGGEF--KRVYDG-----NGLVGFNCVKWASPTEFVTGGFGYGLQWWDLR 186
DG V V+ GG ++ D + L + + +P + V G + L +D R
Sbjct: 358 DGTVKYWDVVAGGGLAGRKQTDKVPLCLDPLQSLSLSSFTTP-QLVCGHMDHSLALYDFR 416
Query: 187 RAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSI 245
A S G + + +V +HP H +G G + WD+R +Q++
Sbjct: 417 DAVS-------NAALAMSGAHAAPVSAVKVHPVSAHLFASGAYDGRIKVWDVRSPKQAL 468
>sp|Q54JS5|WDR24_DICDI WD repeat-containing protein 24 homolog OS=Dictyostelium discoideum
GN=wdr24 PE=3 SV=1
Length = 1023
Score = 35.0 bits (79), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 78/189 (41%), Gaps = 33/189 (17%)
Query: 69 TPPTRISSLSTSHQSLVAASTLSGSLHVLSSAMQVLLSVDENSGFHVGPISGVDLTGSEC 128
+P + ISS S + +VA + + V ++ +V S G ++ TG++C
Sbjct: 19 SPLSAISSSPDSTKLIVAGRDIVKIVSVQNNEFKVT------SNLRAGKTQSLNYTGNDC 72
Query: 129 V---SVGEDGRVNLVSVIGGGE---FKRVYDGNGLV---------GFNCVKWASPTEF-- 171
S+ E+ R + + G + V +G+ V N + W P +
Sbjct: 73 CWHPSLLENYRFLIATAATNGAVVIWNTVREGSKSVERVFTDHSRAVNKLAW-HPEKLDC 131
Query: 172 -VTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSS 230
+TG L+ WD+R + + + F + ++ I V +PS+ + A +
Sbjct: 132 ILTGSQDNTLRMWDIRDSANASKIT-----FSPKSES---IRDVQFNPSQANQFAAAFDN 183
Query: 231 GTVFAWDLR 239
GT+ WD+R
Sbjct: 184 GTIQLWDIR 192
>sp|Q6BU94|PRP46_DEBHA Pre-mRNA-splicing factor PRP46 OS=Debaryomyces hansenii (strain
ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=PRP46 PE=3 SV=2
Length = 417
Score = 34.7 bits (78), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/183 (22%), Positives = 71/183 (38%), Gaps = 35/183 (19%)
Query: 171 FVTGGFGYGLQWWDLRRAGS--GGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGG 228
+G ++ WDL R S G + + G+ G I+++ +HP GG
Sbjct: 179 LFSGSEDKTVKCWDLERTNSSSGCQIRNYHGHVGG-------IYAMALHPELD-LLFTGG 230
Query: 229 SSGTVFAWDLRWQQQSIVLSG-----------VGTGEAVTHSLSES-EVWEVQYDC---- 272
+ WDLR + + +VLSG +G + +T S+ + +W+++
Sbjct: 231 RDSVIRVWDLRSRTEIMVLSGHRSDITSIASQIGDPQIITSSMDATIRLWDIRKATTQLA 290
Query: 273 ---HTKSSNISSSRVLPAMFCSEDGILAVVEQGEEPFELL------AEPCAINSFDIDPQ 323
H+KS + CS D + E ELL E IN+ I+P
Sbjct: 291 LTHHSKSIRSMAMHPQEMTMCSGDTSGNLKEWLLPGGELLNEFGHSGENKIINTLSINPS 350
Query: 324 NPS 326
N +
Sbjct: 351 NNT 353
>sp|A8IR43|WDR12_CHLRE Ribosome biogenesis protein WDR12 homolog OS=Chlamydomonas
reinhardtii GN=CHLREDRAFT_128420 PE=3 SV=2
Length = 444
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 41/100 (41%), Gaps = 7/100 (7%)
Query: 143 IGGGEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGYGLQWWDLR-RAGSGGPVLQFKGNF 201
+ G Y+G+ V ASP G L+ WD R +AGS + +G
Sbjct: 302 VSSGAAADTYNGSKAVLCIASHAASPALVAFGCSDRALRLWDTRGKAGSDALAVTTQGAH 361
Query: 202 GSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQ 241
G G + +V PS +H + GT+ WD+R Q
Sbjct: 362 G------GWVTAVAWCPSSQHHIASASHDGTIKMWDIRTQ 395
>sp|A7TMF9|YTM1_VANPO Ribosome biogenesis protein YTM1 OS=Vanderwaltozyma polyspora
(strain ATCC 22028 / DSM 70294) GN=YTM1 PE=3 SV=1
Length = 453
Score = 33.5 bits (75), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 24/106 (22%), Positives = 42/106 (39%), Gaps = 6/106 (5%)
Query: 146 GEFKRVYDGNGLVGFNCVKWASPTEFVTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQG 205
G+ ++ Y G+ VK+ S T V+GG L+ W + PV+ G
Sbjct: 133 GKIEKQYSGHS-APIRAVKYISNTRMVSGGNDRTLRLWKTKNEDLKQPVVDEDDEDIEDG 191
Query: 206 KASGMIHS-----VDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIV 246
K ++ V I S L+G T+ W +++ ++V
Sbjct: 192 KTLAILEGHKAPVVSIDVSDNSRILSGSYDNTIGFWSTIYKEMTVV 237
>sp|Q7Z5J8|ANKAR_HUMAN Ankyrin and armadillo repeat-containing protein OS=Homo sapiens
GN=ANKAR PE=2 SV=3
Length = 1434
Score = 33.1 bits (74), Expect = 3.1, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 216 IHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVGT 252
+H + K CL G + T+FA++ R+QQ I+ SG+ T
Sbjct: 1136 LHSTEKDICLRAGYALTLFAFNNRFQQYLILESGIMT 1172
>sp|Q9P2H3|IFT80_HUMAN Intraflagellar transport protein 80 homolog OS=Homo sapiens
GN=IFT80 PE=1 SV=3
Length = 777
Score = 33.1 bits (74), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 6/94 (6%)
Query: 53 THSISLPASLTP-HSQWTPPTRISSLSTSHQSLVAASTLSGSLHVLSSAMQVLLSVDENS 111
T + LP + P W P + T +S V S+ G H++S +V SV+ +
Sbjct: 48 TQIVKLPDDIYPIDFHWFPKSLGVKKQTQAESFVLTSS-DGKFHLISKLGRVEKSVEAHC 106
Query: 112 GFHVGPISGV-DLTGSECVSVGEDGRVNLVSVIG 144
G ++G + G+ V+VGEDG++ + S G
Sbjct: 107 G---AVLAGRWNYEGTALVTVGEDGQIKIWSKTG 137
>sp|Q10281|GBLP_SCHPO Guanine nucleotide-binding protein subunit beta-like protein
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=rkp1 PE=1 SV=3
Length = 314
Score = 33.1 bits (74), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 50/115 (43%), Gaps = 15/115 (13%)
Query: 127 ECVSVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWA---SPTEFVTGGFGYGLQWW 183
+ VS D + + ++IG ++ + DG +CV+++ FV+ G+ ++ W
Sbjct: 119 QVVSGSRDKTIKIWNIIGNCKYT-ITDGGHSDWVSCVRFSPNPDNLTFVSAGWDKAVKVW 177
Query: 184 DLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDL 238
DL F G +G + +V I P C +GG GT+ WDL
Sbjct: 178 DLE---------TFSLRTSHYGH-TGYVSAVTISPDGS-LCASGGRDGTLMLWDL 221
>sp|O13168|TLE1_DANRE Protein groucho-1 (Fragment) OS=Danio rerio GN=gro1 PE=1 SV=1
Length = 535
Score = 33.1 bits (74), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 17/115 (14%)
Query: 163 VKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQF----KGNFGSQGKASGMIHSVDIH 217
V ++PT V TGG G ++ WD+ + GS PV Q + N+ I S +
Sbjct: 255 VTISNPTRHVYTGGKGC-VKIWDISQPGSKSPVSQLDCLNRDNY---------IRSCKLL 304
Query: 218 PSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVGTGEAVTHSLSESEVWEVQYDC 272
P + T +AGG T+ WDL Q I + A ++L+ S +V + C
Sbjct: 305 PDGR-TLIAGGEVSTLTIWDLASQTPRIKAELTSSAPAC-YALAISPDAKVCFSC 357
>sp|E9Q349|WDR25_MOUSE WD repeat-containing protein 25 OS=Mus musculus GN=Wdr25 PE=2 SV=1
Length = 535
Score = 32.7 bits (73), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/133 (22%), Positives = 52/133 (39%), Gaps = 20/133 (15%)
Query: 114 HVGPISGVDLTGSEC-----VSVGEDGRVNLVSVIGGGEFKRVYDGNGLVGFNCVKWASP 168
H GP++ + C +S D + + + G + Y + +W SP
Sbjct: 236 HRGPVNSIQWCPVFCKSHMLLSASMDKTFKVWNAVDSGHCLQTYSVHS-EAVRAARW-SP 293
Query: 169 T--EFVTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLA 226
++GGF + L DL +G V + +F + ++ HP + L
Sbjct: 294 CGRRILSGGFDFALHLTDLE---TGTQVFSGQSDF--------RVTTLKFHPKDHNVFLC 342
Query: 227 GGSSGTVFAWDLR 239
GG S + AWD+R
Sbjct: 343 GGFSSEIKAWDMR 355
>sp|Q54ED4|GRWD1_DICDI Glutamate-rich WD repeat-containing protein 1 OS=Dictyostelium
discoideum GN=grwd1 PE=3 SV=1
Length = 482
Score = 32.7 bits (73), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 9/86 (10%)
Query: 161 NCVKWASPTEF--VTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHP 218
N + W+ E+ V+G + WDLR PV FK + +G I S++ +P
Sbjct: 343 NVISWSRNVEYLLVSGCDDGSFRVWDLRAFKDNSPVSDFKYH-------TGPITSIEWNP 395
Query: 219 SRKHTCLAGGSSGTVFAWDLRWQQQS 244
+ + S V WD ++ +
Sbjct: 396 YEESQVIVSSSDDQVTIWDFSLEEDT 421
>sp|Q6C709|PRP46_YARLI Pre-mRNA-splicing factor PRP46 OS=Yarrowia lipolytica (strain CLIB
122 / E 150) GN=PRP46 PE=3 SV=2
Length = 441
Score = 32.7 bits (73), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/135 (21%), Positives = 55/135 (40%), Gaps = 12/135 (8%)
Query: 171 FVTGGFGYGLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSS 230
+GG ++ WDL + V + G+ + ++S+DIHP+ ++ G
Sbjct: 189 MFSGGEDKMVKCWDLE---TNKVVRHYHGHLSA-------VYSLDIHPTLD-VLVSAGRD 237
Query: 231 GTVFAWDLRWQQQSIVLSG-VGTGEAVTHSLSESEVWEVQYDCHTKSSNISSSRVLPAMF 289
WD+R + +VLSG T V SE +V D + N+ + + + +
Sbjct: 238 AVARVWDIRTRDPVVVLSGHKSTINRVKFQASEPQVITASADETVRLWNLQAGKTMTTLT 297
Query: 290 CSEDGILAVVEQGEE 304
+ + + EE
Sbjct: 298 HHKKSVRGLTLHPEE 312
>sp|Q9WVB3|TLE6_MOUSE Transducin-like enhancer protein 6 OS=Mus musculus GN=Tle6 PE=1
SV=1
Length = 581
Score = 32.7 bits (73), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 72/173 (41%), Gaps = 19/173 (10%)
Query: 155 NGLVGFNCVKWASPTEFVTGGFGY-GLQWWDLRRAGSGGPVLQFKGNFGSQGKASGMIHS 213
N + C+ ++ GG+ G+ WDL P L K +G + + +
Sbjct: 342 NKVYLRTCLLSSNSRTLFAGGYNLPGVIVWDL-----AAPSLYEKCQLPCEGLSCQALAN 396
Query: 214 VDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVGTGEAVTHSLSESEVWEVQYDCH 273
++++ LAG + GTV WDLR Q+ L G T A + + +W D
Sbjct: 397 -----TKENMALAGFTDGTVRIWDLRTQEIVRNLKG-PTNSARNLVVKDDNIWTGGLDAC 450
Query: 274 TKSSNISSSRV-LPAMFCSEDGILAVVEQGEEPFELL----AEPCAINSFDID 321
+ ++ ++V L +F S+ I+++ E + LL + C NS D
Sbjct: 451 LRCWDLRMAKVSLEHLFQSQ--IMSLAHSPTEDWLLLGLANGQHCLFNSRKRD 501
>sp|O13166|TLE2_DANRE Protein groucho-2 OS=Danio rerio GN=gro2 PE=1 SV=1
Length = 761
Score = 32.7 bits (73), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 52/115 (45%), Gaps = 17/115 (14%)
Query: 163 VKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQF----KGNFGSQGKASGMIHSVDIH 217
V ++PT V TGG G ++ WD+ + GS PV Q + N+ I S +
Sbjct: 481 VTISNPTRHVYTGGKGC-VKIWDISQPGSKSPVSQLDCLNRDNY---------IRSCKLL 530
Query: 218 PSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVGTGEAVTHSLSESEVWEVQYDC 272
P + T + GG + T+ WDL Q I + A ++L+ S +V + C
Sbjct: 531 PDGR-TLIVGGEASTLTIWDLASQTPRIKAELTSSAPAC-YALAISPDAKVCFSC 583
>sp|Q6CKE8|PRP46_KLULA Pre-mRNA-splicing factor PRP46 OS=Kluyveromyces lactis (strain ATCC
8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 /
WM37) GN=PRP46 PE=3 SV=1
Length = 434
Score = 32.3 bits (72), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 11/67 (16%)
Query: 183 WDLRRAGSGGPVLQFKGNFGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQ 242
WDL R + + F G+ +H+VD+HPS A G V WD+R +
Sbjct: 192 WDLERNTA---IRDFHGHLSG-------VHTVDVHPSLDIIATA-GRDAVVRLWDIRSRS 240
Query: 243 QSIVLSG 249
+ +VL G
Sbjct: 241 EIMVLPG 247
>sp|P16371|GROU_DROME Protein groucho OS=Drosophila melanogaster GN=gro PE=1 SV=3
Length = 730
Score = 32.3 bits (72), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 16/81 (19%)
Query: 163 VKWASPTEFV-TGGFGYGLQWWDLRRAGSGGPVLQF----KGNFGSQGKASGMIHSVDIH 217
V ++PT++V TGG G ++ WD+ + G+ PV Q + N+ I SV +
Sbjct: 450 VTISNPTKYVYTGGKGC-VKVWDISQPGNKNPVSQLDCLQRDNY---------IRSVKLL 499
Query: 218 PSRKHTCLAGGSSGTVFAWDL 238
P + T + GG + + WDL
Sbjct: 500 PDGR-TLIVGGEASNLSIWDL 519
>sp|Q8K057|IFT80_MOUSE Intraflagellar transport protein 80 homolog OS=Mus musculus
GN=Ift80 PE=2 SV=1
Length = 777
Score = 32.0 bits (71), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 56 ISLPASLTPHS-QWTPPTRISSLSTSHQSLVAASTLSGSLHVLSSAMQVLLSVDENSGFH 114
+ LP + P W P + T +S V S+ G H++S +V SV+ + G
Sbjct: 51 VKLPDDIYPIDLHWFPKSLGIKKQTQAESFVLTSS-DGKFHLISKLGRVEKSVEAHCG-- 107
Query: 115 VGPISGV-DLTGSECVSVGEDGRVNLVSVIG 144
++G + G+ V+VGEDG+V + S G
Sbjct: 108 -AVLAGRWNYEGTALVTVGEDGQVKIWSKTG 137
>sp|A1DHK2|SEC31_NEOFI Protein transport protein sec31 OS=Neosartorya fischeri (strain
ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=sec31
PE=3 SV=1
Length = 1263
Score = 32.0 bits (71), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 201 FGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVG 251
GS + I +D + H + G S+G V WD++ +++S+ L+ +G
Sbjct: 159 LGSAAARADDIECLDWNKKVAHILVTGSSAGFVTVWDVKTKKESLTLNNMG 209
>sp|Q66HB3|IFT80_RAT Intraflagellar transport protein 80 homolog OS=Rattus norvegicus
GN=Ift80 PE=2 SV=1
Length = 777
Score = 32.0 bits (71), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 56 ISLPASLTPHS-QWTPPTRISSLSTSHQSLVAASTLSGSLHVLSSAMQVLLSVDENSGFH 114
+ LP + P W P + T +S V S+ G H++S +V SV+ + G
Sbjct: 51 VKLPDDIYPIDLHWFPKSLGIKKQTQAESFVLTSS-DGKFHLISKLGRVEKSVEAHCG-- 107
Query: 115 VGPISGV-DLTGSECVSVGEDGRVNLVSVIG 144
++G + G+ V+VGEDG+V + S G
Sbjct: 108 -AVLAGRWNYEGTALVTVGEDGQVKIWSKTG 137
>sp|Q9VYN8|TENA_DROME Teneurin-a OS=Drosophila melanogaster GN=Ten-a PE=1 SV=2
Length = 3004
Score = 32.0 bits (71), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 12/79 (15%)
Query: 65 HSQWTPPTRISSLSTSHQSL-----VAASTLSGSLHVLSSAMQVLLSVDENSGFHVGPIS 119
H+ W+P +L + L +A S L GSLH + + + L+ D + ++
Sbjct: 1400 HNHWSPAPCSGTLMANQAQLQWPTGLALSPLDGSLHFIDDRLVLRLTSD----MKIRVVA 1455
Query: 120 GVDLTGSECVSVGEDGRVN 138
G L C + G+DGRVN
Sbjct: 1456 GTPL---HCSNGGQDGRVN 1471
>sp|Q4X0M4|SEC31_ASPFU Protein transport protein sec31 OS=Neosartorya fumigata (strain
ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=sec31 PE=3 SV=1
Length = 1263
Score = 32.0 bits (71), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 201 FGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVG 251
GS + I +D + H + G S+G V WD++ +++S+ L+ +G
Sbjct: 159 LGSAAARADDIECLDWNKKVAHILVTGSSAGFVTVWDVKTRKESLTLNNMG 209
>sp|A1C6X5|SEC31_ASPCL Protein transport protein sec31 OS=Aspergillus clavatus (strain
ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1)
GN=sec31 PE=3 SV=1
Length = 1276
Score = 32.0 bits (71), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 201 FGSQGKASGMIHSVDIHPSRKHTCLAGGSSGTVFAWDLRWQQQSIVLSGVG 251
GS + I +D + H + G S+G V WD++ +++S+ L+ +G
Sbjct: 159 LGSTAARADDIECLDWNKKVAHILVTGSSAGFVTVWDVKTKKESLTLNNMG 209
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.132 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 138,693,135
Number of Sequences: 539616
Number of extensions: 6173567
Number of successful extensions: 14474
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 57
Number of HSP's that attempted gapping in prelim test: 14412
Number of HSP's gapped (non-prelim): 109
length of query: 343
length of database: 191,569,459
effective HSP length: 118
effective length of query: 225
effective length of database: 127,894,771
effective search space: 28776323475
effective search space used: 28776323475
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)