BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 045469
(938 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P57740|NU107_HUMAN Nuclear pore complex protein Nup107 OS=Homo sapiens GN=NUP107 PE=1
SV=1
Length = 925
Score = 101 bits (252), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 148/567 (26%), Positives = 232/567 (40%), Gaps = 98/567 (17%)
Query: 51 TFALFASLLDSALQGT-EEPPVELILSPSTSHIEACQFVVN-DHTAQLCLRIVQWLEGLA 108
T+ L ASL +Q EE V + + + S + + D + +V WLE +A
Sbjct: 218 TWRLLASLYRDRIQSALEEESVFAVTAVNASEKTVVEALFQRDSLVRQSQLVVDWLESIA 277
Query: 109 SKSL-DLESKVRGSHVGTYLPNSGVWHHT--QRYLKKGVADANT-VHHLDFDAPTREHAH 164
+ + + Y N+ HT QR L V V LD DAP R+
Sbjct: 278 KDEIGEFSDNIEFYAKSVYWENT---LHTLKQRQLTSYVGSVRPLVTELDPDAPIRQKM- 333
Query: 165 QLPDDKKQDE-SLLEDVWTLLRAGRQEEACDLCRSAGQPWRAATLCPFGVIDFSPSVEAL 223
L D ++DE LL+ ++TL+RAG EEA LC+ GQ WRAATL + + P+V
Sbjct: 334 PLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATLEGWKLY-HDPNV--- 389
Query: 224 IKNGRSRTLQAIELESGIGHQW-RLWKWASYCTSEKIFEQRGSKFEAAIYAAQCSNLKHV 282
NG ELE G+ + R+WK + + +E ++ +++E AIYAA NLK +
Sbjct: 390 --NG------GTELEPVEGNPYRRIWKISCWRMAE---DELFNRYERAIYAALSGNLKQL 438
Query: 283 LPICTNWETACWAMAKSWLGVQLDLELARSQPGRMEQVKSFGDEIEGSPGQMNGISQPSV 342
LP+C WE WA + + ++ E+ S V + DE E P + G
Sbjct: 439 LPVCDTWEDTVWAYFRVMVDSLVEQEIQTS-------VATL-DETEELPREYLGA----- 485
Query: 343 GPESWPVQVLNQQPRDLSALLQKLHSGEMVHEVVTQVCKEQQRQIEMKLMLGNIPHVLQL 402
+W ++ + ++ LQ ++ E +E ++ L+LG+I ++
Sbjct: 486 ---NWTLEKVFEE-------LQATDKKRVLEE-----NQEHYHIVQKFLILGDIDGLMDE 530
Query: 403 IWSWIAPSEDDQNVFRPHGDPQMIRFGAHLVLVLRY--LLTDELKDPFRKDLMDAGDLII 460
W++ S R + ++RF HL+L R L T E + ++
Sbjct: 531 FSKWLSKS-------RNNLPGHLLRFMTHLILFFRTLGLQTKE----------EVSIEVL 573
Query: 461 HMYAMFLFSEHHEELVGVYASQLARHRCIDLFVHMMELRLNSSVHVKYKIFLSAMEYLPF 520
Y L E H L+ Y L + DL V L L S F L
Sbjct: 574 KTYIQLLIREKHTNLIAFYTCHLPQ----DLAVAQYALFLESVTE-----FEQRHHCLEL 624
Query: 521 SSGDDLKGSFEEIIERVLSRSREIKLGKYDK-------STDVAEQHRLQSLQKAMVIQWL 573
+ DL + I + V+ R+ G++ T E+ RL K VI WL
Sbjct: 625 AKEADLDVA--TITKTVVENIRKKDNGEFSHHDLAPALDTGTTEEDRL----KIDVIDWL 678
Query: 574 CFTPPSTIADVKDVSAKLLLRALIHSN 600
F P +K +A ++R + S
Sbjct: 679 VFDPAQRAEALKQGNA--IMRKFLASK 703
>sp|P52590|NU107_RAT Nuclear pore complex protein Nup107 OS=Rattus norvegicus GN=Nup107
PE=1 SV=1
Length = 926
Score = 95.9 bits (237), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 131/503 (26%), Positives = 206/503 (40%), Gaps = 114/503 (22%)
Query: 100 IVQWLEGLASKSL-DLESKVRGSHVGTYLPNSGVWHHT-----QRYLKKGVADANT-VHH 152
+V WLE +A + D + + S W +T QR L + V
Sbjct: 270 VVDWLESIAKDEIGDFSDNIE------FYAKSVYWENTLHSLKQRQLLSYIGSTRPLVTE 323
Query: 153 LDFDAPTREHAHQLPDDKKQDE-SLLEDVWTLLRAGRQEEACDLCRSAGQPWRAATLCPF 211
LD DAP R+ L D ++DE LL+ ++TL+RAG EEA LC+ GQ WRAATL +
Sbjct: 324 LDPDAPIRQKM-PLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATLEGW 382
Query: 212 GVIDFSPSVEALIKNGRSRTLQAIELESGIGHQW-RLWKWASYCTSEKIFEQRGSKFEAA 270
+ P+V NG ELE G+ + R+WK + + +E ++ +K+E A
Sbjct: 383 K-LHHDPNV-----NG------GTELEPVEGNPYRRIWKISCWRMAE---DELFNKYERA 427
Query: 271 IYAAQCSNLKHVLPICTNWETACWAMAKSWLGVQLDLELARSQPGRMEQVKSFGDEIEGS 330
IYAA NLK +LP+C WE WA + + ++ E+ S V + DE E
Sbjct: 428 IYAALSGNLKQLLPVCDTWEDTVWAYFRVMVDSLVEQEIRTS-------VMTL-DETEEL 479
Query: 331 PGQMNGISQPSVGPESWPVQVLNQQPRDLSALLQKLHSGEMVHEVVTQVCKEQQRQIEMK 390
P + +W ++ + ++ LQ ++ E +E ++
Sbjct: 480 PRE--------YMEANWTLEKVFEE-------LQATDKKRVLEE-----NQEHYHVVQKF 519
Query: 391 LMLGNIPHVLQLIWSWIAPSEDDQNVFRPHGDPQMIRFGAHLVLVLRY--LLTDELKDPF 448
L+LG+I ++ W++ S R ++RF HL+L R L T E
Sbjct: 520 LILGDIDGLMDEFSKWLSKS-------RSSLPGHLLRFMTHLILFFRTLGLQTKE----- 567
Query: 449 RKDLMDAGDLIIHMYAMFLFSEHHEELVGVYASQLARHRCIDLFVHMMELRLNSSVHVKY 508
+ ++ Y L +E H L+ Y L + DL V +Y
Sbjct: 568 -----EVSIEVLKTYIQLLINEKHTNLIAFYTCHLPQ----DLAV------------AQY 606
Query: 509 KIFLSAM-------EYLPFSSGDDLKGSFEEIIERVLSRSREIKLGKYDK-------STD 554
+FL + + L + DL + I + V+ R+ G++ T
Sbjct: 607 ALFLEGVTECEQRHQCLELAKEADLDVA--TITKTVVENIRKKDNGEFSHHDLAPSLDTA 664
Query: 555 VAEQHRLQSLQKAMVIQWLCFTP 577
E+ RL K VI WL F P
Sbjct: 665 TTEEDRL----KIDVIDWLVFDP 683
>sp|Q8BH74|NU107_MOUSE Nuclear pore complex protein Nup107 OS=Mus musculus GN=Nup107 PE=2
SV=1
Length = 926
Score = 90.5 bits (223), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 173/408 (42%), Gaps = 76/408 (18%)
Query: 100 IVQWLEGLASKSLDLESKVRGSHVGTYLPNSGVWHHT-----QRYLKKGVADANT-VHHL 153
+V WLE +A + S + S W +T QR L + V L
Sbjct: 270 VVDWLESIAKDEIG-----EFSDNIEFYAKSVYWENTLHSLKQRQLLSHMGSTRPLVTEL 324
Query: 154 DFDAPTREHAHQLPDDKKQDE-SLLEDVWTLLRAGRQEEACDLCRSAGQPWRAATLCPFG 212
D DAP R+ L D ++DE LL+ ++TL+RAG EEA LC+ GQ WRAATL +
Sbjct: 325 DPDAPIRQKL-PLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATLEGWK 383
Query: 213 VIDFSPSVEALIKNGRSRTLQAIELESGIGHQW-RLWKWASYCTSEKIFEQRGSKFEAAI 271
+ P+V NG ELE G+ + R+WK + + +E ++ +K+E AI
Sbjct: 384 LY-HDPNV-----NG------GTELEPVEGNPYRRIWKISCWRMAE---DELFNKYERAI 428
Query: 272 YAAQCSNLKHVLPICTNWETACWAMAKSWLGVQLDLELARSQPGRMEQVKSFGDEIEGSP 331
YAA NLK +LP+C WE WA + + ++ E+ S V + D E
Sbjct: 429 YAALSGNLKQLLPVCDTWEDTVWAYFRVMVDSLVEQEIRTS-------VMTQDDSEELPR 481
Query: 332 GQMNGISQPSVGPESWPVQVLNQQPRDLSALLQKLHSGEMVHEVVTQVCKEQQRQIEMKL 391
M +W ++ + ++ LQ ++ E +E ++ L
Sbjct: 482 EYMEA---------NWTLEKVFEE-------LQATDKKRVLEE-----NQEHYHIVQKFL 520
Query: 392 MLGNIPHVLQLIWSWIAPSEDDQNVFRPHGDPQMIRFGAHLVLVLRY--LLTDELKDPFR 449
+LG++ ++ W++ S + H ++RF HL+L LR L T E
Sbjct: 521 ILGDVDGLMDEFSKWLSKS---GSSLPGH----LLRFMTHLILFLRTLGLQTKE------ 567
Query: 450 KDLMDAGDLIIHMYAMFLFSEHHEELVGVYASQLARHRCIDLFVHMME 497
+ ++ Y L SE H L+ Y L + + + +E
Sbjct: 568 ----EVSIEVLKTYIQLLISEKHTSLIAFYTCHLPQDLAVAQYALFLE 611
>sp|Q10331|NU107_SCHPO Nucleoporin nup107 OS=Schizosaccharomyces pombe (strain 972 / ATCC
24843) GN=nup107 PE=1 SV=4
Length = 813
Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 133/340 (39%), Gaps = 58/340 (17%)
Query: 150 VHHLDFDAPTREHAHQLPDDKKQDESLLEDVWTLLRAGRQEEACDLCRSAGQPWRAATLC 209
V +LD DA R+ D + + L R+G EE +LCR G WR+A+L
Sbjct: 225 VSNLDPDADIRDDKRLDERDDNFERQFFHTAFCLFRSGSFEELLELCRRTGNHWRSASLQ 284
Query: 210 PFGVIDFSPSVEALIKNGRSRTLQAIELESGIGHQWRLWKWASYCTSEKIFEQRGSKFEA 269
G++++ + LI + ++ E+ + L + + ++ +R +E
Sbjct: 285 --GILEYR---DNLIDD-------VLQSETSGNKRKELLRRSCLALTKN---KRIDSYER 329
Query: 270 AIYAAQCSNLKHVLPICTNWETACWAMAKSWLGVQLDLELARSQPGRMEQVKSFGDEIEG 329
A+Y A C +L VL +CT WE A WA S LD+ L+ P Q+ +
Sbjct: 330 ALYGALCGDLNSVLDVCTTWEDAMWAYYNSMTQYNLDVYLSSKAPQTETQLPPVDSGLGL 389
Query: 330 SPGQM-NGISQPSVGPESWPVQVLNQQPRDLSALLQKLHSGEMVHEVVTQVCKEQQRQIE 388
+P + N +S S+ +Q P L KL + ++ ++++ Q+E
Sbjct: 390 TPELIFNSLSNSSIA----SIQEEASHP------LIKLQT-HIICNKISEILSSAHIQLE 438
Query: 389 MKLMLGNIPHVLQLIWSWIAPSEDDQNVFRPHGDPQMIRFGAHLVLVLRY--LLTDELKD 446
+ GN P L+ P ++R H++L L+ L DE
Sbjct: 439 A-IRTGNAPESGDLV------------------TPPLLRILTHIILFLKISGLAVDEY-- 477
Query: 447 PFRKDLMDAGDLIIHMYAMFLFSEHHEELVGVYASQLARH 486
D II Y L S LV +Y L+
Sbjct: 478 --------TSDSIIQAYIELLASAKKVNLVPLYIQYLSNQ 509
>sp|Q4QMI0|RAPA_HAEI8 RNA polymerase-associated protein RapA OS=Haemophilus influenzae
(strain 86-028NP) GN=rapA PE=3 SV=1
Length = 923
Score = 37.4 bits (85), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 67/159 (42%), Gaps = 19/159 (11%)
Query: 618 PIGAHELLSFLAEPLKQLSENPDTLEDNVSENLKEFQDWSEYYSCDATYRKWLKIELENA 677
P+G L S + L+++ N L +N ++SE T R+ LKIELE
Sbjct: 590 PMGM-ALFSQFEDELEKVRSNSTALSEN---------EFSELLKQTKTVREKLKIELEKG 639
Query: 678 NVPALELSLEEKQRAIAAAQETLNMSLILLQRKENPWLVSLEDPIYESVEALYLELHATA 737
LEL+ ++A A A + + +P LV+ +++ + +L A +
Sbjct: 640 RDRLLELNSNGGKQAQALADQ-------IADEDNSPELVNFALKLFDIIGVEQEDLGANS 692
Query: 738 ILCLPSGECLSPDATMCTALMSALYSTLSEEVVLNRELM 776
I+ P+G L PD + T E+ L RE M
Sbjct: 693 IVISPTGTMLVPDFPGLKE--EGVTVTFDRELALAREEM 729
>sp|A5UAT0|RAPA_HAEIE RNA polymerase-associated protein RapA OS=Haemophilus influenzae
(strain PittEE) GN=rapA PE=3 SV=1
Length = 923
Score = 36.6 bits (83), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 67/159 (42%), Gaps = 19/159 (11%)
Query: 618 PIGAHELLSFLAEPLKQLSENPDTLEDNVSENLKEFQDWSEYYSCDATYRKWLKIELENA 677
P+G L S A+ L+++ N L +N ++S T R+ LKIELE
Sbjct: 590 PMGM-ALFSQFADELEKVRSNSTALSEN---------EFSGLLKQTKTAREKLKIELEKG 639
Query: 678 NVPALELSLEEKQRAIAAAQETLNMSLILLQRKENPWLVSLEDPIYESVEALYLELHATA 737
LEL+ ++A A A + + +P LV+ +++ + +L A +
Sbjct: 640 RDRLLELNSHGGEQAQALADQ-------IADEDNSPELVNFALKLFDIIGVEQEDLGANS 692
Query: 738 ILCLPSGECLSPDATMCTALMSALYSTLSEEVVLNRELM 776
I+ P+G L PD + T E+ L RE M
Sbjct: 693 IVISPTGTMLVPDFPGLKE--EGVTVTFDRELALAREEM 729
>sp|P44781|RAPA_HAEIN RNA polymerase-associated protein RapA OS=Haemophilus influenzae
(strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=rapA PE=3
SV=1
Length = 923
Score = 36.2 bits (82), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 67/159 (42%), Gaps = 19/159 (11%)
Query: 618 PIGAHELLSFLAEPLKQLSENPDTLEDNVSENLKEFQDWSEYYSCDATYRKWLKIELENA 677
P+G L S A+ L+++ N L +N ++S T R+ LKIELE
Sbjct: 590 PMGM-ALFSQFADELEKVRSNSTALSEN---------EFSGLLKQTKTAREKLKIELEKG 639
Query: 678 NVPALELSLEEKQRAIAAAQETLNMSLILLQRKENPWLVSLEDPIYESVEALYLELHATA 737
LEL+ ++A A A + + +P LV+ +++ + +L A +
Sbjct: 640 RDRLLELNSHGGEQAQALADQ-------IADEDNSPELVNFALKLFDIIGVEQEDLGANS 692
Query: 738 ILCLPSGECLSPDATMCTALMSALYSTLSEEVVLNRELM 776
I+ P+G L PD + T E+ L RE M
Sbjct: 693 IVISPTGTMLVPDFPGLKE--EGVTVTFDRELALAREEM 729
>sp|C4QV62|ETT1_PICPG Enhancer of translation termination 1 OS=Komagataella pastoris
(strain GS115 / ATCC 20864) GN=ETT1 PE=3 SV=1
Length = 438
Score = 35.4 bits (80), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 18/103 (17%)
Query: 453 MDAGDLIIHMYAM---FLFSEHHEELV--GVYASQLARHRCIDLF----VHMMELRLNSS 503
+DA D + ++A+ FL +E ELV G+ H C L + +LN
Sbjct: 46 VDADDEVSQLFALWKTFLGAEKDNELVLNGII------HECDRLLRNSQTEKNKYKLNGE 99
Query: 504 VHVKYKIFLSAMEYLPFSSGDDLKGSFEEIIERV---LSRSRE 543
H Y + L+ + + + +D+K F+E +ER+ LS S+E
Sbjct: 100 FHAIYALALNELAFFHAAKENDVKPYFDEALERIDLGLSHSKE 142
>sp|C3LTS1|MOAA_VIBCM Cyclic pyranopterin monophosphate synthase OS=Vibrio cholerae
serotype O1 (strain M66-2) GN=moaA PE=3 SV=1
Length = 334
Score = 33.5 bits (75), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 12/116 (10%)
Query: 47 SGETTFALFASLLDSALQ-GTEEPPVELILSPSTSHIEACQFVVNDHTAQLCLRIVQWLE 105
+GE F S +D A + G E+ V ++L + +H+E QF+ T + LR ++ ++
Sbjct: 141 TGENRFQQVMSGIDRAFEVGFEQVKVNVVLMKNLNHLELPQFMAWIKTRPIQLRFIELMQ 200
Query: 106 GLASKSLDLESKVRGSHVGTYLPNSGVWHHTQRYLKKGVAD----ANTVHHLDFDA 157
+L V G + YL +G +L K AD A H DF
Sbjct: 201 TGEMDALFARHHVSGISIRDYLFGNG-------WLLKARADNDGPAQVFIHPDFQG 249
>sp|Q9KT81|MOAA_VIBCH Cyclic pyranopterin monophosphate synthase OS=Vibrio cholerae
serotype O1 (strain ATCC 39315 / El Tor Inaba N16961)
GN=moaA PE=3 SV=1
Length = 334
Score = 33.5 bits (75), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 12/116 (10%)
Query: 47 SGETTFALFASLLDSALQ-GTEEPPVELILSPSTSHIEACQFVVNDHTAQLCLRIVQWLE 105
+GE F S +D A + G E+ V ++L + +H+E QF+ T + LR ++ ++
Sbjct: 141 TGENRFQQVMSGIDRAFEVGFEQVKVNVVLMKNLNHLELPQFMAWIKTRPIQLRFIELMQ 200
Query: 106 GLASKSLDLESKVRGSHVGTYLPNSGVWHHTQRYLKKGVAD----ANTVHHLDFDA 157
+L V G + YL +G +L K AD A H DF
Sbjct: 201 TGEMDALFARHHVSGISIRDYLFGNG-------WLLKARADNDGPAQVFIHPDFQG 249
>sp|A5F2Q5|MOAA_VIBC3 Cyclic pyranopterin monophosphate synthase OS=Vibrio cholerae
serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
GN=moaA PE=3 SV=1
Length = 334
Score = 33.5 bits (75), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 12/116 (10%)
Query: 47 SGETTFALFASLLDSALQ-GTEEPPVELILSPSTSHIEACQFVVNDHTAQLCLRIVQWLE 105
+GE F S +D A + G E+ V ++L + +H+E QF+ T + LR ++ ++
Sbjct: 141 TGENRFQQVMSGIDRAFEVGFEQVKVNVVLMKNLNHLELPQFMAWIKTRPIQLRFIELMQ 200
Query: 106 GLASKSLDLESKVRGSHVGTYLPNSGVWHHTQRYLKKGVAD----ANTVHHLDFDA 157
+L V G + YL +G +L K AD A H DF
Sbjct: 201 TGEMDALFARHHVSGISIRDYLFGNG-------WLLKARADNDGPAQVFIHPDFQG 249
>sp|Q7UKT5|Y9953_RHOBA UPF0061 protein RB9953 OS=Rhodopirellula baltica (strain SH1)
GN=RB9953 PE=3 SV=1
Length = 540
Score = 33.5 bits (75), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 10/98 (10%)
Query: 25 SIHRFGSQSLKACKIE---DDSLTDS-------GETTFALFASLLDSALQGTEEPPVELI 74
S H + L K E DD L DS ET +F L GT E PV L
Sbjct: 357 SWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLADIELGTREQPVTLE 416
Query: 75 LSPSTSHIEACQFVVNDHTAQLCLRIVQWLEGLASKSL 112
L+ H+ +V ++ T + ++ W+ S+ L
Sbjct: 417 LAAVLRHLSEAHYVADEVTEEYQQALMDWMRSYQSRVL 454
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.134 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 343,463,851
Number of Sequences: 539616
Number of extensions: 14350864
Number of successful extensions: 34351
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 34334
Number of HSP's gapped (non-prelim): 18
length of query: 938
length of database: 191,569,459
effective HSP length: 127
effective length of query: 811
effective length of database: 123,038,227
effective search space: 99784002097
effective search space used: 99784002097
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (30.0 bits)