BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021664
(309 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P38821|DNPEP_YEAST Aspartyl aminopeptidase 4 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=APE4 PE=1 SV=1
Length = 490
Score = 38.1 bits (87), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 15/117 (12%)
Query: 105 YGYVWWKGWKLPDMMFATRRSLSDA--CNSVARQLEDVYSSISAAQRQLSSKITSVDRDV 162
YG W W D+ A R + DA S+AR ++ ++ ++ + +DRDV
Sbjct: 124 YGGAIWHSWFDKDLGVAGRVFVKDAKTGKSIARLVD-----LNRPLLKIPTLAIHLDRDV 178
Query: 163 NKIVEISQATQ--------EEVTILRGRSKLIGDEFQSVRDIVQTLESKLIEIEGKQ 211
N+ E ++ TQ E+ T + ++ EF S++ IVQ ++L+ + K+
Sbjct: 179 NQKFEFNRETQLLPIGGLQEDKTEAKTEKEINNGEFTSIKTIVQRHHAELLGLIAKE 235
>sp|Q076A3|MYH13_CANFA Myosin-13 OS=Canis familiaris GN=MYH13 PE=3 SV=1
Length = 1940
Score = 34.7 bits (78), Expect = 0.82, Method: Composition-based stats.
Identities = 25/100 (25%), Positives = 49/100 (49%), Gaps = 7/100 (7%)
Query: 119 MFATRRSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTI 178
+ A +RSL D C+S+ R ++D+ +++ +++ + + V + E A +E ++
Sbjct: 940 LVAKKRSLEDKCSSLKRDIDDLELTLTKVEKEKH----ATENKVKNLSEEMTALEENISK 995
Query: 179 LRGRSKLIGDEFQSVRDIVQTLESK---LIEIEGKQDITT 215
L K + + Q D +Q E K LI+I K + T
Sbjct: 996 LTKEKKSLQEAHQQALDDLQVEEDKVNGLIKINVKLEQQT 1035
>sp|Q118E6|PAND_TRIEI Aspartate 1-decarboxylase OS=Trichodesmium erythraeum (strain
IMS101) GN=panD PE=1 SV=1
Length = 158
Score = 34.3 bits (77), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 27/61 (44%), Gaps = 1/61 (1%)
Query: 210 KQDITTLGVKKLCDRARELENGRPTELVQASRYTLSRTTLELPGITPSSRSGSLHPLPLE 269
K DI + + CDRA LENG ++ A R L + P+ S H LP+
Sbjct: 82 KDDILIIWANEQCDRAEVLENGHKAKIFVADENNHCREFL-YQILNPNQGSVEFHSLPVI 140
Query: 270 P 270
P
Sbjct: 141 P 141
>sp|Q1CRC2|SYS_HELPH Serine--tRNA ligase OS=Helicobacter pylori (strain HPAG1) GN=serS
PE=3 SV=1
Length = 415
Score = 34.3 bits (77), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 10/64 (15%)
Query: 179 LRGRSKLIGDEFQSVRDIVQTLESKLIEIEGKQDITT-----LGVK-----KLCDRAREL 228
L+ R+ +GDE + +R+++ + +LIE+EG Q G+K D +EL
Sbjct: 18 LKKRNHAMGDELERLREVITHYKKQLIELEGLQAFQNKVSKEFGIKMAQKVDTSDLKKEL 77
Query: 229 ENGR 232
EN +
Sbjct: 78 ENNK 81
>sp|Q640L5|CCD18_MOUSE Coiled-coil domain-containing protein 18 OS=Mus musculus GN=Ccdc18
PE=2 SV=1
Length = 1455
Score = 33.9 bits (76), Expect = 1.4, Method: Composition-based stats.
Identities = 24/107 (22%), Positives = 52/107 (48%), Gaps = 1/107 (0%)
Query: 124 RSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRS 183
+ L D LE+ Y++I +L ++ +++E+ QA +E L+ R+
Sbjct: 966 QELRDVLQKAQLSLEEKYTTIKDLTAELRECKMEIEDKKQELIEMDQALKERNWELKQRA 1025
Query: 184 KLIGDEFQSVRDIVQTLESKLIEIEGKQDITTLGVKKLCDRARELEN 230
+ ++R+ +E K+I++EG + + L +K+ C++ E N
Sbjct: 1026 AQVTHLDMTIREHRGEMEQKIIKLEGTLEKSELELKE-CNKQVESLN 1071
>sp|P35579|MYH9_HUMAN Myosin-9 OS=Homo sapiens GN=MYH9 PE=1 SV=4
Length = 1960
Score = 33.9 bits (76), Expect = 1.4, Method: Composition-based stats.
Identities = 31/136 (22%), Positives = 61/136 (44%), Gaps = 9/136 (6%)
Query: 124 RSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRS 183
R+L +A A +LE + L S V + V+++ + +A +++V ++ +
Sbjct: 1483 RALEEAMEQKA-ELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQL 1541
Query: 184 KLIGDEFQSVRDIVQTLESKLI--------EIEGKQDITTLGVKKLCDRARELENGRPTE 235
+ + DE Q+ D LE L +++G+ + + K+L + RE+E E
Sbjct: 1542 EELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDE 1601
Query: 236 LVQASRYTLSRTTLEL 251
Q S +R LE+
Sbjct: 1602 RKQRSMAVAARKKLEM 1617
>sp|Q258K2|MYH9_CANFA Myosin-9 OS=Canis familiaris GN=MYH9 PE=2 SV=1
Length = 1960
Score = 33.9 bits (76), Expect = 1.4, Method: Composition-based stats.
Identities = 31/136 (22%), Positives = 61/136 (44%), Gaps = 9/136 (6%)
Query: 124 RSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRS 183
R+L +A A +LE + L S V + V+++ + +A +++V ++ +
Sbjct: 1483 RALEEAMEQKA-ELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQL 1541
Query: 184 KLIGDEFQSVRDIVQTLESKLI--------EIEGKQDITTLGVKKLCDRARELENGRPTE 235
+ + DE Q+ D LE L +++G+ + + K+L + RE+E E
Sbjct: 1542 EELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDE 1601
Query: 236 LVQASRYTLSRTTLEL 251
Q S +R LE+
Sbjct: 1602 KKQRSMAVAARKKLEM 1617
>sp|Q8VDD5|MYH9_MOUSE Myosin-9 OS=Mus musculus GN=Myh9 PE=1 SV=4
Length = 1960
Score = 33.9 bits (76), Expect = 1.7, Method: Composition-based stats.
Identities = 31/136 (22%), Positives = 61/136 (44%), Gaps = 9/136 (6%)
Query: 124 RSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRS 183
R+L +A A +LE + L S V + V+++ + +A +++V ++ +
Sbjct: 1483 RALEEAMEQKA-ELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQL 1541
Query: 184 KLIGDEFQSVRDIVQTLESKLI--------EIEGKQDITTLGVKKLCDRARELENGRPTE 235
+ + DE Q+ D LE L +++G+ + + K+L + RE+E E
Sbjct: 1542 EELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDE 1601
Query: 236 LVQASRYTLSRTTLEL 251
Q S +R LE+
Sbjct: 1602 RKQRSMAMAARKKLEM 1617
>sp|O60861|GAS7_HUMAN Growth arrest-specific protein 7 OS=Homo sapiens GN=GAS7 PE=1 SV=3
Length = 476
Score = 33.5 bits (75), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/88 (23%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 135 RQLEDVYSSISAAQRQLSSKITSVDRDVNKI-VEISQATQEEVTILRGRSKLIGDEFQSV 193
+QL Y+S+ A++ L+ + ++ ++ +++S T+E++ R +S GD+
Sbjct: 327 KQLASRYASVEKARKALTERQRDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRC 386
Query: 194 RDIVQTLESKLIEIEGKQDITTLGVKKL 221
D+ +SK E + TTL +++L
Sbjct: 387 VDLYNQAQSKWFE---EMVTTTLELERL 411
>sp|Q5T9S5|CCD18_HUMAN Coiled-coil domain-containing protein 18 OS=Homo sapiens GN=CCDC18
PE=2 SV=1
Length = 1454
Score = 33.5 bits (75), Expect = 2.1, Method: Composition-based stats.
Identities = 24/109 (22%), Positives = 53/109 (48%), Gaps = 1/109 (0%)
Query: 124 RSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRS 183
+ L D LE+ Y++I +L ++ +++E+ QA +E L+ R+
Sbjct: 965 QELRDVLQKAQLSLEEKYTTIKDLTAELRECKMEIEDKKQELLEMDQALKERNWELKQRA 1024
Query: 184 KLIGDEFQSVRDIVQTLESKLIEIEGKQDITTLGVKKLCDRARELENGR 232
+ ++R+ +E K+I++EG + + L +K+ C++ E N +
Sbjct: 1025 AQVTHLDMTIREHRGEMEQKIIKLEGTLEKSELELKE-CNKQIESLNDK 1072
>sp|O22267|CKI1_ARATH Histidine kinase CKI1 OS=Arabidopsis thaliana GN=CKI1 PE=1 SV=1
Length = 1122
Score = 33.1 bits (74), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 10/63 (15%)
Query: 71 ELSHVP-RSVIIETSSGSGTGAK------KYGVIVVIVAVGYGYVWWKGWKLPDMMFATR 123
E+S VP R ++ + G T K KY +IVV++ +G+G+ W W MM ATR
Sbjct: 314 EVSGVPLRYTLMFPNKGGATRIKHQAEKAKYQLIVVMIFLGFGWPVWFVWF---MMQATR 370
Query: 124 RSL 126
R +
Sbjct: 371 REM 373
>sp|Q60780|GAS7_MOUSE Growth arrest-specific protein 7 OS=Mus musculus GN=Gas7 PE=1 SV=1
Length = 421
Score = 33.1 bits (74), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/73 (23%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 135 RQLEDVYSSISAAQRQLSSKITSVDRDVNKI-VEISQATQEEVTILRGRSKLIGDEFQSV 193
+QL Y+S+ A++ L+ + ++ ++ +++S T+E++ R +S GD+
Sbjct: 272 KQLASRYASVEKARKALTERQKDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRC 331
Query: 194 RDIVQTLESKLIE 206
D+ +SK E
Sbjct: 332 VDLYNQAQSKWFE 344
>sp|Q10077|SNT2_SCHPO Lid2 complex component snt2 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=snt2 PE=4 SV=1
Length = 1131
Score = 32.7 bits (73), Expect = 3.1, Method: Composition-based stats.
Identities = 36/142 (25%), Positives = 62/142 (43%), Gaps = 19/142 (13%)
Query: 146 AAQRQLSS-KITSVDRDVNKIVEIS---QATQEEVTILRGRSKLIGD------------- 188
A QR+ S+ + + + D N E + QATQ+ V+IL+G +K + +
Sbjct: 311 ATQRKKSTFQKENANVDANHATENNLEGQATQKSVSILKGHNKALSNVSLQEDHGKRRNL 370
Query: 189 -EFQSVRDIVQTLESKLIEIEGKQDITTLGVKKLCDRARELENGRPTELVQASRYTLSRT 247
+S R++ Q L E + +K+L R L+ + + S SR
Sbjct: 371 KSLRSSRNLHQQSRKSLDENKPNSFSNVSKLKRLPWNMRYLDLKSDLTVEKKSDIYPSRA 430
Query: 248 TLELPGITPSSRSGSLHPL-PL 268
+ + + P+S +LHPL PL
Sbjct: 431 RISISPMLPTSSEDNLHPLQPL 452
>sp|Q8IBG1|DYHC1_PLAF7 Dynein heavy chain-like protein MAL7P1.162 OS=Plasmodium falciparum
(isolate 3D7) GN=MAL7P1.162 PE=2 SV=2
Length = 4985
Score = 32.7 bits (73), Expect = 3.1, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 145 SAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRSKLIGDEFQSVRDIVQTLESKL 204
S A L+ + SV +N I+E Q ++E+ L+ +K+ D++ RDI+ LE KL
Sbjct: 3587 SRAAGPLAKWVESVITFLN-ILETVQPLEKEIEKLQEETKVAEDQYNEQRDIICELEKKL 3645
Query: 205 IE 206
++
Sbjct: 3646 VQ 3647
>sp|Q02224|CENPE_HUMAN Centromere-associated protein E OS=Homo sapiens GN=CENPE PE=1 SV=2
Length = 2701
Score = 32.3 bits (72), Expect = 4.6, Method: Composition-based stats.
Identities = 24/94 (25%), Positives = 48/94 (51%), Gaps = 21/94 (22%)
Query: 136 QLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRSKLIGDEFQSVRD 195
+L+ + S+ A + QL + D+ + +E++ QEE+ +L+GDE + ++
Sbjct: 1054 ELQQMLESVIAEKEQLKT-------DLKENIEMTIENQEEL-------RLLGDELKKQQE 1099
Query: 196 IVQTLESKLIEIEGKQDITTLGVKKLCDRARELE 229
IV ++ I+ EG+ + + CDR E+E
Sbjct: 1100 IVAQEKNHAIKKEGE-------LSRTCDRLAEVE 1126
Score = 31.6 bits (70), Expect = 7.5, Method: Composition-based stats.
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 150 QLSSKITSVDRDVNKIVEISQATQEEVTILRGRSKLIGDEFQSVRDIVQTLESKLIEIEG 209
+L KIT + +++NK VE ++A +EEV IL K + E + +R +Q +L I
Sbjct: 717 ELEGKITDLQKELNKEVEENEALREEV-ILLSELKSLPSEVERLRKEIQDKSEELHIITS 775
Query: 210 KQD 212
++D
Sbjct: 776 EKD 778
>sp|Q99323|MYSN_DROME Myosin heavy chain, non-muscle OS=Drosophila melanogaster GN=zip PE=1
SV=2
Length = 2057
Score = 32.0 bits (71), Expect = 6.0, Method: Composition-based stats.
Identities = 25/120 (20%), Positives = 55/120 (45%), Gaps = 10/120 (8%)
Query: 123 RRSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGR 182
R L + C + ++ E++ + + A+ + S+ + S +++ E Q +EE G
Sbjct: 1359 RSELQEKCTKLQQEAENITNQLEEAELKASAAVKSASNMESQLTEAQQLLEEETRQKLGL 1418
Query: 183 S---KLIGDEFQSVRDIVQ-------TLESKLIEIEGKQDITTLGVKKLCDRARELENGR 232
S + I E +++++ ++ E KL E+ + ++ D A+ELE G+
Sbjct: 1419 SSKLRQIESEKEALQEQLEEDDEAKRNYERKLAEVTTQMQEIKKKAEEDADLAKELEEGK 1478
>sp|Q6RT24|CENPE_MOUSE Centromere-associated protein E OS=Mus musculus GN=Cenpe PE=1 SV=1
Length = 2474
Score = 32.0 bits (71), Expect = 6.4, Method: Composition-based stats.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 8/72 (11%)
Query: 150 QLSSKITSVDRDVNKIVEISQATQEEVTILRGRSKLIGDEFQSVRDIVQTLESKLIEIEG 209
+L +IT + +++NK E Q QEEV +L E +S+ V+TL +L E
Sbjct: 717 ELEKRITDLQKELNKEAEEKQTLQEEVNLL--------SELKSLPSEVETLRRELYEKSE 768
Query: 210 KQDITTLGVKKL 221
+ I T +KL
Sbjct: 769 ELHIITTEREKL 780
>sp|P40806|PKSJ_BACSU Polyketide synthase PksJ OS=Bacillus subtilis (strain 168) GN=pksJ
PE=1 SV=3
Length = 5043
Score = 31.6 bits (70), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 76/180 (42%), Gaps = 27/180 (15%)
Query: 21 LAKEGRLSSVSDAVGGTLKIVSKLIKQDDPGPSDRKLFNDLLAEVSSVQQELSHVPRSVI 80
L K+ LS + AV GT + V+ L Q+ DL AEVSS + P ++
Sbjct: 3973 LTKQWSLSPIGSAVPGT-RTVAILCCQETA---------DLAAEVSS------YFPNHLL 4016
Query: 81 IETS----SGSGTGAKKYGVIVVIVAVGY---GYVWWKGWKLPDMMFATRRSLSDACNSV 133
I+ S S K++ +V ++ G+ G + W W + F + L C V
Sbjct: 4017 IDVSRIENDQSDIDWKEFDGLVDVIGCGWDDEGRLDWIEWVQRLVEFGHKEGLRLLC--V 4074
Query: 134 ARQLEDVY-SSISAAQRQLSSKITSVDRDVNKIVEISQATQEEVTILRGRSKLIGDEFQS 192
+ LE +S+ A + + + + ++ EEVT R +KLI DEF S
Sbjct: 4075 TKGLESFQNTSVRMAGASRAGLYRMLQCEYSHLIS-RHMDAEEVTDHRRLAKLIADEFYS 4133
>sp|Q9VRP9|BRE1_DROME E3 ubiquitin-protein ligase Bre1 OS=Drosophila melanogaster GN=Bre1
PE=1 SV=2
Length = 1044
Score = 31.6 bits (70), Expect = 8.0, Method: Composition-based stats.
Identities = 24/100 (24%), Positives = 49/100 (49%), Gaps = 12/100 (12%)
Query: 60 DLLAEVSSVQQELSHVPRSVIIETSSGSGTGAKKYGVIVVIVAVGYGYVWWKGWKLPDMM 119
+ L EV ++ ++ +P SVI+ET+ +Y + +V Y + ++ M+
Sbjct: 371 ETLKEVEKLKMDIRQLPESVIVETT--------EYKCLQSQFSVLYN----ESMQIKTML 418
Query: 120 FATRRSLSDACNSVARQLEDVYSSISAAQRQLSSKITSVD 159
TR L + N RQ+E + S AQ+++ S++ ++
Sbjct: 419 DETRNQLQTSKNQHLRQIEVMESEELIAQKKVRSEMIQME 458
>sp|O55148|GAS7_RAT Growth arrest-specific protein 7 OS=Rattus norvegicus GN=Gas7 PE=2
SV=2
Length = 422
Score = 31.2 bits (69), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 138 EDVYSSISAAQRQLSSKITSVDRDVNKI-VEISQATQEEVTILRGRSKLIGDEFQSVRDI 196
E Y+S+ A++ L+ + ++ ++ +++S T+E++ R +S GD+ D+
Sbjct: 276 ESRYASVEKARKALTERQKDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRCVDL 335
Query: 197 VQTLESKLIE 206
+SK E
Sbjct: 336 YNQAQSKWFE 345
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.133 0.371
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 101,649,316
Number of Sequences: 539616
Number of extensions: 4014000
Number of successful extensions: 15582
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 70
Number of HSP's that attempted gapping in prelim test: 15510
Number of HSP's gapped (non-prelim): 149
length of query: 309
length of database: 191,569,459
effective HSP length: 117
effective length of query: 192
effective length of database: 128,434,387
effective search space: 24659402304
effective search space used: 24659402304
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)