BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026650
(235 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SAK5|APL_ARATH Myb family transcription factor APL OS=Arabidopsis thaliana GN=APL
PE=2 SV=2
Length = 358
Score = 131 bits (330), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/114 (59%), Positives = 83/114 (72%), Gaps = 9/114 (7%)
Query: 1 MRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDVSCVAESQDTGSSTTSSTRMVAQ 60
MR MGVKGLTLYHLKSHLQK+RLGKQ KE ++S A + D + SS+ M+++
Sbjct: 66 MRVMGVKGLTLYHLKSHLQKFRLGKQPHKEYGDHSTKEGSRASAMDIQRNVASSSGMMSR 125
Query: 61 DPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKAL 114
+ N+ +QMEVQRRLHEQLEVQR LQLRIEAQGKY+QSILE+AC+ L
Sbjct: 126 NMNE---------MQMEVQRRLHEQLEVQRHLQLRIEAQGKYMQSILERACQTL 170
>sp|Q8GUN5|PHL1_ARATH Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1
Length = 413
Score = 93.2 bits (230), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 9/118 (7%)
Query: 1 MRTMGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDVSCVAESQDTGSSTTSSTRMVAQ 60
++ + GLT+YH+KSHLQKYR + ET+E E Q+ TS + +
Sbjct: 263 LKLLNNPGLTIYHVKSHLQKYRTARYK-PETSE------VTGEPQEK--KMTSIEDIKSL 313
Query: 61 DPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQA 118
D ++T+ALR+QMEVQ+RLHEQLE+QR LQL+IE QG+YLQ + EK K ++++
Sbjct: 314 DMKTSVEITQALRLQMEVQKRLHEQLEIQRSLQLQIEKQGRYLQMMFEKQQKIQDNKS 371
>sp|Q700D9|MYBF_ARATH Putative Myb family transcription factor At1g14600 OS=Arabidopsis
thaliana GN=At1g14600 PE=2 SV=2
Length = 255
Score = 35.0 bits (79), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 1 MRTMGVKGLTLYHLKSHLQKYR 22
++ M VKGLT+ H+KSHLQ YR
Sbjct: 55 LKIMDVKGLTISHVKSHLQMYR 76
>sp|Q941I2|KAN3_ARATH Probable transcription factor KAN3 OS=Arabidopsis thaliana GN=KAN3
PE=2 SV=1
Length = 322
Score = 34.7 bits (78), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 4 MGVKGLTLYHLKSHLQKYRLGKQACKETT 32
M V+ LTL H+KSHLQ YR K K TT
Sbjct: 199 MDVQDLTLAHVKSHLQMYRTIKSTEKPTT 227
>sp|Q9C616|KAN2_ARATH Probable transcription factor KAN2 OS=Arabidopsis thaliana GN=KAN2
PE=2 SV=1
Length = 388
Score = 33.9 bits (76), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 11/105 (10%)
Query: 4 MGVKGLTLYHLKSHLQKYRLGKQACKETTENSKDVSCVAESQDTGSSTTSSTRMVAQDPN 63
M VK LTL H+KSHLQ YR K TT+ + S ++ + GSS +++ D N
Sbjct: 248 MDVKDLTLAHVKSHLQMYRTVK-----TTDKAAASSGQSDVYENGSSGDNNSDDWMFDMN 302
Query: 64 DGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILE 108
+ +E L +E L + RL GK + ++ E
Sbjct: 303 RKSRDSEELTNPLEKSNGLWTNSSGEARL------HGKLIDNVAE 341
>sp|Q0J235|ROLL9_ORYSJ Probable transcription factor RL9 OS=Oryza sativa subsp. japonica
GN=RL9 PE=2 SV=2
Length = 532
Score = 33.1 bits (74), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 17/26 (65%)
Query: 4 MGVKGLTLYHLKSHLQKYRLGKQACK 29
M VK LTL H+KSHLQ YR K K
Sbjct: 359 MDVKDLTLAHVKSHLQMYRTVKSTDK 384
>sp|Q93WJ9|KAN1_ARATH Transcription repressor KAN1 OS=Arabidopsis thaliana GN=KAN1 PE=1
SV=1
Length = 403
Score = 33.1 bits (74), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 17/26 (65%)
Query: 4 MGVKGLTLYHLKSHLQKYRLGKQACK 29
M VK LTL H+KSHLQ YR K K
Sbjct: 254 MDVKDLTLAHVKSHLQMYRTVKTTNK 279
>sp|Q5ZQU0|SNED1_RAT Sushi, nidogen and EGF-like domain-containing protein 1 OS=Rattus
norvegicus GN=Sned1 PE=2 SV=2
Length = 1403
Score = 32.7 bits (73), Expect = 2.2, Method: Composition-based stats.
Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 14/111 (12%)
Query: 131 LSELAIKVSNDCQGMVPLENIKMPSISELAAALESKNASTIPARIGDCSVESC------- 183
L A+ V + VPL+N + P + LA L N+ + G CS ++C
Sbjct: 1263 LPSRAVTVRSQPTTPVPLKNTEAPEQARLALQLPKNNSKDTESTPGSCSEDTCQNGGTCV 1322
Query: 184 --LTSTSSPVSPMGLGSQAA-AMKKRPRP---LFGNGESLPL-EGNMRQEV 227
+ S P G A +K PRP LF +S P+ EG++ V
Sbjct: 1323 PGANAHSCDCRPGFKGRHCELACEKVPRPCTRLFSETKSFPVWEGDVCHHV 1373
>sp|A2BTQ6|MNMG_PROMS tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG
OS=Prochlorococcus marinus (strain AS9601) GN=mnmG PE=3
SV=1
Length = 655
Score = 32.3 bits (72), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 42/177 (23%), Positives = 78/177 (44%), Gaps = 23/177 (12%)
Query: 19 QKYRLGKQACKETTENSKDVSCVAESQDTGSSTTSSTRMVAQDPNDGYQVTEALRVQM-- 76
+K+RL K T E SK + S+ GS+T + + + PN + ++ ++ +
Sbjct: 493 EKFRLNNTRLKNTDEISKKIELETGSKIKGSTT---LKELLKRPN--FHYSDLIKYNLNE 547
Query: 77 -EVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKAL----NDQAIVAAGLEAAREEL 131
+ + E +E+ + + ++ Q ++ I ++CK+L N + I LE ARE L
Sbjct: 548 RNLGSSIQEGVEIDIKYEGYLKRQKNNIEQINRQSCKSLPQEINYEKIETLSLE-ARENL 606
Query: 132 SELAIKVSNDCQGMVPLENIKMPSIS--ELAAALESKNASTIPARIGDCSVESCLTS 186
+++ K D K+P +S +L A L I + VE L+S
Sbjct: 607 NKIKPKNFGDAS--------KIPGVSKADLTALLVWLKIREIKKEKANIFVEKKLSS 655
>sp|Q9FJV5|KAN4_ARATH Probable transcription factor KAN4 OS=Arabidopsis thaliana GN=KAN4
PE=1 SV=1
Length = 276
Score = 32.3 bits (72), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/22 (68%), Positives = 16/22 (72%)
Query: 4 MGVKGLTLYHLKSHLQKYRLGK 25
M VK LTL H+KSHLQ YR K
Sbjct: 140 MNVKDLTLAHVKSHLQMYRTVK 161
>sp|Q9SJZ6|MED18_ARATH Mediator of RNA polymerase II transcription subunit 18
OS=Arabidopsis thaliana GN=MED18 PE=1 SV=1
Length = 219
Score = 32.0 bits (71), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 10/97 (10%)
Query: 71 ALRVQMEVQRRL-HEQLEVQRRLQLRIEAQGKYLQSILEKACK--ALNDQAIVAAGLE-- 125
ALR+ + +L HE L+V + A S + K K A+++ V G++
Sbjct: 100 ALRMFYALGYKLDHELLKVGFAFHFQRTAHISVSVSSVNKMPKVHAIDEAVPVTPGMQIV 159
Query: 126 -----AAREELSELAIKVSNDCQGMVPLENIKMPSIS 157
A E SE+A VS+ C+ + PL ++ PSIS
Sbjct: 160 DVTAPATSENYSEVAAAVSSFCEFLAPLVHLSKPSIS 196
>sp|P97386|DNLI3_MOUSE DNA ligase 3 OS=Mus musculus GN=Lig3 PE=1 SV=2
Length = 1015
Score = 31.6 bits (70), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 12/114 (10%)
Query: 11 LYHLKSHLQKYRLGKQACKETTE--NSKDVSCVAE--SQDTGSSTTSSTRMVAQDPNDGY 66
L HL S L K +QA ++ + D+ C+ D ++ + + A DPN Y
Sbjct: 386 LLHL-SKLTKEDEQQQALQDIASRCTANDLKCIIRLIKHDLKMNSGAKHVLDALDPN-AY 443
Query: 67 QVTEALR-VQMEVQRRLHEQLEVQ----RRLQLRIEAQ-GKYLQSILEKACKAL 114
+ +A R +Q V+R LH + EV+ RR LR++A +Q +L +ACK++
Sbjct: 444 EAFKASRNLQDVVERVLHNEQEVEKDPGRRRALRVQASLMTPVQPMLAEACKSI 497
>sp|P34036|DYHC_DICDI Dynein heavy chain, cytoplasmic OS=Dictyostelium discoideum GN=dhcA
PE=1 SV=2
Length = 4730
Score = 31.6 bits (70), Expect = 5.4, Method: Composition-based stats.
Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
Query: 70 EALRVQM-EVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACKALNDQAIVAAGLEAAR 128
EAL+ + + ++ + L++Q Q+++ K L + L +A + D V + LE +
Sbjct: 3800 EALKTERPDTHKKRSDLLKIQGEFQVKLRILEKSLLNALSQASGNILDDDSVISTLETLK 3859
Query: 129 EELSELAIKV 138
+E +E+A+KV
Sbjct: 3860 KETTEIALKV 3869
>sp|Q55563|Y163_SYNY3 Uncharacterized WD repeat-containing protein sll0163 OS=Synechocystis
sp. (strain PCC 6803 / Kazusa) GN=sll0163 PE=4 SV=1
Length = 1693
Score = 31.2 bits (69), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 14/129 (10%)
Query: 53 SSTRMVAQDPNDGYQVTEALRVQMEVQRRLHEQLEVQRRLQLRIEAQGKYLQSILEKACK 112
S+ ++A P D Q E L ++E EQ+ Q+ +++ Q LEK C+
Sbjct: 889 SNPNLIAPSPGDRQQGNEDLIAELEAGSSAEEQVVAQKLKAAQLQYQKLVAGINLEKQCQ 948
Query: 113 ALNDQAIVAAGLEAAREELSELAIKVSNDCQGMVPLEN--IKMPSISELAAALESKNAST 170
+ Q +V LEA A+K D Q +V E ++ PS++ + +
Sbjct: 949 FILKQFLVYP-LEALLA-----AVKTGQDLQKLVTPETPLVQYPSLAPWLS------LHS 996
Query: 171 IPARIGDCS 179
I ARI +C+
Sbjct: 997 ILARINECN 1005
>sp|A7GUG3|LIPA_BACCN Lipoyl synthase OS=Bacillus cereus subsp. cytotoxis (strain NVH
391-98) GN=lipA PE=3 SV=1
Length = 298
Score = 30.8 bits (68), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 6/78 (7%)
Query: 62 PNDGYQVTEALRVQMEVQRR-LHEQLEVQRRLQLRIEAQGKYLQSI--LEKACKALND-- 116
P+D V E LR+ M+ + L+ +E RRL R+ A+ KY +S+ L +A + D
Sbjct: 142 PSDMGGVEENLRILMDAKPDILNHNIETVRRLSDRVRARAKYDRSLEFLRRAKEMQPDIP 201
Query: 117 -QAIVAAGLEAAREELSE 133
++ + GL RE+L E
Sbjct: 202 TKSSIMVGLGETREDLLE 219
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.126 0.348
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,569,469
Number of Sequences: 539616
Number of extensions: 2809634
Number of successful extensions: 8646
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 65
Number of HSP's that attempted gapping in prelim test: 8601
Number of HSP's gapped (non-prelim): 104
length of query: 235
length of database: 191,569,459
effective HSP length: 114
effective length of query: 121
effective length of database: 130,053,235
effective search space: 15736441435
effective search space used: 15736441435
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 59 (27.3 bits)