BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038571
(321 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FX53|TGT1_ARATH Trihelix transcription factor GT-1 OS=Arabidopsis thaliana GN=GT-1
PE=1 SV=1
Length = 406
Score = 42.0 bits (97), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 14/100 (14%)
Query: 41 WTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRK 100
W + ET LI +R ++ F TS S ++ W+ ++S ++ G +R P C
Sbjct: 87 WVQDETRSLIMFRRGMDGL---FNTSKS------NKHLWEQISSKMREKGFDRSPTMCTD 137
Query: 101 RWGNLLVDFRKIKRWESQMKEEKQSFW-----VMRNESRK 135
+W NLL +F+K K + K S++ ++R S+K
Sbjct: 138 KWRNLLKEFKKAKHHDRGNGSAKMSYYKEIEDILRERSKK 177
>sp|P55128|RTX11_ACTPL RTX-I toxin determinant A from serotypes 1/9 OS=Actinobacillus
pleuropneumoniae GN=apxIA PE=1 SV=1
Length = 1023
Score = 38.1 bits (87), Expect = 0.090, Method: Composition-based stats.
Identities = 31/115 (26%), Positives = 54/115 (46%), Gaps = 8/115 (6%)
Query: 51 QGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRKRWGNLLVDFR 110
GK E+ G+ SAF D ++ ++ Y K+Y V R ++RW + +
Sbjct: 432 HGKNYFEN---GYDARHSAFLED----TFELLSQYNKEYSVERVVAITQQRWDVNIGELA 484
Query: 111 KIKRWESQMKEEKQSFWVMRNESRKQMKLPGYFDREVYDVLDGVLAMPAVPLTTM 165
I R S K K ++ E + K P FD++V+D L+G + + ++ TT+
Sbjct: 485 GITRKGSDTKSGK-AYVDFFEEGKLLEKEPDRFDKKVFDPLEGKIDLSSINKTTL 538
>sp|Q9LU92|TGT4_ARATH Trihelix transcription factor GT-4 OS=Arabidopsis thaliana GN=GT-4
PE=2 SV=1
Length = 372
Score = 37.0 bits (84), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 41/78 (52%), Gaps = 9/78 (11%)
Query: 41 WTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRK 100
W + ET LI +R +++ F TS S ++ W+ ++ ++ G +R P C
Sbjct: 55 WAQDETRTLISLRREMDNL---FNTSKS------NKHLWEQISKKMREKGFDRSPSMCTD 105
Query: 101 RWGNLLVDFRKIKRWESQ 118
+W N+L +F+K K+ E +
Sbjct: 106 KWRNILKEFKKAKQHEDK 123
>sp|Q9LZS0|PTL_ARATH Trihelix transcription factor PTL OS=Arabidopsis thaliana GN=PTL
PE=2 SV=1
Length = 591
Score = 37.0 bits (84), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 39/75 (52%), Gaps = 10/75 (13%)
Query: 40 RWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASY-CKQYGVNRRPVQC 98
RW RQET+ L++ + ++ + + ++ P WD V+ +++G R +C
Sbjct: 119 RWPRQETLTLLEIRSRLDHKFK---------EANQKGPLWDEVSRIMSEEHGYQRSGKKC 169
Query: 99 RKRWGNLLVDFRKIK 113
R+++ NL +RK K
Sbjct: 170 REKFENLYKYYRKTK 184
>sp|P55129|RTX12_ACTPL RTX-I toxin determinant A from serotypes 5/10 OS=Actinobacillus
pleuropneumoniae GN=apxIA PE=3 SV=1
Length = 1023
Score = 36.6 bits (83), Expect = 0.26, Method: Composition-based stats.
Identities = 30/115 (26%), Positives = 54/115 (46%), Gaps = 8/115 (6%)
Query: 51 QGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRKRWGNLLVDFR 110
GK E+ G+ SAF D ++ ++ Y K+Y V R ++RW + +
Sbjct: 432 HGKNYFEN---GYDARHSAFLED----TFELLSQYNKEYSVERVVAITQQRWDVNIGELA 484
Query: 111 KIKRWESQMKEEKQSFWVMRNESRKQMKLPGYFDREVYDVLDGVLAMPAVPLTTM 165
I R + K K ++ E + K P FD++V+D L+G + + ++ TT+
Sbjct: 485 GITRKGADAKSGK-AYVDFFEEGKLLEKDPDRFDKKVFDPLEGKIDLSSINKTTL 538
>sp|Q39117|TGT2_ARATH Trihelix transcription factor GT-2 OS=Arabidopsis thaliana GN=GT-2
PE=2 SV=1
Length = 575
Score = 36.6 bits (83), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 44/88 (50%), Gaps = 10/88 (11%)
Query: 27 GNTDD-RSKTARNPRWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASY 85
GN D+ S + + RW + E LI+ ++ +E + T P W+ +++
Sbjct: 383 GNYDNNHSVSPSSSRWPKTEVEALIRIRKNLEANYQENGTKG---------PLWEEISAG 433
Query: 86 CKQYGVNRRPVQCRKRWGNLLVDFRKIK 113
++ G NR +C+++W N+ F+K+K
Sbjct: 434 MRRLGYNRSAKRCKEKWENINKYFKKVK 461
Score = 31.2 bits (69), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 9/74 (12%)
Query: 40 RWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCR 99
RW R ET+ L+ R+ + + FR ST P W+ ++ + G R +C+
Sbjct: 41 RWPRPETLALL---RIRSEMDKAFRDSTLK------APLWEEISRKMMELGYKRSSKKCK 91
Query: 100 KRWGNLLVDFRKIK 113
+++ N+ ++ K
Sbjct: 92 EKFENVYKYHKRTK 105
>sp|Q8H181|GTL2_ARATH Trihelix transcription factor GTL2 OS=Arabidopsis thaliana
GN=At5g28300 PE=2 SV=1
Length = 619
Score = 36.2 bits (82), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 41/86 (47%), Gaps = 3/86 (3%)
Query: 29 TDDRSKTARNPRWTRQETIVLIQGKRVVED-RIRGFRTSTSAFRSDHSEPKWDSVASYCK 87
+DD+S + RW + E + LI +R + + + S S + P W+ ++
Sbjct: 450 SDDKSDLGK--RWPKDEVLALINIRRSISNMNDDDHKDENSLSTSSKAVPLWERISKKML 507
Query: 88 QYGVNRRPVQCRKRWGNLLVDFRKIK 113
+ G R +C+++W N+ FRK K
Sbjct: 508 EIGYKRSAKRCKEKWENINKYFRKTK 533
>sp|Q56XP9|ERFL1_ARATH Ethylene-responsive transcription factor-like protein At4g13040
OS=Arabidopsis thaliana GN=At4g13040 PE=2 SV=2
Length = 226
Score = 35.8 bits (81), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 8/68 (11%)
Query: 16 PNGY-------NEEQAIDGNTDDRSKTARNPRWTRQETIVLIQGKRVVEDRIRGFRTSTS 68
PNGY E+ I G + ++ A NP T V+++ K+ +E+R R R+
Sbjct: 16 PNGYVTPLPFLTAEEMITGIPNPNARAAYNPGPAETVTTVIVE-KKAIEERSRRTRSKHM 74
Query: 69 AFRSDHSE 76
FRSD+S+
Sbjct: 75 HFRSDYSD 82
>sp|Q9C882|GTL1_ARATH Trihelix transcription factor GTL1 OS=Arabidopsis thaliana GN=GTL1
PE=1 SV=2
Length = 587
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 15/77 (19%)
Query: 40 RWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPK---WDSVASYCKQYGVNRRPV 96
RW + E + LI + +E R + + PK W+ +++ K+ G NR
Sbjct: 435 RWPKAEILALINLRSGMEPRYQD------------NVPKGLLWEEISTSMKRMGYNRNAK 482
Query: 97 QCRKRWGNLLVDFRKIK 113
+C+++W N+ ++K+K
Sbjct: 483 RCKEKWENINKYYKKVK 499
Score = 31.6 bits (70), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 40 RWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCR 99
RW R+ET+ L+ R+ D FR +T P W+ V+ + G R +C+
Sbjct: 62 RWPREETLALL---RIRSDMDSTFRDATL------KAPLWEHVSRKLLELGYKRSSKKCK 112
Query: 100 KRWGNLLVDFRKIK 113
+++ N+ +++ K
Sbjct: 113 EKFENVQKYYKRTK 126
>sp|Q5ZHX5|MSD2_CHICK Myb/SANT-like DNA-binding domain-containing protein 2 OS=Gallus
gallus GN=MSANTD2 PE=2 SV=1
Length = 556
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 36 ARNPRWTRQETIVLIQ---GKRVVEDRIRGFRTSTSAFRSDHSEPK-WDSVASYCKQYGV 91
R WT ET LI +R+VE R + + + F S P ++ V+ + G
Sbjct: 95 CRGMSWTPAETNALIAVWGNERLVEARYQQLEGAGTVFGSKAPGPAMYERVSRALAELGY 154
Query: 92 NRRPVQCRKRWGNLLVDFRKIK 113
R P QCR+R L + ++K
Sbjct: 155 ERTPSQCRERIKTLRRCYSRVK 176
>sp|Q6P1R3|MSD2_HUMAN Myb/SANT-like DNA-binding domain-containing protein 2 OS=Homo
sapiens GN=MSANTD2 PE=1 SV=1
Length = 559
Score = 33.9 bits (76), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 11/90 (12%)
Query: 36 ARNPRWTRQETIVLIQ---GKRVVEDRIRGFRTSTSAFRSDHSEPK-WDSVASYCKQYGV 91
R WT ET LI +R+VE R + + + F S P ++ V+ + G
Sbjct: 99 CRGMSWTPAETNALIAVWGNERLVEARYQQLEGAGTVFGSKAPGPAMYERVSRALAELGY 158
Query: 92 NRRPVQCRKRWGNLLVDFRKIKRWESQMKE 121
R P QCR+R + ++R S++KE
Sbjct: 159 ERTPSQCRER-------IKTLRRCYSRVKE 181
>sp|Q6NZR2|MSD2_MOUSE Myb/SANT-like DNA-binding domain-containing protein 2 OS=Mus
musculus GN=Msantd2 PE=1 SV=1
Length = 559
Score = 33.9 bits (76), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 11/90 (12%)
Query: 36 ARNPRWTRQETIVLIQ---GKRVVEDRIRGFRTSTSAFRSDHSEPK-WDSVASYCKQYGV 91
R WT ET LI +R+VE R + + + F S P ++ V+ + G
Sbjct: 99 CRGMSWTPAETNALIAVWGNERLVEARYQQLEGAGTVFGSKAPGPAMYERVSRALAELGY 158
Query: 92 NRRPVQCRKRWGNLLVDFRKIKRWESQMKE 121
R P QCR+R + ++R S++KE
Sbjct: 159 ERTPSQCRER-------IKTLRRCYSRVKE 181
>sp|Q4JL84|MYB59_ARATH Transcription factor MYB59 OS=Arabidopsis thaliana GN=MYB59 PE=2
SV=2
Length = 235
Score = 33.5 bits (75), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 46/110 (41%), Gaps = 25/110 (22%)
Query: 41 WTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRK 100
WT QE I+L+ + DR +WD VA K G+NR CR
Sbjct: 13 WTEQEDILLVNFVHLFGDR------------------RWDFVA---KVSGLNRTGKSCRL 51
Query: 101 RWGNLL---VDFRKIKRWESQMKEEKQSFWVMRNESRKQMKLPGYFDREV 147
RW N L + K+ E ++ E + W R S+ KLPG D E+
Sbjct: 52 RWVNYLHPGLKRGKMTPQEERLVLELHAKWGNR-WSKIARKLPGRTDNEI 100
>sp|Q53NK6|MYBA1_ORYSJ Myb-related protein MYBAS1 OS=Oryza sativa subsp. japonica
GN=MYBAS1 PE=2 SV=1
Length = 237
Score = 33.5 bits (75), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 45/114 (39%), Gaps = 25/114 (21%)
Query: 37 RNPRWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPV 96
R WT QE + L+ R+ DR +WD VA K G+NR
Sbjct: 9 RKGPWTEQEDLQLVCTVRLFGDR------------------RWDFVA---KVSGLNRTGK 47
Query: 97 QCRKRWGNLLVDFRKIKRW---ESQMKEEKQSFWVMRNESRKQMKLPGYFDREV 147
CR RW N L K R E + E + W R SR +LPG D E+
Sbjct: 48 SCRLRWVNYLHPGLKHGRMSPKEEHLIIELHARWGNR-WSRIARRLPGRTDNEI 100
>sp|Q9LX82|MYB48_ARATH Transcription factor MYB48 OS=Arabidopsis thaliana GN=MYB48 PE=2
SV=1
Length = 256
Score = 33.1 bits (74), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 46/110 (41%), Gaps = 25/110 (22%)
Query: 41 WTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRK 100
WT QE I+L+ + DR +WD +A K G+NR CR
Sbjct: 12 WTEQEDILLVNFVHLFGDR------------------RWDFIA---KVSGLNRTGKSCRL 50
Query: 101 RWGNLL---VDFRKIKRWESQMKEEKQSFWVMRNESRKQMKLPGYFDREV 147
RW N L + K+ E ++ E + W R S+ KLPG D E+
Sbjct: 51 RWVNYLHPGLKRGKMTPQEERLVLELHAKWGNR-WSKIARKLPGRTDNEI 99
>sp|C1F636|RS3_ACIC5 30S ribosomal protein S3 OS=Acidobacterium capsulatum (strain ATCC
51196 / DSM 11244 / JCM 7670) GN=rpsC PE=3 SV=1
Length = 223
Score = 32.7 bits (73), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 252 RLSSCVSKETNMGDLLFKVLERNSSMLNTQLEAQNINCQLDREQKKEHSDNLIAAMNKLT 311
+L + + K TN ++ +LE N L+ QL A+NI QL E AM K
Sbjct: 85 KLKADLQKRTNR-EVFIDILEVNKPELDAQLIAENIALQL------EKRVGFRRAMRKAV 137
Query: 312 DALLRIGNK 320
D+ LR G K
Sbjct: 138 DSALRFGCK 146
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.127 0.364
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 118,436,871
Number of Sequences: 539616
Number of extensions: 4804334
Number of successful extensions: 13722
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 92
Number of HSP's that attempted gapping in prelim test: 13620
Number of HSP's gapped (non-prelim): 169
length of query: 321
length of database: 191,569,459
effective HSP length: 117
effective length of query: 204
effective length of database: 128,434,387
effective search space: 26200614948
effective search space used: 26200614948
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)