BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028954
(201 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O80931|AS1_ARATH Transcription factor AS1 OS=Arabidopsis thaliana GN=AS1 PE=1 SV=1
Length = 367
Score = 232 bits (591), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/208 (62%), Positives = 163/208 (78%), Gaps = 14/208 (6%)
Query: 3 TSNGGFLHTDPAT-PPN-LLPPWL--SNSSSNIKPPSPSVTLSLSPSTVAAAPLLIP--W 56
SNGGFLH++ PPN ++PPWL SN+ +N+ PSVTL+LSPSTVAAA P W
Sbjct: 158 NSNGGFLHSEQQVQPPNPVIPPWLATSNNGNNVVARPPSVTLTLSPSTVAAAAPQPPIPW 217
Query: 57 LQ---PERGSDN--TLVLGNMPPHGSVPICGENLLVSELMECCRELKEGHRAWAAHKKEA 111
LQ PER + LVLG+M P S E++ +SEL+ECCREL+EGHRAWA HKKEA
Sbjct: 218 LQQQQPERAENGPGGLVLGSMMP--SCSGSSESVFLSELVECCRELEEGHRAWADHKKEA 275
Query: 112 AWRLRRVELQLESEKACQRRDKMEEIEAKVKALGDEQRATLDRIEAEYREQIAGLRKDAE 171
AWRLRR+ELQLESEK C++R+KMEEIEAK+KAL +EQ+ +++IE EYREQ+ GLR+DAE
Sbjct: 276 AWRLRRLELQLESEKTCRQREKMEEIEAKMKALREEQKNAMEKIEGEYREQLVGLRRDAE 335
Query: 172 AKEQKLAEQWSAKHLRLTKFLE-QMGCR 198
AK+QKLA+QW+++H+RLTKFLE QMGCR
Sbjct: 336 AKDQKLADQWTSRHIRLTKFLEQQMGCR 363
>sp|Q94IB1|RS2_ORYSJ Protein rough sheath 2 homolog OS=Oryza sativa subsp. japonica
GN=RS2 PE=2 SV=1
Length = 342
Score = 164 bits (416), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 78/115 (67%), Positives = 104/115 (90%), Gaps = 1/115 (0%)
Query: 85 LLVSELMECCRELKEGHRAWAAHKKEAAWRLRRVELQLESEKACQRRDKMEEIEAKVKAL 144
++V+ELMECCRE++EG RAWAAH+KEAAWR++RVE+QLE+E+AC+RR+ EE EAK++AL
Sbjct: 209 VVVAELMECCREMEEGQRAWAAHRKEAAWRMKRVEMQLETERACRRREATEEFEAKMRAL 268
Query: 145 GDEQRATLDRIEAEYREQIAGLRKDAEAKEQKLAEQWSAKHLRLTKFLEQM-GCR 198
+EQ A ++R+EAEYRE++AGLR+DAEAKEQK+AEQW+AKH RL KFL+Q+ CR
Sbjct: 269 REEQAAAVERVEAEYREKMAGLRRDAEAKEQKMAEQWAAKHARLAKFLDQVAACR 323
>sp|Q9S7B2|RS2_MAIZE Protein rough sheath 2 OS=Zea mays GN=RS2 PE=1 SV=1
Length = 370
Score = 129 bits (323), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/127 (55%), Positives = 96/127 (75%), Gaps = 2/127 (1%)
Query: 70 GNMPPHGSVPICGENLLVSELMECCRELKEGHRAWAAHKKEAAWRLRRVELQLESEKACQ 129
G P G + G+ L +EL ECCREL+EG RAWAAH++EAAWRL+RVE QLE E+ +
Sbjct: 232 GGAAPPGMAVVDGQAL--AELAECCRELEEGRRAWAAHRREAAWRLKRVEQQLEMEREMR 289
Query: 130 RRDKMEEIEAKVKALGDEQRATLDRIEAEYREQIAGLRKDAEAKEQKLAEQWSAKHLRLT 189
RR+ EE EAK++ + EQ A +R+E ++RE++A LR+DA+ KE+K+AEQW+AKH R+
Sbjct: 290 RREVWEEFEAKMRTMRLEQAAAAERVERDHREKVAELRRDAQVKEEKMAEQWAAKHARVA 349
Query: 190 KFLEQMG 196
KF+EQMG
Sbjct: 350 KFVEQMG 356
>sp|Q9C9X0|MP701_ARATH Microtubule-associated protein 70-1 OS=Arabidopsis thaliana
GN=MAP70.1 PE=1 SV=1
Length = 622
Score = 34.3 bits (77), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 66/145 (45%), Gaps = 32/145 (22%)
Query: 3 TSNGGFLHTDPATPPNLLP---PWLSNSSSNIKPPSPSVTLSLSPSTVAAAPLLIPWLQP 59
+++GGFL + AT P +P P L+ S+S K S + PS AAA
Sbjct: 5 SADGGFLSAEQATTPVAIPTPYPSLTVSAS-YKEKSSGRRRPVRPSFDAAA--------- 54
Query: 60 ERGSDNTLVLGNMPPHGSVPICGE-NLLVSELMECCRELKEGHRAWAAHKKEAAWRLRRV 118
DN + HGS P+ E N L +E+ + REL E + A LR
Sbjct: 55 ----DNEFI---TLLHGSDPVKVELNRLENEVRDKDRELSEANAEIKA--------LRLS 99
Query: 119 ELQLESEKACQR-RDKMEEIEAKVK 142
E Q EKAC+ D++ +++ K+K
Sbjct: 100 ERQ--REKACEELTDELAKLDGKLK 122
>sp|Q8MI01|MUC15_BOVIN Mucin-15 OS=Bos taurus GN=MUC15 PE=1 SV=1
Length = 330
Score = 33.5 bits (75), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
Query: 23 WLSNSSSNIKPPSPSVTLSLSPSTVAAAPLL--IPWLQPERGSDNTLVLGNMPPHGSVPI 80
+ SN S++ SPS+ +LSP + + + +PW +DN+L+ + PP+ +VP+
Sbjct: 86 FYSNLSTDNSSRSPSLMPTLSPRSPSTHSFVSKLPWNS--SIADNSLLPASAPPNTTVPV 143
Query: 81 CGENLLVSELMECCR 95
EN +S + + +
Sbjct: 144 SSENFTLSSINDTMK 158
>sp|Q9C0A1|ZFHX2_HUMAN Zinc finger homeobox protein 2 OS=Homo sapiens GN=ZFHX2 PE=2 SV=3
Length = 2572
Score = 33.1 bits (74), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 25/43 (58%)
Query: 70 GNMPPHGSVPICGENLLVSELMECCRELKEGHRAWAAHKKEAA 112
G+MPP VPIC + L E++ RE H +AH+++AA
Sbjct: 2482 GSMPPPLRVPICTYHCLACEVLLSGREALASHLRSSAHRRKAA 2524
>sp|Q9Z1J1|TF7L1_MOUSE Transcription factor 7-like 1 OS=Mus musculus GN=Tcf7l1 PE=1 SV=2
Length = 584
Score = 32.3 bits (72), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 10/90 (11%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H PA+PP L P + + +PP P S LSP V P + WL P++G
Sbjct: 198 HFSPASPPTHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQP-- 255
Query: 67 LVLGNMPPHG---SVPICGENLLVSELMEC 93
+ ++PP G P N +S L+
Sbjct: 256 --MYSLPPGGFRHPYPALAMNASMSSLVSS 283
>sp|P70063|T7L1C_XENLA Transcription factor 7-like 1-C OS=Xenopus laevis GN=tcf7l1-c PE=2
SV=1
Length = 550
Score = 32.0 bits (71), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 7/70 (10%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P TPP L P + + +PP P S LSP V P + WL P++G
Sbjct: 178 HFSPGTPPGHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHRLGWLVPQQGQP-- 235
Query: 67 LVLGNMPPHG 76
+ ++PP G
Sbjct: 236 --MYSIPPGG 243
>sp|Q6GL68|TF7L1_XENTR Transcription factor 7-like 1 OS=Xenopus tropicalis GN=tcf7l1 PE=2
SV=1
Length = 553
Score = 32.0 bits (71), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 7/70 (10%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P TPP L P + + +PP P S LSP V P + WL P++G
Sbjct: 179 HFSPGTPPGHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQP-- 236
Query: 67 LVLGNMPPHG 76
+ ++PP G
Sbjct: 237 --MYSIPPGG 244
>sp|Q8IDX6|RBP2A_PLAF7 Reticulocyte-binding protein 2 homolog a OS=Plasmodium falciparum
(isolate 3D7) GN=PF13_0198 PE=3 SV=1
Length = 3130
Score = 32.0 bits (71), Expect = 3.2, Method: Composition-based stats.
Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 2/99 (2%)
Query: 92 ECCRELKEGHRAWAAHKKEAAWRLRRVELQLESEKACQRRDKMEEIEAKVKALGDEQRAT 151
E R+ +E + A K++ RL++ E E+ R+K E+++ K + L +++
Sbjct: 2774 ELKRQEQERLQKEEALKRQEQERLQKEEELKRQEQERLEREKQEQLQ-KEEELKRQEQER 2832
Query: 152 LDRIEAEYREQIAGLRKDAEAKEQKLAEQWSAKHLRLTK 190
L + EA R++ L+K+ E K Q+ E+ K + L +
Sbjct: 2833 LQKEEALKRQEQERLQKEEELKRQE-QERLERKKIELAE 2870
Score = 30.8 bits (68), Expect = 6.2, Method: Composition-based stats.
Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 1/85 (1%)
Query: 106 AHKKEAAWRLRRVELQLESEKACQRRDKMEEIEAKVKALGDEQRATLDRIEAEYREQIAG 165
A K++ RL++ E E+ R+K E+++ K + L +++ L + EA R++
Sbjct: 2738 ALKRQEQERLQKEEELKRQEQERLEREKQEQLQ-KEEELKRQEQERLQKEEALKRQEQER 2796
Query: 166 LRKDAEAKEQKLAEQWSAKHLRLTK 190
L+K+ E K Q+ K +L K
Sbjct: 2797 LQKEEELKRQEQERLEREKQEQLQK 2821
>sp|Q90ZB6|T7L1B_XENLA Transcription factor 7-like 1-B OS=Xenopus laevis GN=tcf7l1-b PE=1
SV=1
Length = 551
Score = 31.6 bits (70), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 10/90 (11%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P TPP L P + + +PP P S LSP V P + WL P++G
Sbjct: 179 HFSPGTPPGHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQP-- 236
Query: 67 LVLGNMPPHG---SVPICGENLLVSELMEC 93
+ ++PP G P N +S L+
Sbjct: 237 --MYSIPPGGFRHPYPALAMNASMSSLVSS 264
>sp|P70064|T7L1D_XENLA Transcription factor 7-like 1-D OS=Xenopus laevis GN=tcf7l1-d PE=2
SV=1
Length = 550
Score = 31.2 bits (69), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 7/70 (10%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P TPP L P + + +PP P S LSP V P + WL P++G
Sbjct: 178 HFSPGTPPGHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQP-- 235
Query: 67 LVLGNMPPHG 76
+ ++PP G
Sbjct: 236 --MYSIPPGG 243
>sp|P70062|T7L1A_XENLA Transcription factor 7-like 1-A OS=Xenopus laevis GN=tcf7l1-a PE=1
SV=1
Length = 554
Score = 31.2 bits (69), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 7/70 (10%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P TPP L P + + +PP P S LSP V P + WL P +G
Sbjct: 179 HFSPGTPPGHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPPQGQP-- 236
Query: 67 LVLGNMPPHG 76
+ ++PP G
Sbjct: 237 --MYSIPPGG 244
>sp|Q924A0|TF7L2_MOUSE Transcription factor 7-like 2 OS=Mus musculus GN=Tcf7l2 PE=1 SV=2
Length = 459
Score = 30.8 bits (68), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERG 62
H P PP LP + + +PP P S LSP TV P + WL P++G
Sbjct: 187 HFTPGNPPPHLPADVDPKTGIPRPPHPPDISPYYPLSPGTVGQIPHPLGWLVPQQG 242
>sp|Q09YK4|CTTB2_ATEGE Cortactin-binding protein 2 OS=Ateles geoffroyi GN=CTTNBP2 PE=3
SV=1
Length = 1660
Score = 30.4 bits (67), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 18/74 (24%)
Query: 14 ATPPNLLPPW--------LSNSSSNIKPPSPSVTLSLSP----------STVAAAPLLIP 55
A+PP+ LP LS SSS PP PS+ L+++P S V A P P
Sbjct: 585 ASPPSSLPQGNRVINEENLSKSSSPQLPPKPSIDLTVAPAGCAVSALATSQVGAWPAATP 644
Query: 56 WLQPERGSDNTLVL 69
L SD++LV+
Sbjct: 645 GLNQPACSDSSLVI 658
>sp|Q9NQB0|TF7L2_HUMAN Transcription factor 7-like 2 OS=Homo sapiens GN=TCF7L2 PE=1 SV=2
Length = 619
Score = 30.4 bits (67), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERG 62
H P PP LP + + +PP P S LSP TV P + WL P++G
Sbjct: 210 HFTPGNPPPHLPADVDPKTGIPRPPHPPDISPYYPLSPGTVGQIPHPLGWLVPQQG 265
>sp|P47460|HMW2_MYCGE Cytadherence high molecular weight protein 2 OS=Mycoplasma
genitalium (strain ATCC 33530 / G-37 / NCTC 10195)
GN=hmw2 PE=3 SV=1
Length = 1805
Score = 30.4 bits (67), Expect = 8.7, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 34/69 (49%), Gaps = 3/69 (4%)
Query: 125 EKACQRRDKMEEIEAKVKALGDEQRATLDRIEAEYREQIAGLRKDAEAKEQKLAEQWSAK 184
EK Q R ++EE+ K D++ L E + ++ A + EAK+Q+L E AK
Sbjct: 485 EKVAQERKELEELYLVKKQKQDQKENELLFFEKQLKQHQADFENELEAKQQELFE---AK 541
Query: 185 HLRLTKFLE 193
H F++
Sbjct: 542 HALERSFIK 550
>sp|Q9HRY2|DNAK_HALSA Chaperone protein DnaK OS=Halobacterium salinarum (strain ATCC
700922 / JCM 11081 / NRC-1) GN=dnaK PE=3 SV=1
Length = 629
Score = 30.4 bits (67), Expect = 9.1, Method: Composition-based stats.
Identities = 37/135 (27%), Positives = 57/135 (42%), Gaps = 24/135 (17%)
Query: 33 PPSPSVTLSLSPSTVAAAPLLIPWLQPERGSDNTLVLGNMPPHGSVPICGENLLVSELME 92
PP+P+ T + + A ++ ++GS N S+ I G L E ++
Sbjct: 436 PPAPAGTPQIEVTFEIDADGIVNVEAEDQGSGNA---------ESITIEGGAGLSDEQID 486
Query: 93 CCRELKEGHRAWAAHKKEAAWRLRRVELQLESEKACQR---------RDKMEEIEAKVKA 143
+E E AH +E R RR+E + E+E A QR E++EA V
Sbjct: 487 EMQEDAE------AHAEEDEQRRRRIEARNEAETAIQRAESLLEENEELVDEDLEADVND 540
Query: 144 LGDEQRATLDRIEAE 158
D+ +A LD E E
Sbjct: 541 AIDDVQAVLDEDEPE 555
>sp|B0R3H4|DNAK_HALS3 Chaperone protein DnaK OS=Halobacterium salinarum (strain ATCC
29341 / DSM 671 / R1) GN=dnaK PE=3 SV=1
Length = 629
Score = 30.4 bits (67), Expect = 9.1, Method: Composition-based stats.
Identities = 37/135 (27%), Positives = 57/135 (42%), Gaps = 24/135 (17%)
Query: 33 PPSPSVTLSLSPSTVAAAPLLIPWLQPERGSDNTLVLGNMPPHGSVPICGENLLVSELME 92
PP+P+ T + + A ++ ++GS N S+ I G L E ++
Sbjct: 436 PPAPAGTPQIEVTFEIDADGIVNVEAEDQGSGNA---------ESITIEGGAGLSDEQID 486
Query: 93 CCRELKEGHRAWAAHKKEAAWRLRRVELQLESEKACQR---------RDKMEEIEAKVKA 143
+E E AH +E R RR+E + E+E A QR E++EA V
Sbjct: 487 EMQEDAE------AHAEEDEQRRRRIEARNEAETAIQRAESLLEENEELVDEDLEADVND 540
Query: 144 LGDEQRATLDRIEAE 158
D+ +A LD E E
Sbjct: 541 AIDDVQAVLDEDEPE 555
>sp|Q9HCS4|TF7L1_HUMAN Transcription factor 7-like 1 OS=Homo sapiens GN=TCF7L1 PE=2 SV=1
Length = 588
Score = 30.0 bits (66), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 38/90 (42%), Gaps = 10/90 (11%)
Query: 10 HTDPATPPNLLPPWLSNSSSNIKPPSP---SVTLSLSPSTVAAAPLLIPWLQPERGSDNT 66
H P +PP L P + + +PP P S LSP V P + WL P++G
Sbjct: 201 HFSPGSPPTHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQP-- 258
Query: 67 LVLGNMPPHG---SVPICGENLLVSELMEC 93
+ ++PP G P N +S L+
Sbjct: 259 --MYSLPPGGFRHPYPALAMNASMSSLVSS 286
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.130 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,464,689
Number of Sequences: 539616
Number of extensions: 3215001
Number of successful extensions: 14687
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 71
Number of HSP's successfully gapped in prelim test: 301
Number of HSP's that attempted gapping in prelim test: 14135
Number of HSP's gapped (non-prelim): 820
length of query: 201
length of database: 191,569,459
effective HSP length: 112
effective length of query: 89
effective length of database: 131,132,467
effective search space: 11670789563
effective search space used: 11670789563
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 58 (26.9 bits)