BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028992
(200 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P92792|TOM20_SOLTU Mitochondrial import receptor subunit TOM20 OS=Solanum tuberosum
GN=TOM20 PE=1 SV=1
Length = 204
Score = 246 bits (627), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 124/192 (64%), Positives = 153/192 (79%), Gaps = 2/192 (1%)
Query: 5 QSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISK 64
QS+FDRLL EH RK+AE YA++PLDADNLTRWG ALLELSQF+ V++SK++I++A SK
Sbjct: 4 QSEFDRLLFFEHARKSAETTYAQNPLDADNLTRWGGALLELSQFQPVAESKQMISDATSK 63
Query: 65 FEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQ 124
EEAL ++P KH LW LGNAHTS FLT D+ EAK F+KA++CFQ+A D +P+N+LY+
Sbjct: 64 LEEALTVNPEKHDALWCLGNAHTSHVFLTPDMDEAKVYFEKATQCFQQAFDADPSNDLYR 123
Query: 125 KSLEVSTKAPELHMELHKHGINQQTLGG--GSSASSAQSSKKKSSDLKYDIFGWAILAVG 182
KSLEV+ KAPELHME+H+HG QQT+ +S S+ S K KSSDLKYDIFGW ILAVG
Sbjct: 124 KSLEVTAKAPELHMEIHRHGPMQQTMAAEPSTSTSTKSSKKTKSSDLKYDIFGWVILAVG 183
Query: 183 IVAWVGMANSRI 194
IVAWVG A S +
Sbjct: 184 IVAWVGFAKSNM 195
>sp|P82873|TO202_ARATH Mitochondrial import receptor subunit TOM20-2 OS=Arabidopsis
thaliana GN=TOM20-2 PE=1 SV=2
Length = 210
Score = 229 bits (583), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 145/210 (69%), Gaps = 10/210 (4%)
Query: 1 MEFSQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINE 60
MEFS +DF+RL++ EH RK +EA Y DPLD++NL +WG ALLELSQF+ + ++K ++N+
Sbjct: 1 MEFSTADFERLIMFEHARKNSEAQYKNDPLDSENLLKWGGALLELSQFQPIPEAKLMLND 60
Query: 61 AISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTN 120
AISK EEAL I+P KH LW + NA+T+ F D EAK FDKA+E FQRA +E+P N
Sbjct: 61 AISKLEEALTINPGKHQALWCIANAYTAHAFYVHDPEEAKEHFDKATEYFQRAENEDPGN 120
Query: 121 ELYQKSLEVSTKAPELHMELHKHGINQQTLG----------GGSSASSAQSSKKKSSDLK 170
+ Y+KSL+ S KAPELHM+ G+ QQ LG S+ S + KK++++
Sbjct: 121 DTYRKSLDSSLKAPELHMQFMNQGMGQQILGGGGGGGGGGMASSNVSQSSKKKKRNTEFT 180
Query: 171 YDIFGWAILAVGIVAWVGMANSRIPPPPAR 200
YD+ GW ILA GIVAWVGMA S PPPPAR
Sbjct: 181 YDVCGWIILACGIVAWVGMAKSLGPPPPAR 210
>sp|P82874|TO203_ARATH Mitochondrial import receptor subunit TOM20-3 OS=Arabidopsis
thaliana GN=TOM20-3 PE=1 SV=1
Length = 202
Score = 225 bits (573), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/200 (54%), Positives = 142/200 (71%), Gaps = 3/200 (1%)
Query: 4 SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
++++FDR+LL E R+ AE Y +PLDADNLTRWG LLELSQF S+SD+K++I EAI+
Sbjct: 3 TETEFDRILLFEQIRQDAENTYKSNPLDADNLTRWGGVLLELSQFHSISDAKQMIQEAIT 62
Query: 64 KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
KFEEAL+IDP K +W +GNA+TS FLT D +EAK +FD A++ FQ+AVDE+P N Y
Sbjct: 63 KFEEALLIDPKKDEAVWCIGNAYTSFAFLTPDETEAKHNFDLATQFFQQAVDEQPDNTHY 122
Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLG---GGSSASSAQSSKKKSSDLKYDIFGWAILA 180
KSLE++ KAP+LH E +K G+ Q +G + SS KKSSD KYD GW ILA
Sbjct: 123 LKSLEMTAKAPQLHAEAYKQGLGSQPMGRVEAPAPPSSKAVKNKKSSDAKYDAMGWVILA 182
Query: 181 VGIVAWVGMANSRIPPPPAR 200
+G+VAW+ A + +P P R
Sbjct: 183 IGVVAWISFAKANVPVSPPR 202
>sp|A2WYG9|TOM20_ORYSI Probable mitochondrial import receptor subunit TOM20 OS=Oryza
sativa subsp. indica GN=TOM20 PE=2 SV=2
Length = 201
Score = 203 bits (516), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 101/197 (51%), Positives = 136/197 (69%)
Query: 4 SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
+ SD +R+ + + A+ Y ++P DADNLTRWG ALLELSQ + +S K + +A S
Sbjct: 5 AMSDPERMFFFDLACQNAKVTYEQNPHDADNLTRWGGALLELSQMRNGPESLKCLEDAES 64
Query: 64 KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
K EEAL IDP K LW LGNA TS GF T+D +A F+KA++CFQ+AVD EP N+LY
Sbjct: 65 KLEEALKIDPMKADALWCLGNAQTSHGFFTSDTVKANEFFEKATQCFQKAVDVEPANDLY 124
Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLGGGSSASSAQSSKKKSSDLKYDIFGWAILAVGI 183
+KSL++S+KAPELHME+H+ +Q + S++++ QS KKK SD YD+FGW +L VG+
Sbjct: 125 RKSLDLSSKAPELHMEIHRQMASQASQAASSTSNTRQSRKKKDSDFWYDVFGWVVLGVGM 184
Query: 184 VAWVGMANSRIPPPPAR 200
V WVG+A S PP R
Sbjct: 185 VVWVGLAKSNAPPQAPR 201
>sp|P82805|TO204_ARATH Mitochondrial import receptor subunit TOM20-4 OS=Arabidopsis
thaliana GN=TOM20-4 PE=1 SV=1
Length = 187
Score = 202 bits (514), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 105/194 (54%), Positives = 141/194 (72%), Gaps = 17/194 (8%)
Query: 5 QSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISK 64
Q++ +RL++ EH RK AEA Y K+PLDA+NLTRW ALLELSQF++ + K++I EAI K
Sbjct: 4 QNENERLMVFEHARKVAEATYVKNPLDAENLTRWAGALLELSQFQT--EPKQMILEAILK 61
Query: 65 FEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQ 124
EALVIDP KH LW +GNAH S GFL++D +EA +F+KAS+ FQ AV+E+P +ELY+
Sbjct: 62 LGEALVIDPKKHDALWLIGNAHLSFGFLSSDQTEASDNFEKASQFFQLAVEEQPESELYR 121
Query: 125 KSLEVSTKAPELHMELHKHGINQQTLGG--GSSASSAQSSK-KKSSDLKYDIFGWAILAV 181
KSL +++KAPELH GG G S++SA++ K KK+S+ KYD+FGW ILA
Sbjct: 122 KSLTLASKAPELHT------------GGTAGPSSNSAKTMKQKKTSEFKYDVFGWVILAS 169
Query: 182 GIVAWVGMANSRIP 195
+VAW+ ANS+ P
Sbjct: 170 YVVAWISFANSQTP 183
>sp|Q5JJI4|TOM20_ORYSJ Probable mitochondrial import receptor subunit TOM20 OS=Oryza
sativa subsp. japonica GN=TOM20 PE=2 SV=1
Length = 202
Score = 198 bits (503), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/198 (51%), Positives = 136/198 (68%), Gaps = 1/198 (0%)
Query: 4 SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
+ SD +R+ + + A+ Y ++P DADNLTRWG ALLELSQ + +S K + +A S
Sbjct: 5 AMSDPERMFFFDLACQNAKVTYEQNPHDADNLTRWGGALLELSQMRNGPESLKCLEDAES 64
Query: 64 KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
K EEAL IDP K LW LGNA TS GF T+D +A F+KA++CFQ+AVD EP N+LY
Sbjct: 65 KLEEALKIDPMKADALWCLGNAQTSHGFFTSDTVKANEFFEKATQCFQKAVDVEPANDLY 124
Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLGGGSSASSAQS-SKKKSSDLKYDIFGWAILAVG 182
+KSL++S+KAPELHME+H+ +Q + S++++ QS KKK SD YD+FGW +L VG
Sbjct: 125 RKSLDLSSKAPELHMEIHRQMASQASQAASSTSNTRQSRKKKKDSDFWYDVFGWVVLGVG 184
Query: 183 IVAWVGMANSRIPPPPAR 200
+V WVG+A S PP R
Sbjct: 185 MVVWVGLAKSNAPPQAPR 202
>sp|P82872|TO201_ARATH Mitochondrial import receptor subunit TOM20-1 OS=Arabidopsis
thaliana GN=TOM20-1 PE=1 SV=1
Length = 188
Score = 171 bits (433), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 93/190 (48%), Positives = 123/190 (64%), Gaps = 6/190 (3%)
Query: 8 FDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISKFEE 67
D+L E RK AE Y +P DADNL RWGEALLELSQF++V DS K+I +AISK E+
Sbjct: 1 MDKLNFFEEIRKDAEETYKLNPEDADNLMRWGEALLELSQFQNVIDSLKMIQDAISKLED 60
Query: 68 ALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQKSL 127
A++IDP KH +W LGNA+TS LT D ++A+ +F A F AV ++P N++Y KSL
Sbjct: 61 AILIDPMKHDAVWCLGNAYTSYARLTPDDTQARLNFGLAYLFFGIAVAQQPDNQVYHKSL 120
Query: 128 EVSTKAPELHMELHKHGINQQTLGGGSSASSAQSSK----KKSSDLKYDIFGWAILAVGI 183
E++ KAP+LH HK+ + +L GG + S K KKSSD KY + GW ILA+G+
Sbjct: 121 EMADKAPQLHTGFHKNRL--LSLLGGVETLAIPSPKVVKNKKSSDEKYIVMGWVILAIGV 178
Query: 184 VAWVGMANSR 193
VA + R
Sbjct: 179 VACISFRKLR 188
>sp|Q96AY4|TTC28_HUMAN Tetratricopeptide repeat protein 28 OS=Homo sapiens GN=TTC28 PE=1
SV=4
Length = 2481
Score = 32.7 bits (73), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 80 WSLGNAHTSCGFLTADLSEAKGDFDKASEC 109
WSLGN S G++ DL AK D+ EC
Sbjct: 245 WSLGNTEKSTGYMQQDLDVAKTLGDQTGEC 274
>sp|O34384|YCEE_BACSU Uncharacterized protein YceE OS=Bacillus subtilis (strain 168)
GN=yceE PE=3 SV=1
Length = 192
Score = 32.7 bits (73), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 102 DFDKASECFQRAVDEEPTNELYQKSL--EVSTKAPELHMELHKHG 144
+F + S F R VDEE NEL + L + S + + EL++HG
Sbjct: 121 NFGQVSNAFVRVVDEETQNELLRFDLGEDFSIETAVVVCELYRHG 165
>sp|A1WX31|DNAK_HALHL Chaperone protein DnaK OS=Halorhodospira halophila (strain DSM 244
/ SL1) GN=dnaK PE=3 SV=1
Length = 647
Score = 32.3 bits (72), Expect = 2.2, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 10/85 (11%)
Query: 94 ADLSEAKGDFDK-----ASECFQRAVDEEPTNELYQKSLEVSTKAPELHMELHKHGINQQ 148
+DL E GD +K A ++ D E + + QK+ E++TKA EL + + QQ
Sbjct: 551 SDLGEQVGDAEKQEIESAISELEQVKDGEDKDAIEQKTQELATKAGELAQKAY-----QQ 605
Query: 149 TLGGGSSASSAQSSKKKSSDLKYDI 173
GG +++ A + + S + K D+
Sbjct: 606 AGGGDEASADAGAGETASGEQKEDV 630
>sp|Q8NA56|TTC29_HUMAN Tetratricopeptide repeat protein 29 OS=Homo sapiens GN=TTC29 PE=2
SV=2
Length = 475
Score = 32.0 bits (71), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 51/112 (45%), Gaps = 16/112 (14%)
Query: 25 YAKDPLDADN---LTRWGEALLELSQFESVSDSKKIINEAISKFEEALVIDPAKHYTLWS 81
Y K D D+ L R EA+ ++ Q S+ + EAI K+ + V ++
Sbjct: 300 YCKISTDLDDDLSLGRGYEAIAKVLQ------SQGEMTEAI-KYLKKFVKIARNNFQSLD 352
Query: 82 LGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQKSLEVSTKA 133
L A T G D+ KG ++KASECFQ+A D T EL L TK
Sbjct: 353 LVRASTMLG----DIYNEKGYYNKASECFQQAFD--TTVELMSMPLMDETKV 398
>sp|Q8LP10|SPY_EUSER Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY OS=Eustoma
exaltatum subsp. russellianum GN=SPY PE=2 SV=1
Length = 918
Score = 32.0 bits (71), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 28/60 (46%), Gaps = 11/60 (18%)
Query: 60 EAISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPT 119
E I K+ EA+ IDP HY +C L SE +D A C++RA E PT
Sbjct: 156 EGIQKYYEAVKIDP--HYA--------PACYNLGVVYSEMM-QYDVALSCYERAATESPT 204
>sp|O67178|Y1088_AQUAE Uncharacterized protein aq_1088 OS=Aquifex aeolicus (strain VF5)
GN=aq_1088 PE=3 SV=1
Length = 761
Score = 31.2 bits (69), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%), Gaps = 11/61 (18%)
Query: 58 INEAISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEE 117
+ EA +E AL ++P K L++LG H + KG+ +KA + F+RA+ +
Sbjct: 119 LEEAQHYWERALSLNPNKVEILYNLGVLHLN-----------KGELEKALDLFERALRLK 167
Query: 118 P 118
P
Sbjct: 168 P 168
>sp|Q7SGD2|CLF1_NEUCR Pre-mRNA-splicing factor clf-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=clf-1 PE=3 SV=1
Length = 695
Score = 30.8 bits (68), Expect = 6.2, Method: Composition-based stats.
Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 16/95 (16%)
Query: 33 DNLTRWGEALLE--LSQFESVSDSKKIINEAI----SKFEEALVIDPAKHYTLWSLGNAH 86
DNL R L + FE K+ + + I + E V + AK+Y +W
Sbjct: 275 DNLPRSKSMTLHAHYTTFEKQFGDKEGVEDVILTKRRRLYEEQVKENAKNYDVW------ 328
Query: 87 TSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNE 121
F A L E+ GD D+ E ++RA+ + P +
Sbjct: 329 ----FDFARLEESGGDVDRTREVYERAIAQVPPTQ 359
>sp|Q9D706|RPAP3_MOUSE RNA polymerase II-associated protein 3 OS=Mus musculus GN=Rpap3
PE=1 SV=1
Length = 660
Score = 30.4 bits (67), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 20/111 (18%)
Query: 15 EHNRKTAEANYAKDPLDADNLTRWGE-----ALLELSQFESVSDSKKIINEAISKFEEAL 69
E +RKT E N K+ + + + W + L EL + +S DS +++ E+ +
Sbjct: 72 ESSRKTKEEN-TKNRIKSYDYDAWAKLDVDRILDELDKEDSTHDS---LSQESESDEDGI 127
Query: 70 VIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTN 120
+D K L GN + +G +D+A EC+ + +D +P N
Sbjct: 128 RVDSQKALVLKEKGNKYFK-----------QGKYDEAIECYTKGMDADPYN 167
>sp|Q4A600|ATPF_MYCS5 ATP synthase subunit b OS=Mycoplasma synoviae (strain 53) GN=atpF
PE=3 SV=2
Length = 193
Score = 30.4 bits (67), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 11/96 (11%)
Query: 33 DNLTRWGEALLELSQFESVSDSKKIINEAISKFEEALVIDPAKHYTLWSLGNAHTSCGFL 92
D+L + +A EL ES S I+N A SK +E L Y ++ +A+
Sbjct: 84 DSLAKLSQAQAEL--IESHKQSVNIVNNAKSKAQEILA-----SYKNKAISDANRLIEET 136
Query: 93 TADLSEAKGDFDKASECFQRAVDEEPTNELYQKSLE 128
DL+E K +FDK S R + E E+ Q+ L+
Sbjct: 137 QIDLNERKKEFDKNS----RILIAETATEIAQRILK 168
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.128 0.369
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,999,777
Number of Sequences: 539616
Number of extensions: 2592932
Number of successful extensions: 7195
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 7162
Number of HSP's gapped (non-prelim): 66
length of query: 200
length of database: 191,569,459
effective HSP length: 112
effective length of query: 88
effective length of database: 131,132,467
effective search space: 11539657096
effective search space used: 11539657096
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.9 bits)