BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028992
         (200 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P92792|TOM20_SOLTU Mitochondrial import receptor subunit TOM20 OS=Solanum tuberosum
           GN=TOM20 PE=1 SV=1
          Length = 204

 Score =  246 bits (627), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 124/192 (64%), Positives = 153/192 (79%), Gaps = 2/192 (1%)

Query: 5   QSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISK 64
           QS+FDRLL  EH RK+AE  YA++PLDADNLTRWG ALLELSQF+ V++SK++I++A SK
Sbjct: 4   QSEFDRLLFFEHARKSAETTYAQNPLDADNLTRWGGALLELSQFQPVAESKQMISDATSK 63

Query: 65  FEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQ 124
            EEAL ++P KH  LW LGNAHTS  FLT D+ EAK  F+KA++CFQ+A D +P+N+LY+
Sbjct: 64  LEEALTVNPEKHDALWCLGNAHTSHVFLTPDMDEAKVYFEKATQCFQQAFDADPSNDLYR 123

Query: 125 KSLEVSTKAPELHMELHKHGINQQTLGG--GSSASSAQSSKKKSSDLKYDIFGWAILAVG 182
           KSLEV+ KAPELHME+H+HG  QQT+     +S S+  S K KSSDLKYDIFGW ILAVG
Sbjct: 124 KSLEVTAKAPELHMEIHRHGPMQQTMAAEPSTSTSTKSSKKTKSSDLKYDIFGWVILAVG 183

Query: 183 IVAWVGMANSRI 194
           IVAWVG A S +
Sbjct: 184 IVAWVGFAKSNM 195


>sp|P82873|TO202_ARATH Mitochondrial import receptor subunit TOM20-2 OS=Arabidopsis
           thaliana GN=TOM20-2 PE=1 SV=2
          Length = 210

 Score =  229 bits (583), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 145/210 (69%), Gaps = 10/210 (4%)

Query: 1   MEFSQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINE 60
           MEFS +DF+RL++ EH RK +EA Y  DPLD++NL +WG ALLELSQF+ + ++K ++N+
Sbjct: 1   MEFSTADFERLIMFEHARKNSEAQYKNDPLDSENLLKWGGALLELSQFQPIPEAKLMLND 60

Query: 61  AISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTN 120
           AISK EEAL I+P KH  LW + NA+T+  F   D  EAK  FDKA+E FQRA +E+P N
Sbjct: 61  AISKLEEALTINPGKHQALWCIANAYTAHAFYVHDPEEAKEHFDKATEYFQRAENEDPGN 120

Query: 121 ELYQKSLEVSTKAPELHMELHKHGINQQTLG----------GGSSASSAQSSKKKSSDLK 170
           + Y+KSL+ S KAPELHM+    G+ QQ LG            S+ S +   KK++++  
Sbjct: 121 DTYRKSLDSSLKAPELHMQFMNQGMGQQILGGGGGGGGGGMASSNVSQSSKKKKRNTEFT 180

Query: 171 YDIFGWAILAVGIVAWVGMANSRIPPPPAR 200
           YD+ GW ILA GIVAWVGMA S  PPPPAR
Sbjct: 181 YDVCGWIILACGIVAWVGMAKSLGPPPPAR 210


>sp|P82874|TO203_ARATH Mitochondrial import receptor subunit TOM20-3 OS=Arabidopsis
           thaliana GN=TOM20-3 PE=1 SV=1
          Length = 202

 Score =  225 bits (573), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 108/200 (54%), Positives = 142/200 (71%), Gaps = 3/200 (1%)

Query: 4   SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
           ++++FDR+LL E  R+ AE  Y  +PLDADNLTRWG  LLELSQF S+SD+K++I EAI+
Sbjct: 3   TETEFDRILLFEQIRQDAENTYKSNPLDADNLTRWGGVLLELSQFHSISDAKQMIQEAIT 62

Query: 64  KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
           KFEEAL+IDP K   +W +GNA+TS  FLT D +EAK +FD A++ FQ+AVDE+P N  Y
Sbjct: 63  KFEEALLIDPKKDEAVWCIGNAYTSFAFLTPDETEAKHNFDLATQFFQQAVDEQPDNTHY 122

Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLG---GGSSASSAQSSKKKSSDLKYDIFGWAILA 180
            KSLE++ KAP+LH E +K G+  Q +G     +  SS     KKSSD KYD  GW ILA
Sbjct: 123 LKSLEMTAKAPQLHAEAYKQGLGSQPMGRVEAPAPPSSKAVKNKKSSDAKYDAMGWVILA 182

Query: 181 VGIVAWVGMANSRIPPPPAR 200
           +G+VAW+  A + +P  P R
Sbjct: 183 IGVVAWISFAKANVPVSPPR 202


>sp|A2WYG9|TOM20_ORYSI Probable mitochondrial import receptor subunit TOM20 OS=Oryza
           sativa subsp. indica GN=TOM20 PE=2 SV=2
          Length = 201

 Score =  203 bits (516), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 101/197 (51%), Positives = 136/197 (69%)

Query: 4   SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
           + SD +R+   +   + A+  Y ++P DADNLTRWG ALLELSQ  +  +S K + +A S
Sbjct: 5   AMSDPERMFFFDLACQNAKVTYEQNPHDADNLTRWGGALLELSQMRNGPESLKCLEDAES 64

Query: 64  KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
           K EEAL IDP K   LW LGNA TS GF T+D  +A   F+KA++CFQ+AVD EP N+LY
Sbjct: 65  KLEEALKIDPMKADALWCLGNAQTSHGFFTSDTVKANEFFEKATQCFQKAVDVEPANDLY 124

Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLGGGSSASSAQSSKKKSSDLKYDIFGWAILAVGI 183
           +KSL++S+KAPELHME+H+   +Q +    S++++ QS KKK SD  YD+FGW +L VG+
Sbjct: 125 RKSLDLSSKAPELHMEIHRQMASQASQAASSTSNTRQSRKKKDSDFWYDVFGWVVLGVGM 184

Query: 184 VAWVGMANSRIPPPPAR 200
           V WVG+A S  PP   R
Sbjct: 185 VVWVGLAKSNAPPQAPR 201


>sp|P82805|TO204_ARATH Mitochondrial import receptor subunit TOM20-4 OS=Arabidopsis
           thaliana GN=TOM20-4 PE=1 SV=1
          Length = 187

 Score =  202 bits (514), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 105/194 (54%), Positives = 141/194 (72%), Gaps = 17/194 (8%)

Query: 5   QSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISK 64
           Q++ +RL++ EH RK AEA Y K+PLDA+NLTRW  ALLELSQF++  + K++I EAI K
Sbjct: 4   QNENERLMVFEHARKVAEATYVKNPLDAENLTRWAGALLELSQFQT--EPKQMILEAILK 61

Query: 65  FEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQ 124
             EALVIDP KH  LW +GNAH S GFL++D +EA  +F+KAS+ FQ AV+E+P +ELY+
Sbjct: 62  LGEALVIDPKKHDALWLIGNAHLSFGFLSSDQTEASDNFEKASQFFQLAVEEQPESELYR 121

Query: 125 KSLEVSTKAPELHMELHKHGINQQTLGG--GSSASSAQSSK-KKSSDLKYDIFGWAILAV 181
           KSL +++KAPELH             GG  G S++SA++ K KK+S+ KYD+FGW ILA 
Sbjct: 122 KSLTLASKAPELHT------------GGTAGPSSNSAKTMKQKKTSEFKYDVFGWVILAS 169

Query: 182 GIVAWVGMANSRIP 195
            +VAW+  ANS+ P
Sbjct: 170 YVVAWISFANSQTP 183


>sp|Q5JJI4|TOM20_ORYSJ Probable mitochondrial import receptor subunit TOM20 OS=Oryza
           sativa subsp. japonica GN=TOM20 PE=2 SV=1
          Length = 202

 Score =  198 bits (503), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 101/198 (51%), Positives = 136/198 (68%), Gaps = 1/198 (0%)

Query: 4   SQSDFDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAIS 63
           + SD +R+   +   + A+  Y ++P DADNLTRWG ALLELSQ  +  +S K + +A S
Sbjct: 5   AMSDPERMFFFDLACQNAKVTYEQNPHDADNLTRWGGALLELSQMRNGPESLKCLEDAES 64

Query: 64  KFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELY 123
           K EEAL IDP K   LW LGNA TS GF T+D  +A   F+KA++CFQ+AVD EP N+LY
Sbjct: 65  KLEEALKIDPMKADALWCLGNAQTSHGFFTSDTVKANEFFEKATQCFQKAVDVEPANDLY 124

Query: 124 QKSLEVSTKAPELHMELHKHGINQQTLGGGSSASSAQS-SKKKSSDLKYDIFGWAILAVG 182
           +KSL++S+KAPELHME+H+   +Q +    S++++ QS  KKK SD  YD+FGW +L VG
Sbjct: 125 RKSLDLSSKAPELHMEIHRQMASQASQAASSTSNTRQSRKKKKDSDFWYDVFGWVVLGVG 184

Query: 183 IVAWVGMANSRIPPPPAR 200
           +V WVG+A S  PP   R
Sbjct: 185 MVVWVGLAKSNAPPQAPR 202


>sp|P82872|TO201_ARATH Mitochondrial import receptor subunit TOM20-1 OS=Arabidopsis
           thaliana GN=TOM20-1 PE=1 SV=1
          Length = 188

 Score =  171 bits (433), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 93/190 (48%), Positives = 123/190 (64%), Gaps = 6/190 (3%)

Query: 8   FDRLLLSEHNRKTAEANYAKDPLDADNLTRWGEALLELSQFESVSDSKKIINEAISKFEE 67
            D+L   E  RK AE  Y  +P DADNL RWGEALLELSQF++V DS K+I +AISK E+
Sbjct: 1   MDKLNFFEEIRKDAEETYKLNPEDADNLMRWGEALLELSQFQNVIDSLKMIQDAISKLED 60

Query: 68  ALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQKSL 127
           A++IDP KH  +W LGNA+TS   LT D ++A+ +F  A   F  AV ++P N++Y KSL
Sbjct: 61  AILIDPMKHDAVWCLGNAYTSYARLTPDDTQARLNFGLAYLFFGIAVAQQPDNQVYHKSL 120

Query: 128 EVSTKAPELHMELHKHGINQQTLGGGSSASSAQSSK----KKSSDLKYDIFGWAILAVGI 183
           E++ KAP+LH   HK+ +   +L GG    +  S K    KKSSD KY + GW ILA+G+
Sbjct: 121 EMADKAPQLHTGFHKNRL--LSLLGGVETLAIPSPKVVKNKKSSDEKYIVMGWVILAIGV 178

Query: 184 VAWVGMANSR 193
           VA +     R
Sbjct: 179 VACISFRKLR 188


>sp|Q96AY4|TTC28_HUMAN Tetratricopeptide repeat protein 28 OS=Homo sapiens GN=TTC28 PE=1
           SV=4
          Length = 2481

 Score = 32.7 bits (73), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 80  WSLGNAHTSCGFLTADLSEAKGDFDKASEC 109
           WSLGN   S G++  DL  AK   D+  EC
Sbjct: 245 WSLGNTEKSTGYMQQDLDVAKTLGDQTGEC 274


>sp|O34384|YCEE_BACSU Uncharacterized protein YceE OS=Bacillus subtilis (strain 168)
           GN=yceE PE=3 SV=1
          Length = 192

 Score = 32.7 bits (73), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 102 DFDKASECFQRAVDEEPTNELYQKSL--EVSTKAPELHMELHKHG 144
           +F + S  F R VDEE  NEL +  L  + S +   +  EL++HG
Sbjct: 121 NFGQVSNAFVRVVDEETQNELLRFDLGEDFSIETAVVVCELYRHG 165


>sp|A1WX31|DNAK_HALHL Chaperone protein DnaK OS=Halorhodospira halophila (strain DSM 244
           / SL1) GN=dnaK PE=3 SV=1
          Length = 647

 Score = 32.3 bits (72), Expect = 2.2,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 10/85 (11%)

Query: 94  ADLSEAKGDFDK-----ASECFQRAVDEEPTNELYQKSLEVSTKAPELHMELHKHGINQQ 148
           +DL E  GD +K     A    ++  D E  + + QK+ E++TKA EL  + +     QQ
Sbjct: 551 SDLGEQVGDAEKQEIESAISELEQVKDGEDKDAIEQKTQELATKAGELAQKAY-----QQ 605

Query: 149 TLGGGSSASSAQSSKKKSSDLKYDI 173
             GG  +++ A + +  S + K D+
Sbjct: 606 AGGGDEASADAGAGETASGEQKEDV 630


>sp|Q8NA56|TTC29_HUMAN Tetratricopeptide repeat protein 29 OS=Homo sapiens GN=TTC29 PE=2
           SV=2
          Length = 475

 Score = 32.0 bits (71), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 51/112 (45%), Gaps = 16/112 (14%)

Query: 25  YAKDPLDADN---LTRWGEALLELSQFESVSDSKKIINEAISKFEEALVIDPAKHYTLWS 81
           Y K   D D+   L R  EA+ ++ Q      S+  + EAI K+ +  V     ++    
Sbjct: 300 YCKISTDLDDDLSLGRGYEAIAKVLQ------SQGEMTEAI-KYLKKFVKIARNNFQSLD 352

Query: 82  LGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNELYQKSLEVSTKA 133
           L  A T  G    D+   KG ++KASECFQ+A D   T EL    L   TK 
Sbjct: 353 LVRASTMLG----DIYNEKGYYNKASECFQQAFD--TTVELMSMPLMDETKV 398


>sp|Q8LP10|SPY_EUSER Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY OS=Eustoma
           exaltatum subsp. russellianum GN=SPY PE=2 SV=1
          Length = 918

 Score = 32.0 bits (71), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 28/60 (46%), Gaps = 11/60 (18%)

Query: 60  EAISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPT 119
           E I K+ EA+ IDP  HY          +C  L    SE    +D A  C++RA  E PT
Sbjct: 156 EGIQKYYEAVKIDP--HYA--------PACYNLGVVYSEMM-QYDVALSCYERAATESPT 204


>sp|O67178|Y1088_AQUAE Uncharacterized protein aq_1088 OS=Aquifex aeolicus (strain VF5)
           GN=aq_1088 PE=3 SV=1
          Length = 761

 Score = 31.2 bits (69), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%), Gaps = 11/61 (18%)

Query: 58  INEAISKFEEALVIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEE 117
           + EA   +E AL ++P K   L++LG  H +           KG+ +KA + F+RA+  +
Sbjct: 119 LEEAQHYWERALSLNPNKVEILYNLGVLHLN-----------KGELEKALDLFERALRLK 167

Query: 118 P 118
           P
Sbjct: 168 P 168


>sp|Q7SGD2|CLF1_NEUCR Pre-mRNA-splicing factor clf-1 OS=Neurospora crassa (strain ATCC
           24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
           GN=clf-1 PE=3 SV=1
          Length = 695

 Score = 30.8 bits (68), Expect = 6.2,   Method: Composition-based stats.
 Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 16/95 (16%)

Query: 33  DNLTRWGEALLE--LSQFESVSDSKKIINEAI----SKFEEALVIDPAKHYTLWSLGNAH 86
           DNL R     L    + FE     K+ + + I     +  E  V + AK+Y +W      
Sbjct: 275 DNLPRSKSMTLHAHYTTFEKQFGDKEGVEDVILTKRRRLYEEQVKENAKNYDVW------ 328

Query: 87  TSCGFLTADLSEAKGDFDKASECFQRAVDEEPTNE 121
               F  A L E+ GD D+  E ++RA+ + P  +
Sbjct: 329 ----FDFARLEESGGDVDRTREVYERAIAQVPPTQ 359


>sp|Q9D706|RPAP3_MOUSE RNA polymerase II-associated protein 3 OS=Mus musculus GN=Rpap3
           PE=1 SV=1
          Length = 660

 Score = 30.4 bits (67), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 20/111 (18%)

Query: 15  EHNRKTAEANYAKDPLDADNLTRWGE-----ALLELSQFESVSDSKKIINEAISKFEEAL 69
           E +RKT E N  K+ + + +   W +      L EL + +S  DS   +++     E+ +
Sbjct: 72  ESSRKTKEEN-TKNRIKSYDYDAWAKLDVDRILDELDKEDSTHDS---LSQESESDEDGI 127

Query: 70  VIDPAKHYTLWSLGNAHTSCGFLTADLSEAKGDFDKASECFQRAVDEEPTN 120
            +D  K   L   GN +             +G +D+A EC+ + +D +P N
Sbjct: 128 RVDSQKALVLKEKGNKYFK-----------QGKYDEAIECYTKGMDADPYN 167


>sp|Q4A600|ATPF_MYCS5 ATP synthase subunit b OS=Mycoplasma synoviae (strain 53) GN=atpF
           PE=3 SV=2
          Length = 193

 Score = 30.4 bits (67), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 11/96 (11%)

Query: 33  DNLTRWGEALLELSQFESVSDSKKIINEAISKFEEALVIDPAKHYTLWSLGNAHTSCGFL 92
           D+L +  +A  EL   ES   S  I+N A SK +E L       Y   ++ +A+      
Sbjct: 84  DSLAKLSQAQAEL--IESHKQSVNIVNNAKSKAQEILA-----SYKNKAISDANRLIEET 136

Query: 93  TADLSEAKGDFDKASECFQRAVDEEPTNELYQKSLE 128
             DL+E K +FDK S    R +  E   E+ Q+ L+
Sbjct: 137 QIDLNERKKEFDKNS----RILIAETATEIAQRILK 168


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.312    0.128    0.369 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,999,777
Number of Sequences: 539616
Number of extensions: 2592932
Number of successful extensions: 7195
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 7162
Number of HSP's gapped (non-prelim): 66
length of query: 200
length of database: 191,569,459
effective HSP length: 112
effective length of query: 88
effective length of database: 131,132,467
effective search space: 11539657096
effective search space used: 11539657096
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.9 bits)