BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029886
         (186 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|F4J469|TI22L_ARATH Protein TIC 22-like, chloroplastic OS=Arabidopsis thaliana
           GN=TIC22L PE=3 SV=1
          Length = 313

 Score =  275 bits (702), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/203 (68%), Positives = 159/203 (78%), Gaps = 20/203 (9%)

Query: 1   MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
           +SA+AIEERLAGVPVYALSN NEEFVLVSG  +GKSLGL+  K+EDAE LL +MKSMDP 
Sbjct: 110 LSADAIEERLAGVPVYALSNSNEEFVLVSGTSSGKSLGLLFCKEEDAETLLKEMKSMDPR 169

Query: 61  MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQ- 119
           MRKEGS+VV + L+KVFQLKVNGVAFRLIPESTQVKNAL+E + AG  DD F GVPVFQ 
Sbjct: 170 MRKEGSKVVALALSKVFQLKVNGVAFRLIPESTQVKNALKERKTAGIDDDDFHGVPVFQS 229

Query: 120 -------------------EDLEKSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKES 160
                              EDLEKSL RASS QN+LNPA + GDIQVAVFE+I+KGM+ES
Sbjct: 230 KSLILRSENMSYRPVFFRKEDLEKSLIRASSQQNRLNPALKPGDIQVAVFEDIVKGMRES 289

Query: 161 TTSAWNDVVFIPPGFDVSTNPNQ 183
           TTS W+D+VFIPPGF+VST   Q
Sbjct: 290 TTSNWDDIVFIPPGFEVSTEQTQ 312


>sp|Q9ZST9|TIC22_PEA Protein TIC 22, chloroplastic OS=Pisum sativum GN=TIC22 PE=1 SV=1
          Length = 252

 Score =  133 bits (334), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 114/195 (58%), Gaps = 28/195 (14%)

Query: 1   MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
           ++   + + LAG  VY +SN + EFVL+S A+  KS+GL+CF++EDAEA L Q++S    
Sbjct: 53  LTPNHVAKSLAGTSVYTVSNSDNEFVLMSDAEGAKSIGLLCFRQEDAEAFLAQVRSRKKE 112

Query: 61  MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQ- 119
            R  G++VVP+ L++V+ LKV G+AFR +P+  Q+KNAL E+  A  +  +F GVPVFQ 
Sbjct: 113 FRG-GAKVVPITLDQVYMLKVEGIAFRFLPDPVQIKNAL-ELRAA--NRGSFDGVPVFQS 168

Query: 120 -------------------EDLEKSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKES 160
                              EDLE  L + S     +  +     I V  FE+++K M+ S
Sbjct: 169 DLLVVKKKNKRYCPVYFSKEDLEYELSKVSRSSKGVGVS---QHIMVGSFEDVLKKMELS 225

Query: 161 -TTSAWNDVVFIPPG 174
             +S W D+VFIPPG
Sbjct: 226 EKSSGWEDLVFIPPG 240


>sp|Q9SZB2|TIC22_ARATH Protein TIC 22, chloroplastic OS=Arabidopsis thaliana GN=TIC22 PE=1
           SV=1
          Length = 268

 Score =  127 bits (320), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 77/190 (40%), Positives = 115/190 (60%), Gaps = 20/190 (10%)

Query: 1   MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
           +S   + + LAG  V+ +SN N EFVL+S    GKS+GL+CF++EDAEA L Q +     
Sbjct: 70  LSPSLVAKALAGTSVFTVSNTNNEFVLISDPTGGKSIGLLCFRQEDAEAFLAQARLRRRE 129

Query: 61  MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQE 120
           + K  ++VVP+ L++V+ LKV G++FR +P+  Q+KNAL E++ +G + + F GVPVFQ 
Sbjct: 130 L-KTNAKVVPITLDQVYLLKVEGISFRFLPDPIQIKNAL-ELKSSG-NKNGFDGVPVFQS 186

Query: 121 DL---EKSLRRAS----------SDQNKLNPAFRMGDIQVAV--FEEIIKGMKES-TTSA 164
           +L    K  RR             + +K   A R GD Q+ V   E++++ M+ S   S 
Sbjct: 187 ELLVVRKKNRRYCPVYFSKEDIERELSKYTRASR-GDQQIMVGSLEDVLRKMEMSEKNSG 245

Query: 165 WNDVVFIPPG 174
           W DV+FIPPG
Sbjct: 246 WEDVIFIPPG 255


>sp|O78449|YCF80_GUITH Uncharacterized protein ycf80 OS=Guillardia theta GN=ycf80 PE=3
           SV=1
          Length = 282

 Score = 42.0 bits (97), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 36/154 (23%)

Query: 1   MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGK------------------------S 36
           ++ + I E L  VPVY + N   E V+ S                              +
Sbjct: 95  LAQKHILESLKTVPVYTIVNDLNEIVIASPRDLTNFNSFNWVKRFYNDWFIWEKDEGNVN 154

Query: 37  LGLMCFKKEDAEALLHQMKSMDP-AMRKEGSRVVPVPLNKVFQLKVNG---VAFRLIPES 92
           +GL    +EDAE  LHQ+   DP  +   G  V  + L+  ++L       + F+LI + 
Sbjct: 155 IGLFFMNREDAELYLHQICLKDPRGVENVGVNVKTISLDTFYKLNRLSPPRLQFKLIADL 214

Query: 93  TQVKNALRE--------MEKAGFSDDAFAGVPVF 118
            ++KN L            K  +  + F G+P+F
Sbjct: 215 QEIKNILNVNSNSRVSFHPKQKYDKNWFKGIPIF 248


>sp|A8FY36|DXR_SHESH 1-deoxy-D-xylulose 5-phosphate reductoisomerase OS=Shewanella
           sediminis (strain HAW-EB3) GN=dxr PE=3 SV=1
          Length = 395

 Score = 34.3 bits (77), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 10/85 (11%)

Query: 61  MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGV--PVF 118
           MR+ G+RV+PV          +   ++ +PES Q      E+EKAG S     G   P  
Sbjct: 138 MRRSGARVLPVD-------SEHNAIYQALPESLQSNIGYCELEKAGVSHILLTGSGGPFL 190

Query: 119 QEDLEKSLRRASSDQNKLNPAFRMG 143
             +L  SL   +  Q   +P + MG
Sbjct: 191 TSEL-ASLASMTPAQACKHPNWSMG 214


>sp|P32109|YIBJ_ECOLI Putative uncharacterized protein YibJ OS=Escherichia coli (strain
           K12) GN=yibJ PE=5 SV=3
          Length = 233

 Score = 33.9 bits (76), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 1/108 (0%)

Query: 23  EEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPAMRKEGSRVVPVPLNKVFQLKVN 82
           ++ + + G +  +  GL   +    + LL +  + DP   K G      PLN V  +   
Sbjct: 91  QQLIRLPGQQYDEESGLYYNRHRYYDPLLGRYITQDPIGLKGGWNFYQYPLNPVINVDPQ 150

Query: 83  G-VAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRA 129
           G V   L PES  + +   E+   G       G P   E   +S+  A
Sbjct: 151 GLVDINLYPESDLIHSVADEINIPGVFTIGGHGTPTSIESATRSIMTA 198


>sp|Q50388|RPOB_MYCSM DNA-directed RNA polymerase subunit beta OS=Mycobacterium smegmatis
           GN=rpoB PE=3 SV=2
          Length = 1169

 Score = 32.7 bits (73), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 56/139 (40%), Gaps = 26/139 (18%)

Query: 28  VSGAKTGKSLGLMCFKKEDAEAL--------------LHQMKSMDPAMRKEGSR-----V 68
           V   ++GK +G+  F +ED + L                ++   D   R+ G++     +
Sbjct: 829 VPHGESGKVIGIRVFSREDDDELPVGVNELVRVYVAQKRKISDGDKLARRHGNKGVIGKI 888

Query: 69  VPV------PLNKVFQLKVNGVAFRLIPESTQV-KNALREMEKAGFSDDAFAGVPVFQED 121
           +PV      P      + +N    R++    Q+ +  L    KAG++ D  AGVP +   
Sbjct: 889 LPVEDMPFLPDGTPVDIILNTHGCRVVLNIGQILETHLGWGAKAGWNIDVLAGVPDWASK 948

Query: 122 LEKSLRRASSDQNKLNPAF 140
           L + L  A +D     P F
Sbjct: 949 LPEELYSAPADSTVATPVF 967


>sp|Q6F8G6|CYSG_ACIAD Siroheme synthase OS=Acinetobacter sp. (strain ADP1) GN=cysG PE=3
           SV=1
          Length = 457

 Score = 32.3 bits (72), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/127 (22%), Positives = 52/127 (40%), Gaps = 23/127 (18%)

Query: 6   IEERLAGVPVYALSNCNEEFVLVSGAKTGKS-----LGLMCFKKEDAEALLHQMKSMDPA 60
           I +RL   P+  L   + E V V  A++  S     +  +  K   A   + ++K  DP 
Sbjct: 246 IYDRLVSAPILELCRRDAEKVYVGKARSNHSVPQEGINALLVKYAQAGKRVCRLKGGDPF 305

Query: 61  MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQE 120
           +   G         ++ +L   G+ F+++P  T           A     A+AG+P+   
Sbjct: 306 IFGRGG-------EEIQELFAAGIPFQVVPGIT-----------AASGCSAYAGIPLTHR 347

Query: 121 DLEKSLR 127
           D  +S+R
Sbjct: 348 DYAQSVR 354


>sp|Q8RBK0|CARB_THETN Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter
           tengcongensis (strain DSM 15242 / JCM 11007 / NBRC
           100824 / MB4) GN=carB PE=3 SV=1
          Length = 1072

 Score = 32.0 bits (71), Expect = 2.2,   Method: Composition-based stats.
 Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 89  IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
           I E+   K  L++ ++ GFSD   A +   +E+  ++LR+    +  + P+++M D   A
Sbjct: 472 IEENELSKEILKKAKRMGFSDREIATIKGIKEEDVRTLRK----KYGIYPSYKMVDTCAA 527

Query: 149 VFEEIIK 155
            FE I +
Sbjct: 528 EFESITQ 534


>sp|B0KBW4|CARB_THEP3 Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter
           pseudethanolicus (strain ATCC 33223 / 39E) GN=carB PE=3
           SV=1
          Length = 1072

 Score = 31.6 bits (70), Expect = 3.1,   Method: Composition-based stats.
 Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 87  RLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQ 146
           R + +S   +  L++ ++ GFSD   A +   +E+  + LR+    Q+ + P+++M D  
Sbjct: 470 REVEKSELSEGILKKAKRMGFSDREIATIKGIKEEDVRMLRK----QHGIYPSYKMVDTC 525

Query: 147 VAVFEEIIK 155
            A FE + +
Sbjct: 526 AAEFESVTQ 534


>sp|B0V3H4|CS054_DANRE UPF0692 protein C19orf54 homolog OS=Danio rerio GN=si:dkey-233h2.2
           PE=2 SV=1
          Length = 307

 Score = 31.2 bits (69), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 5/82 (6%)

Query: 16  YALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPAMRKEGSRVVPVPLNK 75
           + + +  E FVL   AK GKSL    ++ E       Q+K MDP    +G+R V  P   
Sbjct: 231 WPIDDIEEVFVL---AKQGKSLRYQLWEFESVAQSNKQLKEMDPQRASDGTRYVLPPGG- 286

Query: 76  VFQLKVNGVAFRLIPESTQVKN 97
             Q  + G    L   + Q KN
Sbjct: 287 -VQDGLAGQVLLLHTNTEQTKN 307


>sp|B0K4D7|CARB_THEPX Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter sp.
           (strain X514) GN=carB PE=3 SV=1
          Length = 1072

 Score = 30.8 bits (68), Expect = 4.7,   Method: Composition-based stats.
 Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 87  RLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQ 146
           R I ++   +  L++ ++ GFSD   A +   +E+  ++LR+  S    + P+++M D  
Sbjct: 470 REIEKNELSEEILKKAKRMGFSDREIATIKGIEEEDVRALRKEYS----IYPSYKMVDTC 525

Query: 147 VAVFEEIIK 155
            A FE + +
Sbjct: 526 AAEFESVTQ 534


>sp|P53009|SCY1_YEAST Protein kinase-like protein SCY1 OS=Saccharomyces cerevisiae
           (strain ATCC 204508 / S288c) GN=SCY1 PE=1 SV=1
          Length = 804

 Score = 30.8 bits (68), Expect = 5.5,   Method: Composition-based stats.
 Identities = 38/159 (23%), Positives = 60/159 (37%), Gaps = 32/159 (20%)

Query: 43  KKEDAEALLH---QMKSMDPAMRKEGS---RVVPV--------PLNKVFQLKVNGVAFRL 88
           KKED E +     ++ +M P  +K GS     +P         PLN    L   G   R+
Sbjct: 642 KKEDPETVAAKNIEVAAMQPVKKKTGSSYGETLPQSKSILNSKPLNPKNALATRGFPTRI 701

Query: 89  I---PESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDI 145
           +   P++   +   + M K G S+DA             S  +   + N+       G I
Sbjct: 702 LNSPPQTPSSRTGSKVMTKGG-SNDA-------------SSTKVEEEFNEFQSFSSTGSI 747

Query: 146 -QVAVFEEIIKGMKESTTSAWNDVVFIPPGFDVSTNPNQ 183
            Q +   ++      S T        +PPGF +S  PN+
Sbjct: 748 RQTSAPSDVWMNSTPSPTPTSASSTNLPPGFSISLQPNK 786


>sp|Q71YI1|CARB_LISMF Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
           serotype 4b (strain F2365) GN=carB PE=3 SV=1
          Length = 1070

 Score = 30.8 bits (68), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)

Query: 89  IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
           I E+ Q +  L E ++AGFSD   A      E     LR+A   QN L P ++M D   A
Sbjct: 472 IKENPQNQEILAEAKRAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527

Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
            FE              EST SA   V+ +  G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560


>sp|C1KWD4|CARB_LISMC Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
           serotype 4b (strain CLIP80459) GN=carB PE=3 SV=1
          Length = 1070

 Score = 30.8 bits (68), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)

Query: 89  IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
           I E+ Q +  L E ++AGFSD   A      E     LR+A   QN L P ++M D   A
Sbjct: 472 IKENPQNQEILAEAKRAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527

Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
            FE              EST SA   V+ +  G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560


>sp|Q8Y665|CARB_LISMO Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
           serovar 1/2a (strain ATCC BAA-679 / EGD-e) GN=carB PE=3
           SV=1
          Length = 1070

 Score = 30.0 bits (66), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)

Query: 89  IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
           I E+ Q +  L E ++AGFSD   A      E     LR+A   QN L P ++M D   A
Sbjct: 472 IKENPQNQEILAEAKQAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527

Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
            FE              EST SA   V+ +  G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560


>sp|A1AX51|HEM3_RUTMC Porphobilinogen deaminase OS=Ruthia magnifica subsp. Calyptogena
           magnifica GN=hemC PE=3 SV=1
          Length = 304

 Score = 30.0 bits (66), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 12/117 (10%)

Query: 64  EGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLE 123
           + ++V    + ++ QLK      +++     V   L++++K GF     A   + +   E
Sbjct: 120 QNAKVGTCSIRRIVQLKAIRPDLKILDLRGNVNTRLKKLDKGGFDGIILACAGLIRLGFE 179

Query: 124 KSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKESTTSAWNDVVFIPPGFDVSTN 180
             +++  SDQ  L PA   G + + +        +E+ T   +    I P  DV T 
Sbjct: 180 SRIKQQISDQQSL-PAVGQGVVGIEI--------RENDTEILD---LIKPLIDVETT 224


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.316    0.131    0.366 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 66,091,245
Number of Sequences: 539616
Number of extensions: 2537398
Number of successful extensions: 7026
Number of sequences better than 100.0: 28
Number of HSP's better than 100.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 7009
Number of HSP's gapped (non-prelim): 30
length of query: 186
length of database: 191,569,459
effective HSP length: 111
effective length of query: 75
effective length of database: 131,672,083
effective search space: 9875406225
effective search space used: 9875406225
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (26.6 bits)