BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029886
(186 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|F4J469|TI22L_ARATH Protein TIC 22-like, chloroplastic OS=Arabidopsis thaliana
GN=TIC22L PE=3 SV=1
Length = 313
Score = 275 bits (702), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/203 (68%), Positives = 159/203 (78%), Gaps = 20/203 (9%)
Query: 1 MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
+SA+AIEERLAGVPVYALSN NEEFVLVSG +GKSLGL+ K+EDAE LL +MKSMDP
Sbjct: 110 LSADAIEERLAGVPVYALSNSNEEFVLVSGTSSGKSLGLLFCKEEDAETLLKEMKSMDPR 169
Query: 61 MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQ- 119
MRKEGS+VV + L+KVFQLKVNGVAFRLIPESTQVKNAL+E + AG DD F GVPVFQ
Sbjct: 170 MRKEGSKVVALALSKVFQLKVNGVAFRLIPESTQVKNALKERKTAGIDDDDFHGVPVFQS 229
Query: 120 -------------------EDLEKSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKES 160
EDLEKSL RASS QN+LNPA + GDIQVAVFE+I+KGM+ES
Sbjct: 230 KSLILRSENMSYRPVFFRKEDLEKSLIRASSQQNRLNPALKPGDIQVAVFEDIVKGMRES 289
Query: 161 TTSAWNDVVFIPPGFDVSTNPNQ 183
TTS W+D+VFIPPGF+VST Q
Sbjct: 290 TTSNWDDIVFIPPGFEVSTEQTQ 312
>sp|Q9ZST9|TIC22_PEA Protein TIC 22, chloroplastic OS=Pisum sativum GN=TIC22 PE=1 SV=1
Length = 252
Score = 133 bits (334), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 114/195 (58%), Gaps = 28/195 (14%)
Query: 1 MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
++ + + LAG VY +SN + EFVL+S A+ KS+GL+CF++EDAEA L Q++S
Sbjct: 53 LTPNHVAKSLAGTSVYTVSNSDNEFVLMSDAEGAKSIGLLCFRQEDAEAFLAQVRSRKKE 112
Query: 61 MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQ- 119
R G++VVP+ L++V+ LKV G+AFR +P+ Q+KNAL E+ A + +F GVPVFQ
Sbjct: 113 FRG-GAKVVPITLDQVYMLKVEGIAFRFLPDPVQIKNAL-ELRAA--NRGSFDGVPVFQS 168
Query: 120 -------------------EDLEKSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKES 160
EDLE L + S + + I V FE+++K M+ S
Sbjct: 169 DLLVVKKKNKRYCPVYFSKEDLEYELSKVSRSSKGVGVS---QHIMVGSFEDVLKKMELS 225
Query: 161 -TTSAWNDVVFIPPG 174
+S W D+VFIPPG
Sbjct: 226 EKSSGWEDLVFIPPG 240
>sp|Q9SZB2|TIC22_ARATH Protein TIC 22, chloroplastic OS=Arabidopsis thaliana GN=TIC22 PE=1
SV=1
Length = 268
Score = 127 bits (320), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/190 (40%), Positives = 115/190 (60%), Gaps = 20/190 (10%)
Query: 1 MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPA 60
+S + + LAG V+ +SN N EFVL+S GKS+GL+CF++EDAEA L Q +
Sbjct: 70 LSPSLVAKALAGTSVFTVSNTNNEFVLISDPTGGKSIGLLCFRQEDAEAFLAQARLRRRE 129
Query: 61 MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQE 120
+ K ++VVP+ L++V+ LKV G++FR +P+ Q+KNAL E++ +G + + F GVPVFQ
Sbjct: 130 L-KTNAKVVPITLDQVYLLKVEGISFRFLPDPIQIKNAL-ELKSSG-NKNGFDGVPVFQS 186
Query: 121 DL---EKSLRRAS----------SDQNKLNPAFRMGDIQVAV--FEEIIKGMKES-TTSA 164
+L K RR + +K A R GD Q+ V E++++ M+ S S
Sbjct: 187 ELLVVRKKNRRYCPVYFSKEDIERELSKYTRASR-GDQQIMVGSLEDVLRKMEMSEKNSG 245
Query: 165 WNDVVFIPPG 174
W DV+FIPPG
Sbjct: 246 WEDVIFIPPG 255
>sp|O78449|YCF80_GUITH Uncharacterized protein ycf80 OS=Guillardia theta GN=ycf80 PE=3
SV=1
Length = 282
Score = 42.0 bits (97), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 36/154 (23%)
Query: 1 MSAEAIEERLAGVPVYALSNCNEEFVLVSGAKTGK------------------------S 36
++ + I E L VPVY + N E V+ S +
Sbjct: 95 LAQKHILESLKTVPVYTIVNDLNEIVIASPRDLTNFNSFNWVKRFYNDWFIWEKDEGNVN 154
Query: 37 LGLMCFKKEDAEALLHQMKSMDP-AMRKEGSRVVPVPLNKVFQLKVNG---VAFRLIPES 92
+GL +EDAE LHQ+ DP + G V + L+ ++L + F+LI +
Sbjct: 155 IGLFFMNREDAELYLHQICLKDPRGVENVGVNVKTISLDTFYKLNRLSPPRLQFKLIADL 214
Query: 93 TQVKNALRE--------MEKAGFSDDAFAGVPVF 118
++KN L K + + F G+P+F
Sbjct: 215 QEIKNILNVNSNSRVSFHPKQKYDKNWFKGIPIF 248
>sp|A8FY36|DXR_SHESH 1-deoxy-D-xylulose 5-phosphate reductoisomerase OS=Shewanella
sediminis (strain HAW-EB3) GN=dxr PE=3 SV=1
Length = 395
Score = 34.3 bits (77), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 10/85 (11%)
Query: 61 MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGV--PVF 118
MR+ G+RV+PV + ++ +PES Q E+EKAG S G P
Sbjct: 138 MRRSGARVLPVD-------SEHNAIYQALPESLQSNIGYCELEKAGVSHILLTGSGGPFL 190
Query: 119 QEDLEKSLRRASSDQNKLNPAFRMG 143
+L SL + Q +P + MG
Sbjct: 191 TSEL-ASLASMTPAQACKHPNWSMG 214
>sp|P32109|YIBJ_ECOLI Putative uncharacterized protein YibJ OS=Escherichia coli (strain
K12) GN=yibJ PE=5 SV=3
Length = 233
Score = 33.9 bits (76), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 1/108 (0%)
Query: 23 EEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPAMRKEGSRVVPVPLNKVFQLKVN 82
++ + + G + + GL + + LL + + DP K G PLN V +
Sbjct: 91 QQLIRLPGQQYDEESGLYYNRHRYYDPLLGRYITQDPIGLKGGWNFYQYPLNPVINVDPQ 150
Query: 83 G-VAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRA 129
G V L PES + + E+ G G P E +S+ A
Sbjct: 151 GLVDINLYPESDLIHSVADEINIPGVFTIGGHGTPTSIESATRSIMTA 198
>sp|Q50388|RPOB_MYCSM DNA-directed RNA polymerase subunit beta OS=Mycobacterium smegmatis
GN=rpoB PE=3 SV=2
Length = 1169
Score = 32.7 bits (73), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 56/139 (40%), Gaps = 26/139 (18%)
Query: 28 VSGAKTGKSLGLMCFKKEDAEAL--------------LHQMKSMDPAMRKEGSR-----V 68
V ++GK +G+ F +ED + L ++ D R+ G++ +
Sbjct: 829 VPHGESGKVIGIRVFSREDDDELPVGVNELVRVYVAQKRKISDGDKLARRHGNKGVIGKI 888
Query: 69 VPV------PLNKVFQLKVNGVAFRLIPESTQV-KNALREMEKAGFSDDAFAGVPVFQED 121
+PV P + +N R++ Q+ + L KAG++ D AGVP +
Sbjct: 889 LPVEDMPFLPDGTPVDIILNTHGCRVVLNIGQILETHLGWGAKAGWNIDVLAGVPDWASK 948
Query: 122 LEKSLRRASSDQNKLNPAF 140
L + L A +D P F
Sbjct: 949 LPEELYSAPADSTVATPVF 967
>sp|Q6F8G6|CYSG_ACIAD Siroheme synthase OS=Acinetobacter sp. (strain ADP1) GN=cysG PE=3
SV=1
Length = 457
Score = 32.3 bits (72), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/127 (22%), Positives = 52/127 (40%), Gaps = 23/127 (18%)
Query: 6 IEERLAGVPVYALSNCNEEFVLVSGAKTGKS-----LGLMCFKKEDAEALLHQMKSMDPA 60
I +RL P+ L + E V V A++ S + + K A + ++K DP
Sbjct: 246 IYDRLVSAPILELCRRDAEKVYVGKARSNHSVPQEGINALLVKYAQAGKRVCRLKGGDPF 305
Query: 61 MRKEGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQE 120
+ G ++ +L G+ F+++P T A A+AG+P+
Sbjct: 306 IFGRGG-------EEIQELFAAGIPFQVVPGIT-----------AASGCSAYAGIPLTHR 347
Query: 121 DLEKSLR 127
D +S+R
Sbjct: 348 DYAQSVR 354
>sp|Q8RBK0|CARB_THETN Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter
tengcongensis (strain DSM 15242 / JCM 11007 / NBRC
100824 / MB4) GN=carB PE=3 SV=1
Length = 1072
Score = 32.0 bits (71), Expect = 2.2, Method: Composition-based stats.
Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 89 IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
I E+ K L++ ++ GFSD A + +E+ ++LR+ + + P+++M D A
Sbjct: 472 IEENELSKEILKKAKRMGFSDREIATIKGIKEEDVRTLRK----KYGIYPSYKMVDTCAA 527
Query: 149 VFEEIIK 155
FE I +
Sbjct: 528 EFESITQ 534
>sp|B0KBW4|CARB_THEP3 Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter
pseudethanolicus (strain ATCC 33223 / 39E) GN=carB PE=3
SV=1
Length = 1072
Score = 31.6 bits (70), Expect = 3.1, Method: Composition-based stats.
Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 87 RLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQ 146
R + +S + L++ ++ GFSD A + +E+ + LR+ Q+ + P+++M D
Sbjct: 470 REVEKSELSEGILKKAKRMGFSDREIATIKGIKEEDVRMLRK----QHGIYPSYKMVDTC 525
Query: 147 VAVFEEIIK 155
A FE + +
Sbjct: 526 AAEFESVTQ 534
>sp|B0V3H4|CS054_DANRE UPF0692 protein C19orf54 homolog OS=Danio rerio GN=si:dkey-233h2.2
PE=2 SV=1
Length = 307
Score = 31.2 bits (69), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 5/82 (6%)
Query: 16 YALSNCNEEFVLVSGAKTGKSLGLMCFKKEDAEALLHQMKSMDPAMRKEGSRVVPVPLNK 75
+ + + E FVL AK GKSL ++ E Q+K MDP +G+R V P
Sbjct: 231 WPIDDIEEVFVL---AKQGKSLRYQLWEFESVAQSNKQLKEMDPQRASDGTRYVLPPGG- 286
Query: 76 VFQLKVNGVAFRLIPESTQVKN 97
Q + G L + Q KN
Sbjct: 287 -VQDGLAGQVLLLHTNTEQTKN 307
>sp|B0K4D7|CARB_THEPX Carbamoyl-phosphate synthase large chain OS=Thermoanaerobacter sp.
(strain X514) GN=carB PE=3 SV=1
Length = 1072
Score = 30.8 bits (68), Expect = 4.7, Method: Composition-based stats.
Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 87 RLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQ 146
R I ++ + L++ ++ GFSD A + +E+ ++LR+ S + P+++M D
Sbjct: 470 REIEKNELSEEILKKAKRMGFSDREIATIKGIEEEDVRALRKEYS----IYPSYKMVDTC 525
Query: 147 VAVFEEIIK 155
A FE + +
Sbjct: 526 AAEFESVTQ 534
>sp|P53009|SCY1_YEAST Protein kinase-like protein SCY1 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=SCY1 PE=1 SV=1
Length = 804
Score = 30.8 bits (68), Expect = 5.5, Method: Composition-based stats.
Identities = 38/159 (23%), Positives = 60/159 (37%), Gaps = 32/159 (20%)
Query: 43 KKEDAEALLH---QMKSMDPAMRKEGS---RVVPV--------PLNKVFQLKVNGVAFRL 88
KKED E + ++ +M P +K GS +P PLN L G R+
Sbjct: 642 KKEDPETVAAKNIEVAAMQPVKKKTGSSYGETLPQSKSILNSKPLNPKNALATRGFPTRI 701
Query: 89 I---PESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDI 145
+ P++ + + M K G S+DA S + + N+ G I
Sbjct: 702 LNSPPQTPSSRTGSKVMTKGG-SNDA-------------SSTKVEEEFNEFQSFSSTGSI 747
Query: 146 -QVAVFEEIIKGMKESTTSAWNDVVFIPPGFDVSTNPNQ 183
Q + ++ S T +PPGF +S PN+
Sbjct: 748 RQTSAPSDVWMNSTPSPTPTSASSTNLPPGFSISLQPNK 786
>sp|Q71YI1|CARB_LISMF Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
serotype 4b (strain F2365) GN=carB PE=3 SV=1
Length = 1070
Score = 30.8 bits (68), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)
Query: 89 IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
I E+ Q + L E ++AGFSD A E LR+A QN L P ++M D A
Sbjct: 472 IKENPQNQEILAEAKRAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527
Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
FE EST SA V+ + G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560
>sp|C1KWD4|CARB_LISMC Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
serotype 4b (strain CLIP80459) GN=carB PE=3 SV=1
Length = 1070
Score = 30.8 bits (68), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)
Query: 89 IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
I E+ Q + L E ++AGFSD A E LR+A QN L P ++M D A
Sbjct: 472 IKENPQNQEILAEAKRAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527
Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
FE EST SA V+ + G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560
>sp|Q8Y665|CARB_LISMO Carbamoyl-phosphate synthase large chain OS=Listeria monocytogenes
serovar 1/2a (strain ATCC BAA-679 / EGD-e) GN=carB PE=3
SV=1
Length = 1070
Score = 30.0 bits (66), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 11/93 (11%)
Query: 89 IPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLEKSLRRASSDQNKLNPAFRMGDIQVA 148
I E+ Q + L E ++AGFSD A E LR+A QN L P ++M D A
Sbjct: 472 IKENPQNQEILAEAKQAGFSDAFLATCWNVDEQAIYDLRKA---QN-LFPVYKMVDTCAA 527
Query: 149 VFEEIIKGM-------KESTTSAWNDVVFIPPG 174
FE EST SA V+ + G
Sbjct: 528 EFESTTPYFYSTYEEENESTRSAKESVIVLGSG 560
>sp|A1AX51|HEM3_RUTMC Porphobilinogen deaminase OS=Ruthia magnifica subsp. Calyptogena
magnifica GN=hemC PE=3 SV=1
Length = 304
Score = 30.0 bits (66), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 12/117 (10%)
Query: 64 EGSRVVPVPLNKVFQLKVNGVAFRLIPESTQVKNALREMEKAGFSDDAFAGVPVFQEDLE 123
+ ++V + ++ QLK +++ V L++++K GF A + + E
Sbjct: 120 QNAKVGTCSIRRIVQLKAIRPDLKILDLRGNVNTRLKKLDKGGFDGIILACAGLIRLGFE 179
Query: 124 KSLRRASSDQNKLNPAFRMGDIQVAVFEEIIKGMKESTTSAWNDVVFIPPGFDVSTN 180
+++ SDQ L PA G + + + +E+ T + I P DV T
Sbjct: 180 SRIKQQISDQQSL-PAVGQGVVGIEI--------RENDTEILD---LIKPLIDVETT 224
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.131 0.366
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 66,091,245
Number of Sequences: 539616
Number of extensions: 2537398
Number of successful extensions: 7026
Number of sequences better than 100.0: 28
Number of HSP's better than 100.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 7009
Number of HSP's gapped (non-prelim): 30
length of query: 186
length of database: 191,569,459
effective HSP length: 111
effective length of query: 75
effective length of database: 131,672,083
effective search space: 9875406225
effective search space used: 9875406225
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (26.6 bits)