BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047178
(287 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q2V3H0|PP322_ARATH Pentatricopeptide repeat-containing protein At4g18975,
chloroplastic OS=Arabidopsis thaliana GN=At4g18975 PE=2
SV=2
Length = 287
Score = 141 bits (356), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 73/164 (44%), Positives = 100/164 (60%), Gaps = 1/164 (0%)
Query: 79 LVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSK 138
LV L L N KE VYG L+ WVAWE FP+ + KAL L K QWHRV+Q+ KWMLSK
Sbjct: 101 LVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSK 160
Query: 139 GQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLER 198
GQG+TMGT L+ A DMD RA+EA W + S+P +L MIA+Y +++ ++
Sbjct: 161 GQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDK 220
Query: 199 LIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKY 242
+I++F +E P E S +RVA A+ L E ++ +L +Y
Sbjct: 221 VIEVFADMEELKVSPDEDS-ARRVARAFRELNQEENRKLILRRY 263
>sp|Q8LG95|PP332_ARATH Pentatricopeptide repeat-containing protein At4g21190
OS=Arabidopsis thaliana GN=EMB1417 PE=2 SV=1
Length = 307
Score = 133 bits (335), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 103/164 (62%), Gaps = 1/164 (0%)
Query: 79 LVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSK 138
++ + L N KE+VYG LD+++AWE FP+ +KKAL+ LE E++W +++QV KWMLSK
Sbjct: 61 MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120
Query: 139 GQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLER 198
GQG TMGT L+ AL D+R +EA + W K L P + MI+IYY+ +M ++
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180
Query: 199 LIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKY 242
L ++F +E K P +IV V + L + ++ E++++KY
Sbjct: 181 LFEVFADMEELGVK-PNVAIVSMVGKVFVKLEMKDKYEKLMKKY 223
>sp|Q8GYM2|PP393_ARATH Pentatricopeptide repeat-containing protein At5g18950
OS=Arabidopsis thaliana GN=At5g18950 PE=2 SV=2
Length = 483
Score = 37.7 bits (86), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 5/81 (6%)
Query: 135 MLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYR 192
ML G G TM +C +I+ +++EA + ++ G+ +++ + ++I + +
Sbjct: 379 MLRNGYGGTMLSCNTMIKGFCSHGKSDEAFEIFKNMSETGVTPNAITYN---ALIKGFCK 435
Query: 193 NNMLERLIKLFKGLEAFDRKP 213
N +E+ +KL+K L+A KP
Sbjct: 436 ENKVEKGLKLYKELKALGLKP 456
>sp|Q9SNB7|PP264_ARATH Pentatricopeptide repeat-containing protein At3g46610
OS=Arabidopsis thaliana GN=At3g46610 PE=2 SV=1
Length = 665
Score = 36.6 bits (83), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 2/89 (2%)
Query: 115 ALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEK--RIG 172
L+A K + +Q+ K M+ G+ T+ + G L+ AL+ +EA + W ++G
Sbjct: 464 VLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVG 523
Query: 173 IDLHSVPWQLCKSMIAIYYRNNMLERLIK 201
I+ + + S++ + N+L+ L+K
Sbjct: 524 IEPNLYAYTTMASVLTGQQKFNLLDTLLK 552
>sp|Q0WVV0|PPR31_ARATH Pentatricopeptide repeat-containing protein At1g10910,
chloroplastic OS=Arabidopsis thaliana GN=At1g10910 PE=2
SV=1
Length = 664
Score = 34.3 bits (77), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/92 (22%), Positives = 45/92 (48%), Gaps = 7/92 (7%)
Query: 124 QWHRVVQVIKWMLSKGQ--GSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQ 181
+W ++Q+ +WM G+ ST +C + + A ++ E ++ I+++
Sbjct: 113 RWQDLIQLFEWMQQHGKISVSTYSSCIKFVGAKNVSKALEIYQSIPDESTKINVY----- 167
Query: 182 LCKSMIAIYYRNNMLERLIKLFKGLEAFDRKP 213
+C S+++ +N L+ IKLF ++ KP
Sbjct: 168 ICNSILSCLVKNGKLDSCIKLFDQMKRDGLKP 199
>sp|Q28C74|LPPRC_XENTR Leucine-rich PPR motif-containing protein, mitochondrial OS=Xenopus
tropicalis GN=lrpprc PE=2 SV=1
Length = 1391
Score = 34.3 bits (77), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 8/104 (7%)
Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
+CG L+ L M R E AH+ WEK +G + +++ +Y +N +
Sbjct: 113 SCGSLLPELQMSERTEMAHRIWEKLQELGAVFDVSHYN---ALLKVYLQNEHKFSPTEYL 169
Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVL--EKYKDL 245
+EA + + P + QR+ AY G +E ++L K KDL
Sbjct: 170 AKMEAANVQ-PNRVTYQRLIAAYCNEGDIEGASKILGFMKNKDL 212
>sp|Q9XIM8|PP155_ARATH Pentatricopeptide repeat-containing protein At2g15980
OS=Arabidopsis thaliana GN=At2g15980 PE=2 SV=1
Length = 498
Score = 33.9 bits (76), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/127 (23%), Positives = 54/127 (42%), Gaps = 12/127 (9%)
Query: 129 VQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVP--------- 179
V V++ + S+G + + TC LI + A +K + + G+D SV
Sbjct: 182 VMVMRKLRSRGINAQISTCNALITEVSRRRGASNGYKMYREVFGLDDVSVDEAKKMIGKI 241
Query: 180 ---WQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKE 236
SM+ +YR E + ++++ +E P + +AY GL+ E E
Sbjct: 242 KPNATTFNSMMVSFYREGETEMVERIWREMEEEVGCSPNVYSYNVLMEAYCARGLMSEAE 301
Query: 237 RVLEKYK 243
+V E+ K
Sbjct: 302 KVWEEMK 308
>sp|Q9FIB2|PP373_ARATH Putative pentatricopeptide repeat-containing protein At5g09950
OS=Arabidopsis thaliana GN=PCMP-H35 PE=3 SV=1
Length = 995
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 185 SMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKD 244
SMI+ Y R+ E +KLF+ ++ + PP+ V A GLLEE + E D
Sbjct: 690 SMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSD 749
Query: 245 LF 246
+
Sbjct: 750 SY 751
>sp|Q6PB66|LPPRC_MOUSE Leucine-rich PPR motif-containing protein, mitochondrial OS=Mus
musculus GN=Lrpprc PE=1 SV=2
Length = 1392
Score = 33.9 bits (76), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 8/104 (7%)
Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
+CG L+ L + R E AHK W+K ++G+ + +++ +Y +N
Sbjct: 128 SCGSLLPELSLAERTEFAHKIWDKLQQLGVVYDVSHYN---ALLKVYLQNEYKFSPTDFL 184
Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVL--EKYKDL 245
+E + + P + QR+ AY +G +E ++L K KDL
Sbjct: 185 AKMEGANIQ-PNRVTYQRLIAAYCNVGDIEGASKILGFMKTKDL 227
>sp|Q5SGE0|LPPRC_RAT Leucine-rich PPR motif-containing protein, mitochondrial OS=Rattus
norvegicus GN=Lrpprc PE=1 SV=1
Length = 1392
Score = 32.3 bits (72), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/140 (21%), Positives = 56/140 (40%), Gaps = 32/140 (22%)
Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
+CG L+ L + R E AHK W+K ++G + +++ +Y +N
Sbjct: 128 SCGSLLPELSLAERTEFAHKIWDKLQQLGTVYDVSHYN---ALLKVYLQNEYRFSPTDFL 184
Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMK 263
+E + + P + QR+ AY +G +E ++L K
Sbjct: 185 AKMEGANIQ-PNRVTYQRLIAAYCSVGDIEGASKILGFMK-------------------- 223
Query: 264 GKKSGRTRDTPVSDGVTNAI 283
TRD P+++ V +A+
Sbjct: 224 ------TRDLPITEAVFSAL 237
>sp|Q84J46|PP262_ARATH Pentatricopeptide repeat-containing protein At3g29290
OS=Arabidopsis thaliana GN=EMB2076 PE=2 SV=1
Length = 540
Score = 32.0 bits (71), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 118 ALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWE--KRIGIDL 175
A KE++W +++ + ML KG + C LI +L + K + K +G
Sbjct: 293 ACTKEEKWDLALKIFQSMLKKGMKPNLVACNTLINSLGKAGKVGLVFKVYSVLKSLGHKP 352
Query: 176 HSVPWQLCKSMIAIYYRNNMLERLIKLF 203
W +++ Y+ N E +++LF
Sbjct: 353 DEYTWN---ALLTALYKANRYEDVLQLF 377
>sp|O04647|PP399_ARATH Pentatricopeptide repeat-containing protein At5g27270
OS=Arabidopsis thaliana GN=EMB976 PE=2 SV=2
Length = 1038
Score = 32.0 bits (71), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 185 SMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEE 234
++IA+Y R + L+ +L+ L A + K P KS+++ + DAY G LE+
Sbjct: 675 TLIAVYGRQHKLKEAKRLY--LAAGESKTPGKSVIRSMIDAYVRCGWLED 722
>sp|Q9LS88|PP250_ARATH Pentatricopeptide repeat-containing protein At3g23020
OS=Arabidopsis thaliana GN=At3g23020 PE=2 SV=1
Length = 842
Score = 31.2 bits (69), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 16/81 (19%)
Query: 59 ASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLA 118
ST+ G +P +L+ +S EDV L W N K+ +
Sbjct: 117 CSTKRLSYGGCIP----------AILEALDSIEDVEDALSPWAERLSN------KERTII 160
Query: 119 LEKEQQWHRVVQVIKWMLSKG 139
L+++ W R V++ +W SKG
Sbjct: 161 LKEQIHWERAVEIFEWFKSKG 181
>sp|Q9CEC5|META_LACLA Homoserine O-succinyltransferase OS=Lactococcus lactis subsp.
lactis (strain IL1403) GN=metA PE=3 SV=1
Length = 321
Score = 31.2 bits (69), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 37/172 (21%), Positives = 66/172 (38%), Gaps = 30/172 (17%)
Query: 50 VDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDAWV-------- 101
V+ P + + R+ N P + + FL T D KN+K+ G LD++
Sbjct: 41 VNLMPRKLITERQILRLLSNTPLQINVEFLYMTSHDFKNTKQ---GHLDSFYKSFSEIKS 97
Query: 102 AWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAE 161
+ V L E+ W ++++I W S S++ C AL
Sbjct: 98 QYYDGLIVTGAPVEQLNFEEVDYWSELLKIIDWSKSHVY-SSLHICWGAQAAL------- 149
Query: 162 EAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKP 213
R G+ ++P +LC Y++++ + LF+G + F P
Sbjct: 150 ------YARYGVTKENLPQKLCG-----IYKSSVEQPKNPLFRGFDDFFNYP 190
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 104,248,086
Number of Sequences: 539616
Number of extensions: 4288897
Number of successful extensions: 17627
Number of sequences better than 100.0: 49
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 17551
Number of HSP's gapped (non-prelim): 103
length of query: 287
length of database: 191,569,459
effective HSP length: 116
effective length of query: 171
effective length of database: 128,974,003
effective search space: 22054554513
effective search space used: 22054554513
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)