BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047178
         (287 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q2V3H0|PP322_ARATH Pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic OS=Arabidopsis thaliana GN=At4g18975 PE=2
           SV=2
          Length = 287

 Score =  141 bits (356), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 100/164 (60%), Gaps = 1/164 (0%)

Query: 79  LVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSK 138
           LV  L  L N KE VYG L+ WVAWE  FP+ +  KAL  L K  QWHRV+Q+ KWMLSK
Sbjct: 101 LVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSK 160

Query: 139 GQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLER 198
           GQG+TMGT   L+ A DMD RA+EA   W   +     S+P +L   MIA+Y  +++ ++
Sbjct: 161 GQGATMGTYDILLLAFDMDERADEAESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDK 220

Query: 199 LIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKY 242
           +I++F  +E     P E S  +RVA A+  L   E ++ +L +Y
Sbjct: 221 VIEVFADMEELKVSPDEDS-ARRVARAFRELNQEENRKLILRRY 263


>sp|Q8LG95|PP332_ARATH Pentatricopeptide repeat-containing protein At4g21190
           OS=Arabidopsis thaliana GN=EMB1417 PE=2 SV=1
          Length = 307

 Score =  133 bits (335), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 66/164 (40%), Positives = 103/164 (62%), Gaps = 1/164 (0%)

Query: 79  LVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSK 138
           ++  +  L N KE+VYG LD+++AWE  FP+  +KKAL+ LE E++W +++QV KWMLSK
Sbjct: 61  MIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDEKEWKKIIQVTKWMLSK 120

Query: 139 GQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLER 198
           GQG TMGT   L+ AL  D+R +EA + W K     L   P +    MI+IYY+ +M ++
Sbjct: 121 GQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKFFNKMISIYYKRDMHQK 180

Query: 199 LIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKY 242
           L ++F  +E    K P  +IV  V   +  L + ++ E++++KY
Sbjct: 181 LFEVFADMEELGVK-PNVAIVSMVGKVFVKLEMKDKYEKLMKKY 223


>sp|Q8GYM2|PP393_ARATH Pentatricopeptide repeat-containing protein At5g18950
           OS=Arabidopsis thaliana GN=At5g18950 PE=2 SV=2
          Length = 483

 Score = 37.7 bits (86), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 5/81 (6%)

Query: 135 MLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYR 192
           ML  G G TM +C  +I+      +++EA + ++     G+  +++ +    ++I  + +
Sbjct: 379 MLRNGYGGTMLSCNTMIKGFCSHGKSDEAFEIFKNMSETGVTPNAITYN---ALIKGFCK 435

Query: 193 NNMLERLIKLFKGLEAFDRKP 213
            N +E+ +KL+K L+A   KP
Sbjct: 436 ENKVEKGLKLYKELKALGLKP 456


>sp|Q9SNB7|PP264_ARATH Pentatricopeptide repeat-containing protein At3g46610
           OS=Arabidopsis thaliana GN=At3g46610 PE=2 SV=1
          Length = 665

 Score = 36.6 bits (83), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 2/89 (2%)

Query: 115 ALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEK--RIG 172
            L+A  K  +    +Q+ K M+  G+  T+ + G L+ AL+     +EA + W    ++G
Sbjct: 464 VLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVG 523

Query: 173 IDLHSVPWQLCKSMIAIYYRNNMLERLIK 201
           I+ +   +    S++    + N+L+ L+K
Sbjct: 524 IEPNLYAYTTMASVLTGQQKFNLLDTLLK 552


>sp|Q0WVV0|PPR31_ARATH Pentatricopeptide repeat-containing protein At1g10910,
           chloroplastic OS=Arabidopsis thaliana GN=At1g10910 PE=2
           SV=1
          Length = 664

 Score = 34.3 bits (77), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/92 (22%), Positives = 45/92 (48%), Gaps = 7/92 (7%)

Query: 124 QWHRVVQVIKWMLSKGQ--GSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVPWQ 181
           +W  ++Q+ +WM   G+   ST  +C + + A ++    E      ++   I+++     
Sbjct: 113 RWQDLIQLFEWMQQHGKISVSTYSSCIKFVGAKNVSKALEIYQSIPDESTKINVY----- 167

Query: 182 LCKSMIAIYYRNNMLERLIKLFKGLEAFDRKP 213
           +C S+++   +N  L+  IKLF  ++    KP
Sbjct: 168 ICNSILSCLVKNGKLDSCIKLFDQMKRDGLKP 199


>sp|Q28C74|LPPRC_XENTR Leucine-rich PPR motif-containing protein, mitochondrial OS=Xenopus
           tropicalis GN=lrpprc PE=2 SV=1
          Length = 1391

 Score = 34.3 bits (77), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 8/104 (7%)

Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
           +CG L+  L M  R E AH+ WEK   +G       +    +++ +Y +N       +  
Sbjct: 113 SCGSLLPELQMSERTEMAHRIWEKLQELGAVFDVSHYN---ALLKVYLQNEHKFSPTEYL 169

Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVL--EKYKDL 245
             +EA + + P +   QR+  AY   G +E   ++L   K KDL
Sbjct: 170 AKMEAANVQ-PNRVTYQRLIAAYCNEGDIEGASKILGFMKNKDL 212


>sp|Q9XIM8|PP155_ARATH Pentatricopeptide repeat-containing protein At2g15980
           OS=Arabidopsis thaliana GN=At2g15980 PE=2 SV=1
          Length = 498

 Score = 33.9 bits (76), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/127 (23%), Positives = 54/127 (42%), Gaps = 12/127 (9%)

Query: 129 VQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWEKRIGIDLHSVP--------- 179
           V V++ + S+G  + + TC  LI  +     A   +K + +  G+D  SV          
Sbjct: 182 VMVMRKLRSRGINAQISTCNALITEVSRRRGASNGYKMYREVFGLDDVSVDEAKKMIGKI 241

Query: 180 ---WQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKE 236
                   SM+  +YR    E + ++++ +E      P       + +AY   GL+ E E
Sbjct: 242 KPNATTFNSMMVSFYREGETEMVERIWREMEEEVGCSPNVYSYNVLMEAYCARGLMSEAE 301

Query: 237 RVLEKYK 243
           +V E+ K
Sbjct: 302 KVWEEMK 308


>sp|Q9FIB2|PP373_ARATH Putative pentatricopeptide repeat-containing protein At5g09950
           OS=Arabidopsis thaliana GN=PCMP-H35 PE=3 SV=1
          Length = 995

 Score = 33.9 bits (76), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 185 SMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKD 244
           SMI+ Y R+   E  +KLF+ ++   + PP+      V  A    GLLEE  +  E   D
Sbjct: 690 SMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSD 749

Query: 245 LF 246
            +
Sbjct: 750 SY 751


>sp|Q6PB66|LPPRC_MOUSE Leucine-rich PPR motif-containing protein, mitochondrial OS=Mus
           musculus GN=Lrpprc PE=1 SV=2
          Length = 1392

 Score = 33.9 bits (76), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 8/104 (7%)

Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
           +CG L+  L +  R E AHK W+K  ++G+      +    +++ +Y +N          
Sbjct: 128 SCGSLLPELSLAERTEFAHKIWDKLQQLGVVYDVSHYN---ALLKVYLQNEYKFSPTDFL 184

Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVL--EKYKDL 245
             +E  + + P +   QR+  AY  +G +E   ++L   K KDL
Sbjct: 185 AKMEGANIQ-PNRVTYQRLIAAYCNVGDIEGASKILGFMKTKDL 227


>sp|Q5SGE0|LPPRC_RAT Leucine-rich PPR motif-containing protein, mitochondrial OS=Rattus
           norvegicus GN=Lrpprc PE=1 SV=1
          Length = 1392

 Score = 32.3 bits (72), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 30/140 (21%), Positives = 56/140 (40%), Gaps = 32/140 (22%)

Query: 146 TCGQLIRALDMDHRAEEAHKFWEK--RIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLF 203
           +CG L+  L +  R E AHK W+K  ++G       +    +++ +Y +N          
Sbjct: 128 SCGSLLPELSLAERTEFAHKIWDKLQQLGTVYDVSHYN---ALLKVYLQNEYRFSPTDFL 184

Query: 204 KGLEAFDRKPPEKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMK 263
             +E  + + P +   QR+  AY  +G +E   ++L   K                    
Sbjct: 185 AKMEGANIQ-PNRVTYQRLIAAYCSVGDIEGASKILGFMK-------------------- 223

Query: 264 GKKSGRTRDTPVSDGVTNAI 283
                 TRD P+++ V +A+
Sbjct: 224 ------TRDLPITEAVFSAL 237


>sp|Q84J46|PP262_ARATH Pentatricopeptide repeat-containing protein At3g29290
           OS=Arabidopsis thaliana GN=EMB2076 PE=2 SV=1
          Length = 540

 Score = 32.0 bits (71), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 118 ALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAEEAHKFWE--KRIGIDL 175
           A  KE++W   +++ + ML KG    +  C  LI +L    +     K +   K +G   
Sbjct: 293 ACTKEEKWDLALKIFQSMLKKGMKPNLVACNTLINSLGKAGKVGLVFKVYSVLKSLGHKP 352

Query: 176 HSVPWQLCKSMIAIYYRNNMLERLIKLF 203
               W    +++   Y+ N  E +++LF
Sbjct: 353 DEYTWN---ALLTALYKANRYEDVLQLF 377


>sp|O04647|PP399_ARATH Pentatricopeptide repeat-containing protein At5g27270
           OS=Arabidopsis thaliana GN=EMB976 PE=2 SV=2
          Length = 1038

 Score = 32.0 bits (71), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 185 SMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIVQRVADAYEVLGLLEE 234
           ++IA+Y R + L+   +L+  L A + K P KS+++ + DAY   G LE+
Sbjct: 675 TLIAVYGRQHKLKEAKRLY--LAAGESKTPGKSVIRSMIDAYVRCGWLED 722


>sp|Q9LS88|PP250_ARATH Pentatricopeptide repeat-containing protein At3g23020
           OS=Arabidopsis thaliana GN=At3g23020 PE=2 SV=1
          Length = 842

 Score = 31.2 bits (69), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 16/81 (19%)

Query: 59  ASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDAWVAWEQNFPVGSLKKALLA 118
            ST+    G  +P           +L+  +S EDV   L  W     N      K+  + 
Sbjct: 117 CSTKRLSYGGCIP----------AILEALDSIEDVEDALSPWAERLSN------KERTII 160

Query: 119 LEKEQQWHRVVQVIKWMLSKG 139
           L+++  W R V++ +W  SKG
Sbjct: 161 LKEQIHWERAVEIFEWFKSKG 181


>sp|Q9CEC5|META_LACLA Homoserine O-succinyltransferase OS=Lactococcus lactis subsp.
           lactis (strain IL1403) GN=metA PE=3 SV=1
          Length = 321

 Score = 31.2 bits (69), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 37/172 (21%), Positives = 66/172 (38%), Gaps = 30/172 (17%)

Query: 50  VDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDAWV-------- 101
           V+  P +     +  R+  N P +  + FL  T  D KN+K+   G LD++         
Sbjct: 41  VNLMPRKLITERQILRLLSNTPLQINVEFLYMTSHDFKNTKQ---GHLDSFYKSFSEIKS 97

Query: 102 AWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHRAE 161
            +     V       L  E+   W  ++++I W  S    S++  C     AL       
Sbjct: 98  QYYDGLIVTGAPVEQLNFEEVDYWSELLKIIDWSKSHVY-SSLHICWGAQAAL------- 149

Query: 162 EAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKP 213
                   R G+   ++P +LC       Y++++ +    LF+G + F   P
Sbjct: 150 ------YARYGVTKENLPQKLCG-----IYKSSVEQPKNPLFRGFDDFFNYP 190


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.315    0.130    0.376 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 104,248,086
Number of Sequences: 539616
Number of extensions: 4288897
Number of successful extensions: 17627
Number of sequences better than 100.0: 49
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 17551
Number of HSP's gapped (non-prelim): 103
length of query: 287
length of database: 191,569,459
effective HSP length: 116
effective length of query: 171
effective length of database: 128,974,003
effective search space: 22054554513
effective search space used: 22054554513
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)