BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026993
(229 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9STF9|PP266_ARATH Pentatricopeptide repeat-containing protein At3g46870
OS=Arabidopsis thaliana GN=At3g46870 PE=2 SV=1
Length = 257
Score = 65.1 bits (157), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 95/181 (52%), Gaps = 11/181 (6%)
Query: 41 RGPLVKGR-ILSTEAIQAVQFLKRAHKQNPQNPTY--PSLSRLIKHDLLAALRELIRQGE 97
RGPL +G+ ++ EA+ + LKR + + + + + RL+K D+LA + EL RQ E
Sbjct: 63 RGPLWRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQEE 122
Query: 98 CAVAVHVFSTIQR-EYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEID-GGDGRGLS 155
A+A+ +F IQ+ E+ Q D+ + DLI +LAK+ E L ++++ + D + +
Sbjct: 123 TALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFPDSQTYT 182
Query: 156 RVVRAVVEAGSKESTVRIYGLMKRSGVGCSWKVDEYVGKVLSKGLRRFGEEELANEVERE 215
V+R + G + +Y M +S +E +VL KGL L N+V+++
Sbjct: 183 EVIRGFLRDGCPADAMNVYEDMLKSPDPP----EELPFRVLLKGL--LPHPLLRNKVKKD 236
Query: 216 F 216
F
Sbjct: 237 F 237
>sp|Q1PFH7|PPR89_ARATH Pentatricopeptide repeat-containing protein At1g62350
OS=Arabidopsis thaliana GN=At1g62350 PE=2 SV=1
Length = 196
Score = 49.3 bits (116), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 69/136 (50%), Gaps = 6/136 (4%)
Query: 50 LSTEAIQAVQFLKRAHKQNPQNPTY--PSLSRLIKHDLLAALRELIRQGECAVAVHVFST 107
+S E + A + LKR Q+ + + +SRL+K DL++ L E RQ + + + ++
Sbjct: 1 MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60
Query: 108 IQRE-YQQQDLGLLTDLINTLAKNGLTGEVDRLIGEL--EEIDGGDGRGLSRVVRAVVEA 164
++RE + + D+ D++ LA+N E ++ +L EE+ D +VR ++
Sbjct: 61 VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEV-LFDQHTFGDLVRGFLDN 119
Query: 165 GSKESTVRIYGLMKRS 180
+R+YG M+ S
Sbjct: 120 ELPLEAMRLYGEMRES 135
>sp|Q9ZU27|PPR76_ARATH Pentatricopeptide repeat-containing protein At1g51965,
mitochondrial OS=Arabidopsis thaliana GN=At1g51965 PE=2
SV=1
Length = 650
Score = 34.3 bits (77), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/157 (22%), Positives = 67/157 (42%), Gaps = 29/157 (18%)
Query: 15 HHSPKHKPTIITTHHRLPIRCGPRSNRGPLVKGRILSTEAIQAVQFLKRAHKQN------ 68
H S H+ + P++ G R + +++ + + I+A++ L + H++
Sbjct: 387 HVSEAHR--LFCDMWSFPVK-GERDSYMSMLESLCGAGKTIEAIEMLSKIHEKGVVTDTM 443
Query: 69 PQNPTYPSLSRLIK----HDLLAALRE------------LI----RQGECAVAVHVFSTI 108
N + +L +L + HDL +++ LI R GE A+++F +
Sbjct: 444 MYNTVFSALGKLKQISHIHDLFEKMKKDGPSPDIFTYNILIASFGRVGEVDEAINIFEEL 503
Query: 109 QREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEE 145
+R + D+ LIN L KNG E E++E
Sbjct: 504 ERSDCKPDIISYNSLINCLGKNGDVDEAHVRFKEMQE 540
>sp|Q5G1S8|PP241_ARATH Pentatricopeptide repeat-containing protein At3g18110,
chloroplastic OS=Arabidopsis thaliana GN=EMB1270 PE=2
SV=2
Length = 1440
Score = 33.9 bits (76), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 38/85 (44%), Gaps = 3/85 (3%)
Query: 101 AVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEIDGG--DGRGLSRVV 158
AV VF ++ Q DL +I+ + GL E +RL EL E+ G D + ++
Sbjct: 316 AVKVFEDMEAHRCQPDLWTYNAMISVYGRCGLAAEAERLFMEL-ELKGFFPDAVTYNSLL 374
Query: 159 RAVVEAGSKESTVRIYGLMKRSGVG 183
A + E +Y M++ G G
Sbjct: 375 YAFARERNTEKVKEVYQQMQKMGFG 399
>sp|A8M0Y9|OBG_SALAI GTPase obg OS=Salinispora arenicola (strain CNS-205) GN=obg PE=3
SV=1
Length = 481
Score = 33.9 bits (76), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 2/75 (2%)
Query: 91 ELIRQGE-CAVAVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEIDGG 149
E +R E CAV VHV T E + + + + L G + RL+ L +ID
Sbjct: 229 EFLRHVERCAVLVHVVDTATLETARDPVADIDAIEAELTAYGGLADRPRLVA-LNKIDVP 287
Query: 150 DGRGLSRVVRAVVEA 164
DGR L+ +VR +EA
Sbjct: 288 DGRDLAEIVRPDLEA 302
>sp|O64624|PP163_ARATH Pentatricopeptide repeat-containing protein At2g18940
OS=Arabidopsis thaliana GN=At2g18940 PE=2 SV=1
Length = 822
Score = 32.7 bits (73), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 93 IRQGECAVAVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEE 145
+R+GEC A + T+++ + DL +I + GL E R++ E+ E
Sbjct: 677 VRRGECWKAEEILKTLEKSQLKPDLVSYNTVIKGFCRRGLMQEAVRMLSEMTE 729
>sp|Q3EDF8|PPR28_ARATH Pentatricopeptide repeat-containing protein At1g09900
OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1
Length = 598
Score = 32.0 bits (71), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 34/150 (22%), Positives = 67/150 (44%), Gaps = 11/150 (7%)
Query: 35 CGPRS-NRGPLVKGRILSTEAIQAVQFLKRAHKQNPQNPTYPSLSRLIKHDLLAALRELI 93
C P S + PL+ G + +A+++L+R + YP + + + +L AL
Sbjct: 375 CQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRG----CYPDI--VTYNTMLTAL---C 425
Query: 94 RQGECAVAVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEID-GGDGR 152
+ G+ AV + + + + L +I+ LAK G TG+ +L+ E+ D D
Sbjct: 426 KDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTI 485
Query: 153 GLSRVVRAVVEAGSKESTVRIYGLMKRSGV 182
S +V + G + ++ + +R G+
Sbjct: 486 TYSSLVGGLSREGKVDEAIKFFHEFERMGI 515
>sp|Q9LS72|PP261_ARATH Pentatricopeptide repeat-containing protein At3g29230
OS=Arabidopsis thaliana GN=PCMP-E27 PE=2 SV=1
Length = 600
Score = 32.0 bits (71), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 121 TDLINTLAKNGLTGEVDRLIGELEEIDGG---DGRGLSRVVRAVVEAGSKESTVRIYGLM 177
T +I A+ GL E DRL+ ++ + G D + ++ A E+G +RI+ ++
Sbjct: 284 TIIIAGYAEKGLLKEADRLVDQM--VASGLKFDAAAVISILAACTESGLLSLGMRIHSIL 341
Query: 178 KRSGVG 183
KRS +G
Sbjct: 342 KRSNLG 347
>sp|O04647|PP399_ARATH Pentatricopeptide repeat-containing protein At5g27270
OS=Arabidopsis thaliana GN=EMB976 PE=2 SV=2
Length = 1038
Score = 32.0 bits (71), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 123 LINTLAKNGLTGEVDRLIGE-LEEIDGGDGRGLSRVVRAVVEAGSKESTVRIYGLMKRSG 181
L+N L G E + + LE+ D G + +++A++EAG + IY M SG
Sbjct: 745 LVNALTNRGKHREAEHISRTCLEKNIELDTVGYNTLIKAMLEAGKLQCASEIYERMHTSG 804
Query: 182 VGCSWKVDEYVGKVLSKGLR 201
V CS + + V +GL+
Sbjct: 805 VPCSIQTYNTMISVYGRGLQ 824
>sp|Q3MG20|LEPA_ANAVT Elongation factor 4 OS=Anabaena variabilis (strain ATCC 29413 / PCC
7937) GN=lepA PE=3 SV=1
Length = 603
Score = 31.2 bits (69), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 12/54 (22%)
Query: 122 DLINTLAKNGLTG-EVDRLIGELEEIDGGD-----------GRGLSRVVRAVVE 163
++I L K L G E DR+IGE+EEI G D G G+S ++ AVVE
Sbjct: 130 EIIPVLNKIDLPGAEPDRVIGEIEEIIGLDCSGAILASAKEGIGISEILEAVVE 183
>sp|Q9RSY3|Y1986_DEIRA DegV domain-containing protein DR_1986 OS=Deinococcus radiodurans
(strain ATCC 13939 / DSM 20539 / JCM 16871 / LMG 4051 /
NBRC 15346 / NCIMB 9279 / R1 / VKM B-1422) GN=DR_1986
PE=4 SV=1
Length = 281
Score = 31.2 bits (69), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 91 ELIRQGECAVAVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEI 146
EL+R G+ + T++R Y Q DL D ++ L NG G L+G L I
Sbjct: 131 ELVRAGQSVP--QIVQTLERVYPQADLRFTVDTLDFLRLNGRIGGASALLGGLLNI 184
>sp|A4XAG1|OBG_SALTO GTPase obg OS=Salinispora tropica (strain ATCC BAA-916 / DSM 44818
/ CNB-440) GN=obg PE=3 SV=1
Length = 481
Score = 30.8 bits (68), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 91 ELIRQGE-CAVAVHVFSTIQREYQQQDLGLLTDLINTLAKNGLTGEVDRLIGELEEIDGG 149
E +R E CAV +HV T E ++ + + + L G + RL+ L ++D
Sbjct: 229 EFLRHIERCAVLLHVVDTAALETERDPVADIDAIEAELVAYGGLVDRPRLVA-LNKVDVP 287
Query: 150 DGRGLSRVVRAVVEA 164
DGR L+ +VR +EA
Sbjct: 288 DGRDLAEIVRPDLEA 302
>sp|Q7XJN6|PP197_ARATH Pentatricopeptide repeat-containing protein At2g40720
OS=Arabidopsis thaliana GN=PCMP-E26 PE=3 SV=1
Length = 860
Score = 30.8 bits (68), Expect = 9.2, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 151 GRGLSRVVRAVVEAGSKESTVRIYGLMKRSGV----GCSW-KVDEYVGKVLSKGLRRFGE 205
G +++ +EAG K ++ GLMK G+ GCSW +V + S G +
Sbjct: 780 GSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIEVSDRTNVFFSGGSSSPMK 839
Query: 206 EELANEVER 214
E+ N + R
Sbjct: 840 AEIFNVLNR 848
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.136 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 86,233,253
Number of Sequences: 539616
Number of extensions: 3579401
Number of successful extensions: 9869
Number of sequences better than 100.0: 32
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 9846
Number of HSP's gapped (non-prelim): 42
length of query: 229
length of database: 191,569,459
effective HSP length: 113
effective length of query: 116
effective length of database: 130,592,851
effective search space: 15148770716
effective search space used: 15148770716
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)