BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026468
(238 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|D9J034|WHY2_SOLTU Single-stranded DNA-bindig protein WHY2, mitochondrial OS=Solanum
tuberosum GN=WHY2 PE=1 SV=1
Length = 238
Score = 249 bits (637), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 130/209 (62%), Positives = 158/209 (75%), Gaps = 6/209 (2%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAK-GSLGGRI 59
M+K+SR LL R+QL K L GE V+ + H + + AG ST +V A G GR+
Sbjct: 1 MLKVSR-LLHPRNQLLHKKLPGEC--VKGSIWQHAINTFAGFSTVRQNVVADAGKREGRV 57
Query: 60 FAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFA 119
FAPY V+KGKAA S +P LPTF +LDSG +K+ R+GVI+LTF P++GERKYDW K+Q FA
Sbjct: 58 FAPYSVFKGKAALSAEPRLPTFNRLDSGGVKLNRRGVIMLTFWPSVGERKYDWEKRQLFA 117
Query: 120 LSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLNVANNIL 177
LS TEVGSL++MG RDSSEFFHDP+MLSSNAGQ+RKSLSIK NAD G+FISL+V NN L
Sbjct: 118 LSATEVGSLISMGTRDSSEFFHDPSMLSSNAGQVRKSLSIKPNADGSGYFISLSVVNNNL 177
Query: 178 KTNERFVVPVSTAEFAVMKTACSVCTNEI 206
KTN+RF VPV+TAEFAVM+TA S I
Sbjct: 178 KTNDRFTVPVTTAEFAVMRTAFSFALPHI 206
>sp|Q8VYF7|WHY2_ARATH Single-stranded DNA-binding protein WHY2, mitochondrial
OS=Arabidopsis thaliana GN=WHY2 PE=2 SV=1
Length = 238
Score = 234 bits (596), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/214 (59%), Positives = 160/214 (74%), Gaps = 7/214 (3%)
Query: 1 MMKLSRSLLSSRSQLSEKLLAGEANYVRDGLKSHVLISQAGMSTTGHDVSAKGSLGGRIF 60
MMK +RSLLS RS + EA+ +R G S S G G D +AK S GR+F
Sbjct: 1 MMKQARSLLS-RSLCDQSKSLFEASTLR-GFASWSNSSTPGRGFPGKD-AAKPS--GRLF 55
Query: 61 APYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFAL 120
APY ++KGKAA SV+PVLP+F ++DSG+L++ R+G +++TF PAIGERKYDW KKQ FAL
Sbjct: 56 APYSIFKGKAALSVEPVLPSFTEIDSGNLRIDRRGSLMMTFMPAIGERKYDWEKKQKFAL 115
Query: 121 SPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLNVANNILK 178
SPTEVGSL++MG +DSSEFFHDP+M SSNAGQ+RKSLS+K +AD G+FISL+V N+ILK
Sbjct: 116 SPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKSLSVKPHADGSGYFISLSVNNSILK 175
Query: 179 TNERFVVPVSTAEFAVMKTACSVCTNEIASRSQL 212
TN+ FVVPV+ AEFAVMKTA S I ++L
Sbjct: 176 TNDYFVVPVTKAEFAVMKTAFSFALPHIMGWNRL 209
>sp|B2LXS7|WHY1_MAIZE Single-stranded DNA-bindig protein WHY1, chloroplastic OS=Zea mays
GN=WHY1 PE=2 SV=1
Length = 266
Score = 152 bits (383), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 72/152 (47%), Positives = 99/152 (65%), Gaps = 2/152 (1%)
Query: 46 GHDVSAKGSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAI 105
G+ G+ GR+F Y +YKGKAA S DP P F+ LDSG KV ++G +LL FAPA+
Sbjct: 73 GYGRPPNGAQDGRVFTSYSIYKGKAALSFDPRPPLFVPLDSGAYKVAKEGFVLLQFAPAV 132
Query: 106 GERKYDWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD- 164
R+YDW +KQ F+LS E+G+LLT+GP DS EFFHDP S G++RK L I+ D
Sbjct: 133 ATRQYDWTRKQVFSLSVWEIGTLLTLGPTDSCEFFHDPFKGRSEEGKVRKVLKIEPTPDG 192
Query: 165 -GFFISLNVANNILKTNERFVVPVSTAEFAVM 195
G F +L+V N ++ +E +P++ EFAV+
Sbjct: 193 NGRFFNLSVQNRLINVDESIYIPITKGEFAVI 224
>sp|Q9M9S3|WHY1_ARATH Single-stranded DNA-binding protein WHY1, chloroplastic
OS=Arabidopsis thaliana GN=WHY1 PE=1 SV=1
Length = 263
Score = 145 bits (367), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 102/150 (68%), Gaps = 2/150 (1%)
Query: 55 LGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAK 114
L R + + +YKGKAA +VDP P F+ LDSG K+ + G +LL FAP+ G R+YDW+K
Sbjct: 77 LPARFYVGHSIYKGKAALTVDPRAPEFVALDSGAFKLSKDGFLLLQFAPSAGVRQYDWSK 136
Query: 115 KQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISLNV 172
KQ F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ D G F +L+V
Sbjct: 137 KQVFSLSVTEIGTLVSLGPRESCEFFHDPFKGKSDEGKVRKVLKVEPLPDGSGHFFNLSV 196
Query: 173 ANNILKTNERFVVPVSTAEFAVMKTACSVC 202
N ++ +E +P++ AEFAV+ +A +
Sbjct: 197 QNKLVNVDESIYIPITRAEFAVLISAFNFV 226
>sp|Q9LL85|WHY1_SOLTU Single-stranded DNA-bindig protein WHY1, chloroplastic OS=Solanum
tuberosum GN=WHY1 PE=1 SV=1
Length = 274
Score = 145 bits (366), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 103/148 (69%), Gaps = 2/148 (1%)
Query: 53 GSLGGRIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDW 112
G+ ++F Y +YKGKAA +V+P P F LDSG K+ R+G+++L FAPA G R+YDW
Sbjct: 86 GASTPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDW 145
Query: 113 AKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANAD--GFFISL 170
++KQ F+LS TE+GS++++G +DS EFFHDP S+ G++RK L ++ D G F +L
Sbjct: 146 SRKQVFSLSVTEIGSIISLGAKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDGSGHFFNL 205
Query: 171 NVANNILKTNERFVVPVSTAEFAVMKTA 198
+V N ++ +E +PV+ AEFAV+ +A
Sbjct: 206 SVQNKLINLDENIYIPVTKAEFAVLVSA 233
>sp|Q66GR6|WHY3_ARATH Single-stranded DNA-binding protein WHY3, chloroplastic
OS=Arabidopsis thaliana GN=WHY3 PE=1 SV=1
Length = 268
Score = 138 bits (347), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 102/148 (68%), Gaps = 3/148 (2%)
Query: 58 RIFAPYYVYKGKAAFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQH 117
R + + +YKGKAA +++P P F+ L+SG K+ ++G +LL FAPA G R+YDW++KQ
Sbjct: 84 RFYVGHSIYKGKAALTIEPRAPEFVALESGAFKLTKEGFLLLQFAPAAGVRQYDWSRKQV 143
Query: 118 FALSPTEVGSLLTMGPRDSSEFFHDPAM-LSSNAGQMRKSLSIKANAD--GFFISLNVAN 174
F+LS TE+G+L+++GPR+S EFFHDP S+ G++RK L ++ D G F +L+V N
Sbjct: 144 FSLSVTEIGNLVSLGPRESCEFFHDPFKGKGSDEGKVRKVLKVEPLPDGSGRFFNLSVQN 203
Query: 175 NILKTNERFVVPVSTAEFAVMKTACSVC 202
+L +E +P++ AEFAV+ +A +
Sbjct: 204 KLLNVDESVYIPITKAEFAVLISAFNFV 231
>sp|Q5L5F1|AROB_CHLAB 3-dehydroquinate synthase OS=Chlamydophila abortus (strain S26/3)
GN=aroB PE=3 SV=1
Length = 379
Score = 31.6 bits (70), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 19/136 (13%)
Query: 71 AFSVDPVLPTFMKLDSGDLKVKRKGVILLTFAPAIGERKYDWAKKQHFALSPTEVGSLLT 130
A SV ++ T + L+SG +K PA+ E+ + +K+ H PT + L
Sbjct: 261 AVSVGMMIETKISLESGIMK-----------NPALLEQLHHLSKRFHL---PTTLEELRD 306
Query: 131 MGPRDSSEFFHDPAMLSSNAGQMRKSLSIKANADGFFISLNVANNILKTNERF-VVPVST 189
+ P+ F+DP + G +K+LS KA + + A N + VP
Sbjct: 307 LIPQHLHHEFYDPENIIHALGYDKKNLSKKAIR---MVMMEDAGKATSCNGIYCTVPKMA 363
Query: 190 AEFAVMKTAC-SVCTN 204
+ ++K+ C ++C N
Sbjct: 364 ILYEILKSECYAMCNN 379
>sp|P44817|PEPD_HAEIN Cytosol non-specific dipeptidase OS=Haemophilus influenzae (strain
ATCC 51907 / DSM 11121 / KW20 / Rd) GN=pepD PE=3 SV=2
Length = 484
Score = 31.2 bits (69), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 111 DWAKKQHFALSPTEVGSLLTMGPRDSSEFFHDPAMLSSN 149
+WAK++HF + EVG++L P H P +L ++
Sbjct: 38 NWAKEKHFFVERDEVGNVLIRKPATKGMENHTPVVLQAH 76
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.130 0.364
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,011,270
Number of Sequences: 539616
Number of extensions: 2953032
Number of successful extensions: 6300
Number of sequences better than 100.0: 9
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 6286
Number of HSP's gapped (non-prelim): 10
length of query: 238
length of database: 191,569,459
effective HSP length: 114
effective length of query: 124
effective length of database: 130,053,235
effective search space: 16126601140
effective search space used: 16126601140
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)