BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036571
(251 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P27061|PPA1_SOLLC Acid phosphatase 1 OS=Solanum lycopersicum GN=APS1 PE=2 SV=1
Length = 255
Score = 252 bits (643), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 115/215 (53%), Positives = 154/215 (71%)
Query: 36 DLSCLSWRLAVETNNIIGWKTTPEKCEGYLGHYMLGQQYREDSEAVAYEAIVYAQSLELA 95
+L C +WR VETNN+ WKT PE+C Y+ YM+G Y+ + + V+ EA YA+S++L
Sbjct: 40 ELKCTTWRFVVETNNLSPWKTIPEECADYVKEYMVGPGYKMEIDRVSDEAGEYAKSVDLG 99
Query: 96 GDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPESLKLYKKLL 155
DGR++WIFD+DET LSNLPYY+ H +G+E F+ F++WV G AP+L SLKLY+++L
Sbjct: 100 DDGRDVWIFDVDETLLSNLPYYSDHRYGLEVFDDVEFDKWVENGTAPALGSSLKLYQEVL 159
Query: 156 SLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKGSSYSGETAVVYKSSERKRLEK 215
LG K+ LTGR E RSVT NL N GF+ W LIL+GS G+TA YKS R + +
Sbjct: 160 KLGFKVFLLTGRSERHRSVTVENLMNAGFHDWHKLILRGSDDHGKTATTYKSERRNAMVE 219
Query: 216 KGYRIIGNIGDQWSDLLGTNAGNRTFKLPDPMYYI 250
+G+RI+GN GDQWSDLLG++ R+FKLP+PMYYI
Sbjct: 220 EGFRIVGNSGDQWSDLLGSSMSYRSFKLPNPMYYI 254
>sp|P15490|VSPA_SOYBN Stem 28 kDa glycoprotein OS=Glycine max GN=VSPA PE=2 SV=1
Length = 254
Score = 251 bits (642), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 126/226 (55%), Positives = 167/226 (73%), Gaps = 7/226 (3%)
Query: 28 GARTNDFPDLSCLSWRLAVETNNIIGWKTTPEKCEGYLGHYMLGQQYREDSEAVAYEAIV 87
GART P++ C SWRLAVE +NI G++T PE+C Y+ G+QYR DS+ V +A
Sbjct: 34 GART---PEVKCASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQQAYF 90
Query: 88 YAQSLELAGDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPES 147
YA+ LE+ ++ ++F ID T LSN+PYY KHG+GVE FNSTL++EWVNKG AP+LPE+
Sbjct: 91 YARDLEV--HPKDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPET 148
Query: 148 LKLYKKLLSLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKG-SSYSGETAVVYK 206
LK Y KL+SLG KI+FL+GR D+++VTE NLK G++TWE LILK S AV YK
Sbjct: 149 LKNYNKLVSLGFKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYK 208
Query: 207 SSERKRLEKKGYRIIGNIGDQWSDLLGTNAG-NRTFKLPDPMYYIS 251
++ R++L ++GY I+G IGDQWSDLLG + G +RTFKLP+P+YYI
Sbjct: 209 TAAREKLIRQGYNIVGIIGDQWSDLLGGHRGESRTFKLPNPLYYIQ 254
>sp|P10742|S25K_SOYBN Stem 31 kDa glycoprotein (Fragment) OS=Glycine max GN=VSP25 PE=2
SV=2
Length = 291
Score = 245 bits (625), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 123/221 (55%), Positives = 163/221 (73%), Gaps = 7/221 (3%)
Query: 28 GARTNDFPDLSCLSWRLAVETNNIIGWKTTPEKCEGYLGHYMLGQQYREDSEAVAYEAIV 87
GART P++ C SWRLAVE +NI G++T PE+C Y+ G+QYR DS+ V +A
Sbjct: 31 GART---PEVKCASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQQAYF 87
Query: 88 YAQSLELAGDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPES 147
YA+ LE+ ++ ++F ID T LSN+PYY KHG+GVE FNSTL++EWVNKG AP+LPE+
Sbjct: 88 YARDLEV--HPKDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPET 145
Query: 148 LKLYKKLLSLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKG-SSYSGETAVVYK 206
LK Y KL+SLG KI+FL+GR D+++VTE NLK G++TWE LILK S AV YK
Sbjct: 146 LKNYNKLVSLGFKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYK 205
Query: 207 SSERKRLEKKGYRIIGNIGDQWSDLLGTNAG-NRTFKLPDP 246
++ R++L ++GY I+G IGDQWSDLLG + G +RTFKLP+P
Sbjct: 206 TAAREKLIRQGYNIVGIIGDQWSDLLGGHRGESRTFKLPNP 246
>sp|P10743|VSPB_SOYBN Stem 31 kDa glycoprotein OS=Glycine max GN=VSPB PE=2 SV=1
Length = 254
Score = 244 bits (623), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 123/216 (56%), Positives = 158/216 (73%), Gaps = 3/216 (1%)
Query: 36 DLSCLSWRLAVETNNIIGWKTTPEKCEGYLGHYMLGQQYREDSEAVAYEAIVYAQSLELA 95
++ C S+RLAVE +NI +KT PE+C Y+ G+Q+R DS+ V +A YA E+
Sbjct: 40 EVKCASFRLAVEAHNIRAFKTIPEECVEPTKDYINGEQFRSDSKTVNQQAFFYASEREVH 99
Query: 96 GDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPESLKLYKKLL 155
+ +I+IF ID T LSN+PYY KHG+GVE FN TL++EWVNKG+AP+LPE+LK Y KLL
Sbjct: 100 HN--DIFIFGIDNTVLSNIPYYEKHGYGVEEFNETLYDEWVNKGDAPALPETLKNYNKLL 157
Query: 156 SLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKGSSYSGETAVVYKSSERKRLEK 215
SLG KIVFL+GR D+ +VTE NLK GF+TWE LILK A+ YKS+ R+ L +
Sbjct: 158 SLGFKIVFLSGRYLDKMAVTEANLKKAGFHTWEQLILKDPHLITPNALSYKSAMRENLLR 217
Query: 216 KGYRIIGNIGDQWSDLLGTNAG-NRTFKLPDPMYYI 250
+GYRI+G IGDQWSDLLG + G +RTFKLP+PMYYI
Sbjct: 218 QGYRIVGIIGDQWSDLLGDHRGESRTFKLPNPMYYI 253
>sp|O49195|VSP1_ARATH Vegetative storage protein 1 OS=Arabidopsis thaliana GN=VSP1 PE=2
SV=2
Length = 270
Score = 196 bits (497), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 146/230 (63%), Gaps = 7/230 (3%)
Query: 23 LRPKSGARTNDFPDLSCLSWRLAVETNNIIGWKTTPEKCEGYLGHYML-GQQYREDSEAV 81
L K G N +P+ C SW L VET+NII + T P C+ Y+ Y++ +QY+ DS+ V
Sbjct: 44 LLEKEGLSIN-YPN--CRSWHLGVETSNIINFDTVPANCKAYVEDYLITSKQYQYDSKTV 100
Query: 82 AYEAIVYAQSLELAGDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEA 141
EA YA+ L L D +WIFD+D+T LS++PYYAK+G+G E + W+ GE+
Sbjct: 101 NKEAYFYAKGLALKNDTVNVWIFDLDDTLLSSIPYYAKYGYGTENTAPGAYWSWLESGES 160
Query: 142 -PSLPESLKLYKKLLSLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKGSSYSGE 200
P LPE+L LY+ LL LGI+ + ++ R + VT NLK VG W++LILK + S
Sbjct: 161 TPGLPETLHLYENLLELGIEPIIISDRWKKLSEVTVENLKAVGVTKWKHLILKPNG-SKL 219
Query: 201 TAVVYKSSERKRLEKKGYRIIGNIGDQWSDLLGTNAGNRTFKLPDPMYYI 250
T VVYKS R L KKGY I+GNIGDQW+DL+ G R FKLP+P+YY+
Sbjct: 220 TQVVYKSKVRNSLVKKGYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYYV 268
>sp|O82122|VSP2_ARATH Vegetative storage protein 2 OS=Arabidopsis thaliana GN=VSP2 PE=1
SV=1
Length = 265
Score = 190 bits (483), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 142/215 (66%), Gaps = 4/215 (1%)
Query: 38 SCLSWRLAVETNNIIGWKTTPEKCEGYLGHYML-GQQYREDSEAVAYEAIVYAQSLELAG 96
+C SW L VET+NII + T P C+ Y+ Y++ +QY+ DS+ V EA YA+ L L
Sbjct: 51 NCRSWHLGVETSNIIDFDTVPANCKDYVEDYLITSKQYQYDSKTVCKEAYFYAKGLALKN 110
Query: 97 DGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEA-PSLPESLKLYKKLL 155
D +WIFD+D+T LS++PYYAK+G+G E + + W+ G + P LPE+L LY+ ++
Sbjct: 111 DTVNVWIFDLDDTLLSSIPYYAKYGYGTEKTDPGAYWLWLGTGASTPGLPEALHLYQNII 170
Query: 156 SLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKGSSYSGETAVVYKSSERKRLEK 215
LGI+ + L+ R + ++VT +NL+ G W++LILK + S VVYKS RK L K
Sbjct: 171 ELGIEPIILSDRWKLWKNVTLDNLEAAGVTYWKHLILKPNG-SNLRQVVYKSKVRKSLVK 229
Query: 216 KGYRIIGNIGDQWSDLLGTNAGNRTFKLPDPMYYI 250
KGY I+GNIGDQW+DL+ G R FKLP+P+YY+
Sbjct: 230 KGYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYYV 263
>sp|O04195|Y2992_ARATH Uncharacterized protein At2g39920 OS=Arabidopsis thaliana
GN=At2g39920 PE=2 SV=2
Length = 283
Score = 63.5 bits (153), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 29/213 (13%)
Query: 39 CLSWRLAVETNNIIGWKTTPEKCEGYLGHYMLGQQYREDSEAVAYEAIVYAQSLELAGDG 98
C L + N++ P C H + Y + A+ Y Q+++ D
Sbjct: 88 CKILSLHSQLNSLDEESELPLLCRDVALHRIKQGIYLRELNFTIQMALTYFQTIKPMNDN 147
Query: 99 REIWIFDIDETSL-SNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPESLKLYKKLLSL 157
++ + DID+T+L YY K+ + S L L LY KL S
Sbjct: 148 CDVVVIDIDDTNLLEQDSYYMKYIEEAKHQKSILI---------------LALYSKLRSQ 192
Query: 158 GIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLILKGSSYSGETAVVYKSSERKRLEKKG 217
G +V L+ RPE +R+ T LK+ G+ W +LI+ + +++ LE +G
Sbjct: 193 GYSMVLLSRRPETERNATIEQLKSRGYSDWSHLIMSRED----------TRQKEELE-RG 241
Query: 218 YRIIGNIGDQWSDLLGT-N-AGNRTFKLPDPMY 248
+R+IG IG+ L G N R FKLP Y
Sbjct: 242 HRVIGVIGNHMDVLRGQWNWQSKRLFKLPSLTY 274
>sp|P26093|HEL_HAEIN Lipoprotein E OS=Haemophilus influenzae (strain ATCC 51907 / DSM
11121 / KW20 / Rd) GN=hel PE=1 SV=2
Length = 274
Score = 52.0 bits (123), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 70/143 (48%), Gaps = 11/143 (7%)
Query: 105 DIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKGEAPSLPESLKLYKKLLSLGIKIVFL 164
D+DET L N PY +PF+ + WV+ ++ ++P +++ + S K+ ++
Sbjct: 84 DLDETMLDNSPYAGWQVQNNKPFDGKDWTRWVDARQSRAVPGAVEFNNYVNSHNGKVFYV 143
Query: 165 TGRPED-QRSVTENNLKNVGFYTWENLILKGSSYSGETAVVYKSSERKRLEKKGYRIIGN 223
T R + ++S T +++K +GF E S++ + K++ +EK+GY I+
Sbjct: 144 TNRKDSTEKSGTIDDMKRLGFNGVEE-----SAFYLKKDKSAKAARFAEIEKQGYEIVLY 198
Query: 224 IGDQWSDLLGT-----NAGNRTF 241
+GD D T NA R F
Sbjct: 199 VGDNLDDFGNTVYGKLNADRRAF 221
>sp|B0JID6|AROB_MICAN 3-dehydroquinate synthase OS=Microcystis aeruginosa (strain
NIES-843) GN=aroB PE=3 SV=1
Length = 366
Score = 31.6 bits (70), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 9/80 (11%)
Query: 155 LSLGIKIVFLTGRPED---QRSVTENNLKNVGFYTWENLILKGSSYSGETAV--VYKSSE 209
LSLG K V L PE + N+L+ GF + +LI G +Y A+ VY S+
Sbjct: 33 LSLGKK-VLLVSNPEIFDYYGQIAVNSLEKAGFAVFTHLIPAGENYKTLDAIAQVYDSAL 91
Query: 210 RKRLEKKGYRII---GNIGD 226
RLE+ + G IGD
Sbjct: 92 AHRLERSSTMVALGGGVIGD 111
>sp|A6W230|APHA_MARMS Class B acid phosphatase OS=Marinomonas sp. (strain MWYL1) GN=aphA
PE=3 SV=1
Length = 236
Score = 31.6 bits (70), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 93 ELAGDGREIWIFDIDETSLSNLPYYAKHGFGVEPFNSTLFNEWVNKG---------EAPS 143
EL G FDID+T+L + P + + G + F+ ++ N+ +A S
Sbjct: 58 ELQGKAPFAVGFDIDDTTLFSTPVFYR---GQQEFSPNGYSYLTNQDFWDKANCGWDAFS 114
Query: 144 LPESLKLYKKLLSL----GIKIVFLTGRP-EDQRSVTENNLKNVGFYTWENLILKGSSYS 198
+P+++ K+L+++ G I F+TGR D TE K ++I GSS +
Sbjct: 115 MPKNIA--KELIAMHQERGDSIYFITGRTGSDCNFTTEYLKKTFDIKDMHDVIFAGSSRT 172
Query: 199 GETAVVY 205
T Y
Sbjct: 173 EYTKTKY 179
>sp|Q6NTN5|MTMRD_XENLA Myotubularin-related protein 13 OS=Xenopus laevis GN=sbf2 PE=2 SV=1
Length = 1873
Score = 31.6 bits (70), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 25/102 (24%), Positives = 45/102 (44%), Gaps = 4/102 (3%)
Query: 132 FNEWVNKGEAPSLPESLKLYKKLLSLGIKIVFLTGRPEDQRSVTENNLKNVGFYTWENLI 191
F+E V+ + L ++ LS+ F G+ + + KN F T I
Sbjct: 982 FDEEVSAEMVDLFKKQLMKFRYPLSIFSAFAFAAGQTAPPIILPKQKEKNTSFRTISKTI 1041
Query: 192 LKGSSYSGETAVVYKSSERKR----LEKKGYRIIGNIGDQWS 229
+KG+ +G+ + + + +K+ LE++G R GN D S
Sbjct: 1042 VKGAKRAGKMTIGRQYAAKKKTGTILEERGSRSGGNEDDDIS 1083
>sp|P43547|AAD6_YEAST Putative aryl-alcohol dehydrogenase AAD6 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=AAD6 PE=3
SV=1
Length = 212
Score = 31.2 bits (69), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 223 NIGDQWSDLLGTNAGNRTFKLPDPMY 248
+IGD WS++LG+ + R F+L D Y
Sbjct: 38 SIGDAWSEILGSMSKERAFELLDAFY 63
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.135 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,378,728
Number of Sequences: 539616
Number of extensions: 4499545
Number of successful extensions: 10168
Number of sequences better than 100.0: 16
Number of HSP's better than 100.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 10140
Number of HSP's gapped (non-prelim): 16
length of query: 251
length of database: 191,569,459
effective HSP length: 115
effective length of query: 136
effective length of database: 129,513,619
effective search space: 17613852184
effective search space used: 17613852184
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)