BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044563
(290 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O04195|Y2992_ARATH Uncharacterized protein At2g39920 OS=Arabidopsis thaliana
GN=At2g39920 PE=2 SV=2
Length = 283
Score = 204 bits (519), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 172/298 (57%), Gaps = 46/298 (15%)
Query: 1 MSAYAHQMERQLSSQSLSSRGGS-------------------VIFIGGLITVGVLLITLL 41
MSAY H MER+LS LSSRG S IFI L+T GVL+ITLL
Sbjct: 1 MSAYTHPMERELSG--LSSRGNSELGSRYSIESGCYMTSLAASIFIASLVTFGVLMITLL 58
Query: 42 VTLAVMLQSCQNRSSGVVELTKSSGDNNYCKLFALHAELNSL-EADNLPEICRGLAIRYI 100
+ L+ MLQSC+NR+ +VE + YCK+ +LH++LNSL E LP +CR +A+ I
Sbjct: 59 IALSTMLQSCENRNIAIVEAQRLDESFGYCKILSLHSQLNSLDEESELPLLCRDVALHRI 118
Query: 101 KEGQFARDLNLSIQIVESYFNTLTPSYNGLDVVLMDIDDIFASSSKYSNLLIDRVNVRGY 160
K+G + R+LN +IQ+ +YF T+ P + DVV++DIDD +NLL Y
Sbjct: 119 KQGIYLRELNFTIQMALTYFQTIKPMNDNCDVVVIDIDD--------TNLL---EQDSYY 167
Query: 161 IECIEEAKHLKHMFTLKLFMKLQVRGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIM 220
++ IEEAKH K + L L+ KL+ +G+ ++LLSR+ E +RNAT E L S GY WS LIM
Sbjct: 168 MKYIEEAKHQKSILILALYSKLRSQGYSMVLLSRRPETERNATIEQLKSRGYSDWSHLIM 227
Query: 221 RLDNEMQMDSREYLSRRRTILQKEGFHITGLISNQMDALIGQ--SLGKRVFKLPNPLY 276
++ Q +E L R G + G+I N MD L GQ KR+FKLP+ Y
Sbjct: 228 SREDTRQ---KEELER--------GHRVIGVIGNHMDVLRGQWNWQSKRLFKLPSLTY 274
>sp|P27061|PPA1_SOLLC Acid phosphatase 1 OS=Solanum lycopersicum GN=APS1 PE=2 SV=1
Length = 255
Score = 90.5 bits (223), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 101/214 (47%), Gaps = 10/214 (4%)
Query: 71 CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
C + E N+L +PE C Y+ + +++ Y ++ +G
Sbjct: 43 CTTWRFVVETNNLSPWKTIPEECADYVKEYMVGPGYKMEIDRVSDEAGEYAKSVDLGDDG 102
Query: 130 LDVVLMDIDDI------FASSSKYSNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQ 183
DV + D+D+ + S +Y + D V ++E A L +LKL+ ++
Sbjct: 103 RDVWIFDVDETLLSNLPYYSDHRYGLEVFDDVEFDKWVEN-GTAPALGS--SLKLYQEVL 159
Query: 184 VRGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQK 243
G+ V LL+ + E R+ T E L++AG+ W LI+R ++ + Y S RR + +
Sbjct: 160 KLGFKVFLLTGRSERHRSVTVENLMNAGFHDWHKLILRGSDDHGKTATTYKSERRNAMVE 219
Query: 244 EGFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
EGF I G +Q L+G S+ R FKLPNP+YY
Sbjct: 220 EGFRIVGNSGDQWSDLLGSSMSYRSFKLPNPMYY 253
>sp|P10743|VSPB_SOYBN Stem 31 kDa glycoprotein OS=Glycine max GN=VSPB PE=2 SV=1
Length = 254
Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 7/212 (3%)
Query: 71 CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
C F L E +++ A +PE C YI QF D Q +++F +
Sbjct: 43 CASFRLAVEAHNIRAFKTIPEECVEPTKDYINGEQFRSDSKTVNQ--QAFFYASEREVHH 100
Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
D+ + ID+ S+ Y ++ N Y E + + TLK + KL G
Sbjct: 101 NDIFIFGIDNTVLSNIPYYEKHGYGVEEFNETLYDEWVNKGDAPALPETLKNYNKLLSLG 160
Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKEGF 246
+ ++ LS ++ + T L AG+ W LI++ + + ++ Y S R L ++G+
Sbjct: 161 FKIVFLSGRYLDKMAVTEANLKKAGFHTWEQLILKDPHLITPNALSYKSAMRENLLRQGY 220
Query: 247 HITGLISNQMDALIGQSLGK-RVFKLPNPLYY 277
I G+I +Q L+G G+ R FKLPNP+YY
Sbjct: 221 RIVGIIGDQWSDLLGDHRGESRTFKLPNPMYY 252
>sp|P15490|VSPA_SOYBN Stem 28 kDa glycoprotein OS=Glycine max GN=VSPA PE=2 SV=1
Length = 254
Score = 72.4 bits (176), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 96/213 (45%), Gaps = 8/213 (3%)
Query: 71 CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
C + L E +++ + +PE C YI Q+ D Q ++YF +
Sbjct: 42 CASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQ--QAYFYARDLEVHP 99
Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
D + ID S+ Y +++ N Y E + + TLK + KL G
Sbjct: 100 KDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPETLKNYNKLVSLG 159
Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQM-DSREYLSRRRTILQKEG 245
+ +I LS + ++ T L AGY W LI++ + ++ Y + R L ++G
Sbjct: 160 FKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYKTAAREKLIRQG 219
Query: 246 FHITGLISNQMDALIGQSLGK-RVFKLPNPLYY 277
++I G+I +Q L+G G+ R FKLPNPLYY
Sbjct: 220 YNIVGIIGDQWSDLLGGHRGESRTFKLPNPLYY 252
>sp|O49195|VSP1_ARATH Vegetative storage protein 1 OS=Arabidopsis thaliana GN=VSP1 PE=2
SV=2
Length = 270
Score = 67.0 bits (162), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 93/213 (43%), Gaps = 8/213 (3%)
Query: 71 CKLFALHAEL-NSLEADNLPEICRGLAIRY-IKEGQFARDLNLSIQIVESYFNTLTPSYN 128
C+ + L E N + D +P C+ Y I Q+ D + Y L +
Sbjct: 57 CRSWHLGVETSNIINFDTVPANCKAYVEDYLITSKQYQYDSKTVNKEAYFYAKGLALKND 116
Query: 129 GLDVVLMDIDDIFASSSKYSNLL---IDRVNVRGYIECIEEAKHLKHMF-TLKLFMKLQV 184
++V + D+DD SS Y + Y +E + + TL L+ L
Sbjct: 117 TVNVWIFDLDDTLLSSIPYYAKYGYGTENTAPGAYWSWLESGESTPGLPETLHLYENLLE 176
Query: 185 RGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKE 244
G I++S + + T E L + G W LI++ N ++ Y S+ R L K+
Sbjct: 177 LGIEPIIISDRWKKLSEVTVENLKAVGVTKWKHLILK-PNGSKLTQVVYKSKVRNSLVKK 235
Query: 245 GFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
G++I G I +Q L+ + G RVFKLPNPLYY
Sbjct: 236 GYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYY 267
>sp|P10742|S25K_SOYBN Stem 31 kDa glycoprotein (Fragment) OS=Glycine max GN=VSP25 PE=2
SV=2
Length = 291
Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 8/210 (3%)
Query: 71 CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
C + L E +++ + +PE C YI Q+ D Q ++YF +
Sbjct: 39 CASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQ--QAYFYARDLEVHP 96
Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
D + ID S+ Y +++ N Y E + + TLK + KL G
Sbjct: 97 KDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPETLKNYNKLVSLG 156
Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQM-DSREYLSRRRTILQKEG 245
+ +I LS + ++ T L AGY W LI++ + ++ Y + R L ++G
Sbjct: 157 FKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYKTAAREKLIRQG 216
Query: 246 FHITGLISNQMDALIGQSLGK-RVFKLPNP 274
++I G+I +Q L+G G+ R FKLPNP
Sbjct: 217 YNIVGIIGDQWSDLLGGHRGESRTFKLPNP 246
>sp|O82122|VSP2_ARATH Vegetative storage protein 2 OS=Arabidopsis thaliana GN=VSP2 PE=1
SV=1
Length = 265
Score = 65.5 bits (158), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 95/213 (44%), Gaps = 8/213 (3%)
Query: 71 CKLFALHAEL-NSLEADNLPEICRGLAIRY-IKEGQFARDLNLSIQIVESYFNTLTPSYN 128
C+ + L E N ++ D +P C+ Y I Q+ D + Y L +
Sbjct: 52 CRSWHLGVETSNIIDFDTVPANCKDYVEDYLITSKQYQYDSKTVCKEAYFYAKGLALKND 111
Query: 129 GLDVVLMDIDDIFASSSKYSNLL---IDRVNVRGYIECIEEAKHLKHMF-TLKLFMKLQV 184
++V + D+DD SS Y ++ + Y + + L L+ +
Sbjct: 112 TVNVWIFDLDDTLLSSIPYYAKYGYGTEKTDPGAYWLWLGTGASTPGLPEALHLYQNIIE 171
Query: 185 RGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKE 244
G I+LS + + +N T + L +AG W LI++ N + Y S+ R L K+
Sbjct: 172 LGIEPIILSDRWKLWKNVTLDNLEAAGVTYWKHLILK-PNGSNLRQVVYKSKVRKSLVKK 230
Query: 245 GFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
G++I G I +Q L+ + G RVFKLPNPLYY
Sbjct: 231 GYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYY 262
>sp|Q07799|YL007_YEAST Uncharacterized protein YLL007C OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=YLL007C PE=1 SV=1
Length = 665
Score = 33.9 bits (76), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 12/149 (8%)
Query: 101 KEGQFA---RDLNLSIQIVESYFNTLTPSYNGLDVVL-MDIDDIFASSSKYSNLLIDRVN 156
KE Q++ RDL L QI + N T Y L++ L I+ + S S +LL +
Sbjct: 321 KENQYSDSERDLKL--QIATAVLNRQTCFYKTLELFLRFWIESLAKSQSDLVSLLNLAII 378
Query: 157 VRGYIECIEEAKHLKHMFTLKLFMKLQVRGWPVILLSRKHEGQRNATTELLISAGYRGWS 216
Y+ C+ + L+ K +K QV V L S +++ R + + YR WS
Sbjct: 379 TLKYV-CLSSS-DLEAAIQTKSLLKTQV----VALDSMRYKFARTLQLDSIKKEQYRTWS 432
Query: 217 SLIMRLDNEMQMDSREYLSRRRTILQKEG 245
S I D + R+Y+ +R + ++G
Sbjct: 433 SSIASFDTMLSGQVRDYVRHQRLLQLQKG 461
>sp|Q6NVD0|FREM2_MOUSE FRAS1-related extracellular matrix protein 2 OS=Mus musculus GN=Frem2
PE=2 SV=2
Length = 3160
Score = 33.1 bits (74), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 16 SLSSRGGSVIFIGGLITVGVLLITLLVTLAVMLQSCQNRSSGVVELTKSSG 66
S SS V +GGL TVG+L + L V AVM C+NRS+ + K SG
Sbjct: 3096 SPSSEVSLVTVLGGL-TVGLLTVCLAVAAAVM---CRNRSTKGKDTPKGSG 3142
>sp|P56417|SYY_HELPY Tyrosine--tRNA ligase OS=Helicobacter pylori (strain ATCC 700392 /
26695) GN=tyrS PE=3 SV=1
Length = 402
Score = 31.6 bits (70), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 14/113 (12%)
Query: 58 VVELTKSSGDNNYCKLFALHAE-------LNSLEADNLPEICRGLAI-RYIKEGQFARDL 109
V+E K+ + Y L H E L++L A + E+C ++ R ++ F +
Sbjct: 102 VLENAKTYEEQIYKILDEKHTEVCFNSTWLDALGAKGMIELCAKFSVARMLERDDFTKRY 161
Query: 110 --NLSIQIVESYFNTLTPSYNGLDVVLMDIDDIFASSSKYSNLLIDRVNVRGY 160
N I IVE L P G D V MD D + + NLL+ R R Y
Sbjct: 162 KENRPISIVE----FLYPLLQGYDSVAMDADIELGGNDQKFNLLVGRFLQRAY 210
>sp|Q95KD7|K1407_MACFA Coiled-coil domain-containing protein KIAA1407 homolog OS=Macaca
fascicularis GN=QflA-14927 PE=2 SV=1
Length = 935
Score = 31.2 bits (69), Expect = 9.6, Method: Composition-based stats.
Identities = 24/110 (21%), Positives = 46/110 (41%), Gaps = 7/110 (6%)
Query: 147 YSNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRGW-----PVILLSR-KHEGQR 200
YS +L+ RV +R +++ + + + F ++ K R W V + SR KHE
Sbjct: 807 YSQILLKRV-IRSWLQYMTDLQEEVRKFCVRFLQKKIFRAWFNMVREVKIDSRGKHEIAA 865
Query: 201 NATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKEGFHITG 250
+ ++ +R W + + E + R RR+ + F + G
Sbjct: 866 EHSDRRILWITFRTWKKFVKFMKEERVKEERRQQLRRKVVEILPDFQVPG 915
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.322 0.136 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,303,776
Number of Sequences: 539616
Number of extensions: 3653663
Number of successful extensions: 9561
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 9543
Number of HSP's gapped (non-prelim): 19
length of query: 290
length of database: 191,569,459
effective HSP length: 116
effective length of query: 174
effective length of database: 128,974,003
effective search space: 22441476522
effective search space used: 22441476522
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)