BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044563
         (290 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O04195|Y2992_ARATH Uncharacterized protein At2g39920 OS=Arabidopsis thaliana
           GN=At2g39920 PE=2 SV=2
          Length = 283

 Score =  204 bits (519), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 127/298 (42%), Positives = 172/298 (57%), Gaps = 46/298 (15%)

Query: 1   MSAYAHQMERQLSSQSLSSRGGS-------------------VIFIGGLITVGVLLITLL 41
           MSAY H MER+LS   LSSRG S                    IFI  L+T GVL+ITLL
Sbjct: 1   MSAYTHPMERELSG--LSSRGNSELGSRYSIESGCYMTSLAASIFIASLVTFGVLMITLL 58

Query: 42  VTLAVMLQSCQNRSSGVVELTKSSGDNNYCKLFALHAELNSL-EADNLPEICRGLAIRYI 100
           + L+ MLQSC+NR+  +VE  +      YCK+ +LH++LNSL E   LP +CR +A+  I
Sbjct: 59  IALSTMLQSCENRNIAIVEAQRLDESFGYCKILSLHSQLNSLDEESELPLLCRDVALHRI 118

Query: 101 KEGQFARDLNLSIQIVESYFNTLTPSYNGLDVVLMDIDDIFASSSKYSNLLIDRVNVRGY 160
           K+G + R+LN +IQ+  +YF T+ P  +  DVV++DIDD        +NLL        Y
Sbjct: 119 KQGIYLRELNFTIQMALTYFQTIKPMNDNCDVVVIDIDD--------TNLL---EQDSYY 167

Query: 161 IECIEEAKHLKHMFTLKLFMKLQVRGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIM 220
           ++ IEEAKH K +  L L+ KL+ +G+ ++LLSR+ E +RNAT E L S GY  WS LIM
Sbjct: 168 MKYIEEAKHQKSILILALYSKLRSQGYSMVLLSRRPETERNATIEQLKSRGYSDWSHLIM 227

Query: 221 RLDNEMQMDSREYLSRRRTILQKEGFHITGLISNQMDALIGQ--SLGKRVFKLPNPLY 276
             ++  Q   +E L R        G  + G+I N MD L GQ     KR+FKLP+  Y
Sbjct: 228 SREDTRQ---KEELER--------GHRVIGVIGNHMDVLRGQWNWQSKRLFKLPSLTY 274


>sp|P27061|PPA1_SOLLC Acid phosphatase 1 OS=Solanum lycopersicum GN=APS1 PE=2 SV=1
          Length = 255

 Score = 90.5 bits (223), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 101/214 (47%), Gaps = 10/214 (4%)

Query: 71  CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
           C  +    E N+L     +PE C      Y+    +  +++        Y  ++    +G
Sbjct: 43  CTTWRFVVETNNLSPWKTIPEECADYVKEYMVGPGYKMEIDRVSDEAGEYAKSVDLGDDG 102

Query: 130 LDVVLMDIDDI------FASSSKYSNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQ 183
            DV + D+D+       + S  +Y   + D V    ++E    A  L    +LKL+ ++ 
Sbjct: 103 RDVWIFDVDETLLSNLPYYSDHRYGLEVFDDVEFDKWVEN-GTAPALGS--SLKLYQEVL 159

Query: 184 VRGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQK 243
             G+ V LL+ + E  R+ T E L++AG+  W  LI+R  ++    +  Y S RR  + +
Sbjct: 160 KLGFKVFLLTGRSERHRSVTVENLMNAGFHDWHKLILRGSDDHGKTATTYKSERRNAMVE 219

Query: 244 EGFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
           EGF I G   +Q   L+G S+  R FKLPNP+YY
Sbjct: 220 EGFRIVGNSGDQWSDLLGSSMSYRSFKLPNPMYY 253


>sp|P10743|VSPB_SOYBN Stem 31 kDa glycoprotein OS=Glycine max GN=VSPB PE=2 SV=1
          Length = 254

 Score = 76.3 bits (186), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 7/212 (3%)

Query: 71  CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
           C  F L  E +++ A   +PE C      YI   QF  D     Q  +++F       + 
Sbjct: 43  CASFRLAVEAHNIRAFKTIPEECVEPTKDYINGEQFRSDSKTVNQ--QAFFYASEREVHH 100

Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
            D+ +  ID+   S+  Y       ++  N   Y E + +        TLK + KL   G
Sbjct: 101 NDIFIFGIDNTVLSNIPYYEKHGYGVEEFNETLYDEWVNKGDAPALPETLKNYNKLLSLG 160

Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKEGF 246
           + ++ LS ++  +   T   L  AG+  W  LI++  + +  ++  Y S  R  L ++G+
Sbjct: 161 FKIVFLSGRYLDKMAVTEANLKKAGFHTWEQLILKDPHLITPNALSYKSAMRENLLRQGY 220

Query: 247 HITGLISNQMDALIGQSLGK-RVFKLPNPLYY 277
            I G+I +Q   L+G   G+ R FKLPNP+YY
Sbjct: 221 RIVGIIGDQWSDLLGDHRGESRTFKLPNPMYY 252


>sp|P15490|VSPA_SOYBN Stem 28 kDa glycoprotein OS=Glycine max GN=VSPA PE=2 SV=1
          Length = 254

 Score = 72.4 bits (176), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 96/213 (45%), Gaps = 8/213 (3%)

Query: 71  CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
           C  + L  E +++   + +PE C      YI   Q+  D     Q  ++YF       + 
Sbjct: 42  CASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQ--QAYFYARDLEVHP 99

Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
            D  +  ID    S+  Y       +++ N   Y E + +        TLK + KL   G
Sbjct: 100 KDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPETLKNYNKLVSLG 159

Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQM-DSREYLSRRRTILQKEG 245
           + +I LS +   ++  T   L  AGY  W  LI++   +    ++  Y +  R  L ++G
Sbjct: 160 FKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYKTAAREKLIRQG 219

Query: 246 FHITGLISNQMDALIGQSLGK-RVFKLPNPLYY 277
           ++I G+I +Q   L+G   G+ R FKLPNPLYY
Sbjct: 220 YNIVGIIGDQWSDLLGGHRGESRTFKLPNPLYY 252


>sp|O49195|VSP1_ARATH Vegetative storage protein 1 OS=Arabidopsis thaliana GN=VSP1 PE=2
           SV=2
          Length = 270

 Score = 67.0 bits (162), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 93/213 (43%), Gaps = 8/213 (3%)

Query: 71  CKLFALHAEL-NSLEADNLPEICRGLAIRY-IKEGQFARDLNLSIQIVESYFNTLTPSYN 128
           C+ + L  E  N +  D +P  C+     Y I   Q+  D     +    Y   L    +
Sbjct: 57  CRSWHLGVETSNIINFDTVPANCKAYVEDYLITSKQYQYDSKTVNKEAYFYAKGLALKND 116

Query: 129 GLDVVLMDIDDIFASSSKYSNLL---IDRVNVRGYIECIEEAKHLKHMF-TLKLFMKLQV 184
            ++V + D+DD   SS  Y        +      Y   +E  +    +  TL L+  L  
Sbjct: 117 TVNVWIFDLDDTLLSSIPYYAKYGYGTENTAPGAYWSWLESGESTPGLPETLHLYENLLE 176

Query: 185 RGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKE 244
            G   I++S + +     T E L + G   W  LI++  N  ++    Y S+ R  L K+
Sbjct: 177 LGIEPIIISDRWKKLSEVTVENLKAVGVTKWKHLILK-PNGSKLTQVVYKSKVRNSLVKK 235

Query: 245 GFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
           G++I G I +Q   L+  + G RVFKLPNPLYY
Sbjct: 236 GYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYY 267


>sp|P10742|S25K_SOYBN Stem 31 kDa glycoprotein (Fragment) OS=Glycine max GN=VSP25 PE=2
           SV=2
          Length = 291

 Score = 65.9 bits (159), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 8/210 (3%)

Query: 71  CKLFALHAELNSLEA-DNLPEICRGLAIRYIKEGQFARDLNLSIQIVESYFNTLTPSYNG 129
           C  + L  E +++   + +PE C      YI   Q+  D     Q  ++YF       + 
Sbjct: 39  CASWRLAVEAHNIFGFETIPEECVEATKEYIHGEQYRSDSKTVNQ--QAYFYARDLEVHP 96

Query: 130 LDVVLMDIDDIFASSSKY---SNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRG 186
            D  +  ID    S+  Y       +++ N   Y E + +        TLK + KL   G
Sbjct: 97  KDTFVFSIDGTVLSNIPYYKKHGYGVEKFNSTLYDEWVNKGNAPALPETLKNYNKLVSLG 156

Query: 187 WPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQM-DSREYLSRRRTILQKEG 245
           + +I LS +   ++  T   L  AGY  W  LI++   +    ++  Y +  R  L ++G
Sbjct: 157 FKIIFLSGRTLDKQAVTEANLKKAGYHTWEKLILKDPQDPSTPNAVSYKTAAREKLIRQG 216

Query: 246 FHITGLISNQMDALIGQSLGK-RVFKLPNP 274
           ++I G+I +Q   L+G   G+ R FKLPNP
Sbjct: 217 YNIVGIIGDQWSDLLGGHRGESRTFKLPNP 246


>sp|O82122|VSP2_ARATH Vegetative storage protein 2 OS=Arabidopsis thaliana GN=VSP2 PE=1
           SV=1
          Length = 265

 Score = 65.5 bits (158), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 95/213 (44%), Gaps = 8/213 (3%)

Query: 71  CKLFALHAEL-NSLEADNLPEICRGLAIRY-IKEGQFARDLNLSIQIVESYFNTLTPSYN 128
           C+ + L  E  N ++ D +P  C+     Y I   Q+  D     +    Y   L    +
Sbjct: 52  CRSWHLGVETSNIIDFDTVPANCKDYVEDYLITSKQYQYDSKTVCKEAYFYAKGLALKND 111

Query: 129 GLDVVLMDIDDIFASSSKYSNLL---IDRVNVRGYIECIEEAKHLKHMF-TLKLFMKLQV 184
            ++V + D+DD   SS  Y        ++ +   Y   +        +   L L+  +  
Sbjct: 112 TVNVWIFDLDDTLLSSIPYYAKYGYGTEKTDPGAYWLWLGTGASTPGLPEALHLYQNIIE 171

Query: 185 RGWPVILLSRKHEGQRNATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKE 244
            G   I+LS + +  +N T + L +AG   W  LI++  N   +    Y S+ R  L K+
Sbjct: 172 LGIEPIILSDRWKLWKNVTLDNLEAAGVTYWKHLILK-PNGSNLRQVVYKSKVRKSLVKK 230

Query: 245 GFHITGLISNQMDALIGQSLGKRVFKLPNPLYY 277
           G++I G I +Q   L+  + G RVFKLPNPLYY
Sbjct: 231 GYNIVGNIGDQWADLVEDTPG-RVFKLPNPLYY 262


>sp|Q07799|YL007_YEAST Uncharacterized protein YLL007C OS=Saccharomyces cerevisiae (strain
           ATCC 204508 / S288c) GN=YLL007C PE=1 SV=1
          Length = 665

 Score = 33.9 bits (76), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 12/149 (8%)

Query: 101 KEGQFA---RDLNLSIQIVESYFNTLTPSYNGLDVVL-MDIDDIFASSSKYSNLLIDRVN 156
           KE Q++   RDL L  QI  +  N  T  Y  L++ L   I+ +  S S   +LL   + 
Sbjct: 321 KENQYSDSERDLKL--QIATAVLNRQTCFYKTLELFLRFWIESLAKSQSDLVSLLNLAII 378

Query: 157 VRGYIECIEEAKHLKHMFTLKLFMKLQVRGWPVILLSRKHEGQRNATTELLISAGYRGWS 216
              Y+ C+  +  L+     K  +K QV    V L S +++  R    + +    YR WS
Sbjct: 379 TLKYV-CLSSS-DLEAAIQTKSLLKTQV----VALDSMRYKFARTLQLDSIKKEQYRTWS 432

Query: 217 SLIMRLDNEMQMDSREYLSRRRTILQKEG 245
           S I   D  +    R+Y+  +R +  ++G
Sbjct: 433 SSIASFDTMLSGQVRDYVRHQRLLQLQKG 461


>sp|Q6NVD0|FREM2_MOUSE FRAS1-related extracellular matrix protein 2 OS=Mus musculus GN=Frem2
            PE=2 SV=2
          Length = 3160

 Score = 33.1 bits (74), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 29/51 (56%), Gaps = 4/51 (7%)

Query: 16   SLSSRGGSVIFIGGLITVGVLLITLLVTLAVMLQSCQNRSSGVVELTKSSG 66
            S SS    V  +GGL TVG+L + L V  AVM   C+NRS+   +  K SG
Sbjct: 3096 SPSSEVSLVTVLGGL-TVGLLTVCLAVAAAVM---CRNRSTKGKDTPKGSG 3142


>sp|P56417|SYY_HELPY Tyrosine--tRNA ligase OS=Helicobacter pylori (strain ATCC 700392 /
           26695) GN=tyrS PE=3 SV=1
          Length = 402

 Score = 31.6 bits (70), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 14/113 (12%)

Query: 58  VVELTKSSGDNNYCKLFALHAE-------LNSLEADNLPEICRGLAI-RYIKEGQFARDL 109
           V+E  K+  +  Y  L   H E       L++L A  + E+C   ++ R ++   F +  
Sbjct: 102 VLENAKTYEEQIYKILDEKHTEVCFNSTWLDALGAKGMIELCAKFSVARMLERDDFTKRY 161

Query: 110 --NLSIQIVESYFNTLTPSYNGLDVVLMDIDDIFASSSKYSNLLIDRVNVRGY 160
             N  I IVE     L P   G D V MD D     + +  NLL+ R   R Y
Sbjct: 162 KENRPISIVE----FLYPLLQGYDSVAMDADIELGGNDQKFNLLVGRFLQRAY 210


>sp|Q95KD7|K1407_MACFA Coiled-coil domain-containing protein KIAA1407 homolog OS=Macaca
           fascicularis GN=QflA-14927 PE=2 SV=1
          Length = 935

 Score = 31.2 bits (69), Expect = 9.6,   Method: Composition-based stats.
 Identities = 24/110 (21%), Positives = 46/110 (41%), Gaps = 7/110 (6%)

Query: 147 YSNLLIDRVNVRGYIECIEEAKHLKHMFTLKLFMKLQVRGW-----PVILLSR-KHEGQR 200
           YS +L+ RV +R +++ + + +     F ++   K   R W      V + SR KHE   
Sbjct: 807 YSQILLKRV-IRSWLQYMTDLQEEVRKFCVRFLQKKIFRAWFNMVREVKIDSRGKHEIAA 865

Query: 201 NATTELLISAGYRGWSSLIMRLDNEMQMDSREYLSRRRTILQKEGFHITG 250
             +   ++   +R W   +  +  E   + R    RR+ +     F + G
Sbjct: 866 EHSDRRILWITFRTWKKFVKFMKEERVKEERRQQLRRKVVEILPDFQVPG 915


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.322    0.136    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,303,776
Number of Sequences: 539616
Number of extensions: 3653663
Number of successful extensions: 9561
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 9543
Number of HSP's gapped (non-prelim): 19
length of query: 290
length of database: 191,569,459
effective HSP length: 116
effective length of query: 174
effective length of database: 128,974,003
effective search space: 22441476522
effective search space used: 22441476522
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)