BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037924
         (158 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O49813|OLEE1_BETPN Olee1-like protein OS=Betula pendula PE=2 SV=1
          Length = 166

 Score =  115 bits (288), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 90/155 (58%), Gaps = 11/155 (7%)

Query: 1   MARLLLL--FALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECK 58
           MA+ +++   AL  L L+    +   F + G+VYCD CR  F T  +TY++GA+V +EC+
Sbjct: 1   MAKSIIIQAPALCFLSLLGFAYSESRFFVEGKVYCDNCRTQFVTKLSTYMKGAKVSLECR 60

Query: 59  DRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTA------YPGCA 112
           +R    L YS D +TD +GTY I VDGDH+++IC + L  SS  DC              
Sbjct: 61  NREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKDPFLKKSAR 120

Query: 113 LILTRSNGVVSNLHFANALGFLKNRPLAFYPELLK 147
           + LT++NG+ + +  AN LGF+K +PL   PE  K
Sbjct: 121 ISLTKNNGISTPVRLANPLGFMKKKPL---PECAK 152


>sp|P33050|PSC13_MAIZE Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1
          Length = 170

 Score =  111 bits (278), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 7/135 (5%)

Query: 21  AGDP----FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDST 76
           A DP    + I+GRVYCDTCR GF T+ T YI GA+VR+ECK   +  L+ ++DG TD+T
Sbjct: 27  ADDPNLPDYVIQGRVYCDTCRAGFVTNVTEYIAGAKVRLECKHFGTGKLERAIDGVTDAT 86

Query: 77  GTYNIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSNLHFANALGF 133
           GTY I +   H++ IC V L++S   DC           ++LTR+ G+  +L  AN LG+
Sbjct: 87  GTYTIELKDSHEEDICQVVLVASPRKDCDEVQALRDRAGVLLTRNVGISDSLRPANPLGY 146

Query: 134 LKNRPLAFYPELLKK 148
            K+ PL     LLK+
Sbjct: 147 FKDVPLPVCAALLKQ 161


>sp|Q8LGR0|CHE1_CHEAL Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1
          Length = 168

 Score =  109 bits (273), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 7/154 (4%)

Query: 2   ARLLLLFALVALLLVELVNAGDP-FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDR 60
           A  LL+ AL  L L  + NA +  F ++G VYCDTCR  F T  +T ++GA V++EC++ 
Sbjct: 6   AVFLLVGALCVLSLAGVANAAENHFKVQGMVYCDTCRIQFMTRISTIMEGATVKLECRNI 65

Query: 61  NSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKT------AYPGCALI 114
            +    +  +  TD  G Y+I V+GD +D IC ++L+ S  ++C        A     + 
Sbjct: 66  TAGTQTFKAEAVTDKVGQYSIPVNGDFEDDICEIELVKSPNSECSEVSHDVYAKQSAKVS 125

Query: 115 LTRSNGVVSNLHFANALGFLKNRPLAFYPELLKK 148
           LT +NG  S++  ANALGF++  PL   PE+LK+
Sbjct: 126 LTSNNGEASDIRSANALGFMRKEPLKECPEVLKE 159


>sp|P13447|LAT52_SOLLC Anther-specific protein LAT52 OS=Solanum lycopersicum GN=LAT52 PE=2
           SV=1
          Length = 161

 Score = 93.6 bits (231), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 88/155 (56%), Gaps = 9/155 (5%)

Query: 1   MARLLLLFALVALLLVELVNAG----DPFHIRGRVYCDTCRCGFETSATTYIQGARVRIE 56
           MA+ ++L  L AL ++ L N      + F + G+VYCDTCR  FET  +  ++GA V+++
Sbjct: 1   MAKAIVL--LSALCILALANFAHCRPEVFDVEGKVYCDTCRVQFETKLSENLEGATVKLQ 58

Query: 57  CKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCK---TAYPGCAL 113
           C++ ++    +SV+G TD  G Y + V+GDH++ IC V ++ S   DCK   + Y    +
Sbjct: 59  CRNISTEAETFSVEGVTDKDGKYKLTVNGDHENDICEVTVVKSPREDCKESVSGYEKARI 118

Query: 114 ILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKK 148
             + + G+ + + FAN L F+K   +    E L +
Sbjct: 119 ECSDNVGIHNAVRFANPLFFMKAESVQGCKEALDE 153


>sp|O82015|LIGV1_LIGVU Major pollen allergen Lig v 1 OS=Ligustrum vulgare PE=1 SV=2
          Length = 145

 Score = 90.9 bits (224), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 9/130 (6%)

Query: 25  FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVD 84
           F+I+G+VYCDTCR  F T  + +I GA VR++CKD  +  + ++  G T + G YN+ ++
Sbjct: 11  FYIQGQVYCDTCRARFITELSEFIPGAGVRLQCKDGENGKVTFTEVGYTKAEGLYNMLIE 70

Query: 85  GDHQDQICYVKLISSSLADCKT------AYPGCALILTRSNGVVSNLHFANALGFLKNRP 138
            DH+++ C + LISSS  DC          P    +L   NG    +   N LGFLK   
Sbjct: 71  RDHKNEFCEITLISSSRKDCDEIPTEGWVKPSLKFVLNTVNGTTRTI---NPLGFLKKEV 127

Query: 139 LAFYPELLKK 148
           L   P++  K
Sbjct: 128 LPKCPQVFNK 137


>sp|P19963|ALL1_OLEEU Major pollen allergen Ole e 1 OS=Olea europaea PE=1 SV=2
          Length = 145

 Score = 89.7 bits (221), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 46/122 (37%), Positives = 68/122 (55%), Gaps = 9/122 (7%)

Query: 25  FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVD 84
           FHI+G+VYCDTCR GF T  + +I GA +R++CKD+ + ++ ++  G T + G Y++ V+
Sbjct: 11  FHIQGQVYCDTCRAGFITELSEFIPGASLRLQCKDKENGDVTFTEVGYTRAEGLYSMLVE 70

Query: 85  GDHQDQICYVKLISSSLADCKT------AYPGCALILTRSNGVVSNLHFANALGFLKNRP 138
            DH+++ C + LISS   DC        A P     L   NG    +   N LGF K   
Sbjct: 71  RDHKNEFCEITLISSGRKDCNEIPTEGWAKPSLKFKLNTVNGTTRTV---NPLGFFKKEA 127

Query: 139 LA 140
           L 
Sbjct: 128 LP 129


>sp|Q8H6L7|PHLB_PHLPR Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1
          Length = 143

 Score = 88.6 bits (218), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 71/138 (51%), Gaps = 4/138 (2%)

Query: 20  NAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTY 79
           + G  F + GRVYCD CR GFET+ +  +QGA V ++C+  N    K   +  TD  G Y
Sbjct: 1   DKGPGFVVTGRVYCDPCRAGFETNVSHNVQGATVAVDCRPFNGGESKLKAEATTDGLGWY 60

Query: 80  NIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSN-LHFANALGFLK 135
            I +D DHQ++IC V L  S    C           + LT +NG+    + +AN + F +
Sbjct: 61  KIEIDQDHQEEICEVVLAKSPDTTCSEIEEFRDRARVPLTSNNGIKQQGIRYANPIAFFR 120

Query: 136 NRPLAFYPELLKKYLLQN 153
             PL     +L+ Y L++
Sbjct: 121 KEPLKECGGILQAYDLRD 138


>sp|Q7M1X5|LOLB_LOLPR Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1
          Length = 134

 Score = 82.4 bits (202), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 4/134 (2%)

Query: 20  NAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTY 79
           + G  F + GRVYCD CR GFET+ +  ++GA V ++C+  +    K   +  TD  G Y
Sbjct: 1   DKGPGFVVTGRVYCDPCRAGFETNVSHNVEGATVAVDCRPFDGGESKLKAEATTDKDGWY 60

Query: 80  NIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSN-LHFANALGFLK 135
            I +D DHQ++IC V L  S    C           + LT + G+    + +AN + F +
Sbjct: 61  KIEIDQDHQEEICEVVLAKSPDKSCSEIEEFRDRARVPLTSNXGIKQQGIRYANPIAFFR 120

Query: 136 NRPLAFYPELLKKY 149
             PL     +L+ Y
Sbjct: 121 KEPLKECGGILQAY 134


>sp|P82242|PLAL1_PLALA Major pollen allergen Pla l 1 OS=Plantago lanceolata PE=1 SV=2
          Length = 131

 Score = 60.5 bits (145), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 4/81 (4%)

Query: 25  FHIRGRVYCDTCRC-GFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHV 83
           FH+ G VYC+ C         +  + GA+V+++CKD +S  + YS+ G+TD  G Y + V
Sbjct: 9   FHVEGEVYCNVCHSRNLINELSERMAGAQVQLDCKD-DSKKVIYSIGGETDQDGVYRLPV 67

Query: 84  DGDHQDQICYVKLISSSLADC 104
            G H+D  C +KL+ SS  DC
Sbjct: 68  VGYHED--CEIKLVKSSRPDC 86


>sp|Q9FZ35|PRP1_ARATH Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1
          Length = 335

 Score = 34.7 bits (78), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 15/78 (19%)

Query: 36  CRCGFETSATTYIQGARVRIECKDRNSLN------LKYSVDGDTDSTGTYNIHVDGDHQD 89
           C+ G+ET     IQGA+ +I C +R S        + YS    TD  G +  HV   H  
Sbjct: 220 CKNGYETYP---IQGAKAKIVCSERGSYEKSKNEVVIYS--DPTDFKGYF--HVVLTHIK 272

Query: 90  QI--CYVKLISSSLADCK 105
            +  C VKL +S +  CK
Sbjct: 273 NLSNCRVKLYTSPVETCK 290


>sp|Q9LZJ7|PRP3_ARATH Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1
          Length = 313

 Score = 32.3 bits (72), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 36  CRCGFETSATTYIQGARVRIECKDRNSL---NLKYSV-DGDTDSTGTYNIHVDGDHQDQI 91
           C+ G+ET     I GA+++I C D  S    N +  +    TDS G +++ +        
Sbjct: 194 CKNGYETYP---ILGAKIQIVCSDPASYGKSNTEVVIYSNPTDSKGYFHLSLTSIKDLAY 250

Query: 92  CYVKLISSSLADCK 105
           C VKL  S +  CK
Sbjct: 251 CRVKLYLSPVETCK 264


>sp|Q18LF9|DNBI_ELHVK Major DNA-binding protein OS=Elephantid herpesvirus 1 (isolate
           Asian elephant/Berlin/Kiba/1998) PE=3 SV=1
          Length = 1143

 Score = 32.0 bits (71), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 34/86 (39%), Gaps = 17/86 (19%)

Query: 81  IHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLH-------------- 126
           +H  G     I  V+ + ++L+D K     C L  +R+ G VS  H              
Sbjct: 860 LHASGHRGQYIGIVQALCTTLSDTKVDGLPCYLDSSRTFGTVSEYHAYCREYNGTVKVGR 919

Query: 127 ---FANALGFLKNRPLAFYPELLKKY 149
              + N  G  + RPL   P  L+KY
Sbjct: 920 TRPYCNTNGMFERRPLVTVPYALEKY 945


>sp|O51067|Y038_BORBU Uncharacterized protein BB_0038 OS=Borrelia burgdorferi (strain
           ATCC 35210 / B31 / CIP 102532 / DSM 4680) GN=BB_0038
           PE=4 SV=1
          Length = 505

 Score = 31.6 bits (70), Expect = 2.3,   Method: Composition-based stats.
 Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 8/99 (8%)

Query: 52  RVRIECKDRNSLNLKYS-VDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPG 110
           + +I+   +NS N+    + GD D    +NI + G  +D+        S L D  +  P 
Sbjct: 132 KTQIDIDPKNSYNMYLEDITGDYD----FNIVIQGFLKDKSVLYVFQKSVLNDVSSYRP- 186

Query: 111 CALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKY 149
             +   + NG V    +A +  + +NR    YP  L+KY
Sbjct: 187 --IFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKY 223


>sp|Q8AAD2|TRPB_BACTN Tryptophan synthase beta chain OS=Bacteroides thetaiotaomicron
           (strain ATCC 29148 / DSM 2079 / NCTC 10582 / E50 /
           VPI-5482) GN=trpB PE=3 SV=1
          Length = 394

 Score = 30.8 bits (68), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 39/97 (40%), Gaps = 9/97 (9%)

Query: 48  IQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTA 107
           I GAR  +   +   +   YS+    D  G   IH +   Q +   + +      +    
Sbjct: 283 IHGARTYVIQNEDGQIEEPYSISAGLDYPGIGPIHANLAAQRRATVLAVNDDEAIEA--- 339

Query: 108 YPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPE 144
               A  LT+  G++  L  A+ALG LK   L F PE
Sbjct: 340 ----AYELTKLEGIIPALESAHALGALKK--LKFKPE 370


>sp|P53577|FIXA_BRAJA Protein FixA OS=Bradyrhizobium japonicum (strain USDA 110) GN=fixA
           PE=3 SV=2
          Length = 288

 Score = 30.0 bits (66), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 28  RGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNL----KYSVDGDTDSTG 77
           R  +  D C  G +T ATTY     +R   K+    NL    K ++DGDT   G
Sbjct: 80  RAVLLTDRCFAGSDTLATTYALATAIRKIGKEYGPANLIFTGKQTIDGDTAQVG 133


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.325    0.141    0.426 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,706,147
Number of Sequences: 539616
Number of extensions: 2091233
Number of successful extensions: 5460
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 5442
Number of HSP's gapped (non-prelim): 20
length of query: 158
length of database: 191,569,459
effective HSP length: 108
effective length of query: 50
effective length of database: 133,290,931
effective search space: 6664546550
effective search space used: 6664546550
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 56 (26.2 bits)