BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037924
(158 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O49813|OLEE1_BETPN Olee1-like protein OS=Betula pendula PE=2 SV=1
Length = 166
Score = 115 bits (288), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 90/155 (58%), Gaps = 11/155 (7%)
Query: 1 MARLLLL--FALVALLLVELVNAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECK 58
MA+ +++ AL L L+ + F + G+VYCD CR F T +TY++GA+V +EC+
Sbjct: 1 MAKSIIIQAPALCFLSLLGFAYSESRFFVEGKVYCDNCRTQFVTKLSTYMKGAKVSLECR 60
Query: 59 DRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTA------YPGCA 112
+R L YS D +TD +GTY I VDGDH+++IC + L SS DC
Sbjct: 61 NREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKDPFLKKSAR 120
Query: 113 LILTRSNGVVSNLHFANALGFLKNRPLAFYPELLK 147
+ LT++NG+ + + AN LGF+K +PL PE K
Sbjct: 121 ISLTKNNGISTPVRLANPLGFMKKKPL---PECAK 152
>sp|P33050|PSC13_MAIZE Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1
Length = 170
Score = 111 bits (278), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 7/135 (5%)
Query: 21 AGDP----FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDST 76
A DP + I+GRVYCDTCR GF T+ T YI GA+VR+ECK + L+ ++DG TD+T
Sbjct: 27 ADDPNLPDYVIQGRVYCDTCRAGFVTNVTEYIAGAKVRLECKHFGTGKLERAIDGVTDAT 86
Query: 77 GTYNIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSNLHFANALGF 133
GTY I + H++ IC V L++S DC ++LTR+ G+ +L AN LG+
Sbjct: 87 GTYTIELKDSHEEDICQVVLVASPRKDCDEVQALRDRAGVLLTRNVGISDSLRPANPLGY 146
Query: 134 LKNRPLAFYPELLKK 148
K+ PL LLK+
Sbjct: 147 FKDVPLPVCAALLKQ 161
>sp|Q8LGR0|CHE1_CHEAL Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1
Length = 168
Score = 109 bits (273), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 7/154 (4%)
Query: 2 ARLLLLFALVALLLVELVNAGDP-FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDR 60
A LL+ AL L L + NA + F ++G VYCDTCR F T +T ++GA V++EC++
Sbjct: 6 AVFLLVGALCVLSLAGVANAAENHFKVQGMVYCDTCRIQFMTRISTIMEGATVKLECRNI 65
Query: 61 NSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKT------AYPGCALI 114
+ + + TD G Y+I V+GD +D IC ++L+ S ++C A +
Sbjct: 66 TAGTQTFKAEAVTDKVGQYSIPVNGDFEDDICEIELVKSPNSECSEVSHDVYAKQSAKVS 125
Query: 115 LTRSNGVVSNLHFANALGFLKNRPLAFYPELLKK 148
LT +NG S++ ANALGF++ PL PE+LK+
Sbjct: 126 LTSNNGEASDIRSANALGFMRKEPLKECPEVLKE 159
>sp|P13447|LAT52_SOLLC Anther-specific protein LAT52 OS=Solanum lycopersicum GN=LAT52 PE=2
SV=1
Length = 161
Score = 93.6 bits (231), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 88/155 (56%), Gaps = 9/155 (5%)
Query: 1 MARLLLLFALVALLLVELVNAG----DPFHIRGRVYCDTCRCGFETSATTYIQGARVRIE 56
MA+ ++L L AL ++ L N + F + G+VYCDTCR FET + ++GA V+++
Sbjct: 1 MAKAIVL--LSALCILALANFAHCRPEVFDVEGKVYCDTCRVQFETKLSENLEGATVKLQ 58
Query: 57 CKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCK---TAYPGCAL 113
C++ ++ +SV+G TD G Y + V+GDH++ IC V ++ S DCK + Y +
Sbjct: 59 CRNISTEAETFSVEGVTDKDGKYKLTVNGDHENDICEVTVVKSPREDCKESVSGYEKARI 118
Query: 114 ILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKK 148
+ + G+ + + FAN L F+K + E L +
Sbjct: 119 ECSDNVGIHNAVRFANPLFFMKAESVQGCKEALDE 153
>sp|O82015|LIGV1_LIGVU Major pollen allergen Lig v 1 OS=Ligustrum vulgare PE=1 SV=2
Length = 145
Score = 90.9 bits (224), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 9/130 (6%)
Query: 25 FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVD 84
F+I+G+VYCDTCR F T + +I GA VR++CKD + + ++ G T + G YN+ ++
Sbjct: 11 FYIQGQVYCDTCRARFITELSEFIPGAGVRLQCKDGENGKVTFTEVGYTKAEGLYNMLIE 70
Query: 85 GDHQDQICYVKLISSSLADCKT------AYPGCALILTRSNGVVSNLHFANALGFLKNRP 138
DH+++ C + LISSS DC P +L NG + N LGFLK
Sbjct: 71 RDHKNEFCEITLISSSRKDCDEIPTEGWVKPSLKFVLNTVNGTTRTI---NPLGFLKKEV 127
Query: 139 LAFYPELLKK 148
L P++ K
Sbjct: 128 LPKCPQVFNK 137
>sp|P19963|ALL1_OLEEU Major pollen allergen Ole e 1 OS=Olea europaea PE=1 SV=2
Length = 145
Score = 89.7 bits (221), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 68/122 (55%), Gaps = 9/122 (7%)
Query: 25 FHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVD 84
FHI+G+VYCDTCR GF T + +I GA +R++CKD+ + ++ ++ G T + G Y++ V+
Sbjct: 11 FHIQGQVYCDTCRAGFITELSEFIPGASLRLQCKDKENGDVTFTEVGYTRAEGLYSMLVE 70
Query: 85 GDHQDQICYVKLISSSLADCKT------AYPGCALILTRSNGVVSNLHFANALGFLKNRP 138
DH+++ C + LISS DC A P L NG + N LGF K
Sbjct: 71 RDHKNEFCEITLISSGRKDCNEIPTEGWAKPSLKFKLNTVNGTTRTV---NPLGFFKKEA 127
Query: 139 LA 140
L
Sbjct: 128 LP 129
>sp|Q8H6L7|PHLB_PHLPR Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1
Length = 143
Score = 88.6 bits (218), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 71/138 (51%), Gaps = 4/138 (2%)
Query: 20 NAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTY 79
+ G F + GRVYCD CR GFET+ + +QGA V ++C+ N K + TD G Y
Sbjct: 1 DKGPGFVVTGRVYCDPCRAGFETNVSHNVQGATVAVDCRPFNGGESKLKAEATTDGLGWY 60
Query: 80 NIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSN-LHFANALGFLK 135
I +D DHQ++IC V L S C + LT +NG+ + +AN + F +
Sbjct: 61 KIEIDQDHQEEICEVVLAKSPDTTCSEIEEFRDRARVPLTSNNGIKQQGIRYANPIAFFR 120
Query: 136 NRPLAFYPELLKKYLLQN 153
PL +L+ Y L++
Sbjct: 121 KEPLKECGGILQAYDLRD 138
>sp|Q7M1X5|LOLB_LOLPR Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1
Length = 134
Score = 82.4 bits (202), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 4/134 (2%)
Query: 20 NAGDPFHIRGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTY 79
+ G F + GRVYCD CR GFET+ + ++GA V ++C+ + K + TD G Y
Sbjct: 1 DKGPGFVVTGRVYCDPCRAGFETNVSHNVEGATVAVDCRPFDGGESKLKAEATTDKDGWY 60
Query: 80 NIHVDGDHQDQICYVKLISSSLADCKTAYP---GCALILTRSNGVVSN-LHFANALGFLK 135
I +D DHQ++IC V L S C + LT + G+ + +AN + F +
Sbjct: 61 KIEIDQDHQEEICEVVLAKSPDKSCSEIEEFRDRARVPLTSNXGIKQQGIRYANPIAFFR 120
Query: 136 NRPLAFYPELLKKY 149
PL +L+ Y
Sbjct: 121 KEPLKECGGILQAY 134
>sp|P82242|PLAL1_PLALA Major pollen allergen Pla l 1 OS=Plantago lanceolata PE=1 SV=2
Length = 131
Score = 60.5 bits (145), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 4/81 (4%)
Query: 25 FHIRGRVYCDTCRC-GFETSATTYIQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHV 83
FH+ G VYC+ C + + GA+V+++CKD +S + YS+ G+TD G Y + V
Sbjct: 9 FHVEGEVYCNVCHSRNLINELSERMAGAQVQLDCKD-DSKKVIYSIGGETDQDGVYRLPV 67
Query: 84 DGDHQDQICYVKLISSSLADC 104
G H+D C +KL+ SS DC
Sbjct: 68 VGYHED--CEIKLVKSSRPDC 86
>sp|Q9FZ35|PRP1_ARATH Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1
Length = 335
Score = 34.7 bits (78), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 15/78 (19%)
Query: 36 CRCGFETSATTYIQGARVRIECKDRNSLN------LKYSVDGDTDSTGTYNIHVDGDHQD 89
C+ G+ET IQGA+ +I C +R S + YS TD G + HV H
Sbjct: 220 CKNGYETYP---IQGAKAKIVCSERGSYEKSKNEVVIYS--DPTDFKGYF--HVVLTHIK 272
Query: 90 QI--CYVKLISSSLADCK 105
+ C VKL +S + CK
Sbjct: 273 NLSNCRVKLYTSPVETCK 290
>sp|Q9LZJ7|PRP3_ARATH Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1
Length = 313
Score = 32.3 bits (72), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 36 CRCGFETSATTYIQGARVRIECKDRNSL---NLKYSV-DGDTDSTGTYNIHVDGDHQDQI 91
C+ G+ET I GA+++I C D S N + + TDS G +++ +
Sbjct: 194 CKNGYETYP---ILGAKIQIVCSDPASYGKSNTEVVIYSNPTDSKGYFHLSLTSIKDLAY 250
Query: 92 CYVKLISSSLADCK 105
C VKL S + CK
Sbjct: 251 CRVKLYLSPVETCK 264
>sp|Q18LF9|DNBI_ELHVK Major DNA-binding protein OS=Elephantid herpesvirus 1 (isolate
Asian elephant/Berlin/Kiba/1998) PE=3 SV=1
Length = 1143
Score = 32.0 bits (71), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 34/86 (39%), Gaps = 17/86 (19%)
Query: 81 IHVDGDHQDQICYVKLISSSLADCKTAYPGCALILTRSNGVVSNLH-------------- 126
+H G I V+ + ++L+D K C L +R+ G VS H
Sbjct: 860 LHASGHRGQYIGIVQALCTTLSDTKVDGLPCYLDSSRTFGTVSEYHAYCREYNGTVKVGR 919
Query: 127 ---FANALGFLKNRPLAFYPELLKKY 149
+ N G + RPL P L+KY
Sbjct: 920 TRPYCNTNGMFERRPLVTVPYALEKY 945
>sp|O51067|Y038_BORBU Uncharacterized protein BB_0038 OS=Borrelia burgdorferi (strain
ATCC 35210 / B31 / CIP 102532 / DSM 4680) GN=BB_0038
PE=4 SV=1
Length = 505
Score = 31.6 bits (70), Expect = 2.3, Method: Composition-based stats.
Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 8/99 (8%)
Query: 52 RVRIECKDRNSLNLKYS-VDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTAYPG 110
+ +I+ +NS N+ + GD D +NI + G +D+ S L D + P
Sbjct: 132 KTQIDIDPKNSYNMYLEDITGDYD----FNIVIQGFLKDKSVLYVFQKSVLNDVSSYRP- 186
Query: 111 CALILTRSNGVVSNLHFANALGFLKNRPLAFYPELLKKY 149
+ + NG V +A + + +NR YP L+KY
Sbjct: 187 --IFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKY 223
>sp|Q8AAD2|TRPB_BACTN Tryptophan synthase beta chain OS=Bacteroides thetaiotaomicron
(strain ATCC 29148 / DSM 2079 / NCTC 10582 / E50 /
VPI-5482) GN=trpB PE=3 SV=1
Length = 394
Score = 30.8 bits (68), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 39/97 (40%), Gaps = 9/97 (9%)
Query: 48 IQGARVRIECKDRNSLNLKYSVDGDTDSTGTYNIHVDGDHQDQICYVKLISSSLADCKTA 107
I GAR + + + YS+ D G IH + Q + + + +
Sbjct: 283 IHGARTYVIQNEDGQIEEPYSISAGLDYPGIGPIHANLAAQRRATVLAVNDDEAIEA--- 339
Query: 108 YPGCALILTRSNGVVSNLHFANALGFLKNRPLAFYPE 144
A LT+ G++ L A+ALG LK L F PE
Sbjct: 340 ----AYELTKLEGIIPALESAHALGALKK--LKFKPE 370
>sp|P53577|FIXA_BRAJA Protein FixA OS=Bradyrhizobium japonicum (strain USDA 110) GN=fixA
PE=3 SV=2
Length = 288
Score = 30.0 bits (66), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 28 RGRVYCDTCRCGFETSATTYIQGARVRIECKDRNSLNL----KYSVDGDTDSTG 77
R + D C G +T ATTY +R K+ NL K ++DGDT G
Sbjct: 80 RAVLLTDRCFAGSDTLATTYALATAIRKIGKEYGPANLIFTGKQTIDGDTAQVG 133
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.325 0.141 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,706,147
Number of Sequences: 539616
Number of extensions: 2091233
Number of successful extensions: 5460
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 5442
Number of HSP's gapped (non-prelim): 20
length of query: 158
length of database: 191,569,459
effective HSP length: 108
effective length of query: 50
effective length of database: 133,290,931
effective search space: 6664546550
effective search space used: 6664546550
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 56 (26.2 bits)