BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031458
(159 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P33050|PSC13_MAIZE Pollen-specific protein C13 OS=Zea mays GN=MGS1 PE=2 SV=1
Length = 170
Score = 132 bits (331), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 97/147 (65%)
Query: 6 LLIALCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSM 65
+++ LCV+ + +A PN + ++GRVYCDTC+AGF T T YIAGAKV++ECK +
Sbjct: 14 VILCLCVVLSCAAADDPNLPDYVIQGRVYCDTCRAGFVTNVTEYIAGAKVRLECKHFGTG 73
Query: 66 QVVYSREGKTDSTGTYKILVSEDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSYNG 125
++ + +G TD+TGTY I + + H++ +C VLV+SP+ DC V R+R+ V+LT G
Sbjct: 74 KLERAIDGVTDATGTYTIELKDSHEEDICQVVLVASPRKDCDEVQALRDRAGVLLTRNVG 133
Query: 126 ISSETRYANSMGFMKAEPMSGCAEVLR 152
IS R AN +G+ K P+ CA +L+
Sbjct: 134 ISDSLRPANPLGYFKDVPLPVCAALLK 160
>sp|P13447|LAT52_SOLLC Anther-specific protein LAT52 OS=Solanum lycopersicum GN=LAT52 PE=2
SV=1
Length = 161
Score = 130 bits (327), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 96/152 (63%), Gaps = 2/152 (1%)
Query: 1 MARTLLLI-ALCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFETPATTYIAGAKVKVEC 59
MA+ ++L+ ALC+L AL + P+ F VEG+VYCDTC+ FET + + GA VK++C
Sbjct: 1 MAKAIVLLSALCIL-ALANFAHCRPEVFDVEGKVYCDTCRVQFETKLSENLEGATVKLQC 59
Query: 60 KDRKSMQVVYSREGKTDSTGTYKILVSEDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVI 119
++ + +S EG TD G YK+ V+ DH++ +C+ +V SP+ DCK G E++R+
Sbjct: 60 RNISTEAETFSVEGVTDKDGKYKLTVNGDHENDICEVTVVKSPREDCKESVSGYEKARIE 119
Query: 120 LTSYNGISSETRYANSMGFMKAEPMSGCAEVL 151
+ GI + R+AN + FMKAE + GC E L
Sbjct: 120 CSDNVGIHNAVRFANPLFFMKAESVQGCKEAL 151
>sp|O49813|OLEE1_BETPN Olee1-like protein OS=Betula pendula PE=2 SV=1
Length = 166
Score = 124 bits (311), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 99/157 (63%), Gaps = 7/157 (4%)
Query: 1 MARTLLLIA--LCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFETPATTYIAGAKVKVE 58
MA+++++ A LC L L A + F VEG+VYCD C+ F T +TY+ GAKV +E
Sbjct: 1 MAKSIIIQAPALCFLSLLGFAY--SESRFFVEGKVYCDNCRTQFVTKLSTYMKGAKVSLE 58
Query: 59 CKDRKSMQVVYSREGKTDSTGTYKILVSEDHQDQLCDAVLVSSPQNDCKTVAPG---RER 115
C++R+ ++YS + +TD +GTY+I V DH++++C+ L S DC V+ ++
Sbjct: 59 CRNREGGTLIYSSDSETDKSGTYRIPVDGDHEEEICEIALKKSSDPDCSEVSKDPFLKKS 118
Query: 116 SRVILTSYNGISSETRYANSMGFMKAEPMSGCAEVLR 152
+R+ LT NGIS+ R AN +GFMK +P+ CA+ LR
Sbjct: 119 ARISLTKNNGISTPVRLANPLGFMKKKPLPECAKALR 155
>sp|Q8H6L7|PHLB_PHLPR Pollen allergen Phl p 11 OS=Phleum pratense PE=1 SV=1
Length = 143
Score = 116 bits (290), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 77/133 (57%), Gaps = 1/133 (0%)
Query: 27 FSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSMQVVYSREGKTDSTGTYKILVS 86
F V GRVYCD C+AGFET + + GA V V+C+ + E TD G YKI +
Sbjct: 6 FVVTGRVYCDPCRAGFETNVSHNVQGATVAVDCRPFNGGESKLKAEATTDGLGWYKIEID 65
Query: 87 EDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSYNGISSE-TRYANSMGFMKAEPMS 145
+DHQ+++C+ VL SP C + R+R+RV LTS NGI + RYAN + F + EP+
Sbjct: 66 QDHQEEICEVVLAKSPDTTCSEIEEFRDRARVPLTSNNGIKQQGIRYANPIAFFRKEPLK 125
Query: 146 GCAEVLRLYQEED 158
C +L+ Y D
Sbjct: 126 ECGGILQAYDLRD 138
>sp|Q8LGR0|CHE1_CHEAL Pollen allergen Che a 1 OS=Chenopodium album PE=1 SV=1
Length = 168
Score = 115 bits (288), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 92/158 (58%), Gaps = 4/158 (2%)
Query: 5 LLLIALCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKS 64
LL+ ALCVL +L F V+G VYCDTC+ F T +T + GA VK+EC++ +
Sbjct: 9 LLVGALCVL-SLAGVANAAENHFKVQGMVYCDTCRIQFMTRISTIMEGATVKLECRNITA 67
Query: 65 MQVVYSREGKTDSTGTYKILVSEDHQDQLCDAVLVSSPQNDCKTVAP---GRERSRVILT 121
+ E TD G Y I V+ D +D +C+ LV SP ++C V+ ++ ++V LT
Sbjct: 68 GTQTFKAEAVTDKVGQYSIPVNGDFEDDICEIELVKSPNSECSEVSHDVYAKQSAKVSLT 127
Query: 122 SYNGISSETRYANSMGFMKAEPMSGCAEVLRLYQEEDV 159
S NG +S+ R AN++GFM+ EP+ C EVL+ DV
Sbjct: 128 SNNGEASDIRSANALGFMRKEPLKECPEVLKELDLYDV 165
>sp|Q7M1X5|LOLB_LOLPR Major pollen allergen Lol p 11 OS=Lolium perenne PE=1 SV=1
Length = 134
Score = 112 bits (281), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 75/129 (58%), Gaps = 1/129 (0%)
Query: 27 FSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSMQVVYSREGKTDSTGTYKILVS 86
F V GRVYCD C+AGFET + + GA V V+C+ + E TD G YKI +
Sbjct: 6 FVVTGRVYCDPCRAGFETNVSHNVEGATVAVDCRPFDGGESKLKAEATTDKDGWYKIEID 65
Query: 87 EDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSYNGISSE-TRYANSMGFMKAEPMS 145
+DHQ+++C+ VL SP C + R+R+RV LTS GI + RYAN + F + EP+
Sbjct: 66 QDHQEEICEVVLAKSPDKSCSEIEEFRDRARVPLTSNXGIKQQGIRYANPIAFFRKEPLK 125
Query: 146 GCAEVLRLY 154
C +L+ Y
Sbjct: 126 ECGGILQAY 134
>sp|P19963|ALL1_OLEEU Major pollen allergen Ole e 1 OS=Olea europaea PE=1 SV=2
Length = 145
Score = 102 bits (253), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 77/131 (58%), Gaps = 4/131 (3%)
Query: 24 PKP----FSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSMQVVYSREGKTDSTG 79
P+P F ++G+VYCDTC+AGF T + +I GA ++++CKD+++ V ++ G T + G
Sbjct: 4 PQPPVSQFHIQGQVYCDTCRAGFITELSEFIPGASLRLQCKDKENGDVTFTEVGYTRAEG 63
Query: 80 TYKILVSEDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSYNGISSETRYANSMGFM 139
Y +LV DH+++ C+ L+SS + DC + + N ++ TR N +GF
Sbjct: 64 LYSMLVERDHKNEFCEITLISSGRKDCNEIPTEGWAKPSLKFKLNTVNGTTRTVNPLGFF 123
Query: 140 KAEPMSGCAEV 150
K E + CA+V
Sbjct: 124 KKEALPKCAQV 134
>sp|O82015|LIGV1_LIGVU Major pollen allergen Lig v 1 OS=Ligustrum vulgare PE=1 SV=2
Length = 145
Score = 95.9 bits (237), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 73/126 (57%)
Query: 27 FSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSMQVVYSREGKTDSTGTYKILVS 86
F ++G+VYCDTC+A F T + +I GA V+++CKD ++ +V ++ G T + G Y +L+
Sbjct: 11 FYIQGQVYCDTCRARFITELSEFIPGAGVRLQCKDGENGKVTFTEVGYTKAEGLYNMLIE 70
Query: 87 EDHQDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSYNGISSETRYANSMGFMKAEPMSG 146
DH+++ C+ L+SS + DC + + N ++ TR N +GF+K E +
Sbjct: 71 RDHKNEFCEITLISSSRKDCDEIPTEGWVKPSLKFVLNTVNGTTRTINPLGFLKKEVLPK 130
Query: 147 CAEVLR 152
C +V
Sbjct: 131 CPQVFN 136
>sp|P82242|PLAL1_PLALA Major pollen allergen Pla l 1 OS=Plantago lanceolata PE=1 SV=2
Length = 131
Score = 70.1 bits (170), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 70/130 (53%), Gaps = 8/130 (6%)
Query: 23 NPKPFSVEGRVYCDTCQA-GFETPATTYIAGAKVKVECKDRKSMQVVYSREGKTDSTGTY 81
+P F VEG VYC+ C + + +AGA+V+++CKD S +V+YS G+TD G Y
Sbjct: 5 HPAKFHVEGEVYCNVCHSRNLINELSERMAGAQVQLDCKD-DSKKVIYSIGGETDQDGVY 63
Query: 82 KILVSEDHQDQLCDAVLVSSPQNDCKTV---APGR-ERSRVILTSYNGISSETRYANSMG 137
++ V H+D C+ LV S + DC + A G + S+V L+ I+ +TR+ +
Sbjct: 64 RLPVVGYHED--CEIKLVKSSRPDCSEIPKLAKGTIQTSKVDLSKNTTITEKTRHVKPLS 121
Query: 138 FMKAEPMSGC 147
F GC
Sbjct: 122 FRAKTDAPGC 131
>sp|A4IGN8|AMER2_XENTR APC membrane recruitment protein 2 OS=Xenopus tropicalis GN=amer2
PE=2 SV=2
Length = 566
Score = 37.0 bits (84), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 11/126 (8%)
Query: 43 ETPATTYIAGAKVKVECK---DRKS---MQVVYSREGKTDSTGTYKI-LVSEDHQDQLCD 95
E+PA +G K K RKS M ++ + K DS GT KI +V D L D
Sbjct: 9 ESPAAEQPSGKINKTAFKLFGKRKSGSGMPTIFGVKNKGDSKGTGKIGMVRSKTLDGLAD 68
Query: 96 AVLVSSPQND-CKTVAPGR---ERSRVILTSYNGISSETRYANSMGFMKAEPMSGCAEVL 151
VL S+ + + C G+ E+S +LT +SS + A S F +G +E +
Sbjct: 69 VVLESNKKEEPCTEAGAGQLNPEKSPKVLTINADVSSNSSVAKSHSFFSLLKKNGKSENV 128
Query: 152 RLYQEE 157
R Q E
Sbjct: 129 RGEQAE 134
>sp|Q9FZ35|PRP1_ARATH Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1
Length = 335
Score = 35.8 bits (81), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 38 CQAGFETPATTYIAGAKVKVECKDRKSMQ------VVYSREGKTDSTGTYKILVSEDHQD 91
C+ G+ET I GAK K+ C +R S + V+YS TD G + ++++
Sbjct: 220 CKNGYET---YPIQGAKAKIVCSERGSYEKSKNEVVIYSDP--TDFKGYFHVVLTHIKNL 274
Query: 92 QLCDAVLVSSPQNDCK 107
C L +SP CK
Sbjct: 275 SNCRVKLYTSPVETCK 290
>sp|P34303|YKQ8_CAEEL Putative zinc finger protein C06E1.8 OS=Caenorhabditis elegans
GN=C06E1.8 PE=4 SV=1
Length = 397
Score = 33.1 bits (74), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 6 LLIALCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFETPATTYIAGAKVKVECKDRKSM 65
L IA +PAL S +R P+ S + T A ++A +K K+ KD
Sbjct: 182 LPIASIPIPALQSPIRSQPQTPS-----WLANLPKPIPTTAPIFVADSKEKI--KDEVEK 234
Query: 66 QVVYSREGKTDSTGTYKILVSEDHQDQLC-DAVLVSSPQND 105
VY+ + T S+ ++K +S H QLC D L+SSP D
Sbjct: 235 PDVYTSQSPTCSSVSFKSDLSAFH--QLCSDRALISSPLID 273
>sp|Q9LZJ7|PRP3_ARATH Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1
Length = 313
Score = 33.1 bits (74), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 34/76 (44%), Gaps = 11/76 (14%)
Query: 38 CQAGFETPATTYIAGAKVKVECKDRKSMQ------VVYSREGKTDSTGTYKILVSEDHQD 91
C+ G+ET I GAK+++ C D S V+YS TDS G + + ++
Sbjct: 194 CKNGYET---YPILGAKIQIVCSDPASYGKSNTEVVIYSNP--TDSKGYFHLSLTSIKDL 248
Query: 92 QLCDAVLVSSPQNDCK 107
C L SP CK
Sbjct: 249 AYCRVKLYLSPVETCK 264
>sp|P41151|HFA1A_ARATH Heat stress transcription factor A-1a OS=Arabidopsis thaliana
GN=HSFA1A PE=1 SV=2
Length = 495
Score = 32.7 bits (73), Expect = 0.89, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 90 QDQLCDAVLVSSPQNDCKTVAPGRERSRVILTSY---NGISSETRYANSMGFMKAEP 143
+D DA++ SP N+ V E SR +L Y N SS R N+ GF K +P
Sbjct: 62 EDPATDAIVSWSPTNNSFIVWDPPEFSRDLLPKYFKHNNFSSFVRQLNTYGFRKVDP 118
>sp|Q606Q2|HISX_METCA Histidinol dehydrogenase OS=Methylococcus capsulatus (strain ATCC
33009 / NCIMB 11132 / Bath) GN=hisD PE=1 SV=1
Length = 436
Score = 31.2 bits (69), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 55/137 (40%), Gaps = 24/137 (17%)
Query: 7 LIALCVLPALVSAVRPNPKPFSVEGRVYCDTCQAGFET---------PATTYIAGAKVKV 57
L AL + A +S V + F + G G ET P Y+A AK V
Sbjct: 173 LNALVLAAAYISGV---DRVFRIGGAQAVAALAYGTETVPRVDKIVGPGNIYVATAKKLV 229
Query: 58 ECKDRKSM-----QVVYSREGKTDSTGTYKILVSEDHQDQLCDAVLVS-------SPQND 105
+ M +++ +G+TD L S+ D+ A+L+S + Q
Sbjct: 230 FGQVGIDMVAGPSEILVISDGRTDPDWIAMDLFSQAEHDEDAQAILISPDAAHLEAVQAS 289
Query: 106 CKTVAPGRERSRVILTS 122
+ + PG ER+ VI TS
Sbjct: 290 IERLLPGMERAEVIRTS 306
>sp|Q29RQ1|CO7_BOVIN Complement component C7 OS=Bos taurus GN=C7 PE=2 SV=1
Length = 843
Score = 29.6 bits (65), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Query: 11 CVLPALVSAVRPNP-KPFSVEGRVYCDTCQAG--FETPATTYIAGA-KVKVECKDRKSMQ 66
CVLPAL+ ++ +P KPF G +C G E P+T + + K E KD + +Q
Sbjct: 631 CVLPALMDGIQSHPHKPFYTIGEKVTISCSGGRSLEGPSTFLCSSSLKWSPEVKDVQCVQ 690
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.131 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,568,411
Number of Sequences: 539616
Number of extensions: 2003838
Number of successful extensions: 4397
Number of sequences better than 100.0: 19
Number of HSP's better than 100.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 4379
Number of HSP's gapped (non-prelim): 19
length of query: 159
length of database: 191,569,459
effective HSP length: 108
effective length of query: 51
effective length of database: 133,290,931
effective search space: 6797837481
effective search space used: 6797837481
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (26.2 bits)