BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033096
(127 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65426|LOL2_ARATH Protein LOL2 OS=Arabidopsis thaliana GN=LOL2 PE=2 SV=1
Length = 155
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/105 (62%), Positives = 79/105 (75%), Gaps = 13/105 (12%)
Query: 19 GPPPGWQP--IPPPLPV------PSPS-----EMAQMVCGSCRRLLAYPRGARHVKCSCC 65
GPPPGW+ +PPP+ P+P+ E AQMVCGSCRRLL+Y RG++HVKCS C
Sbjct: 25 GPPPGWESAVLPPPIVTITAAVNPNPTTVEIPEKAQMVCGSCRRLLSYLRGSKHVKCSSC 84
Query: 66 QTVNFVLEAHQVGQVKCASCAVLLMYPYGAPSVRCSSCCFVTEIG 110
QTVN VLEA+QVGQV C +C +LLMYPYGAP+VRCSSC VT+I
Sbjct: 85 QTVNLVLEANQVGQVNCNNCKLLLMYPYGAPAVRCSSCNSVTDIS 129
>sp|Q704V3|LOL5_ORYSJ Protein LOL5 OS=Oryza sativa subsp. japonica GN=LOL5 PE=2 SV=1
Length = 163
Score = 114 bits (286), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 52/73 (71%), Positives = 62/73 (84%)
Query: 38 EMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCAVLLMYPYGAPS 97
E+ QMVCGSCR LLAY RGA +V C+CCQT+N+VLEAH+VG+V C CA LLMYP+GAP+
Sbjct: 67 EVGQMVCGSCRILLAYFRGAGYVHCTCCQTMNYVLEAHEVGKVHCGHCATLLMYPFGAPA 126
Query: 98 VRCSSCCFVTEIG 110
V+CS C FVTEIG
Sbjct: 127 VKCSLCLFVTEIG 139
Score = 32.3 bits (72), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 21/36 (58%)
Query: 38 EMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLE 73
E+ ++ CG C LL YP GA VKCS C V + E
Sbjct: 105 EVGKVHCGHCATLLMYPFGAPAVKCSLCLFVTEIGE 140
>sp|Q93ZB1|LOL1_ARATH Protein LOL1 OS=Arabidopsis thaliana GN=LOL1 PE=2 SV=1
Length = 154
Score = 99.0 bits (245), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/84 (57%), Positives = 58/84 (69%), Gaps = 5/84 (5%)
Query: 27 IPPPLPVPSPSEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCA 86
+PPP +EMAQ+VCG C LL Y RGA V+CSCC TVN LEA+QV V C +C
Sbjct: 64 VPPP-----GTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLALEANQVAHVNCGNCM 118
Query: 87 VLLMYPYGAPSVRCSSCCFVTEIG 110
+LLMY YGA SV+C+ C FVT +G
Sbjct: 119 MLLMYQYGARSVKCAVCNFVTSVG 142
Score = 31.6 bits (70), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 79 QVKCASCAVLLMYPYGAPSVRCSSCCFVTEI 109
Q+ C+ C LLMYP GA SV C+ C VT +
Sbjct: 34 QLVCSGCRNLLMYPVGATSVCCAVCNAVTAV 64
>sp|Q0J7V9|LSD1_ORYSJ Protein LSD1 OS=Oryza sativa subsp. japonica GN=LSD1 PE=2 SV=1
Length = 184
Score = 97.8 bits (242), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/79 (59%), Positives = 58/79 (73%), Gaps = 1/79 (1%)
Query: 33 VPSP-SEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCAVLLMY 91
VP+P +EMAQ+VCG C LL Y RGA V+CSCC TVN +EA+QV V C +C +LLMY
Sbjct: 95 VPAPGTEMAQLVCGGCHTLLMYIRGATSVQCSCCHTVNLAMEANQVAHVNCGNCRMLLMY 154
Query: 92 PYGAPSVRCSSCCFVTEIG 110
YGA SV+C+ C FVT +G
Sbjct: 155 QYGARSVKCAVCNFVTSVG 173
Score = 32.3 bits (72), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 79 QVKCASCAVLLMYPYGAPSVRCSSCCFVTEI 109
Q+ C+ C LLMYP GA SV C+ C VT +
Sbjct: 65 QLVCSGCRNLLMYPAGATSVCCAVCSTVTAV 95
>sp|Q6ASS2|LOL3_ORYSJ Protein LOL3 OS=Oryza sativa subsp. japonica GN=LOL3 PE=2 SV=1
Length = 186
Score = 92.8 bits (229), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%), Gaps = 3/85 (3%)
Query: 27 IPPPLPVPSPSEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCA 86
+PPP PV EMA ++CG CR LL Y R A V+CSCC TVN V + + V C C
Sbjct: 34 VPPPAPV---MEMAHLICGGCRTLLMYTRNADTVRCSCCSTVNLVRPVNNIAHVSCGQCR 90
Query: 87 VLLMYPYGAPSVRCSSCCFVTEIGV 111
LMYPYGAPSV+C+ C ++T G+
Sbjct: 91 TTLMYPYGAPSVKCAICHYITNTGM 115
Score = 29.6 bits (65), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 79 QVKCASCAVLLMYPYGAPSVRCSSCCFVTEI 109
Q+ C C +L YP GAPSV C+ C +T +
Sbjct: 4 QIVCHGCRSVLRYPSGAPSVCCALCQAITTV 34
>sp|Q2QMB3|LOL2_ORYSJ Protein LOL2 OS=Oryza sativa subsp. japonica GN=LOL2 PE=2 SV=1
Length = 172
Score = 87.4 bits (215), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/79 (49%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 34 PSPS-EMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCAVLLMYP 92
PSP ++A ++CG CR LL Y R A V+CSCC TVN V + + C C +LMYP
Sbjct: 36 PSPGMDIAHLICGGCRTLLMYTRNATSVRCSCCDTVNLVRPVSSIAHLNCGQCQTVLMYP 95
Query: 93 YGAPSVRCSSCCFVTEIGV 111
YGAPSV+C+ C F+T G+
Sbjct: 96 YGAPSVKCAICNFITNTGM 114
>sp|P94077|LSD1_ARATH Protein LSD1 OS=Arabidopsis thaliana GN=LSD1 PE=1 SV=1
Length = 189
Score = 83.2 bits (204), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/83 (49%), Positives = 51/83 (61%), Gaps = 9/83 (10%)
Query: 38 EMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFV-----LEAH----QVGQVKCASCAVL 88
+MA ++CG CR +L Y RGA V+CSCCQT N V AH QV Q+ C C
Sbjct: 47 DMAHIICGGCRTMLMYTRGASSVRCSCCQTTNLVPAHSNQVAHAPSSQVAQINCGHCRTT 106
Query: 89 LMYPYGAPSVRCSSCCFVTEIGV 111
LMYPYGA SV+C+ C FVT + +
Sbjct: 107 LMYPYGASSVKCAVCQFVTNVNM 129
Score = 68.6 bits (166), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 41 QMVCGSCRRLLAYPRGARHVKCSCCQTVNFV---LEAHQVGQVKCASCAVLLMYPYGAPS 97
Q+VC CR LL YPRGA +V+C+ C T+N V H + + C C +LMY GA S
Sbjct: 9 QLVCHGCRNLLMYPRGASNVRCALCNTINMVPPPPPPHDMAHIICGGCRTMLMYTRGASS 68
Query: 98 VRCSSCCFVTEI 109
VRC SCC T +
Sbjct: 69 VRC-SCCQTTNL 79
>sp|Q84UR0|LOL4_ORYSJ Protein LOL4 OS=Oryza sativa subsp. japonica GN=LOL4 PE=2 SV=1
Length = 147
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 45/73 (61%)
Query: 37 SEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAHQVGQVKCASCAVLLMYPYGAP 96
++M++++C C LL Y RGA +++C C +N A+Q+ + C C LM+P GA
Sbjct: 40 ADMSELICSGCPTLLFYNRGASNIRCPSCNRLNSTRSANQIAHLTCGQCRTTLMHPPGAS 99
Query: 97 SVRCSSCCFVTEI 109
+V+C++C +V +
Sbjct: 100 TVQCATCRYVNHV 112
Score = 38.9 bits (89), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 26/41 (63%)
Query: 35 SPSEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLEAH 75
S +++A + CG CR L +P GA V+C+ C+ VN V +A
Sbjct: 76 SANQIAHLTCGQCRTTLMHPPGASTVQCATCRYVNHVRDAR 116
>sp|Q69UP7|LOL1_ORYSJ Protein LOL1 OS=Oryza sativa subsp. japonica GN=LOL1 PE=2 SV=1
Length = 147
Score = 37.4 bits (85), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 2/48 (4%)
Query: 24 WQPIPPPLPVPSPSEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFV 71
W +PP S +Q+VC CR LL YP GA + C+ C TV V
Sbjct: 32 WSSMPPS--SHSLGAQSQLVCSGCRNLLMYPAGATSICCAVCGTVTAV 77
Score = 32.7 bits (73), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 73 EAHQVG---QVKCASCAVLLMYPYGAPSVRCSSCCFVTEI 109
+H +G Q+ C+ C LLMYP GA S+ C+ C VT +
Sbjct: 38 SSHSLGAQSQLVCSGCRNLLMYPAGATSICCAVCGTVTAV 77
>sp|A6NNT2|CP096_HUMAN Putative uncharacterized protein C16orf96 OS=Homo sapiens GN=C16orf96
PE=5 SV=3
Length = 1141
Score = 35.4 bits (80), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 80 VKCASCAVLLMYPYGAPSVRCSSCCFVTEIGVCGL 114
+KC SC +L +YPYG P V V +GV G+
Sbjct: 986 LKCKSCNLLTLYPYGDPHVIDYDSAEVDILGVDGI 1020
>sp|Q7XJE6|MCA1_ARATH Metacaspase-1 OS=Arabidopsis thaliana GN=AMC1 PE=1 SV=1
Length = 367
Score = 33.5 bits (75), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 28 PPPLPVPSPSEMAQMVCGSCRRLLAYPRGARHVKCSCCQTVNFVLE 73
PPP + +P + C CR L P GAR ++C+ CQ V + +
Sbjct: 4 PPPSSIYAPPMLVN--CSGCRTPLQLPSGARSIRCALCQAVTHIAD 47
Score = 28.9 bits (63), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 80 VKCASCAVLLMYPYGAPSVRCSSCCFVTEI 109
V C+ C L P GA S+RC+ C VT I
Sbjct: 16 VNCSGCRTPLQLPSGARSIRCALCQAVTHI 45
>sp|P24851|LT_POVBO Large T antigen OS=Bovine polyomavirus PE=2 SV=2
Length = 619
Score = 31.6 bits (70), Expect = 1.7, Method: Composition-based stats.
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 3 NREKESMENQEENEEAGPPPGWQPIPPPLPVPSP 36
+ E E +N+EEN PG Q PP P SP
Sbjct: 96 DEELEPSDNEEENPAGSQAPGSQATPPKKPRTSP 129
>sp|O94275|MUG30_SCHPO Probable E3 ubiquitin-protein ligase mug30 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=mug30 PE=1 SV=1
Length = 807
Score = 31.2 bits (69), Expect = 2.2, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 44 CGSCRRLLAYPRGARHVKCSCCQTVN---FVLEAHQV 77
C C +L YP A +CS C TVN F L Q+
Sbjct: 83 CICCNSVLRYPAQAMCFRCSLCMTVNDVYFCLSGSQI 119
>sp|P55833|RS27_HOMAM 40S ribosomal protein S27 OS=Homarus americanus GN=RPS27 PE=3 SV=2
Length = 84
Score = 30.4 bits (67), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 4/81 (4%)
Query: 26 PIPPPLPVPSPSEMAQMVCGSCRRLLAYPRGA-RHVKCSCCQTVNFVLEAHQVGQVKCAS 84
P+ L PSPSE + +RL+ +P VKC C ++ V +H V C
Sbjct: 2 PLAKDLLHPSPSEEKRKC--KLKRLVQHPNSYFMDVKCPGCFKISTVF-SHAQTVVACVG 58
Query: 85 CAVLLMYPYGAPSVRCSSCCF 105
CA +L P G + C F
Sbjct: 59 CATVLCQPTGGKAKLTDGCSF 79
>sp|Q8R0G7|SPNS1_MOUSE Protein spinster homolog 1 OS=Mus musculus GN=Spns1 PE=2 SV=1
Length = 528
Score = 30.0 bits (66), Expect = 4.4, Method: Composition-based stats.
Identities = 13/46 (28%), Positives = 22/46 (47%)
Query: 11 NQEENEEAGPPPGWQPIPPPLPVPSPSEMAQMVCGSCRRLLAYPRG 56
+Q ++ + GP PG +P P+ P E+ C +R+ RG
Sbjct: 11 SQADDPDDGPAPGHPGLPGPMGNPKSGELEVPDCEGLQRITGLSRG 56
>sp|Q2YDU8|SPNS1_RAT Protein spinster homolog 1 OS=Rattus norvegicus GN=Spns1 PE=2
SV=2
Length = 528
Score = 30.0 bits (66), Expect = 4.5, Method: Composition-based stats.
Identities = 13/46 (28%), Positives = 22/46 (47%)
Query: 11 NQEENEEAGPPPGWQPIPPPLPVPSPSEMAQMVCGSCRRLLAYPRG 56
+Q ++ + GP PG +P P+ P E+ C +R+ RG
Sbjct: 11 SQADDPDDGPAPGHPGLPGPMGNPKSGELEVPDCEGLQRITGLSRG 56
>sp|Q4DTX9|ATAT2_TRYCC Alpha-tubulin N-acetyltransferase 2 OS=Trypanosoma cruzi (strain CL
Brener) GN=Tc00.1047053509233.160 PE=3 SV=1
Length = 328
Score = 29.6 bits (65), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 6 KESMENQEENEEAGPPPGWQPIPPPLPVPSPSEMAQMVCGSCRRL 50
+E + Q G P QP+P P+ S SE+A CG+ RR+
Sbjct: 269 EEYLREQAYRRRQGGDPRLQPVPNPV---SSSEIAAASCGARRRM 310
>sp|Q8G9P0|MERB_PSEPU Alkylmercury lyase OS=Pseudomonas putida GN=merB PE=3 SV=1
Length = 212
Score = 28.9 bits (63), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 60 VKCSCCQT---VNFVLEAHQVGQVKCASCAVLLMYPYGAPSVRCSSCCFV 106
V C T + + V QV+ A V L+ P +P +RCS CC V
Sbjct: 116 VTSRCVATGRPITLTVAPEAVLQVEPAETMVSLLTPDASPDIRCSFCCHV 165
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.136 0.457
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 50,732,220
Number of Sequences: 539616
Number of extensions: 2122083
Number of successful extensions: 18666
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 116
Number of HSP's successfully gapped in prelim test: 99
Number of HSP's that attempted gapping in prelim test: 17451
Number of HSP's gapped (non-prelim): 1074
length of query: 127
length of database: 191,569,459
effective HSP length: 93
effective length of query: 34
effective length of database: 141,385,171
effective search space: 4807095814
effective search space used: 4807095814
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 55 (25.8 bits)