Your job contains 1 sequence.
>026574
MARRASSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALT
SDPDIVAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELAD
YLITSLEKVGNVSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVN
LISNGDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINSTRKGKFAVELIQYL
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 026574
(236 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2037048 - symbol:AT1G53120 species:3702 "Arabi... 1010 6.9e-102 1
UNIPROTKB|P74082 - symbol:sll1252 "Sll1252 protein" speci... 561 2.6e-54 1
UNIPROTKB|Q3AAH6 - symbol:CHY_2039 "S4 domain protein" sp... 360 5.2e-33 1
TIGR_CMR|CHY_2039 - symbol:CHY_2039 "S4 domain protein" s... 360 5.2e-33 1
UNIPROTKB|Q81WE2 - symbol:BAS3748 "S4 domain protein" spe... 246 6.3e-21 1
TIGR_CMR|BA_4036 - symbol:BA_4036 "S4 domain protein" spe... 246 6.3e-21 1
UNIPROTKB|Q71XY5 - symbol:LMOf2365_2060 "S4 domain protei... 233 1.5e-19 1
>TAIR|locus:2037048 [details] [associations]
symbol:AT1G53120 species:3702 "Arabidopsis thaliana"
[GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM;IDA] InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889
SMART:SM00363 EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0009507
GO:GO:0003723 Gene3D:3.10.290.10 EMBL:AY059839 EMBL:BT001219
EMBL:BT002414 EMBL:BT008717 IPI:IPI00520703 RefSeq:NP_564622.1
UniGene:At.16238 UniGene:At.66953 ProteinModelPortal:Q93YQ0
SMR:Q93YQ0 STRING:Q93YQ0 PaxDb:Q93YQ0 PRIDE:Q93YQ0
EnsemblPlants:AT1G53120.1 GeneID:841746 KEGG:ath:AT1G53120
TAIR:At1g53120 eggNOG:COG2302 HOGENOM:HOG000048758
InParanoid:Q93YQ0 OMA:NTVEAST PhylomeDB:Q93YQ0 ProtClustDB:PLN00051
ArrayExpress:Q93YQ0 Genevestigator:Q93YQ0 InterPro:IPR017506
TIGRFAMs:TIGR03069 Uniprot:Q93YQ0
Length = 320
Score = 1010 (360.6 bits), Expect = 6.9e-102, P = 6.9e-102
Identities = 186/236 (78%), Positives = 222/236 (94%)
Query: 1 MARRASSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALT 60
MARRASS+REVLH+DFLTPP++KES+ LEK ADVK VAQGGYP+AERCR+S+GHP+ LT
Sbjct: 85 MARRASSKREVLHTDFLTPPIVKESVSLLEKFADVKIVAQGGYPEAERCRISIGHPDVLT 144
Query: 61 SDPDIVAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELAD 120
SDPDIVAALSITGNFGFQPCSHGDFLG+ILGTGI+REK+GDI++Q EKGAQ L+VPEL D
Sbjct: 145 SDPDIVAALSITGNFGFQPCSHGDFLGAILGTGISREKLGDILIQEEKGAQVLIVPELVD 204
Query: 121 YLITSLEKVGNVSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVN 180
+++T+L+KVGNV V+C++IPLLALEYEPPRT SFKT+EASLR+DA+ASAGFK+SRSKLV+
Sbjct: 205 FVVTALDKVGNVGVTCSKIPLLALEYEPPRTNSFKTVEASLRIDAVASAGFKISRSKLVD 264
Query: 181 LISNGDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINSTRKGKFAVELIQYL 236
LIS+ DVRVNW TVTKNGT ++TGD+VSVSGKGR+KIGEIN T+KGKFAVE+I+YL
Sbjct: 265 LISSKDVRVNWATVTKNGTIVKTGDVVSVSGKGRLKIGEINETKKGKFAVEIIRYL 320
>UNIPROTKB|P74082 [details] [associations]
symbol:sll1252 "Sll1252 protein" species:1111708
"Synechocystis sp. PCC 6803 substr. Kazusa" [GO:0030096 "plasma
membrane-derived thylakoid photosystem II" evidence=IDA]
InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889 SMART:SM00363
GO:GO:0003723 EMBL:BA000022 GenomeReviews:BA000022_GR GO:GO:0030096
Gene3D:3.10.290.10 eggNOG:COG2302 HOGENOM:HOG000048758
InterPro:IPR017506 TIGRFAMs:TIGR03069 PIR:S75599 RefSeq:NP_441480.1
RefSeq:YP_005651538.1 ProteinModelPortal:P74082 IntAct:P74082
STRING:P74082 GeneID:12254190 GeneID:954860 KEGG:syn:sll1252
KEGG:syy:SYNGTS_1585 PATRIC:23840371 OMA:DFLDPRE
ProtClustDB:CLSK893139 Uniprot:P74082
Length = 259
Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
Identities = 111/236 (47%), Positives = 160/236 (67%)
Query: 2 ARRASSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEA-LT 60
A +A EV+ SDF PP++ E + L ++ + GGYPQAER RL++ E L
Sbjct: 24 AEQALRTWEVVVSDFCAPPLIAEIQTRFDSLTELHFLPWGGYPQAERQRLAIARAEIPLE 83
Query: 61 SDPDIVAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELAD 120
++ + AL + GNF F P SH DFLG++LGTG+ REK+GDIIL GE+GAQ +VVPE+A+
Sbjct: 84 TEQIPLTALDVAGNFLFDPASHRDFLGALLGTGLVREKVGDIILLGERGAQAIVVPEVAE 143
Query: 121 YLITSLEKVGNVSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVN 180
++ L +V V V I L L+ PP+TK T+EASLR+DA+ASAGF LSRSK+ +
Sbjct: 144 FISLHLTQVRTVPVKAKAIALEELKIRPPKTKEMTTVEASLRLDAIASAGFGLSRSKMAD 203
Query: 181 LISNGDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINSTRKGKFAVELIQYL 236
++ G+V+VNW VT++ L+ GD+V+ GKGR++IGEI T+K ++ ++L +YL
Sbjct: 204 AVTQGNVQVNWKPVTQSSYALKAGDLVTYRGKGRLEIGEITVTKKERYRIQLTRYL 259
>UNIPROTKB|Q3AAH6 [details] [associations]
symbol:CHY_2039 "S4 domain protein" species:246194
"Carboxydothermus hydrogenoformans Z-2901" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889
SMART:SM00363 EMBL:CP000141 GenomeReviews:CP000141_GR GO:GO:0003723
Gene3D:3.10.290.10 eggNOG:COG2302 HOGENOM:HOG000048758 OMA:DFLDPRE
RefSeq:YP_360858.1 ProteinModelPortal:Q3AAH6 STRING:Q3AAH6
GeneID:3726280 KEGG:chy:CHY_2039 PATRIC:21277159
BioCyc:CHYD246194:GJCN-2038-MONOMER Uniprot:Q3AAH6
Length = 246
Score = 360 (131.8 bits), Expect = 5.2e-33, P = 5.2e-33
Identities = 85/233 (36%), Positives = 128/233 (54%)
Query: 6 SSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALTSDPDI 65
+++ +++ +DFL P + L +V GGYP AE+ R + D ++
Sbjct: 15 AAKGQIVLTDFLDPAAASLTRDLLRNYPEVNYKVDGGYPNAEKVRFAFYPIYVFPEDVEL 74
Query: 66 -VAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELADYLIT 124
+ + ITGNF FQ +H DFLG+ILGTGI REK+GD+IL G Q +V +L YLI
Sbjct: 75 NLGFIEITGNFKFQAVTHRDFLGAILGTGIKREKVGDLILI-PNGCQVVVDRDLVPYLIQ 133
Query: 125 SLEKVGNVSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVNLISN 184
L+KV V V+ I L +T+ SLR+D++ ++GF +SRS++V I
Sbjct: 134 QLDKVHRVGVTVKEIVREELLLPEAKTREVVAFVKSLRLDSVGASGFGISRSQMVKEIEG 193
Query: 185 GDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINSTRK-GKFAVELIQYL 236
VRVNW K + GD++S+ G+GR+++ EI K G+ V L +YL
Sbjct: 194 QKVRVNWKLQVKPSYEVAPGDVISLRGRGRVEVLEIAGVSKSGRNKVILRRYL 246
>TIGR_CMR|CHY_2039 [details] [associations]
symbol:CHY_2039 "S4 domain protein" species:246194
"Carboxydothermus hydrogenoformans Z-2901" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889
SMART:SM00363 EMBL:CP000141 GenomeReviews:CP000141_GR GO:GO:0003723
Gene3D:3.10.290.10 eggNOG:COG2302 HOGENOM:HOG000048758 OMA:DFLDPRE
RefSeq:YP_360858.1 ProteinModelPortal:Q3AAH6 STRING:Q3AAH6
GeneID:3726280 KEGG:chy:CHY_2039 PATRIC:21277159
BioCyc:CHYD246194:GJCN-2038-MONOMER Uniprot:Q3AAH6
Length = 246
Score = 360 (131.8 bits), Expect = 5.2e-33, P = 5.2e-33
Identities = 85/233 (36%), Positives = 128/233 (54%)
Query: 6 SSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALTSDPDI 65
+++ +++ +DFL P + L +V GGYP AE+ R + D ++
Sbjct: 15 AAKGQIVLTDFLDPAAASLTRDLLRNYPEVNYKVDGGYPNAEKVRFAFYPIYVFPEDVEL 74
Query: 66 -VAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELADYLIT 124
+ + ITGNF FQ +H DFLG+ILGTGI REK+GD+IL G Q +V +L YLI
Sbjct: 75 NLGFIEITGNFKFQAVTHRDFLGAILGTGIKREKVGDLILI-PNGCQVVVDRDLVPYLIQ 133
Query: 125 SLEKVGNVSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVNLISN 184
L+KV V V+ I L +T+ SLR+D++ ++GF +SRS++V I
Sbjct: 134 QLDKVHRVGVTVKEIVREELLLPEAKTREVVAFVKSLRLDSVGASGFGISRSQMVKEIEG 193
Query: 185 GDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINSTRK-GKFAVELIQYL 236
VRVNW K + GD++S+ G+GR+++ EI K G+ V L +YL
Sbjct: 194 QKVRVNWKLQVKPSYEVAPGDVISLRGRGRVEVLEIAGVSKSGRNKVILRRYL 246
>UNIPROTKB|Q81WE2 [details] [associations]
symbol:BAS3748 "S4 domain protein" species:1392 "Bacillus
anthracis" [GO:0008150 "biological_process" evidence=ND]
InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889 SMART:SM00363
EMBL:AE016879 EMBL:AE017334 EMBL:AE017225 GenomeReviews:AE016879_GR
GenomeReviews:AE017225_GR GenomeReviews:AE017334_GR GO:GO:0003723
Gene3D:3.10.290.10 HOGENOM:HOG000048758 OMA:DFLDPRE
RefSeq:NP_846276.1 RefSeq:YP_020678.1 RefSeq:YP_029999.1
ProteinModelPortal:Q81WE2 DNASU:1086627
EnsemblBacteria:EBBACT00000010693 EnsemblBacteria:EBBACT00000014679
EnsemblBacteria:EBBACT00000021217 GeneID:1086627 GeneID:2818410
GeneID:2850101 KEGG:ban:BA_4036 KEGG:bar:GBAA_4036 KEGG:bat:BAS3748
ProtClustDB:CLSK917204 BioCyc:BANT260799:GJAJ-3806-MONOMER
BioCyc:BANT261594:GJ7F-3924-MONOMER Uniprot:Q81WE2
Length = 255
Score = 246 (91.7 bits), Expect = 6.3e-21, P = 6.3e-21
Identities = 69/232 (29%), Positives = 117/232 (50%)
Query: 3 RRASSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALTSD 62
++A+ +V +DFL P + M + + D+ G +AER R + +P+ L +
Sbjct: 22 KQAAEYHQVKLTDFLDPRQQQIVTMVIGQ-GDIAVQFDGATSRAERKRALI-YPDYLVVN 79
Query: 63 PDI--VAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELAD 120
+ V L I F H LG+ + G+ REK GDI+LQ ++ AQ +V E+
Sbjct: 80 EEEFQVEGLEIDYPSKFYTLEHRQILGTFMSLGLTREKCGDILLQEDR-AQIVVAKEVVS 138
Query: 121 YLITSLEKVGNVSVSCTRIP-LLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLV 179
Y+ +L+ +G V VS + + L+ + + T+ +SLR+D + + +SR K+
Sbjct: 139 YIEMNLQSIGKVKVSLSPVKGEKILQIQETWGEKSGTV-SSLRLDVMLAEMLHISRQKVQ 197
Query: 180 NLISNGDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINS-TRKGKFAV 230
I NG V+VNW TV + GD+ SV G GR K+ + T++ K+ +
Sbjct: 198 PFIKNGLVKVNWKTVEQTSYECYPGDVFSVRGYGRSKLFSVEGRTKRDKWRI 249
>TIGR_CMR|BA_4036 [details] [associations]
symbol:BA_4036 "S4 domain protein" species:198094 "Bacillus
anthracis str. Ames" [GO:0003723 "RNA binding" evidence=ISS]
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR002942
Pfam:PF01479 PROSITE:PS50889 SMART:SM00363 EMBL:AE016879
EMBL:AE017334 EMBL:AE017225 GenomeReviews:AE016879_GR
GenomeReviews:AE017225_GR GenomeReviews:AE017334_GR GO:GO:0003723
Gene3D:3.10.290.10 HOGENOM:HOG000048758 OMA:DFLDPRE
RefSeq:NP_846276.1 RefSeq:YP_020678.1 RefSeq:YP_029999.1
ProteinModelPortal:Q81WE2 DNASU:1086627
EnsemblBacteria:EBBACT00000010693 EnsemblBacteria:EBBACT00000014679
EnsemblBacteria:EBBACT00000021217 GeneID:1086627 GeneID:2818410
GeneID:2850101 KEGG:ban:BA_4036 KEGG:bar:GBAA_4036 KEGG:bat:BAS3748
ProtClustDB:CLSK917204 BioCyc:BANT260799:GJAJ-3806-MONOMER
BioCyc:BANT261594:GJ7F-3924-MONOMER Uniprot:Q81WE2
Length = 255
Score = 246 (91.7 bits), Expect = 6.3e-21, P = 6.3e-21
Identities = 69/232 (29%), Positives = 117/232 (50%)
Query: 3 RRASSRREVLHSDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALTSD 62
++A+ +V +DFL P + M + + D+ G +AER R + +P+ L +
Sbjct: 22 KQAAEYHQVKLTDFLDPRQQQIVTMVIGQ-GDIAVQFDGATSRAERKRALI-YPDYLVVN 79
Query: 63 PDI--VAALSITGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELAD 120
+ V L I F H LG+ + G+ REK GDI+LQ ++ AQ +V E+
Sbjct: 80 EEEFQVEGLEIDYPSKFYTLEHRQILGTFMSLGLTREKCGDILLQEDR-AQIVVAKEVVS 138
Query: 121 YLITSLEKVGNVSVSCTRIP-LLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLV 179
Y+ +L+ +G V VS + + L+ + + T+ +SLR+D + + +SR K+
Sbjct: 139 YIEMNLQSIGKVKVSLSPVKGEKILQIQETWGEKSGTV-SSLRLDVMLAEMLHISRQKVQ 197
Query: 180 NLISNGDVRVNWTTVTKNGTTLRTGDIVSVSGKGRIKIGEINS-TRKGKFAV 230
I NG V+VNW TV + GD+ SV G GR K+ + T++ K+ +
Sbjct: 198 PFIKNGLVKVNWKTVEQTSYECYPGDVFSVRGYGRSKLFSVEGRTKRDKWRI 249
>UNIPROTKB|Q71XY5 [details] [associations]
symbol:LMOf2365_2060 "S4 domain protein" species:265669
"Listeria monocytogenes serotype 4b str. F2365" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR002942 Pfam:PF01479 PROSITE:PS50889
SMART:SM00363 GO:GO:0003723 EMBL:AE017262 GenomeReviews:AE017262_GR
Gene3D:3.10.290.10 eggNOG:COG2302 HOGENOM:HOG000048758 OMA:DFLDPRE
RefSeq:YP_014653.1 ProteinModelPortal:Q71XY5 STRING:Q71XY5
GeneID:2799062 KEGG:lmf:LMOf2365_2060 PATRIC:20325415
ProtClustDB:CLSK564719 Uniprot:Q71XY5
Length = 258
Score = 233 (87.1 bits), Expect = 1.5e-19, P = 1.5e-19
Identities = 68/226 (30%), Positives = 105/226 (46%)
Query: 14 SDFLTPPVLKESMMALEKLADVKAVAQGGYPQAERCRLSVGHPEALT-SDPDI-VAALSI 71
+DFL P + + ++ GG AER R + +P+ T ++ D +A I
Sbjct: 35 TDFLDPRQRFITETVIGGYDEINVQFFGGVAHAERRRALI-YPDYYTPTEADFEIALFHI 93
Query: 72 TGNFGFQPCSHGDFLGSILGTGIAREKIGDIILQGEKGAQFLVVPELADYLITSLEKVGN 131
F +H LG+++ G+ R+ GDI+ G + Q LV + DYL LEK+G
Sbjct: 94 RYPVKFTTLTHQKILGTLMSLGMKRDIFGDILNNGTEW-QLLVESTMKDYLTLQLEKIGK 152
Query: 132 VSVSCTRIPLLALEYEPPRTKSFKTIEASLRVDALASAGFKLSRSKLVNLISNGDVRVNW 191
V+V L Y P + +S+R+D + S +SR K L++ G V+VNW
Sbjct: 153 VNVMLEETDLANAVYAPVVWEEMGLTVSSMRLDVIISNAHHISRQKAKQLVTAGLVKVNW 212
Query: 192 TTVTKNGTTLRTGDIVSVSGKGRIKIGEINS-TRKGKFAVELIQYL 236
TV D++S G GR+K+ T+K K +E I YL
Sbjct: 213 KTVENPDFECEEEDVLSARGYGRVKVLSTGGRTKKDKIRIE-IGYL 257
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.317 0.135 0.378 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 236 236 0.00089 113 3 11 22 0.43 33
32 0.43 36
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 7
No. of states in DFA: 580 (62 KB)
Total size of DFA: 149 KB (2090 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 19.41u 0.10s 19.51t Elapsed: 00:00:01
Total cpu time: 19.41u 0.10s 19.51t Elapsed: 00:00:01
Start: Fri May 10 20:18:01 2013 End: Fri May 10 20:18:02 2013