Your job contains 1 sequence.
>043788
MKAHAPSQRPTPAFISKPVRRHQSAHDISPRHRFNLIHSKRSSHVTTLSLNKNVPPQSAE
FSRRHVFLSPLIAVGASILLQSATASADETQPSPPAQPTTSPVPQNPETVKAEEVVVSRI
YDATVIGEPLAVGKDKRKVWEKLMNARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVES
ERTITLALEAFPSDLQDQLNQYTDKRIDGETLKSYASHWPPQRWQEYEPLLSYCRDNGVQ
LLACGTPLKVLRTVQAEGIHGLSKADRKLYAPPAGSGFISGFTSISHRSSVDMNSLTQSV
PFGPSSYLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARIS
KKLQKKNQVVILLDLKGNIFEEREKFL
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 043788
(387 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2078446 - symbol:AT3G56140 "AT3G56140" species... 1094 8.7e-111 1
TAIR|locus:2063136 - symbol:AT2G40400 species:3702 "Arabi... 1088 3.8e-110 1
UNIPROTKB|Q747X6 - symbol:GSU3139 "Uncharacterized protei... 171 1.1e-10 1
TIGR_CMR|GSU_3139 - symbol:GSU_3139 "conserved hypothetic... 171 1.1e-10 1
>TAIR|locus:2078446 [details] [associations]
symbol:AT3G56140 "AT3G56140" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid lumen"
evidence=ISS] GO:GO:0009507 EMBL:CP002686 InterPro:IPR021825
Pfam:PF11891 EMBL:AY093111 EMBL:BT010340 EMBL:AK227110
IPI:IPI00520041 RefSeq:NP_191173.2 UniGene:At.34962
ProteinModelPortal:Q8RWG3 STRING:Q8RWG3 PaxDb:Q8RWG3 PRIDE:Q8RWG3
EnsemblPlants:AT3G56140.1 GeneID:824780 KEGG:ath:AT3G56140
TAIR:At3g56140 eggNOG:NOG242091 HOGENOM:HOG000082965
InParanoid:Q8RWG3 OMA:RRKENFF PhylomeDB:Q8RWG3
ProtClustDB:CLSN2688835 ArrayExpress:Q8RWG3 Genevestigator:Q8RWG3
InterPro:IPR007314 Pfam:PF04187 Uniprot:Q8RWG3
Length = 745
Score = 1094 (390.2 bits), Expect = 8.7e-111, P = 8.7e-111
Identities = 219/351 (62%), Positives = 259/351 (73%)
Query: 35 NLIHSKRSSHVTTLSLNKNVPPQSAEFSRRHVFLSP-LIAVGASILLQSATASADEXXXX 93
NL K +S ++ ++L+ + P FSRR L+P L+ AS+ L+ + + A E
Sbjct: 38 NLTSEKNNS-LSIVALSDSDLPSRTAFSRRAFLLAPPLLVSAASLFLKPSVSLASEESSS 96
Query: 94 XXXXXXXXXXX----------XNPETVKAEEVVVSRIYDATVIGEPLAVGKDKRKVWEKL 143
P V EE + SRIYDAT IGEP+A+GKDK+KVWEKL
Sbjct: 97 ATVTSPAESAAPPPPPATTTPSPPPPVNKEETITSRIYDATAIGEPMAMGKDKKKVWEKL 156
Query: 144 MNARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLALEAFPSDLQDQLNQYT 203
+NARVVYLGEAEQVP +DD+ELEL+IV+NLRKRCVESER I++ALEAFP DLQDQLNQY
Sbjct: 157 LNARVVYLGEAEQVPTKDDKELELEIVRNLRKRCVESERQISVALEAFPLDLQDQLNQYM 216
Query: 204 DKRIDGETLKSYASHWPPQRWQEYEPLLSYCRDNGVQLLACGTPLKVLRTVQAEGIHGLS 263
DKR+DGETLKSY +HWP QRWQEYEPLLSYCRDN V+L+ACGTPLKVLRTVQAEGI GLS
Sbjct: 217 DKRMDGETLKSYVTHWPAQRWQEYEPLLSYCRDNSVRLIACGTPLKVLRTVQAEGIRGLS 276
Query: 264 KADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSSYLSAQARVVEDYAMSQI 323
K++RKLY PPA RS+ DM+ TQ VPFGPSSYLSAQARVVED+ MSQ+
Sbjct: 277 KSERKLYTPPAGSGFISGFSSFSRRSTFDMSLPTQIVPFGPSSYLSAQARVVEDHTMSQV 336
Query: 324 ILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKKNQVVILLD 374
IL+A+ DGG G+L+VVTGASHV YGSRGTGLPARIS+K KKNQVV+LLD
Sbjct: 337 ILQAVADGGGTGLLLVVTGASHVEYGSRGTGLPARISRKFPKKNQVVVLLD 387
>TAIR|locus:2063136 [details] [associations]
symbol:AT2G40400 species:3702 "Arabidopsis thaliana"
[GO:0009507 "chloroplast" evidence=ISM] [GO:0009543 "chloroplast
thylakoid lumen" evidence=ISS] [GO:0015995 "chlorophyll
biosynthetic process" evidence=RCA] EMBL:CP002685 EMBL:AC007020
InterPro:IPR021825 Pfam:PF11891 ProtClustDB:CLSN2688835
InterPro:IPR007314 Pfam:PF04187 EMBL:AF410285 EMBL:AY102131
IPI:IPI00518341 PIR:A84829 RefSeq:NP_565930.1 RefSeq:NP_850329.1
UniGene:At.14284 ProteinModelPortal:Q9SIY5 STRING:Q9SIY5
PRIDE:Q9SIY5 EnsemblPlants:AT2G40400.1 EnsemblPlants:AT2G40400.2
GeneID:818633 KEGG:ath:AT2G40400 TAIR:At2g40400 InParanoid:Q9SIY5
OMA:IEHRISD PhylomeDB:Q9SIY5 ArrayExpress:Q9SIY5
Genevestigator:Q9SIY5 Uniprot:Q9SIY5
Length = 735
Score = 1088 (388.1 bits), Expect = 3.8e-110, P = 3.8e-110
Identities = 211/334 (63%), Positives = 259/334 (77%)
Query: 44 HVTTLSLNK--NVPPQSAEFSRRHVFLSP-LIAVGASILLQSATASADEXXXXXXXXXXX 100
+V TL L+ NV +RR + ++P L+A AS+ L ++A++ E
Sbjct: 46 NVVTLCLHSHSNVSSSQIAVTRRAILVAPPLLAAAASLFLSISSAASAETSAESVALPPV 105
Query: 101 XXXXXNPETVKAEEVVVSRIYDATVIGEPLAVGKDKRKVWEKLMNARVVYLGEAEQVPVR 160
P V+ EE + SRIYDA+V+GEP+AVGKDK++VWEKL+NAR+VYLGEAEQVP R
Sbjct: 106 ATAPP-PPPVEKEEAITSRIYDASVLGEPMAVGKDKKRVWEKLLNARIVYLGEAEQVPTR 164
Query: 161 DDRELELQIVKNLRKRCVESERTITLALEAFPSDLQDQLNQYTDKRIDGETLKSYASHWP 220
DD+ LEL+IV+NLRKRC+ES+R ++LALEAFP DLQ+QLNQY DKR+DGE LKSY SHWP
Sbjct: 165 DDKVLELEIVRNLRKRCIESDRQLSLALEAFPLDLQEQLNQYMDKRMDGEVLKSYVSHWP 224
Query: 221 PQRWQEYEPLLSYCRDNGVQLLACGTPLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXX 280
QRWQEYEPLLSYCRDNGV+L+ACGTPLKVLRTVQAEGI GLS+++RKLY PPA
Sbjct: 225 VQRWQEYEPLLSYCRDNGVKLIACGTPLKVLRTVQAEGIRGLSESERKLYTPPAGSGFIS 284
Query: 281 XXXXXXHRSSVDMNSLTQSVPFGPSSYLSAQARVVEDYAMSQIILKAIMDGGANGMLVVV 340
SS++MN LTQ VPFGPSSYLSAQARVVED+ MSQ+I++A+ DGG GMLVVV
Sbjct: 285 GFTSFSRSSSLNMNPLTQIVPFGPSSYLSAQARVVEDHTMSQVIVQAVADGGGTGMLVVV 344
Query: 341 TGASHVTYGSRGTGLPARISKKLQKKNQVVILLD 374
TGA+HV YGSRGTGLPARIS+K+ KK+Q+V+LLD
Sbjct: 345 TGANHVEYGSRGTGLPARISRKIPKKSQLVVLLD 378
>UNIPROTKB|Q747X6 [details] [associations]
symbol:GSU3139 "Uncharacterized protein" species:243231
"Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] EMBL:AE017180
GenomeReviews:AE017180_GR InterPro:IPR007314 Pfam:PF04187
InterPro:IPR016773 PIRSF:PIRSF020419 RefSeq:NP_954180.1
GeneID:2688434 KEGG:gsu:GSU3139 PATRIC:22029137
HOGENOM:HOG000012442 OMA:NTHTEFL ProtClustDB:CLSK829132
BioCyc:GSUL243231:GH27-3157-MONOMER Uniprot:Q747X6
Length = 282
Score = 171 (65.3 bits), Expect = 1.1e-10, P = 1.1e-10
Identities = 58/248 (23%), Positives = 117/248 (47%)
Query: 134 KDKRKV-WEKLMN----ARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLAL 188
KD++++ +E+++ +V+Y+GE P D L+L+IV+ L + V + +A+
Sbjct: 34 KDRKEISFEEMLRDLKAGKVIYVGETHDNPYHHD--LQLRIVRELHRAGVP----LAIAM 87
Query: 189 EAFPSDLQDQLNQYTDKRIDGETLKS-YASHWP-PQRWQEYEPLLSYCRDNGVQLLACGT 246
E F + Q++L+++ + D + Y +W P W Y +L + RD + L+
Sbjct: 88 EMFTYESQEELDRWVAGKTDPALFQQIYLKNWNFP--WALYGDILLFARDRRIPLVGLNV 145
Query: 247 PLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSS 306
P +V R V +G LS+ +R+ P RS D + T + F +
Sbjct: 146 PREVTRKVARQGFESLSREERRKLPPSITCDVDDAYMAMIRRSYSDHD--TSAKTF--KN 201
Query: 307 YLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKK 366
+ AQ ++ + +M+ +++ + + +VV+TG+ H G G+P ++ ++
Sbjct: 202 FCEAQ--MLWNKSMAYHLVEYLKNNPGR-TVVVITGSGHAVRG----GMPVQVDREKPGL 254
Query: 367 NQVVILLD 374
V+L D
Sbjct: 255 ASRVVLPD 262
>TIGR_CMR|GSU_3139 [details] [associations]
symbol:GSU_3139 "conserved hypothetical protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
EMBL:AE017180 GenomeReviews:AE017180_GR InterPro:IPR007314
Pfam:PF04187 InterPro:IPR016773 PIRSF:PIRSF020419
RefSeq:NP_954180.1 GeneID:2688434 KEGG:gsu:GSU3139 PATRIC:22029137
HOGENOM:HOG000012442 OMA:NTHTEFL ProtClustDB:CLSK829132
BioCyc:GSUL243231:GH27-3157-MONOMER Uniprot:Q747X6
Length = 282
Score = 171 (65.3 bits), Expect = 1.1e-10, P = 1.1e-10
Identities = 58/248 (23%), Positives = 117/248 (47%)
Query: 134 KDKRKV-WEKLMN----ARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLAL 188
KD++++ +E+++ +V+Y+GE P D L+L+IV+ L + V + +A+
Sbjct: 34 KDRKEISFEEMLRDLKAGKVIYVGETHDNPYHHD--LQLRIVRELHRAGVP----LAIAM 87
Query: 189 EAFPSDLQDQLNQYTDKRIDGETLKS-YASHWP-PQRWQEYEPLLSYCRDNGVQLLACGT 246
E F + Q++L+++ + D + Y +W P W Y +L + RD + L+
Sbjct: 88 EMFTYESQEELDRWVAGKTDPALFQQIYLKNWNFP--WALYGDILLFARDRRIPLVGLNV 145
Query: 247 PLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSS 306
P +V R V +G LS+ +R+ P RS D + T + F +
Sbjct: 146 PREVTRKVARQGFESLSREERRKLPPSITCDVDDAYMAMIRRSYSDHD--TSAKTF--KN 201
Query: 307 YLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKK 366
+ AQ ++ + +M+ +++ + + +VV+TG+ H G G+P ++ ++
Sbjct: 202 FCEAQ--MLWNKSMAYHLVEYLKNNPGR-TVVVITGSGHAVRG----GMPVQVDREKPGL 254
Query: 367 NQVVILLD 374
V+L D
Sbjct: 255 ASRVVLPD 262
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.316 0.131 0.375 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 387 359 0.00081 117 3 11 22 0.41 34
34 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 4
No. of states in DFA: 610 (65 KB)
Total size of DFA: 219 KB (2121 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 27.69u 0.10s 27.79t Elapsed: 00:00:01
Total cpu time: 27.69u 0.10s 27.79t Elapsed: 00:00:01
Start: Fri May 10 18:10:16 2013 End: Fri May 10 18:10:17 2013