Your job contains 1 sequence.
>023545
MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG
PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK
AVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA
NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG
CGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 023545
(281 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2202033 - symbol:AT1G12250 "AT1G12250" species... 795 4.2e-79 1
TAIR|locus:2055023 - symbol:AT2G44920 "AT2G44920" species... 157 1.1e-10 1
UNIPROTKB|Q0BZZ2 - symbol:HNE_2256 "Pentapeptide repeat d... 125 5.9e-06 1
UNIPROTKB|Q74B06 - symbol:GSU2404 "Pentapeptide repeat do... 122 1.5e-05 1
TIGR_CMR|GSU_2404 - symbol:GSU_2404 "pentapeptide repeat ... 122 1.5e-05 1
>TAIR|locus:2202033 [details] [associations]
symbol:AT1G12250 "AT1G12250" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005576
"extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid lumen"
evidence=ISS] [GO:0009535 "chloroplast thylakoid membrane"
evidence=IDA] [GO:0009579 "thylakoid" evidence=IDA] [GO:0009534
"chloroplast thylakoid" evidence=IDA] EMBL:CP002684
GenomeReviews:CT485782_GR GO:GO:0009535 GO:GO:0009543 EMBL:AC022522
eggNOG:COG1357 EMBL:AY142640 EMBL:AY035122 IPI:IPI00519055
PIR:F86257 RefSeq:NP_563902.1 UniGene:At.19174
ProteinModelPortal:Q8H1Q1 SMR:Q8H1Q1 IntAct:Q8H1Q1 STRING:Q8H1Q1
PaxDb:Q8H1Q1 PRIDE:Q8H1Q1 ProMEX:Q8H1Q1 EnsemblPlants:AT1G12250.1
GeneID:837778 KEGG:ath:AT1G12250 TAIR:At1g12250
HOGENOM:HOG000239303 InParanoid:Q8H1Q1 OMA:GAYLEKA PhylomeDB:Q8H1Q1
ProtClustDB:CLSN2687778 Genevestigator:Q8H1Q1 Uniprot:Q8H1Q1
Length = 280
Score = 795 (284.9 bits), Expect = 4.2e-79, P = 4.2e-79
Identities = 171/267 (64%), Positives = 189/267 (70%)
Query: 18 SSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--CAGPYAKLKNWRXXXXXX 75
SS S+ PY H + L Q+SS+ S+ + D SN + C A+ W+
Sbjct: 21 SSVSRSPY--H-FQRYLLRRLQLSSR--SNLEIKDSSNTREGCCSS-AESNTWKRILSAA 74
Query: 76 XXXXXXXXXXXXXXXLADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFR ANFTS
Sbjct: 75 MAAAVIASSSGVPA-MAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 133
Query: 135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 134 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 193
Query: 195 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGXXXX 254
SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYG
Sbjct: 194 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 253
Query: 255 XXXXXXXXXXXDRDGFCDSGTGLCDAK 281
RDGFCD TGLCD K
Sbjct: 254 PLLSAPPQRLLGRDGFCDEKTGLCDVK 280
>TAIR|locus:2055023 [details] [associations]
symbol:AT2G44920 "AT2G44920" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0009507
"chloroplast" evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid
lumen" evidence=IDA] [GO:0031977 "thylakoid lumen" evidence=IDA]
[GO:0009579 "thylakoid" evidence=IDA] [GO:0009535 "chloroplast
thylakoid membrane" evidence=IDA] [GO:0005515 "protein binding"
evidence=IPI] [GO:0009534 "chloroplast thylakoid" evidence=IDA]
[GO:0006098 "pentose-phosphate shunt" evidence=RCA] [GO:0006636
"unsaturated fatty acid biosynthetic process" evidence=RCA]
[GO:0009409 "response to cold" evidence=RCA] [GO:0015979
"photosynthesis" evidence=RCA] [GO:0015995 "chlorophyll
biosynthetic process" evidence=RCA] [GO:0016117 "carotenoid
biosynthetic process" evidence=RCA] [GO:0019288 "isopentenyl
diphosphate biosynthetic process, mevalonate-independent pathway"
evidence=RCA] [GO:0042742 "defense response to bacterium"
evidence=RCA] [GO:0043085 "positive regulation of catalytic
activity" evidence=RCA] Pfam:PF00805 EMBL:CP002685
GenomeReviews:CT485783_GR EMBL:AC002388 GO:GO:0009535 GO:GO:0009543
eggNOG:COG1357 InterPro:IPR001646 EMBL:AY050941 EMBL:BT000902
IPI:IPI00534350 IPI:IPI00535173 PIR:T00401 RefSeq:NP_566030.1
RefSeq:NP_566031.1 UniGene:At.12323 PDB:3N90 PDBsum:3N90
ProteinModelPortal:O22160 SMR:O22160 IntAct:O22160 STRING:O22160
PaxDb:O22160 PRIDE:O22160 ProMEX:O22160 EnsemblPlants:AT2G44920.2
GeneID:819101 KEGG:ath:AT2G44920 TAIR:At2g44920
HOGENOM:HOG000232693 InParanoid:O22160 OMA:FLKYFLC PhylomeDB:O22160
ProtClustDB:CLSN2688933 Genevestigator:O22160 GermOnline:AT2G44920
Uniprot:O22160
Length = 224
Score = 157 (60.3 bits), Expect = 1.1e-10, P = 1.1e-10
Identities = 45/119 (37%), Positives = 67/119 (56%)
Query: 129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
R +F ++ +R+++F G+K GA + A+ TGADLS+ + D + N + NLT
Sbjct: 110 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 164
Query: 184 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
NA L TV + G+ I GADF+D + Q+ LCK A+G N TG +TR +L C
Sbjct: 165 NANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223
>UNIPROTKB|Q0BZZ2 [details] [associations]
symbol:HNE_2256 "Pentapeptide repeat domain protein"
species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:CP000158 GenomeReviews:CP000158_GR
eggNOG:COG1357 InterPro:IPR001646 HOGENOM:HOG000148292
RefSeq:YP_760951.1 ProteinModelPortal:Q0BZZ2 STRING:Q0BZZ2
GeneID:4289822 KEGG:hne:HNE_2256 PATRIC:32217357 OMA:GRADFDK
BioCyc:HNEP228405:GI69-2278-MONOMER Uniprot:Q0BZZ2
Length = 245
Score = 125 (49.1 bits), Expect = 5.9e-06, P = 5.9e-06
Identities = 41/107 (38%), Positives = 56/107 (52%)
Query: 110 AAQFGSADLRKAVHVKENFR-ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 163
+A ADLR A F A F +A M++ +DFS ++ GA LEKA NF
Sbjct: 82 SANVTGADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFE 141
Query: 164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
GA L L R L A+L+ A T+L R++L G I +GA+ S+A
Sbjct: 142 GASL---LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183
>UNIPROTKB|Q74B06 [details] [associations]
symbol:GSU2404 "Pentapeptide repeat domain protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
Length = 254
Score = 122 (48.0 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 38/106 (35%), Positives = 57/106 (53%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A AD+RK V+V+ + NF+ A++ ++FSG+K A L AV NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLA 215
+ + L AN A T+L + L GA + + F S ++ D A
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTA 223
Score = 119 (46.9 bits), Expect = 3.5e-05, P = 3.5e-05
Identities = 31/81 (38%), Positives = 46/81 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +ADMR + SG AY+ A AN +GAD+ +++ ++ANLTNA
Sbjct: 97 AIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNANFSG 151
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L ++LGGA++ G +FS A
Sbjct: 152 AKLKYANLGGAVLRGTNFSFA 172
>TIGR_CMR|GSU_2404 [details] [associations]
symbol:GSU_2404 "pentapeptide repeat domain protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
Length = 254
Score = 122 (48.0 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 38/106 (35%), Positives = 57/106 (53%)
Query: 111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
A AD+RK V+V+ + NF+ A++ ++FSG+K A L AV NF+ ADLS T
Sbjct: 122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177
Query: 171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLA 215
+ + L AN A T+L + L GA + + F S ++ D A
Sbjct: 178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTA 223
Score = 119 (46.9 bits), Expect = 3.5e-05, P = 3.5e-05
Identities = 31/81 (38%), Positives = 46/81 (56%)
Query: 130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
A F +ADMR + SG AY+ A AN +GAD+ +++ ++ANLTNA
Sbjct: 97 AIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNANFSG 151
Query: 190 TVLTRSDLGGAIIEGADFSDA 210
L ++LGGA++ G +FS A
Sbjct: 152 AKLKYANLGGAVLRGTNFSFA 172
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.316 0.130 0.386 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 281 233 0.00086 113 3 11 22 0.44 33
32 0.43 36
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 5
No. of states in DFA: 604 (64 KB)
Total size of DFA: 179 KB (2103 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 17.29u 0.10s 17.39t Elapsed: 00:00:04
Total cpu time: 17.29u 0.10s 17.39t Elapsed: 00:00:04
Start: Fri May 10 07:24:12 2013 End: Fri May 10 07:24:16 2013