Your job contains 1 sequence.
>026796
MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG
PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK
AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE
ANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQVGSKTSYFFFNIKC
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 026796
(233 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2202033 - symbol:AT1G12250 "AT1G12250" species... 592 1.4e-57 1
UNIPROTKB|Q74B06 - symbol:GSU2404 "Pentapeptide repeat do... 135 1.3e-07 1
TIGR_CMR|GSU_2404 - symbol:GSU_2404 "pentapeptide repeat ... 135 1.3e-07 1
UNIPROTKB|Q0BZZ2 - symbol:HNE_2256 "Pentapeptide repeat d... 132 3.2e-07 1
TAIR|locus:2160230 - symbol:FIP2 species:3702 "Arabidopsi... 112 0.00021 1
UNIPROTKB|Q2GFB5 - symbol:ECH_1083 "Pentapeptide repeat p... 115 0.00029 1
TIGR_CMR|ECH_1083 - symbol:ECH_1083 "pentapeptide repeat ... 115 0.00029 1
>TAIR|locus:2202033 [details] [associations]
symbol:AT1G12250 "AT1G12250" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005576
"extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid lumen"
evidence=ISS] [GO:0009535 "chloroplast thylakoid membrane"
evidence=IDA] [GO:0009579 "thylakoid" evidence=IDA] [GO:0009534
"chloroplast thylakoid" evidence=IDA] EMBL:CP002684
GenomeReviews:CT485782_GR GO:GO:0009535 GO:GO:0009543 EMBL:AC022522
eggNOG:COG1357 EMBL:AY142640 EMBL:AY035122 IPI:IPI00519055
PIR:F86257 RefSeq:NP_563902.1 UniGene:At.19174
ProteinModelPortal:Q8H1Q1 SMR:Q8H1Q1 IntAct:Q8H1Q1 STRING:Q8H1Q1
PaxDb:Q8H1Q1 PRIDE:Q8H1Q1 ProMEX:Q8H1Q1 EnsemblPlants:AT1G12250.1
GeneID:837778 KEGG:ath:AT1G12250 TAIR:At1g12250
HOGENOM:HOG000239303 InParanoid:Q8H1Q1 OMA:GAYLEKA PhylomeDB:Q8H1Q1
ProtClustDB:CLSN2687778 Genevestigator:Q8H1Q1 Uniprot:Q8H1Q1
Length = 280
Score = 592 (213.5 bits), Expect = 1.4e-57, P = 1.4e-57
Identities = 131/204 (64%), Positives = 148/204 (72%)
Query: 18 SSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--CAGPYAKLKNWRXXXXXX 75
SS S+ PY H + L Q+SS+ S+ + D SN + C A+ W+
Sbjct: 21 SSVSRSPY--H-FQRYLLRRLQLSSR--SNLEIKDSSNTREGCCSS-AESNTWKRILSAA 74
Query: 76 XXXXXXXXXXXXXXXLADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH ENFRRANFTS
Sbjct: 75 MAAAVIASSSGVPA-MAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 133
Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 195
ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct: 134 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 193
Query: 196 SDLGGAIIEGADFSDAVIDLAQKQ 219
SDLGGA IEGADFSDAVIDL QKQ
Sbjct: 194 SDLGGAKIEGADFSDAVIDLLQKQ 217
>UNIPROTKB|Q74B06 [details] [associations]
symbol:GSU2404 "Pentapeptide repeat domain protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
Length = 254
Score = 135 (52.6 bits), Expect = 1.3e-07, P = 1.3e-07
Identities = 40/113 (35%), Positives = 59/113 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A+L A + A F +ADMR + SG AY+ A AN +GAD+
Sbjct: 76 ACNFTGANLTGAQMDGASLDEAIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQVGS 222
+++ ++ANLTNA L ++LGGA++ G +FS A DL+ +GS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFA--DLSATDLGS 181
Score = 133 (51.9 bits), Expect = 2.5e-07, P = 2.5e-07
Identities = 39/112 (34%), Positives = 59/112 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ADLS
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQVG 221
T + + L AN A T+L + L GA + + F S ++ D A ++G
Sbjct: 177 TDLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTATNRLG 228
Score = 109 (43.4 bits), Expect = 0.00034, P = 0.00034
Identities = 39/130 (30%), Positives = 59/130 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A L +A+ + R A+ + A + + F G+ +GA + K K NF+ A+L
Sbjct: 85 TGAQMDGASLDEAIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANL 144
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVI------DLAQ 217
++ L ANL AVL T L+ +DLG +EGA+F A D
Sbjct: 145 TNANFSGAKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLLRDAKL 204
Query: 218 KQVGSKTSYF 227
K + S F
Sbjct: 205 KGADLRQSRF 214
>TIGR_CMR|GSU_2404 [details] [associations]
symbol:GSU_2404 "pentapeptide repeat domain protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
Length = 254
Score = 135 (52.6 bits), Expect = 1.3e-07, P = 1.3e-07
Identities = 40/113 (35%), Positives = 59/113 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLS 169
A F A+L A + A F +ADMR + SG AY+ A AN +GAD+
Sbjct: 76 ACNFTGANLTGAQMDGASLDEAIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMR 130
Query: 170 DTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQVGS 222
+++ ++ANLTNA L ++LGGA++ G +FS A DL+ +GS
Sbjct: 131 KVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFA--DLSATDLGS 181
Score = 133 (51.9 bits), Expect = 2.5e-07, P = 2.5e-07
Identities = 39/112 (34%), Positives = 59/112 (52%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+F A+L A K N + NF+ A++ ++FSG+K A L AV NF+ ADLS
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLAQKQVG 221
T + + L AN A T+L + L GA + + F S ++ D A ++G
Sbjct: 177 TDLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTATNRLG 228
Score = 109 (43.4 bits), Expect = 0.00034, P = 0.00034
Identities = 39/130 (30%), Positives = 59/130 (45%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ AQ A L +A+ + R A+ + A + + F G+ +GA + K K NF+ A+L
Sbjct: 85 TGAQMDGASLDEAIFDTADMRSAHCSGAYIHHAKFVGANLSGADMRKVNVEKGNFSQANL 144
Query: 169 SDTLMDRMVLNEANLTNAVLVRTV-----LTRSDLGGAIIEGADFSDAVI------DLAQ 217
++ L ANL AVL T L+ +DLG +EGA+F A D
Sbjct: 145 TNANFSGAKLKYANLGGAVLRGTNFSFADLSATDLGSLDLEGANFRGATFNGTLLRDAKL 204
Query: 218 KQVGSKTSYF 227
K + S F
Sbjct: 205 KGADLRQSRF 214
>UNIPROTKB|Q0BZZ2 [details] [associations]
symbol:HNE_2256 "Pentapeptide repeat domain protein"
species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:CP000158 GenomeReviews:CP000158_GR
eggNOG:COG1357 InterPro:IPR001646 HOGENOM:HOG000148292
RefSeq:YP_760951.1 ProteinModelPortal:Q0BZZ2 STRING:Q0BZZ2
GeneID:4289822 KEGG:hne:HNE_2256 PATRIC:32217357 OMA:GRADFDK
BioCyc:HNEP228405:GI69-2278-MONOMER Uniprot:Q0BZZ2
Length = 245
Score = 132 (51.5 bits), Expect = 3.2e-07, P = 3.2e-07
Identities = 41/107 (38%), Positives = 56/107 (52%)
Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
+A ADLR A F A F +A M++ +DFS ++ GA LEKA NF
Sbjct: 82 SANVTGADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFE 141
Query: 165 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
GA L L R L A+L+ A T+L R++L G I +GA+ S+A
Sbjct: 142 GASL---LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183
Score = 110 (43.8 bits), Expect = 0.00023, P = 0.00023
Identities = 40/131 (30%), Positives = 56/131 (42%)
Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADL 168
+ A ADL A F A +DFS ++ GA LEKA NF GA L
Sbjct: 86 TGADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFEGASL 145
Query: 169 SDTLMDRMVLNEANLTNAVLVR-----TV-----LTRSDLGGAIIEGADFSDAVIDLAQK 218
++ L+ AN T +L R T+ L+ + GA + GA F DA + A
Sbjct: 146 LFARLETADLSGANCTGTILDRANLRGTIFDGANLSEASFNGADLSGASFRDARLPGADL 205
Query: 219 Q--VGSKTSYF 227
GS+++ F
Sbjct: 206 SGVTGSESADF 216
>TAIR|locus:2160230 [details] [associations]
symbol:FIP2 species:3702 "Arabidopsis thaliana"
[GO:0005249 "voltage-gated potassium channel activity"
evidence=IEA;ISS] [GO:0005576 "extracellular region" evidence=ISM]
[GO:0005634 "nucleus" evidence=ISM] [GO:0006813 "potassium ion
transport" evidence=IEA;ISS] [GO:0008076 "voltage-gated potassium
channel complex" evidence=IEA;ISS] [GO:0016020 "membrane"
evidence=IEA;ISS] [GO:0005515 "protein binding" evidence=IPI]
InterPro:IPR000210 InterPro:IPR003131 Pfam:PF02214 PROSITE:PS50097
SMART:SM00225 UniPathway:UPA00143 Pfam:PF00805 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0051260 GO:GO:0016567
Gene3D:3.30.710.10 InterPro:IPR011333 SUPFAM:SSF54695 EMBL:AB005232
EMBL:AF174429 EMBL:AB017059 EMBL:BT030093 IPI:IPI00541675
RefSeq:NP_200311.1 UniGene:At.11758 ProteinModelPortal:Q9SE95
SMR:Q9SE95 PaxDb:Q9SE95 PRIDE:Q9SE95 EnsemblPlants:AT5G55000.2
GeneID:835591 KEGG:ath:AT5G55000 TAIR:At5g55000 eggNOG:COG1357
HOGENOM:HOG000010269 InParanoid:Q9SE95 OMA:YFTTTRI PhylomeDB:Q9SE95
ProtClustDB:CLSN2686964 Genevestigator:Q9SE95 InterPro:IPR001646
Uniprot:Q9SE95
Length = 298
Score = 112 (44.5 bits), Expect = 0.00021, P = 0.00021
Identities = 33/100 (33%), Positives = 47/100 (47%)
Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 171
+F SA+LR A+ N + AN A + F G+ A+L+ A AN GA+L
Sbjct: 192 EFTSANLRGALLAGTNLQSANLQDACLVGCSFCGADLRTAHLQNADLTNANLEGANLEGA 251
Query: 172 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 211
+ L+ AN A L R L +L A +EGA+ A
Sbjct: 252 NLKGAKLSNANFKGANLQRAYLRHVNLREAHMEGANLGGA 291
Score = 109 (43.4 bits), Expect = 0.00049, P = 0.00049
Identities = 31/103 (30%), Positives = 47/103 (45%)
Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170
A+F +AD ++ R FTSA++R + +G+ A L+ A +F GADL
Sbjct: 171 AKFRNADAEGSIFHNAILRECEFTSANLRGALLAGTNLQSANLQDACLVGCSFCGADLRT 230
Query: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVI 213
+ L ANL A L L + L A +GA+ A +
Sbjct: 231 AHLQNADLTNANLEGANLEGANLKGAKLSNANFKGANLQRAYL 273
>UNIPROTKB|Q2GFB5 [details] [associations]
symbol:ECH_1083 "Pentapeptide repeat protein"
species:205920 "Ehrlichia chaffeensis str. Arkansas" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:CP000236 GenomeReviews:CP000236_GR
eggNOG:COG1357 InterPro:IPR001646 OMA:WSNSTIE RefSeq:YP_507868.1
ProteinModelPortal:Q2GFB5 STRING:Q2GFB5 GeneID:3927854
KEGG:ech:ECH_1083 PATRIC:20577512 HOGENOM:HOG000063360
ProtClustDB:CLSK749520 BioCyc:ECHA205920:GJNR-1086-MONOMER
Uniprot:Q2GFB5
Length = 607
Score = 115 (45.5 bits), Expect = 0.00029, P = 0.00029
Identities = 40/123 (32%), Positives = 58/123 (47%)
Query: 104 EFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-- 160
EFG S A F DLR +V N ANFT A++ S F S GA A K
Sbjct: 66 EFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSKSN 125
Query: 161 --------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
ANFT ADL T + + +N N + + + ++LT ++ G++ A+FS+
Sbjct: 126 IKNSNLNFANFTSADLQKTTITQSKINNTNFSYSDMRFSILT--EVNGSL---ANFSETE 180
Query: 213 IDL 215
+ L
Sbjct: 181 LKL 183
>TIGR_CMR|ECH_1083 [details] [associations]
symbol:ECH_1083 "pentapeptide repeat protein"
species:205920 "Ehrlichia chaffeensis str. Arkansas" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF00805 EMBL:CP000236 GenomeReviews:CP000236_GR
eggNOG:COG1357 InterPro:IPR001646 OMA:WSNSTIE RefSeq:YP_507868.1
ProteinModelPortal:Q2GFB5 STRING:Q2GFB5 GeneID:3927854
KEGG:ech:ECH_1083 PATRIC:20577512 HOGENOM:HOG000063360
ProtClustDB:CLSK749520 BioCyc:ECHA205920:GJNR-1086-MONOMER
Uniprot:Q2GFB5
Length = 607
Score = 115 (45.5 bits), Expect = 0.00029, P = 0.00029
Identities = 40/123 (32%), Positives = 58/123 (47%)
Query: 104 EFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK-- 160
EFG S A F DLR +V N ANFT A++ S F S GA A K
Sbjct: 66 EFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSKSN 125
Query: 161 --------ANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAV 212
ANFT ADL T + + +N N + + + ++LT ++ G++ A+FS+
Sbjct: 126 IKNSNLNFANFTSADLQKTTITQSKINNTNFSYSDMRFSILT--EVNGSL---ANFSETE 180
Query: 213 IDL 215
+ L
Sbjct: 181 LKL 183
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.317 0.130 0.380 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 233 200 0.00087 111 3 11 22 0.48 32
31 0.45 35
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 7
No. of states in DFA: 599 (64 KB)
Total size of DFA: 160 KB (2095 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 15.77u 0.09s 15.86t Elapsed: 00:00:02
Total cpu time: 15.77u 0.09s 15.86t Elapsed: 00:00:02
Start: Sat May 11 06:09:43 2013 End: Sat May 11 06:09:45 2013