Citrus Sinensis ID: 026428
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 238 | ||||||
| 255577683 | 242 | conserved hypothetical protein [Ricinus | 1.0 | 0.983 | 0.776 | 1e-105 | |
| 225449116 | 241 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.987 | 0.763 | 1e-103 | |
| 147767150 | 241 | hypothetical protein VITISV_022485 [Viti | 1.0 | 0.987 | 0.763 | 1e-103 | |
| 224112977 | 242 | predicted protein [Populus trichocarpa] | 0.983 | 0.966 | 0.778 | 1e-103 | |
| 118488695 | 242 | unknown [Populus trichocarpa] | 0.983 | 0.966 | 0.774 | 1e-102 | |
| 224100825 | 243 | predicted protein [Populus trichocarpa] | 0.987 | 0.967 | 0.765 | 1e-100 | |
| 118484646 | 243 | unknown [Populus trichocarpa] | 0.987 | 0.967 | 0.761 | 1e-100 | |
| 449452120 | 236 | PREDICTED: uncharacterized protein LOC10 | 0.974 | 0.983 | 0.747 | 5e-95 | |
| 356563686 | 241 | PREDICTED: transmembrane protein 70 homo | 0.987 | 0.975 | 0.709 | 5e-94 | |
| 388508974 | 251 | unknown [Lotus japonicus] | 1.0 | 0.948 | 0.685 | 2e-90 |
| >gi|255577683|ref|XP_002529718.1| conserved hypothetical protein [Ricinus communis] gi|223530820|gb|EEF32684.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/242 (77%), Positives = 211/242 (87%), Gaps = 4/242 (1%)
Query: 1 MVRAALVHLLRSQSKQLSSRS-FHSGYQLCRLG--ARSPTPKVNFNSAIPVAAAQRRWAS 57
M R AL+HLLRSQ++ LSS S SGY++CR G + + T K N+NSAI +AA+Q+RWAS
Sbjct: 1 MARTALLHLLRSQTRNLSSPSTVQSGYRVCRFGTWSHAHTAKTNYNSAIQIAASQKRWAS 60
Query: 58 QNSAAED-NKISIGPRRGGEAVEDEKDGSVVYYGPISSTIKKVKLLSLSTCCLSVSLGPV 116
Q ED N+ISIGPRRG EA EDEK+ VVYYGPISSTIKKVKLLSLSTCCLSVSLGPV
Sbjct: 61 QARTTEDDNRISIGPRRGKEAGEDEKETGVVYYGPISSTIKKVKLLSLSTCCLSVSLGPV 120
Query: 117 ITFMTSPDMNVIVKGAVASSVIFMSATTTAALHWFVSPYIHKLKWKPGSDSFEVEMMSWL 176
ITFMTSPDMNVI+KGAVASSVIF+SA+TTAALHWFV+PYIHKL+W+PGSDSFEVEMMSW+
Sbjct: 121 ITFMTSPDMNVILKGAVASSVIFLSASTTAALHWFVTPYIHKLRWQPGSDSFEVEMMSWM 180
Query: 177 ATYIPKTIRFADIRLPETNRPFVTFKANENFYFVDAEHCHNKALLARLTPQKATQESALK 236
ATYIPKTI+FADIR ETNRPFVTFKA+ NFYFVD EHCHNKALLARLTPQK+ ESA K
Sbjct: 181 ATYIPKTIKFADIRPAETNRPFVTFKADGNFYFVDTEHCHNKALLARLTPQKSQNESAFK 240
Query: 237 NL 238
NL
Sbjct: 241 NL 242
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225449116|ref|XP_002277030.1| PREDICTED: uncharacterized protein LOC100266780 [Vitis vinifera] gi|296086053|emb|CBI31494.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|147767150|emb|CAN69001.1| hypothetical protein VITISV_022485 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224112977|ref|XP_002316350.1| predicted protein [Populus trichocarpa] gi|222865390|gb|EEF02521.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118488695|gb|ABK96158.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224100825|ref|XP_002312028.1| predicted protein [Populus trichocarpa] gi|222851848|gb|EEE89395.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118484646|gb|ABK94194.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|449452120|ref|XP_004143808.1| PREDICTED: uncharacterized protein LOC101213359 [Cucumis sativus] gi|449525730|ref|XP_004169869.1| PREDICTED: uncharacterized protein LOC101223596 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356563686|ref|XP_003550092.1| PREDICTED: transmembrane protein 70 homolog, mitochondrial-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388508974|gb|AFK42553.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 238 | ||||||
| TAIR|locus:2039260 | 238 | AT2G35790 "AT2G35790" [Arabido | 0.970 | 0.970 | 0.555 | 1.2e-63 |
| TAIR|locus:2039260 AT2G35790 "AT2G35790" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 649 (233.5 bits), Expect = 1.2e-63, P = 1.2e-63
Identities = 136/245 (55%), Positives = 166/245 (67%)
Query: 1 MVRAALVHXXXXXXXXXXXXXFHSGYQ---LCRLGARSPTPKVNFNSAIPVAAAQRRWAS 57
M R+AL+H F SGY + + S PKV F +P + +QR WAS
Sbjct: 1 MGRSALIHLLRSQSRRLSSSTFTSGYHHRSIAGSWSSSVNPKVRFQ--VP-SLSQRSWAS 57
Query: 58 QNSAA-EDN---KISIGPRRGGEAVEDEKDGSVVYYGPISSTIXXXXXXXXXXXXXXXXX 113
+ ED+ KISIGP+ E EKDG VVYYGPISSTI
Sbjct: 58 FGAKTREDDDEHKISIGPQEKKE----EKDGGVVYYGPISSTIKKVKLLSLSTCCLSVSL 113
Query: 114 GPVITFMTSPDMNVIVKGAVASSVIFMSATTTAALHWFVSPYIHKLKWKPGSDSFEVEMM 173
GPVITFMTSP +NVI+KGAVAS+VIF+SA+TTAALHWFVSPY+HKL+W+PGSD+FEVEMM
Sbjct: 114 GPVITFMTSPGLNVIMKGAVASTVIFLSASTTAALHWFVSPYVHKLRWQPGSDTFEVEMM 173
Query: 174 SWLATYIPKTIRFADIRLPETNRPFVTFKANENFYFVDAEHCHNKALLARLTPQKATQES 233
+WL T+ KT++F+DIR P+T RP+V+FKA+ NFYFVDA+HC NKALLARLTP K +S
Sbjct: 174 TWLGTFSAKTLKFSDIRYPDTQRPYVSFKADGNFYFVDADHCPNKALLARLTPPKDAHDS 233
Query: 234 ALKNL 238
A KNL
Sbjct: 234 AFKNL 238
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.131 0.395 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 238 208 0.00077 112 3 11 22 0.48 32
31 0.48 35
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 604 (64 KB)
Total size of DFA: 176 KB (2102 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 17.30u 0.11s 17.41t Elapsed: 00:00:01
Total cpu time: 17.30u 0.11s 17.41t Elapsed: 00:00:01
Start: Fri May 10 13:10:03 2013 End: Fri May 10 13:10:04 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 238 | |||
| pfam06979 | 133 | pfam06979, DUF1301, Protein of unknown function (D | 8e-05 |
| >gnl|CDD|148541 pfam06979, DUF1301, Protein of unknown function (DUF1301) | Back alignment and domain information |
|---|
Score = 40.9 bits (96), Expect = 8e-05
Identities = 26/118 (22%), Positives = 39/118 (33%), Gaps = 9/118 (7%)
Query: 99 VKLLSLSTCCLSVSLGPVITFMT-----SPDMNVIVKGAVASSVIFMSATTTAALHWFVS 153
VK S ST ++ P I S M V G F + T LH
Sbjct: 2 VKFFSYSTSLAGLAAQPYILLEQGMKVGSLPMQVAFCGIAG----FFTFVTPLLLHLITK 57
Query: 154 PYIHKLKWKPGSDSFEVEMMSWLATYIPKTIRFADIRLPETNRPFVTFKANENFYFVD 211
Y+ +L + +D++ + D+ +PE F TF A V+
Sbjct: 58 GYVIRLYYDATTDTYTATTYNVFLQEKKTVFHQEDVTIPEVPGMFTTFYAKTKSLLVN 115
|
This family contains a number of eukaryotic proteins of unknown function that are approximately 160 residues long. Length = 133 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 238 | |||
| KOG4478 | 217 | consensus Uncharacterized membrane protein [Functi | 100.0 | |
| PF06979 | 133 | DUF1301: Protein of unknown function (DUF1301); In | 100.0 | |
| PF14640 | 170 | TMEM223: Transmembrane protein 223 | 98.05 |
| >KOG4478 consensus Uncharacterized membrane protein [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=2.3e-57 Score=393.10 Aligned_cols=211 Identities=23% Similarity=0.248 Sum_probs=189.2
Q ss_pred HHHHHHhccccccccccCCcceeeecccc---CCCCCcCcCCchhhhhhhhhhhcccccccCCceeeCCCCCCCCcccCC
Q 026428 6 LVHLLRSQSKQLSSRSFHSGYQLCRLGAR---SPTPKVNFNSAIPVAAAQRRWASQNSAAEDNKISIGPRRGGEAVEDEK 82 (238)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~a~~q~~~~~~~~~~~~~~is~~~~~~~~~~~~~~ 82 (238)
+.|+++.|++|+|+++..+ +||+.+. +.-+++.+|+.| -.|.+|+|-...+ +|++.
T Consensus 2 ~~h~a~~~~~r~ssstf~~---~~Rsi~~~s~~s~~~v~pq~~~---~~q~~w~S~~~~k---------------~e~dd 60 (217)
T KOG4478|consen 2 SSHYALLRPIRISSSTFQK---CVRSIGTTSKSSVVEVLPQSKI---DKQDLWHSILIHK---------------GELDD 60 (217)
T ss_pred cceeeeeccceeccccchh---hheeccccccCCcccccccccc---hhhhhhhhheecc---------------ccCCC
Confidence 5799999999999998876 8888552 233777777775 6799999977766 23333
Q ss_pred CCceEEEccCccceeEEeEEeeechhhhccccceeeeccCCCch--hHHHHHHHHHHHHHHhhhHHHHHHhhcCceeEEE
Q 026428 83 DGSVVYYGPISSTIKKVKLLSLSTCCLSVSLGPVITFMTSPDMN--VIVKGAVASSVIFMSATTTAALHWFVSPYIHKLK 160 (238)
Q Consensus 83 ~~~lVY~GpLa~~Vr~vKlFSLsSs~lsl~~~Pvl~~~~~~~~s--l~~k~av~~~v~~ft~~Tp~LLHw~tKpYV~~Ly 160 (238)
....+|+|++|++|++||+||||||++|++++|+|+.++.+..+ .+.++++|.+.+|++++||+|||||+||||++||
T Consensus 61 g~Q~~yyg~vA~~vkgVK~lSlSTsl~gv~~~Pvi~~q~~~i~~~~pim~~f~~~~g~Flv~~TpllLHfitk~Yvi~L~ 140 (217)
T KOG4478|consen 61 GSQKTYYGVVAMGVKGVKILSLSTSLAGVVMVPVLSSQLWEIAAERPIMMMFAIVAGTFLVLLTPLLLHFITKRYVIDLF 140 (217)
T ss_pred cccccccchhhhhcceeEEEEehhhhhhhhhhhhHhhhcchhhcccchhhhhHhhhhhHHHHhHHHHHHHHhccceEEEE
Confidence 33559999999999999999999999999999999998887555 7888889999999999999999999999999999
Q ss_pred EeCCCCeEEEEEEeeeeeeeeeEEeeCceecCCCCCceEEEEECCeeeeEecCCCCCHHHHhhhCCCCcchhhhhccC
Q 026428 161 WKPGSDSFEVEMMSWLATYIPKTIRFADIRLPETNRPFVTFKANENFYFVDAEHCHNKALLARLTPQKATQESALKNL 238 (238)
Q Consensus 161 ~~p~tdtyta~T~s~fl~~k~t~F~~~DV~~P~t~rpFtTF~a~gkplfV~p~~F~d~~~y~kLmgyd~~~~~~~~~~ 238 (238)
|||++|+|||+||||++++..++|+++||++||+.+||++|.|++++|||||.+|||+++|.+|||||+ .|++++|.
T Consensus 141 ~np~tdtfta~tyn~Llqk~attf~~sDV~~Pdv~~~ftsF~a~~~~lfVdp~lfpdre~y~~lmgydd-~df~l~~~ 217 (217)
T KOG4478|consen 141 YNPDTDTFTAVTYNFLLQKQATTFSASDVVDPDVAPEFTSFWAPLATLFVDPLLFPDREAYLDLMGYDD-LDFNLHIP 217 (217)
T ss_pred EcCCCceeehhhHHHHHHHHhheecccceecCCCCCceeEEEecCcceeeccccCCCHHHHHhhcCCch-hhhhccCC
Confidence 999999999999999999999999999999999999999999999999999999999999999999999 99999984
|
|
| >PF06979 DUF1301: Protein of unknown function (DUF1301); InterPro: IPR009724 This family contains a number of eukaryotic proteins of unknown function that are approximately 160 residues long | Back alignment and domain information |
|---|
| >PF14640 TMEM223: Transmembrane protein 223 | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00