Citrus Sinensis ID: 029481
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 192 | ||||||
| 118485725 | 205 | unknown [Populus trichocarpa] | 0.994 | 0.931 | 0.661 | 2e-67 | |
| 351728015 | 203 | uncharacterized protein LOC100306685 pre | 0.968 | 0.916 | 0.668 | 3e-66 | |
| 359807155 | 198 | uncharacterized protein LOC100814074 pre | 0.848 | 0.823 | 0.740 | 5e-65 | |
| 351721661 | 198 | uncharacterized protein LOC100306063 pre | 0.859 | 0.833 | 0.718 | 1e-64 | |
| 388515277 | 202 | unknown [Lotus japonicus] | 1.0 | 0.950 | 0.623 | 1e-64 | |
| 351722561 | 199 | uncharacterized protein LOC100499681 pre | 0.911 | 0.879 | 0.689 | 1e-64 | |
| 388515291 | 199 | unknown [Lotus japonicus] | 0.932 | 0.899 | 0.651 | 1e-63 | |
| 255552305 | 172 | conserved hypothetical protein [Ricinus | 0.796 | 0.889 | 0.758 | 4e-63 | |
| 357480763 | 201 | GPI-anchored protein, putative [Medicago | 0.963 | 0.920 | 0.616 | 3e-62 | |
| 388504872 | 201 | unknown [Medicago truncatula] | 0.963 | 0.920 | 0.616 | 3e-62 |
| >gi|118485725|gb|ABK94712.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 260 bits (664), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/204 (66%), Positives = 159/204 (77%), Gaps = 13/204 (6%)
Query: 1 MASPWLHLLL--LVSFLSMNLL---VKCDTDEEDTLLQGLNSYRESLNLTSLTKNKNAEC 55
MAS LL V F+ + L+ V CD D+ED LLQG+N+YR S NLT+LTKN NAEC
Sbjct: 1 MASSRFSLLFPFFVFFIILCLISHPVICDGDQEDALLQGINNYRTSFNLTTLTKNDNAEC 60
Query: 56 LADELADQFKNQPCTNSTGANTVPGTEKQLSNYPDLLAKCHLNVSNTRDGIVMPACVPNL 115
LA+E+ADQFKNQPCTN+TG+NTVPGTE Q NYP LLAKCHLNVSNTRDG VMPACVP+L
Sbjct: 61 LAEEIADQFKNQPCTNTTGSNTVPGTEPQFPNYPSLLAKCHLNVSNTRDGAVMPACVPHL 120
Query: 116 EHSLVLSNFTKSQYSDSLNDTKYKGAGIGSEDNWIVVILTTSTPAGSYVP-------YNA 168
+ SLVL+NFT++ YSD+LNDTK+ GAGIGS+ NWIVV+LTTSTP GSYV YNA
Sbjct: 121 DPSLVLTNFTRTPYSDNLNDTKFTGAGIGSDGNWIVVVLTTSTPEGSYVTSKTDGSDYNA 180
Query: 169 ASLIS-NIGLIYCLLFWLISALLI 191
A+L + N GLIY LLF LI +L +
Sbjct: 181 ANLTAKNTGLIYHLLFLLIGSLFL 204
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|351728015|ref|NP_001238716.1| uncharacterized protein LOC100306685 precursor [Glycine max] gi|255629271|gb|ACU14980.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|359807155|ref|NP_001241609.1| uncharacterized protein LOC100814074 precursor [Glycine max] gi|255640671|gb|ACU20620.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|351721661|ref|NP_001236961.1| uncharacterized protein LOC100306063 precursor [Glycine max] gi|255627423|gb|ACU14056.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388515277|gb|AFK45700.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|351722561|ref|NP_001238272.1| uncharacterized protein LOC100499681 precursor [Glycine max] gi|255625749|gb|ACU13219.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388515291|gb|AFK45707.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|255552305|ref|XP_002517197.1| conserved hypothetical protein [Ricinus communis] gi|223543832|gb|EEF45360.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|357480763|ref|XP_003610667.1| GPI-anchored protein, putative [Medicago truncatula] gi|217071106|gb|ACJ83913.1| unknown [Medicago truncatula] gi|355512002|gb|AES93625.1| GPI-anchored protein, putative [Medicago truncatula] gi|388501396|gb|AFK38764.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|388504872|gb|AFK40502.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 192 | ||||||
| TAIR|locus:505006331 | 200 | AT3G06035 "AT3G06035" [Arabido | 0.895 | 0.86 | 0.638 | 1.2e-56 | |
| TAIR|locus:2182162 | 196 | AT5G19250 "AT5G19250" [Arabido | 0.895 | 0.877 | 0.611 | 2.1e-52 | |
| TAIR|locus:2011025 | 200 | AT1G54860 "AT1G54860" [Arabido | 0.859 | 0.825 | 0.461 | 2.6e-38 | |
| TAIR|locus:2182152 | 199 | AT5G19240 "AT5G19240" [Arabido | 0.848 | 0.819 | 0.491 | 5.5e-38 | |
| TAIR|locus:2182142 | 189 | AT5G19230 "AT5G19230" [Arabido | 0.828 | 0.841 | 0.462 | 2.9e-32 |
| TAIR|locus:505006331 AT3G06035 "AT3G06035" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
Identities = 113/177 (63%), Positives = 136/177 (76%)
Query: 21 VKCDTDEEDTLLQGLNSYRESLNLTSLTKNKNAECLADELADQFKNQPCTNSTGANTVPG 80
V DTDEED LL G+NSYR + NLT L+KN+NAECLADE+ADQFKN+PCTN TG+ TVPG
Sbjct: 22 VLSDTDEEDILLTGINSYRTTQNLTILSKNENAECLADEIADQFKNKPCTNDTGSATVPG 81
Query: 81 TEKQLSNYPDLLAKCHLNVSNTRDGIVMPACVPNLEHSLVLSNFTKSQYSDSLNDTKYKG 140
TE Q +NYP +LAKCHLNVS+TRDG +MPACVP LE +LVL+NFTKSQYS SLND+K+ G
Sbjct: 82 TEPQFANYPQILAKCHLNVSDTRDGSIMPACVPRLESNLVLTNFTKSQYSMSLNDSKFTG 141
Query: 141 AGIGSEDNWIVVILTTSTPAGSYVPYNAASLISN-----IGLIYCLLFWLISALLIF 192
GIG ED+WIVV+LTT+TP GSY SN IGL+ L+ ++ S+ F
Sbjct: 142 IGIGKEDDWIVVVLTTNTPEGSYSTATPTKQESNGFTFGIGLVSYLVIFMYSSFCFF 198
|
|
| TAIR|locus:2182162 AT5G19250 "AT5G19250" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2011025 AT1G54860 "AT1G54860" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2182152 AT5G19240 "AT5G19240" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2182142 AT5G19230 "AT5G19230" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.VIII.44.1 | SubName- Full=Putative uncharacterized protein; (161 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 192 | |||
| cd05379 | 122 | SCP_bacterial SCP_bacterial: SCP-like extracellula | 97.72 | |
| TIGR02909 | 127 | spore_YkwD uncharacterized protein, YkwD family. M | 97.27 | |
| PF00188 | 124 | CAP: Cysteine-rich secretory protein family; Inter | 90.69 | |
| cd05384 | 129 | SCP_PRY1_like SCP_PRY1_like: SCP-like extracellula | 85.51 | |
| cd05381 | 136 | SCP_PR-1_like SCP_PR-1_like: SCP-like extracellula | 85.2 | |
| cd00168 | 122 | SCP SCP: SCP-like extracellular protein domain, fo | 83.31 | |
| PF14412 | 109 | AHH: A nuclease family of the HNH/ENDO VII superfa | 81.43 |
| >cd05379 SCP_bacterial SCP_bacterial: SCP-like extracellular protein domain, as found in bacteria and archaea | Back alignment and domain information |
|---|
Probab=97.72 E-value=5.8e-05 Score=54.19 Aligned_cols=41 Identities=22% Similarity=0.242 Sum_probs=36.6
Q ss_pred hhHHhhhhhhhhhcCCCccccCCCCcccHHHHHHHhCCCCC
Q 029481 29 DTLLQGLNSYRESLNLTSLTKNKNAECLADELADQFKNQPC 69 (192)
Q Consensus 29 d~Ll~~iN~YR~slnLp~L~kN~kA~ClA~eiA~~~~~qpC 69 (192)
..+++.||.||+..++|+|+.+++-.+.|.+-|++.....+
T Consensus 2 ~~~~~~iN~~R~~~gl~pl~~~~~l~~~A~~~a~~~~~~~~ 42 (122)
T cd05379 2 QEALELINAYRAQNGLPPLTWDPALAAAAQAHARDMAANGY 42 (122)
T ss_pred hHHHHHHHHHHHHcCCCCCccChHHHHHHHHHHHHHHhcCc
Confidence 46899999999999999999999999999999998865555
|
The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases. Little is known about the biological roles of the bacterial and archaeal SCP domains. |
| >TIGR02909 spore_YkwD uncharacterized protein, YkwD family | Back alignment and domain information |
|---|
| >PF00188 CAP: Cysteine-rich secretory protein family; InterPro: IPR014044 The cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins (CAP) superfamily proteins are found in a wide range of organisms, including prokaryotes [] and non-vertebrate eukaryotes [], The nine subfamilies of the mammalian CAP superfamily include: the human glioma pathogenesis-related 1 (GLIPR1), Golgi associated pathogenesis related-1 (GAPR1) proteins, peptidase inhibitor 15 (PI15), peptidase inhibitor 16 (PI16), cysteine-rich secretory proteins (CRISPs), CRISP LCCL domain containing 1 (CRISPLD1), CRISP LCCL domain containing 2 (CRISPLD2), mannose receptor like and the R3H domain containing like proteins | Back alignment and domain information |
|---|
| >cd05384 SCP_PRY1_like SCP_PRY1_like: SCP-like extracellular protein domain, PRY1-like sub-family restricted to fungi | Back alignment and domain information |
|---|
| >cd05381 SCP_PR-1_like SCP_PR-1_like: SCP-like extracellular protein domain, PR-1 like subfamily | Back alignment and domain information |
|---|
| >cd00168 SCP SCP: SCP-like extracellular protein domain, found in eukaryotes and prokaryotes | Back alignment and domain information |
|---|
| >PF14412 AHH: A nuclease family of the HNH/ENDO VII superfamily with conserved AHH | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 192 | |||
| 4ifa_A | 339 | Extracellular protein containing A SCP domain; vac | 93.57 | |
| 4h0a_A | 323 | Uncharacterized protein; CAP protein family, cyste | 92.96 | |
| 1cfe_A | 135 | Pathogenesis-related protein P14A; PR-1 proteins, | 89.2 |
| >4ifa_A Extracellular protein containing A SCP domain; vaccine candi virulence, pathogenesis, center for structural genomics of infectious diseases; HET: MSE; 1.50A {Bacillus anthracis} | Back alignment and structure |
|---|
Probab=93.57 E-value=0.027 Score=49.84 Aligned_cols=117 Identities=15% Similarity=0.189 Sum_probs=66.1
Q ss_pred hhhhHHhhhhhhhhhcCCCccccCCCCcccHHHHHHHhCCCCCCCCCCCccCCCCCCCCCChhHhhhccccccccccCce
Q 029481 27 EEDTLLQGLNSYRESLNLTSLTKNKNAECLADELADQFKNQPCTNSTGANTVPGTEKQLSNYPDLLAKCHLNVSNTRDGI 106 (192)
Q Consensus 27 ~Ed~Ll~~iN~YR~slnLp~L~kN~kA~ClA~eiA~~~~~qpCtntt~~~~vPg~~pq~pnyp~~l~kC~inin~T~DG~ 106 (192)
.|.++|.-+|.||+..+||+|+-|+...-.|+.=|+++...-+-..++. -|. +..+.+++..+.......=+
T Consensus 220 ~e~~vl~lvN~~Ra~~Gl~pL~~d~~L~~aAq~hA~dMa~~~~fsH~~~---~G~-----~~~~R~~~~G~~~~~~GENI 291 (339)
T 4ifa_A 220 NMQQIFDLTNIIRSRHNLPLLAWDQQTADVAIGHSKDMKDNNYFSHDSP---TLG-----TLGDRLQRGKVGFQLAGENI 291 (339)
T ss_dssp HHHHHHHHHHHHHHHTTCCCCEECHHHHHHHHHHHHHHHHHTCCSSSBT---TTB-----CHHHHHHHTTCCCSEEEEEE
T ss_pred HHHHHHHHHHHHHHHcCCCCCccCHHHHHHHHHHHHHHhhcCCeecCCC---CCC-----CHHHHHHHcCCCcCceeEEE
Confidence 4789999999999999999999999988888777776643322111111 011 22345554433211100000
Q ss_pred eeecccCCCccchhhcccchh--hhhcccCCCCcceeeccCCCceEEEEEec
Q 029481 107 VMPACVPNLEHSLVLSNFTKS--QYSDSLNDTKYKGAGIGSEDNWIVVILTT 156 (192)
Q Consensus 107 imPvCVP~l~~~~vltNyT~S--qy~~yLNdSkytg~GiGsed~WmVvVLtT 156 (192)
-. . ...+..++..+-.| +.+. |=+..|+-+|||-...|.+.++.+
T Consensus 292 A~--G--~~s~~~av~~WmnSpGHr~N-IL~~~~t~iGvGv~~~YwtQ~F~~ 338 (339)
T 4ifa_A 292 AA--Q--HSDGVAALQGWLNSEGHRKN-LLNEQFTGLGVGVYDKFYTQNFIR 338 (339)
T ss_dssp EE--S--CSSHHHHHHHHHTSHHHHHH-HTCTTCCEEEEEEETTEEEEEEEE
T ss_pred EE--e--CCCHHHHHHHHhCCHhHHHH-HhCCCCCEEEEEEEecEEEEEEec
Confidence 00 0 01223344444445 4444 545678888888666666666653
|
| >4h0a_A Uncharacterized protein; CAP protein family, cysteine-rich secretory proteins, struct genomics, joint center for structural genomics; 1.90A {Staphylococcus aureus subsp} | Back alignment and structure |
|---|
| >1cfe_A Pathogenesis-related protein P14A; PR-1 proteins, plant defense; NMR {Solanum lycopersicum} SCOP: d.111.1.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00