Citrus Sinensis ID: 031010
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 167 | ||||||
| 255542610 | 190 | conserved hypothetical protein [Ricinus | 0.958 | 0.842 | 0.585 | 4e-48 | |
| 150036258 | 196 | GPI-anchored protein-like protein II [Cu | 0.946 | 0.806 | 0.544 | 9e-41 | |
| 18405086 | 200 | Glycoprotein membrane precursor GPI-anch | 0.808 | 0.675 | 0.546 | 1e-40 | |
| 449443181 | 172 | PREDICTED: uncharacterized GPI-anchored | 0.958 | 0.930 | 0.526 | 4e-40 | |
| 297853454 | 200 | predicted protein [Arabidopsis lyrata su | 0.808 | 0.675 | 0.539 | 6e-40 | |
| 225450985 | 196 | PREDICTED: uncharacterized GPI-anchored | 0.964 | 0.821 | 0.518 | 8e-39 | |
| 224131028 | 139 | predicted protein [Populus trichocarpa] | 0.796 | 0.956 | 0.588 | 2e-38 | |
| 147801316 | 252 | hypothetical protein VITISV_015337 [Viti | 0.910 | 0.603 | 0.522 | 3e-36 | |
| 255542608 | 196 | conserved hypothetical protein [Ricinus | 0.976 | 0.831 | 0.487 | 7e-36 | |
| 351721661 | 198 | uncharacterized protein LOC100306063 pre | 0.970 | 0.818 | 0.473 | 1e-35 |
| >gi|255542610|ref|XP_002512368.1| conserved hypothetical protein [Ricinus communis] gi|223548329|gb|EEF49820.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 195 bits (496), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/164 (58%), Positives = 124/164 (75%), Gaps = 4/164 (2%)
Query: 4 DDEDNLLQGINSFRTSLGIPVLTKHKKAACLADEVAEDLEDQPCTSINIGVKSPPLSNYN 63
D E+++LQGINS+RTSL +P LTK+ KA CLADEVA+ L+DQPC+S G S LSNY
Sbjct: 29 DKENSVLQGINSYRTSLQLPALTKNDKAGCLADEVADKLQDQPCSSA--GASSVQLSNYP 86
Query: 64 RLVKKCDIDINTNRDGVVLPVCVHDLVPTLVLTNYTHSSYAQYLNNSKFAGAGVGSEDDW 123
L+ KC IDIN RDG VLPVCV LVPTLVLTNYTH+ +A+Y+N+S++ G G+GSEDDW
Sbjct: 87 ELLSKCGIDINHTRDGAVLPVCVPKLVPTLVLTNYTHTQHAKYINDSRYTGVGIGSEDDW 146
Query: 124 MVVVLTTNTAAGIFASGAHHSVVSEIGWKRYAVCLLLGSLVYLV 167
MVVVL+TNT +G FA +S++S+IG +C L+G ++ V
Sbjct: 147 MVVVLSTNTPSGSFAGA--NSLMSKIGCGHCLMCFLMGIMLLFV 188
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|150036258|gb|ABR67421.1| GPI-anchored protein-like protein II [Cucumis melo subsp. melo] | Back alignment and taxonomy information |
|---|
| >gi|18405086|ref|NP_564669.1| Glycoprotein membrane precursor GPI-anchored [Arabidopsis thaliana] gi|12322170|gb|AAG51126.1|AC069144_23 unknown protein [Arabidopsis thaliana] gi|3776578|gb|AAC64895.1| ESTs gb|F13915 and gb|F13916 come from this gene [Arabidopsis thaliana] gi|21553603|gb|AAM62696.1| unknown [Arabidopsis thaliana] gi|28392879|gb|AAO41876.1| unknown protein [Arabidopsis thaliana] gi|28827272|gb|AAO50480.1| unknown protein [Arabidopsis thaliana] gi|332195036|gb|AEE33157.1| Glycoprotein membrane precursor GPI-anchored [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|449443181|ref|XP_004139359.1| PREDICTED: uncharacterized GPI-anchored protein At3g06035-like [Cucumis sativus] gi|449483048|ref|XP_004156479.1| PREDICTED: uncharacterized GPI-anchored protein At3g06035-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297853454|ref|XP_002894608.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297340450|gb|EFH70867.1| predicted protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|225450985|ref|XP_002284816.1| PREDICTED: uncharacterized GPI-anchored protein At3g06035 [Vitis vinifera] gi|296088315|emb|CBI36760.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224131028|ref|XP_002328435.1| predicted protein [Populus trichocarpa] gi|222838150|gb|EEE76515.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|147801316|emb|CAN77026.1| hypothetical protein VITISV_015337 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255542608|ref|XP_002512367.1| conserved hypothetical protein [Ricinus communis] gi|223548328|gb|EEF49819.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|351721661|ref|NP_001236961.1| uncharacterized protein LOC100306063 precursor [Glycine max] gi|255627423|gb|ACU14056.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 167 | ||||||
| TAIR|locus:2011025 | 200 | AT1G54860 "AT1G54860" [Arabido | 0.808 | 0.675 | 0.546 | 2.9e-39 | |
| TAIR|locus:505006331 | 200 | AT3G06035 "AT3G06035" [Arabido | 0.988 | 0.825 | 0.413 | 3.2e-33 | |
| TAIR|locus:2182162 | 196 | AT5G19250 "AT5G19250" [Arabido | 0.832 | 0.709 | 0.461 | 4.2e-31 | |
| TAIR|locus:2182142 | 189 | AT5G19230 "AT5G19230" [Arabido | 0.754 | 0.666 | 0.455 | 2.6e-24 | |
| TAIR|locus:2182152 | 199 | AT5G19240 "AT5G19240" [Arabido | 0.826 | 0.693 | 0.410 | 7.8e-23 |
| TAIR|locus:2011025 AT1G54860 "AT1G54860" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 419 (152.6 bits), Expect = 2.9e-39, P = 2.9e-39
Identities = 77/141 (54%), Positives = 105/141 (74%)
Query: 6 EDNLLQGINSFRTSLGIPVLTKHKKAACLADEVAEDLEDQPCTS------INIGVKSPPL 59
EDNLLQG+NS+RT+ +P K++KA C+ADE+A+ LEDQPCT+ + G P L
Sbjct: 32 EDNLLQGLNSYRTAQRVPPFAKNEKADCVADEIADKLEDQPCTNHTTASTVTPGSVPPRL 91
Query: 60 SNYNRLVKKCDIDINTNRDGVVLPVCVHDLVPTLVLTNYTHSSYAQYLNNSKFAGAGVGS 119
+NY ++ +C ID NT RDG++LPVC+ + +PTL LTNYT + YA+YLN+S++ GAGVGS
Sbjct: 92 TNYQDILSECKIDPNTTRDGLILPVCIPNRIPTLALTNYTQTGYARYLNDSRYVGAGVGS 151
Query: 120 EDDWMVVVLTTNTAAGIFASG 140
E +WMVVVLTT+T G F +G
Sbjct: 152 EKEWMVVVLTTSTPGGSFTAG 172
|
|
| TAIR|locus:505006331 AT3G06035 "AT3G06035" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2182162 AT5G19250 "AT5G19250" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2182142 AT5G19230 "AT5G19230" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2182152 AT5G19240 "AT5G19240" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| AT1G54860 | unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biological_process unknown; LOCATED IN- anchored to membrane; EXPRESSED IN- leaf whorl, petal, sepal, flower; EXPRESSED DURING- 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT5G19250.1); Has 42 Blast hits to 41 proteins in 9 species- Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 42; Viruses - 0; Other Eukaryotes - 0 (source- NCBI BLink). (200 aa) | ||||||||||
(Arabidopsis thaliana) | |||||||||||
| AT1G18280 | • | • | 0.863 | ||||||||
| AT3G18590 | • | 0.782 | |||||||||
| AT1G48940 | • | 0.782 | |||||||||
| AT1G64760 | • | 0.782 | |||||||||
| AT2G42940 | • | 0.677 | |||||||||
| AT5G64790 | • | • | 0.675 | ||||||||
| AT1G33430 | • | 0.659 | |||||||||
| FLA3 | • | 0.595 | |||||||||
| SHT | • | 0.575 | |||||||||
| SBT5.4 | • | 0.567 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 167 | |||
| cd05379 | 122 | SCP_bacterial SCP_bacterial: SCP-like extracellula | 97.59 | |
| TIGR02909 | 127 | spore_YkwD uncharacterized protein, YkwD family. M | 97.56 | |
| PF00188 | 124 | CAP: Cysteine-rich secretory protein family; Inter | 92.45 | |
| cd05381 | 136 | SCP_PR-1_like SCP_PR-1_like: SCP-like extracellula | 86.8 | |
| cd05384 | 129 | SCP_PRY1_like SCP_PRY1_like: SCP-like extracellula | 86.26 | |
| cd00168 | 122 | SCP SCP: SCP-like extracellular protein domain, fo | 84.79 | |
| PF14412 | 109 | AHH: A nuclease family of the HNH/ENDO VII superfa | 80.45 | |
| cd05382 | 132 | SCP_GAPR-1_like SCP_GAPR-1_like: SCP-like extracel | 80.07 |
| >cd05379 SCP_bacterial SCP_bacterial: SCP-like extracellular protein domain, as found in bacteria and archaea | Back alignment and domain information |
|---|
Probab=97.59 E-value=7.9e-05 Score=52.32 Aligned_cols=42 Identities=29% Similarity=0.352 Sum_probs=37.2
Q ss_pred hHHhhhhhhhhhhcCCCccccCCCCcccHHHHHHHcCCCCcc
Q 031010 7 DNLLQGINSFRTSLGIPVLTKHKKAACLADEVAEDLEDQPCT 48 (167)
Q Consensus 7 d~Ll~~iN~YR~slnLp~L~kN~kA~ClA~eiA~~~~~qpCt 48 (167)
..+++.||.||+..++|+|+.+++-.+.|.+-|++.....+.
T Consensus 2 ~~~~~~iN~~R~~~gl~pl~~~~~l~~~A~~~a~~~~~~~~~ 43 (122)
T cd05379 2 QEALELINAYRAQNGLPPLTWDPALAAAAQAHARDMAANGYF 43 (122)
T ss_pred hHHHHHHHHHHHHcCCCCCccChHHHHHHHHHHHHHHhcCcc
Confidence 568999999999999999999999999999999988665553
|
The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases. Little is known about the biological roles of the bacterial and archaeal SCP domains. |
| >TIGR02909 spore_YkwD uncharacterized protein, YkwD family | Back alignment and domain information |
|---|
| >PF00188 CAP: Cysteine-rich secretory protein family; InterPro: IPR014044 The cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins (CAP) superfamily proteins are found in a wide range of organisms, including prokaryotes [] and non-vertebrate eukaryotes [], The nine subfamilies of the mammalian CAP superfamily include: the human glioma pathogenesis-related 1 (GLIPR1), Golgi associated pathogenesis related-1 (GAPR1) proteins, peptidase inhibitor 15 (PI15), peptidase inhibitor 16 (PI16), cysteine-rich secretory proteins (CRISPs), CRISP LCCL domain containing 1 (CRISPLD1), CRISP LCCL domain containing 2 (CRISPLD2), mannose receptor like and the R3H domain containing like proteins | Back alignment and domain information |
|---|
| >cd05381 SCP_PR-1_like SCP_PR-1_like: SCP-like extracellular protein domain, PR-1 like subfamily | Back alignment and domain information |
|---|
| >cd05384 SCP_PRY1_like SCP_PRY1_like: SCP-like extracellular protein domain, PRY1-like sub-family restricted to fungi | Back alignment and domain information |
|---|
| >cd00168 SCP SCP: SCP-like extracellular protein domain, found in eukaryotes and prokaryotes | Back alignment and domain information |
|---|
| >PF14412 AHH: A nuclease family of the HNH/ENDO VII superfamily with conserved AHH | Back alignment and domain information |
|---|
| >cd05382 SCP_GAPR-1_like SCP_GAPR-1_like: SCP-like extracellular protein domain, golgi-associated plant pathogenesis related protein (GAPR)-like sub-family | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 167 | |||
| 4ifa_A | 339 | Extracellular protein containing A SCP domain; vac | 93.96 | |
| 4h0a_A | 323 | Uncharacterized protein; CAP protein family, cyste | 93.02 | |
| 1cfe_A | 135 | Pathogenesis-related protein P14A; PR-1 proteins, | 88.46 |
| >4ifa_A Extracellular protein containing A SCP domain; vaccine candi virulence, pathogenesis, center for structural genomics of infectious diseases; HET: MSE; 1.50A {Bacillus anthracis} | Back alignment and structure |
|---|
Probab=93.96 E-value=0.016 Score=50.18 Aligned_cols=117 Identities=18% Similarity=0.250 Sum_probs=65.8
Q ss_pred hhhHHhhhhhhhhhhcCCCccccCCCCcccHHHHHHHcCCCCccccCCCCCCCCCCChhHhhhhccccccccCCceeeee
Q 031010 5 DEDNLLQGINSFRTSLGIPVLTKHKKAACLADEVAEDLEDQPCTSINIGVKSPPLSNYNRLVKKCDIDINTNRDGVVLPV 84 (167)
Q Consensus 5 ~Ed~Ll~~iN~YR~slnLp~L~kN~kA~ClA~eiA~~~~~qpCt~~t~g~~~~q~pnyp~~l~kC~inin~T~DG~imPv 84 (167)
.|.++|.-+|.||+..+||+|+-|+...-.|..=|+++-..-+-..+++. -.+..+.+++..+..... |--+=.
T Consensus 220 ~e~~vl~lvN~~Ra~~Gl~pL~~d~~L~~aAq~hA~dMa~~~~fsH~~~~----G~~~~~R~~~~G~~~~~~--GENIA~ 293 (339)
T 4ifa_A 220 NMQQIFDLTNIIRSRHNLPLLAWDQQTADVAIGHSKDMKDNNYFSHDSPT----LGTLGDRLQRGKVGFQLA--GENIAA 293 (339)
T ss_dssp HHHHHHHHHHHHHHHTTCCCCEECHHHHHHHHHHHHHHHHHTCCSSSBTT----TBCHHHHHHHTTCCCSEE--EEEEEE
T ss_pred HHHHHHHHHHHHHHHcCCCCCccCHHHHHHHHHHHHHHhhcCCeecCCCC----CCCHHHHHHHcCCCcCce--eEEEEE
Confidence 47899999999999999999999999888887777765433222222211 112335555554321110 000000
Q ss_pred eccCCCccccccccchh--hhhhcccCCCcccccccCCCceEEEEEec
Q 031010 85 CVHDLVPTLVLTNYTHS--SYAQYLNNSKFAGAGVGSEDDWMVVVLTT 130 (167)
Q Consensus 85 CVP~l~~~~vltNyT~S--~y~~yLNdSkytg~GiGsed~WmVvVLtT 130 (167)
.- ..+..++..+-.| +.+. |=+..|+-+|||-...|.+.++.+
T Consensus 294 G~--~s~~~av~~WmnSpGHr~N-IL~~~~t~iGvGv~~~YwtQ~F~~ 338 (339)
T 4ifa_A 294 QH--SDGVAALQGWLNSEGHRKN-LLNEQFTGLGVGVYDKFYTQNFIR 338 (339)
T ss_dssp SC--SSHHHHHHHHHTSHHHHHH-HTCTTCCEEEEEEETTEEEEEEEE
T ss_pred eC--CCHHHHHHHHhCCHhHHHH-HhCCCCCEEEEEEEecEEEEEEec
Confidence 00 1122334444445 4444 445677777777666655666553
|
| >4h0a_A Uncharacterized protein; CAP protein family, cysteine-rich secretory proteins, struct genomics, joint center for structural genomics; 1.90A {Staphylococcus aureus subsp} | Back alignment and structure |
|---|
| >1cfe_A Pathogenesis-related protein P14A; PR-1 proteins, plant defense; NMR {Solanum lycopersicum} SCOP: d.111.1.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00