Citrus Sinensis ID: 033542
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 117 | ||||||
| 356497054 | 283 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.413 | 0.683 | 2e-42 | |
| 255647653 | 283 | unknown [Glycine max] | 1.0 | 0.413 | 0.683 | 2e-42 | |
| 356541380 | 283 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.413 | 0.675 | 4e-42 | |
| 359474591 | 280 | PREDICTED: uncharacterized protein LOC10 | 0.957 | 0.4 | 0.716 | 1e-40 | |
| 357482319 | 294 | GMFP4 [Medicago truncatula] gi|355512780 | 0.965 | 0.384 | 0.685 | 2e-40 | |
| 356511490 | 277 | PREDICTED: uncharacterized protein LOC54 | 0.948 | 0.400 | 0.739 | 1e-39 | |
| 357482323 | 289 | GMFP4 [Medicago truncatula] gi|355512782 | 0.982 | 0.397 | 0.658 | 1e-39 | |
| 449461537 | 279 | PREDICTED: uncharacterized protein LOC10 | 0.974 | 0.408 | 0.701 | 8e-39 | |
| 358248102 | 277 | uncharacterized protein LOC100793680 [Gl | 0.948 | 0.400 | 0.721 | 4e-38 | |
| 224138040 | 280 | predicted protein [Populus trichocarpa] | 0.982 | 0.410 | 0.767 | 6e-37 |
| >gi|356497054|ref|XP_003517379.1| PREDICTED: uncharacterized protein LOC100791911 [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 98/120 (81%), Gaps = 3/120 (2%)
Query: 1 MACRGCLECLLKLLNFLLTLAGLAMVGYGIYLFVEYKRV---DNLGTVSPVSGDEGFVQL 57
MACRGC ECLLKL NF+LTL GLA+VGYGIYLFVE+ + DN +SPVS D +QL
Sbjct: 1 MACRGCWECLLKLFNFILTLTGLAIVGYGIYLFVEFSKASDDDNTPAISPVSDDSALIQL 60
Query: 58 GRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCISATTRNGCCLTCVSLVL 117
GRP+LMAVSL++ FD LP+AWFIYLFIG+GVVLF+ISC GCI A TRNGCCL+C S+++
Sbjct: 61 GRPVLMAVSLSNDFFDNLPRAWFIYLFIGIGVVLFLISCFGCIGAATRNGCCLSCYSILV 120
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255647653|gb|ACU24289.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356541380|ref|XP_003539155.1| PREDICTED: uncharacterized protein LOC100815033 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|359474591|ref|XP_002279170.2| PREDICTED: uncharacterized protein LOC100245417 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|357482319|ref|XP_003611445.1| GMFP4 [Medicago truncatula] gi|355512780|gb|AES94403.1| GMFP4 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|356511490|ref|XP_003524459.1| PREDICTED: uncharacterized protein LOC547883 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357482323|ref|XP_003611447.1| GMFP4 [Medicago truncatula] gi|355512782|gb|AES94405.1| GMFP4 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449461537|ref|XP_004148498.1| PREDICTED: uncharacterized protein LOC101203030 [Cucumis sativus] gi|449527337|ref|XP_004170668.1| PREDICTED: uncharacterized protein LOC101226567 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|358248102|ref|NP_001240069.1| uncharacterized protein LOC100793680 [Glycine max] gi|255641974|gb|ACU21254.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|224138040|ref|XP_002326503.1| predicted protein [Populus trichocarpa] gi|222833825|gb|EEE72302.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 117 | ||||||
| TAIR|locus:2033718 | 280 | TOM2A "tobamovirus multiplicat | 1.0 | 0.417 | 0.576 | 6.7e-33 | |
| TAIR|locus:2038997 | 270 | AT2G20230 [Arabidopsis thalian | 0.358 | 0.155 | 0.428 | 1.1e-08 | |
| TAIR|locus:2051348 | 221 | AT2G20740 "AT2G20740" [Arabido | 0.299 | 0.158 | 0.571 | 2.5e-07 | |
| UNIPROTKB|H0Y8W4 | 87 | TSPAN5 "Tetraspanin-5" [Homo s | 0.222 | 0.298 | 0.423 | 0.00095 |
| TAIR|locus:2033718 TOM2A "tobamovirus multiplication 2A" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 359 (131.4 bits), Expect = 6.7e-33, P = 6.7e-33
Identities = 68/118 (57%), Positives = 81/118 (68%)
Query: 1 MACRGCXXXXXXXXXXXXXXXXXAMVGYGIYLFVEYKRV-DNLGTVSPVSGDEGFVQLGR 59
MACRGC M+GYGIYLFVEYKRV DN T +GD+ +V GR
Sbjct: 1 MACRGCLECLLKLLNFLLAVAGLGMIGYGIYLFVEYKRVTDNSVTFDLTNGDQSYVSFGR 60
Query: 60 PMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCISATTRNGCCLTCVSLVL 117
P+LMAVSL+ +IFD LPKAWFIYLFIG+GV LFVISC GC+ +R+ CCL+C SL+L
Sbjct: 61 PILMAVSLSSNIFDNLPKAWFIYLFIGIGVALFVISCCGCVGTCSRSVCCLSCYSLLL 118
|
|
| TAIR|locus:2038997 AT2G20230 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2051348 AT2G20740 "AT2G20740" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|H0Y8W4 TSPAN5 "Tetraspanin-5" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00015826001 | hypothetical protein LOC100245417 (410 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 117 | |||
| pfam00335 | 221 | pfam00335, Tetraspannin, Tetraspanin family | 3e-06 |
| >gnl|CDD|215864 pfam00335, Tetraspannin, Tetraspanin family | Back alignment and domain information |
|---|
Score = 43.6 bits (103), Expect = 3e-06
Identities = 27/112 (24%), Positives = 41/112 (36%), Gaps = 29/112 (25%)
Query: 6 CLECLLKLLNFLLTLAGLAMVGYGIYLFVEYKRVDNLGTVSPVSGDEGFVQLGRPMLMAV 65
CL+ LL LLN L L GLA++ GI+L V
Sbjct: 1 CLKYLLFLLNLLFLLLGLALLAVGIWLLV-----------------------------IA 31
Query: 66 SLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCISATTRNGCCLTCVSLVL 117
+I L I +GV++ ++ +GC A + C L ++L
Sbjct: 32 KDYLAIALNDSIRALYILIIVLGVIILLVGFLGCCGAIKESRCLLLTYFILL 83
|
Length = 221 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 117 | |||
| KOG3882 | 237 | consensus Tetraspanin family integral membrane pro | 99.71 | |
| PF00335 | 221 | Tetraspannin: Tetraspanin family RDS_ROM1 subfamil | 99.07 |
| >KOG3882 consensus Tetraspanin family integral membrane protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=99.71 E-value=2.1e-17 Score=123.78 Aligned_cols=85 Identities=34% Similarity=0.586 Sum_probs=71.7
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHhheeEEecccccccCCcCCCCCCCcccccCcchhhhhhhhhcccccCchhhHHH
Q 033542 3 CRGCLECLLKLLNFLLTLAGLAMVGYGIYLFVEYKRVDNLGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIY 82 (117)
Q Consensus 3 c~~~lK~~l~~~N~l~~l~G~~li~~Giw~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (117)
|.+|+||.++.+|+++|++|++++++|+|++.++....++... . ...++ +
T Consensus 5 ~~~~~K~~lf~~N~~~~l~G~~ll~~giw~~~~~~~~~~~~~~-------------~--------------~~~~~---~ 54 (237)
T KOG3882|consen 5 GSSCLKYLLFLLNLLFWLLGLLLLAVGIWLLADKGFLSSLLES-------------D--------------FLVPA---Y 54 (237)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHhhhheeEeccchhhcccc-------------c--------------hhcch---h
Confidence 4489999999999999999999999999999998766543210 0 02333 7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccchhhhhhhhcC
Q 033542 83 LFIGVGVVLFVISCVGCISATTRNGCCLTCVSLVL 117 (117)
Q Consensus 83 ~~i~~G~~~~~v~~~Gc~ga~~es~~lL~~y~~~l 117 (117)
+++++|.+++++|++||+||.|||+|+|.+|++++
T Consensus 55 ili~~G~v~~~v~flGc~Ga~~es~~lL~~y~~~l 89 (237)
T KOG3882|consen 55 ILIAVGGVVFLVGFLGCCGALRESRCLLLSYFILL 89 (237)
T ss_pred hhhhhhHHHHHHHHhhhhhhHhhhHHHHHHHHHHH
Confidence 99999999999999999999999999999998764
|
|
| >PF00335 Tetraspannin: Tetraspanin family RDS_ROM1 subfamily; InterPro: IPR018499 A number of eukaryotic CD antigens have been shown to be related [] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00