Citrus Sinensis ID: 038963


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------
MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHHcccHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEEEEEEEcc
ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHcccccccccccccccHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEEEEEEEc
MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEykradnsgtvspvsgdegfvqlgRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVgcigattrngccltcvslvl
MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
MACRGClecllkllnfllTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
**CRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLV*
***RGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKR**********************MLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
*ACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRAD************G*VQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
iiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiii
ooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooo
oooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooo
iiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiii
iiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query117 2.2.26 [Sep-21-2011]
Q940P5 221 Tetraspanin-19 OS=Arabido no no 0.700 0.371 0.364 2e-11
Q93XY5 270 Tetraspanin-18 OS=Arabido no no 0.923 0.4 0.304 7e-09
>sp|Q940P5|TET19_ARATH Tetraspanin-19 OS=Arabidopsis thaliana GN=At2g20740 PE=2 SV=1 Back     alignment and function desciption
 Score = 67.8 bits (164), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 58/107 (54%), Gaps = 25/107 (23%)

Query: 4   RGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLM 63
           R CL+ +LKL+N L+ +VG+AM+ Y ++L  +++                  Q+G     
Sbjct: 6   RSCLQSMLKLVNSLIGMVGIAMILYAVWLIRQWQE-----------------QMGN---- 44

Query: 64  AVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCL 110
            +  ADS     P  WFIY F+G+G +L V++C G I A T NGCCL
Sbjct: 45  -LPFADS---DHPVPWFIYSFLGLGAILCVVTCAGHIAAETVNGCCL 87




May be involved in the regulation of cell differentiation.
Arabidopsis thaliana (taxid: 3702)
>sp|Q93XY5|TET18_ARATH Tetraspanin-18 OS=Arabidopsis thaliana GN=At2g20230 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query117
356497054 283 PREDICTED: uncharacterized protein LOC10 1.0 0.413 0.691 1e-43
255647653 283 unknown [Glycine max] 1.0 0.413 0.691 1e-43
356541380 283 PREDICTED: uncharacterized protein LOC10 1.0 0.413 0.683 1e-42
359474591 280 PREDICTED: uncharacterized protein LOC10 0.957 0.4 0.725 4e-42
357482319 294 GMFP4 [Medicago truncatula] gi|355512780 0.965 0.384 0.702 4e-42
357482323 289 GMFP4 [Medicago truncatula] gi|355512782 0.982 0.397 0.674 3e-41
449461537 279 PREDICTED: uncharacterized protein LOC10 0.974 0.408 0.701 2e-40
356511490 277 PREDICTED: uncharacterized protein LOC54 0.948 0.400 0.730 4e-39
224138040 280 predicted protein [Populus trichocarpa] 0.982 0.410 0.784 3e-38
358248102 277 uncharacterized protein LOC100793680 [Gl 0.948 0.400 0.704 5e-38
>gi|356497054|ref|XP_003517379.1| PREDICTED: uncharacterized protein LOC100791911 [Glycine max] Back     alignment and taxonomy information
 Score =  180 bits (456), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 83/120 (69%), Positives = 101/120 (84%), Gaps = 3/120 (2%)

Query: 1   MACRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRA---DNSGTVSPVSGDEGFVQL 57
           MACRGC ECLLKL NF+LT+ GLA+VGYGIYLFVE+ +A   DN+  +SPVS D   +QL
Sbjct: 1   MACRGCWECLLKLFNFILTLTGLAIVGYGIYLFVEFSKASDDDNTPAISPVSDDSALIQL 60

Query: 58  GRPMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL 117
           GRP+LMAVSL++  FD LP+AWFIYLFIG+GVVLF+ISC GCIGA TRNGCCL+C S+++
Sbjct: 61  GRPVLMAVSLSNDFFDNLPRAWFIYLFIGIGVVLFLISCFGCIGAATRNGCCLSCYSILV 120




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255647653|gb|ACU24289.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356541380|ref|XP_003539155.1| PREDICTED: uncharacterized protein LOC100815033 [Glycine max] Back     alignment and taxonomy information
>gi|359474591|ref|XP_002279170.2| PREDICTED: uncharacterized protein LOC100245417 [Vitis vinifera] Back     alignment and taxonomy information
>gi|357482319|ref|XP_003611445.1| GMFP4 [Medicago truncatula] gi|355512780|gb|AES94403.1| GMFP4 [Medicago truncatula] Back     alignment and taxonomy information
>gi|357482323|ref|XP_003611447.1| GMFP4 [Medicago truncatula] gi|355512782|gb|AES94405.1| GMFP4 [Medicago truncatula] Back     alignment and taxonomy information
>gi|449461537|ref|XP_004148498.1| PREDICTED: uncharacterized protein LOC101203030 [Cucumis sativus] gi|449527337|ref|XP_004170668.1| PREDICTED: uncharacterized protein LOC101226567 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356511490|ref|XP_003524459.1| PREDICTED: uncharacterized protein LOC547883 [Glycine max] Back     alignment and taxonomy information
>gi|224138040|ref|XP_002326503.1| predicted protein [Populus trichocarpa] gi|222833825|gb|EEE72302.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|358248102|ref|NP_001240069.1| uncharacterized protein LOC100793680 [Glycine max] gi|255641974|gb|ACU21254.1| unknown [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query117
TAIR|locus:2033718 280 TOM2A "tobamovirus multiplicat 1.0 0.417 0.601 1.9e-35
TAIR|locus:2038997 270 AT2G20230 [Arabidopsis thalian 0.358 0.155 0.428 2.1e-09
TAIR|locus:2051348 221 AT2G20740 "AT2G20740" [Arabido 0.299 0.158 0.571 3.2e-07
RGD|130474095 Tspan9 "tetraspanin 9" [Rattus 0.299 0.368 0.371 9.2e-05
UNIPROTKB|H0Y8W487 TSPAN5 "Tetraspanin-5" [Homo s 0.222 0.298 0.461 0.00012
UNIPROTKB|J9NS67 241 TSPAN1 "Uncharacterized protei 0.307 0.149 0.388 0.00042
UNIPROTKB|P19397 219 CD53 "Leukocyte surface antige 0.401 0.214 0.297 0.00071
UNIPROTKB|E2RPE6 295 TSPAN1 "Uncharacterized protei 0.307 0.122 0.388 0.0008
TAIR|locus:2033718 TOM2A "tobamovirus multiplication 2A" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 383 (139.9 bits), Expect = 1.9e-35, P = 1.9e-35
 Identities = 71/118 (60%), Positives = 85/118 (72%)

Query:     1 MACRGCXXXXXXXXXXXXTIVGLAMVGYGIYLFVEYKRA-DNSGTVSPVSGDEGFVQLGR 59
             MACRGC             + GL M+GYGIYLFVEYKR  DNS T    +GD+ +V  GR
Sbjct:     1 MACRGCLECLLKLLNFLLAVAGLGMIGYGIYLFVEYKRVTDNSVTFDLTNGDQSYVSFGR 60

Query:    60 PMLMAVSLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL 117
             P+LMAVSL+ +IFD LPKAWFIYLFIG+GV LFVISC GC+G  +R+ CCL+C SL+L
Sbjct:    61 PILMAVSLSSNIFDNLPKAWFIYLFIGIGVALFVISCCGCVGTCSRSVCCLSCYSLLL 118




GO:0005576 "extracellular region" evidence=ISM
GO:0016021 "integral to membrane" evidence=IEA
GO:0005515 "protein binding" evidence=IPI
GO:0016020 "membrane" evidence=ISS
GO:0046786 "viral replication complex formation and maintenance" evidence=IMP
GO:0005773 "vacuole" evidence=IDA
GO:0009506 "plasmodesma" evidence=IDA
GO:0005794 "Golgi apparatus" evidence=IDA
TAIR|locus:2038997 AT2G20230 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051348 AT2G20740 "AT2G20740" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
RGD|1304740 Tspan9 "tetraspanin 9" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|H0Y8W4 TSPAN5 "Tetraspanin-5" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|J9NS67 TSPAN1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|P19397 CD53 "Leukocyte surface antigen CD53" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|E2RPE6 TSPAN1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query117
pfam00335 221 pfam00335, Tetraspannin, Tetraspanin family 6e-07
>gnl|CDD|215864 pfam00335, Tetraspannin, Tetraspanin family Back     alignment and domain information
 Score = 45.5 bits (108), Expect = 6e-07
 Identities = 27/112 (24%), Positives = 43/112 (38%), Gaps = 29/112 (25%)

Query: 6   CLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAV 65
           CL+ LL LLN L  ++GLA++  GI+L V                               
Sbjct: 1   CLKYLLFLLNLLFLLLGLALLAVGIWLLV-----------------------------IA 31

Query: 66  SLADSIFDKLPKAWFIYLFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL 117
               +I           L I +GV++ ++  +GC GA   + C L    ++L
Sbjct: 32  KDYLAIALNDSIRALYILIIVLGVIILLVGFLGCCGAIKESRCLLLTYFILL 83


Length = 221

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 117
KOG3882 237 consensus Tetraspanin family integral membrane pro 99.71
PF00335 221 Tetraspannin: Tetraspanin family RDS_ROM1 subfamil 99.09
>KOG3882 consensus Tetraspanin family integral membrane protein [General function prediction only] Back     alignment and domain information
Probab=99.71  E-value=2.1e-17  Score=122.64  Aligned_cols=85  Identities=32%  Similarity=0.569  Sum_probs=71.3

Q ss_pred             CcchHHHHHHHHHHHHHHHHHHHHHHhheeEEecccccCCCCcCCCCCCCcccccCcchhhhhhhhhhhcccCChhHHHH
Q 038963            3 CRGCLECLLKLLNFLLTIVGLAMVGYGIYLFVEYKRADNSGTVSPVSGDEGFVQLGRPMLMAVSLADSIFDKLPKAWFIY   82 (117)
Q Consensus         3 c~~~lk~~L~~~N~lf~i~G~~li~~G~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   82 (117)
                      |.+|+||.++.+|+++|++|++++++|+|++.++....+....             .              .+.+   ++
T Consensus         5 ~~~~~K~~lf~~N~~~~l~G~~ll~~giw~~~~~~~~~~~~~~-------------~--------------~~~~---~~   54 (237)
T KOG3882|consen    5 GSSCLKYLLFLLNLLFWLLGLLLLAVGIWLLADKGFLSSLLES-------------D--------------FLVP---AY   54 (237)
T ss_pred             ccchHHHHHHHHHHHHHHHHHHHHHhhhheeEeccchhhcccc-------------c--------------hhcc---hh
Confidence            4589999999999999999999999999999998765532210             0              0122   37


Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhcccchhhhhhhcC
Q 038963           83 LFIGVGVVLFVISCVGCIGATTRNGCCLTCVSLVL  117 (117)
Q Consensus        83 ~li~~G~~~~~v~~~Gc~Ga~~en~~lL~~y~~~l  117 (117)
                      +++++|++++++|++||+||.|||||+|.+|++++
T Consensus        55 ili~~G~v~~~v~flGc~Ga~~es~~lL~~y~~~l   89 (237)
T KOG3882|consen   55 ILIAVGGVVFLVGFLGCCGALRESRCLLLSYFILL   89 (237)
T ss_pred             hhhhhhHHHHHHHHhhhhhhHhhhHHHHHHHHHHH
Confidence            99999999999999999999999999999998764



>PF00335 Tetraspannin: Tetraspanin family RDS_ROM1 subfamily; InterPro: IPR018499 A number of eukaryotic CD antigens have been shown to be related [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00