Citrus Sinensis ID: 040328


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------
MGVSFNINPSSSSFAETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGAAGASRFQEEDSKAWSRPFQVYV
ccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccEEEc
ccEEEEcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHcccHccccccHHHHHHHHHHHHHHHHHHHHHEHEEcccHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccEEEEc
mgvsfninpssssfaethsrKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSvatsssnkcyAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHafvdgsnsgaagasrfqeedskawsrpfqvyv
mgvsfninpssssfaETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSgaagasrfqeedskawsrpfqvyv
MGVSFNINPSSSSFAETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIavsslsvatsssNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGAAGASRFQEEDSKAWSRPFQVYV
*********************HKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVD****************************
*********************HKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFV*********************SRPFQVYV
*******************RKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGA**********************
**************A*THSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSN*************SKAWSRPFQVYV
iiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGVSFNINPSSSSFAETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGAAGASRFQEEDSKAWSRPFQVYV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query187
225467749189 PREDICTED: uncharacterized protein LOC10 0.978 0.968 0.633 1e-57
224109514186 predicted protein [Populus trichocarpa] 0.946 0.951 0.65 5e-57
224101255191 predicted protein [Populus trichocarpa] 0.989 0.968 0.623 1e-56
224109182191 predicted protein [Populus trichocarpa] 0.989 0.968 0.617 2e-56
317106645189 JHL23J11.5 [Jatropha curcas] 0.941 0.931 0.642 3e-56
255567776193 conserved hypothetical protein [Ricinus 0.983 0.953 0.647 4e-56
255547444190 conserved hypothetical protein [Ricinus 0.941 0.926 0.637 5e-56
224121768174 predicted protein [Populus trichocarpa] 0.882 0.948 0.668 1e-52
357507865 316 hypothetical protein MTR_7g080550 [Medic 0.967 0.572 0.597 7e-52
388497550185 unknown [Medicago truncatula] 0.967 0.978 0.597 1e-51
>gi|225467749|ref|XP_002265070.1| PREDICTED: uncharacterized protein LOC100267455 [Vitis vinifera] gi|296080863|emb|CBI18793.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  228 bits (580), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 121/191 (63%), Positives = 142/191 (74%), Gaps = 8/191 (4%)

Query: 1   MGVSFNINPSSSSFAETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILL 60
           MG+S  I+  SSS  E+H   HK+ L  NYILL A+SS IFLTLSLRL PS+ GFFLILL
Sbjct: 1   MGLS--IHNQSSSVTESHYHTHKVFLFCNYILLGAASSCIFLTLSLRLLPSVCGFFLILL 58

Query: 61  HVLTILIAVSSLSVATSSSNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVR 120
           H+LTI  AVS  + A++ +++ YAAHMV TVLT IFQGSVSVLIF  TSDFLG LKSYVR
Sbjct: 59  HILTITGAVSGCAAASTGTSRWYAAHMVTTVLTAIFQGSVSVLIFTRTSDFLGDLKSYVR 118

Query: 121 EEDGAVILKLAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGAAG------ASRFQEE 174
           EEDGAVILKLAG LS+LIF LEWVV+ L F L+Y+A+V+G   G++G      A   QEE
Sbjct: 119 EEDGAVILKLAGGLSILIFCLEWVVMILAFFLKYYAYVEGDGCGSSGSAMKKSAKVQQEE 178

Query: 175 DSKAWSRPFQV 185
           D K W  PFQV
Sbjct: 179 DLKDWPWPFQV 189




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224109514|ref|XP_002333247.1| predicted protein [Populus trichocarpa] gi|224146576|ref|XP_002326057.1| predicted protein [Populus trichocarpa] gi|118484881|gb|ABK94307.1| unknown [Populus trichocarpa] gi|222835792|gb|EEE74227.1| predicted protein [Populus trichocarpa] gi|222862932|gb|EEF00439.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224101255|ref|XP_002312204.1| predicted protein [Populus trichocarpa] gi|118483634|gb|ABK93711.1| unknown [Populus trichocarpa] gi|222852024|gb|EEE89571.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224109182|ref|XP_002315113.1| predicted protein [Populus trichocarpa] gi|222864153|gb|EEF01284.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|317106645|dbj|BAJ53150.1| JHL23J11.5 [Jatropha curcas] Back     alignment and taxonomy information
>gi|255567776|ref|XP_002524866.1| conserved hypothetical protein [Ricinus communis] gi|223535829|gb|EEF37490.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|255547444|ref|XP_002514779.1| conserved hypothetical protein [Ricinus communis] gi|223545830|gb|EEF47333.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224121768|ref|XP_002330648.1| predicted protein [Populus trichocarpa] gi|222872252|gb|EEF09383.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357507865|ref|XP_003624221.1| hypothetical protein MTR_7g080550 [Medicago truncatula] gi|355499236|gb|AES80439.1| hypothetical protein MTR_7g080550 [Medicago truncatula] Back     alignment and taxonomy information
>gi|388497550|gb|AFK36841.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query187
TAIR|locus:2181447183 AT5G16250 "AT5G16250" [Arabido 0.941 0.961 0.589 8.8e-47
TAIR|locus:2076849185 AT3G02640 "AT3G02640" [Arabido 0.914 0.924 0.567 1.5e-44
TAIR|locus:2147376183 AT5G36710 "AT5G36710" [Arabido 0.925 0.945 0.522 1.4e-39
TAIR|locus:2832155183 AT5G36800 "AT5G36800" [Arabido 0.925 0.945 0.522 1.4e-39
TAIR|locus:2181447 AT5G16250 "AT5G16250" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 105/178 (58%), Positives = 122/178 (68%)

Query:    10 SSSSFAETHSRKHKLLLISNYILLAASSSFIFLTLSLRLFPSLVGFFLILLHVLTILIXX 69
             SSS   E+H   HK+ L SNYILL A+SS IFLTLSLRL PS+ GF LILLH  TI    
Sbjct:     6 SSSPVEESHYHTHKIFLFSNYILLGAASSCIFLTLSLRLIPSICGFLLILLHATTIAAAV 65

Query:    70 XXXXXXXXXXNKCYAAHMVATVLTGIFQGSVSVLIFASTSDFLGYLKSYVREEDGAVILK 129
                       N+ YAAHMVATVLT IFQGSVSVLIF +TS FLG LKSYVREED AVILK
Sbjct:    66 SGCAAASCGRNRWYAAHMVATVLTAIFQGSVSVLIFTNTSKFLGSLKSYVREEDAAVILK 125

Query:   130 LAGALSVLIFFLEWVVLALTFLLRYHAFVDGSNSGAAGAS-RFQ-EEDSKAWSRPFQV 185
             L G L ++IF L+W+VL   F L+Y+A+VDG +  A   + + Q EE+ K W  PFQV
Sbjct:   126 LGGGLCIVIFCLDWIVLVCAFFLKYYAYVDGGDGVAMKRTGKVQSEENPKDWPWPFQV 183




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
TAIR|locus:2076849 AT3G02640 "AT3G02640" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2147376 AT5G36710 "AT5G36710" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2832155 AT5G36800 "AT5G36800" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 187
KOG3882237 consensus Tetraspanin family integral membrane pro 95.75
PF00335221 Tetraspannin: Tetraspanin family RDS_ROM1 subfamil 95.6
>KOG3882 consensus Tetraspanin family integral membrane protein [General function prediction only] Back     alignment and domain information
Probab=95.75  E-value=0.023  Score=46.23  Aligned_cols=97  Identities=14%  Similarity=0.114  Sum_probs=75.7

Q ss_pred             hhHHHHHHHHHHHHhhhhhHHHHHhhhhcccc-----------hhHHHHHHHHHHHHHHHhhcCcccccCCCchHHHHHH
Q 040328           20 RKHKLLLISNYILLAASSSFIFLTLSLRLFPS-----------LVGFFLILLHVLTILIAVSSLSVATSSSNKCYAAHMV   88 (187)
Q Consensus        20 ~t~k~fl~~NYilL~aAsscIfltLsLRL~PS-----------~~G~~lIllhalTi~~avsGCs~~~~~~~~~ytaHmv   88 (187)
                      -.+..+.+.|++..-.+...+.+.+-+|..++           ...+.+|.+.+..++.+..||-++--.+......-++
T Consensus         8 ~~K~~lf~~N~~~~l~G~~ll~~giw~~~~~~~~~~~~~~~~~~~~~ili~~G~v~~~v~flGc~Ga~~es~~lL~~y~~   87 (237)
T KOG3882|consen    8 CLKYLLFLLNLLFWLLGLLLLAVGIWLLADKGFLSSLLESDFLVPAYILIAVGGVVFLVGFLGCCGALRESRCLLLSYFI   87 (237)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHhhhheeEeccchhhccccchhcchhhhhhhhHHHHHHHHhhhhhhHhhhHHHHHHHHH
Confidence            45667888999999888888887777776654           5788999999999999999987664333345566777


Q ss_pred             HHHHHHHhhcceeheehhhhhhHHhhhc
Q 040328           89 ATVLTGIFQGSVSVLIFASTSDFLGYLK  116 (187)
Q Consensus        89 ~t~LTaIfqGs~svliFtrt~dfl~~LK  116 (187)
                      ...+--|.|.+.+++.|+..+.+.++++
T Consensus        88 ~l~l~~i~e~~~~i~~~~~~~~l~~~~~  115 (237)
T KOG3882|consen   88 LLLLLFIAELAAGILAFVFRDSLRDELE  115 (237)
T ss_pred             HHHHHHHHHHHHHHHhheeHHHHHHHHH
Confidence            8888889999999999977666666554



>PF00335 Tetraspannin: Tetraspanin family RDS_ROM1 subfamily; InterPro: IPR018499 A number of eukaryotic CD antigens have been shown to be related [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00