Citrus Sinensis ID: 021734


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------31
MVFSNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSHLYQNDHVPSTDSTPSDVEYEMKYEQQQMIVVAEEENKATVFEEKSVVVSGDDAQVEVGNYGDAAPWTPPQRTDSLEILTDFHLLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPLTRHMKKSDTWENHGRQVNVDPLLVKKSETFKDRTNFQLPPVNNSPGSAGKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
ccccccccHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHcccHHHHHHHHHHccccHHHHHHHHHHHHHHHHccccccccccccccccccHHHHHcccccccccHHHccccccccccccccccccccHHHccccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHEEEEEHHEHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccccccccccccccccccccccccHHHcccccccccccccccccccccccccccccccEEccccccHHHHHHHHHHHcccccccHHHccccccccccccccccccHHHccccccccccccccccccccccccccEEccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
mvfsnswilplkVFFISTGVLSIALFCkssvpfvmdfsvsrapviwssfvswlkppylyIIINAIIIIIAASshlyqndhvpstdstpsdvEYEMKYEQQQMIVVAEEEnkatvfeeksvvvsgddaqvevgnygdaapwtppqrtdslEILTDFHLLAeeekplvsarfghrkpiksspedgkklrvtkprrhetlENTWKTitegrampltrhmkksdtwenhgrqvnvdpllvkksetfkdrtnfqlppvnnspgsagklrkepslsqDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
mvfsnswilpLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSHLYQNDHVPSTDSTPSDVEYEMKYEQQQMIVVAEEENKATVFEEksvvvsgddaQVEVGNYGDAAPWTPPQRTDSLEILTDFHLLAEEEKplvsarfghrkpiksspedgkklrvtkprrhetlentwktitegrampltrhmkksdtwenhgrqvnvdpllvKKSETfkdrtnfqlppvnnspgsagklrkepslsqdelNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
MVFSNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYiiinaiiiiiaaSSHLYQNDHVPSTDSTPSDVEYEMKYEQQQMIVVAEEENKATVFEEKSVVVSGDDAQVEVGNYGDAAPWTPPQRTDSLEILTDFHLLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPLTRHMKKSDTWENHGRQVNVDPLLVKKSETFKDRTNFQLPPVNNSPGSAGKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
***SNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSHLYQ************************MIVVA*****ATVFEEKSVVVSGDDAQVEVGNYGDAAPWT*****DSLEILTDFHLLA*****************************************************************************************************************************************************
******W*LPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAA*************************************************************************************************************************************************************************************************************RVEAFIKKFNDEMRL*******QYQEMISRG*
MVFSNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSHLYQND**********DVEYEMKYEQQQMIVVAEEENKATVFEEKSVVVSGDDAQVEVGNYGDAAPWTPPQRTDSLEILTDFHLLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPLTRHMKKSDTWENHGRQVNVDPLLVKKSETFKDRTNFQLPPVNN**************SQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
*****SWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSH*******************************************KSVV********************************D*H****************************KLRVTKPRRHETLENTWKTITEGRAMPLT**MKKSDTWE********************************************SLSQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISR**
iiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVFSNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIIIIIAASSHLYQNDHVPSTDSTPSDVEYEMKYEQQQMIVVAEEENKATVFEEKSVVVSGDDAQVEVGNYGDAAPWTPPQRTDSLEILTDFHLLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPLTRHMKKSDTWENHGRQVNVDPLLVKKSETFKDRTNFQLPPVNNSPGSAGKLRKEPSLSQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query308
68131790331 fiber expressed protein [Gossypium hirsu 0.961 0.894 0.568 6e-93
255560836338 conserved hypothetical protein [Ricinus 0.961 0.875 0.540 2e-87
3264828331 cotton fiber expressed protein 1 [Gossyp 0.961 0.894 0.563 4e-86
224114045330 predicted protein [Populus trichocarpa] 0.964 0.9 0.573 6e-81
359496721330 PREDICTED: uncharacterized protein LOC10 0.957 0.893 0.551 9e-78
118481177354 unknown [Populus trichocarpa] 0.964 0.838 0.530 2e-77
147766669407 hypothetical protein VITISV_003953 [Viti 0.977 0.739 0.549 2e-76
225461324320 PREDICTED: uncharacterized protein LOC10 0.957 0.921 0.518 8e-69
302144236304 unnamed protein product [Vitis vinifera] 0.886 0.898 0.509 1e-68
3264830275 cotton fiber expressed protein 2 [Gossyp 0.785 0.88 0.528 5e-68
>gi|68131790|gb|AAY85179.1| fiber expressed protein [Gossypium hirsutum] Back     alignment and taxonomy information
 Score =  346 bits (888), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 194/341 (56%), Positives = 231/341 (67%), Gaps = 45/341 (13%)

Query: 1   MVFSNSWILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYI 60
           M  +++WIL LKVF ISTG+L I L  K SVP V++FSVS+AP+ WS F SWLKPPYLY+
Sbjct: 1   MATASTWILSLKVFLISTGILGIVLGLKISVPLVLEFSVSQAPLWWSGFRSWLKPPYLYV 60

Query: 61  IINAIIIIIAASSHLYQNDHVP------------STDSTPSDVEYEMK------------ 96
           +IN III IAASS   QN+               S D  P  VEYE K            
Sbjct: 61  VINGIIITIAASSRFNQNNGEKDQMEQMQPRPKISADQQPM-VEYETKSGWDSDAVESSD 119

Query: 97  --YEQQQMIVVAEEENKATVFEEKSVVVSGDDAQVEVGN--YGDAAPWTPPQRTDSLEIL 152
             YE+ Q      EE +  V EE+S V   DD     GN      + W PP RTDS EI 
Sbjct: 120 FVYEENQR----GEEVETRVSEEESNVAVKDDRD---GNEFVISKSEWIPPSRTDSSEIP 172

Query: 153 TDFHLLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPL 212
            D  L+  +EKP  S+RFGHRKP+K++PE G+ L+  KP+RHETLENTWK ITEG++MPL
Sbjct: 173 LDALLI--QEKPAPSSRFGHRKPVKANPEGGRALKAAKPKRHETLENTWKMITEGKSMPL 230

Query: 213 TRHMKKSDTWENHGRQVNVDPL----LVKKSETFKDRTNFQLPP--VNNSPGSAGKLRKE 266
           +RH+KKSDTWENHGR +N++ L    L+KKSETF+DRTN+QLPP  V++ P S GKLRKE
Sbjct: 231 SRHLKKSDTWENHGRDINMEALTSSPLMKKSETFRDRTNYQLPPEQVSSFPAS-GKLRKE 289

Query: 267 PSLSQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRG 307
           PSLSQDELNRRVEAFIKKFNDEMRLQRQESL QY EM+ RG
Sbjct: 290 PSLSQDELNRRVEAFIKKFNDEMRLQRQESLNQYMEMVDRG 330




Source: Gossypium hirsutum

Species: Gossypium hirsutum

Genus: Gossypium

Family: Malvaceae

Order: Malvales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255560836|ref|XP_002521431.1| conserved hypothetical protein [Ricinus communis] gi|223539330|gb|EEF40921.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|3264828|gb|AAC33276.1| cotton fiber expressed protein 1 [Gossypium hirsutum] Back     alignment and taxonomy information
>gi|224114045|ref|XP_002316652.1| predicted protein [Populus trichocarpa] gi|222859717|gb|EEE97264.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|359496721|ref|XP_002273372.2| PREDICTED: uncharacterized protein LOC100244739 [Vitis vinifera] Back     alignment and taxonomy information
>gi|118481177|gb|ABK92540.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147766669|emb|CAN65070.1| hypothetical protein VITISV_003953 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225461324|ref|XP_002284551.1| PREDICTED: uncharacterized protein LOC100267532 [Vitis vinifera] Back     alignment and taxonomy information
>gi|302144236|emb|CBI23474.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|3264830|gb|AAC33277.1| cotton fiber expressed protein 2 [Gossypium hirsutum] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query308
TAIR|locus:2008445344 AT1G61260 "AT1G61260" [Arabido 0.616 0.552 0.497 2.6e-61
TAIR|locus:2202094308 AT1G11210 "AT1G11210" [Arabido 0.935 0.935 0.373 2.1e-36
TAIR|locus:2162570326 AT5G54300 "AT5G54300" [Arabido 0.493 0.466 0.476 1e-29
TAIR|locus:2202079310 AT1G11220 "AT1G11220" [Arabido 0.912 0.906 0.318 2.9e-25
TAIR|locus:2138416303 AT4G04990 "AT4G04990" [Arabido 0.129 0.132 0.65 2.5e-06
TAIR|locus:2008445 AT1G61260 "AT1G61260" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 444 (161.4 bits), Expect = 2.6e-61, Sum P(2) = 2.6e-61
 Identities = 108/217 (49%), Positives = 146/217 (67%)

Query:    98 EQQQMIVVAEEENKATVFEE-KSVVVSGDDAQVEVGNYGDAAPWTPPQRTDSLEILTDFH 156
             E+++ I+ +    +  + EE KSV++  +   VE     D  P  PP   +S E L    
Sbjct:   149 EEEKKIIDSAATAEDEIEEELKSVIMVENSDLVE----SDVIP--PPMMIES-ENLPPI- 200

Query:   157 LLAEEEKPLVSARFGHRKPIKSSPEDGKKLRVTKPRRHETLENTWKTITEGRAMPLTRHM 216
                  EKPLV++RFGHRK +K+S E G+ LRVTKP+++ETLENTWK ITEG++ PLTR +
Sbjct:   201 -----EKPLVTSRFGHRKLMKASQEGGRALRVTKPKKNETLENTWKMITEGKSTPLTRQL 255

Query:   217 -KKSDTW---ENHGRQVNVDPLLVKKSETFKDRTNF-QLPPVNNSPGSAGKLRKEPSLSQ 271
              ++SDT+   ++ G    V P+  KKS+TF+DRTN+ QL           K+RKEPSLSQ
Sbjct:   256 YRRSDTFGRGDSGGVDGEVKPVY-KKSDTFRDRTNYYQL-------AETAKVRKEPSLSQ 307

Query:   272 DELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRGI 308
             +ELNRRVEAFIKKFN+EM+LQR ESLRQY+E+ SRG+
Sbjct:   308 EELNRRVEAFIKKFNEEMKLQRMESLRQYKEITSRGV 344


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0010200 "response to chitin" evidence=RCA
TAIR|locus:2202094 AT1G11210 "AT1G11210" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2162570 AT5G54300 "AT5G54300" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2202079 AT1G11220 "AT1G11220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2138416 AT4G04990 "AT4G04990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query308
pfam0555338 pfam05553, DUF761, Cotton fibre expressed protein 3e-16
pfam1436434 pfam14364, DUF4408, Domain of unknown function (DU 1e-12
>gnl|CDD|218633 pfam05553, DUF761, Cotton fibre expressed protein Back     alignment and domain information
 Score = 71.0 bits (175), Expect = 3e-16
 Identities = 23/38 (60%), Positives = 33/38 (86%)

Query: 270 SQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRG 307
           S DE++RR E FI +F +++RLQRQESL++YQEM++RG
Sbjct: 1   SDDEVDRRAEEFIARFREQLRLQRQESLKRYQEMLARG 38


This family consists of several plant proteins of unknown function. Three of the sequences (from Gossypium hirsutum) in this family are described as cotton fibre expressed proteins. The remaining sequences, found in Arabidopsis thaliana, are uncharacterized. Length = 38

>gnl|CDD|206532 pfam14364, DUF4408, Domain of unknown function (DUF4408) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 308
PF0555338 DUF761: Cotton fibre expressed protein; InterPro: 99.68
PF1436434 DUF4408: Domain of unknown function (DUF4408) 99.48
>PF05553 DUF761: Cotton fibre expressed protein; InterPro: IPR008480 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.68  E-value=4.8e-17  Score=113.07  Aligned_cols=38  Identities=63%  Similarity=1.069  Sum_probs=36.6

Q ss_pred             ChhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q 021734          270 SQDELNRRVEAFIKKFNDEMRLQRQESLRQYQEMISRG  307 (308)
Q Consensus       270 ~~~evd~rAE~FI~kF~~qlrLQRqeS~~~y~eml~Rg  307 (308)
                      +++|||++||+||+|||+|||||||+|+++|+||++||
T Consensus         1 ~~~evd~rAe~FI~~f~~qlrlqr~~S~~ry~eml~Rg   38 (38)
T PF05553_consen    1 SDDEVDRRAEEFIAKFREQLRLQRQESLQRYQEMLARG   38 (38)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            46899999999999999999999999999999999998



Three of the sequences from Gossypium hirsutum (Upland cotton) in this family are described as G. hirsutum fibre expressed proteins []. The remaining sequences, found in Arabidopsis thaliana, are uncharacterised.

>PF14364 DUF4408: Domain of unknown function (DUF4408) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00