Citrus Sinensis ID: 022680


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290---
MSQSSQHVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAFGSSRGNKNDRA
cccccccccccccccccccHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHcccccccHHHHHHHHHHcccccccccccccc
cccccEEEEcccccccccHccHHHHHHHHcccccccccccccccEEEEEEccccccccccccccccEEEEEEEccccccccEcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccEEEEEHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHcccccccccccccccccccccccEcccccccccc
msqssqhvftnsppscssrftFYHSATNliknpnnlalpppitsrhllihishndytpfkfipnskfyhslkpnncsntsvvttqtndnfnldsLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTgklfenrqepsqeqdklktsdfidgaKQMETQEteafgssrgnkndra
msqssqhvftnsppscsSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWigaiirrrqwrrvcgekaraegresvnlvgriEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTGKLfenrqepsqeqdklktsdFIDGAKQMEtqeteafgssrgnkndra
MSQSSQHVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVllamqeqqqkqlelilaiGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAFGSSRGNKNDRA
******************RFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARA*****VNLVGRIE*********ATILRVLSRQLEKLGVRFRVTRKALKDPITQ******************GDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTGK**********************************************
******H**TNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLK****************NFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRR**********************RIEKLEEDMKS*ATILRVLSRQLEKLGVRFRVTRKALKD*****************ALAM****************L**QEQQQKQLELILAIGKT************************************************
**************SCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTGKLFE************KTSDFIDGAKQMETQ****************
*****QHVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEK*******SVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLELILAIGKTGKLFE*******************************************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSQSSQHVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRxxxxxxxxxxxxxxxxxxxxxLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVxxxxxxxxxxxxxxxxxxxxxQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAFGSSRGNKNDRA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query293
224083616307 predicted protein [Populus trichocarpa] 0.924 0.882 0.482 7e-58
356512685293 PREDICTED: uncharacterized protein LOC10 0.529 0.529 0.707 3e-53
359496689193 PREDICTED: uncharacterized protein LOC10 0.651 0.989 0.626 3e-53
356525459287 PREDICTED: uncharacterized protein LOC10 0.573 0.585 0.660 1e-52
449451421323 PREDICTED: uncharacterized protein LOC10 0.825 0.749 0.478 3e-52
449520022246 PREDICTED: uncharacterized LOC101207421, 0.552 0.658 0.664 2e-50
18424929300 uncharacterized protein [Arabidopsis tha 0.805 0.786 0.455 5e-49
224093738267 predicted protein [Populus trichocarpa] 0.699 0.767 0.5 3e-45
297797551299 hypothetical protein ARALYDRAFT_496756 [ 0.805 0.789 0.444 3e-45
296085648139 unnamed protein product [Vitis vinifera] 0.474 1.0 0.664 5e-43
>gi|224083616|ref|XP_002307074.1| predicted protein [Populus trichocarpa] gi|222856523|gb|EEE94070.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  229 bits (585), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 152/315 (48%), Positives = 199/315 (63%), Gaps = 44/315 (13%)

Query: 6   QHVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNS 65
            H+FTNSPP  +  F+    +   +             SRH+   +  +++      P +
Sbjct: 8   HHLFTNSPPRITLLFSSSSLSLRNLS-----------LSRHVTTSLHSSNFHFKPQTPRN 56

Query: 66  KFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVL-----------------WSRVLA 108
            F  +LK      T  + TQ ++ FNLD  LSI+E+L                 +S+  A
Sbjct: 57  SFSFTLKAYQSDPT--IRTQVSNQFNLDQFLSIAELLCIISSSIITISYALNCTFSKTGA 114

Query: 109 CGVVSLVCG-VW----------IGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEED 157
            GV+    G  W          IGA IRRRQW R+C E  R EG  S+NLVGRIEKLE+D
Sbjct: 115 LGVIGSNTGFAWGMVVMVSGVVIGAWIRRRQWWRICRETGR-EG--SLNLVGRIEKLEQD 171

Query: 158 MKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKE 217
           M+SSATI+RVLSRQLEKLG+RFRVTRKALK+PI + AALAQKNSEATRALA+Q ++LEKE
Sbjct: 172 MRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIVETAALAQKNSEATRALALQENILEKE 231

Query: 218 LGEIQKVLLAMQEQQQKQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQ 277
           LGE QK+LLAMQEQQQKQLELILAIGK+GK ++NR+E  +EQ+ +KTSD  +G  Q+E+ 
Sbjct: 232 LGETQKILLAMQEQQQKQLELILAIGKSGKSWDNRRERVEEQELIKTSDLTEGVNQLESH 291

Query: 278 ETEAFGSSRGNKNDR 292
           E +   +S+ + N+R
Sbjct: 292 EAQPSVTSKRSNNNR 306




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356512685|ref|XP_003525047.1| PREDICTED: uncharacterized protein LOC100790782 [Glycine max] Back     alignment and taxonomy information
>gi|359496689|ref|XP_002269557.2| PREDICTED: uncharacterized protein LOC100244969 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356525459|ref|XP_003531342.1| PREDICTED: uncharacterized protein LOC100809936 [Glycine max] Back     alignment and taxonomy information
>gi|449451421|ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207421 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449520022|ref|XP_004167033.1| PREDICTED: uncharacterized LOC101207421, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|18424929|ref|NP_569009.1| uncharacterized protein [Arabidopsis thaliana] gi|10178187|dbj|BAB11661.1| unnamed protein product [Arabidopsis thaliana] gi|15028313|gb|AAK76633.1| unknown protein [Arabidopsis thaliana] gi|19310695|gb|AAL85078.1| unknown protein [Arabidopsis thaliana] gi|332010646|gb|AED98029.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224093738|ref|XP_002309969.1| predicted protein [Populus trichocarpa] gi|222852872|gb|EEE90419.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297797551|ref|XP_002866660.1| hypothetical protein ARALYDRAFT_496756 [Arabidopsis lyrata subsp. lyrata] gi|297312495|gb|EFH42919.1| hypothetical protein ARALYDRAFT_496756 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|296085648|emb|CBI29447.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query293
TAIR|locus:2171865300 AT5G65250 "AT5G65250" [Arabido 0.539 0.526 0.530 2.2e-42
TAIR|locus:2171865 AT5G65250 "AT5G65250" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 388 (141.6 bits), Expect = 2.2e-42, Sum P(2) = 2.2e-42
 Identities = 88/166 (53%), Positives = 110/166 (66%)

Query:   105 RVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATI 164
             +VL+ G V LV  V  G+ +RRRQW R+C     +EG    NL+ R+EKLE+D+KSS +I
Sbjct:   129 KVLSLGFVGLVGSVATGSWLRRRQWMRICKGARESEG---TNLIRRLEKLEKDLKSSTSI 185

Query:   165 LRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKV 224
             +RVLSR LEKLG+RFRVTRKALK+PI++ AALAQKNSEATR L  Q ++LEKELGEIQKV
Sbjct:   186 VRVLSRHLEKLGIRFRVTRKALKEPISETAALAQKNSEATRVLVAQQEILEKELGEIQKV 245

Query:   225 XXXXXXXXXXXXXXXXXXGKTGKLFEN----RQEPSQEQDKLKTSD 266
                                K+ KLFE+    +Q PS EQ K K  +
Sbjct:   246 LLAMQEQQRKQLELILTIAKSSKLFESSTSSKQSPS-EQRKNKAEE 290


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009941 "chloroplast envelope" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 293
PRK06975 656 bifunctional uroporphyrinogen-III synthetase/uropo 88.95
PF10805106 DUF2730: Protein of unknown function (DUF2730); In 85.8
cd07628185 BAR_Atg24p The Bin/Amphiphysin/Rvs (BAR) domain of 85.11
TIGR03495135 phage_LysB phage lysis regulatory protein, LysB fa 83.32
PF06295128 DUF1043: Protein of unknown function (DUF1043); In 82.43
>PRK06975 bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed Back     alignment and domain information
Probab=88.95  E-value=3.3  Score=42.39  Aligned_cols=48  Identities=17%  Similarity=0.164  Sum_probs=37.3

Q ss_pred             ccchHHHHHHHHhhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH
Q 022680          186 LKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQ  233 (293)
Q Consensus       186 LrdPI~etaAlAqknSeatraLA~redlLEkEL~eiQ~vl~AMQeqq~  233 (293)
                      +..+..|...++++..+..+.+..|.+.||.++.+.++-..++|++-+
T Consensus       362 ~~~~~~e~~~~~~~~~~~~~~~~~~l~~le~~l~~~~~~~~~L~~~~~  409 (656)
T PRK06975        362 NDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQQYQ  409 (656)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344555666667777778888999999999999999988888877654



>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins Back     alignment and domain information
>cd07628 BAR_Atg24p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Atg24p Back     alignment and domain information
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family Back     alignment and domain information
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00