Citrus Sinensis ID: 031579


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------
MGGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHccccccccccccccccccccccHHHHHcccccccccccHHHHHHHHHHHHHHccccc
cccccccccccccEEEEccccccccEcccEEEcccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccccccccccEEcccccHHHHccccccccccccHHHHHHHHHHHHHHccccc
mggclgcfirppvntsasnlskelgvpgrsagkrsisedfwttstcdmdnsavqsqgsisslgtnnlipdtnggsgsasapsefvnhgflLWNQTRQRWIGnkkaenrthqvrepklnwnATYESllgsnkpfpqpiplsEMVDFLVDIWEQEGMYD
mggclgcfirppvntsasnlskelgvpgrsagkrsISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKkaenrthqvrepklNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
MGGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
***CLGCFIR*************************************************************************FVNHGFLLWNQTRQRWIGNK************KLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWE******
MGGCLG*FIR******************************WT********************************************HGFLLWNQTRQ**********************NATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMY*
MGGCLGCFIRPPVNTSASNLSKE**********RSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTN********PSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
**GCLGCFIRPPVNTSASNLSKELGVPGRSAGK**IS*******************************************PSEFVNHGFLLWNQTRQRWIGNKK*******VREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query157
351720932160 uncharacterized protein LOC547651 [Glyci 1.0 0.981 0.689 1e-56
255553169190 conserved hypothetical protein [Ricinus 0.980 0.810 0.703 3e-54
351721931156 uncharacterized protein LOC100500161 [Gl 0.980 0.987 0.681 4e-54
388517837159 unknown [Lotus japonicus] 1.0 0.987 0.656 7e-54
357468215156 hypothetical protein MTR_4g010330 [Medic 0.961 0.967 0.656 4e-53
224059178160 predicted protein [Populus trichocarpa] 0.987 0.968 0.690 6e-52
356547434193 PREDICTED: uncharacterized protein LOC10 0.968 0.787 0.664 1e-51
357468213 274 hypothetical protein MTR_4g010330 [Medic 0.949 0.543 0.651 2e-51
351724527174 uncharacterized protein LOC100499770 [Gl 1.0 0.902 0.610 5e-51
449436152155 PREDICTED: uncharacterized protein LOC10 0.987 1.0 0.643 6e-51
>gi|351720932|ref|NP_001235913.1| uncharacterized protein LOC547651 [Glycine max] gi|22597160|gb|AAN03467.1| hypothetical protein [Glycine max] Back     alignment and taxonomy information
 Score =  224 bits (570), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 109/158 (68%), Positives = 124/158 (78%), Gaps = 1/158 (0%)

Query: 1   MGGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSIS 60
           MGGC+GC+ RP ++ +    SK L   G    K S SEDFWTTST DMDNSAVQSQGSIS
Sbjct: 3   MGGCVGCYKRPTLSATVDVPSKGLATQGNGVRKPSSSEDFWTTSTHDMDNSAVQSQGSIS 62

Query: 61  SLG-TNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNW 119
           S   TN  +    G S S++ P+EFVNHG +LWNQTRQRW+GNKK ENRT Q+REPKLNW
Sbjct: 63  SASLTNQAVVLHAGSSKSSTNPTEFVNHGVILWNQTRQRWVGNKKLENRTQQLREPKLNW 122

Query: 120 NATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 157
           NATYESLLGSNKPF QPIPL+EMVDFLVDIWEQ+G+YD
Sbjct: 123 NATYESLLGSNKPFHQPIPLAEMVDFLVDIWEQDGLYD 160




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255553169|ref|XP_002517627.1| conserved hypothetical protein [Ricinus communis] gi|223543259|gb|EEF44791.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|351721931|ref|NP_001237738.1| uncharacterized protein LOC100500161 [Glycine max] gi|255629494|gb|ACU15093.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388517837|gb|AFK46980.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|357468215|ref|XP_003604392.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|355505447|gb|AES86589.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|388496552|gb|AFK36342.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|224059178|ref|XP_002299754.1| predicted protein [Populus trichocarpa] gi|118485078|gb|ABK94402.1| unknown [Populus trichocarpa] gi|222847012|gb|EEE84559.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356547434|ref|XP_003542117.1| PREDICTED: uncharacterized protein LOC100802439 [Glycine max] Back     alignment and taxonomy information
>gi|357468213|ref|XP_003604391.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|355505446|gb|AES86588.1| hypothetical protein MTR_4g010330 [Medicago truncatula] Back     alignment and taxonomy information
>gi|351724527|ref|NP_001235781.1| uncharacterized protein LOC100499770 [Glycine max] gi|255626433|gb|ACU13561.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|449436152|ref|XP_004135858.1| PREDICTED: uncharacterized protein LOC101208775 isoform 3 [Cucumis sativus] gi|449491036|ref|XP_004158780.1| PREDICTED: uncharacterized LOC101208775 isoform 3 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query157
TAIR|locus:2145442169 AT5G25360 "AT5G25360" [Arabido 0.961 0.893 0.582 7.9e-46
TAIR|locus:2037878154 AT1G15350 "AT1G15350" [Arabido 0.974 0.993 0.563 8.4e-42
TAIR|locus:2093292162 AT3G15770 "AT3G15770" [Arabido 0.974 0.944 0.518 1.3e-38
TAIR|locus:4515103481161 AT4G32342 "AT4G32342" [Arabido 0.891 0.869 0.471 2.7e-29
TAIR|locus:214268998 AT5G03440 "AT5G03440" [Arabido 0.541 0.867 0.449 2e-17
TAIR|locus:2082707112 AT3G54880 "AT3G54880" [Arabido 0.611 0.857 0.387 2.5e-17
TAIR|locus:2145442 AT5G25360 "AT5G25360" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 481 (174.4 bits), Expect = 7.9e-46, P = 7.9e-46
 Identities = 92/158 (58%), Positives = 114/158 (72%)

Query:     1 MGGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSIS 60
             MGGC GC  +PP+  +    SK L + GR   K S+SEDFW+TSTC+MDNS +QSQ S+S
Sbjct:    18 MGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMS 77

Query:    61 SLG-TNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNW 119
             S+  TNN        S S S P+EFVNHG  LWNQTRQ+W+ N  ++ +  +VREP ++W
Sbjct:    78 SISFTNN-----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKKA-KVREPTISW 131

Query:   120 NATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 157
             NATYESLLG NK F +PIPL EMVDFLVD+WEQEG+YD
Sbjct:   132 NATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2037878 AT1G15350 "AT1G15350" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093292 AT3G15770 "AT3G15770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4515103481 AT4G32342 "AT4G32342" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2142689 AT5G03440 "AT5G03440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2082707 AT3G54880 "AT3G54880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pm.C_LG_I0570
SubName- Full=Putative uncharacterized protein; (161 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query157
pfam13259114 pfam13259, DUF4050, Protein of unknown function (D 3e-50
>gnl|CDD|222012 pfam13259, DUF4050, Protein of unknown function (DUF4050) Back     alignment and domain information
 Score =  156 bits (397), Expect = 3e-50
 Identities = 54/119 (45%), Positives = 71/119 (59%), Gaps = 7/119 (5%)

Query: 41  WTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWI 100
           W+TS  D D+S   SQ SISS  ++           S+  P EFVNHG  LW + R  W 
Sbjct: 1   WSTSPYDFDSSDAVSQRSISSQSSD-----RRKRRKSSEEPEEFVNHGLRLWEERRDAWT 55

Query: 101 GNKKAENRTHQVRE--PKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 157
           G +K+   ++ +RE  P L++NA YE L+G NKP  QPI LS+MV FLV  WEQEG++ 
Sbjct: 56  GARKSNEESNPIRESIPPLSYNAIYEKLVGQNKPLKQPINLSDMVKFLVAGWEQEGLWP 114


This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 109 and 173 amino acids in length. There are two conserved sequence motifs: IPL and FLVD. Length = 114

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 157
PF13259122 DUF4050: Protein of unknown function (DUF4050) 100.0
>PF13259 DUF4050: Protein of unknown function (DUF4050) Back     alignment and domain information
Probab=100.00  E-value=5.4e-40  Score=249.16  Aligned_cols=113  Identities=53%  Similarity=0.939  Sum_probs=97.8

Q ss_pred             cccccccccCcccccccccccCCCCCCCCCCCCCCCCCCCCCcchhHHHHHHHHHHHhhhcCCcCCc-------ccccCC
Q 031579           41 WTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAEN-------RTHQVR  113 (157)
Q Consensus        41 Wsss~~emdns~~~Sq~s~ssis~~n~~~d~~~~~~~~~~~~~FVNhGl~~W~q~R~qWvg~~~~~~-------~~~~~r  113 (157)
                      ||||+++|||+++|+|+++++.   ++..+.+........ ++||||||++|+++|++|||+++...       +....|
T Consensus         1 ~s~s~~~~d~~~~~~~~~~s~~---~~~~~~~~~~~~~~~-~~~~N~Gl~lW~~rR~~W~~~~~~~~~~~~~l~~~~~~r   76 (122)
T PF13259_consen    1 WSSSPFRFDNSDAQSQRSISST---NSSSDCSKRRKSSEE-EEFVNHGLILWEQRRQQWTGPKKSNKIIEPRLQQSNSIR   76 (122)
T ss_pred             CCCCcccCcChhhhhhhccccc---cchhhhhhhhccccC-ccceeHHHHHHHHHHHHHcCCCCcccccccccccccccc
Confidence            9999999999999999999444   444444433323344 99999999999999999999988874       444688


Q ss_pred             C--CceecccchhcccCCCCCCCCCCChHHHHHHHHHHHhhcCCCC
Q 031579          114 E--PKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD  157 (157)
Q Consensus       114 e--p~iswnatye~LLgsnkpFpqPIPL~EMVDfLVDiWE~EGLyd  157 (157)
                      +  |.++|++|||+||++++||++||||+|||+||||+|++||+||
T Consensus        77 ~~Ip~~~y~~iY~~Lv~~~~pf~~PI~L~dmv~~Lv~gW~~eg~w~  122 (122)
T PF13259_consen   77 ESIPPISYPAIYEKLVGSNKPFKQPIPLSDMVDFLVDGWEEEGLWP  122 (122)
T ss_pred             cccchhhHHHHHHHHhcCCCCCCCCcCHHHHHHHHHHHhhcCCccC
Confidence            8  9999999999999999999999999999999999999999997




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00