Citrus Sinensis ID: 031414


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160
MHGSSGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHccccccccccccccccccccccHHHHHcccccccccccHHHHHHHHHHHHHcccccc
ccccccccccccccccEEEEccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccccccccccEEcccccHHHHccccccccccccHHHHHHHHHHHHHHccccc
mhgssgclgcfirppvntsasnlskelgvpgrsagkrsisedfwttstcdmdnsavqsqgsisslgtnnlipdtnggsgsasapsefvnhgflLWNQTRQRWIGnkkaenrthqvrepklnwnATYESllgsnkpfpqpiplsEMVDFLVDIWEQEGMYD
MHGSSGCLGCFIRPPVNtsasnlskelgvpgrsagkrsISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKkaenrthqvrepklNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
MHGSSGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
******CLGCFIR*************************************************************************FVNHGFLLWNQTRQRWIGNK************KLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWE******
***SSGCLGCFIR******************************WT**T*****************************************HGFLLWNQTRQ**********************NATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMY*
MHGSSGCLGCFIRPPVNTSASNLSKE**********RSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTN********PSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
***SSGCLGCFIRPPVNTSASNLSKEL**PGRSAG***IS******************************************APSEFVNHGFLLWNQTRQRWIGNKK*******VREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MHGSSGCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query160
351720932160 uncharacterized protein LOC547651 [Glyci 0.968 0.968 0.685 2e-55
255553169190 conserved hypothetical protein [Ricinus 0.993 0.836 0.693 3e-55
224059178160 predicted protein [Populus trichocarpa] 1.0 1.0 0.7 6e-55
356547434193 PREDICTED: uncharacterized protein LOC10 0.975 0.808 0.666 2e-53
357468213 274 hypothetical protein MTR_4g010330 [Medic 0.956 0.558 0.654 3e-53
351721931156 uncharacterized protein LOC100500161 [Gl 0.95 0.974 0.677 8e-53
388517837159 unknown [Lotus japonicus] 0.968 0.974 0.651 1e-52
357468215156 hypothetical protein MTR_4g010330 [Medic 0.931 0.955 0.651 9e-52
357468217 288 hypothetical protein MTR_4g010330 [Medic 0.956 0.531 0.602 3e-50
449436148174 PREDICTED: uncharacterized protein LOC10 0.962 0.885 0.641 5e-50
>gi|351720932|ref|NP_001235913.1| uncharacterized protein LOC547651 [Glycine max] gi|22597160|gb|AAN03467.1| hypothetical protein [Glycine max] Back     alignment and taxonomy information
 Score =  219 bits (559), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 107/156 (68%), Positives = 122/156 (78%), Gaps = 1/156 (0%)

Query: 6   GCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSL 65
           GC+GC+ RP ++ +    SK L   G    K S SEDFWTTST DMDNSAVQSQGSISS 
Sbjct: 5   GCVGCYKRPTLSATVDVPSKGLATQGNGVRKPSSSEDFWTTSTHDMDNSAVQSQGSISSA 64

Query: 66  G-TNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNA 124
             TN  +    G S S++ P+EFVNHG +LWNQTRQRW+GNKK ENRT Q+REPKLNWNA
Sbjct: 65  SLTNQAVVLHAGSSKSSTNPTEFVNHGVILWNQTRQRWVGNKKLENRTQQLREPKLNWNA 124

Query: 125 TYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 160
           TYESLLGSNKPF QPIPL+EMVDFLVDIWEQ+G+YD
Sbjct: 125 TYESLLGSNKPFHQPIPLAEMVDFLVDIWEQDGLYD 160




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255553169|ref|XP_002517627.1| conserved hypothetical protein [Ricinus communis] gi|223543259|gb|EEF44791.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224059178|ref|XP_002299754.1| predicted protein [Populus trichocarpa] gi|118485078|gb|ABK94402.1| unknown [Populus trichocarpa] gi|222847012|gb|EEE84559.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356547434|ref|XP_003542117.1| PREDICTED: uncharacterized protein LOC100802439 [Glycine max] Back     alignment and taxonomy information
>gi|357468213|ref|XP_003604391.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|355505446|gb|AES86588.1| hypothetical protein MTR_4g010330 [Medicago truncatula] Back     alignment and taxonomy information
>gi|351721931|ref|NP_001237738.1| uncharacterized protein LOC100500161 [Glycine max] gi|255629494|gb|ACU15093.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388517837|gb|AFK46980.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|357468215|ref|XP_003604392.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|355505447|gb|AES86589.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|388496552|gb|AFK36342.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|357468217|ref|XP_003604393.1| hypothetical protein MTR_4g010330 [Medicago truncatula] gi|355505448|gb|AES86590.1| hypothetical protein MTR_4g010330 [Medicago truncatula] Back     alignment and taxonomy information
>gi|449436148|ref|XP_004135856.1| PREDICTED: uncharacterized protein LOC101208775 isoform 1 [Cucumis sativus] gi|449491028|ref|XP_004158778.1| PREDICTED: uncharacterized LOC101208775 isoform 1 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query160
TAIR|locus:2145442169 AT5G25360 "AT5G25360" [Arabido 0.931 0.881 0.576 1.2e-44
TAIR|locus:2037878154 AT1G15350 "AT1G15350" [Arabido 0.943 0.980 0.557 1.2e-40
TAIR|locus:2093292162 AT3G15770 "AT3G15770" [Arabido 0.956 0.944 0.524 6.1e-39
TAIR|locus:4515103481161 AT4G32342 "AT4G32342" [Arabido 0.893 0.888 0.468 1.6e-29
TAIR|locus:214268998 AT5G03440 "AT5G03440" [Arabido 0.531 0.867 0.449 2e-17
TAIR|locus:2082707112 AT3G54880 "AT3G54880" [Arabido 0.6 0.857 0.387 2.5e-17
TAIR|locus:2145442 AT5G25360 "AT5G25360" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 470 (170.5 bits), Expect = 1.2e-44, P = 1.2e-44
 Identities = 90/156 (57%), Positives = 112/156 (71%)

Query:     6 GCLGCFIRPPVNTSASNLSKELGVPGRSAGKRSISEDFWTTSTCDMDNSAVQSQGSISSL 65
             GC GC  +PP+  +    SK L + GR   K S+SEDFW+TSTC+MDNS +QSQ S+SS+
Sbjct:    20 GCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSI 79

Query:    66 G-TNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAENRTHQVREPKLNWNA 124
               TNN        S S S P+EFVNHG  LWNQTRQ+W+ N  ++ +  +VREP ++WNA
Sbjct:    80 SFTNN-----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKKA-KVREPTISWNA 133

Query:   125 TYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 160
             TYESLLG NK F +PIPL EMVDFLVD+WEQEG+YD
Sbjct:   134 TYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2037878 AT1G15350 "AT1G15350" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093292 AT3G15770 "AT3G15770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4515103481 AT4G32342 "AT4G32342" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2142689 AT5G03440 "AT5G03440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2082707 AT3G54880 "AT3G54880" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pm.C_LG_I0570
SubName- Full=Putative uncharacterized protein; (161 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query160
pfam13259114 pfam13259, DUF4050, Protein of unknown function (D 4e-50
>gnl|CDD|222012 pfam13259, DUF4050, Protein of unknown function (DUF4050) Back     alignment and domain information
 Score =  156 bits (396), Expect = 4e-50
 Identities = 54/119 (45%), Positives = 71/119 (59%), Gaps = 7/119 (5%)

Query: 44  WTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWI 103
           W+TS  D D+S   SQ SISS  ++           S+  P EFVNHG  LW + R  W 
Sbjct: 1   WSTSPYDFDSSDAVSQRSISSQSSD-----RRKRRKSSEEPEEFVNHGLRLWEERRDAWT 55

Query: 104 GNKKAENRTHQVRE--PKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD 160
           G +K+   ++ +RE  P L++NA YE L+G NKP  QPI LS+MV FLV  WEQEG++ 
Sbjct: 56  GARKSNEESNPIRESIPPLSYNAIYEKLVGQNKPLKQPINLSDMVKFLVAGWEQEGLWP 114


This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 109 and 173 amino acids in length. There are two conserved sequence motifs: IPL and FLVD. Length = 114

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 160
PF13259122 DUF4050: Protein of unknown function (DUF4050) 100.0
>PF13259 DUF4050: Protein of unknown function (DUF4050) Back     alignment and domain information
Probab=100.00  E-value=6.1e-40  Score=249.54  Aligned_cols=113  Identities=53%  Similarity=0.939  Sum_probs=97.6

Q ss_pred             cccccccccCccccccccccCCCCCCCCCCCCCCCCCCCCCCcchhHHHHHHHHHHHhhhcCCCCCc-------cccccC
Q 031414           44 WTTSTCDMDNSAVQSQGSISSLGTNNLIPDTNGGSGSASAPSEFVNHGFLLWNQTRQRWIGNKKAEN-------RTHQVR  116 (160)
Q Consensus        44 WssS~~emdns~~~Sq~s~ssis~~n~~~~~~~~~~s~~~~~~fVNhgl~~W~q~R~qWvg~~~~~~-------~~~~~r  116 (160)
                      ||||+++|||+++|+|+++++.   ++..+.+........ .+|||||+++|+++|++|||+++...       +....+
T Consensus         1 ~s~s~~~~d~~~~~~~~~~s~~---~~~~~~~~~~~~~~~-~~~~N~Gl~lW~~rR~~W~~~~~~~~~~~~~l~~~~~~r   76 (122)
T PF13259_consen    1 WSSSPFRFDNSDAQSQRSISST---NSSSDCSKRRKSSEE-EEFVNHGLILWEQRRQQWTGPKKSNKIIEPRLQQSNSIR   76 (122)
T ss_pred             CCCCcccCcChhhhhhhccccc---cchhhhhhhhccccC-ccceeHHHHHHHHHHHHHcCCCCcccccccccccccccc
Confidence            9999999999999999999444   444444433322344 99999999999999999999988774       454588


Q ss_pred             C--CccccccchhhhcCCCCCCCCCCChHHHHHHHHHHHhhcCCCC
Q 031414          117 E--PKLNWNATYESLLGSNKPFPQPIPLSEMVDFLVDIWEQEGMYD  160 (160)
Q Consensus       117 e--P~iswnaTyE~LLgsn~pFpqPIPL~EMVDfLVDiWE~EGLyd  160 (160)
                      +  |.++|++|||+||+.++||++||||+|||+||||+|++||+||
T Consensus        77 ~~Ip~~~y~~iY~~Lv~~~~pf~~PI~L~dmv~~Lv~gW~~eg~w~  122 (122)
T PF13259_consen   77 ESIPPISYPAIYEKLVGSNKPFKQPIPLSDMVDFLVDGWEEEGLWP  122 (122)
T ss_pred             cccchhhHHHHHHHHhcCCCCCCCCcCHHHHHHHHHHHhhcCCccC
Confidence            8  9999999999999999999999999999999999999999997




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00