Citrus Sinensis ID: 034369


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90------
MDLHNKSNNSNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGCRPGSKRVSGKWTN
ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccc
ccccccccccccHHHHHHHHHHHHHHHHHHHHHcccHccHcccHHHHHHHHHHcccEEHHHHccHHHHHHHHEEEEEEcccccccccEEEcccccc
mdlhnksnnsnsIEVIFDSSLNLEQTHLkegydegythglATRKEEAkdvglkhgfetgeELGFYKAVsmsgtrlfisirpgcrpgskrvsgkwtn
mdlhnksnnsnsiEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLfisirpgcrpgskrvsgkwtn
MDLHNKSNNSNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGCRPGSKRVSGKWTN
**************VIFDSSLNLEQTHLKEGYDEGYTHGLAT******DVGLKHGFETGEELGFYKAVSMSGTRLFISIRPG**************
***************IFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLF********************
********NSNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGC*************
*********SNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGCRPGSKRVS*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDLHNKSNNSNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGCRPGSKRVSGKWTN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query96
255637974138 unknown [Glycine max] 0.697 0.485 0.647 1e-15
255639485138 unknown [Glycine max] 0.697 0.485 0.647 1e-15
255553879144 Oral cancer overexpressed protein, putat 0.604 0.402 0.689 1e-15
388511973137 unknown [Lotus japonicus] 0.697 0.489 0.617 5e-14
225447254 455 PREDICTED: formimidoyltransferase-cyclod 0.583 0.123 0.660 1e-13
297739272146 unnamed protein product [Vitis vinifera] 0.666 0.438 0.555 2e-13
224126761125 predicted protein [Populus trichocarpa] 0.531 0.408 0.725 3e-13
449444394 427 PREDICTED: formimidoyltransferase-cyclod 0.531 0.119 0.725 4e-13
358345129135 Oral cancer-overexpressed protein-like p 0.697 0.496 0.558 7e-13
388517567136 unknown [Medicago truncatula] 0.697 0.492 0.558 7e-13
>gi|255637974|gb|ACU19303.1| unknown [Glycine max] Back     alignment and taxonomy information
 Score = 87.4 bits (215), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/68 (64%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 16 IFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRL 75
          +FDSSLNLE TH KEGYDEGY+HGL T KEEA+ VGLK GFE GEELGFY+      T  
Sbjct: 4  LFDSSLNLEDTHYKEGYDEGYSHGLVTGKEEARQVGLKVGFEVGEELGFYRGCVEIWT-F 62

Query: 76 FISIRPGC 83
           I + P C
Sbjct: 63 AIRLDPTC 70




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255639485|gb|ACU20037.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|255553879|ref|XP_002517980.1| Oral cancer overexpressed protein, putative [Ricinus communis] gi|223542962|gb|EEF44498.1| Oral cancer overexpressed protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|388511973|gb|AFK44048.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|225447254|ref|XP_002272923.1| PREDICTED: formimidoyltransferase-cyclodeaminase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|297739272|emb|CBI28923.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224126761|ref|XP_002319920.1| predicted protein [Populus trichocarpa] gi|222858296|gb|EEE95843.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449444394|ref|XP_004139960.1| PREDICTED: formimidoyltransferase-cyclodeaminase-like [Cucumis sativus] gi|449475733|ref|XP_004154536.1| PREDICTED: formimidoyltransferase-cyclodeaminase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|358345129|ref|XP_003636635.1| Oral cancer-overexpressed protein-like protein [Medicago truncatula] gi|355502570|gb|AES83773.1| Oral cancer-overexpressed protein-like protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|388517567|gb|AFK46845.1| unknown [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query96
TAIR|locus:2051415 431 AT2G20830 [Arabidopsis thalian 0.583 0.129 0.5 8.6e-11
MGI|MGI:1919534137 Oraov1 "oral cancer overexpres 0.583 0.408 0.339 5e-05
UNIPROTKB|Q8WV07137 ORAOV1 "Oral cancer-overexpres 0.583 0.408 0.321 0.00057
UNIPROTKB|G4N1N8169 MGG_07505 "Uncharacterized pro 0.489 0.278 0.361 0.00068
TAIR|locus:2051415 AT2G20830 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 158 (60.7 bits), Expect = 8.6e-11, P = 8.6e-11
 Identities = 28/56 (50%), Positives = 41/56 (73%)

Query:    14 EVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVS 69
             E   D  + LE+TH+++G+DEGY  GL + +E+A+ +GLK GFETGE +GFY+  S
Sbjct:     7 EDFLDCIVRLEETHVQQGFDEGYEEGLVSGREDARHLGLKLGFETGELIGFYRGCS 62




GO:0005542 "folic acid binding" evidence=IEA
GO:0008152 "metabolic process" evidence=IEA
GO:0009507 "chloroplast" evidence=ISM
GO:0016740 "transferase activity" evidence=IEA
MGI|MGI:1919534 Oraov1 "oral cancer overexpressed 1" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q8WV07 ORAOV1 "Oral cancer-overexpressed protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|G4N1N8 MGG_07505 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query96
pfam0981139 pfam09811, Yae1_N, Essential protein Yae1, N termi 1e-06
>gnl|CDD|204317 pfam09811, Yae1_N, Essential protein Yae1, N terminal Back     alignment and domain information
 Score = 41.3 bits (98), Expect = 1e-06
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query: 31 GYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAV 68
          GY EG   G     +E  D G + G + G ELG    +
Sbjct: 1  GYREGIAEGKEEGLQEGFDEGYQEGAQLGLELGQLLGL 38


Members of this family are found in the N terminal region of the essential protein Yae1. Their exact function has not, as yet, been determined. The family DUF1715, pfam08215 has now been merged into this family. Length = 39

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 96
KOG4595139 consensus Uncharacterized conserved protein [Funct 99.92
PF0981139 Yae1_N: Essential protein Yae1, N terminal; InterP 99.37
PRK13386 236 fliH flagellar assembly protein H; Provisional 98.28
PRK13386 236 fliH flagellar assembly protein H; Provisional 98.2
PRK05687 246 fliH flagellar assembly protein H; Validated 98.15
KOG4774190 consensus Uncharacterized conserved protein [Funct 98.12
PF0981139 Yae1_N: Essential protein Yae1, N terminal; InterP 98.02
PRK05687 246 fliH flagellar assembly protein H; Validated 98.0
KOG4595139 consensus Uncharacterized conserved protein [Funct 97.15
COG1317 234 FliH Flagellar biosynthesis/type III secretory pat 96.99
KOG4774190 consensus Uncharacterized conserved protein [Funct 95.39
PRK09956308 hypothetical protein; Provisional 95.14
COG1317 234 FliH Flagellar biosynthesis/type III secretory pat 94.24
PRK06669281 fliH flagellar assembly protein H; Validated 92.31
PRK06937204 type III secretion system protein; Reviewed 88.53
TIGR03825 255 FliH_bacil flagellar assembly protein FliH. This b 87.96
COG5464289 Uncharacterized conserved protein [Function unknow 87.49
PRK09956308 hypothetical protein; Provisional 80.01
>KOG4595 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=99.92  E-value=1.2e-24  Score=156.14  Aligned_cols=76  Identities=34%  Similarity=0.534  Sum_probs=68.8

Q ss_pred             CCCCchhhhHhhhhhHHHHHHHHHHHHHHHhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCC
Q 034369            9 NSNSIEVIFDSSLNLEQTHLKEGYDEGYTHGLATRKEEAKDVGLKHGFETGEELGFYKAVSMSGTRLFISIRPGCRP   85 (96)
Q Consensus         9 ~~~d~dD~FD~il~lEe~~~~eGY~EG~~~G~~~G~~EG~~~G~~kGfe~g~ElGfy~G~~~~W~~~l~~~~p~~~~   85 (96)
                      ++-+++|+||+|++|||++|++||+||+++|.++|+.|||++|+++||++|.+||+|+||+.+|.++++ .+|..++
T Consensus         2 ~sg~~~d~fD~Iv~lEE~~~q~Gy~EG~~eGi~qG~eEg~~~Gl~~G~~~g~llG~~~Gc~~l~~~~lh-s~~~~ks   77 (139)
T KOG4595|consen    2 PSGDINDDFDDIVLLEEKEYQEGYDEGYEEGIEQGNEEGRQLGLAHGFSLGELLGKILGCVVLWQQLLH-SPKTRKS   77 (139)
T ss_pred             CcchhHhHHHHHHHHHHHHHHHhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhcc-CCchhcC
Confidence            456799999999999999999999999999999999999999999999999999999999999999653 4444444



>PF09811 Yae1_N: Essential protein Yae1, N terminal; InterPro: IPR019191 This entry represents proteins found in the N-terminal region of the essential protein Yae1 Back     alignment and domain information
>PRK13386 fliH flagellar assembly protein H; Provisional Back     alignment and domain information
>PRK13386 fliH flagellar assembly protein H; Provisional Back     alignment and domain information
>PRK05687 fliH flagellar assembly protein H; Validated Back     alignment and domain information
>KOG4774 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PF09811 Yae1_N: Essential protein Yae1, N terminal; InterPro: IPR019191 This entry represents proteins found in the N-terminal region of the essential protein Yae1 Back     alignment and domain information
>PRK05687 fliH flagellar assembly protein H; Validated Back     alignment and domain information
>KOG4595 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>COG1317 FliH Flagellar biosynthesis/type III secretory pathway protein [Cell motility and secretion / Intracellular trafficking and secretion] Back     alignment and domain information
>KOG4774 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PRK09956 hypothetical protein; Provisional Back     alignment and domain information
>COG1317 FliH Flagellar biosynthesis/type III secretory pathway protein [Cell motility and secretion / Intracellular trafficking and secretion] Back     alignment and domain information
>PRK06669 fliH flagellar assembly protein H; Validated Back     alignment and domain information
>PRK06937 type III secretion system protein; Reviewed Back     alignment and domain information
>TIGR03825 FliH_bacil flagellar assembly protein FliH Back     alignment and domain information
>COG5464 Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PRK09956 hypothetical protein; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00