Citrus Sinensis ID: 029330


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-----
MTTSFIVEQAPKSSILALSSDSESCKDSSLLREEKEFQFQGKEMDEDAALTGRDEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
ccHHHHHHcccccccEEccccccccccccccHHHHHccccccccccccccccccccccccccccccccEEEccccccEEEEEEEcccccccccccccEEEEEEccccccccEEEEEEcccEEEEEEEccccEEEEEEccccEEEEEEcccHHHHccccccEEEEEcccEEEEEEccEEEEEEEEEEEccEEEEEc
cccEEEEEccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccEEEEEEccccEEcccccccEEEEEEcccccccccEEEEccEEEEEEEEEcccEEEEEEccccccEEEEcccHHHHHHHHccEEEEEccccEEEEEEcEEEEEEEEEEcccccEEEEc
mttsfiveqapkssilalssdsesckdsslLREEKEFQFqgkemdedaaltgrdektsvkkVSTLTLPLVISEKLQLTKAFVECegesidlsgdmgavgrilvpgtaegnHVMFLDLKgtiykttlvpsrtfcivsfghpeakvccksspfslilvdfPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
mttsfiveqapkssilalssdsesckdSSLLREEKEfqfqgkemdedaaltgrdektsvkkvstltlplvisEKLQLTKAFVECEGEsidlsgdmgAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
MTTSFIVEQAPKssilalssdsesckdssllREEKEFQFQGKEMDEDAALTGRDEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
*************************************************************VSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTV**
********************************************************************LVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVP********MFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
MTTSFIVEQAPKSSILA*************LREEKEFQFQGKEMDEDAALTGRDEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
*TTSFIVEQAPKSS**************************************************LTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTTSFIVEQAPKSSILALSSDSESCKDSSLLREEKEFQFQGKEMDEDAALTGRDEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIYIFSSSLVIFQIVYDFDIAKTTVVL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query195 2.2.26 [Sep-21-2011]
Q9FLU1454 DNA-binding protein BIN4 yes no 0.538 0.231 0.525 9e-26
>sp|Q9FLU1|BIN4_ARATH DNA-binding protein BIN4 OS=Arabidopsis thaliana GN=BIN4 PE=1 SV=2 Back     alignment and function desciption
 Score =  116 bits (291), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 262 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 318

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 319 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 366




Component of the DNA topoisomerase VI complex. Binds to DNA. Required for chromatin organization and progression of endoreduplication cycles. The loss of BIN4 activates the ATM- and ATR-dependent DNA damage responses in postmitotic cells and induces the ectopic expression of the mitotic G2/M-specific cyclin B1;1 gene in non-dividing cells.
Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query195
359490779 397 PREDICTED: uncharacterized protein LOC10 0.784 0.385 0.418 6e-32
224121130187 predicted protein [Populus trichocarpa] 0.502 0.524 0.611 5e-30
357464307 416 DNA-binding protein BIN4 [Medicago trunc 0.615 0.288 0.5 9e-26
356507532 380 PREDICTED: uncharacterized protein LOC10 0.533 0.273 0.525 1e-24
240256338 451 DNA-binding protein BIN4 [Arabidopsis th 0.538 0.232 0.525 4e-24
238481371 446 DNA-binding protein BIN4 [Arabidopsis th 0.538 0.235 0.525 4e-24
238481373 454 DNA-binding protein BIN4 [Arabidopsis th 0.538 0.231 0.525 4e-24
238481367 450 DNA-binding protein BIN4 [Arabidopsis th 0.538 0.233 0.525 4e-24
45736025 320 unknown protein [Oryza sativa Japonica G 0.4 0.243 0.628 5e-24
238481365 432 DNA-binding protein BIN4 [Arabidopsis th 0.538 0.243 0.525 5e-24
>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera] gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 93/222 (41%), Positives = 113/222 (50%), Gaps = 69/222 (31%)

Query: 9   QAPKSSILALSSDSESCKDSSLLREE----------KEFQFQGKEMDEDAALTGRD---- 54
            AP  S+  LSSDSESC D+S +RE+          K  QF  KE D   +  G++    
Sbjct: 100 NAPNHSVWKLSSDSESCPDNSSMREDYIHHEESSVHKTTQFPDKEKDAVLSENGQESLLN 159

Query: 55  ----EKTSVKKVS-----------------------------------------TLTLPL 69
               EK+  K+V                                          +  LPL
Sbjct: 160 EASKEKSPKKRVKVEDHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVSSGLPL 219

Query: 70  VISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPS 129
           V+SEK+  +KA VECEGESIDLSGDMGAVGRI++  T   NH MFLDLKGTIYKTT+VPS
Sbjct: 220 VLSEKVHRSKALVECEGESIDLSGDMGAVGRIVILDTPSRNHEMFLDLKGTIYKTTIVPS 279

Query: 130 RTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           RTFCIVSFG  EAKV         ++ DF   I L  +S +Y
Sbjct: 280 RTFCIVSFGQSEAKV-------EAVMNDF---IQLKPQSNVY 311




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa] gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula] gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max] Back     alignment and taxonomy information
>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein MIDGET gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group] gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana] gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query195
TAIR|locus:2153939454 BIN4 "AT5G24630" [Arabidopsis 0.512 0.220 0.547 4.5e-24
TAIR|locus:2153939 BIN4 "AT5G24630" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 279 (103.3 bits), Expect = 4.5e-24, P = 4.5e-24
 Identities = 58/106 (54%), Positives = 75/106 (70%)

Query:    42 KEMDEDAALTGR---DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAV 98
             K+ D D  +      D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAV
Sbjct:   247 KDTDTDTIIAEEVTTDQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAV 306

Query:    99 GRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKV 144
             GR++V  T  G+  M+LDLKGTIYK+T++PSRTFC+V+ G  EAK+
Sbjct:   307 GRVVVSDTT-GD--MYLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI 349


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.137   0.385    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      195       176   0.00081  109 3  11 22  0.38    32
                                                     31  0.49    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  547 (58 KB)
  Total size of DFA:  128 KB (2082 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  15.50u 0.09s 15.59t   Elapsed:  00:00:01
  Total cpu time:  15.50u 0.09s 15.59t   Elapsed:  00:00:01
  Start:  Mon May 20 22:07:53 2013   End:  Mon May 20 22:07:54 2013


GO:0009913 "epidermal cell differentiation" evidence=IMP
GO:0003690 "double-stranded DNA binding" evidence=IDA
GO:0005634 "nucleus" evidence=IDA
GO:0009330 "DNA topoisomerase complex (ATP-hydrolyzing)" evidence=IPI
GO:0010090 "trichome morphogenesis" evidence=RCA;IMP
GO:0030307 "positive regulation of cell growth" evidence=IMP
GO:0042023 "DNA endoreduplication" evidence=IMP
GO:0048364 "root development" evidence=IMP
GO:0048367 "shoot system development" evidence=IMP
GO:0051276 "chromosome organization" evidence=RCA;IMP
GO:0000003 "reproduction" evidence=RCA
GO:0000278 "mitotic cell cycle" evidence=RCA
GO:0003002 "regionalization" evidence=RCA
GO:0006259 "DNA metabolic process" evidence=RCA
GO:0006325 "chromatin organization" evidence=RCA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=RCA
GO:0007062 "sister chromatid cohesion" evidence=RCA
GO:0007126 "meiosis" evidence=RCA
GO:0007131 "reciprocal meiotic recombination" evidence=RCA
GO:0007155 "cell adhesion" evidence=RCA
GO:0008284 "positive regulation of cell proliferation" evidence=RCA
GO:0009410 "response to xenobiotic stimulus" evidence=RCA
GO:0010638 "positive regulation of organelle organization" evidence=RCA
GO:0033043 "regulation of organelle organization" evidence=RCA
GO:0033044 "regulation of chromosome organization" evidence=RCA
GO:0042138 "meiotic DNA double-strand break formation" evidence=RCA
GO:0043687 "post-translational protein modification" evidence=RCA
GO:0045010 "actin nucleation" evidence=RCA
GO:0045132 "meiotic chromosome segregation" evidence=RCA
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
GO:0048449 "floral organ formation" evidence=RCA
GO:0048522 "positive regulation of cellular process" evidence=RCA
GO:0048765 "root hair cell differentiation" evidence=RCA
GO:0071555 "cell wall organization" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.XII.841.1
hypothetical protein (187 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 195
PRK06764105 hypothetical protein; Provisional 81.96
>PRK06764 hypothetical protein; Provisional Back     alignment and domain information
Probab=81.96  E-value=2.1  Score=34.04  Aligned_cols=64  Identities=25%  Similarity=0.307  Sum_probs=44.3

Q ss_pred             ccccccceeeEEeecccccceeeee------ecCceec-ccCCcccee--eEEecCCCCCCceeEEeeeceeEee
Q 029330           59 VKKVSTLTLPLVISEKLQLTKAFVE------CEGESID-LSGDMGAVG--RILVPGTAEGNHVMFLDLKGTIYKT  124 (195)
Q Consensus        59 ~~svS~g~lPLvlpekv~rsK~LvE------~eg~slD-LsGD~GAVG--Rl~V~~s~~~~~~L~LDLKG~iY~a  124 (195)
                      +|+||+.+.|-+--|+++.-.+.+-      +.|.++| ||||.-||-  ...+.=  +....-.+-..|+||+-
T Consensus        25 epsvs~ae~~q~~~enfn~i~v~mn~~e~y~lsgrsidilsgdkeaiqlnkyti~f--~kpg~yvirvngciy~d   97 (105)
T PRK06764         25 EPSVSAAESQQVKEENFNAIDVSMNINELYVLSGRSIDVLSGDKEAIQLNKYTIRF--SKPGKYVIRVNGCIYND   97 (105)
T ss_pred             ccccchhcchhhhhcccceEEEEEeccceEEEcCceeeeecCChhheEeeeeEEEe--cCCccEEEEEccEEeee
Confidence            8999999999888888875444443      3378999 799999984  222321  12234677788999974




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00