Citrus Sinensis ID: 033891


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------11
MEKETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
ccccccccHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHccc
ccccccccHHHHHHHHHHHHHHHHHHHccccccccccccEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEcHHHHHHHHHHHHccc
MEKETTFDSDLIHAIFKHIWTRRSLerernggtdamESEFLLhqagagtskknrptsanANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
mekettfdsdlihaifkhiwtrrsLERERNGGTDAMESEFLLHQagagtskknrpTSANANALKLSCELLRVFVTEAVQRAAAIAeaegvskieatHLERILPQLLLDF
MEKETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALKLSCELLRVFVTeavqraaaiaeaeGVSKIEATHLERILPQLLLDF
********SDLIHAIFKHIWTRR**************************************ALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLL***
*****TF**DLIHAIFKHIW******************************************LKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
********SDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAG***********ANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
****TTFDSDLIHAIFKHIWTRRSLE*******************************ANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEKETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query109 2.2.26 [Sep-21-2011]
Q2NKU079 Centromere protein X OS=B yes no 0.532 0.734 0.465 2e-09
A8MT6981 Centromere protein X OS=H yes no 0.532 0.716 0.465 3e-08
Q5RBU181 Centromere protein X OS=P yes no 0.532 0.716 0.465 3e-08
Q8C4X178 Centromere protein X OS=M yes no 0.532 0.743 0.448 1e-07
P0DJH780 Centromere protein X OS=G yes no 0.495 0.675 0.555 5e-05
>sp|Q2NKU0|CENPX_BOVIN Centromere protein X OS=Bos taurus GN=STRA13 PE=3 SV=1 Back     alignment and function desciption
 Score = 61.2 bits (147), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 27/58 (46%), Positives = 42/58 (72%)

Query: 52  KNRPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109
           K++ T  + +AL+L  ELL++FV EA  R+   A+AEG++ ++   LE++LPQLLLDF
Sbjct: 22  KDKKTKVSGDALQLVAELLKIFVVEAAIRSVRQAQAEGLAHVDVEQLEKVLPQLLLDF 79




DNA-binding component of the FA core complex involved in DNA damage repair and genome maintenance. Recruited to forks stalled by DNA interstrand cross-links, and required for cellular resistance to such lesions. Component of the heterotetrameric CENP-T-W-S-X complex that binds and supercoils DNA, and plays an important role in kinetochore assembly. Component of the APITD1/CENPS complex that is essential for the stable assembly of the outer kinetochore. Plays an important role in mitotic progression and chromosome segregation.
Bos taurus (taxid: 9913)
>sp|A8MT69|CENPX_HUMAN Centromere protein X OS=Homo sapiens GN=STRA13 PE=1 SV=1 Back     alignment and function description
>sp|Q5RBU1|CENPX_PONAB Centromere protein X OS=Pongo abelii GN=STRA13 PE=3 SV=1 Back     alignment and function description
>sp|Q8C4X1|CENPX_MOUSE Centromere protein X OS=Mus musculus GN=Stra13 PE=2 SV=1 Back     alignment and function description
>sp|P0DJH7|CENPX_CHICK Centromere protein X OS=Gallus gallus GN=STRA13 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query109
297733659179 unnamed protein product [Vitis vinifera] 0.917 0.558 0.771 5e-38
359491178 323 PREDICTED: uncharacterized protein LOC10 0.917 0.309 0.771 7e-38
297842655104 hypothetical protein ARALYDRAFT_316775 [ 0.954 1.0 0.733 2e-36
30699356104 uncharacterized protein [Arabidopsis tha 0.954 1.0 0.733 2e-36
224121862103 predicted protein [Populus trichocarpa] 0.935 0.990 0.710 1e-35
449440604106 PREDICTED: centromere protein X-like [Cu 0.954 0.981 0.694 6e-34
356513827103 PREDICTED: centromere protein X-like [Gl 0.935 0.990 0.663 5e-31
357164214110 PREDICTED: centromere protein X-like [Br 0.899 0.890 0.634 8e-30
356565256137 PREDICTED: uncharacterized protein LOC10 0.871 0.693 0.68 4e-29
255540435143 conserved hypothetical protein [Ricinus 0.853 0.650 0.693 4e-29
>gi|297733659|emb|CBI14906.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  161 bits (407), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 81/105 (77%), Positives = 88/105 (83%), Gaps = 5/105 (4%)

Query: 5   TTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALK 64
           T FD DLIHAIFK +W+R +LERE+N G D +E E      GA TSKKNRPTSANANALK
Sbjct: 80  TRFDPDLIHAIFKLVWSRTALEREKNEGADPLECE-----VGAATSKKNRPTSANANALK 134

Query: 65  LSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109
           LSCELLRVFV EAV+RAA IAEAEGV+KIEATHLERILPQLLLDF
Sbjct: 135 LSCELLRVFVIEAVERAATIAEAEGVNKIEATHLERILPQLLLDF 179




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359491178|ref|XP_002276097.2| PREDICTED: uncharacterized protein LOC100253596 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297842655|ref|XP_002889209.1| hypothetical protein ARALYDRAFT_316775 [Arabidopsis lyrata subsp. lyrata] gi|297335050|gb|EFH65468.1| hypothetical protein ARALYDRAFT_316775 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|30699356|ref|NP_178000.2| uncharacterized protein [Arabidopsis thaliana] gi|22135948|gb|AAM91556.1| unknown protein [Arabidopsis thaliana] gi|24899661|gb|AAN65045.1| unknown protein [Arabidopsis thaliana] gi|332198032|gb|AEE36153.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224121862|ref|XP_002318691.1| predicted protein [Populus trichocarpa] gi|118488565|gb|ABK96095.1| unknown [Populus trichocarpa] gi|222859364|gb|EEE96911.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449440604|ref|XP_004138074.1| PREDICTED: centromere protein X-like [Cucumis sativus] gi|449501349|ref|XP_004161344.1| PREDICTED: centromere protein X-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356513827|ref|XP_003525610.1| PREDICTED: centromere protein X-like [Glycine max] Back     alignment and taxonomy information
>gi|357164214|ref|XP_003579984.1| PREDICTED: centromere protein X-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|356565256|ref|XP_003550858.1| PREDICTED: uncharacterized protein LOC100810847 [Glycine max] Back     alignment and taxonomy information
>gi|255540435|ref|XP_002511282.1| conserved hypothetical protein [Ricinus communis] gi|223550397|gb|EEF51884.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query109
TAIR|locus:2037488104 AT1G78790 "AT1G78790" [Arabido 0.954 1.0 0.623 1.9e-28
UNIPROTKB|Q2NKU079 STRA13 "Centromere protein X" 0.532 0.734 0.362 1.6e-06
UNIPROTKB|P0DJH780 STRA13 "Centromere protein X" 0.495 0.675 0.407 2.1e-06
UNIPROTKB|A8MT6981 STRA13 "Centromere protein X" 0.532 0.716 0.362 3e-05
UNIPROTKB|Q5RBU181 STRA13 "Centromere protein X" 0.532 0.716 0.362 3e-05
MGI|MGI:89432478 Stra13 "stimulated by retinoic 0.495 0.692 0.370 8.1e-05
ZFIN|ZDB-GENE-110411-7881 stra13 "stimulated by retinoic 0.495 0.666 0.351 0.00045
TAIR|locus:2037488 AT1G78790 "AT1G78790" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 317 (116.6 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 68/109 (62%), Positives = 75/109 (68%)

Query:     1 MEKETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANA 60
             M+   TFDSDLIHAIFKHIW RR  ERER+   DA E+E  L     GT+KKNR  SANA
Sbjct:     1 MDANNTFDSDLIHAIFKHIWARRFRERERSDAIDATEAEVAL-----GTTKKNRLASANA 55

Query:    61 NALKLSCELLRVFVTXXXXXXXXXXXXXGVSKIEATHLERILPQLLLDF 109
             NALKLSCELL+ FV+             G+ KIEATHLERILPQLLLDF
Sbjct:    56 NALKLSCELLKSFVSEAVQRAAIIAEAEGMEKIEATHLERILPQLLLDF 104




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0006661 "phosphatidylinositol biosynthetic process" evidence=RCA
UNIPROTKB|Q2NKU0 STRA13 "Centromere protein X" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|P0DJH7 STRA13 "Centromere protein X" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|A8MT69 STRA13 "Centromere protein X" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q5RBU1 STRA13 "Centromere protein X" [Pongo abelii (taxid:9601)] Back     alignment and assigned GO terms
MGI|MGI:894324 Stra13 "stimulated by retinoic acid 13" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-110411-78 stra13 "stimulated by retinoic acid 13 homolog (mouse)" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query109
pfam0941571 pfam09415, CENP-X, CENP-S associating Centromere p 7e-20
>gnl|CDD|220228 pfam09415, CENP-X, CENP-S associating Centromere protein X Back     alignment and domain information
 Score = 76.2 bits (188), Expect = 7e-20
 Identities = 34/58 (58%), Positives = 39/58 (67%)

Query: 52  KNRPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109
           K+  T    +AL+L  E L +FV EAV RAA IAEAEG  K+E   LERI PQLLLDF
Sbjct: 14  KDDKTRITKDALELVAEYLDIFVREAVARAAEIAEAEGDEKLEVEDLERIAPQLLLDF 71


The centromere, essential for faithful chromosome segregation during mitosis, has a network of constitutive centromere-associated (CCAN) proteins associating with it during mitosis. So far in vertebrates at least 15 centromere proteins have been identified, which are divided into several subclasses based on functional and biochemical analyses. These provide a platform for the formation of a functional kinetochore during mitosis. CENP-S is one that does not associate with the CENP-H-containing complex but rather interacts with CENP-X to form a stable assembly of outer kinetochore proteins that functions downstream of other components of the CCAN. This complex may directly allow efficient and stable formation of the outer kinetochore on the CCAN platform. Length = 71

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 109
PF0941572 CENP-X: CENP-S associating Centromere protein X; I 99.95
PF0080865 CBFD_NFYB_HMF: Histone-like transcription factor ( 97.15
cd0798172 TAF12 TATA Binding Protein (TBP) Associated Factor 96.65
PF0384768 TFIID_20kDa: Transcription initiation factor TFIID 96.54
smart0080365 TAF TATA box binding protein associated factor. TA 94.26
COG203691 HHT1 Histones H3 and H4 [Chromatin structure and d 94.15
PF0226993 TFIID-18kDa: Transcription initiation factor IID, 93.5
PF0012575 Histone: Core histone H2A/H2B/H3/H4 histone h2a si 92.41
cd0804885 TAF11 TATA Binding Protein (TBP) Associated Factor 91.59
cd0007685 H4 Histone H4, one of the four histones, along wit 89.05
cd00074115 H2A Histone 2A; H2A is a subunit of the nucleosome 88.98
KOG1142258 consensus Transcription initiation factor TFIID, s 88.43
PLN00035103 histone H4; Provisional 88.1
PTZ00015102 histone H4; Provisional 86.81
smart0041774 H4 Histone H4. 86.02
PF0471990 TAFII28: hTAFII28-like protein conserved region; I 85.17
smart0057677 BTP Bromodomain transcription factors and PHD doma 83.45
cd0797892 TAF13 The TATA Binding Protein (TBP) Associated Fa 81.32
PF0752477 Bromo_TP: Bromodomain associated; InterPro: IPR006 80.73
>PF09415 CENP-X: CENP-S associating Centromere protein X; InterPro: IPR018552 Centromere protein X (CENP-X) is a component of the CENP-S complex Back     alignment and domain information
Probab=99.95  E-value=2.5e-28  Score=163.29  Aligned_cols=61  Identities=59%  Similarity=0.785  Sum_probs=55.0

Q ss_pred             CccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCc-cccchHHHhhhhhhcCC
Q 033891           49 TSKKNRPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSK-IEATHLERILPQLLLDF  109 (109)
Q Consensus        49 ~s~k~~~TkIs~dAl~L~aeyL~iFV~EAv~RA~~eA~~Eg~~~-levehLEKIlPQLLLDF  109 (109)
                      .+|++++||||+||++++++||++||+||++||+++|+++|++. |+++|||||+|||||||
T Consensus        11 ~~f~~~~tkIs~dal~l~~eyl~iFV~EAv~Ra~~~a~~e~~~~~le~e~LEki~pqLlLDf   72 (72)
T PF09415_consen   11 EHFKDDKTKISKDALKLSAEYLRIFVREAVARAAEQAEAEGDEGFLEVEHLEKILPQLLLDF   72 (72)
T ss_dssp             TTSSSTT-EE-CCCHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSEE-HHHHHHHCHCHHCHT
T ss_pred             HHhcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCHHHHHHHHHHHhcCC
Confidence            36899999999999999999999999999999999999999887 99999999999999999



The CENP-S complex is composed of at least of CENP-S and CENP-X and is essential for the stable assembly of the outer kinetchore []. CENP-X is also a DNA-binding component of the Fanconi anemia (FA) core complex involved in DNA damage repair and genome maintenance. The FA complex is composed of CENPS, FANCA, FANCB, FANCC, FANCE, FANCF, FANCG, FANCL/PHF9, FANCM, FAAP24 and CENPX. Interacts with CENPS, FANCM and FAAP24 [, ].; PDB: 4DRB_L 4DRA_H 3V9R_D.

>PF00808 CBFD_NFYB_HMF: Histone-like transcription factor (CBF/NF-Y) and archaeal histone; InterPro: IPR003958 The CCAAT-binding factor (CBF) is a mammalian transcription factor that binds to a CCAAT motif in the promoters of a wide variety of genes, including type I collagen and albumin Back     alignment and domain information
>cd07981 TAF12 TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex Back     alignment and domain information
>PF03847 TFIID_20kDa: Transcription initiation factor TFIID subunit A; InterPro: IPR003228 Human transcription initiation factor TFIID is composed of the TATA-binding polypeptide (TBP) and at least 13 TBP-associated factors (TAFs) that collectively or individually are involved in activator-dependent transcription [] Back     alignment and domain information
>smart00803 TAF TATA box binding protein associated factor Back     alignment and domain information
>COG2036 HHT1 Histones H3 and H4 [Chromatin structure and dynamics] Back     alignment and domain information
>PF02269 TFIID-18kDa: Transcription initiation factor IID, 18kD subunit; InterPro: IPR003195 This family includes the Spt3 yeast transcription factors and the 18 kDa subunit from human transcription initiation factor IID (TFIID-18) Back     alignment and domain information
>PF00125 Histone: Core histone H2A/H2B/H3/H4 histone h2a signature histone h2b signature histone h3 signature histone h4 signature; InterPro: IPR007125 The core histones together with some other DNA binding proteins appear to form a superfamily defined by a common fold and distant sequence similarities [, ] Back     alignment and domain information
>cd08048 TAF11 TATA Binding Protein (TBP) Associated Factor 11 (TAF11) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex Back     alignment and domain information
>cd00076 H4 Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in which DNA (the string) is wrapped around small blobs of histones (the beads) at regular intervals; play a role in the inheritance of specialized chromosome structures and the control of gene activity; defects in the establishment of proper chromosome structure by histones may activate or silence genes aberrantly and thus lead to disease; the sequence of histone H4 has remained almost invariant in more than 2 billion years of evolution Back     alignment and domain information
>cd00074 H2A Histone 2A; H2A is a subunit of the nucleosome Back     alignment and domain information
>KOG1142 consensus Transcription initiation factor TFIID, subunit TAF12 (also component of histone acetyltransferase SAGA) [Transcription] Back     alignment and domain information
>PLN00035 histone H4; Provisional Back     alignment and domain information
>PTZ00015 histone H4; Provisional Back     alignment and domain information
>smart00417 H4 Histone H4 Back     alignment and domain information
>PF04719 TAFII28: hTAFII28-like protein conserved region; InterPro: IPR006809 The general transcription factor, TFIID, consists of the TATA-binding protein (TBP) associated with a series of TBP-associated factors (TAFs) that together participate in the assembly of the transcription preinitiation complex Back     alignment and domain information
>smart00576 BTP Bromodomain transcription factors and PHD domain containing proteins Back     alignment and domain information
>cd07978 TAF13 The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex Back     alignment and domain information
>PF07524 Bromo_TP: Bromodomain associated; InterPro: IPR006565 This bromodomain is found in eukaryotic transcription factors and PHD domain containing proteins (IPR001965 from INTERPRO) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query109
4drb_J84 The Crystal Structure Of Fancm Bound Mhf Complex Le 1e-05
3vh5_D81 Crystal Structure Of The Chicken Cenp-T Histone Fol 1e-05
4dra_E84 Crystal Structure Of Mhf Complex Length = 84 2e-05
>pdb|4DRB|J Chain J, The Crystal Structure Of Fancm Bound Mhf Complex Length = 84 Back     alignment and structure

Iteration: 1

Score = 44.7 bits (104), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 21/58 (36%), Positives = 31/58 (53%) Query: 52 KNRPTSANANALKLSCELLRVFVTXXXXXXXXXXXXXGVSKIEATHLERILPQLLLDF 109 K+ T + +AL+L ELL+VFV +++ LE++LPQLLLDF Sbjct: 27 KDDKTKVSGDALQLXVELLKVFVVEAAVRGVRQAQAEDALRVDVDQLEKVLPQLLLDF 84
>pdb|3VH5|D Chain D, Crystal Structure Of The Chicken Cenp-T Histone FoldCENP-WCENP- SCENP-X Heterotetrameric Complex, Crystal Form I Length = 81 Back     alignment and structure
>pdb|4DRA|E Chain E, Crystal Structure Of Mhf Complex Length = 84 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query109
3b0b_C81 CENP-X, centromere protein X; histone fold, DNA bi 3e-20
4dra_E84 Centromere protein X; DNA binding complex, DNA dam 1e-19
>3b0b_C CENP-X, centromere protein X; histone fold, DNA binding, DNA, nucleus, DNA binding protein; 2.15A {Gallus gallus} PDB: 3vh5_D 3vh6_D Length = 81 Back     alignment and structure
 Score = 76.9 bits (189), Expect = 3e-20
 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 26  ERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALKLSCELLRVFVTEAVQRAAAIA 85
             ER GG      E LL         ++  T  N +AL L  ELL+VFV EA  RAA  A
Sbjct: 2   YEEREGGFRKETVERLLR-----LHFRDGRTRVNGDALLLMAELLKVFVREAAARAARQA 56

Query: 86  EAEGVSKIEATHLERILPQLLLDF 109
           +AE + K++  H+E++LPQLLLDF
Sbjct: 57  QAEDLEKVDIEHVEKVLPQLLLDF 80


>4dra_E Centromere protein X; DNA binding complex, DNA damage repair, histone-fold, DNA BI protein; 2.41A {Homo sapiens} PDB: 4drb_J Length = 84 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query109
3b0b_C81 CENP-X, centromere protein X; histone fold, DNA bi 100.0
4dra_E84 Centromere protein X; DNA binding complex, DNA dam 99.97
3v9r_B88 MHF2, uncharacterized protein YDL160C-A; histone f 99.8
3b0c_W76 CENP-W, centromere protein W; histone fold, DNA bi 96.92
1n1j_A93 NF-YB; histone-like PAIR, DNA binding protein; 1.6 96.28
1n1j_B97 NF-YC; histone-like PAIR, DNA binding protein; 1.6 96.23
2byk_B128 Chrac-14; nucleosome sliding, histone fold, DNA-bi 95.97
1h3o_B76 Transcription initiation factor TFIID 20/15 kDa su 95.97
1b67_A68 Protein (histone HMFA); DNA binding protein; 1.48A 95.7
4g92_C119 HAPE; transcription factor, nucleosome, minor groo 95.58
1jfi_A98 Transcription regulator NC2 alpha chain; histone, 95.24
2byk_A140 Chrac-16; nucleosome sliding, histone fold, DNA-bi 95.21
1jfi_B179 DR1 protein, transcription regulator NC2 beta chai 94.94
1f1e_A154 Histone fold protein; archaeal histone protein, DN 94.45
2hue_C84 Histone H4; mini beta sheet, elongated beta sandwh 93.8
1id3_B102 Histone H4; nucleosome core particle, chromatin, p 93.49
1ku5_A70 HPHA, archaeal histon; histone fold, DNA binding p 93.33
2yfw_B103 Histone H4, H4; cell cycle, kinetochore, centromer 93.25
1tzy_D103 Histone H4-VI; histone-fold, tetramer-dimer-dimer, 93.2
1taf_B70 TFIID TBP associated factor 62; transcription init 92.49
1f1e_A154 Histone fold protein; archaeal histone protein, DN 91.88
3b0c_T111 CENP-T, centromere protein T; histone fold, DNA bi 91.14
2f8n_G120 Core histone macro-H2A.1; nucleosome, NCP, macroh2 90.77
2nqb_C123 Histone H2A; nucleosome, NCP, chromatin, structura 90.56
1tzy_A129 Histone H2A-IV; histone-fold, tetramer-dimer-dimer 90.14
1id3_C131 Histone H2A.1; nucleosome core particle, chromatin 89.31
1f66_C128 Histone H2A.Z; nucleosome, chromatin, histone vari 88.84
2f8n_K149 Histone H2A type 1; nucleosome, NCP, macroh2A, his 88.61
1bh9_B89 TAFII28; histone fold, tata binding protein, trans 84.03
2jss_A192 Chimera of histone H2B.1 and histone H2A.Z; histon 83.23
2v1u_A 387 Cell division control protein 6 homolog; DNA repli 81.87
3uk6_A368 RUVB-like 2; hexameric AAA+ ATP-ASE, DNA unwinding 80.66
>3b0b_C CENP-X, centromere protein X; histone fold, DNA binding, DNA, nucleus, DNA binding protein; 2.15A {Gallus gallus} PDB: 3vh5_D 3vh6_D Back     alignment and structure
Probab=100.00  E-value=7.5e-34  Score=193.01  Aligned_cols=77  Identities=43%  Similarity=0.676  Sum_probs=71.5

Q ss_pred             CCccCCCHHHHHHHHHHHhhhhhhhhhhcCCCChhHHHHHhhhccCCCccCCCCCccchHHHHHHHHHHHHHHHHHHHHH
Q 033891            2 EKETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANALKLSCELLRVFVTEAVQRA   81 (109)
Q Consensus         2 ~~~~tf~~~li~~ifk~~w~~~~~~~~~~~~~~~~~~e~~~~~~~~~~s~k~~~TkIs~dAl~L~aeyL~iFV~EAv~RA   81 (109)
                      +..|+|+|+||++||++.|..                               ++|||++||++|++|||++||+||++||
T Consensus         4 ~~~~~~~~~lI~ril~~~f~~-------------------------------~ktrI~~dAl~l~aeyl~iFV~EAv~RA   52 (81)
T 3b0b_C            4 EREGGFRKETVERLLRLHFRD-------------------------------GRTRVNGDALLLMAELLKVFVREAAARA   52 (81)
T ss_dssp             ---CCCCHHHHHHHHHHHCCS-------------------------------TTCEECHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ccCCCCCHHHHHHHHHHHhcc-------------------------------CcccccHHHHHHHHHHHHHHHHHHHHHH
Confidence            357899999999999999875                               3789999999999999999999999999


Q ss_pred             HHHHHhhCCCccccchHHHhhhhhhcCC
Q 033891           82 AAIAEAEGVSKIEATHLERILPQLLLDF  109 (109)
Q Consensus        82 ~~eA~~Eg~~~levehLEKIlPQLLLDF  109 (109)
                      +++|+++|+++|+++|||||+|||||||
T Consensus        53 ~~~a~~e~~~~le~~~LEki~pqLlLDf   80 (81)
T 3b0b_C           53 ARQAQAEDLEKVDIEHVEKVLPQLLLDF   80 (81)
T ss_dssp             HHHHHHTTCSEECHHHHHHHHHHHHHHC
T ss_pred             HHHHHhCCCCeecHHHHHHHHHHHHhcc
Confidence            9999999999999999999999999998



>4dra_E Centromere protein X; DNA binding complex, DNA damage repair, histone-fold, DNA BI protein; 2.41A {Homo sapiens} PDB: 4drb_J Back     alignment and structure
>3v9r_B MHF2, uncharacterized protein YDL160C-A; histone fold, fanconi anemia, DNA repair, DNA BI protein; 2.40A {Saccharomyces cerevisiae} Back     alignment and structure
>3b0c_W CENP-W, centromere protein W; histone fold, DNA binding, DNA binding protein; HET: CIT; 2.20A {Gallus gallus} PDB: 3b0d_W* 3vh5_W 3vh6_W Back     alignment and structure
>1n1j_A NF-YB; histone-like PAIR, DNA binding protein; 1.67A {Homo sapiens} SCOP: a.22.1.3 Back     alignment and structure
>1n1j_B NF-YC; histone-like PAIR, DNA binding protein; 1.67A {Homo sapiens} SCOP: a.22.1.3 Back     alignment and structure
>2byk_B Chrac-14; nucleosome sliding, histone fold, DNA-binding protein; 2.4A {Drosophila melanogaster} SCOP: a.22.1.3 PDB: 2bym_B Back     alignment and structure
>1h3o_B Transcription initiation factor TFIID 20/15 kDa subunits; transcription/TBP-associated factors, TBP-associated factors; 2.3A {Homo sapiens} SCOP: a.22.1.3 Back     alignment and structure
>1b67_A Protein (histone HMFA); DNA binding protein; 1.48A {Methanothermus fervidus} SCOP: a.22.1.2 PDB: 1hta_A 1a7w_A 1b6w_A 1bfm_A Back     alignment and structure
>4g92_C HAPE; transcription factor, nucleosome, minor groove binding, CCAA complex, histone fold motif, specific binding to the ccaat- nucleus; HET: DNA; 1.80A {Aspergillus nidulans} PDB: 4g91_C* Back     alignment and structure
>1jfi_A Transcription regulator NC2 alpha chain; histone, H2A/H2B, tata-DNA, transcription initiation, NC2, negative cofactor, structural genomics, PSI; 2.62A {Homo sapiens} SCOP: a.22.1.3 Back     alignment and structure
>2byk_A Chrac-16; nucleosome sliding, histone fold, DNA-binding protein; 2.4A {Drosophila melanogaster} SCOP: a.22.1.3 PDB: 2bym_A Back     alignment and structure
>1jfi_B DR1 protein, transcription regulator NC2 beta chain; histone, H2A/H2B, tata-DNA, transcription initiation, NC2, negative cofactor, structural genomics, PSI; 2.62A {Homo sapiens} SCOP: a.22.1.3 Back     alignment and structure
>1f1e_A Histone fold protein; archaeal histone protein, DNA binding protein; HET: MSE; 1.37A {Methanopyrus kandleri} SCOP: a.22.1.2 Back     alignment and structure
>2hue_C Histone H4; mini beta sheet, elongated beta sandwhich, DNA binding prote; 1.70A {Xenopus laevis} SCOP: a.22.1.1 PDB: 3nqj_B 1aoi_B 3kwq_B* 1hio_D 2yfv_B Back     alignment and structure
>1id3_B Histone H4; nucleosome core particle, chromatin, protein/DNA interaction, nucleoprotein, supercoiled DNA; 3.10A {Saccharomyces cerevisiae} SCOP: a.22.1.1 Back     alignment and structure
>1ku5_A HPHA, archaeal histon; histone fold, DNA binding protein; 2.30A {Pyrococcus horikoshii} SCOP: a.22.1.2 Back     alignment and structure
>2yfw_B Histone H4, H4; cell cycle, kinetochore, centromere, histone chaperone, BUDD; 2.60A {Kluyveromyces lactis nrrl y-1140} Back     alignment and structure
>1tzy_D Histone H4-VI; histone-fold, tetramer-dimer-dimer, DNA binding protein; 1.90A {Gallus gallus} SCOP: a.22.1.1 PDB: 1f66_B 1eqz_D 1hq3_D 1u35_B 2aro_D 2cv5_B* 2f8n_B 3nqu_B 3r45_B 3azg_B 3a6n_B 3an2_B 3av1_B 3av2_B 3ayw_B 3aze_B 3azf_B 3afa_B 3azh_B 3azk_B ... Back     alignment and structure
>1taf_B TFIID TBP associated factor 62; transcription initiation, histone fold, complex (TWO transcr factors); 2.00A {Drosophila melanogaster} SCOP: a.22.1.3 Back     alignment and structure
>1f1e_A Histone fold protein; archaeal histone protein, DNA binding protein; HET: MSE; 1.37A {Methanopyrus kandleri} SCOP: a.22.1.2 Back     alignment and structure
>3b0c_T CENP-T, centromere protein T; histone fold, DNA binding, DNA binding protein; HET: CIT; 2.20A {Gallus gallus} PDB: 3b0d_T* 3vh5_T 3vh6_T Back     alignment and structure
>2f8n_G Core histone macro-H2A.1; nucleosome, NCP, macroh2A, histone variant, chromatin, X- RAY structure, crystallography, structural protein/DNA complex; 2.90A {Homo sapiens} SCOP: a.22.1.1 PDB: 1u35_C Back     alignment and structure
>2nqb_C Histone H2A; nucleosome, NCP, chromatin, structural protein/DNA complex; 2.30A {Drosophila melanogaster} PDB: 2pyo_C* Back     alignment and structure
>1tzy_A Histone H2A-IV; histone-fold, tetramer-dimer-dimer, DNA binding protein; 1.90A {Gallus gallus} SCOP: a.22.1.1 PDB: 1eqz_A 1hq3_A 2aro_A 2hio_A 3c9k_A 3azg_C 3a6n_C 3an2_C 3av1_C 3av2_C 3ayw_C 3aze_C 3azf_C 3afa_C 3azh_C 3azi_C 3azj_C 3azk_C 3azl_C 3azm_C ... Back     alignment and structure
>1id3_C Histone H2A.1; nucleosome core particle, chromatin, protein/DNA interaction, nucleoprotein, supercoiled DNA; 3.10A {Saccharomyces cerevisiae} SCOP: a.22.1.1 Back     alignment and structure
>1f66_C Histone H2A.Z; nucleosome, chromatin, histone variant, protein DNA interaction, nucleoprotein, supercoiled DNA, complex (nucleosome core/DNA); 2.60A {Homo sapiens} SCOP: a.22.1.1 Back     alignment and structure
>2f8n_K Histone H2A type 1; nucleosome, NCP, macroh2A, histone variant, chromatin, X- RAY structure, crystallography, structural protein/DNA complex; 2.90A {Mus musculus} SCOP: a.22.1.1 Back     alignment and structure
>1bh9_B TAFII28; histone fold, tata binding protein, transcription regulation complex; HET: PMB; 2.60A {Homo sapiens} SCOP: a.22.1.3 PDB: 1bh8_B* Back     alignment and structure
>2jss_A Chimera of histone H2B.1 and histone H2A.Z; histone/chaperone complex, intrinsically unfolded protein, chaperone/structural protein complex; NMR {Saccharomyces cerevisiae} SCOP: a.22.1.1 a.22.1.1 Back     alignment and structure
>2v1u_A Cell division control protein 6 homolog; DNA replication, nucleotide-binding, replication, archaea; HET: ADP; 3.10A {Aeropyrum pernix} Back     alignment and structure
>3uk6_A RUVB-like 2; hexameric AAA+ ATP-ASE, DNA unwinding, hydrolase; HET: ADP; 2.95A {Homo sapiens} PDB: 2xsz_D* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query109
d1n1jb_78 Nuclear transcription factor Y subunit gamma (Nf-Y 97.07
d1ku5a_66 Archaeal histone {Archaeon (Pyrococcus horikoshii) 96.97
d1jfia_66 Negative cofactor 2, NC2, alpha chain {Human (Homo 96.89
d1htaa_68 Archaeal histone {Archaeon Methanothermus fervidus 96.81
d1n1ja_87 Nuclear transcription factor Y subunit beta (Nf-Yb 96.59
d1h3ob_74 TAF(II)-20, (TAF(II)-15, hTAF12), histone fold dom 96.18
d2bykb189 Chrac-14 {Fruit fly (Drosophila melanogaster) [Tax 96.14
d1f1ea_151 Archaeal histone {Archaeon Methanopyrus kandleri [ 95.73
d1jfib_135 Negative cofactor 2, NC2, beta chain {Human (Homo 95.3
d2byka172 Chrac-16 {Fruit fly (Drosophila melanogaster) [Tax 94.45
d2huec182 Histone H4 {African clawed frog (Xenopus laevis) [ 93.35
d1w5sa2287 CDC6-like protein APE0152, N-terminal domain {Aero 91.05
d1f1ea_151 Archaeal histone {Archaeon Methanopyrus kandleri [ 90.61
d1fnna2276 CDC6, N-domain {Archaeon Pyrobaculum aerophilum [T 89.54
>d1n1jb_ a.22.1.3 (B:) Nuclear transcription factor Y subunit gamma (Nf-Yc2) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All alpha proteins
fold: Histone-fold
superfamily: Histone-fold
family: TBP-associated factors, TAFs
domain: Nuclear transcription factor Y subunit gamma (Nf-Yc2)
species: Human (Homo sapiens) [TaxId: 9606]
Probab=97.07  E-value=0.00074  Score=41.99  Aligned_cols=51  Identities=18%  Similarity=0.188  Sum_probs=46.9

Q ss_pred             CCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCccccchHHHhhhh
Q 033891           54 RPTSANANALKLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQ  104 (109)
Q Consensus        54 ~~TkIs~dAl~L~aeyL~iFV~EAv~RA~~eA~~Eg~~~levehLEKIlPQ  104 (109)
                      +.++|++||..++++..++||.+-+..|...|+.++...|..+||-+.+-+
T Consensus        15 ~~~~is~ea~~~i~~a~E~Fi~~l~~~A~~~a~~~~rkti~~~dl~~av~~   65 (78)
T d1n1jb_          15 DVKMISAEAPVLFAKAAQIFITELTLRAWIHTEDNKRRTLQRNDIAMAITK   65 (78)
T ss_dssp             TCCCBCTHHHHHHHHHHHHHHHHHHHHHHHHHHHTTCSEECHHHHHHHHTT
T ss_pred             cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCCHHHHHHHHhc
Confidence            567899999999999999999999999999999999999999999887654



>d1ku5a_ a.22.1.2 (A:) Archaeal histone {Archaeon (Pyrococcus horikoshii) [TaxId: 53953]} Back     information, alignment and structure
>d1jfia_ a.22.1.3 (A:) Negative cofactor 2, NC2, alpha chain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1htaa_ a.22.1.2 (A:) Archaeal histone {Archaeon Methanothermus fervidus, histone A [TaxId: 2180]} Back     information, alignment and structure
>d1n1ja_ a.22.1.3 (A:) Nuclear transcription factor Y subunit beta (Nf-Yb3) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1h3ob_ a.22.1.3 (B:) TAF(II)-20, (TAF(II)-15, hTAF12), histone fold domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2bykb1 a.22.1.3 (B:11-99) Chrac-14 {Fruit fly (Drosophila melanogaster) [TaxId: 7227]} Back     information, alignment and structure
>d1f1ea_ a.22.1.2 (A:) Archaeal histone {Archaeon Methanopyrus kandleri [TaxId: 2320]} Back     information, alignment and structure
>d1jfib_ a.22.1.3 (B:) Negative cofactor 2, NC2, beta chain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2byka1 a.22.1.3 (A:29-100) Chrac-16 {Fruit fly (Drosophila melanogaster) [TaxId: 7227]} Back     information, alignment and structure
>d2huec1 a.22.1.1 (C:20-101) Histone H4 {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1w5sa2 c.37.1.20 (A:7-293) CDC6-like protein APE0152, N-terminal domain {Aeropyrum pernix [TaxId: 56636]} Back     information, alignment and structure
>d1f1ea_ a.22.1.2 (A:) Archaeal histone {Archaeon Methanopyrus kandleri [TaxId: 2320]} Back     information, alignment and structure
>d1fnna2 c.37.1.20 (A:1-276) CDC6, N-domain {Archaeon Pyrobaculum aerophilum [TaxId: 13773]} Back     information, alignment and structure