Citrus Sinensis ID: 024038


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270---
MTNSSSSKKHGPGQPEESGPTLKLQRIKMSKPEEAEKKNLNKKLKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISLIDGQQQQLRSTSDP
cccccccccccccccccccccccccccccccccHHHHHHccccccccEEEEEEEEccEEEEccccccccccEEEEEEEEccccccccccEEEEEEEEccccccccEEEEccccEEEEccEEEEEEEEEEEEEcccccccEEEEEEEEEccccccccccccccEEEEEccEEEcccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccc
ccccccccHcccccccccccccccccccccccccccccccccEEccEEEEEEEEEEcEEEEcccccccccccEEEEEEEccccccHHHHEEEEEEEEccccccccEEEEcccEEEEEcccEEEEEEEEEEEEcccccccEEEEEEEEEccccccccccccccEEEEEEcEEEcccccHHHHHHHHcccccccccccccccccccccccccccccccccccccHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHcccccc
mtnsssskkhgpgqpeesgptlklqrikmskpeeAEKKNLNKKLKDVEISIPIVYGNVAFWLGKKaseyqshkwTVYVRGATNEDLGVVIKRAVFQLhssfnnptravesppfelsesgwgEFEIAITLYFHadvcdkplnlyhhlklypedesgsmstkkpvvvesydeivfpepsdsflarvqnhpavtlprlpvgftlpppvpiedtskrkrgdtkdhplaQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISlidgqqqqlrstsdp
mtnsssskkhgpgqpeesgptlklqrikmskpeeaekknlnkklkdveisIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHssfnnptravesppFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLypedesgsmstKKPVVVESYDEIVFPEPSDSFLARVQNHPAvtlprlpvgftlpppvpiedtskrkrgdtKDHPLAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQIslidgqqqqlrstsdp
MTNSSSSKKHGPGQPEESGPTLKLQRIKMSKPEEAEkknlnkklkDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADEllqlaaarqqvqaHIAKLKRQISLIDGQQQQLRSTSDP
*******************************************LKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFN***********ELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYP************VVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLPVGFTL*********************LAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISL**************
***********************************************EISIPIVYGNVAFWLGK*ASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPED************VESYDEIVFPEPSDSFL******************************************************LLQLAAA************************L******
*********************LKLQRIKMSKPEEAEKKNLNKKLKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPE**********PVVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISLID************
****************************************NKKLKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTL***************************DHPLAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISLIDGQQQQ*******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTNSSSSKKHGPGQPEESGPTLKLQRIKMSKPEEAEKKNLNKKLKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADExxxxxxxxxxxxxxxxxxxxxISLIDGQQQQLRSTSDP
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query273 2.2.26 [Sep-21-2011]
Q755P0208 Protein AF-9 homolog OS=A yes no 0.556 0.730 0.439 1e-33
P53930226 Protein AF-9 homolog OS=S yes no 0.564 0.681 0.397 4e-31
Q6FXM4221 Protein AF-9 homolog OS=C yes no 0.556 0.687 0.372 1e-29
Q6CIV8220 Protein AF-9 homolog OS=K yes no 0.560 0.695 0.408 3e-29
Q10319217 Protein AF-9 homolog OS=S yes no 0.736 0.926 0.350 5e-29
O95619227 YEATS domain-containing p yes no 0.487 0.585 0.463 2e-26
Q9CR11227 YEATS domain-containing p yes no 0.487 0.585 0.463 2e-26
Q5BC71275 Protein AF-9 homolog OS=E yes no 0.509 0.505 0.417 1e-25
Q4WPM8252 Protein AF-9 homolog OS=N yes no 0.542 0.587 0.374 2e-23
Q6CF24202 Protein AF-9 homolog OS=Y yes no 0.593 0.801 0.325 9e-22
>sp|Q755P0|AF9_ASHGO Protein AF-9 homolog OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=YAF9 PE=3 SV=1 Back     alignment and function desciption
 Score =  143 bits (360), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 69/157 (43%), Positives = 100/157 (63%), Gaps = 5/157 (3%)

Query: 42  KKLKDVEISIPIVYGNVAFWLGKK----ASEYQSHKWTVYVRGATNEDLGVVIKRAVFQL 97
           K++K + ++ PIVYGN A  +G      A    +H WT++VRG   ED+   IK+ VF+L
Sbjct: 7   KRIKTLSVARPIVYGNTAKKMGDVRPAIAPSEHTHMWTIFVRGPQGEDISYFIKKVVFKL 66

Query: 98  HSSFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYP-EDESGS 156
           H ++ NP R V++PPFEL+E+GWGEFEI + ++F  +  +K LN YHHL+L+P  +E G 
Sbjct: 67  HETYPNPVRVVDAPPFELTETGWGEFEINVKVHFVDEANEKMLNFYHHLRLHPYTEEDGR 126

Query: 157 MSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTLP 193
            S    V    YDEIVF EP+++F A++   P   LP
Sbjct: 127 RSDGDEVSSVFYDEIVFNEPNEAFFAKMIEQPGNLLP 163




Component of the SWR1 complex which mediates the ATP-dependent exchange of histone H2A for the H2A variant HZT1 leading to transcriptional regulation of selected genes by chromatin remodeling. Component of the NuA4 histone acetyltransferase complex which is involved in transcriptional activation of selected genes principally by acetylation of nucleosomal histones H4 and H2A. The NuA4 complex is also involved in DNA repair. Yaf9 may also be required for viability in conditions in which the structural integrity of the spindle is compromised.
Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) (taxid: 284811)
>sp|P53930|AF9_YEAST Protein AF-9 homolog OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=YAF9 PE=1 SV=1 Back     alignment and function description
>sp|Q6FXM4|AF9_CANGA Protein AF-9 homolog OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=YAF9 PE=3 SV=1 Back     alignment and function description
>sp|Q6CIV8|AF9_KLULA Protein AF-9 homolog OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=YAF9 PE=3 SV=1 Back     alignment and function description
>sp|Q10319|AF9_SCHPO Protein AF-9 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=yaf9 PE=3 SV=1 Back     alignment and function description
>sp|O95619|YETS4_HUMAN YEATS domain-containing protein 4 OS=Homo sapiens GN=YEATS4 PE=1 SV=1 Back     alignment and function description
>sp|Q9CR11|YETS4_MOUSE YEATS domain-containing protein 4 OS=Mus musculus GN=Yeats4 PE=2 SV=1 Back     alignment and function description
>sp|Q5BC71|AF9_EMENI Protein AF-9 homolog OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=yaf9 PE=3 SV=1 Back     alignment and function description
>sp|Q4WPM8|AF9_ASPFU Protein AF-9 homolog OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=yaf9 PE=3 SV=2 Back     alignment and function description
>sp|Q6CF24|AF9_YARLI Protein AF-9 homolog OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YAF9 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query273
356545002273 PREDICTED: protein AF-9 homolog [Glycine 0.996 0.996 0.841 1e-135
224080520271 predicted protein [Populus trichocarpa] 0.930 0.937 0.871 1e-130
225427812273 PREDICTED: protein AF-9 homolog [Vitis v 0.989 0.989 0.819 1e-127
297794703268 hypothetical protein ARALYDRAFT_494425 [ 0.978 0.996 0.757 1e-122
15242448268 YEATS family protein [Arabidopsis thalia 0.978 0.996 0.757 1e-121
334188213267 YEATS family protein [Arabidopsis thalia 0.974 0.996 0.757 1e-121
297744708246 unnamed protein product [Vitis vinifera] 0.879 0.975 0.85 1e-120
357473621245 YEATS domain-containing protein [Medicag 0.893 0.995 0.799 1e-118
224103299282 predicted protein [Populus trichocarpa] 0.893 0.865 0.754 1e-117
255543907227 YEATS domain-containing protein, putativ 0.794 0.955 0.903 1e-115
>gi|356545002|ref|XP_003540935.1| PREDICTED: protein AF-9 homolog [Glycine max] Back     alignment and taxonomy information
 Score =  486 bits (1252), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 229/272 (84%), Positives = 248/272 (91%)

Query: 1   MTNSSSSKKHGPGQPEESGPTLKLQRIKMSKPEEAEKKNLNKKLKDVEISIPIVYGNVAF 60
           MTNSSSS KHG  QP+ SGPT K QR KM K E+ +KKNL KKLKDVEISIPIVYGNVAF
Sbjct: 1   MTNSSSSTKHGQDQPDLSGPTPKSQRTKMGKSEDNDKKNLGKKLKDVEISIPIVYGNVAF 60

Query: 61  WLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGW 120
           WLGKKASEYQSHKWTVYVRGATNEDLG +IK AVFQLHSSFNNPTR VESPPFELSESGW
Sbjct: 61  WLGKKASEYQSHKWTVYVRGATNEDLGTIIKHAVFQLHSSFNNPTRVVESPPFELSESGW 120

Query: 121 GEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSF 180
           GEFE++ITLYFH+DVCDKPLNLYHHLKLYPEDE+ SMSTKKPVVVE YDEIVFP+PS++F
Sbjct: 121 GEFEVSITLYFHSDVCDKPLNLYHHLKLYPEDENSSMSTKKPVVVEFYDEIVFPDPSEAF 180

Query: 181 LARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADELLQLA 240
           LARVQNHPAV LPRLP G TLPP +P+ED SKR++GDTKDH L+QWFMNFSEADELLQLA
Sbjct: 181 LARVQNHPAVNLPRLPAGLTLPPSIPVEDASKRRKGDTKDHSLSQWFMNFSEADELLQLA 240

Query: 241 AARQQVQAHIAKLKRQISLIDGQQQQLRSTSD 272
           AARQQVQAHIAKL+RQISLIDGQ QQ +S+SD
Sbjct: 241 AARQQVQAHIAKLRRQISLIDGQHQQFKSSSD 272




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224080520|ref|XP_002306148.1| predicted protein [Populus trichocarpa] gi|222849112|gb|EEE86659.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225427812|ref|XP_002275472.1| PREDICTED: protein AF-9 homolog [Vitis vinifera] Back     alignment and taxonomy information
>gi|297794703|ref|XP_002865236.1| hypothetical protein ARALYDRAFT_494425 [Arabidopsis lyrata subsp. lyrata] gi|297311071|gb|EFH41495.1| hypothetical protein ARALYDRAFT_494425 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15242448|ref|NP_199373.1| YEATS family protein [Arabidopsis thaliana] gi|10177934|dbj|BAB11199.1| unnamed protein product [Arabidopsis thaliana] gi|18175886|gb|AAL59945.1| unknown protein [Arabidopsis thaliana] gi|20465403|gb|AAM20126.1| unknown protein [Arabidopsis thaliana] gi|39545912|gb|AAR28019.1| TAF14b [Arabidopsis thaliana] gi|332007890|gb|AED95273.1| YEATS family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|334188213|ref|NP_001190475.1| YEATS family protein [Arabidopsis thaliana] gi|332007891|gb|AED95274.1| YEATS family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297744708|emb|CBI37970.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357473621|ref|XP_003607095.1| YEATS domain-containing protein [Medicago truncatula] gi|355508150|gb|AES89292.1| YEATS domain-containing protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|224103299|ref|XP_002313001.1| predicted protein [Populus trichocarpa] gi|222849409|gb|EEE86956.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255543907|ref|XP_002513016.1| YEATS domain-containing protein, putative [Ricinus communis] gi|223548027|gb|EEF49519.1| YEATS domain-containing protein, putative [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query273
TAIR|locus:2157156268 GAS41 "GLIOMAS 41" [Arabidopsi 0.978 0.996 0.694 3.7e-103
UNIPROTKB|Q32LE1227 YEATS4 "Uncharacterized protei 0.472 0.568 0.492 1.8e-30
UNIPROTKB|E2QSI2227 YEATS4 "Uncharacterized protei 0.472 0.568 0.492 1.8e-30
UNIPROTKB|O95619227 YEATS4 "YEATS domain-containin 0.472 0.568 0.492 1.8e-30
MGI|MGI:1927224227 Yeats4 "YEATS domain containin 0.472 0.568 0.492 1.8e-30
RGD|1305741227 Yeats4 "YEATS domain containin 0.472 0.568 0.492 1.8e-30
UNIPROTKB|Q8UVS4227 GAS41 "GAS41" [Gallus gallus ( 0.472 0.568 0.477 4.9e-30
UNIPROTKB|E1BU44224 YEATS4 "Uncharacterized protei 0.472 0.575 0.481 1.6e-29
ZFIN|ZDB-GENE-040718-252226 yeats4 "YEATS domain containin 0.468 0.566 0.477 2.1e-29
UNIPROTKB|F8W1B9211 YEATS4 "YEATS domain-containin 0.336 0.436 0.457 5.9e-29
TAIR|locus:2157156 GAS41 "GLIOMAS 41" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1022 (364.8 bits), Expect = 3.7e-103, P = 3.7e-103
 Identities = 189/272 (69%), Positives = 221/272 (81%)

Query:     1 MTNSSSSKKHGPGQPEESGPTLKLQRIKMSKPEEAEXXXXXXXXXDVEISIPIVYGNVAF 60
             MTNSSSSKK    QPE S PTLK  + KM+K +E +         D+EIS+PIVYGNVAF
Sbjct:     1 MTNSSSSKKQAQDQPETSEPTLKSLKTKMTKSDEKQKKLK-----DIEISVPIVYGNVAF 55

Query:    61 WLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGW 120
             WLGKKASEYQSHKW VYVRGATNED+ VV+K+ VFQLHSSFN+PTR +E PPFE+SESGW
Sbjct:    56 WLGKKASEYQSHKWAVYVRGATNEDISVVVKKVVFQLHSSFNSPTRVIEEPPFEVSESGW 115

Query:   121 GEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESGSMSTKKPVVVESYDEIVFPEPSDSF 180
             GEFEIA+TL+FH+DVCDKPL+LYHHLKLYPEDESG ++ KKPVVVESYDEIVFP+PS+SF
Sbjct:   116 GEFEIAMTLHFHSDVCDKPLSLYHHLKLYPEDESGPLTMKKPVVVESYDEIVFPDPSESF 175

Query:   181 LARVQNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADEXXXXX 240
             LARVQNHPA+T PRLP G+ LP P+ +EDT K+KRGDTKDH L QWFM+FSEADE     
Sbjct:   176 LARVQNHPALTFPRLPSGYNLPAPMQVEDTGKKKRGDTKDHSLGQWFMSFSEADELLQLA 235

Query:   241 XXXXXXXXHIAKLKRQISLIDGQQQQLRSTSD 272
                     HIAKL+RQISL++GQ Q +++ SD
Sbjct:   236 AARQQVQAHIAKLRRQISLLEGQNQTVKTGSD 267




GO:0005634 "nucleus" evidence=IEA;ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=ISS
GO:0009507 "chloroplast" evidence=ISM
GO:0010228 "vegetative to reproductive phase transition of meristem" evidence=RCA
GO:0048510 "regulation of timing of transition from vegetative to reproductive phase" evidence=IMP
GO:0090239 "regulation of histone H4 acetylation" evidence=IMP
UNIPROTKB|Q32LE1 YEATS4 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2QSI2 YEATS4 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|O95619 YEATS4 "YEATS domain-containing protein 4" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1927224 Yeats4 "YEATS domain containing 4" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1305741 Yeats4 "YEATS domain containing 4" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q8UVS4 GAS41 "GAS41" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E1BU44 YEATS4 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040718-252 yeats4 "YEATS domain containing 4" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|F8W1B9 YEATS4 "YEATS domain-containing protein 4" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query273
pfam0336684 pfam03366, YEATS, YEATS family 1e-35
COG5033225 COG5033, TFG3, Transcription initiation factor IIF 1e-23
>gnl|CDD|190614 pfam03366, YEATS, YEATS family Back     alignment and domain information
 Score =  122 bits (308), Expect = 1e-35
 Identities = 47/83 (56%), Positives = 59/83 (71%), Gaps = 2/83 (2%)

Query: 71  SHKWTVYVRGATNE-DLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESGWGEFEIAITL 129
           +HKWTV+VRG  NE DL   IK+  F+LH SF NP R V  PPFE++E+GWGEFEI I +
Sbjct: 1   THKWTVFVRGLDNEGDLSYFIKKVTFKLHESFPNPVRTVTKPPFEVTETGWGEFEIPIKI 60

Query: 130 YFHADVCDKPLNLYHHLKLYPED 152
           YF  D  +KP+ + H LKL+PE 
Sbjct: 61  YFV-DSNEKPVTIQHDLKLHPEG 82


We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity. Length = 84

>gnl|CDD|227366 COG5033, TFG3, Transcription initiation factor IIF, auxiliary subunit [Transcription] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 273
KOG3149249 consensus Transcription initiation factor IIF, aux 100.0
COG5033225 TFG3 Transcription initiation factor IIF, auxiliar 100.0
PF0336684 YEATS: YEATS family; InterPro: IPR005033 Named the 100.0
>KOG3149 consensus Transcription initiation factor IIF, auxiliary subunit [Transcription] Back     alignment and domain information
Probab=100.00  E-value=1.2e-42  Score=317.59  Aligned_cols=228  Identities=36%  Similarity=0.582  Sum_probs=184.4

Q ss_pred             cccceeceEEEEEEEEccceEEcCCCCCCCCeeeEEEEEeCCCCCCcccceeeeEEEeCCCCCCCcceeecCCcEEEeee
Q 024038           40 LNKKLKDVEISIPIVYGNVAFWLGKKASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESG  119 (273)
Q Consensus        40 ~~kR~k~~~I~~pIv~Gn~A~~l~kk~~~~~tH~WtVyVr~~~~edls~~IkKV~F~LHpSF~nP~Rvv~~PPFeVtE~G  119 (273)
                      +.+|++.++|+++|+|||.|.+++++.++.+||.|+|||||.+++|++.||+||+|+||+||+||+|+|++|||+|+|+|
T Consensus         5 ~~~~~~~~~~~~~iv~G~~a~~~~~~~~~~~th~w~v~v~~~~~ed~~~~V~KV~f~LH~sf~~P~Rvv~~pPf~i~EtG   84 (249)
T KOG3149|consen    5 SIKRTKECTISVPIVPGNRAAILGKRLPDGFTHIWEVYVRGPGKEDISAFVDKVVFKLHESFPNPRRVVESPPFEITETG   84 (249)
T ss_pred             CcceeeeeeEEeeeecCccccccCCCCCcccceeeEEEecCcCccccceeeeeeeeecccccccccccccCCCceEEeec
Confidence            56899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEEEEEEEEEEeecCCCCCEEEEEEeecCCCCCC---C-----------CCcCCCCeEEEeee-EEEeeCCCHHHHHHH
Q 024038          120 WGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDES---G-----------SMSTKKPVVVESYD-EIVFPEPSDSFLARV  184 (273)
Q Consensus       120 WGEFeI~IkI~F~~~~~ekp~~l~H~L~L~~~~~~---~-----------~~~~~~~V~~e~yd-EIvF~nPse~f~~~L  184 (273)
                      ||+|+|.|+|||.++.+++++.++|+|.|++++..   .           ....+.+|+++.|+ +++|++|++.++..+
T Consensus        85 wgeF~i~i~i~f~d~~~~~~v~~~~~l~l~~~~~p~~~~~~~~~~~~~~~~~~~r~~v~~~~~~~e~~f~~~~~~~~~~~  164 (249)
T KOG3149|consen   85 WGEFEIQIEIFFTDDANEKKVTLYHDLKLHSYGAPPVPHEESTKKTFVNPTISLRIPVVREGVDVEIVFPDPTESTSIEA  164 (249)
T ss_pred             cccceEEEEEEeccCCCCceeeeeeeEEeeccCCCCccchhhhcccccccchhcccccccccccceeecCCCCccccccc
Confidence            99999999999999999999999999999987431   1           12347799999999 999999999999999


Q ss_pred             hcCCCccCCCCCCCCCCCCCCCCcccccccCCCCCCCCcchhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 024038          185 QNHPAVTLPRLPVGFTLPPPVPIEDTSKRKRGDTKDHPLAQWFMNFSEADELLQLAAARQQVQAHIAKLKRQISLIDGQQ  264 (273)
Q Consensus       185 ~~~~~~~~p~~p~~~~lp~~~~~~~~~~~~~~~~~~~~~~~~f~~~~E~~El~~l~~a~~kV~~~i~~lk~~l~~~e~~~  264 (273)
                      ...+..+..+.+....++............+.++++.-...+.-...|..+.++++.+.+.++.++.++++++..++++.
T Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~e~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~  244 (249)
T KOG3149|consen  165 SSRPVGPGSNLAAVTDLKQVKSKTLPSLLSKESSKDVKTEKSSERPNEIDEVDRLEKKIKELKKEINKVRKDIEKLEGEI  244 (249)
T ss_pred             CCCCCcCCccccccccccccccccCcccccccccccccccccccccccchhhhhhhhhhhhhhhHHHHHHHhhhhccccc
Confidence            98875443332222222211111111111222232322334445566899999999999999999999999999999886


Q ss_pred             hhc
Q 024038          265 QQL  267 (273)
Q Consensus       265 ~~~  267 (273)
                      ...
T Consensus       245 ~~~  247 (249)
T KOG3149|consen  245 DLQ  247 (249)
T ss_pred             ccc
Confidence            543



>COG5033 TFG3 Transcription initiation factor IIF, auxiliary subunit [Transcription] Back     alignment and domain information
>PF03366 YEATS: YEATS family; InterPro: IPR005033 Named the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit', this family also contains the GAS41 protein Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query273
3rls_A175 Crystal Structure Of Yeast Af-9 Homolog Protein Yaf 2e-30
3fk3_A164 Structure Of The Yeats Domain, Yaf9 Length = 164 4e-28
3qrl_A140 Crystal Structure Of The Taf14 Yeats Domain Length 3e-05
2l7e_A131 The Structure Of A Domain From Yeast Length = 131 3e-05
>pdb|3RLS|A Chain A, Crystal Structure Of Yeast Af-9 Homolog Protein Yaf9 Length = 175 Back     alignment and structure

Iteration: 1

Score = 129 bits (323), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 67/166 (40%), Positives = 97/166 (58%), Gaps = 17/166 (10%) Query: 47 VEISIPIVYGNVAFWLGK----KASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFN 102 + +S PI+YGN A +G A +H WT++VRG NED+ IK+ VF+LH ++ Sbjct: 4 LSVSRPIIYGNTAKKMGSVKPPNAPAEHTHLWTIFVRGPQNEDISYFIKKVVFKLHDTYP 63 Query: 103 NPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYP------------ 150 NP R++E+PPFEL+E+GWGEF+I I +YF + +K LN YH L+L+P Sbjct: 64 NPVRSIEAPPFELTETGWGEFDINIKVYFVEEANEKVLNFYHRLRLHPYANPVPNSDNGN 123 Query: 151 EDESGSMSTKKPVVVESY-DEIVFPEPSDSFLARVQNHPAVTLPRL 195 E + ++K V Y DEIVF EP++ F + + P LP L Sbjct: 124 EQNTTDHNSKDAEVSSVYFDEIVFNEPNEEFFKILMSRPGNLLPSL 169
>pdb|3FK3|A Chain A, Structure Of The Yeats Domain, Yaf9 Length = 164 Back     alignment and structure
>pdb|3QRL|A Chain A, Crystal Structure Of The Taf14 Yeats Domain Length = 140 Back     alignment and structure
>pdb|2L7E|A Chain A, The Structure Of A Domain From Yeast Length = 131 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query273
3rls_A175 YAF9, protein AF-9 homolog; yeats domain, histone, 4e-52
3qrl_A140 Transcription initiation factor TFIID subunit 14; 1e-37
>3rls_A YAF9, protein AF-9 homolog; yeats domain, histone, transcription; 1.70A {Saccharomyces cerevisiae} PDB: 3fk3_A Length = 175 Back     alignment and structure
 Score =  167 bits (423), Expect = 4e-52
 Identities = 66/170 (38%), Positives = 95/170 (55%), Gaps = 17/170 (10%)

Query: 44  LKDVEISIPIVYGNVAFWLGK----KASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHS 99
           +K + +S PI+YGN A  +G      A    +H WT++VRG  NED+   IK+ VF+LH 
Sbjct: 1   IKTLSVSRPIIYGNTAKKMGSVKPPNAPAEHTHLWTIFVRGPQNEDISYFIKKVVFKLHD 60

Query: 100 SFNNPTRAVESPPFELSESGWGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESG---- 155
           ++ NP R++E+PPFEL+E+GWGEF+I I +YF  +  +K LN YH L+L+P         
Sbjct: 61  TYPNPVRSIEAPPFELTETGWGEFDINIKVYFVEEANEKVLNFYHRLRLHPYANPVPNSD 120

Query: 156 ---------SMSTKKPVVVESYDEIVFPEPSDSFLARVQNHPAVTLPRLP 196
                      S    V    +DEIVF EP++ F   + + P   LP L 
Sbjct: 121 NGNEQNTTDHNSKDAEVSSVYFDEIVFNEPNEEFFKILMSRPGNLLPSLE 170


>3qrl_A Transcription initiation factor TFIID subunit 14; yeats domain, IG fold, nucleus, nuclear protein; HET: PGE; 1.70A {Saccharomyces cerevisiae} PDB: 2l7e_A Length = 140 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query273
3rls_A175 YAF9, protein AF-9 homolog; yeats domain, histone, 100.0
3qrl_A140 Transcription initiation factor TFIID subunit 14; 100.0
2l5g_B42 Putative uncharacterized protein NCOR2, G protein 83.09
>3rls_A YAF9, protein AF-9 homolog; yeats domain, histone, transcription; 1.70A {Saccharomyces cerevisiae} PDB: 3fk3_A Back     alignment and structure
Probab=100.00  E-value=6.2e-56  Score=385.31  Aligned_cols=151  Identities=43%  Similarity=0.833  Sum_probs=135.4

Q ss_pred             eeceEEEEEEEEccceEEcCC----CCCCCCeeeEEEEEeCCCCCCcccceeeeEEEeCCCCCCCcceeecCCcEEEeee
Q 024038           44 LKDVEISIPIVYGNVAFWLGK----KASEYQSHKWTVYVRGATNEDLGVVIKRAVFQLHSSFNNPTRAVESPPFELSESG  119 (273)
Q Consensus        44 ~k~~~I~~pIv~Gn~A~~l~k----k~~~~~tH~WtVyVr~~~~edls~~IkKV~F~LHpSF~nP~Rvv~~PPFeVtE~G  119 (273)
                      |||++|+|||||||+|+++++    +++++|||+|+|||||++++|+++||+||+|+|||||+||+|+|++|||+|+|+|
T Consensus         1 vk~v~i~kpIv~Gn~a~~l~~~~~~~~~~~~TH~WtVyVr~~~~edis~~v~KV~F~LHpSF~np~Rvv~~PPFevtE~G   80 (175)
T 3rls_A            1 IKTLSVSRPIIYGNTAKKMGSVKPPNAPAEHTHLWTIFVRGPQNEDISYFIKKVVFKLHDTYPNPVRSIEAPPFELTETG   80 (175)
T ss_dssp             CCCCCEEEEEEEEEEEEECCSCCCTTCCTTCCEEEEEEEECGGGCCCTTTEEEEEEECCTTSSSCEEEECSSSEEEEEEE
T ss_pred             CCceEEEeCEEEcceeEECCccccCCCCCCCcEEEEEEEECCCCCChhheEEEEEEEcCCCCCCCcEEEeCCCCEEEEeE
Confidence            689999999999999999996    3567899999999999999999999999999999999999999999999999999


Q ss_pred             EEEEEEEEEEEEeecCCCCCEEEEEEeecCCCCCCC-------C------CcCCCCeEEEeeeEEEeeCCCHHHHHHHhc
Q 024038          120 WGEFEIAITLYFHADVCDKPLNLYHHLKLYPEDESG-------S------MSTKKPVVVESYDEIVFPEPSDSFLARVQN  186 (273)
Q Consensus       120 WGEFeI~IkI~F~~~~~ekp~~l~H~L~L~~~~~~~-------~------~~~~~~V~~e~ydEIvF~nPse~f~~~L~~  186 (273)
                      ||||+|.|+|||++++++|+++|+|+|+||+++.+.       .      ..+++||++|+||||||+||+|.||++|++
T Consensus        81 WGeF~i~I~i~F~~~~~ek~i~i~H~L~L~~~~~~~~~~~~~~~~~~~~~~~~~~~V~se~ydEivF~ePte~f~~~L~~  160 (175)
T 3rls_A           81 WGEFDINIKVYFVEEANEKVLNFYHRLRLHPYANPVPNSDNGNEQNTTDHNSKDAEVSSVYFDEIVFNEPNEEFFKILMS  160 (175)
T ss_dssp             SSCCEEEEEEEECGGGCCCCEEEEEECCCCC-----------------------CCEEEEEEEEEEESSCCHHHHHHHHH
T ss_pred             EeeEEEEEEEEEeCCCCCccEEEEEEEEecCCCCccccccccccccccccccCCCceEEEEeccEEEeCCCHHHHHHHHh
Confidence            999999999999988899999999999999986441       1      124789999999999999999999999999


Q ss_pred             CCCccCCC
Q 024038          187 HPAVTLPR  194 (273)
Q Consensus       187 ~~~~~~p~  194 (273)
                      +|++++|.
T Consensus       161 ~p~~~lp~  168 (175)
T 3rls_A          161 RPGNLLPS  168 (175)
T ss_dssp             STTCCSCS
T ss_pred             CCCccCCC
Confidence            99988763



>3qrl_A Transcription initiation factor TFIID subunit 14; yeats domain, IG fold, nucleus, nuclear protein; HET: PGE; 1.70A {Saccharomyces cerevisiae} PDB: 2l7e_A Back     alignment and structure
>2l5g_B Putative uncharacterized protein NCOR2, G protein pathway suppressor 2; GPS2, SMRT, TBL1, CO-repressor, transcription regulator; NMR {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00