Citrus Sinensis ID: 024913


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260
MVADKGKKTKVEEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEGDEEGKEDDDSEDDEDDQEEDDDDEDGDDEGN
cccccccccHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHcccccEEEEEccccHHHHcccHHHHHHHHHHcccEEEEcccccccEEEEEEEccccccccccEEEEEEEEccccccEEEEEEEEcccccccccccccccccccccccccccEEccccccccccccccHHHHHHHHHHcccccHHHHccccccccccccccccccccccccccccccccccccccccccc
ccccccccccccHccHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccEHHHHcccHHHHHHHcHcHHHHHHHHHccEEEEcccccccEEEEEEEcccccccccEEEEEEEEEcccccEEEEEccEEEcccccccHcccccccccccccccccHccccccccccccccHHHHHHHHHHHHHcccccHHHHccccccccccccccccccccccccEEEEEEcccccccccccc
mvadkgkktkveeENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIrkpvydkrndIIKSIPDFWLTAFishpalgellseEDQKIFRYLSslevedfkdvksgysitfnfspnpyfednkltktftfldddgsmkITATSIKwkegmgipngvnhekkgnkrplaeesfftwfsdtqekdtIDGIQDEVAEIIKedlwpnpltyfnneadeeefegdeegkedddseddeddqeeddddedgddegn
mvadkgkktkveeenaeqidselvlSIEKLQEIQDELEKINeeasekvleveqkyseirkpvydkrnDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSlevedfkdvKSGYSITfnfspnpyfedNKLTKTftfldddgsmKITATsikwkegmgipngvnhekkgnkrpLAEESFFTWFSDTQEKDTIDGIQDEVAEIIkedlwpnpLTYFNNEadeeefegdeegkedddseddeddqeeddddedgddegn
MVADKGKKTKVEEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNeadeeefegdeegkedddseddeddqeeddddedgddegN
******************************************************YSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGI*****************ESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFN*****************************************
**************************IEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGI****************EESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNN****************************************
*****************QIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEAD*************************************
*****************QIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIP*G*************EESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEAD****************EDDEDDQEEDD**********
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVADKGKKTKVEEENxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEGDEEGKEDDDSEDDEDDQEEDDDDEDGDDEGN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query260 2.2.26 [Sep-21-2011]
P53997269 Protein SET OS=Drosophila yes no 0.723 0.698 0.45 7e-45
Q01105290 Protein SET OS=Homo sapie yes no 0.719 0.644 0.448 3e-44
Q9EQU5289 Protein SET OS=Mus muscul yes no 0.719 0.647 0.448 6e-44
Q63945289 Protein SET OS=Rattus nor yes no 0.719 0.647 0.448 6e-44
Q9H2G4 693 Testis-specific Y-encoded no no 0.730 0.274 0.368 8e-35
Q7TQI8 677 Testis-specific Y-encoded no no 0.8 0.307 0.330 2e-34
Q9BE64 695 Testis-specific Y-encoded N/A no 0.730 0.273 0.364 3e-34
Q8N831410 Testis-specific Y-encoded no no 0.7 0.443 0.371 1e-33
Q9UJ04414 Testis-specific Y-encoded no no 0.703 0.442 0.357 3e-32
O88852379 Testis-specific Y-encoded no no 0.692 0.474 0.372 3e-32
>sp|P53997|SET_DROME Protein SET OS=Drosophila melanogaster GN=Set PE=1 SV=2 Back     alignment and function desciption
 Score =  181 bits (458), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 136/200 (68%), Gaps = 12/200 (6%)

Query: 26  SIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHP 85
           ++E++   Q+E++ +NE+ASE++L+VEQKY+++RKP Y+KR++++K IP+FW+T+FI+HP
Sbjct: 32  ALEQIDACQNEIDALNEKASEEILKVEQKYNKLRKPCYEKRSELVKRIPNFWVTSFINHP 91

Query: 86  ALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDG 145
            +  +L EE+++    L+ LEVE+F+D+KSGY I F+F  NPYFE+  LTK F       
Sbjct: 92  QVSGILDEEEEECLHALNKLEVEEFEDIKSGYRINFHFDENPYFENKVLTKEFHLNSAAA 151

Query: 146 S-----MKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEE--SFFTWFSDTQEKDTIDGI 198
           S        T+T IKWKEG  +   +  +  GNK+    E  +FF WFS     D  D +
Sbjct: 152 SENGDWPASTSTPIKWKEGKNLLKLLLTKPYGNKKKRNSEYKTFFDWFS-----DNTDPV 206

Query: 199 QDEVAEIIKEDLWPNPLTYF 218
            DE+AE+IK+DLWPNPL Y+
Sbjct: 207 NDEIAELIKDDLWPNPLQYY 226





Drosophila melanogaster (taxid: 7227)
>sp|Q01105|SET_HUMAN Protein SET OS=Homo sapiens GN=SET PE=1 SV=3 Back     alignment and function description
>sp|Q9EQU5|SET_MOUSE Protein SET OS=Mus musculus GN=Set PE=1 SV=1 Back     alignment and function description
>sp|Q63945|SET_RAT Protein SET OS=Rattus norvegicus GN=Set PE=1 SV=2 Back     alignment and function description
>sp|Q9H2G4|TSYL2_HUMAN Testis-specific Y-encoded-like protein 2 OS=Homo sapiens GN=TSPYL2 PE=1 SV=1 Back     alignment and function description
>sp|Q7TQI8|TSYL2_MOUSE Testis-specific Y-encoded-like protein 2 OS=Mus musculus GN=Tspyl2 PE=1 SV=1 Back     alignment and function description
>sp|Q9BE64|TSYL2_MACFA Testis-specific Y-encoded-like protein 2 OS=Macaca fascicularis GN=TSPYL2 PE=1 SV=1 Back     alignment and function description
>sp|Q8N831|TSYL6_HUMAN Testis-specific Y-encoded-like protein 6 OS=Homo sapiens GN=TSPYL6 PE=2 SV=1 Back     alignment and function description
>sp|Q9UJ04|TSYL4_HUMAN Testis-specific Y-encoded-like protein 4 OS=Homo sapiens GN=TSPYL4 PE=2 SV=2 Back     alignment and function description
>sp|O88852|TSYL1_MOUSE Testis-specific Y-encoded-like protein 1 OS=Mus musculus GN=Tspyl1 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query260
225456743268 PREDICTED: protein SET [Vitis vinifera] 0.896 0.869 0.804 1e-105
26451105228 unknown protein [Arabidopsis thaliana] g 0.869 0.991 0.794 2e-97
6730705255 Putative phospatase 2A inhibitor [Arabid 0.842 0.858 0.800 5e-96
356516792263 PREDICTED: protein SET-like isoform 1 [G 0.842 0.832 0.797 6e-96
225452785255 PREDICTED: protein SET [Vitis vinifera] 0.838 0.854 0.790 6e-96
356508564261 PREDICTED: protein SET-like [Glycine max 0.842 0.839 0.792 8e-96
296082895253 unnamed protein product [Vitis vinifera] 0.838 0.861 0.790 1e-95
255638203263 unknown [Glycine max] 0.842 0.832 0.792 5e-95
18394656256 template-activating factor I [Arabidopsi 0.842 0.855 0.797 1e-94
224119088242 nucleosome/chromatin assembly factor gro 0.884 0.950 0.764 1e-94
>gi|225456743|ref|XP_002275632.1| PREDICTED: protein SET [Vitis vinifera] gi|297733991|emb|CBI15238.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 194/241 (80%), Positives = 219/241 (90%), Gaps = 8/241 (3%)

Query: 1   MVADKGKKTKV----EEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYS 56
           MVADKGKK K     EE N++ ID +LVLSIEKLQEIQD+LEKINEEAS+KVLEVEQKY+
Sbjct: 1   MVADKGKKLKQSEKEEEVNSDHIDGDLVLSIEKLQEIQDDLEKINEEASDKVLEVEQKYN 60

Query: 57  EIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSG 116
           EIRKP+YDKRNDIIKSIPDFWLTAF+SHPALG+LLSEEDQKIF+YLSSLEVEDFKDVKSG
Sbjct: 61  EIRKPIYDKRNDIIKSIPDFWLTAFLSHPALGDLLSEEDQKIFKYLSSLEVEDFKDVKSG 120

Query: 117 YSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRP 176
           YSITFNF+PNP+FED KL KTFTFL D+G  KI+ TSIKWK+GMGIPNGVNHEKKGNKRP
Sbjct: 121 YSITFNFNPNPFFEDTKLKKTFTFL-DEGITKISVTSIKWKDGMGIPNGVNHEKKGNKRP 179

Query: 177 LAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEG---DEEG 233
           +A+ SFF+WFS+TQ+KD +D I DE+AEIIKEDLWPNPLTYFN+EADEE+F+G   DEEG
Sbjct: 180 IADASFFSWFSETQQKDIMDDIHDEIAEIIKEDLWPNPLTYFNSEADEEDFDGEDADEEG 239

Query: 234 K 234
           K
Sbjct: 240 K 240




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|26451105|dbj|BAC42657.1| unknown protein [Arabidopsis thaliana] gi|28950777|gb|AAO63312.1| At1g18800 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|6730705|gb|AAF27100.1|AC011809_9 Putative phospatase 2A inhibitor [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|356516792|ref|XP_003527077.1| PREDICTED: protein SET-like isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|225452785|ref|XP_002283325.1| PREDICTED: protein SET [Vitis vinifera] Back     alignment and taxonomy information
>gi|356508564|ref|XP_003523025.1| PREDICTED: protein SET-like [Glycine max] Back     alignment and taxonomy information
>gi|296082895|emb|CBI22196.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255638203|gb|ACU19415.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|18394656|ref|NP_564063.1| template-activating factor I [Arabidopsis thaliana] gi|21555241|gb|AAM63812.1| putative SET protein, phospatase 2A inhibitor [Arabidopsis thaliana] gi|332191644|gb|AEE29765.1| template-activating factor I [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224119088|ref|XP_002317982.1| nucleosome/chromatin assembly factor group [Populus trichocarpa] gi|222858655|gb|EEE96202.1| nucleosome/chromatin assembly factor group [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query260
TAIR|locus:2034995256 NRP2 "AT1G18800" [Arabidopsis 0.842 0.855 0.797 5.5e-93
TAIR|locus:2019075264 NRP1 "AT1G74560" [Arabidopsis 0.807 0.795 0.760 3.4e-84
ZFIN|ZDB-GENE-030131-433275 setb "SET translocation (myelo 0.757 0.716 0.444 8.8e-47
UNIPROTKB|F1RR69289 SET "Uncharacterized protein" 0.773 0.695 0.433 3e-46
UNIPROTKB|F2Z4L4277 SET "Uncharacterized protein" 0.757 0.711 0.435 6.2e-46
FB|FBgn0014879269 Set "Set" [Drosophila melanoga 0.773 0.747 0.437 7.9e-46
UNIPROTKB|Q5VXV2268 SET "Protein SET" [Homo sapien 0.746 0.723 0.438 7.9e-46
MGI|MGI:1860267289 Set "SET nuclear oncogene" [Mu 0.780 0.702 0.425 1e-45
RGD|1307467289 Set "SET nuclear oncogene" [Ra 0.780 0.702 0.425 1e-45
UNIPROTKB|Q01105290 SET "Protein SET" [Homo sapien 0.719 0.644 0.448 1.3e-45
TAIR|locus:2034995 NRP2 "AT1G18800" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 926 (331.0 bits), Expect = 5.5e-93, P = 5.5e-93
 Identities = 177/222 (79%), Positives = 200/222 (90%)

Query:     1 MVADKGKKTKVEEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRK 60
             MV DK KK K EEEN EQID+ELVLSIEKLQEIQD+LEKINE+AS++VLEVEQKY+ IRK
Sbjct:     1 MVTDKSKKAKTEEENVEQIDAELVLSIEKLQEIQDDLEKINEKASDEVLEVEQKYNVIRK 60

Query:    61 PVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSIT 120
             PVYDKRN+IIK+IPDFWLTAF+SHPALGELL+EEDQKIF+YLSSL+VED KDVKSGYSIT
Sbjct:    61 PVYDKRNEIIKTIPDFWLTAFLSHPALGELLTEEDQKIFKYLSSLDVEDAKDVKSGYSIT 120

Query:   121 FNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEE 180
             F+F+PNP+FED KLTKTFTFL++ G+ KITAT IKWKEG G+ NGVNHEK GNKR L EE
Sbjct:   121 FSFNPNPFFEDGKLTKTFTFLEE-GTTKITATPIKWKEGKGLANGVNHEKNGNKRALPEE 179

Query:   181 SFFTWFSDTQEKDTI-DGIQDE-VAEIIKEDLWPNPLTYFNN 220
             SFFTWFSD Q K+ + D +QDE VA+IIKEDLWPNPLTYFNN
Sbjct:   180 SFFTWFSDAQHKEDVEDEMQDEQVADIIKEDLWPNPLTYFNN 221




GO:0003677 "DNA binding" evidence=ISS
GO:0005634 "nucleus" evidence=ISM;IEA;ISS;IDA
GO:0006334 "nucleosome assembly" evidence=IEA;ISS
GO:0003682 "chromatin binding" evidence=IPI
GO:0005737 "cytoplasm" evidence=IDA
GO:0008283 "cell proliferation" evidence=IGI
GO:0010311 "lateral root formation" evidence=IGI
GO:0030154 "cell differentiation" evidence=IGI
GO:0042393 "histone binding" evidence=IPI
TAIR|locus:2019075 NRP1 "AT1G74560" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-030131-433 setb "SET translocation (myeloid leukemia-associated) B" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|F1RR69 SET "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|F2Z4L4 SET "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
FB|FBgn0014879 Set "Set" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|Q5VXV2 SET "Protein SET" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1860267 Set "SET nuclear oncogene" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1307467 Set "SET nuclear oncogene" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q01105 SET "Protein SET" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00020937001
SubName- Full=Chromosome chr14 scaffold_21, whole genome shotgun sequence; (255 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00017697001
SubName- Full=Chromosome chr17 scaffold_16, whole genome shotgun sequence; (86 aa)
       0.608
GSVIVG00028481001
SubName- Full=Chromosome chr7 scaffold_44, whole genome shotgun sequence; (316 aa)
       0.595
GSVIVG00021672001
SubName- Full=Chromosome chr8 scaffold_23, whole genome shotgun sequence; (315 aa)
       0.544
GSVIVG00017910001
SubName- Full=Chromosome chr17 scaffold_16, whole genome shotgun sequence; (481 aa)
       0.516
GSVIVG00014203001
SubName- Full=Chromosome chr16 scaffold_10, whole genome shotgun sequence; (151 aa)
       0.503
GSVIVG00000534001
SubName- Full=Chromosome chr2 scaffold_105, whole genome shotgun sequence; (338 aa)
       0.501
GSVIVG00005498001
SubName- Full=Putative uncharacterized protein (Chromosome chr13 scaffold_152, whole genome sho [...] (293 aa)
       0.496
GSVIVG00021307001
SubName- Full=Chromosome chr8 scaffold_23, whole genome shotgun sequence; (586 aa)
       0.488
GSVIVG00002607001
SubName- Full=Chromosome undetermined scaffold_133, whole genome shotgun sequence; (114 aa)
      0.473
GSVIVG00014439001
RecName- Full=Diphthine synthase; EC=2.1.1.98;; Required for the methylation step in diphthamid [...] (285 aa)
       0.471

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query260
pfam00956236 pfam00956, NAP, Nucleosome assembly protein (NAP) 2e-49
PTZ00008185 PTZ00008, PTZ00008, (NAP-S) nucleosome assembly pr 3e-23
PTZ00007337 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly pr 1e-22
pfam10446 449 pfam10446, DUF2457, Protein of unknown function (D 2e-04
PRK12298390 PRK12298, obgE, GTPase CgtA; Reviewed 7e-04
pfam03153332 pfam03153, TFIIA, Transcription factor IIA, alpha/ 8e-04
pfam04147 809 pfam04147, Nop14, Nop14-like family 0.001
pfam10446 449 pfam10446, DUF2457, Protein of unknown function (D 0.002
pfam03153332 pfam03153, TFIIA, Transcription factor IIA, alpha/ 0.002
pfam04546 211 pfam04546, Sigma70_ner, Sigma-70, non-essential re 0.002
pfam10446 449 pfam10446, DUF2457, Protein of unknown function (D 0.003
pfam02724 583 pfam02724, CDC45, CDC45-like protein 0.004
>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP) Back     alignment and domain information
 Score =  162 bits (411), Expect = 2e-49
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 37/234 (15%)

Query: 26  SIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDII--------------- 70
            +E L+ +Q EL+++  +  E+VLE+E+KY ++ +P+YDKR +II               
Sbjct: 3   RVEALKALQKELDELEAKFQEEVLELERKYDKLYQPLYDKRREIINGAREPTEVEEEEPE 62

Query: 71  -KSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYF 129
            K IP FWLTA  +HP L E+++E D++  +YL+ + VE  +D K G+ + F+F+PNPYF
Sbjct: 63  EKGIPGFWLTALKNHPLLSEMITERDEEALKYLTDIRVEYLEDPKKGFKLIFHFAPNPYF 122

Query: 130 EDNKLTKTFTFLDDDGS--MKITATSIKWKEGM-----GIPNGVNHEKKGNKRPLAE--- 179
            +  LTKT+   D+     +K   T I+WKEG       +     ++K G  R + +   
Sbjct: 123 TNEVLTKTYHLKDEGDPFELKSEGTPIEWKEGKNLTVKTVKKKQRNKKTGQTRTITKTVP 182

Query: 180 -ESFFTWFSDTQEKDTIDG----------IQDEVAEIIKEDLWPNPLTYFNNEA 222
            ESFF +FS  +  D  D           +  E+ EIIK+DL P  L Y+  EA
Sbjct: 183 AESFFNFFSPPKVPDDDDDDDEELEEELELDYEIGEIIKDDLIPRALDYYTGEA 236


NAP proteins are involved in moving histones into the nucleus, nucleosome assembly and chromatin fluidity. They affect the transcription of many genes. Length = 236

>gnl|CDD|185394 PTZ00008, PTZ00008, (NAP-S) nucleosome assembly protein-S; Provisional Back     alignment and domain information
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L; Provisional Back     alignment and domain information
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457) Back     alignment and domain information
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed Back     alignment and domain information
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit Back     alignment and domain information
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family Back     alignment and domain information
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457) Back     alignment and domain information
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit Back     alignment and domain information
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region Back     alignment and domain information
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457) Back     alignment and domain information
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 260
PTZ00007337 (NAP-L) nucleosome assembly protein -L; Provisiona 100.0
KOG1507358 consensus Nucleosome assembly protein NAP-1 [Chrom 100.0
PTZ00008185 (NAP-S) nucleosome assembly protein-S; Provisional 100.0
PF00956244 NAP: Nucleosome assembly protein (NAP); InterPro: 100.0
KOG1508260 consensus DNA replication factor/protein phosphata 100.0
PF1162949 Mst1_SARAH: C terminal SARAH domain of Mst1; Inter 94.71
PF09026101 CENP-B_dimeris: Centromere protein B dimerisation 92.24
PF07352149 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; I 91.9
KOG1508260 consensus DNA replication factor/protein phosphata 89.37
COG4396170 Mu-like prophage host-nuclease inhibitor protein G 88.52
PF06524314 NOA36: NOA36 protein; InterPro: IPR010531 This fam 85.47
PF04931 784 DNA_pol_phi: DNA polymerase phi; InterPro: IPR0070 82.88
>PTZ00007 (NAP-L) nucleosome assembly protein -L; Provisional Back     alignment and domain information
Probab=100.00  E-value=5.1e-60  Score=437.52  Aligned_cols=217  Identities=30%  Similarity=0.591  Sum_probs=189.7

Q ss_pred             hhhhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHhhhhhhh----------cchhhHHHHH
Q 024913           12 EEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIK----------SIPDFWLTAF   81 (260)
Q Consensus        12 ~~e~~~~~~~~v~~~i~~L~~lQ~e~~~le~~~~~e~~~le~ky~k~~~ply~kR~eiI~----------~IP~FW~~vl   81 (260)
                      .++++..||+.++.++.+|+.||.++..|+.++++++++|+++|.++++|+|++|++||+          |||+||++||
T Consensus        28 ~~~~i~~Lp~~~~~rv~aL~~lQ~e~~~le~ef~~ev~~LE~kY~~~~~Ply~kR~eII~G~~~~e~~~~gIP~FWl~vL  107 (337)
T PTZ00007         28 DDEKLSHLTDEQRETLKKLQLLQKEFDDLEVEYNAELRKLRSKYEDLYNPIYDKRKEALVQNGGAEIGTPGLPQFWLTAM  107 (337)
T ss_pred             ccchhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHcCCcccccccCCcccHHHHHH
Confidence            346677999999999999999999999999999999999999999999999999999999          6999999999


Q ss_pred             hhhhhhhcccChhhHHhhcCcceeEEEEccCCC-cceEEEEEecCCCcccCCeEEEEEEeeCCC---CC--ceeeecccc
Q 024913           82 ISHPALGELLSEEDQKIFRYLSSLEVEDFKDVK-SGYSITFNFSPNPYFEDNKLTKTFTFLDDD---GS--MKITATSIK  155 (260)
Q Consensus        82 ~n~~~l~~~i~~~D~~iL~~L~dI~Ve~~~d~~-~~f~i~F~F~~NpyF~N~~LtK~~~~~~~~---g~--~~~~~t~I~  155 (260)
                      +||+.|+.+|+++|++||+||++|+|++..+.. +||+|+|+|++||||+|++|||+|++....   |+  ..+++|+|+
T Consensus       108 ~Nh~~ls~~I~e~De~iL~~L~dI~ve~~~~~~~~gf~I~F~F~~NpyF~N~vLtK~y~~~~~d~~~~p~~~~~~~t~I~  187 (337)
T PTZ00007        108 KNNNTLGSAIEEHDEPILSYLSDISCEYTEPNKQEGFILVFTFAPNPFFSNTVLTKTYHMKVLDGDDEPLLSNTVATEID  187 (337)
T ss_pred             HcCccHhhhCCHHHHHHHHhhCceEEEEccCCCCCceEEEEEeCCCCCCCCCeEEEEEEeecCCCCCCceeecceeeece
Confidence            999999999999999999999999999876544 899999999999999999999999997422   22  246899999


Q ss_pred             ccCCCCCCCccccccCCCC-----c----cccccccccccccccCCCccc----------------cchHHHHHHHhhcc
Q 024913          156 WKEGMGIPNGVNHEKKGNK-----R----PLAEESFFTWFSDTQEKDTID----------------GIQDEVAEIIKEDL  210 (260)
Q Consensus       156 Wk~gk~~t~~~~~~k~~~~-----r----~~~~~SFF~~F~~~~~~~~~e----------------~~~~ei~~~i~d~i  210 (260)
                      ||+|++||++..++|++++     |    +++..|||+||+++..+...+                +.+++||++|+++|
T Consensus       188 WK~GkdlT~k~v~kKqr~K~~~~~r~v~~~~~~~SFFnfF~p~~~p~~~~~e~~~e~~~ee~~~~l~~DyeiG~~ikd~I  267 (337)
T PTZ00007        188 WKQGKDVTKKVVTKKQRHKKTKETRTVTETVDRESFFNFFTSHEVPSDEELEKMSKHEIAELEMIVETDYEIGITIRDKL  267 (337)
T ss_pred             eeCCCCchhhhcccccccccCCCceeeccCCCCCChHHhcCCCCCCcccccccccchhHHHHHHHHHHhHHHHHHHHHhc
Confidence            9999999998766544333     2    356799999999987654210                24679999999999


Q ss_pred             ccchhhhcccCCCccccc
Q 024913          211 WPNPLTYFNNEADEEEFE  228 (260)
Q Consensus       211 ~p~al~yy~~~~~~~e~~  228 (260)
                      ||+||.||+|++.+++.+
T Consensus       268 IP~AV~yftGea~d~~~~  285 (337)
T PTZ00007        268 IPYAVYWFLGEAIDEDSD  285 (337)
T ss_pred             ccccHHhhCCCccccccc
Confidence            999999999998776654



>KOG1507 consensus Nucleosome assembly protein NAP-1 [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>PTZ00008 (NAP-S) nucleosome assembly protein-S; Provisional Back     alignment and domain information
>PF00956 NAP: Nucleosome assembly protein (NAP); InterPro: IPR002164 It is thought that NAPs act as histone chaperones, shuttling both core and linker histones from their site of synthesis in the cytoplasm to the nucleus Back     alignment and domain information
>KOG1508 consensus DNA replication factor/protein phosphatase inhibitor SET/SPR-2 [Replication, recombination and repair] Back     alignment and domain information
>PF11629 Mst1_SARAH: C terminal SARAH domain of Mst1; InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name Back     alignment and domain information
>PF09026 CENP-B_dimeris: Centromere protein B dimerisation domain; InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box Back     alignment and domain information
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo [] Back     alignment and domain information
>KOG1508 consensus DNA replication factor/protein phosphatase inhibitor SET/SPR-2 [Replication, recombination and repair] Back     alignment and domain information
>COG4396 Mu-like prophage host-nuclease inhibitor protein Gam [General function prediction only] Back     alignment and domain information
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues Back     alignment and domain information
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query260
2e50_A225 Crystal Structure Of SetTAF-1betaINHAT Length = 225 4e-46
3kyp_A193 Crystal Structure Of Nucleosome Assembly Protein S 4e-24
3hfd_A276 Nucleosome Assembly Protein 1 From Plasmodium Knowl 8e-14
3gyv_A249 Crystal Structure Of Nucleosome Assembly Protein Fr 4e-12
3fs3_A359 Crystal Structure Of Malaria Parasite Nucleosome As 5e-12
2ayu_A417 The Structure Of Nucleosome Assembly Protein Sugges 7e-09
2z2r_A292 Nucleosome Assembly Proteins I (Nap-1, 74-365) Leng 7e-09
3q33_B232 Structure Of The Rtt109-AccoaVPS75 COMPLEX AND IMPL 2e-04
2zd7_A264 The Structure Of Vps75 (Vacuolar Protein Sorting-As 2e-04
3c9d_A259 Crystal Structure Of Vps75 Length = 259 2e-04
3c9b_A259 Crystal Structure Of Semet Vps75 Length = 259 2e-04
3dm7_A234 Crystal Structure Of The Vps75 Histone Chaperone Le 3e-04
>pdb|2E50|A Chain A, Crystal Structure Of SetTAF-1betaINHAT Length = 225 Back     alignment and structure

Iteration: 1

Score = 181 bits (458), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 88/196 (44%), Positives = 133/196 (67%), Gaps = 9/196 (4%) Query: 26 SIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHP 85 +IE + E+Q+E++++NE+ASE++L+VEQKY+++R+P + KR+++I IP+FW+T F++HP Sbjct: 31 AIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHP 90 Query: 86 ALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDG 145 + LL EED++ YL+ +EV +F+D+KSGY I F F NPYFE+ L+K F +++ G Sbjct: 91 QVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEF-HMNESG 149 Query: 146 SMKITATSIKWKEG--MGIPNGVNHEKKGNKRPLAE-ESFFTWFSDTQEKDTIDGIQDEV 202 +T IKWK G M + K KR E ESFFTWF+D D DE+ Sbjct: 150 DPSSKSTEIKWKSGKDMTKRSSQTQNKASRKRQHEEPESFFTWFTDHS-----DAGADEL 204 Query: 203 AEIIKEDLWPNPLTYF 218 E+IK+D+WPNPL Y+ Sbjct: 205 GEVIKDDIWPNPLQYY 220
>pdb|3KYP|A Chain A, Crystal Structure Of Nucleosome Assembly Protein S (Pfnaps) Plasmodium Falciparum Length = 193 Back     alignment and structure
>pdb|3HFD|A Chain A, Nucleosome Assembly Protein 1 From Plasmodium Knowlesi Length = 276 Back     alignment and structure
>pdb|3GYV|A Chain A, Crystal Structure Of Nucleosome Assembly Protein From Plasmodium Falciparum Length = 249 Back     alignment and structure
>pdb|3FS3|A Chain A, Crystal Structure Of Malaria Parasite Nucleosome Assembly Protein (Nap) Length = 359 Back     alignment and structure
>pdb|2AYU|A Chain A, The Structure Of Nucleosome Assembly Protein Suggests A Mechanism For Histone Binding And Shuttling Length = 417 Back     alignment and structure
>pdb|2Z2R|A Chain A, Nucleosome Assembly Proteins I (Nap-1, 74-365) Length = 292 Back     alignment and structure
>pdb|3Q33|B Chain B, Structure Of The Rtt109-AccoaVPS75 COMPLEX AND IMPLICATIONS FOR Chaperone-Mediated Histone Acetylation Length = 232 Back     alignment and structure
>pdb|2ZD7|A Chain A, The Structure Of Vps75 (Vacuolar Protein Sorting-Associated Protein 75) Length = 264 Back     alignment and structure
>pdb|3C9D|A Chain A, Crystal Structure Of Vps75 Length = 259 Back     alignment and structure
>pdb|3C9B|A Chain A, Crystal Structure Of Semet Vps75 Length = 259 Back     alignment and structure
>pdb|3DM7|A Chain A, Crystal Structure Of The Vps75 Histone Chaperone Length = 234 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query260
2e50_A225 Protein SET; histone chaperone, inhat, PP2AI, prot 8e-52
3kyp_A193 Pfnaps, nucleosome assembly protein; histone recog 1e-46
2zd7_A264 VPS75, vacuolar protein sorting-associated protein 2e-43
2ayu_A417 Nucleosome assembly protein; histone chaperone; 3. 3e-34
3fs3_A359 Nucleosome assembly protein 1, putative; protein l 8e-31
>2e50_A Protein SET; histone chaperone, inhat, PP2AI, protein binding; HET: TRE; 2.30A {Homo sapiens} SCOP: d.305.1.1 Length = 225 Back     alignment and structure
 Score =  167 bits (424), Expect = 8e-52
 Identities = 87/226 (38%), Positives = 139/226 (61%), Gaps = 12/226 (5%)

Query: 3   ADKGKKTKVEEENAEQIDS---ELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIR 59
           A K  K ++   +    ++   E   +IE + E+Q+E++++NE+ASE++L+VEQKY+++R
Sbjct: 5   AAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLR 64

Query: 60  KPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSI 119
           +P + KR+++I  IP+FW+T F++HP +  LL EED++   YL+ +EV +F+D+KSGY I
Sbjct: 65  QPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRI 124

Query: 120 TFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKR---P 176
            F F  NPYFE+  L+K F   ++ G     +T IKWK G  +    +  +    R    
Sbjct: 125 DFYFDENPYFENKVLSKEFHM-NESGDPSSKSTEIKWKSGKDMTKRSSQTQNKASRKRQH 183

Query: 177 LAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEA 222
              ESFFTWF+D  +        DE+ E+IK+D+WPNPL Y+    
Sbjct: 184 EEPESFFTWFTDHSDAG-----ADELGEVIKDDIWPNPLQYYLVPD 224


>3kyp_A Pfnaps, nucleosome assembly protein; histone recognition, chaperone; 2.80A {Plasmodium falciparum} Length = 193 Back     alignment and structure
>2zd7_A VPS75, vacuolar protein sorting-associated protein 75; histone chaperone, VPS75, NAP1, nucleus, phosphoprotein; 1.85A {Saccharomyces cerevisiae} PDB: 3q66_A* 3q68_A* 3c9d_A 3c9b_A 3q33_B* 3q35_B* 3dm7_A Length = 264 Back     alignment and structure
>2ayu_A Nucleosome assembly protein; histone chaperone; 3.00A {Saccharomyces cerevisiae} SCOP: d.305.1.1 PDB: 2z2r_A Length = 417 Back     alignment and structure
>3fs3_A Nucleosome assembly protein 1, putative; protein localization, histone recognition, structural analysis, CHA; 2.30A {Plasmodium falciparum} PDB: 3hfd_A 3gyw_A 3gyv_A Length = 359 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query260
2e50_A225 Protein SET; histone chaperone, inhat, PP2AI, prot 100.0
2ayu_A417 Nucleosome assembly protein; histone chaperone; 3. 100.0
3fs3_A359 Nucleosome assembly protein 1, putative; protein l 100.0
2zd7_A264 VPS75, vacuolar protein sorting-associated protein 100.0
3kyp_A193 Pfnaps, nucleosome assembly protein; histone recog 100.0
2p2u_A171 HOST-nuclease inhibitor protein GAM, putative; str 93.48
2jo8_A51 Serine/threonine-protein kinase 4; C-terminal doma 92.88
>2e50_A Protein SET; histone chaperone, inhat, PP2AI, protein binding; HET: TRE; 2.30A {Homo sapiens} SCOP: d.305.1.1 Back     alignment and structure
Probab=100.00  E-value=9.9e-62  Score=428.08  Aligned_cols=214  Identities=43%  Similarity=0.831  Sum_probs=174.6

Q ss_pred             CCCCccccchhhhh-----hhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHhhhhhhhcchh
Q 024913            1 MVADKGKKTKVEEE-----NAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPD   75 (260)
Q Consensus         1 ~~~~~~~~~~~~~e-----~~~~~~~~v~~~i~~L~~lQ~e~~~le~~~~~e~~~le~ky~k~~~ply~kR~eiI~~IP~   75 (260)
                      |++|++||+|+..+     ....++++++++|.+|+.||.+++.+++++++++++|+++|+++++|+|++|++||++||+
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~~iQ~e~~~l~~e~~~ev~~lE~ky~~~~~Ply~kR~eII~~IP~   80 (225)
T 2e50_A            1 MSAQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPN   80 (225)
T ss_dssp             CHHHHHHHHHHHTTC-----CTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSTT
T ss_pred             CCCCcccccccccccccccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHhcccc
Confidence            88999999999764     3457889999999999999999999999999999999999999999999999999999999


Q ss_pred             hHHHHHhhhhhhhcccChhhHHhhcCcceeEEEEccCCCcceEEEEEecCCCcccCCeEEEEEEeeCCCCCceeeecccc
Q 024913           76 FWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIK  155 (260)
Q Consensus        76 FW~~vl~n~~~l~~~i~~~D~~iL~~L~dI~Ve~~~d~~~~f~i~F~F~~NpyF~N~~LtK~~~~~~~~g~~~~~~t~I~  155 (260)
                      ||++||+||+.|+.+|++.|++||+||++|+|++..++..||+|+|+|++||||+|++|||+|++.. +|.+++++|+|+
T Consensus        81 FW~tal~n~~~l~~~i~e~De~iL~~L~dI~v~~~~d~~~gf~i~F~F~~N~yF~N~vLtK~y~~~~-~g~~~s~~t~I~  159 (225)
T 2e50_A           81 FWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHMNE-SGDPSSKSTEIK  159 (225)
T ss_dssp             HHHHHHHTSHHHHTTCCHHHHHHGGGEEEEEEEECCSSCCCEEEEEEECSCSSBSCSEEEEEEC--------CEEECCCC
T ss_pred             HHHHHHhcChhhhhhccHhHHHHHHhcCeeEEEEccCCCCceEEEEEeCCCCCccCCEEEEEEEecC-CCCcccCCCcce
Confidence            9999999999999999999999999999999999988789999999999999999999999999998 888899999999


Q ss_pred             ccCCCCCCCcccccc--CCCCcc-ccccccccccccccCCCccccchHHHHHHHhhccccchhhhccc
Q 024913          156 WKEGMGIPNGVNHEK--KGNKRP-LAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNN  220 (260)
Q Consensus       156 Wk~gk~~t~~~~~~k--~~~~r~-~~~~SFF~~F~~~~~~~~~e~~~~ei~~~i~d~i~p~al~yy~~  220 (260)
                      ||+|+++|++..+++  +|++|. .+..|||+||+++..+     ..++||++|+++|||+||+||+.
T Consensus       160 Wk~gkd~t~~~~~kk~~~~~~r~~~~~~SFF~fF~~~~~~-----~~~eige~ikd~i~P~av~yy~~  222 (225)
T 2e50_A          160 WKSGKDMTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDA-----GADELGEVIKDDIWPNPLQYYLV  222 (225)
T ss_dssp             BCSSCCC----------------------CGGGGC-----------CHHHHHHHHHTTTTCSHHHHCC
T ss_pred             ecCCCCccchhhhhcccccCcccCCCCCCcceecCCCCCC-----chhHHHHHHHhCccccHHHHhcC
Confidence            999999998865542  344454 3468999999998755     35899999999999999999984



>2ayu_A Nucleosome assembly protein; histone chaperone; 3.00A {Saccharomyces cerevisiae} SCOP: d.305.1.1 PDB: 2z2r_A Back     alignment and structure
>3fs3_A Nucleosome assembly protein 1, putative; protein localization, histone recognition, structural analysis, CHA; 2.30A {Plasmodium falciparum} PDB: 3hfd_A 3gyw_A 3gyv_A Back     alignment and structure
>2zd7_A VPS75, vacuolar protein sorting-associated protein 75; histone chaperone, VPS75, NAP1, nucleus, phosphoprotein; 1.85A {Saccharomyces cerevisiae} PDB: 3q66_A* 3q68_A* 3c9d_A 3c9b_A 3q33_B* 3q35_B* 3dm7_A Back     alignment and structure
>3kyp_A Pfnaps, nucleosome assembly protein; histone recognition, chaperone; 2.80A {Plasmodium falciparum} Back     alignment and structure
>2p2u_A HOST-nuclease inhibitor protein GAM, putative; structural genomics, unknown function, PSI-2, protein structure initiative; 2.75A {Desulfovibrio vulgaris} SCOP: h.4.18.1 Back     alignment and structure
>2jo8_A Serine/threonine-protein kinase 4; C-terminal domain, human mammalian sterIle 20-like kinase 1, dimer, transferase; NMR {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 260
d2e50a1222 d.305.1.1 (A:1-222) Protein SET {Human (Homo sapie 3e-56
d2ayua1301 d.305.1.1 (A:70-370) Nucleosome assembly protein, 3e-37
>d2e50a1 d.305.1.1 (A:1-222) Protein SET {Human (Homo sapiens) [TaxId: 9606]} Length = 222 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: NAP-like
superfamily: NAP-like
family: NAP-like
domain: Protein SET
species: Human (Homo sapiens) [TaxId: 9606]
 Score =  177 bits (450), Expect = 3e-56
 Identities = 87/224 (38%), Positives = 140/224 (62%), Gaps = 12/224 (5%)

Query: 1   MVADKGKKTKVEEENAEQIDS---ELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSE 57
             A K  K ++   +    ++   E   +IE + E+Q+E++++NE+ASE++L+VEQKY++
Sbjct: 3   AQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNK 62

Query: 58  IRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGY 117
           +R+P + KR+++I  IP+FW+T F++HP +  LL EED++   YL+ +EV +F+D+KSGY
Sbjct: 63  LRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGY 122

Query: 118 SITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPL 177
            I F F  NPYFE+  L+K F  +++ G     +T IKWK G  +    +  +    R  
Sbjct: 123 RIDFYFDENPYFENKVLSKEF-HMNESGDPSSKSTEIKWKSGKDMTKRSSQTQNKASRKR 181

Query: 178 AE---ESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYF 218
                ESFFTWF+D  +        DE+ E+IK+D+WPNPL Y+
Sbjct: 182 QHEEPESFFTWFTDHSDAGA-----DELGEVIKDDIWPNPLQYY 220


>d2ayua1 d.305.1.1 (A:70-370) Nucleosome assembly protein, NAP {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 301 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query260
d2e50a1222 Protein SET {Human (Homo sapiens) [TaxId: 9606]} 100.0
d2ayua1301 Nucleosome assembly protein, NAP {Baker's yeast (S 100.0
>d2e50a1 d.305.1.1 (A:1-222) Protein SET {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: NAP-like
superfamily: NAP-like
family: NAP-like
domain: Protein SET
species: Human (Homo sapiens) [TaxId: 9606]
Probab=100.00  E-value=5.3e-60  Score=414.44  Aligned_cols=213  Identities=42%  Similarity=0.835  Sum_probs=170.3

Q ss_pred             CCCCccccchhh-----hhhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHhhhhhhhcchh
Q 024913            1 MVADKGKKTKVE-----EENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPD   75 (260)
Q Consensus         1 ~~~~~~~~~~~~-----~e~~~~~~~~v~~~i~~L~~lQ~e~~~le~~~~~e~~~le~ky~k~~~ply~kR~eiI~~IP~   75 (260)
                      ||||++|..+..     +.+...++.+++.+|.+|+.||.++..++.++.+++++|+++|.++++|+|++|++||+|||+
T Consensus         1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~~iQ~e~~~l~~e~~~e~~~le~k~~~~~~ply~~R~~iI~gIP~   80 (222)
T d2e50a1           1 MSAQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPN   80 (222)
T ss_dssp             CHHHHHHHHHHHTTC-----CTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSTT
T ss_pred             CCCccccccchhcccccCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccCcc
Confidence            788766655542     266789999999999999999999999999999999999999999999999999999999999


Q ss_pred             hHHHHHhhhhhhhcccChhhHHhhcCcceeEEEEccCCCcceEEEEEecCCCcccCCeEEEEEEeeCCCCCceeeecccc
Q 024913           76 FWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIK  155 (260)
Q Consensus        76 FW~~vl~n~~~l~~~i~~~D~~iL~~L~dI~Ve~~~d~~~~f~i~F~F~~NpyF~N~~LtK~~~~~~~~g~~~~~~t~I~  155 (260)
                      ||++||+||+.++.+|++.|+++|+||+||+|++..++..+|+|+|+|++||||+|++|+|+|++.. +|.+++++|+|+
T Consensus        81 FW~~~l~n~~~l~~~I~~~D~~~L~~L~dI~v~~~~~~~~~f~i~F~F~~N~yF~N~~L~K~y~~~~-~~~~~~~~t~I~  159 (222)
T d2e50a1          81 FWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHMNE-SGDPSSKSTEIK  159 (222)
T ss_dssp             HHHHHHHTSHHHHTTCCHHHHHHGGGEEEEEEEECCSSCCCEEEEEEECSCSSBSCSEEEEEEC--------CEEECCCC
T ss_pred             HHHHHHHcCcchhhhCCHHHHHHHHhhhheeeeeccCCCCceEEEEEeCCCCcccCCeEEEEEEecC-CCCcccccccce
Confidence            9999999999999999999999999999999999988889999999999999999999999999988 888899999999


Q ss_pred             ccCCCCCCCccccccCCC---CccccccccccccccccCCCccccchHHHHHHHhhccccchhhhcc
Q 024913          156 WKEGMGIPNGVNHEKKGN---KRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFN  219 (260)
Q Consensus       156 Wk~gk~~t~~~~~~k~~~---~r~~~~~SFF~~F~~~~~~~~~e~~~~ei~~~i~d~i~p~al~yy~  219 (260)
                      ||+|+++|++..+++++.   .......|||+||+++..+     ...+||++|+++|||+||.||+
T Consensus       160 Wk~gkd~t~k~~~~~~~~~~~~~~~~~~SFFnfF~~~~~~-----~~deI~~~i~d~i~p~av~yyL  221 (222)
T d2e50a1         160 WKSGKDMTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDA-----GADELGEVIKDDIWPNPLQYYL  221 (222)
T ss_dssp             BCSSCCC----------------------CGGGGC-----------CHHHHHHHHHTTTTCSHHHHC
T ss_pred             ecCCCccccccccccccCccccccCCCCChhhcCCCCCCC-----CchHHHHHHHhhhchhhhHhhc
Confidence            999999998765543322   1234668999999988755     4568999999999999999997



>d2ayua1 d.305.1.1 (A:70-370) Nucleosome assembly protein, NAP {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure