Citrus Sinensis ID: 024913

Local Sequence Feature Prediction

Prediction and (Method)	Result

Residue Number Marker

Protein Sequence

Secondary Structure (PSIPRED)

Secondary Structure Prediction (SSPRO)

Coil and Loop (DISEMBL)

Flexible Loop (DISEMBL)

Low Complexity Region (SEG)

Disordered region (IsUnstruct)

Disordered Region (DISOPRED)

Disordered Region (DISEMBL)

Disordered Region (DISPRO)

Transmembrane Helix (TMHMM)

Transmembrane Helix (HMMTOP)

Transmembrane Helix (MEMSAT)

TM Helix, Signal Peptide (MEMSAT_SVM)

TM Helix, Signal Peptide (Phobius)

Signal Peptide (SignalP HMM Mode)

Signal Peptide (SignalP NN Mode)

Coiled Coils (COILS)

Positional Conservation

--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260

MVADKGKKTKVEEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEGDEEGKEDDDSEDDEDDQEEDDDDEDGDDEGN

cccccccccHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHcccccEEEEEccccHHHHcccHHHHHHHHHHcccEEEEcccccccEEEEEEEccccccccccEEEEEEEEccccccEEEEEEEEcccccccccccccccccccccccccccEEccccccccccccccHHHHHHHHHHcccccHHHHccccccccccccccccccccccccccccccccccccccccccc

ccccccccccccHccHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccEHHHHcccHHHHHHHcHcHHHHHHHHHccEEEEcccccccEEEEEEEcccccccccEEEEEEEEEcccccEEEEEccEEEcccccccHcccccccccccccccccHccccccccccccccHHHHHHHHHHHHHcccccHHHHccccccccccccccccccccccccEEEEEEcccccccccccc

mvadkgkktkveeENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIrkpvydkrndIIKSIPDFWLTAFishpalgellseEDQKIFRYLSslevedfkdvksgysitfnfspnpyfednkltktftfldddgsmkITATSIKwkegmgipngvnhekkgnkrplaeesfftwfsdtqekdtIDGIQDEVAEIIKedlwpnpltyfnneadeeefegdeegkedddseddeddqeeddddedgddegn

mvadkgkktkveeenaeqidselvlSIEKLQEIQDELEKINeeasekvleveqkyseirkpvydkrnDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSlevedfkdvKSGYSITfnfspnpyfedNKLTKTftfldddgsmKITATsikwkegmgipngvnhekkgnkrpLAEESFFTWFSDTQEKDTIDGIQDEVAEIIkedlwpnpLTYFNNEadeeefegdeegkedddseddeddqeeddddedgddegn

******************************************************YSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGI*****************ESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFN*****************************************

**************************IEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGI****************EESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNN****************************************

*****************QIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEAD*************************************

*****************QIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIP*G*************EESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEAD****************EDDEDDQEEDD**********

oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooo

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

MVADKGKKTKVEEENxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxYSEIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEGDEEGKEDDDSEDDEDDQEEDDDDEDGDDEGN

no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST

Original result of BLAST against SWISS-PROT Database

ID	Alignment graph	Length	Definition	RBH(Q2H)	RBH(H2Q)	Q cover	H cover	Identity	E-value
Query		260	2.2.26 [Sep-21-2011]
P53997		269	Protein SET OS=Drosophila	yes	no	0.723	0.698	0.45	7e-45
Q01105		290	Protein SET OS=Homo sapie	yes	no	0.719	0.644	0.448	3e-44
Q9EQU5		289	Protein SET OS=Mus muscul	yes	no	0.719	0.647	0.448	6e-44
Q63945		289	Protein SET OS=Rattus nor	yes	no	0.719	0.647	0.448	6e-44
Q9H2G4		693	Testis-specific Y-encoded	no	no	0.730	0.274	0.368	8e-35
Q7TQI8		677	Testis-specific Y-encoded	no	no	0.8	0.307	0.330	2e-34
Q9BE64		695	Testis-specific Y-encoded	N/A	no	0.730	0.273	0.364	3e-34
Q8N831		410	Testis-specific Y-encoded	no	no	0.7	0.443	0.371	1e-33
Q9UJ04		414	Testis-specific Y-encoded	no	no	0.703	0.442	0.357	3e-32
O88852		379	Testis-specific Y-encoded	no	no	0.692	0.474	0.372	3e-32

>sp\|P53997\|SET_DROME Protein SET OS=Drosophila melanogaster GN=Set PE=1 SV=2	Back alignment and function desciption

 Score =  181 bits (458), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 136/200 (68%), Gaps = 12/200 (6%)

Query: 26  SIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIKSIPDFWLTAFISHP 85
           ++E++   Q+E++ +NE+ASE++L+VEQKY+++RKP Y+KR++++K IP+FW+T+FI+HP
Sbjct: 32  ALEQIDACQNEIDALNEKASEEILKVEQKYNKLRKPCYEKRSELVKRIPNFWVTSFINHP 91

Query: 86  ALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYFEDNKLTKTFTFLDDDG 145
            +  +L EE+++    L+ LEVE+F+D+KSGY I F+F  NPYFE+  LTK F       
Sbjct: 92  QVSGILDEEEEECLHALNKLEVEEFEDIKSGYRINFHFDENPYFENKVLTKEFHLNSAAA 151

Query: 146 S-----MKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEE--SFFTWFSDTQEKDTIDGI 198
           S        T+T IKWKEG  +   +  +  GNK+    E  +FF WFS     D  D +
Sbjct: 152 SENGDWPASTSTPIKWKEGKNLLKLLLTKPYGNKKKRNSEYKTFFDWFS-----DNTDPV 206

Query: 199 QDEVAEIIKEDLWPNPLTYF 218
            DE+AE+IK+DLWPNPL Y+
Sbjct: 207 NDEIAELIKDDLWPNPLQYY 226

Drosophila melanogaster (taxid: 7227)

>sp\|Q01105\|SET_HUMAN Protein SET OS=Homo sapiens GN=SET PE=1 SV=3	Back alignment and function description

>sp\|Q9EQU5\|SET_MOUSE Protein SET OS=Mus musculus GN=Set PE=1 SV=1	Back alignment and function description

>sp\|Q63945\|SET_RAT Protein SET OS=Rattus norvegicus GN=Set PE=1 SV=2	Back alignment and function description

>sp\|Q9H2G4\|TSYL2_HUMAN Testis-specific Y-encoded-like protein 2 OS=Homo sapiens GN=TSPYL2 PE=1 SV=1	Back alignment and function description

>sp\|Q7TQI8\|TSYL2_MOUSE Testis-specific Y-encoded-like protein 2 OS=Mus musculus GN=Tspyl2 PE=1 SV=1	Back alignment and function description

>sp\|Q9BE64\|TSYL2_MACFA Testis-specific Y-encoded-like protein 2 OS=Macaca fascicularis GN=TSPYL2 PE=1 SV=1	Back alignment and function description

>sp\|Q8N831\|TSYL6_HUMAN Testis-specific Y-encoded-like protein 6 OS=Homo sapiens GN=TSPYL6 PE=2 SV=1	Back alignment and function description

>sp\|Q9UJ04\|TSYL4_HUMAN Testis-specific Y-encoded-like protein 4 OS=Homo sapiens GN=TSPYL4 PE=2 SV=2	Back alignment and function description

>sp\|O88852\|TSYL1_MOUSE Testis-specific Y-encoded-like protein 1 OS=Mus musculus GN=Tspyl1 PE=2 SV=1	Back alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST

Original result of BLAST against Nonredundant Database

GI	Alignment Graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		260
225456743		268	PREDICTED: protein SET [Vitis vinifera]	0.896	0.869	0.804	1e-105
26451105		228	unknown protein [Arabidopsis thaliana] g	0.869	0.991	0.794	2e-97
6730705		255	Putative phospatase 2A inhibitor [Arabid	0.842	0.858	0.800	5e-96
356516792		263	PREDICTED: protein SET-like isoform 1 [G	0.842	0.832	0.797	6e-96
225452785		255	PREDICTED: protein SET [Vitis vinifera]	0.838	0.854	0.790	6e-96
356508564		261	PREDICTED: protein SET-like [Glycine max	0.842	0.839	0.792	8e-96
296082895		253	unnamed protein product [Vitis vinifera]	0.838	0.861	0.790	1e-95
255638203		263	unknown [Glycine max]	0.842	0.832	0.792	5e-95
18394656		256	template-activating factor I [Arabidopsi	0.842	0.855	0.797	1e-94
224119088		242	nucleosome/chromatin assembly factor gro	0.884	0.950	0.764	1e-94

>gi\|225456743\|ref\|XP_002275632.1\| PREDICTED: protein SET [Vitis vinifera] gi\|297733991\|emb\|CBI15238.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 194/241 (80%), Positives = 219/241 (90%), Gaps = 8/241 (3%)

Query: 1   MVADKGKKTKV----EEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYS 56
           MVADKGKK K     EE N++ ID +LVLSIEKLQEIQD+LEKINEEAS+KVLEVEQKY+
Sbjct: 1   MVADKGKKLKQSEKEEEVNSDHIDGDLVLSIEKLQEIQDDLEKINEEASDKVLEVEQKYN 60

Query: 57  EIRKPVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSG 116
           EIRKP+YDKRNDIIKSIPDFWLTAF+SHPALG+LLSEEDQKIF+YLSSLEVEDFKDVKSG
Sbjct: 61  EIRKPIYDKRNDIIKSIPDFWLTAFLSHPALGDLLSEEDQKIFKYLSSLEVEDFKDVKSG 120

Query: 117 YSITFNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRP 176
           YSITFNF+PNP+FED KL KTFTFL D+G  KI+ TSIKWK+GMGIPNGVNHEKKGNKRP
Sbjct: 121 YSITFNFNPNPFFEDTKLKKTFTFL-DEGITKISVTSIKWKDGMGIPNGVNHEKKGNKRP 179

Query: 177 LAEESFFTWFSDTQEKDTIDGIQDEVAEIIKEDLWPNPLTYFNNEADEEEFEG---DEEG 233
           +A+ SFF+WFS+TQ+KD +D I DE+AEIIKEDLWPNPLTYFN+EADEE+F+G   DEEG
Sbjct: 180 IADASFFSWFSETQQKDIMDDIHDEIAEIIKEDLWPNPLTYFNSEADEEDFDGEDADEEG 239

Query: 234 K 234
           K
Sbjct: 240 K 240

Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi\|26451105\|dbj\|BAC42657.1\| unknown protein [Arabidopsis thaliana] gi\|28950777\|gb\|AAO63312.1\| At1g18800 [Arabidopsis thaliana]	Back alignment and taxonomy information

>gi\|6730705\|gb\|AAF27100.1\|AC011809_9 Putative phospatase 2A inhibitor [Arabidopsis thaliana]	Back alignment and taxonomy information

>gi\|356516792\|ref\|XP_003527077.1\| PREDICTED: protein SET-like isoform 1 [Glycine max]	Back alignment and taxonomy information

>gi\|225452785\|ref\|XP_002283325.1\| PREDICTED: protein SET [Vitis vinifera]	Back alignment and taxonomy information

>gi\|356508564\|ref\|XP_003523025.1\| PREDICTED: protein SET-like [Glycine max]	Back alignment and taxonomy information

>gi\|296082895\|emb\|CBI22196.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

>gi\|255638203\|gb\|ACU19415.1\| unknown [Glycine max]	Back alignment and taxonomy information

>gi\|18394656\|ref\|NP_564063.1\| template-activating factor I [Arabidopsis thaliana] gi\|21555241\|gb\|AAM63812.1\| putative SET protein, phospatase 2A inhibitor [Arabidopsis thaliana] gi\|332191644\|gb\|AEE29765.1\| template-activating factor I [Arabidopsis thaliana]	Back alignment and taxonomy information

>gi\|224119088\|ref\|XP_002317982.1\| nucleosome/chromatin assembly factor group [Populus trichocarpa] gi\|222858655\|gb\|EEE96202.1\| nucleosome/chromatin assembly factor group [Populus trichocarpa]	Back alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST

Original result of BLAST against Gene Ontology (AMIGO)

ID	Alignment graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		260
TAIR\|locus:2034995		256	NRP2 "AT1G18800" [Arabidopsis	0.842	0.855	0.797	5.5e-93
TAIR\|locus:2019075		264	NRP1 "AT1G74560" [Arabidopsis	0.807	0.795	0.760	3.4e-84
ZFIN\|ZDB-GENE-030131-433		275	setb "SET translocation (myelo	0.757	0.716	0.444	8.8e-47
UNIPROTKB\|F1RR69		289	SET "Uncharacterized protein"	0.773	0.695	0.433	3e-46
UNIPROTKB\|F2Z4L4		277	SET "Uncharacterized protein"	0.757	0.711	0.435	6.2e-46
FB\|FBgn0014879		269	Set "Set" [Drosophila melanoga	0.773	0.747	0.437	7.9e-46
UNIPROTKB\|Q5VXV2		268	SET "Protein SET" [Homo sapien	0.746	0.723	0.438	7.9e-46
MGI\|MGI:1860267		289	Set "SET nuclear oncogene" [Mu	0.780	0.702	0.425	1e-45
RGD\|1307467		289	Set "SET nuclear oncogene" [Ra	0.780	0.702	0.425	1e-45
UNIPROTKB\|Q01105		290	SET "Protein SET" [Homo sapien	0.719	0.644	0.448	1.3e-45

TAIR\|locus:2034995 NRP2 "AT1G18800" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

 Score = 926 (331.0 bits), Expect = 5.5e-93, P = 5.5e-93
 Identities = 177/222 (79%), Positives = 200/222 (90%)

Query:     1 MVADKGKKTKVEEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRK 60
             MV DK KK K EEEN EQID+ELVLSIEKLQEIQD+LEKINE+AS++VLEVEQKY+ IRK
Sbjct:     1 MVTDKSKKAKTEEENVEQIDAELVLSIEKLQEIQDDLEKINEKASDEVLEVEQKYNVIRK 60

Query:    61 PVYDKRNDIIKSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSIT 120
             PVYDKRN+IIK+IPDFWLTAF+SHPALGELL+EEDQKIF+YLSSL+VED KDVKSGYSIT
Sbjct:    61 PVYDKRNEIIKTIPDFWLTAFLSHPALGELLTEEDQKIFKYLSSLDVEDAKDVKSGYSIT 120

Query:   121 FNFSPNPYFEDNKLTKTFTFLDDDGSMKITATSIKWKEGMGIPNGVNHEKKGNKRPLAEE 180
             F+F+PNP+FED KLTKTFTFL++ G+ KITAT IKWKEG G+ NGVNHEK GNKR L EE
Sbjct:   121 FSFNPNPFFEDGKLTKTFTFLEE-GTTKITATPIKWKEGKGLANGVNHEKNGNKRALPEE 179

Query:   181 SFFTWFSDTQEKDTI-DGIQDE-VAEIIKEDLWPNPLTYFNN 220
             SFFTWFSD Q K+ + D +QDE VA+IIKEDLWPNPLTYFNN
Sbjct:   180 SFFTWFSDAQHKEDVEDEMQDEQVADIIKEDLWPNPLTYFNN 221

GO:0003677 "DNA binding" evidence=ISS

GO:0005634 "nucleus" evidence=ISM;IEA;ISS;IDA

GO:0006334 "nucleosome assembly" evidence=IEA;ISS

GO:0003682 "chromatin binding" evidence=IPI

GO:0005737 "cytoplasm" evidence=IDA

GO:0008283 "cell proliferation" evidence=IGI

GO:0010311 "lateral root formation" evidence=IGI

GO:0030154 "cell differentiation" evidence=IGI

GO:0042393 "histone binding" evidence=IPI

TAIR\|locus:2019075 NRP1 "AT1G74560" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

ZFIN\|ZDB-GENE-030131-433 setb "SET translocation (myeloid leukemia-associated) B" [Danio rerio (taxid:7955)]	Back alignment and assigned GO terms

UNIPROTKB\|F1RR69 SET "Uncharacterized protein" [Sus scrofa (taxid:9823)]	Back alignment and assigned GO terms

UNIPROTKB\|F2Z4L4 SET "Uncharacterized protein" [Gallus gallus (taxid:9031)]	Back alignment and assigned GO terms

FB\|FBgn0014879 Set "Set" [Drosophila melanogaster (taxid:7227)]	Back alignment and assigned GO terms

UNIPROTKB\|Q5VXV2 SET "Protein SET" [Homo sapiens (taxid:9606)]	Back alignment and assigned GO terms

MGI\|MGI:1860267 Set "SET nuclear oncogene" [Mus musculus (taxid:10090)]	Back alignment and assigned GO terms

RGD\|1307467 Set "SET nuclear oncogene" [Rattus norvegicus (taxid:10116)]	Back alignment and assigned GO terms

UNIPROTKB\|Q01105 SET "Protein SET" [Homo sapiens (taxid:9606)]	Back alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries

Original result of BLAST against SWISS-PROT

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server

Original result from Ezypred Server

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software

No EC number assignment, probably not an enzyme!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING

Original result from the STRING server

Your Input:
	GSVIVG00020937001	SubName- Full=Chromosome chr14 scaffold_21, whole genome shotgun sequence; (255 aa)
		(Vitis vinifera)
Predicted Functional Partners:
	GSVIVG00017697001	SubName- Full=Chromosome chr17 scaffold_16, whole genome shotgun sequence; (86 aa)	•		0.608
	GSVIVG00028481001	SubName- Full=Chromosome chr7 scaffold_44, whole genome shotgun sequence; (316 aa)	•		0.595
	GSVIVG00021672001	SubName- Full=Chromosome chr8 scaffold_23, whole genome shotgun sequence; (315 aa)	•		0.544
	GSVIVG00017910001	SubName- Full=Chromosome chr17 scaffold_16, whole genome shotgun sequence; (481 aa)	•		0.516
	GSVIVG00014203001	SubName- Full=Chromosome chr16 scaffold_10, whole genome shotgun sequence; (151 aa)	•		0.503
	GSVIVG00000534001	SubName- Full=Chromosome chr2 scaffold_105, whole genome shotgun sequence; (338 aa)	•		0.501
	GSVIVG00005498001	SubName- Full=Putative uncharacterized protein (Chromosome chr13 scaffold_152, whole genome sho [...] (293 aa)	•		0.496
	GSVIVG00021307001	SubName- Full=Chromosome chr8 scaffold_23, whole genome shotgun sequence; (586 aa)	•		0.488
	GSVIVG00002607001	SubName- Full=Chromosome undetermined scaffold_133, whole genome shotgun sequence; (114 aa)	•	•	0.473
	GSVIVG00014439001	RecName- Full=Diphthine synthase; EC=2.1.1.98;; Required for the methylation step in diphthamid [...] (285 aa)	•		0.471

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST

Original result of RPS-BLAST against CDD database part I

ID	Alignment Graph	Length	Definition	E-value
Query		260
pfam00956		236	pfam00956, NAP, Nucleosome assembly protein (NAP)	2e-49
PTZ00008		185	PTZ00008, PTZ00008, (NAP-S) nucleosome assembly pr	3e-23
PTZ00007		337	PTZ00007, PTZ00007, (NAP-L) nucleosome assembly pr	1e-22
pfam10446		449	pfam10446, DUF2457, Protein of unknown function (D	2e-04
PRK12298		390	PRK12298, obgE, GTPase CgtA; Reviewed	7e-04
pfam03153		332	pfam03153, TFIIA, Transcription factor IIA, alpha/	8e-04
pfam04147		809	pfam04147, Nop14, Nop14-like family	0.001
pfam10446		449	pfam10446, DUF2457, Protein of unknown function (D	0.002
pfam03153		332	pfam03153, TFIIA, Transcription factor IIA, alpha/	0.002
pfam04546		211	pfam04546, Sigma70_ner, Sigma-70, non-essential re	0.002
pfam10446		449	pfam10446, DUF2457, Protein of unknown function (D	0.003
pfam02724		583	pfam02724, CDC45, CDC45-like protein	0.004

>gnl\|CDD\|216213 pfam00956, NAP, Nucleosome assembly protein (NAP)	Back alignment and domain information

 Score =  162 bits (411), Expect = 2e-49
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 37/234 (15%)

Query: 26  SIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDII--------------- 70
            +E L+ +Q EL+++  +  E+VLE+E+KY ++ +P+YDKR +II               
Sbjct: 3   RVEALKALQKELDELEAKFQEEVLELERKYDKLYQPLYDKRREIINGAREPTEVEEEEPE 62

Query: 71  -KSIPDFWLTAFISHPALGELLSEEDQKIFRYLSSLEVEDFKDVKSGYSITFNFSPNPYF 129
            K IP FWLTA  +HP L E+++E D++  +YL+ + VE  +D K G+ + F+F+PNPYF
Sbjct: 63  EKGIPGFWLTALKNHPLLSEMITERDEEALKYLTDIRVEYLEDPKKGFKLIFHFAPNPYF 122

Query: 130 EDNKLTKTFTFLDDDGS--MKITATSIKWKEGM-----GIPNGVNHEKKGNKRPLAE--- 179
            +  LTKT+   D+     +K   T I+WKEG       +     ++K G  R + +   
Sbjct: 123 TNEVLTKTYHLKDEGDPFELKSEGTPIEWKEGKNLTVKTVKKKQRNKKTGQTRTITKTVP 182

Query: 180 -ESFFTWFSDTQEKDTIDG----------IQDEVAEIIKEDLWPNPLTYFNNEA 222
            ESFF +FS  +  D  D           +  E+ EIIK+DL P  L Y+  EA
Sbjct: 183 AESFFNFFSPPKVPDDDDDDDEELEEELELDYEIGEIIKDDLIPRALDYYTGEA 236

NAP proteins are involved in moving histones into the nucleus, nucleosome assembly and chromatin fluidity. They affect the transcription of many genes. Length = 236

>gnl\|CDD\|185394 PTZ00008, PTZ00008, (NAP-S) nucleosome assembly protein-S; Provisional	Back alignment and domain information

>gnl\|CDD\|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L; Provisional	Back alignment and domain information

>gnl\|CDD\|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457)	Back alignment and domain information

>gnl\|CDD\|237047 PRK12298, obgE, GTPase CgtA; Reviewed	Back alignment and domain information

>gnl\|CDD\|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit	Back alignment and domain information

>gnl\|CDD\|217927 pfam04147, Nop14, Nop14-like family	Back alignment and domain information

>gnl\|CDD\|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457)	Back alignment and domain information

>gnl\|CDD\|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit	Back alignment and domain information

>gnl\|CDD\|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region	Back alignment and domain information

>gnl\|CDD\|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457)	Back alignment and domain information

>gnl\|CDD\|217203 pfam02724, CDC45, CDC45-like protein	Back alignment and domain information

Conserved Domains Detected by HHsearch

Original result of HHsearch against CDD database

ID	Alignment Graph	Length	Definition	Probability
Query		260
PTZ00007		337	(NAP-L) nucleosome assembly protein -L; Provisiona	100.0
KOG1507		358	consensus Nucleosome assembly protein NAP-1 [Chrom	100.0
PTZ00008		185	(NAP-S) nucleosome assembly protein-S; Provisional	100.0
PF00956		244	NAP: Nucleosome assembly protein (NAP); InterPro:	100.0
KOG1508		260	consensus DNA replication factor/protein phosphata	100.0
PF11629		49	Mst1_SARAH: C terminal SARAH domain of Mst1; Inter	94.71
PF09026		101	CENP-B_dimeris: Centromere protein B dimerisation	92.24
PF07352		149	Phage_Mu_Gam: Bacteriophage Mu Gam like protein; I	91.9
KOG1508		260	consensus DNA replication factor/protein phosphata	89.37
COG4396		170	Mu-like prophage host-nuclease inhibitor protein G	88.52
PF06524		314	NOA36: NOA36 protein; InterPro: IPR010531 This fam	85.47
PF04931		784	DNA_pol_phi: DNA polymerase phi; InterPro: IPR0070	82.88

>PTZ00007 (NAP-L) nucleosome assembly protein -L; Provisional	Back alignment and domain information

Probab=100.00  E-value=5.1e-60  Score=437.52  Aligned_cols=217  Identities=30%  Similarity=0.591  Sum_probs=189.7

Q ss_pred             hhhhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHhhhhhhh----------cchhhHHHHH
Q 024913           12 EEENAEQIDSELVLSIEKLQEIQDELEKINEEASEKVLEVEQKYSEIRKPVYDKRNDIIK----------SIPDFWLTAF   81 (260)
Q Consensus        12 ~~e~~~~~~~~v~~~i~~L~~lQ~e~~~le~~~~~e~~~le~ky~k~~~ply~kR~eiI~----------~IP~FW~~vl   81 (260)
                      .++++..||+.++.++.+|+.||.++..|+.++++++++|+++|.++++|+|++|++||+          |||+||++||
T Consensus        28 ~~~~i~~Lp~~~~~rv~aL~~lQ~e~~~le~ef~~ev~~LE~kY~~~~~Ply~kR~eII~G~~~~e~~~~gIP~FWl~vL  107 (337)
T PTZ00007         28 DDEKLSHLTDEQRETLKKLQLLQKEFDDLEVEYNAELRKLRSKYEDLYNPIYDKRKEALVQNGGAEIGTPGLPQFWLTAM  107 (337)
T ss_pred             ccchhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHcCCcccccccCCcccHHHHHH
Confidence            346677999999999999999999999999999999999999999999999999999999          6999999999


Q ss_pred             hhhhhhhcccChhhHHhhcCcceeEEEEccCCC-cceEEEEEecCCCcccCCeEEEEEEeeCCC---CC--ceeeecccc
Q 024913           82 ISHPALGELLSEEDQKIFRYLSSLEVEDFKDVK-SGYSITFNFSPNPYFEDNKLTKTFTFLDDD---GS--MKITATSIK  155 (260)
Q Consensus        82 ~n~~~l~~~i~~~D~~iL~~L~dI~Ve~~~d~~-~~f~i~F~F~~NpyF~N~~LtK~~~~~~~~---g~--~~~~~t~I~  155 (260)
                      +||+.|+.+|+++|++||+||++|+|++..+.. +||+|+|+|++||||+|++|||+|++....   |+  ..+++|+|+
T Consensus       108 ~Nh~~ls~~I~e~De~iL~~L~dI~ve~~~~~~~~gf~I~F~F~~NpyF~N~vLtK~y~~~~~d~~~~p~~~~~~~t~I~  187 (337)
T PTZ00007        108 KNNNTLGSAIEEHDEPILSYLSDISCEYTEPNKQEGFILVFTFAPNPFFSNTVLTKTYHMKVLDGDDEPLLSNTVATEID  187 (337)
T ss_pred             HcCccHhhhCCHHHHHHHHhhCceEEEEccCCCCCceEEEEEeCCCCCCCCCeEEEEEEeecCCCCCCceeecceeeece
Confidence            999999999999999999999999999876544 899999999999999999999999997422   22  246899999


Q ss_pred             ccCCCCCCCccccccCCCC-----c----cccccccccccccccCCCccc----------------cchHHHHHHHhhcc
Q 024913          156 WKEGMGIPNGVNHEKKGNK-----R----PLAEESFFTWFSDTQEKDTID----------------GIQDEVAEIIKEDL  210 (260)
Q Consensus       156 Wk~gk~~t~~~~~~k~~~~-----r----~~~~~SFF~~F~~~~~~~~~e----------------~~~~ei~~~i~d~i  210 (260)
                      ||+|++||++..++|++++     |    +++..|||+||+++..+...+                +.+++||++|+++|
T Consensus       188 WK~GkdlT~k~v~kKqr~K~~~~~r~v~~~~~~~SFFnfF~p~~~p~~~~~e~~~e~~~ee~~~~l~~DyeiG~~ikd~I  267 (337)
T PTZ00007        188 WKQGKDVTKKVVTKKQRHKKTKETRTVTETVDRESFFNFFTSHEVPSDEELEKMSKHEIAELEMIVETDYEIGITIRDKL  267 (337)
T ss_pred             eeCCCCchhhhcccccccccCCCceeeccCCCCCChHHhcCCCCCCcccccccccchhHHHHHHHHHHhHHHHHHHHHhc
Confidence            9999999998766544333     2    356799999999987654210                24679999999999


Q ss_pred             ccchhhhcccCCCccccc
Q 024913          211 WPNPLTYFNNEADEEEFE  228 (260)
Q Consensus       211 ~p~al~yy~~~~~~~e~~  228 (260)
                      ||+||.||+|++.+++.+
T Consensus       268 IP~AV~yftGea~d~~~~  285 (337)
T PTZ00007        268 IPYAVYWFLGEAIDEDSD  285 (337)
T ss_pred             ccccHHhhCCCccccccc
Confidence            999999999998776654

>KOG1507 consensus Nucleosome assembly protein NAP-1 [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]	Back alignment and domain information

>PTZ00008 (NAP-S) nucleosome assembly protein-S; Provisional	Back alignment and domain information

>PF00956 NAP: Nucleosome assembly protein (NAP); InterPro: IPR002164 It is thought that NAPs act as histone chaperones, shuttling both core and linker histones from their site of synthesis in the cytoplasm to the nucleus	Back alignment and domain information

>KOG1508 consensus DNA replication factor/protein phosphatase inhibitor SET/SPR-2 [Replication, recombination and repair]	Back alignment and domain information

>PF11629 Mst1_SARAH: C terminal SARAH domain of Mst1; InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name	Back alignment and domain information

>PF09026 CENP-B_dimeris: Centromere protein B dimerisation domain; InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box	Back alignment and domain information

>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []	Back alignment and domain information

>KOG1508 consensus DNA replication factor/protein phosphatase inhibitor SET/SPR-2 [Replication, recombination and repair]	Back alignment and domain information

>COG4396 Mu-like prophage host-nuclease inhibitor protein Gam [General function prediction only]	Back alignment and domain information

>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues	Back alignment and domain information

>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar	Back alignment and domain information

Homologous Structure Templates