Citrus Sinensis ID: 022261


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300
MEGNSGGGGAGAGGSGGVTGGGAGGGGGNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR
ccccccccccccccccccccccccccccccccEEEEEEEEEEEEEEEEEEEcccccEEEEEEcccccEEEEEEccccHHHHHHHHHHHHcccccccccccccccccEEEEEEcccccccEEEEEEEEcccccEEEEEcccccccHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccEEEEccccccccEEEEEEEccccccEEEEccccHHHHHHHHHHHHHHHHHHHHccccccccccccccc
cccccccccccccccccccccccccccccccEEEEEEEEEEcEEEEEEEEEccccEEEEEEEEcccccEEEEEcHHHHHHHHHHHHHHcccccccccccEEEEcccEEEEEEccccccEEEEEEEEEcccccEEEEEcccccccHHHHHHHHHHHHHHHHcccEEccccccHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccEEEEEEccEEEEEccccccEEEEEEEEEccccccEEEEcHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccc
megnsggggagaggsggvtgggagggggndVELVCKTLQVEHKLFyfdlkenprgrylkisektsatrstiivpssgiSWFLDLFNYyvnsddhelFSKELQLDSkvfyfdigenrrgrflkvseasvsrnrstiivpagssrdEGWAAFRNILAEINEASRllilpnqqgseqsehlvglsddvgagfisghgsqpapaselnvdrsvdlpaqdeignmgvSKVIRADQKrfffdlgsnnrghflrisevtgsdrssiilplsglkQFHEIVGHFVEITKDRiegmtgasvrivdppqr
MEGNSGGGGAGAGGSGGVTGGGAGGGGGNDVELVCKTLQVEHKLfyfdlkenprgrylkisektsatrstiivpssgISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFdigenrrgrflkvseasvsrnrstiivpagssrdegWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFffdlgsnnrgHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDriegmtgasvrivdppqr
MEgnsggggagaggsggvtgggagggggNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR
*******************************ELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEA******STIIVPAGSSRDEGWAAFRNILAEINEASRLLIL***************************************************GNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGA**********
*************************************LQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEI******************************************************************KVIRADQKRFFFDLGSNNRGHFLRISEV*GSDRSSIILPLSGLKQFHEIVGHFV***********************
************GGSGGVTGGGAGGGGGNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGH*********LNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR
*****************************DVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAG*******************************MGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDR*****************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEGNSGGGGAGAGGSGGVTGGGAGGGGGNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLKQFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query300 2.2.26 [Sep-21-2011]
Q9SKZ1296 Transcription factor Pur- yes no 0.896 0.908 0.853 1e-132
Q6PHK6297 Transcriptional activator no no 0.743 0.750 0.277 5e-15
Q00577322 Transcriptional activator no no 0.646 0.602 0.263 1e-14
Q96QR8312 Transcriptional activator yes no 0.713 0.685 0.265 4e-14
Q9UJV8347 Purine-rich element-bindi no no 0.713 0.616 0.260 7e-14
Q8R4E6350 Purine-rich element-bindi yes no 0.71 0.608 0.261 7e-14
Q6PAC9323 Transcriptional activator N/A no 0.703 0.653 0.262 1e-13
Q68A21315 Transcriptional activator no no 0.663 0.631 0.265 6e-13
Q8AVS4328 Transcriptional activator N/A no 0.67 0.612 0.275 6e-13
O35295324 Transcriptional activator no no 0.663 0.614 0.261 7e-13
>sp|Q9SKZ1|PUR_ARATH Transcription factor Pur-alpha 1 OS=Arabidopsis thaliana GN=PURA1 PE=1 SV=2 Back     alignment and function desciption
 Score =  469 bits (1208), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 233/273 (85%), Positives = 250/273 (91%), Gaps = 4/273 (1%)

Query: 29  NDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY 88
           +DVELV KTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY
Sbjct: 27  SDVELVSKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY 86

Query: 89  VNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWA 148
           VNS++HELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSS DEGWA
Sbjct: 87  VNSEEHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSPDEGWA 146

Query: 149 AFRNILAEINEASRLLILPNQ-QGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDR 207
           AFRNILAEI+EAS L ++PNQ + S+  EHLV   DDVGAGFI GHGSQ   +SE NVDR
Sbjct: 147 AFRNILAEIHEASGLFVMPNQVKPSDGQEHLV---DDVGAGFIPGHGSQQPSSSEHNVDR 203

Query: 208 SVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLK 267
           ++D P Q+E G  GVSKVIRADQKRFFFDLG+NNRGHFLRISEV GSDRSSIILPLSGLK
Sbjct: 204 TIDSPGQEETGMTGVSKVIRADQKRFFFDLGNNNRGHFLRISEVAGSDRSSIILPLSGLK 263

Query: 268 QFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR 300
           QFHE++GHFVEITKD+IEGMTGA+VR VDPPQR
Sbjct: 264 QFHEVIGHFVEITKDKIEGMTGANVRTVDPPQR 296




Transcription factor that specifically binds the purine-rich double-stranded telomeric repeated sequence 5'-AAACCCTAA-3' found in promoter telo boxes.
Arabidopsis thaliana (taxid: 3702)
>sp|Q6PHK6|PURB_DANRE Transcriptional activator protein Pur-beta OS=Danio rerio GN=purb PE=2 SV=3 Back     alignment and function description
>sp|Q00577|PURA_HUMAN Transcriptional activator protein Pur-alpha OS=Homo sapiens GN=PURA PE=1 SV=2 Back     alignment and function description
>sp|Q96QR8|PURB_HUMAN Transcriptional activator protein Pur-beta OS=Homo sapiens GN=PURB PE=1 SV=3 Back     alignment and function description
>sp|Q9UJV8|PURG_HUMAN Purine-rich element-binding protein gamma OS=Homo sapiens GN=PURG PE=2 SV=1 Back     alignment and function description
>sp|Q8R4E6|PURG_MOUSE Purine-rich element-binding protein gamma OS=Mus musculus GN=Purg PE=1 SV=1 Back     alignment and function description
>sp|Q6PAC9|PURBA_XENLA Transcriptional activator protein Pur-beta-A OS=Xenopus laevis GN=purb-a PE=2 SV=3 Back     alignment and function description
>sp|Q68A21|PURB_RAT Transcriptional activator protein Pur-beta OS=Rattus norvegicus GN=Purb PE=1 SV=3 Back     alignment and function description
>sp|Q8AVS4|PURBB_XENLA Transcriptional activator protein Pur-beta-B OS=Xenopus laevis GN=purb-b PE=2 SV=3 Back     alignment and function description
>sp|O35295|PURB_MOUSE Transcriptional activator protein Pur-beta OS=Mus musculus GN=Purb PE=1 SV=3 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query300
225435836279 PREDICTED: transcription factor Pur-alph 0.923 0.992 0.916 1e-146
359807554286 uncharacterized protein LOC100782805 [Gl 0.906 0.951 0.926 1e-145
356521624283 PREDICTED: transcription factor Pur-alph 0.903 0.957 0.926 1e-144
357475063293 Transcription factor Pur-alpha [Medicago 0.896 0.918 0.911 1e-141
147774370291 hypothetical protein VITISV_041199 [Viti 0.9 0.927 0.875 1e-138
224075952269 predicted protein [Populus trichocarpa] 0.89 0.992 0.903 1e-138
449452640309 PREDICTED: transcription factor Pur-alph 0.906 0.880 0.856 1e-134
224056917317 predicted protein [Populus trichocarpa] 0.903 0.854 0.829 1e-134
255564778282 nucleic acid binding protein, putative [ 0.843 0.897 0.874 1e-132
30685174295 transcription factor Pur-alpha 1 [Arabid 0.896 0.911 0.856 1e-131
>gi|225435836|ref|XP_002283799.1| PREDICTED: transcription factor Pur-alpha 1-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 254/277 (91%), Positives = 266/277 (96%)

Query: 24  GGGGGNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLD 83
           G  GGNDVEL+CKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVP SGISWFLD
Sbjct: 3   GNSGGNDVELLCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPFSGISWFLD 62

Query: 84  LFNYYVNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSR 143
           LFNYYVNSD+ ++FSKELQLD+KVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGS+R
Sbjct: 63  LFNYYVNSDEQDVFSKELQLDTKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSTR 122

Query: 144 DEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASEL 203
           DEGWAAFRNILAEINEASRL ILPNQQ SE SE LVGLSDDVGAGFISGH +QPAPASEL
Sbjct: 123 DEGWAAFRNILAEINEASRLFILPNQQSSEPSERLVGLSDDVGAGFISGHSTQPAPASEL 182

Query: 204 NVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPL 263
           NV+RSV+LPAQDEIGN+GVSKVIRADQKRFFFDLGSNNRGHFLRISEV GSDRSSIILPL
Sbjct: 183 NVERSVELPAQDEIGNLGVSKVIRADQKRFFFDLGSNNRGHFLRISEVAGSDRSSIILPL 242

Query: 264 SGLKQFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR 300
           SGLKQFHE+VGHFVEITKDRIEGMTGA+VR VDPPQR
Sbjct: 243 SGLKQFHEMVGHFVEITKDRIEGMTGANVRTVDPPQR 279




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359807554|ref|NP_001241408.1| uncharacterized protein LOC100782805 [Glycine max] gi|255640975|gb|ACU20767.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356521624|ref|XP_003529454.1| PREDICTED: transcription factor Pur-alpha 1 isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|357475063|ref|XP_003607817.1| Transcription factor Pur-alpha [Medicago truncatula] gi|355508872|gb|AES90014.1| Transcription factor Pur-alpha [Medicago truncatula] Back     alignment and taxonomy information
>gi|147774370|emb|CAN72395.1| hypothetical protein VITISV_041199 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224075952|ref|XP_002304844.1| predicted protein [Populus trichocarpa] gi|222842276|gb|EEE79823.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449452640|ref|XP_004144067.1| PREDICTED: transcription factor Pur-alpha 1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224056917|ref|XP_002299088.1| predicted protein [Populus trichocarpa] gi|222846346|gb|EEE83893.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255564778|ref|XP_002523383.1| nucleic acid binding protein, putative [Ricinus communis] gi|223537333|gb|EEF38962.1| nucleic acid binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|30685174|ref|NP_850182.1| transcription factor Pur-alpha 1 [Arabidopsis thaliana] gi|16930405|gb|AAL31888.1|AF419556_1 At2g32080/F22D22.17 [Arabidopsis thaliana] gi|330253538|gb|AEC08632.1| transcription factor Pur-alpha 1 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query300
TAIR|locus:2045497296 PUR ALPHA-1 "AT2G32080" [Arabi 0.896 0.908 0.853 4.1e-120
FB|FBgn0022361275 Pur-alpha "Purine-rich binding 0.406 0.443 0.361 1e-21
WB|WBGene00004046226 plp-1 [Caenorhabditis elegans 0.363 0.482 0.338 4e-10
UNIPROTKB|Q9MYX6321 Q9MYX6 "DNA binding protein pu 0.376 0.352 0.333 1e-08
UNIPROTKB|Q96QR8312 PURB "Transcriptional activato 0.173 0.166 0.442 9.3e-14
ZFIN|ZDB-GENE-040624-8287 pura "purine-rich element bind 0.453 0.473 0.306 5.4e-09
MGI|MGI:103079321 Pura "purine rich element bind 0.376 0.352 0.333 1e-08
UNIPROTKB|E1BMW9322 PURA "Uncharacterized protein" 0.376 0.350 0.333 1e-08
UNIPROTKB|Q00577322 PURA "Transcriptional activato 0.376 0.350 0.333 1e-08
UNIPROTKB|I3LCN6322 PURA "Uncharacterized protein" 0.376 0.350 0.333 1e-08
TAIR|locus:2045497 PUR ALPHA-1 "AT2G32080" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1182 (421.1 bits), Expect = 4.1e-120, P = 4.1e-120
 Identities = 233/273 (85%), Positives = 250/273 (91%)

Query:    29 NDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY 88
             +DVELV KTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY
Sbjct:    27 SDVELVSKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYY 86

Query:    89 VNSDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWA 148
             VNS++HELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSS DEGWA
Sbjct:    87 VNSEEHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSPDEGWA 146

Query:   149 AFRNILAEINEASRLLILPNQ-QGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDR 207
             AFRNILAEI+EAS L ++PNQ + S+  EHLV   DDVGAGFI GHGSQ   +SE NVDR
Sbjct:   147 AFRNILAEIHEASGLFVMPNQVKPSDGQEHLV---DDVGAGFIPGHGSQQPSSSEHNVDR 203

Query:   208 SVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVTGSDRSSIILPLSGLK 267
             ++D P Q+E G  GVSKVIRADQKRFFFDLG+NNRGHFLRISEV GSDRSSIILPLSGLK
Sbjct:   204 TIDSPGQEETGMTGVSKVIRADQKRFFFDLGNNNRGHFLRISEVAGSDRSSIILPLSGLK 263

Query:   268 QFHEIVGHFVEITKDRIEGMTGASVRIVDPPQR 300
             QFHE++GHFVEITKD+IEGMTGA+VR VDPPQR
Sbjct:   264 QFHEVIGHFVEITKDKIEGMTGANVRTVDPPQR 296




GO:0003676 "nucleic acid binding" evidence=ISS
GO:0005634 "nucleus" evidence=ISM
GO:0005737 "cytoplasm" evidence=IDA
GO:0046686 "response to cadmium ion" evidence=IEP;RCA
GO:0006816 "calcium ion transport" evidence=RCA
GO:0006833 "water transport" evidence=RCA
GO:0006970 "response to osmotic stress" evidence=RCA
GO:0007030 "Golgi organization" evidence=RCA
GO:0007033 "vacuole organization" evidence=RCA
GO:0009651 "response to salt stress" evidence=RCA
GO:0009750 "response to fructose stimulus" evidence=RCA
GO:0048767 "root hair elongation" evidence=RCA
FB|FBgn0022361 Pur-alpha "Purine-rich binding protein-alpha" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
WB|WBGene00004046 plp-1 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
UNIPROTKB|Q9MYX6 Q9MYX6 "DNA binding protein pur-alpha" [Ovis aries (taxid:9940)] Back     alignment and assigned GO terms
UNIPROTKB|Q96QR8 PURB "Transcriptional activator protein Pur-beta" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040624-8 pura "purine-rich element binding protein A" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
MGI|MGI:103079 Pura "purine rich element binding protein A" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|E1BMW9 PURA "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q00577 PURA "Transcriptional activator protein Pur-alpha" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|I3LCN6 PURA "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9SKZ1PUR_ARATHNo assigned EC number0.85340.89660.9087yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query300
pfam04845218 pfam04845, PurA, PurA ssDNA and RNA-binding protei 5e-31
smart0071263 smart00712, PUR, DNA/RNA-binding repeats in PUR-al 7e-22
smart0071263 smart00712, PUR, DNA/RNA-binding repeats in PUR-al 1e-20
smart0071263 smart00712, PUR, DNA/RNA-binding repeats in PUR-al 2e-19
pfam04845218 pfam04845, PurA, PurA ssDNA and RNA-binding protei 1e-07
>gnl|CDD|218289 pfam04845, PurA, PurA ssDNA and RNA-binding protein Back     alignment and domain information
 Score =  115 bits (289), Expect = 5e-31
 Identities = 75/266 (28%), Positives = 116/266 (43%), Gaps = 71/266 (26%)

Query: 32  ELVCKTLQVEHKLFYFDLKENPRGRYLKISE-KTSATRSTIIVPSSGISWFLD----LFN 86
           EL  K L +++K FY D+K+N RGR++KI+E      +S I +  S  + F D       
Sbjct: 3   ELASKRLDIQNKRFYLDVKQNARGRFIKIAEVGAGGRKSRITLSMSVAAEFRDYLSKFIE 62

Query: 87  YY-----VNSDDHE------LFSKELQLDSKVFYFDIGENRRGRFLKVSEASV---SRNR 132
           +Y      N +         L S+ L  D++ +Y D+ EN+RGRFL++++  +   S  R
Sbjct: 63  HYAQLGPQNGEHLPEDGRSALKSEFLVRDNRKYYLDLKENQRGRFLRIAQTVMRGGSGQR 122

Query: 133 STIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISG 192
             I +PA     +G   FR+ L +                        L +D G G    
Sbjct: 123 QQIALPA-----QGMIEFRDALTD------------------------LIEDYGEGDEED 153

Query: 193 HGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEVT 252
             +              +LP             +  D KRF+FD+GSN RG F+RISEV 
Sbjct: 154 EPA--------------ELPE---------GTSLTVDNKRFYFDVGSNKRGVFMRISEVK 190

Query: 253 GSDRSSIILPLSGLKQFHEIVGHFVE 278
            + R+SI +P     +F +I   + E
Sbjct: 191 PNYRNSITVPEKAWTRFRDIFCKYSE 216


This family represents most of the length of the protein. Length = 218

>gnl|CDD|197840 smart00712, PUR, DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria Back     alignment and domain information
>gnl|CDD|197840 smart00712, PUR, DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria Back     alignment and domain information
>gnl|CDD|197840 smart00712, PUR, DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria Back     alignment and domain information
>gnl|CDD|218289 pfam04845, PurA, PurA ssDNA and RNA-binding protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 300
KOG3074263 consensus Transcriptional regulator of the PUR fam 100.0
PF04845218 PurA: PurA ssDNA and RNA-binding protein; InterPro 100.0
PF04845218 PurA: PurA ssDNA and RNA-binding protein; InterPro 100.0
KOG3074263 consensus Transcriptional regulator of the PUR fam 100.0
smart0071263 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamm 99.87
smart0071263 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamm 99.84
PF11680124 DUF3276: Protein of unknown function (DUF3276); In 97.18
PF11680124 DUF3276: Protein of unknown function (DUF3276); In 96.95
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription] Back     alignment and domain information
Probab=100.00  E-value=9.7e-74  Score=521.46  Aligned_cols=217  Identities=49%  Similarity=0.839  Sum_probs=189.5

Q ss_pred             CCCCCCCCCCCCCCCCCccceeeeeeeeeceeEEEeccCCCCcceEEEEEecCCcceEEEeecccHHHHHHHHhhhhc--
Q 022261           13 GGSGGVTGGGAGGGGGNDVELVCKTLQVEHKLFYFDLKENPRGRYLKISEKTSATRSTIIVPSSGISWFLDLFNYYVN--   90 (300)
Q Consensus        13 ~~~~~~~~~~~~~~~~~~~eL~Sk~l~ie~KrFYlDlKeN~RGRFLKIaE~~~g~rs~I~lp~~g~~efrd~L~~y~~--   90 (300)
                      +++..++|| |||+.++++|||||+|+||+||||||||||+|||||||||++.+.+|+|+||+++++||+|+|.+|++  
T Consensus         6 ~~g~~~~gG-gGG~~~~~~eLask~l~iq~KrfYlDvkqN~rGRflKIaE~g~~~~Sri~ls~s~aaef~d~l~~f~~~y   84 (263)
T KOG3074|consen    6 GGGGAVTGG-GGGGGGEDQELASKRLQIQHKRFYLDVKQNPRGRFLKIAEVGAGGRSRIILSLSVAAEFRDILNDFIEHY   84 (263)
T ss_pred             CCCCCCcCC-CCCccchhHHHhhhhhhhhhheEEEEeccCCCcceEEEEEeccCCcceEEEehhhHHHHHHHHHHHHHHH
Confidence            334445555 44678889999999999999999999999999999999999999999999999999999999966653  


Q ss_pred             -------------CCCccccccceecccceEEEecCCCCccceeEEeeeccCC--CCce---EeecCCCCchhHHHHHHH
Q 022261           91 -------------SDDHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSR--NRST---IIVPAGSSRDEGWAAFRN  152 (300)
Q Consensus        91 -------------~e~~~~~s~~l~vdnKrFYfDvkeN~RGrFLRISEv~~~~--~Rst---I~IPa~~~~~qGw~~FRd  152 (300)
                                   .+++.++|+.|.+|||+||||||||+|||||||+|...++  .|++   |++|+     |||++|||
T Consensus        85 a~lg~~~~~~~~~~ee~~lkSe~L~~d~RkfY~DLKEN~RGRFLRI~qt~~rg~~~~~Q~~tIalPa-----qG~aEfrd  159 (263)
T KOG3074|consen   85 AQLGPPPPELAAPSEEHELKSEELQRDNRKFYLDLKENQRGRFLRISQTPMRGRQTRTQVQTIALPA-----QGIAEFRD  159 (263)
T ss_pred             HhcCCCCcccccchhhhhHHHHHHhhcCceEEEEcccccccceEEeeecccCCCCccccccceeccc-----hhHHHHHH
Confidence                         2445688999999999999999999999999999877543  3556   99999     99999999


Q ss_pred             HHHHHHHHhhhhccCCCCCCccchhccccccccCCccccCCCCCCCCCCccccccCCCCCcccccCCcccceeEeeccce
Q 022261          153 ILAEINEASRLLILPNQQGSEQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKR  232 (300)
Q Consensus       153 ~L~ei~e~~~~~~~p~~~~~~~~e~l~~l~d~~G~g~~~~~~~q~~~~~~~~~~~~~~lP~~~~~~~~~~s~~irvdnKr  232 (300)
                      +|++|.                        ++||++-.    .|             +||+         .++|+||||+
T Consensus       160 aLaell------------------------e~~G~~~~----gq-------------elpe---------g~~~tvDnk~  189 (263)
T KOG3074|consen  160 ALAELL------------------------EDFGEGDE----GQ-------------ELPE---------GTSITVDNKR  189 (263)
T ss_pred             HHHHHH------------------------HHhCCccc----cC-------------CCCC---------CCeEEEecce
Confidence            999975                        45665432    12             6787         4889999999


Q ss_pred             EEEEcCCCCceeEEEEeeecC--CCCceeEechhhHHHHHHHHHHHHHHHHhhhc
Q 022261          233 FFFDLGSNNRGHFLRISEVTG--SDRSSIILPLSGLKQFHEIVGHFVEITKDRIE  285 (300)
Q Consensus       233 FyFDlg~N~RG~FLRISEv~g--~~RssI~IP~~~~~~F~d~l~~~~d~m~~~~~  285 (300)
                      ||||+|+|+||.|||||||++  .+|++||||.++|.+|+++|++|+++|++..+
T Consensus       190 F~Fd~G~N~~G~f~riSEVk~n~~yRnsITvP~k~w~~F~~~f~~y~e~mk~~~e  244 (263)
T KOG3074|consen  190 FLFDVGKNKRGVFVRISEVKSNGSYRNSITVPFKAWKRFGEVFGKYSEEMKKISE  244 (263)
T ss_pred             eEEecCCCCccceEEEEEecCCCCcccceeeehhHhhhhhhHHHHHHHHHHHhhh
Confidence            999999999999999999997  89999999999999999999999999998755



>PF04845 PurA: PurA ssDNA and RNA-binding protein; InterPro: IPR006628 The Pur protein family consists of four known members in humans and is strongly conserved throughout evolution Back     alignment and domain information
>PF04845 PurA: PurA ssDNA and RNA-binding protein; InterPro: IPR006628 The Pur protein family consists of four known members in humans and is strongly conserved throughout evolution Back     alignment and domain information
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription] Back     alignment and domain information
>smart00712 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria Back     alignment and domain information
>smart00712 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria Back     alignment and domain information
>PF11680 DUF3276: Protein of unknown function (DUF3276); InterPro: IPR021694 This bacterial family of proteins has no known function Back     alignment and domain information
>PF11680 DUF3276: Protein of unknown function (DUF3276); InterPro: IPR021694 This bacterial family of proteins has no known function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query300
3k44_A146 Crystal Structure Of Drosophila Melanogaster Pur-Al 9e-14
3k44_A146 Crystal Structure Of Drosophila Melanogaster Pur-Al 9e-13
>pdb|3K44|A Chain A, Crystal Structure Of Drosophila Melanogaster Pur-Alpha Length = 146 Back     alignment and structure

Iteration: 1

Score = 73.9 bits (180), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 52/144 (36%), Positives = 80/144 (55%), Gaps = 22/144 (15%) Query: 32 ELVCKTLQVEHKLFYFDLKENPRGRYLKISE-KTSATRSTIIVPSSGISWFLDLF----N 86 EL K LQ++ K FY D+K+N RGR++K++E RS I + S + F D + Sbjct: 4 ELATKMLQIQSKRFYLDVKQNRRGRFIKVAEIGADGRRSQIYLALSTAAEFRDHLSSFSD 63 Query: 87 YYV-----NSD----DHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRN--RSTI 135 YY N+D D +L S+ + D + +Y D+ EN RGRFL+VS+ +++R RS I Sbjct: 64 YYASLGPPNTDNLPEDGKLKSEMMIKDYRRYYLDLKENARGRFLRVSQ-TITRGGPRSQI 122 Query: 136 IVPAGSSRDEGWAAFRNILAEINE 159 +PA +G FR+ L ++ E Sbjct: 123 ALPA-----QGMIEFRDALTDLLE 141
>pdb|3K44|A Chain A, Crystal Structure Of Drosophila Melanogaster Pur-Alpha Length = 146 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query300
3k44_A146 Purine-rich binding protein-alpha, isoform B; PUR- 100.0
3k44_A146 Purine-rich binding protein-alpha, isoform B; PUR- 100.0
3n8b_A98 Uncharacterized protein; PUR-alpha, PUR repeat, PU 98.25
3n8b_A98 Uncharacterized protein; PUR-alpha, PUR repeat, PU 98.2
3nm7_A127 Uncharacterized protein; PUR-alpha, PUR repeat, PU 96.93
3nm7_A127 Uncharacterized protein; PUR-alpha, PUR repeat, PU 96.6
>3k44_A Purine-rich binding protein-alpha, isoform B; PUR-alpha, PUR repeat, PUR domain, whirly fold, DNA binding protein, RNA binding protein; 2.10A {Drosophila melanogaster} Back     alignment and structure
Probab=100.00  E-value=1.7e-48  Score=334.55  Aligned_cols=141  Identities=37%  Similarity=0.602  Sum_probs=126.5

Q ss_pred             CccccccceecccceEEEecCCCCccceeEEeeeccCCCCceEeecCCCCchhHHHHHHHHHHHHHHHhhhhccCCCCCC
Q 022261           93 DHELFSKELQLDSKVFYFDIGENRRGRFLKVSEASVSRNRSTIIVPAGSSRDEGWAAFRNILAEINEASRLLILPNQQGS  172 (300)
Q Consensus        93 ~~~~~s~~l~vdnKrFYfDvkeN~RGrFLRISEv~~~~~RstI~IPa~~~~~qGw~~FRd~L~ei~e~~~~~~~p~~~~~  172 (300)
                      +++++|+.|++|+|+||||||+|+|||||||||+.+++.|++|+||+     +||.+|||+|++++|+++.+.++++   
T Consensus         2 ~~eL~S~~i~ie~KrfYlDlKeN~RGrfLkIsE~~~~g~rs~I~lp~-----~~~~efrd~L~~~ie~~~~~g~~~~---   73 (146)
T 3k44_A            2 EQELATKMLQIQSKRFYLDVKQNRRGRFIKVAEIGADGRRSQIYLAL-----STAAEFRDHLSSFSDYYASLGPPNT---   73 (146)
T ss_dssp             CEEEEEEEEEETTEEEEEEEEEETTEEEEEEEEECTTCCEEEEEECH-----HHHHHHHHHHHHHHHHHHTSCSCCS---
T ss_pred             cccccceeEEEeceEEEEEecccCCeEEEEEEEecCCCcceEEEEEh-----hHHHHHHHHHHHHHHHHHhcCCCcc---
Confidence            57899999999999999999999999999999998777899999999     9999999999999998876554332   


Q ss_pred             ccchhccccccccCCccccCCCCCCCCCCccccccCCCCCcccccCCcccceeEeeccceEEEEcCCCCceeEEEEeee-
Q 022261          173 EQSEHLVGLSDDVGAGFISGHGSQPAPASELNVDRSVDLPAQDEIGNMGVSKVIRADQKRFFFDLGSNNRGHFLRISEV-  251 (300)
Q Consensus       173 ~~~e~l~~l~d~~G~g~~~~~~~q~~~~~~~~~~~~~~lP~~~~~~~~~~s~~irvdnKrFyFDlg~N~RG~FLRISEv-  251 (300)
                                                          .+||+.    +.++|++|++|||+||||||+|+||+||||||+ 
T Consensus        74 ------------------------------------~~~~~~----~~l~s~~i~~~~k~fyfDl~~N~RGrFLRIsE~~  113 (146)
T 3k44_A           74 ------------------------------------DNLPED----GKLKSEMMIKDYRRYYLDLKENARGRFLRVSQTI  113 (146)
T ss_dssp             ------------------------------------SCCCTT----CEEEEEEEEETTEEEEEEEEEETTEEEEEEEEEE
T ss_pred             ------------------------------------ccCCcc----ccccchheeccCceEEEEecccCcceEEEEEEee
Confidence                                                145553    335689999999999999999999999999999 


Q ss_pred             -cCCCCceeEechhhHHHHHHHHHHHHHHHH
Q 022261          252 -TGSDRSSIILPLSGLKQFHEIVGHFVEITK  281 (300)
Q Consensus       252 -~g~~RssI~IP~~~~~~F~d~l~~~~d~m~  281 (300)
                       ++++|++||||+++|.+|+|+|++|+|+|.
T Consensus       114 ~~~~~R~~I~iP~~g~~~Fr~~l~~~~~~~~  144 (146)
T 3k44_A          114 TRGGPRSQIALPAQGMIEFRDALTDLLEEFG  144 (146)
T ss_dssp             SSCCCCEEEEEEGGGHHHHHHHHHHHHHHHC
T ss_pred             ccCCCcceEEecHHHHHHHHHHHHHHHHHhc
Confidence             589999999999999999999999999984



>3k44_A Purine-rich binding protein-alpha, isoform B; PUR-alpha, PUR repeat, PUR domain, whirly fold, DNA binding protein, RNA binding protein; 2.10A {Drosophila melanogaster} Back     alignment and structure
>3n8b_A Uncharacterized protein; PUR-alpha, PUR repeat, PUR domain, whirly fold, RNA binding, binding, nucleic acid binding protein; HET: MSE; 1.90A {Borrelia burgdorferi} Back     alignment and structure
>3n8b_A Uncharacterized protein; PUR-alpha, PUR repeat, PUR domain, whirly fold, RNA binding, binding, nucleic acid binding protein; HET: MSE; 1.90A {Borrelia burgdorferi} Back     alignment and structure
>3nm7_A Uncharacterized protein; PUR-alpha, PUR repeat, PUR domain, PURA, whirly fold, RNA BI DNA binding, nucleic acid binding protein; 2.20A {Borrelia burgdorferi} Back     alignment and structure
>3nm7_A Uncharacterized protein; PUR-alpha, PUR repeat, PUR domain, PURA, whirly fold, RNA BI DNA binding, nucleic acid binding protein; 2.20A {Borrelia burgdorferi} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00