Citrus Sinensis ID: 033411


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120
MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
ccccccccccHHHHHHHHHHHHHHccccccccccEEEEEcccccccccccccccHHHHHHHHHHHccccccccccccccccccHHHHHHHHcccEEEEEHHHHHHHHHEEEEEEEEEEcc
ccccccccccHHHHHHHHHHHHHHcccccccHEEEEEEEEccccccccccccccHHHHHHHHHHHcccccccHHHccHHHHccHHHHHHHccccEEEEEEEEEEEEEEEEEEEEEEcccc
mapvaplqfsLAERDIQAIVDAykeeptnpkyafkhllfsvtepqfrvkppgvsdIMWAEAMgklegmdstdrerLWPQLVQGFKdlsnrlkvpvVYFRHVAFWLEIHVHSFVCCFHVDG
mapvaplqfslaeRDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
**********LAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGM**TDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHV**
***********AERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEA*************RLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
*****PLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVD*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKVPVVYFRHVAFWLEIHVHSFVCCFHVDG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query120
225425478 371 PREDICTED: nuclear pore complex protein 0.775 0.250 0.849 2e-42
297738423 265 unnamed protein product [Vitis vinifera] 0.775 0.350 0.849 7e-42
255547750 395 conserved hypothetical protein [Ricinus 0.775 0.235 0.860 8e-42
356566511 391 PREDICTED: uncharacterized protein LOC10 0.775 0.237 0.860 9e-42
224107623 270 predicted protein [Populus trichocarpa] 0.775 0.344 0.870 3e-41
15221725 377 uncharacterized protein [Arabidopsis tha 0.775 0.246 0.827 3e-41
297808793 263 hypothetical protein ARALYDRAFT_910872 [ 0.775 0.353 0.827 4e-41
2829886 288 Hypothetical protein [Arabidopsis thalia 0.775 0.322 0.827 9e-41
449435057 389 PREDICTED: uncharacterized protein LOC10 0.775 0.239 0.838 1e-40
297851152 376 hypothetical protein ARALYDRAFT_472914 [ 0.775 0.247 0.817 3e-40
>gi|225425478|ref|XP_002279552.1| PREDICTED: nuclear pore complex protein Nup54-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  175 bits (444), Expect = 2e-42,   Method: Composition-based stats.
 Identities = 79/93 (84%), Positives = 88/93 (94%)

Query: 1   MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAE 60
           MAPVAPL FSLA+RD+QAIV+AY++EP NPKYAFKHLLFSVT+PQFRVKP G SDIMWAE
Sbjct: 107 MAPVAPLPFSLADRDVQAIVEAYRDEPGNPKYAFKHLLFSVTDPQFRVKPAGASDIMWAE 166

Query: 61  AMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKV 93
           AMGKLEGM+S+DRERLWPQLVQGFKDLS RLK+
Sbjct: 167 AMGKLEGMESSDRERLWPQLVQGFKDLSQRLKL 199




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297738423|emb|CBI27624.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255547750|ref|XP_002514932.1| conserved hypothetical protein [Ricinus communis] gi|223545983|gb|EEF47486.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356566511|ref|XP_003551474.1| PREDICTED: uncharacterized protein LOC100797030 [Glycine max] Back     alignment and taxonomy information
>gi|224107623|ref|XP_002314540.1| predicted protein [Populus trichocarpa] gi|222863580|gb|EEF00711.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|15221725|ref|NP_173841.1| uncharacterized protein [Arabidopsis thaliana] gi|26450407|dbj|BAC42318.1| unknown protein [Arabidopsis thaliana] gi|28827332|gb|AAO50510.1| unknown protein [Arabidopsis thaliana] gi|332192394|gb|AEE30515.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297808793|ref|XP_002872280.1| hypothetical protein ARALYDRAFT_910872 [Arabidopsis lyrata subsp. lyrata] gi|297318117|gb|EFH48539.1| hypothetical protein ARALYDRAFT_910872 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|2829886|gb|AAC00594.1| Hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449435057|ref|XP_004135312.1| PREDICTED: uncharacterized protein LOC101211067 [Cucumis sativus] gi|449494920|ref|XP_004159683.1| PREDICTED: uncharacterized LOC101211067 [Cucumis sativus] Back     alignment and taxonomy information
>gi|297851152|ref|XP_002893457.1| hypothetical protein ARALYDRAFT_472914 [Arabidopsis lyrata subsp. lyrata] gi|297339299|gb|EFH69716.1| hypothetical protein ARALYDRAFT_472914 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query120
TAIR|locus:2032417 377 AT1G24310 "AT1G24310" [Arabido 0.775 0.246 0.827 1.4e-39
TAIR|locus:2032417 AT1G24310 "AT1G24310" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 422 (153.6 bits), Expect = 1.4e-39, P = 1.4e-39
 Identities = 77/93 (82%), Positives = 88/93 (94%)

Query:     1 MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVTEPQFRVKPPGVSDIMWAE 60
             MAPVAP+ +SLA+RD+QAI++AYKE+PTNPKYAF+HLLFSVTEPQ+RVKP  VSDIMWAE
Sbjct:   114 MAPVAPIPYSLADRDVQAIIEAYKEDPTNPKYAFQHLLFSVTEPQYRVKPAAVSDIMWAE 173

Query:    61 AMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKV 93
             AM KLEGMDST+RERLWPQLVQGFKDLS RLK+
Sbjct:   174 AMSKLEGMDSTERERLWPQLVQGFKDLSQRLKL 206


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.325   0.139   0.448    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      120       120   0.00091  102 3  11 22  0.44    30
                                                     29  0.48    32


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  590 (63 KB)
  Total size of DFA:  141 KB (2086 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  13.15u 0.09s 13.24t   Elapsed:  00:00:01
  Total cpu time:  13.15u 0.09s 13.24t   Elapsed:  00:00:01
  Start:  Sat May 11 02:18:00 2013   End:  Sat May 11 02:18:01 2013


GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0005730 "nucleolus" evidence=IDA
GO:0005515 "protein binding" evidence=IPI
GO:0005635 "nuclear envelope" evidence=IDA
GO:0006406 "mRNA export from nucleus" evidence=RCA
GO:0006606 "protein import into nucleus" evidence=RCA
GO:0009220 "pyrimidine ribonucleotide biosynthetic process" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query120
pfam13874141 pfam13874, Nup54, Nucleoporin complex subunit 54 1e-10
>gnl|CDD|222430 pfam13874, Nup54, Nucleoporin complex subunit 54 Back     alignment and domain information
 Score = 54.6 bits (132), Expect = 1e-10
 Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 48 VKPPGVSDIMWAEAMGKLEGMDSTDRERLWPQLVQGFKDLSNRLKV 93
            P G+    W +A+      D+ D  RL P  V GF++L  R+K+
Sbjct: 1  NPPAGIDPEKWEQAL-----EDNPDPNRLIPVPVIGFEELKQRVKL 41


This is the human Nup54 subunit of the nucleoporin complex, equivalent to Nup57 of yeast. Nup54, Nup58 and Nup62 all have similar affinities for importin-beta. It seems likely that they are the only FG-repeat nucleoporins of the central channel, and as such they would form a zone of equal affinity spanning the central channel. The diffusion of importin-beta import complexes through the central channel may be a stochastic process as the affinities are similar, whereas movement from cytoplasmic fibrils to the central channel and from the central channel to the nuclear basket would be facilitated by the subtle differences in affinity between them. Length = 141

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 120
KOG3091 508 consensus Nuclear pore complex, p54 component (sc 99.9
PF13874141 Nup54: Nucleoporin complex subunit 54; PDB: 3T97_B 99.7
>KOG3091 consensus Nuclear pore complex, p54 component (sc Nup57) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
Probab=99.90  E-value=2.4e-24  Score=187.84  Aligned_cols=100  Identities=27%  Similarity=0.430  Sum_probs=91.7

Q ss_pred             CCCCCCcccCchHHHHHHHHHhhccCCCCCCCcceeeeccCC---CCCCcCCCCCCCHHHHHHHHhcCCCCCCCCCCCcc
Q 033411            1 MAPVAPLQFSLAERDIQAIVDAYKEEPTNPKYAFKHLLFSVT---EPQFRVKPPGVSDIMWAEAMGKLEGMDSTDRERLW   77 (120)
Q Consensus         1 ~~~~~~~~~~~~~~qi~~Ik~aW~~DP~sp~c~Fk~~fYNvv---dp~~~~kP~gvd~~~W~eAl~k~pg~dnpdp~~li   77 (120)
                      |++++|.|-+|.| +.+++.+.|  +++.|+|++.+|+||.+   |+.+|.||+|||+++|+|||     +||||++++|
T Consensus       258 ~a~~~~~p~~~~~-~~q~~~~~~--~~n~~~t~~~afv~~~~~q~e~~L~~kP~gVd~~~W~QA~-----~dnp~s~kli  329 (508)
T KOG3091|consen  258 VAGRIPAPQSLND-QVQKTLKEW--LLNTPKTRVLAFVYLSVAQTEAYLETKPAGVDQRIWRQAM-----KDNPPSNKLI  329 (508)
T ss_pred             cccCCCcchhHHH-HHHHHHHHH--hhcCCcchhhhhhccCHHHHHHHhcCCCCCcCHHHHHHHh-----hcCCCccccc
Confidence            5677888888988 888889999  99999999999999943   56678999999999999999     6999999999


Q ss_pred             ceecccHHHHHHHhhchHHHHHHHHHHHHhh
Q 033411           78 PQLVQGFKDLSNRLKVPVVYFRHVAFWLEIH  108 (120)
Q Consensus        78 PVl~~GF~dL~~R~k~Q~~~v~~~~~~l~~~  108 (120)
                      ||+|+||+||++|+|+|++++++|+.+|++-
T Consensus       330 PVpvvGF~dL~~R~K~Q~q~~~~~r~ri~~i  360 (508)
T KOG3091|consen  330 PVPVVGFEDLRQRLKVQDQEVKQHRIRINAI  360 (508)
T ss_pred             ceeccchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            9999999999999999999999999998863



>PF13874 Nup54: Nucleoporin complex subunit 54; PDB: 3T97_B Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00