Citrus Sinensis ID: 013065
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 450 | ||||||
| 224136205 | 533 | f-box family protein [Populus trichocarp | 0.928 | 0.784 | 0.542 | 1e-130 | |
| 359491092 | 527 | PREDICTED: F-box/LRR-repeat protein At5g | 0.94 | 0.802 | 0.537 | 1e-128 | |
| 147863571 | 527 | hypothetical protein VITISV_019403 [Viti | 0.94 | 0.802 | 0.535 | 1e-126 | |
| 255540189 | 523 | conserved hypothetical protein [Ricinus | 0.904 | 0.778 | 0.531 | 1e-124 | |
| 224122060 | 551 | predicted protein [Populus trichocarpa] | 0.928 | 0.758 | 0.512 | 1e-120 | |
| 118488987 | 533 | unknown [Populus trichocarpa x Populus d | 0.926 | 0.782 | 0.512 | 1e-114 | |
| 224118086 | 530 | predicted protein [Populus trichocarpa] | 0.926 | 0.786 | 0.515 | 1e-114 | |
| 224115878 | 558 | predicted protein [Populus trichocarpa] | 0.913 | 0.736 | 0.486 | 1e-107 | |
| 118484799 | 342 | unknown [Populus trichocarpa] | 0.744 | 0.979 | 0.573 | 1e-103 | |
| 356516535 | 540 | PREDICTED: F-box/LRR-repeat protein At5g | 0.913 | 0.761 | 0.476 | 1e-102 |
| >gi|224136205|ref|XP_002322271.1| f-box family protein [Populus trichocarpa] gi|222869267|gb|EEF06398.1| f-box family protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 469 bits (1208), Expect = e-130, Method: Compositional matrix adjust.
Identities = 241/444 (54%), Positives = 314/444 (70%), Gaps = 26/444 (5%)
Query: 7 LVKHLGLRTPIILSVVHGVMGRDALTDEFREVKFQDHEDIGVNRGICTCGQQSGIVLTVG 66
L LG +TPII+S G+MGRDA+TDE REV +++ G + +GI+LTVG
Sbjct: 111 LATKLGFQTPIIVSCTSGIMGRDAVTDEHREVMLEEYWVDGESNPC------NGIILTVG 164
Query: 67 YLPGLKVDAIPLLRRKEIVQVPVDDTYVDVYLAPPVPMIDQFVMDIQNYTTSVSGCASPV 126
+LPGLKVDAIPL + ++ + M+D FVMDI++Y TS+SGCASPV
Sbjct: 165 FLPGLKVDAIPLFQPRKGCRAT---------------MVDNFVMDIKDYATSISGCASPV 209
Query: 127 GIIMFGKEDMDQKPIIEKLDYAMSMNTVFVGDERSRFAYRSGDDLRNVCGNPAFISDAVA 186
GIIMFG ED DQKP++EKLD+AMS +T+ +GDER++F YR+G + RN + + S AVA
Sbjct: 210 GIIMFGDEDADQKPVMEKLDHAMSSDTIIIGDERAQFLYRNGVESRNDYESSEYFSAAVA 269
Query: 187 LVFASDKDKPHAGTGEIQFHLAMSEGVSAIGPRHKAVSVR--ANHAEGSTWLTAKREGHH 244
LVFA D+DKP GTGEIQFH A+S GVSA+GPR+KAVSV+ + +TWLTA+REG H
Sbjct: 270 LVFARDRDKP-CGTGEIQFHAALSSGVSAVGPRYKAVSVKKIVSGTGHTTWLTARREGEH 328
Query: 245 VILDGEQILRHID-QLENRFPQVELYVGVTKRRKCSIGSEKSRLITTLAFHGIRGGDQEY 303
I DG++IL I+ +L N+ +LY+GVT++R+C IGS+KSR++T L FHG+ GGDQEY
Sbjct: 329 EIQDGQRILDDINNELVNQVGHPDLYIGVTEQRRCFIGSQKSRVMTFLVFHGVMGGDQEY 388
Query: 304 LYVDGVGIKTGDYFQFYQPDHNAALAACRNASENIRNLKLNSSGKGFLGRRDVANSIDRK 363
L+ DGVGI+TGDYFQFY PD +AAL++C N S+N RNL L+ S + L R V +++ K
Sbjct: 389 LFADGVGIRTGDYFQFYHPDPSAALSSCSNVSKNFRNLNLDWSSRNCLHARGVYDNVCNK 448
Query: 364 EVLGGFIFSCCGRGNSFFGGLNVDSFPFFENFPSAPLAGIFCGGEIGRGKLSMTGQESQE 423
E++GGF+FSCCGRG SFF NVDS PF +NFP P+AGIFC GEIGRG E QE
Sbjct: 449 ELVGGFVFSCCGRGESFFERCNVDSSPFLDNFPGFPMAGIFCRGEIGRGFSVFNADEGQE 508
Query: 424 ESPAERRYLHVYSTAYLVISYSAA 447
E + LHVYS YL++SY+ A
Sbjct: 509 ERTS-HCCLHVYSAVYLLVSYTPA 531
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359491092|ref|XP_002283895.2| PREDICTED: F-box/LRR-repeat protein At5g63520-like [Vitis vinifera] gi|297734433|emb|CBI15680.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|147863571|emb|CAN79767.1| hypothetical protein VITISV_019403 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255540189|ref|XP_002511159.1| conserved hypothetical protein [Ricinus communis] gi|223550274|gb|EEF51761.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224122060|ref|XP_002318743.1| predicted protein [Populus trichocarpa] gi|222859416|gb|EEE96963.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118488987|gb|ABK96301.1| unknown [Populus trichocarpa x Populus deltoides] | Back alignment and taxonomy information |
|---|
| >gi|224118086|ref|XP_002331554.1| predicted protein [Populus trichocarpa] gi|222873778|gb|EEF10909.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224115878|ref|XP_002317147.1| predicted protein [Populus trichocarpa] gi|222860212|gb|EEE97759.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118484799|gb|ABK94267.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356516535|ref|XP_003526949.1| PREDICTED: F-box/LRR-repeat protein At5g63520-like [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 450 | ||||||
| UNIPROTKB|H3BTR7 | 78 | FBXO22 "F-box only protein 22" | 0.111 | 0.641 | 0.442 | 6.5e-05 |
| UNIPROTKB|H3BTR7 FBXO22 "F-box only protein 22" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
Score = 104 (41.7 bits), Expect = 6.5e-05, P = 6.5e-05
Identities = 23/52 (44%), Positives = 33/52 (63%)
Query: 368 GFIFSCCGRGNSFFGGL-NVDSFPFFENFPSAPLAGIFCGGEIGRGKLSMTG 418
GF+F+C GRG ++ NV++ F + FPS PL G F GEIG ++ +TG
Sbjct: 18 GFMFACVGRGFQYYRAKGNVEADAFRKFFPSVPLFGFFGNGEIGCDRI-VTG 68
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.321 0.139 0.414 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 450 450 0.00092 118 3 11 22 0.38 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 616 (65 KB)
Total size of DFA: 261 KB (2139 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 34.10u 0.14s 34.24t Elapsed: 00:00:01
Total cpu time: 34.10u 0.14s 34.24t Elapsed: 00:00:01
Start: Thu May 9 16:18:52 2013 End: Thu May 9 16:18:53 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00017954001 | SubName- Full=Chromosome chr17 scaffold_16, whole genome shotgun sequence; (526 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 450 | |||
| pfam10442 | 137 | pfam10442, FIST_C, FIST C domain | 6e-11 | |
| COG4398 | 389 | COG4398, COG4398, Uncharacterized protein conserve | 8e-08 |
| >gnl|CDD|220757 pfam10442, FIST_C, FIST C domain | Back alignment and domain information |
|---|
Score = 59.6 bits (145), Expect = 6e-11
Identities = 26/102 (25%), Positives = 34/102 (33%), Gaps = 20/102 (19%)
Query: 311 IKTGDYFQFYQPDHNAALAACRNASENIRNLKLNSSGKGFLGRRDVANSIDRKEVLGGFI 370
+ G+ Q D + R A E R + G +
Sbjct: 56 VPEGEELQLMLRDAEDLIEDLRRALEAARE--------------------GGRPPAGALL 95
Query: 371 FSCCGRGNSFFGGLNVDSFPFFENFPSAPLAGIFCGGEIGRG 412
FSC GRG FG + + E AP+ G F GEIG G
Sbjct: 96 FSCIGRGLLLFGEPDEELEAVREVLGDAPVIGFFTYGEIGPG 137
|
The FIST C domain is a novel sensory domain, which is present in signal transduction proteins from Bacteria, Archaea and Eukarya. Chromosomal proximity of FIST-encoding genes to those coding for proteins involved in amino acid metabolism and transport suggest that FIST domains bind small ligands, such as amino acids. Length = 137 |
| >gnl|CDD|226833 COG4398, COG4398, Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 450 | |||
| COG4398 | 389 | Uncharacterized protein conserved in bacteria [Fun | 100.0 | |
| COG3287 | 379 | Uncharacterized conserved protein [Function unknow | 99.9 | |
| PF10442 | 136 | FIST_C: FIST C domain; InterPro: IPR019494 This en | 99.77 | |
| PF08495 | 198 | FIST: FIST N domain; InterPro: IPR013702 The FIST | 99.52 |
| >COG4398 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=8.8e-50 Score=389.80 Aligned_cols=254 Identities=23% Similarity=0.307 Sum_probs=218.1
Q ss_pred cEEEEecCC-CCChHHHHHHhhhhcCCCceeeccc-ccccccccCCCccceeCCCceecCeEEEEEecCCCCCCCCCCce
Q 013065 126 VGIIMFGKE-DMDQKPIIEKLDYAMSMNTVFVGDE-RSRFAYRSGDDLRNVCGNPAFISDAVALVFASDKDKPHAGTGEI 203 (450)
Q Consensus 126 ~gi~~f~d~-~~~~~~ll~gL~~al~~~~vI~Gg~-agd~~f~~g~~~r~~~~~~~~~~~aVaL~f~gd~~~p~a~~G~i 203 (450)
...|++.|+ ++....++++|+.++|.. .++||. +|. +.+.+.|.++..+.+.++.|++.+.| +
T Consensus 126 ~~~ilL~dp~t~~~n~li~~l~~~~Pgt-tvvGG~~Sgg---~~~G~~~Lf~~~~~~~~G~vGv~L~G-----------i 190 (389)
T COG4398 126 DLHLLLPDPYTFPSNLLIEHLNTDLPGT-TVVGGVVSGG---RRRGDTRLFRDHDVLTSGVVGVRLPG-----------I 190 (389)
T ss_pred CceEEccCCcccchHHHhhccCcCCCCc-eEEccEeecC---ccCCceEEeecCCcccCceeEEeecc-----------c
Confidence 456788998 999999999999999975 566664 444 33334445555557777899998754 4
Q ss_pred EEEEEeccCCeeeCCCeEEEEeccccccCceEEcccccCccccccChhhhhhhh-------HhhccCCCcceeEEEEecc
Q 013065 204 QFHLAMSEGVSAIGPRHKAVSVRANHAEGSTWLTAKREGHHVILDGEQILRHID-------QLENRFPQVELYVGVTKRR 276 (450)
Q Consensus 204 ~~~~~vsqGcrPiGp~~~Vt~v~~T~segNvll~l~~~g~~~eLDg~pAL~~L~-------e~~~~l~~~~L~iGva~~~ 276 (450)
+....|+|||||||.+|.|| ++++|+| .||+|+|-|..|. +.+++|.+++|++|++++|
T Consensus 191 ~l~~vVsQGCRPIGeP~iVt-----~a~~niI---------tEl~gr~PL~~Lr~ii~~lsp~er~L~~~~L~iGi~~DE 256 (389)
T COG4398 191 RLVPVVSQGCRPIGEPYIVT-----GADGNII---------TELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDE 256 (389)
T ss_pred eecchhccCcccCCCceEee-----ccCceeE---------eecCCCChHHHHHHHhhccChhhHHHHhcCceEEEEehh
Confidence 46689999999999999995 9999999 8889998887555 4488999999999999999
Q ss_pred cccCCCCCceeEEEEEeecCCCCCceeeEeeccCCCCCCEEEEEcCCHHHHHHHHHHHHHHHHHHhhhcCCCCccccccc
Q 013065 277 KCSIGSEKSRLITTLAFHGIRGGDQEYLYVDGVGIKTGDYFQFYQPDHNAALAACRNASENIRNLKLNSSGKGFLGRRDV 356 (450)
Q Consensus 277 ~~~~~~~~~~~vR~ll~~d~~~gs~e~~i~~g~~I~~G~~vqf~~RD~~aA~~dl~~~~~~lr~l~~~~~~k~~l~~~~~ 356 (450)
.+.....++|++|.+++.|+.+|+ |.+++-|++|+++||++||++++..|++-..+.. .+++
T Consensus 257 ~~~~~~qGDFlIR~lLG~DPs~Ga----IaIgd~Vr~G~~lQF~~RD~~as~~dL~~l~er~---~~e~----------- 318 (389)
T COG4398 257 HLAAPGQGDFLIRGLLGADPSTGA----IAIGEVVRVGATLQFQVRDAAAADKDLRLLVERA---AAEL----------- 318 (389)
T ss_pred hhcCCCCCceEeeeccccCCCCCc----eeecceeccCcEEEEEEcccccchhHHHHHHHHH---HhhC-----------
Confidence 887777899999999999999999 9999999999999999999999998777655444 2232
Q ss_pred cCccCCCceeEEEEEEeCCCCcCCCCCCccchHhHHhhCCCCceeeeecccccCCCCCccCCCCCCCCCCcccccccccc
Q 013065 357 ANSIDRKEVLGGFIFSCCGRGNSFFGGLNVDSFPFFENFPSAPLAGIFCGGEIGRGKLSMTGQESQEESPAERRYLHVYS 436 (450)
Q Consensus 357 ~~~~~~~~p~gaLlFSC~GRG~~lfg~~~~E~~~v~e~lp~vPlaGFy~~GEIgP~~~~~v~g~~~~~~~~~~~~LH~yT 436 (450)
...++|||||||+|||..|||.+|+|+++|.+.||++|++||||+||||| +++ +|+||+||
T Consensus 319 -----~~~avGaLmFsC~GRG~~m~G~p~~Ds~~~~~~~~gipl~GFF~~GEIGp-----V~g---------r~~LHG~T 379 (389)
T COG4398 319 -----PGRAVGALLFTCNGRGRRMFGVPDHDASTIEELLGGIPLAGFFAAGEIGP-----VAG---------RNALHGFT 379 (389)
T ss_pred -----CCccceeEEEEecCccccccCCCCccHHHHHHHhCCCcccceeecCcccc-----ccc---------hhhhhccc
Confidence 35789999999999999999999999999999999999999999999999 999 99999999
Q ss_pred eeEeeeecC
Q 013065 437 TAYLVISYS 445 (450)
Q Consensus 437 ~v~~l~se~ 445 (450)
+++++|.++
T Consensus 380 s~~ai~~~~ 388 (389)
T COG4398 380 ASMALFVDD 388 (389)
T ss_pred eeeEEEeec
Confidence 999999875
|
|
| >COG3287 Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
| >PF10442 FIST_C: FIST C domain; InterPro: IPR019494 This entry represents a novel sensory domain, designated FIST C (short for F-box and intracellular signal transduction, C-terminal), which is present in signal transduction proteins from bacteria, archaea and eukaryotes | Back alignment and domain information |
|---|
| >PF08495 FIST: FIST N domain; InterPro: IPR013702 The FIST N domain is a novel sensory domain, which is present in signal transduction proteins from Bacteria, Archaea and Eukarya | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00