Citrus Sinensis ID: 018537
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 354 | ||||||
| 359495111 | 354 | PREDICTED: uncharacterized protein LOC10 | 0.923 | 0.923 | 0.783 | 1e-147 | |
| 359495113 | 346 | PREDICTED: uncharacterized protein LOC10 | 0.887 | 0.907 | 0.777 | 1e-140 | |
| 224125914 | 355 | predicted protein [Populus trichocarpa] | 0.898 | 0.895 | 0.741 | 1e-133 | |
| 255573521 | 372 | conserved hypothetical protein [Ricinus | 0.892 | 0.849 | 0.738 | 1e-130 | |
| 18410364 | 351 | uncharacterized protein [Arabidopsis tha | 0.983 | 0.991 | 0.664 | 1e-128 | |
| 27754453 | 350 | unknown protein [Arabidopsis thaliana] | 0.912 | 0.922 | 0.698 | 1e-127 | |
| 449439491 | 343 | PREDICTED: uncharacterized protein LOC10 | 0.946 | 0.976 | 0.666 | 1e-126 | |
| 334183893 | 357 | uncharacterized protein [Arabidopsis tha | 0.985 | 0.977 | 0.651 | 1e-126 | |
| 297839209 | 351 | hypothetical protein ARALYDRAFT_895206 [ | 0.985 | 0.994 | 0.657 | 1e-125 | |
| 11120789 | 351 | unknown protein [Arabidopsis thaliana] | 0.963 | 0.971 | 0.654 | 1e-122 |
| >gi|359495111|ref|XP_002263023.2| PREDICTED: uncharacterized protein LOC100243690 isoform 1 [Vitis vinifera] gi|296088813|emb|CBI38263.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 260/332 (78%), Positives = 287/332 (86%), Gaps = 5/332 (1%)
Query: 28 TFPNY-----SSPRRRIGLSTVACTYVALSSKGQGAFDPELRSVLELATDSELYELERIL 82
T PNY SP +I L V ++ ++S GQGAFDPELR VLELATDSEL+ELERIL
Sbjct: 18 TKPNYPIFRSRSPSTKITLGLVFSSHSSISRNGQGAFDPELRPVLELATDSELFELERIL 77
Query: 83 FGPSYFSPLLKSFTKRADVDYVMIEQDLEEREDFIASLESRFLFLAADARSTLRGWRPSY 142
FGPSYFSPLLKS ++RADVDY MIE+DLEEREDFI+SLESRFLFLAADARSTLRGWRPSY
Sbjct: 78 FGPSYFSPLLKSISRRADVDYAMIEEDLEEREDFISSLESRFLFLAADARSTLRGWRPSY 137
Query: 143 RNVLLAVRKNLNIPCSSKLSTEDLEAEIFLHLLQEYASEESGVFPGSWENSEASDAQNSL 202
RNVLL VRK LN+PCSSKLSTEDLE EIFLHLLQ+Y+SEESG SWENS+AS + +L
Sbjct: 138 RNVLLGVRKKLNVPCSSKLSTEDLEVEIFLHLLQDYSSEESGALSKSWENSKASTSHGNL 197
Query: 203 ELGLSQWKVQALAAFNAGAVELKSMMLKGGGIYTLVKIYQLLMRNLSGKFFLEAANYQIK 262
E GLSQWKVQA+AA AGA EL+S++LKGG + TL KIY LL R LSGK FLEAANYQIK
Sbjct: 198 EFGLSQWKVQAVAALGAGASELRSIILKGGSMLTLGKIYHLLARRLSGKLFLEAANYQIK 257
Query: 263 KEVLKKGGQLAAINLESRAALLAAKQGFAGAATKYLGLRNMVALLGPVLWGTFLADVVIQ 322
EV+KKGGQLAAINLESRAALLAAKQGFAGAA++YLGLR+ +AL GPVLWGTFLADVVIQ
Sbjct: 258 NEVIKKGGQLAAINLESRAALLAAKQGFAGAASRYLGLRSTIALFGPVLWGTFLADVVIQ 317
Query: 323 MLGTDYARILRAIYAFAQIRITRTYRLPSDDE 354
MLGTDYARILRAIYAFAQIRITRTYRLPSD +
Sbjct: 318 MLGTDYARILRAIYAFAQIRITRTYRLPSDGD 349
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359495113|ref|XP_003634913.1| PREDICTED: uncharacterized protein LOC100243690 isoform 2 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224125914|ref|XP_002329748.1| predicted protein [Populus trichocarpa] gi|222870656|gb|EEF07787.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255573521|ref|XP_002527685.1| conserved hypothetical protein [Ricinus communis] gi|223532916|gb|EEF34684.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|18410364|ref|NP_565065.1| uncharacterized protein [Arabidopsis thaliana] gi|21593462|gb|AAM65429.1| unknown [Arabidopsis thaliana] gi|124301178|gb|ABN04841.1| At1g73470 [Arabidopsis thaliana] gi|332197343|gb|AEE35464.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|27754453|gb|AAO22674.1| unknown protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|449439491|ref|XP_004137519.1| PREDICTED: uncharacterized protein LOC101204111 [Cucumis sativus] gi|449517717|ref|XP_004165891.1| PREDICTED: uncharacterized protein LOC101226231 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|334183893|ref|NP_001185391.1| uncharacterized protein [Arabidopsis thaliana] gi|332197345|gb|AEE35466.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297839209|ref|XP_002887486.1| hypothetical protein ARALYDRAFT_895206 [Arabidopsis lyrata subsp. lyrata] gi|297333327|gb|EFH63745.1| hypothetical protein ARALYDRAFT_895206 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|11120789|gb|AAG30969.1|AC012396_5 unknown protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
No hits with e-value below 0.001 by BLAST
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 354 | |||
| COG4735 | 211 | COG4735, COG4735, Uncharacterized protein conserve | 9e-18 |
| >gnl|CDD|227078 COG4735, COG4735, Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Score = 80.2 bits (198), Expect = 9e-18
Identities = 57/211 (27%), Positives = 90/211 (42%), Gaps = 22/211 (10%)
Query: 141 SYRNVLLAVRKNLNIPCSSKLSTEDLEAEIFLHLLQEYASEESGVFPGSWENSEASDAQN 200
YR L+ V K L +T L A FL L + + D ++
Sbjct: 12 KYREGLIQVCKYLKRIAEELQTTGGLSAANFLRDLGQLYKKIL------------CDEKD 59
Query: 201 SL------ELGLSQWKVQALAAFNAGAVELKSMMLKGGGIYTLVKIYQLLMRNLSGKFFL 254
L ELG ++ + L +E MLKGG + + + + + +
Sbjct: 60 KLKVKVQKELGTTEIEEPLLLQILRNPLEK---MLKGGKAEAVDSLDAPNVLEIEEQLQV 116
Query: 255 EAANYQIKKEVLKKGGQLAAINLESRAALLAAKQGFAGAATKYLGLRNMVALLGPVLWGT 314
A+Y + K+ L KG A+ L++ AL QG A Y +R + +L GPV W
Sbjct: 117 LIADYLVAKKALGKGSYQLAVILQNAVALQTLGQGLLRGAA-YGLVRTLFSLGGPVGWAL 175
Query: 315 FLADVVIQMLGTDYARILRAIYAFAQIRITR 345
F AD+ + +GT+Y + AI+ AQ+R+TR
Sbjct: 176 FGADLGWRAIGTNYRVTIPAIFQLAQLRLTR 206
|
Length = 211 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 354 | |||
| PRK10236 | 237 | hypothetical protein; Provisional | 100.0 | |
| COG4735 | 211 | Uncharacterized protein conserved in bacteria [Fun | 100.0 | |
| COG4735 | 211 | Uncharacterized protein conserved in bacteria [Fun | 98.14 | |
| PF03981 | 141 | Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone | 97.13 | |
| PF13099 | 35 | DUF3944: Domain of unknown function (DUF3944) | 96.21 |
| >PRK10236 hypothetical protein; Provisional | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.6e-50 Score=374.93 Aligned_cols=214 Identities=20% Similarity=0.276 Sum_probs=168.7
Q ss_pred CChhHHHHHHhhChHHHHHHHHHhcC-----CCCCCcchhhhc-cc---CCcchhHHhhhhhHHHHHHHHHHHHHccccc
Q 018537 59 FDPELRSVLELATDSELYELERILFG-----PSYFSPLLKSFT-KR---ADVDYVMIEQDLEEREDFIASLESRFLFLAA 129 (354)
Q Consensus 59 ~~~ELr~vLelATdeEL~~L~~ILf~-----~s~fsPLlkSi~-~r---~~~Dy~~~~~~~~~R~~~i~~Ie~rfrflAa 129 (354)
.|++| +||..||||||.+|++||++ .|+++.|+++.. ++ +.+||.+ ||+.||+|||||||
T Consensus 7 ~D~DL-~fL~~cs~edL~~Lv~~Lt~d~dG~~R~te~lt~~~~yk~~~~~~~~~~~----------yw~~Ia~elq~fGg 75 (237)
T PRK10236 7 NDSDL-DFLQHCSEEQLANFARLLTHNEKGKTRLSSVLMRNELFKSMEGHPEQHRR----------NWQLIAGELQHFGG 75 (237)
T ss_pred cCccH-HHHHhCCHHHHHHHHHHHhcCCCCCEeehhhhcccHHHHhhcccchhHHH----------HHHHHHHHHHHhcc
Confidence 34555 79999999999999999999 466777776644 45 6666666 99999999999999
Q ss_pred ch-hhhhcCCCccHHHHHHHHHhhcCcCCCCCCChHHHHHHHHHHHHhhhhccccCCCCCCcccCCchhhhhHHHhhhhh
Q 018537 130 DA-RSTLRGWRPSYRNVLLAVRKNLNIPCSSKLSTEDLEAEIFLHLLQEYASEESGVFPGSWENSEASDAQNSLELGLSQ 208 (354)
Q Consensus 130 D~-~s~LRG~gpsYReVLldVckkLkV~~s~~lsT~dLEaeIflhLL~~~~~~~~~~~~~awek~~~~~e~~sl~~~l~~ 208 (354)
|+ +|+|||+||+|||||+||||||||||++++||++||+|||+|++++ +|+||+++ |+++|+..|+
T Consensus 76 nt~~n~lRG~Gv~YreIL~DVc~~LKV~y~~~~st~~iE~~il~kll~~-----------a~~kms~e-E~~~L~~~l~- 142 (237)
T PRK10236 76 DSIANKLRGHGKLYRAILLDVSKRLKLKADKEMSTFEIEQQLLEQFLRN-----------TWKKMDEE-HKQEFLHAVD- 142 (237)
T ss_pred hHHHHHHhcCCccHHHHHHHHHHHcCCCCCCCCCHHHHHHHHHHHHHHH-----------HHHHCCHH-HHHHHHHHHh-
Confidence 99 6799999999999999999999999999999999999999999999 99999999 9999999998
Q ss_pred HHHHHHHhhhcCchhhHHHHHhcchHHHHHHHHHHHHHHhHHHHHHHHhhHHHHHHHHHhhhHHHHHhHHHHHHHHHHhh
Q 018537 209 WKVQALAAFNAGAVELKSMMLKGGGIYTLVKIYQLLMRNLSGKFFLEAANYQIKKEVLKKGGQLAAINLESRAALLAAKQ 288 (354)
Q Consensus 209 ~~~~~la~~~lg~~~~~~~lLKGgsalavssi~~~l~~~ls~qla~~~A~Yq~Ak~~l~~Gg~~aa~~l~~~~al~aA~r 288 (354)
.++ + +...+.+.++..- +.+.+ .....+||++...||+ ++++++ ++
T Consensus 143 ---~~l--~--~~~~~~~~ll~~~-----~~~~~-~~gf~~y~l~~iv~~~-i~~~~l--------------------G~ 188 (237)
T PRK10236 143 ---ARV--N--ELEELLPLLMKDK-----LLAKG-VSHLLSSQLTRILRTH-AAMSVL--------------------GH 188 (237)
T ss_pred ---hhc--c--CcccchHHHHHHH-----HHHHh-hcchHHHHHHHHHHHH-HHHHHh--------------------hh
Confidence 333 1 2233555554321 22333 3344555555555553 245444 44
Q ss_pred hhhhhhhhHHHHHHHHHhHhHHHHHHHHHHHHHHhhcccchHHHHHHHHHHHHHH
Q 018537 289 GFAGAATKYLGLRNMVALLGPVLWGTFLADVVIQMLGTDYARILRAIYAFAQIRI 343 (354)
Q Consensus 289 G~~~aAarY~~~Rs~~~llGPilW~wf~aDL~~~aigT~Y~RiIpaI~alAQIRL 343 (354)
|++ |++. |+|||+|+.+.++ +..||+||+|||||++||.+|.
T Consensus 189 gL~---------r~~~-l~GPIGw~itg~w---dlagpAyRVTIPaviqIA~Lrq 230 (237)
T PRK10236 189 GLL---------RGAG-LGGPVGAALNGVK---AVSGSAYRVTIPAVLQIACLRR 230 (237)
T ss_pred hHH---------HHhh-ccCchhHHhhhhH---HhcCCeeeeecchHHHHHHHHH
Confidence 443 6665 9999999966653 5669999999999999999975
|
|
| >COG4735 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >COG4735 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >PF03981 Ubiq_cyt_C_chap: Ubiquinol-cytochrome C chaperone ; InterPro: IPR021150 Saccharomyces cerevisiae ubiquinol-cytochrome C chaperone is required for assembly of coenzyme QF-2-cytochrome C reductase | Back alignment and domain information |
|---|
| >PF13099 DUF3944: Domain of unknown function (DUF3944) | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 354 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 4e-08 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 54.1 bits (129), Expect = 4e-08
Identities = 67/447 (14%), Positives = 113/447 (25%), Gaps = 140/447 (31%)
Query: 1 MAVATATFLVLQPSFSSPFAVTPSSKLTFPNYSSPRRRIGLSTVACTYVA--LSSKGQGA 58
+A+ +Q F + L N +SP + + + +S+ +
Sbjct: 166 VALDVCLSYKVQCKM--DFKI---FWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHS 220
Query: 59 FDPELRSVLELATDSELYELERILFGPSYFSPLL-----------KSF---------TKR 98
+ +LR S EL R+L Y + LL +F T+
Sbjct: 221 SNIKLR------IHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLTTRF 274
Query: 99 ADV-DYV----MIEQDLEEREDFIASLESRFLF----------LAADARST--------- 134
V D++ L+ + E + L L + +T
Sbjct: 275 KQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIA 334
Query: 135 --LRGWRPSYRNVLLAVRKNLNIPCSSKLSTEDLEAEIFLHLLQEYASEESGVFPGS--- 189
+R ++ N L + S LE + + + VFP S
Sbjct: 335 ESIRDGLATWDNWKHVNCDKLTT--IIESSLNVLEPAEYRKMFDRLS-----VFPPSAHI 387
Query: 190 --------WENSEASDAQNSLELGLSQWKVQALAAFNAGAVEL--KSMMLKGG--GIYTL 237
W + SD + +L S++ K ++
Sbjct: 388 PTILLSLIWFDVIKSDVMVVVN-------------------KLHKYSLVEKQPKESTISI 428
Query: 238 VKIYQLLMRNLSGKFFLEA---ANYQIKKEVLKK---------------GGQLAAINLES 279
IY L L ++ L +Y I K G L I
Sbjct: 429 PSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHHLKNIEHPE 488
Query: 280 RAALLAAKQGFAGAATKYLGLR--------NMVALLGPVLWGTFLADVV-----IQMLGT 326
R L +L R + A L + I
Sbjct: 489 RMTLF---------RMVFLDFRFLEQKIRHDSTAWNASGSILNTLQQLKFYKPYICDNDP 539
Query: 327 DYARILRAIYAFAQIRITRTYRLPSDD 353
Y R++ AI F D
Sbjct: 540 KYERLVNAILDFLPKIEENLICSKYTD 566
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 354 | |||
| 2jx3_A | 131 | Protein DEK; alpha helix, SAF/SAP motif, DNA bindi | 81.82 |
| >2jx3_A Protein DEK; alpha helix, SAF/SAP motif, DNA binding, chromosomal rearrangement, DNA-binding, nucleus, phosphorylation, proto oncogene; NMR {Homo sapiens} | Back alignment and structure |
|---|
Probab=81.82 E-value=4.5 Score=34.45 Aligned_cols=94 Identities=15% Similarity=0.167 Sum_probs=56.7
Q ss_pred ChhHHHHHHhhChHHHHHHHHHhcCCCCCCcc--hhhhcccCCcchhHHhhhhhHHHHHHHHHHHHHcccccchhhhhcC
Q 018537 60 DPELRSVLELATDSELYELERILFGPSYFSPL--LKSFTKRADVDYVMIEQDLEEREDFIASLESRFLFLAADARSTLRG 137 (354)
Q Consensus 60 ~~ELr~vLelATdeEL~~L~~ILf~~s~fsPL--lkSi~~r~~~Dy~~~~~~~~~R~~~i~~Ie~rfrflAaD~~s~LRG 137 (354)
.|.....+...+-++|..|.+|||++. .+.- -++|..=.+-.+ +.++.+++.-.+.|+.
T Consensus 14 Ip~V~~~i~k~k~d~Lk~LH~ilfgr~-Gk~~~~KknI~~FsGF~~---~~~~~~~~k~~e~l~K--------------- 74 (131)
T 2jx3_A 14 IERIHFFLSKKKTDELRNLHKLLYNRP-GTVSSLKKNVGQFSGFPF---EKGSVQYKKKEEMLKK--------------- 74 (131)
T ss_dssp CHHHHHHHHTCCHHHHHHHHHHTSSSC-CCSHHHHHHHHTSSCSSC---CCSSHHHHHHHHHHHH---------------
T ss_pred cHHHHHHHcccCHHHHHHHHHHHcCCC-ChHhHHHHHHHhcCCCCC---CccHHHHHHHHHHHHc---------------
Confidence 466677788888899999999999975 2221 122221111111 1223333333233321
Q ss_pred CCccHHHHHHHHHhhcCcCCCCCCChHHHHHHHHHHHHhh
Q 018537 138 WRPSYRNVLLAVRKNLNIPCSSKLSTEDLEAEIFLHLLQE 177 (354)
Q Consensus 138 ~gpsYReVLldVckkLkV~~s~~lsT~dLEaeIflhLL~~ 177 (354)
...+-|.++|+=|+|+++. +.++|-+.|+..|..=
T Consensus 75 ---~~~~~L~~~c~iL~l~~~g--~keelv~ril~FL~~P 109 (131)
T 2jx3_A 75 ---FRNAMLKSICEVLDLERSG--VNSELVKRILNFLMHP 109 (131)
T ss_dssp ---HHHHHHHHHHHTTTCCSCS--CHHHHHHHHHHTTTSC
T ss_pred ---cCHHHHHHHHHHHCCCCCC--cHHHHHHHHHHHHhCc
Confidence 2356789999999999985 4678877777666543
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00