Citrus Sinensis ID: 022129
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 302 | ||||||
| 224134150 | 401 | predicted protein [Populus trichocarpa] | 0.807 | 0.608 | 0.860 | 1e-121 | |
| 356496677 | 400 | PREDICTED: UPF0183 protein At3g51130-lik | 0.801 | 0.605 | 0.847 | 1e-121 | |
| 357483667 | 404 | hypothetical protein MTR_5g021520 [Medic | 0.801 | 0.599 | 0.847 | 1e-121 | |
| 359476646 | 402 | PREDICTED: UPF0183 protein At3g51130-lik | 0.801 | 0.601 | 0.863 | 1e-121 | |
| 224094913 | 398 | predicted protein [Populus trichocarpa] | 0.798 | 0.605 | 0.863 | 1e-121 | |
| 356538264 | 403 | PREDICTED: UPF0183 protein At3g51130-lik | 0.801 | 0.600 | 0.842 | 1e-120 | |
| 388499418 | 404 | unknown [Medicago truncatula] | 0.801 | 0.599 | 0.838 | 1e-120 | |
| 21554302 | 409 | Putative UPF0183 protein [Arabidopsis th | 0.801 | 0.591 | 0.826 | 1e-119 | |
| 449434186 | 398 | PREDICTED: UPF0183 protein At3g51130-lik | 0.801 | 0.608 | 0.834 | 1e-118 | |
| 449491389 | 398 | PREDICTED: UPF0183 protein At3g51130-lik | 0.801 | 0.608 | 0.834 | 1e-118 |
| >gi|224134150|ref|XP_002327768.1| predicted protein [Populus trichocarpa] gi|222836853|gb|EEE75246.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 210/244 (86%), Positives = 223/244 (91%)
Query: 3 QSQKPRRRCEGTAMGAIVLDLRPGVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFDE 62
QSQ+PRRRCEGTAMG I+LDLRPG GIGPFSLGMPICEAFA IEQQP+IYDVVHVKYFDE
Sbjct: 10 QSQRPRRRCEGTAMGVIILDLRPGNGIGPFSLGMPICEAFAQIEQQPSIYDVVHVKYFDE 69
Query: 63 EPLKLDIIISFPDHGFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVY 122
EPLKLDI+ISFPDHGFHLRFDPWSQRLRLIEIFD+KRLQMRYATSLIGG S LATFVAVY
Sbjct: 70 EPLKLDIVISFPDHGFHLRFDPWSQRLRLIEIFDVKRLQMRYATSLIGGPSNLATFVAVY 129
Query: 123 ALFGPTFPGVYDKERSVYMLFYPGLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTCR 182
ALFGPTFPG+YDK+R VY LFYPGLSFAFPIP+QY DC REAELPLEFPDGTTPVTCR
Sbjct: 130 ALFGPTFPGIYDKDRGVYTLFYPGLSFAFPIPSQYTDCFHGREAELPLEFPDGTTPVTCR 189
Query: 183 VSIYDGSADKKVGVGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHFTVGSQHIPFGASPQ 242
VSIYDGSADKKVGVGSL KA AP L G+LY+EEVH KLGEEL+F+VG QHIPFGASPQ
Sbjct: 190 VSIYDGSADKKVGVGSLMHKASAPPLLPGNLYMEEVHVKLGEELYFSVGGQHIPFGASPQ 249
Query: 243 VTFT 246
+T
Sbjct: 250 DVWT 253
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356496677|ref|XP_003517192.1| PREDICTED: UPF0183 protein At3g51130-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357483667|ref|XP_003612120.1| hypothetical protein MTR_5g021520 [Medicago truncatula] gi|355513455|gb|AES95078.1| hypothetical protein MTR_5g021520 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|359476646|ref|XP_003631875.1| PREDICTED: UPF0183 protein At3g51130-like [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224094913|ref|XP_002310289.1| predicted protein [Populus trichocarpa] gi|222853192|gb|EEE90739.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356538264|ref|XP_003537624.1| PREDICTED: UPF0183 protein At3g51130-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388499418|gb|AFK37775.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|21554302|gb|AAM63377.1| Putative UPF0183 protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|449434186|ref|XP_004134877.1| PREDICTED: UPF0183 protein At3g51130-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449491389|ref|XP_004158881.1| PREDICTED: UPF0183 protein At3g51130-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 302 | ||||||
| TAIR|locus:2080873 | 410 | AT3G51130 [Arabidopsis thalian | 0.798 | 0.587 | 0.825 | 5.3e-111 | |
| DICTYBASE|DDB_G0271880 | 519 | DDB_G0271880 "UPF0183 family p | 0.549 | 0.319 | 0.449 | 3.1e-35 | |
| FB|FBgn0035877 | 438 | CG7083 [Drosophila melanogaste | 0.655 | 0.452 | 0.328 | 5e-23 | |
| WB|WBGene00011344 | 422 | T01G9.2a [Caenorhabditis elega | 0.516 | 0.369 | 0.335 | 1.3e-18 | |
| UNIPROTKB|P34692 | 422 | T01G9.2 "UPF0183 protein T01G9 | 0.516 | 0.369 | 0.335 | 1.3e-18 | |
| RGD|621098 | 422 | RGD621098 "similar to RIKEN cD | 0.582 | 0.417 | 0.309 | 1e-17 | |
| UNIPROTKB|Q9BSU1 | 422 | C16orf70 "UPF0183 protein C16o | 0.582 | 0.417 | 0.309 | 2.8e-17 | |
| MGI|MGI:2443049 | 422 | D230025D16Rik "RIKEN cDNA D230 | 0.589 | 0.421 | 0.302 | 1.3e-15 | |
| ASPGD|ASPL0000038147 | 516 | AN10404 [Emericella nidulans ( | 0.427 | 0.25 | 0.310 | 1.3e-12 | |
| UNIPROTKB|G4MWK3 | 526 | MGG_01174 "Uncharacterized pro | 0.596 | 0.342 | 0.267 | 4.9e-10 |
| TAIR|locus:2080873 AT3G51130 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1096 (390.9 bits), Expect = 5.3e-111, P = 5.3e-111
Identities = 199/241 (82%), Positives = 216/241 (89%)
Query: 2 LQSQKPRRRCEGTAMGAIVLDLRPGVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFD 61
L Q+PRRR EGTAMGA V DLRPGVGIGPFS+GMPICEAFA IEQQPNIYDVVHVKY+D
Sbjct: 11 LVMQRPRRRLEGTAMGATVFDLRPGVGIGPFSIGMPICEAFAQIEQQPNIYDVVHVKYYD 70
Query: 62 EEPLKLDIIISFPDHGFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAV 121
E+PLKLD++ISFPDHGFHLRFDPWSQRLRL+EIFD+KRLQMRYATS+IGG STLATFVAV
Sbjct: 71 EDPLKLDVVISFPDHGFHLRFDPWSQRLRLVEIFDVKRLQMRYATSMIGGPSTLATFVAV 130
Query: 122 YALFGPTFPGVYDKERSVYMLFYPGLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTC 181
YALFGPTFPG+YDKER +Y LFYPGLSF FPIP QY DCC D EA LPLEFPDGTTPVTC
Sbjct: 131 YALFGPTFPGIYDKERGIYSLFYPGLSFEFPIPNQYTDCCHDGEAALPLEFPDGTTPVTC 190
Query: 182 RVSIYDGSADKKVGVGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHFTVGSQHIPFGASP 241
RVSIYD S+DKKVGVG L D+A P LP GSLY+EEVH K G+EL+FTVG QH+PFGASP
Sbjct: 191 RVSIYDNSSDKKVGVGKLMDRASVPPLPPGSLYMEEVHVKPGKELYFTVGGQHMPFGASP 250
Query: 242 Q 242
Q
Sbjct: 251 Q 251
|
|
| DICTYBASE|DDB_G0271880 DDB_G0271880 "UPF0183 family protein" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0035877 CG7083 [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| WB|WBGene00011344 T01G9.2a [Caenorhabditis elegans (taxid:6239)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|P34692 T01G9.2 "UPF0183 protein T01G9.2" [Caenorhabditis elegans (taxid:6239)] | Back alignment and assigned GO terms |
|---|
| RGD|621098 RGD621098 "similar to RIKEN cDNA D230025D16Rik" [Rattus norvegicus (taxid:10116)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9BSU1 C16orf70 "UPF0183 protein C16orf70" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:2443049 D230025D16Rik "RIKEN cDNA D230025D16 gene" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| ASPGD|ASPL0000038147 AN10404 [Emericella nidulans (taxid:162425)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|G4MWK3 MGG_01174 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.00570121 | hypothetical protein (401 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 302 | |||
| pfam03676 | 395 | pfam03676, UPF0183, Uncharacterized protein family | 5e-94 |
| >gnl|CDD|217668 pfam03676, UPF0183, Uncharacterized protein family (UPF0183) | Back alignment and domain information |
|---|
Score = 283 bits (725), Expect = 5e-94
Identities = 101/241 (41%), Positives = 130/241 (53%), Gaps = 35/241 (14%)
Query: 26 GVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPW 85
G G F LGMPI +A A I+QQP I V VKY D++PL +D++I+ P G LRFDP
Sbjct: 1 GNGQWEFVLGMPISQAIAIIQQQPRIIKNVQVKYSDKDPLSMDLVINLPQDGIRLRFDPV 60
Query: 86 SQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYP 145
SQRL+LIE+FD+KR++++YA S L T VY FG T PGVYD +Y LF+
Sbjct: 61 SQRLKLIEVFDLKRVKLKYAGVYFNSPSVLPTIEQVYHSFGATHPGVYDASHQLYALFFR 120
Query: 146 GLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTCRVSIYDGSADKKVGVGSLFDKAIA 205
GLSF+FPI ++Y L+FPDG TPV R+SIYDGS +A
Sbjct: 121 GLSFSFPIDSKYTPGFGHGLGS--LKFPDGATPVVSRMSIYDGSN---------LAEAKV 169
Query: 206 PSLPV----GSLYIEEVHA------KLGEELHFTVG--------------SQHIPFGASP 241
P LP+ G+LY+E V LG +L ++HI FG S
Sbjct: 170 PPLPLSCYLGNLYLESVEVLRDSGGTLGLKLQLVTEGGPGVALEPRVRTFTRHIYFGDSC 229
Query: 242 Q 242
Q
Sbjct: 230 Q 230
|
This family of proteins includes Lin-10 from C. elegans. Length = 395 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 302 | |||
| PF03676 | 394 | UPF0183: Uncharacterised protein family (UPF0183); | 100.0 | |
| KOG2819 | 413 | consensus Uncharacterized conserved protein [Funct | 100.0 |
| >PF03676 UPF0183: Uncharacterised protein family (UPF0183); InterPro: IPR005373 Members of this family are proteins of unknown function | Back alignment and domain information |
|---|
Probab=100.00 E-value=1e-70 Score=535.84 Aligned_cols=220 Identities=40% Similarity=0.771 Sum_probs=209.9
Q ss_pred CCcccccccCCcHHHHHHHHhcCCCccceEEEEecCCCCCccceEEEcCCCceEEEecCCCceEEEEEEeeCCcceEEEc
Q 022129 26 GVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRLIEIFDIKRLQMRYA 105 (302)
Q Consensus 26 G~gLG~F~LG~sL~~vL~~Lk~~~~~f~~V~v~Ys~~~Pl~~pIvI~Lp~~GIrL~FD~~sQRLrlIEV~D~skv~L~Y~ 105 (302)
|.++|+|+|||||||||++||+++++||+|||+||+++|+++||+|+||++||||+|||++||||+|||+|+++++|+|+
T Consensus 1 ~~~~g~f~LG~~l~~vl~~lk~~~~~~~~v~~~Y~~~~P~~~~ivi~l~~~GirL~Fd~~~QrL~lIEv~d~~~i~L~Y~ 80 (394)
T PF03676_consen 1 GNSLGEFVLGMSLHQVLTILKSEPQTFPKVDLIYSDQDPLSSDIVINLPENGIRLRFDGPSQRLRLIEVYDFSKIKLRYK 80 (394)
T ss_pred CCccceEEcCCcHHHHHHHHHhccccCCceEEEECCCCCCcCCEEEEcCCCCeEEEECCCCcEEEEEEEecCccceEEeC
Confidence 67999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccCCCCcceeeeeccccCCCCCCCccCCCcEEEEeeCceEEEeeCCccCccccccccCCccccCCCCCCceeeEEEE
Q 022129 106 TSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTCRVSI 185 (302)
Q Consensus 106 g~~~ssp~~~pTf~~Iy~~FGPTyPG~yd~~~~~YvLsYPGvaF~Fpi~s~~~~~~~~g~~~lsl~fpdg~tPvvsri~I 185 (302)
++.|++|++.|||++||++|||||||+||+++++|+||||||||+||+++++++.+.++.+ |++|++|++|+|+||+|
T Consensus 81 ~~~~~~p~~~pTf~~Iy~~FGPTyPG~yd~~~~~YvLsYpGlaF~Fpi~~~~~~~~~~~~~--sl~~~~~ssp~~tsmaI 158 (394)
T PF03676_consen 81 GSVFSSPEIGPTFRHIYRLFGPTYPGEYDKSRGTYVLSYPGLAFSFPIPSKFQSSYSDGLD--SLEFPSGSSPVATSMAI 158 (394)
T ss_pred cccccCcccCcchheeheccCCCCCCccCCCCCEEEEEECCEEEEeeCchhhcccccCCcc--eeecCCCCCcceeEEEE
Confidence 9999999999999999999999999999999999999999999999999999999988865 99999999999999999
Q ss_pred eeCCCCCccccCcccccCCCCCCC----CCCeeEEEEEEEe------CCeEEEEecc--------------EEEEcCCCH
Q 022129 186 YDGSADKKVGVGSLFDKAIAPSLP----VGSLYIEEVHAKL------GEELHFTVGS--------------QHIPFGASP 241 (302)
Q Consensus 186 y~gs~~~~~~~g~~~~~a~~P~LP----~g~~y~e~v~v~~------~~~l~f~~~~--------------~~I~FGdS~ 241 (302)
|+|+ +|++|++|++| +|++|+|+++|++ +.+++|..+. +.|+|||||
T Consensus 159 f~G~---------s~~ear~p~lp~~~~~~~~~~~~v~v~~~~~~~~~~~l~l~~~~g~~~~~~~~~~~~~~~I~fGdT~ 229 (394)
T PF03676_consen 159 FSGS---------SWAEARAPPLPLSCYCGNLYLESVEVLRDNKETVGLELSLVTEGGPGRIEEPRRSNFERWIRFGDTP 229 (394)
T ss_pred EcCC---------chhcccCCCccccccCCCcceeeEEeeccCCCCcCcEEEEEEcCCCcccccccccCceEEEEeCCCH
Confidence 9998 69999999888 4999999999954 4788887775 899999999
Q ss_pred HHHHhhcCCceeecc
Q 022129 242 QVTFTCIVVFFIVLQ 256 (302)
Q Consensus 242 QDVls~LG~p~~v~~ 256 (302)
|||+++||+|.+|..
T Consensus 230 qdv~~~lG~P~~~~~ 244 (394)
T PF03676_consen 230 QDVLSELGPPDRIFY 244 (394)
T ss_pred HHHHHhhCCccceee
Confidence 999999999999864
|
|
| >KOG2819 consensus Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00