Citrus Sinensis ID: 021572
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 310 | ||||||
| 255582681 | 320 | conserved hypothetical protein [Ricinus | 0.767 | 0.743 | 0.742 | 3e-93 | |
| 359493882 | 315 | PREDICTED: uncharacterized membrane prot | 0.812 | 0.8 | 0.689 | 2e-90 | |
| 302142819 | 662 | unnamed protein product [Vitis vinifera] | 0.812 | 0.380 | 0.689 | 2e-90 | |
| 224116968 | 273 | predicted protein [Populus trichocarpa] | 0.683 | 0.776 | 0.798 | 4e-90 | |
| 356544132 | 270 | PREDICTED: uncharacterized membrane prot | 0.658 | 0.755 | 0.833 | 2e-88 | |
| 356549616 | 344 | PREDICTED: uncharacterized membrane prot | 0.680 | 0.613 | 0.815 | 7e-88 | |
| 21592713 | 333 | unknown [Arabidopsis thaliana] | 0.654 | 0.609 | 0.753 | 3e-83 | |
| 18412044 | 333 | uncharacterized transmembrane protein [A | 0.654 | 0.609 | 0.753 | 4e-83 | |
| 297842641 | 334 | integral membrane family protein [Arabid | 0.654 | 0.607 | 0.753 | 2e-82 | |
| 42572161 | 342 | uncharacterized transmembrane protein [A | 0.654 | 0.593 | 0.721 | 6e-81 |
| >gi|255582681|ref|XP_002532119.1| conserved hypothetical protein [Ricinus communis] gi|223528199|gb|EEF30259.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 179/241 (74%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
Query: 27 KFEFPIFRNPNISNSTSNCNSSKMLLLHRANAAP---SLQVAVSEAMNLVQLSQPTWQSA 83
+F R I SN KML + P S+Q A+S AMNL+Q + PTWQSA
Sbjct: 18 QFNKSTIRLKKILTLNSNPGLPKMLPQEKQTQTPNAVSMQGALSGAMNLIQSAPPTWQSA 77
Query: 84 LLSNVVIFVLGSPILVSGLSPSGIAAAFLLGTLTWRAFGPSGFLLVATYFIIGTAATKVK 143
L++NV+IF+LGSPIL+SGLS SGI AAFLLGTLTWRAFGPSGFLLVA YF+IGTAATKVK
Sbjct: 78 LVNNVLIFILGSPILLSGLSLSGICAAFLLGTLTWRAFGPSGFLLVACYFVIGTAATKVK 137
Query: 144 MAQKEAQGVAEKRKGRRGPGSVIGSSAAGCVCAFLSIFGVGGFEFSRLWQLGFVASFCTK 203
MAQKEAQGVAEKRKGRRGPGSVIGSSAAGCVCAFLSI+GVGG FSRLW+LGFVASFCTK
Sbjct: 138 MAQKEAQGVAEKRKGRRGPGSVIGSSAAGCVCAFLSIYGVGGEAFSRLWELGFVASFCTK 197
Query: 204 LSDTVSSEIGKAYGKTTYLVTTFKVVPRGTEGAVSVEGTFAGIFASILLAWVGCLTGQVS 263
LSDTVSSEIGKAYGK TYLVTTFK+VPRGTEGAVSVEGT AG+ ASILLA +GC G++
Sbjct: 198 LSDTVSSEIGKAYGKITYLVTTFKIVPRGTEGAVSVEGTLAGLLASILLASIGCFLGEIH 257
Query: 264 L 264
+
Sbjct: 258 I 258
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359493882|ref|XP_002283657.2| PREDICTED: uncharacterized membrane protein sll0875-like [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|302142819|emb|CBI20114.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224116968|ref|XP_002317441.1| predicted protein [Populus trichocarpa] gi|222860506|gb|EEE98053.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356544132|ref|XP_003540509.1| PREDICTED: uncharacterized membrane protein sll0875-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356549616|ref|XP_003543188.1| PREDICTED: uncharacterized membrane protein sll0875-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|21592713|gb|AAM64662.1| unknown [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|18412044|ref|NP_565184.1| uncharacterized transmembrane protein [Arabidopsis thaliana] gi|4836871|gb|AAD30574.1|AC007260_5 Unknown protein [Arabidopsis thaliana] gi|14334658|gb|AAK59507.1| unknown protein [Arabidopsis thaliana] gi|17104591|gb|AAL34184.1| unknown protein [Arabidopsis thaliana] gi|332198007|gb|AEE36128.1| uncharacterized transmembrane protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297842641|ref|XP_002889202.1| integral membrane family protein [Arabidopsis lyrata subsp. lyrata] gi|297335043|gb|EFH65461.1| integral membrane family protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|42572161|ref|NP_974171.1| uncharacterized transmembrane protein [Arabidopsis thaliana] gi|332198008|gb|AEE36129.1| uncharacterized transmembrane protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 310 | ||||||
| TAIR|locus:2202955 | 342 | AT1G78620 [Arabidopsis thalian | 0.664 | 0.602 | 0.720 | 1.3e-77 | |
| UNIPROTKB|G4NEB5 | 376 | MGG_00802 "Uncharacterized pro | 0.2 | 0.164 | 0.375 | 3.1e-07 | |
| UNIPROTKB|A7E3R1 | 336 | TMEM19 "Uncharacterized protei | 0.583 | 0.538 | 0.275 | 1.2e-05 | |
| UNIPROTKB|Q96HH6 | 336 | TMEM19 "Transmembrane protein | 0.612 | 0.565 | 0.271 | 9.8e-05 | |
| ZFIN|ZDB-GENE-040625-78 | 322 | tmem19 "transmembrane protein | 0.593 | 0.571 | 0.26 | 0.00026 | |
| UNIPROTKB|F1PGI2 | 297 | TMEM19 "Uncharacterized protei | 0.593 | 0.619 | 0.273 | 0.00037 | |
| DICTYBASE|DDB_G0287439 | 295 | DDB_G0287439 "Transmembrane pr | 0.464 | 0.488 | 0.285 | 0.00047 | |
| UNIPROTKB|F1SH34 | 336 | TMEM19 "Uncharacterized protei | 0.535 | 0.494 | 0.284 | 0.00061 | |
| TAIR|locus:2147705 | 288 | AT5G19930 [Arabidopsis thalian | 0.470 | 0.506 | 0.288 | 0.00098 |
| TAIR|locus:2202955 AT1G78620 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 781 (280.0 bits), Expect = 1.3e-77, P = 1.3e-77
Identities = 155/215 (72%), Positives = 183/215 (85%)
Query: 58 AAPSLQVAVSEAMNLVQLSQPTWQSALLSNVVIFVLGSPILVSGLSPSGIAAAFLLGTLT 117
AA S++ ++EAM L+Q + PTW+SA+ +N++IFVLGSP+LV+GLS SGIAAAFLLGTLT
Sbjct: 64 AAASMEGVMTEAMKLIQSASPTWKSAVANNLLIFVLGSPLLVTGLSASGIAAAFLLGTLT 123
Query: 118 WRAFGPSGFLLVATYFII---------GTAATKVKMAQKEAQGVAEKRKGRRGPGSVIGS 168
WRA+G +GFLLVA YF+I GTAATKVKM QKEAQGVAEKRKGRRGP SVIGS
Sbjct: 124 WRAYGSAGFLLVAAYFVIVSAFVINLNGTAATKVKMTQKEAQGVAEKRKGRRGPRSVIGS 183
Query: 169 SAAGCVCAFLSIFGVGGFEFSRLWQLGFVASFCTKLSDTVSSEIGKAYGKTTYLVTTFKV 228
SAAGCVCAFLSI+ VGG FS+L++LGFV+SFCTK+SDTVSSEIGKAYGKTTYL TTFK+
Sbjct: 184 SAAGCVCAFLSIYQVGGAAFSQLFRLGFVSSFCTKVSDTVSSEIGKAYGKTTYLATTFKI 243
Query: 229 VPRGTEGAVSVEGTFAGIFASILLAWVGCLTGQVS 263
VPRGTEGA+S+EGT AG+ AS LA VGC GQ++
Sbjct: 244 VPRGTEGAMSLEGTLAGLLASFFLASVGCFLGQIT 278
|
|
| UNIPROTKB|G4NEB5 MGG_00802 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|A7E3R1 TMEM19 "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q96HH6 TMEM19 "Transmembrane protein 19" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| ZFIN|ZDB-GENE-040625-78 tmem19 "transmembrane protein 19" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1PGI2 TMEM19 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| DICTYBASE|DDB_G0287439 DDB_G0287439 "Transmembrane protein 19" [Dictyostelium discoideum (taxid:44689)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1SH34 TMEM19 "Uncharacterized protein" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2147705 AT5G19930 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 310 | |||
| pfam01940 | 224 | pfam01940, DUF92, Integral membrane protein DUF92 | 2e-59 | |
| TIGR00297 | 237 | TIGR00297, TIGR00297, TIGR00297 family protein | 5e-43 | |
| COG1836 | 247 | COG1836, COG1836, Predicted membrane protein [Func | 1e-42 |
| >gnl|CDD|216799 pfam01940, DUF92, Integral membrane protein DUF92 | Back alignment and domain information |
|---|
Score = 189 bits (482), Expect = 2e-59
Identities = 78/185 (42%), Positives = 106/185 (57%), Gaps = 5/185 (2%)
Query: 83 ALLSNVVIFVLGSPILVSGLSPSGIAAAFLLGTLTWRAFGPSGFLLVATYFIIGTAATKV 142
A+L N+++ L + L+ SG AA LLGTL W G F L+ +F++G+ ATK+
Sbjct: 1 AVLLNLLLGALA--YRLKSLTLSGALAAVLLGTLIWGLGGWRWFALLLVFFVLGSLATKL 58
Query: 143 KMAQKEAQGVAEKRKGRRGPGSVIGSSAAGCVCAFLSIFGVGGFEFSRLWQLGFVASFCT 202
+K A G+AE R GRRGP +V+G+ + A L G G L+ GFV SF
Sbjct: 59 GYKRKAALGIAEARGGRRGPENVLGNGGVAAILALLYALGPTGAS---LFLAGFVGSFAA 115
Query: 203 KLSDTVSSEIGKAYGKTTYLVTTFKVVPRGTEGAVSVEGTFAGIFASILLAWVGCLTGQV 262
L+DT +SEIGK G+T L+TT K VP GT G VS+ GT A + S+L+A V L G +
Sbjct: 116 ALADTWASEIGKLSGRTPRLITTLKPVPPGTSGGVSLLGTLASLLGSLLIALVAWLLGLI 175
Query: 263 SLIIV 267
S V
Sbjct: 176 SPPGV 180
|
Members of this family have several predicted transmembrane helices. The function of these prokaryotic proteins is unknown. Length = 224 |
| >gnl|CDD|213522 TIGR00297, TIGR00297, TIGR00297 family protein | Back alignment and domain information |
|---|
| >gnl|CDD|224749 COG1836, COG1836, Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 310 | |||
| TIGR00297 | 237 | conserved hypothetical protein TIGR00297. | 100.0 | |
| PF01940 | 226 | DUF92: Integral membrane protein DUF92; InterPro: | 100.0 | |
| COG1836 | 247 | Predicted membrane protein [Function unknown] | 100.0 | |
| KOG4491 | 323 | consensus Predicted membrane protein [Function unk | 100.0 | |
| PF01148 | 259 | CTP_transf_1: Cytidylyltransferase family; InterPr | 95.81 | |
| COG0170 | 216 | SEC59 Dolichol kinase [Lipid metabolism] | 92.62 | |
| COG0575 | 265 | CdsA CDP-diglyceride synthetase [Lipid metabolism] | 87.66 |
| >TIGR00297 conserved hypothetical protein TIGR00297 | Back alignment and domain information |
|---|
Probab=100.00 E-value=8.8e-60 Score=434.39 Aligned_cols=202 Identities=37% Similarity=0.606 Sum_probs=178.9
Q ss_pred hhHHHHHHHHHHHHHHHHHhhcCCChHHHHHHHHHHHHHHHhhchhhHHHHHHHHhhhhhhhhcchhhhhhhhhhhhccC
Q 021572 79 TWQSALLSNVVIFVLGSPILVSGLSPSGIAAAFLLGTLTWRAFGPSGFLLVATYFIIGTAATKVKMAQKEAQGVAEKRKG 158 (310)
Q Consensus 79 ~w~~a~~~n~~l~~~~~~~~~ksLT~sGa~aA~lvG~l~~~~~G~~g~llLl~FFlssS~~TK~k~~~K~~~~~~ekkgG 158 (310)
+|.++++++.+ +...+||+|+||++|+++|+++|++++...|++|+.+++.||++||++||+|+++|++++++|||+|
T Consensus 3 ~~~~a~~~~~~--l~~~~~~~~~L~~sG~~~a~~vG~~~~~~~g~~~~~~ll~Ff~~sS~~Tk~~~~~K~~~~~~e~~gg 80 (237)
T TIGR00297 3 LWLSAVILCVF--LLALAAYAPLLDPWGYGHAWIMGVIIILFAGFRGLLVILAFFFLGSAVTRYGQEEKKAAGIAEKRSG 80 (237)
T ss_pred HHHHHHHHHHH--HHHHHHHHccCCHhHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhHHHHhccHHHHHhcccccccCC
Confidence 59999999876 5566799999999999999999999988899999999999999999999999999999999999999
Q ss_pred ccCcccccccchHHHHHHHHHHHhcCCcchhHHHHHHHHHHHHHhhchhhhhhhhcccCCCceEeeCCeecCCCCCcccc
Q 021572 159 RRGPGSVIGSSAAGCVCAFLSIFGVGGFEFSRLWQLGFVASFCTKLSDTVSSEIGKAYGKTTYLVTTFKVVPRGTEGAVS 238 (310)
Q Consensus 159 ~R~~~QVlaNg~~a~l~All~~~~~~~~~~~~~~~lafvas~Aaa~ADTwASEIG~ls~~~PrLITT~K~VppGTnGGVS 238 (310)
+||++||++||+++++|+++|.+.+. .+..+.++|++++||++|||||||||++|+++||+||||||||||||||||
T Consensus 81 ~R~~~qVlaNg~~~~~~al~~~~~~~---~~~~~~~~f~~s~A~a~aDT~ASEiG~ls~~~p~lItt~k~V~~GT~GgVS 157 (237)
T TIGR00297 81 QRGPKNVWGNGLTPALFALAIAFGPE---WDLWLALGYVASVATALSDTMASEIGKAYGKNPRLITTLQRVEPGTDGAIS 157 (237)
T ss_pred CCCHHHHHHhhHHHHHHHHHHHhccc---chHHHHHHHHHHHHHHHcchHHHhhhhccCCCCeEeecCccCCCCCCCccc
Confidence 99999999999999999999877543 256778999999999999999999999999999999999999999999999
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcchhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Q 021572 239 VEGTFAGIFASILLAWVGCLTGQVSLIIVSFHLFSDFTHALMLFTSMFSYFFLYALICLFTSTLNKLFS 307 (310)
Q Consensus 239 llGtlAsllGa~~Ia~va~llg~~~~~~~~~~~~~~~g~~g~lfdSvl~~~~~~~~aGl~GSllDSll~ 307 (310)
++||+|+++|+++|+..+++++..+... +++..++|++||++||+||
T Consensus 158 ~~Gt~As~~Ga~~I~~~~~~~~~~~~~~----------------------~~~~~~aG~~Gsl~DSlLG 204 (237)
T TIGR00297 158 VEGTLAGFAGALAIALLGYLLGLISFKG----------------------ILFSTLAAFAGTNLDSLLG 204 (237)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchHH----------------------HHHHHHHHHHHHHHHHHHh
Confidence 9999999999999999988887655322 3344556666666666655
|
|
| >PF01940 DUF92: Integral membrane protein DUF92; InterPro: IPR002794 Many members of this family have no known function and are predicted to be integral membrane proteins | Back alignment and domain information |
|---|
| >COG1836 Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
| >KOG4491 consensus Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
| >PF01148 CTP_transf_1: Cytidylyltransferase family; InterPro: IPR000374 Phosphatidate cytidylyltransferase (2 | Back alignment and domain information |
|---|
| >COG0170 SEC59 Dolichol kinase [Lipid metabolism] | Back alignment and domain information |
|---|
| >COG0575 CdsA CDP-diglyceride synthetase [Lipid metabolism] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 310 | |||
| 3dh4_A | 530 | Sodium/glucose cotransporter; membrane protein, sy | 86.1 |
| >3dh4_A Sodium/glucose cotransporter; membrane protein, symporter, sugar transport, SGLT, ION TRAN membrane, sodium transport, symport; HET: GAL; 2.70A {Vibrio parahaemolyticus} | Back alignment and structure |
|---|
Probab=86.10 E-value=1.5 Score=43.12 Aligned_cols=57 Identities=5% Similarity=-0.030 Sum_probs=35.8
Q ss_pred HHHHhhcCCChHHHHHHHHHHHHHHHhh-----ch-------hhHHHHHHHHhhhhhhhhcchhhhhhh
Q 021572 94 GSPILVSGLSPSGIAAAFLLGTLTWRAF-----GP-------SGFLLVATYFIIGTAATKVKMAQKEAQ 150 (310)
Q Consensus 94 ~~~~~~ksLT~sGa~aA~lvG~l~~~~~-----G~-------~g~llLl~FFlssS~~TK~k~~~K~~~ 150 (310)
..++.-|..|..|++++.++|.+++... .+ .++.+-+..++..|++||=+.+++++.
T Consensus 414 llgl~wkr~~~~gA~~g~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~v~vSl~t~~~~~~~~~~ 482 (530)
T 3dh4_A 414 LLGLFWKKTTSKGAIIGVVASIPFALFLKFMPLSMPFMDQMLYTLLFTMVVIAFTSLSTSINDDDPKGI 482 (530)
T ss_dssp HHHHHCTTCCHHHHHHHHHHHHHHHHHHHTSSSCCCHHHHHHHHHHHHHHHHHHHHHTTCSSSSCTTCC
T ss_pred HHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHcCCccHHHHHHHHHHHHHHHHHHHHhhcCCCccccccc
Confidence 3555567799999999999998764321 11 112333345666788887666555443
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00