Citrus Sinensis ID: 022850
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 291 | ||||||
| 357520997 | 265 | hypothetical protein MTR_8g103420 [Medic | 0.903 | 0.992 | 0.781 | 1e-126 | |
| 388492226 | 265 | unknown [Medicago truncatula] | 0.903 | 0.992 | 0.778 | 1e-125 | |
| 217073912 | 265 | unknown [Medicago truncatula] | 0.903 | 0.992 | 0.774 | 1e-125 | |
| 224143553 | 264 | predicted protein [Populus trichocarpa] | 0.890 | 0.981 | 0.793 | 1e-121 | |
| 225445216 | 266 | PREDICTED: uncharacterized protein LOC10 | 0.896 | 0.981 | 0.799 | 1e-119 | |
| 449443151 | 265 | PREDICTED: uncharacterized protein LOC10 | 0.896 | 0.984 | 0.778 | 1e-118 | |
| 357520991 | 270 | Cov1 [Medicago truncatula] gi|355524806| | 0.896 | 0.966 | 0.761 | 1e-118 | |
| 224119792 | 254 | predicted protein [Populus trichocarpa] | 0.848 | 0.972 | 0.819 | 1e-118 | |
| 388517155 | 270 | unknown [Medicago truncatula] | 0.896 | 0.966 | 0.750 | 1e-116 | |
| 449464664 | 267 | PREDICTED: uncharacterized protein LOC10 | 0.903 | 0.985 | 0.744 | 1e-115 |
| >gi|357520997|ref|XP_003630787.1| hypothetical protein MTR_8g103420 [Medicago truncatula] gi|355524809|gb|AET05263.1| hypothetical protein MTR_8g103420 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 229/293 (78%), Positives = 247/293 (84%), Gaps = 30/293 (10%)
Query: 1 MGDDKSTIVMASRERDRDRELLIPVADSGDES--SKPSSSSSSSHHAGRETFSKVVRSWA 58
M DDKS + RDRDRELLIPVA+SGD++ S+P +SSSS HH+GRETFSKVV+SWA
Sbjct: 1 MVDDKSPSTVIG-NRDRDRELLIPVANSGDDTDASRPLASSSSMHHSGRETFSKVVQSWA 59
Query: 59 SKKFMTGCVILFPIAVTFYITWWFIHFVDGFFSPIYAQLGIDIFGLGFITSVTFIFLIGV 118
SKKFMTGCVIL PIA+TFYITWWFIHFVDGFFSPIYAQLGIDIFGLGFITS+TFIFL+GV
Sbjct: 60 SKKFMTGCVILLPIAITFYITWWFIHFVDGFFSPIYAQLGIDIFGLGFITSITFIFLVGV 119
Query: 119 FMSSWLGASVLSLGEWFIKRMPFVRHIYNASKQISAAISPGMYSVFNLPYYISVGRQFIF 178
FMSSWLGASVL LGEWFIKRMP VRHIYNASKQISAAISP
Sbjct: 120 FMSSWLGASVLGLGEWFIKRMPLVRHIYNASKQISAAISP-------------------- 159
Query: 179 CITFVITDQNTQAFKEVAIIRHPRIGEYAFGFITSSVVLQNYSGEEELCCVYVPTNHLYI 238
DQN+QAFKEVAIIRHPR+GEYAFGFITSSVVLQ+YSG+EELCCVYVPTNHLYI
Sbjct: 160 -------DQNSQAFKEVAIIRHPRVGEYAFGFITSSVVLQSYSGDEELCCVYVPTNHLYI 212
Query: 239 GDIFLINTKDVIRPNLSVREGIEIVVSGGMSMPQILSTLETRMPLDGSRPDRR 291
GDIFL+NTKDVIRP LSVREGIEIVVSGGMSMPQILSTLE+R+P SRPDRR
Sbjct: 213 GDIFLVNTKDVIRPTLSVREGIEIVVSGGMSMPQILSTLESRVPAQISRPDRR 265
|
Source: Medicago truncatula Species: Medicago truncatula Genus: Medicago Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|388492226|gb|AFK34179.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|217073912|gb|ACJ85316.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|224143553|ref|XP_002324995.1| predicted protein [Populus trichocarpa] gi|222866429|gb|EEF03560.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|225445216|ref|XP_002284354.1| PREDICTED: uncharacterized protein LOC100262854 [Vitis vinifera] gi|297738813|emb|CBI28058.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449443151|ref|XP_004139344.1| PREDICTED: uncharacterized protein LOC101208869 [Cucumis sativus] gi|449516298|ref|XP_004165184.1| PREDICTED: uncharacterized protein LOC101229165 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|357520991|ref|XP_003630784.1| Cov1 [Medicago truncatula] gi|355524806|gb|AET05260.1| Cov1 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|224119792|ref|XP_002331162.1| predicted protein [Populus trichocarpa] gi|222873245|gb|EEF10376.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|388517155|gb|AFK46639.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449464664|ref|XP_004150049.1| PREDICTED: uncharacterized protein LOC101222032 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 291 | ||||||
| TAIR|locus:2061668 | 268 | COV1 "AT2G20120" [Arabidopsis | 0.683 | 0.742 | 0.613 | 1.6e-61 | |
| TAIR|locus:2061654 | 256 | LCV1 "AT2G20130" [Arabidopsis | 0.652 | 0.742 | 0.613 | 2.8e-57 | |
| TAIR|locus:2018640 | 261 | LCV2 "AT1G43130" [Arabidopsis | 0.364 | 0.406 | 0.754 | 1.1e-41 | |
| UNIPROTKB|Q3AEZ4 | 211 | CHY_0429 "Putative uncharacter | 0.357 | 0.492 | 0.321 | 1.1e-24 | |
| TIGR_CMR|CHY_0429 | 211 | CHY_0429 "conserved hypothetic | 0.357 | 0.492 | 0.321 | 1.1e-24 | |
| UNIPROTKB|Q0C0Q5 | 237 | HNE_1988 "Putative uncharacter | 0.333 | 0.409 | 0.288 | 8.1e-15 | |
| UNIPROTKB|Q5LRR3 | 224 | SPO2062 "Uncharacterized prote | 0.426 | 0.553 | 0.297 | 2.4e-12 | |
| TIGR_CMR|SPO_2062 | 224 | SPO_2062 "conserved hypothetic | 0.426 | 0.553 | 0.297 | 2.4e-12 | |
| UNIPROTKB|Q3Z7T7 | 214 | DET0989 "Putative uncharacteri | 0.518 | 0.705 | 0.203 | 8.8e-10 | |
| TIGR_CMR|DET_0989 | 214 | DET_0989 "conserved hypothetic | 0.518 | 0.705 | 0.203 | 8.8e-10 |
| TAIR|locus:2061668 COV1 "AT2G20120" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 629 (226.5 bits), Expect = 1.6e-61, P = 1.6e-61
Identities = 130/212 (61%), Positives = 147/212 (69%)
Query: 1 MGDDKSTIVMASRERDRDRELLIPVADSGDEX-------XXXXXXXXXXHHAGRETFSKV 53
MGD+K IVMA+RERDR ELLIPVADSGD+ H + ET S
Sbjct: 1 MGDEKPVIVMANRERDR--ELLIPVADSGDKDDGSSSKPSSSSSASSSSHQSSHETLSLF 58
Query: 54 VRSWASKKFMTGCVILFPIAVTFYITWWFIHFVDGFFSPIYAQLGIDIFGLGFITSVTFI 113
+R WASKKFMTGCVIL PIA+TFYITWWFIHFVDGFFSPIYAQLGI++FG GF+TS+ FI
Sbjct: 59 IRGWASKKFMTGCVILLPIAITFYITWWFIHFVDGFFSPIYAQLGINVFGFGFLTSIAFI 118
Query: 114 FLIGVFMSSWLGASVLSLGEWFIKRMPFVRHIYNASKQISAAISPGMYS-VFNLPYYIS- 171
FL+GVFMSSWLGASVL+LGEWFIKRMPFVRHIYNASKQIS AISP + F I
Sbjct: 119 FLVGVFMSSWLGASVLNLGEWFIKRMPFVRHIYNASKQISTAISPDQNTQAFKEVAIIRH 178
Query: 172 --VGRQFIFCITFVITDQNTQAFKEVAIIRHP 201
VG IT + QN +E+ + P
Sbjct: 179 PRVGEYAFGFITSTVVLQNYPTEEELCCVYVP 210
|
|
| TAIR|locus:2061654 LCV1 "AT2G20130" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2018640 LCV2 "AT1G43130" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q3AEZ4 CHY_0429 "Putative uncharacterized protein" [Carboxydothermus hydrogenoformans Z-2901 (taxid:246194)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|CHY_0429 CHY_0429 "conserved hypothetical protein" [Carboxydothermus hydrogenoformans Z-2901 (taxid:246194)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q0C0Q5 HNE_1988 "Putative uncharacterized protein" [Hyphomonas neptunium ATCC 15444 (taxid:228405)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5LRR3 SPO2062 "Uncharacterized protein" [Ruegeria pomeroyi DSS-3 (taxid:246200)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|SPO_2062 SPO_2062 "conserved hypothetical protein" [Ruegeria pomeroyi DSS-3 (taxid:246200)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q3Z7T7 DET0989 "Putative uncharacterized protein" [Dehalococcoides ethenogenes 195 (taxid:243164)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|DET_0989 DET_0989 "conserved hypothetical protein" [Dehalococcoides ethenogenes 195 (taxid:243164)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| grail3.0020005701 | hypothetical protein (265 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 291 | |||
| COG2928 | 222 | COG2928, COG2928, Uncharacterized conserved protei | 2e-26 | |
| pfam04367 | 108 | pfam04367, DUF502, Protein of unknown function (DU | 1e-23 |
| >gnl|CDD|225480 COG2928, COG2928, Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Score = 103 bits (258), Expect = 2e-26
Identities = 59/230 (25%), Positives = 106/230 (46%), Gaps = 39/230 (16%)
Query: 60 KKFMTGCVILFPIAVTFYITWWFIHFVDGFFSP-------IYAQLGIDIFGLGFITSVTF 112
K F+TG ++L P+A+T ++ W +D F P +I GLG I ++
Sbjct: 8 KYFLTGLLVLLPLAITLWVVSWIFGLLDQFVGPLLPDRLRPAVYFPFNIPGLGVILAIIL 67
Query: 113 IFLIGVFMSSWLGASVLSLGEWFIKRMPFVRHIYNASKQISAAISPGMYSVFNLPYYISV 172
IFL+G + +G S+LSLG+ ++R+P V+ IY ++KQ+
Sbjct: 68 IFLLGFLARNMIGRSLLSLGDSLLRRIPLVKSIYKSAKQVVET----------------- 110
Query: 173 GRQFIFCITFVITDQNTQAFKEVAIIRHPRIGEYAFGFITS--SVVLQNYSGEEELCCVY 230
+ + +FK+V ++ PR G +A F+T + L+ G + V+
Sbjct: 111 -----------LLSDQSGSFKQVVLVEFPRRGIWAIAFVTGEKAGELKEKEG-RPMVAVF 158
Query: 231 VPTNHLYIGDIFLINTKDVIRP-NLSVREGIEIVVSGGMSMPQILSTLET 279
VPT L+ K+ I P +++V + ++ ++SGG+ P L +
Sbjct: 159 VPTTPNPTSGFLLLVPKEDIVPLDMTVEDALKYIISGGVVAPDSLPAKDL 208
|
Length = 222 |
| >gnl|CDD|218047 pfam04367, DUF502, Protein of unknown function (DUF502) | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 291 | |||
| COG2928 | 222 | Uncharacterized conserved protein [Function unknow | 100.0 | |
| PF04367 | 108 | DUF502: Protein of unknown function (DUF502); Inte | 99.96 | |
| PRK15350 | 88 | type III secretion system protein SsaS; Provisiona | 89.29 | |
| TIGR01402 | 88 | fliQ flagellar biosynthetic protein FliQ. This mod | 88.6 | |
| PRK05700 | 89 | fliQ flagellar biosynthesis protein FliQ; Validate | 88.36 | |
| PRK06010 | 88 | fliQ flagellar biosynthesis protein FliQ; Reviewed | 87.4 | |
| TIGR02120 | 399 | GspF general secretion pathway protein F. This mem | 86.78 | |
| TIGR01403 | 81 | fliQ_rel_III type III secretion protein, HrpO fami | 86.75 | |
| PRK12781 | 88 | fliQ flagellar biosynthesis protein FliQ; Reviewed | 86.3 | |
| cd01277 | 103 | HINT_subgroup HINT (histidine triad nucleotide-bin | 86.18 | |
| COG1987 | 89 | FliQ Flagellar biosynthesis pathway, component Fli | 82.62 | |
| cd01275 | 126 | FHIT FHIT (fragile histidine family): FHIT protein | 82.34 | |
| COG1459 | 397 | PulF Type II secretory pathway, component PulF [Ce | 82.02 | |
| PRK15333 | 86 | type III secretion system protein SpaQ; Provisiona | 81.53 | |
| cd00468 | 86 | HIT_like HIT family: HIT (Histidine triad) protein | 81.37 |
| >COG2928 Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.9e-52 Score=373.82 Aligned_cols=193 Identities=30% Similarity=0.655 Sum_probs=174.5
Q ss_pred HHHHHHhhhhhHhHHHHHHHHHHHHHHHHHHHHHhHhhh-------hhhccchhhhHHHHHHHHHHHHHHHHHHhhHHHH
Q 022850 56 SWASKKFMTGCVILFPIAVTFYITWWFIHFVDGFFSPIY-------AQLGIDIFGLGFITSVTFIFLIGVFMSSWLGASV 128 (291)
Q Consensus 56 ~~l~~~FltGLlvLlPIaiTi~Il~wl~~~vd~~~~pi~-------~~lg~~ipGlG~l~~i~lIfliG~l~~~~lg~~l 128 (291)
..++|+|+|||++++|+++|+|+++|+++++|+++.|.+ .+++.+++|+|+++.+++++++|+++++.+|+++
T Consensus 4 ~~lk~~fltGLlvllPlaiT~~vv~~i~~~l~~~~~~~lp~~~~~~~~~~~~i~~lg~il~iili~l~G~l~~~~ig~~l 83 (222)
T COG2928 4 KRLKKYFLTGLLVLLPLAITLWVVSWIFGLLDQFVGPLLPDRLRPAVYFPFNIPGLGVILAIILIFLLGFLARNMIGRSL 83 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccchhchhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhHH
Confidence 345688999999999999999999999999999999854 2346779999999999999999999999999999
Q ss_pred HHHHHHhhccccchhHHHHHHHHHHHHhcCCcccccccccccccccceeeeeeeeeecccccccceEEEEEcCCCCeeEE
Q 022850 129 LSLGEWFIKRMPFVRHIYNASKQISAAISPGMYSVFNLPYYISVGRQFIFCITFVITDQNTQAFKEVAIIRHPRIGEYAF 208 (291)
Q Consensus 129 ~~~~E~~l~rIPlVk~IYsaiKqi~~~~~~~~~~~~~~~~~~~~g~~~kf~~~~~v~~~~~~~fk~VVlV~~P~~g~~~i 208 (291)
++++|++|.|||++|+||+++||+++++.+ +++.+||+||+|||||+|+|++
T Consensus 84 ~~~~d~~L~RiPlv~~IY~s~kqi~etll~----------------------------~~~~sfk~vvlVefP~~G~~~i 135 (222)
T COG2928 84 LSLGDSLLRRIPLVKSIYKSAKQVVETLLS----------------------------DQSGSFKQVVLVEFPRRGIWAI 135 (222)
T ss_pred HHHHHHHHccCccHHHHHHHHHHHHHHHHh----------------------------cCCccceeeEEEECCCCCcEEE
Confidence 999999999999999999999999999975 3455999999999999999999
Q ss_pred EEeccccc--ccccCCCcceEEEEecCCCC-CceeEEEEeCCCeecCCCCHHHHHHHHhhccccCCCccccc
Q 022850 209 GFITSSVV--LQNYSGEEELCCVYVPTNHL-YIGDIFLINTKDVIRPNLSVREGIEIVVSGGMSMPQILSTL 277 (291)
Q Consensus 209 GFvT~~~~--~~~~~~~e~~v~VfVPtsp~-~sG~l~lV~~~dV~~lDmSveeAmk~viSgGv~~p~~~~~~ 277 (291)
||+|++.. .+. ..++++++||+||+|| |+||+.+||+||++++|||+|||+|+++|||+..|+.....
T Consensus 136 ~fvtg~~~~e~~~-~~~~~~v~VfvPTTPNPTsGfl~~Vpkedi~~ldmtvEda~k~iIS~G~v~p~~~~~~ 206 (222)
T COG2928 136 AFVTGEKAGELKE-KEGRPMVAVFVPTTPNPTSGFLLLVPKEDIVPLDMTVEDALKYIISGGVVAPDSLPAK 206 (222)
T ss_pred EEeccCCCcchhc-ccCCceEEEEcCCCCCCCcceEEEEEHHHceeccccHHHHhheeeeccccCCCccccc
Confidence 99999953 333 3357999999999996 79999999999999999999999999999999999855443
|
|
| >PF04367 DUF502: Protein of unknown function (DUF502); InterPro: IPR007462 This entry contains proteins that are predicted to be integral membrane proteins | Back alignment and domain information |
|---|
| >PRK15350 type III secretion system protein SsaS; Provisional | Back alignment and domain information |
|---|
| >TIGR01402 fliQ flagellar biosynthetic protein FliQ | Back alignment and domain information |
|---|
| >PRK05700 fliQ flagellar biosynthesis protein FliQ; Validated | Back alignment and domain information |
|---|
| >PRK06010 fliQ flagellar biosynthesis protein FliQ; Reviewed | Back alignment and domain information |
|---|
| >TIGR02120 GspF general secretion pathway protein F | Back alignment and domain information |
|---|
| >TIGR01403 fliQ_rel_III type III secretion protein, HrpO family | Back alignment and domain information |
|---|
| >PRK12781 fliQ flagellar biosynthesis protein FliQ; Reviewed | Back alignment and domain information |
|---|
| >cd01277 HINT_subgroup HINT (histidine triad nucleotide-binding protein) subgroup: Members of this CD belong to the superfamily of histidine triad hydrolases that act on alpha-phosphate of ribonucleotides | Back alignment and domain information |
|---|
| >COG1987 FliQ Flagellar biosynthesis pathway, component FliQ [Cell motility and secretion / Intracellular trafficking and secretion] | Back alignment and domain information |
|---|
| >cd01275 FHIT FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases | Back alignment and domain information |
|---|
| >COG1459 PulF Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion] | Back alignment and domain information |
|---|
| >PRK15333 type III secretion system protein SpaQ; Provisional | Back alignment and domain information |
|---|
| >cd00468 HIT_like HIT family: HIT (Histidine triad) proteins, named for a motif related to the sequence HxHxH/Qxx (x, a hydrophobic amino acid), are a superfamily of nucleotide hydrolases and transferases, which act on the alpha-phosphate of ribonucleotides | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00