Citrus Sinensis ID: 030628
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 174 | ||||||
| 224144561 | 285 | predicted protein [Populus trichocarpa] | 1.0 | 0.610 | 0.752 | 1e-74 | |
| 224123536 | 253 | predicted protein [Populus trichocarpa] | 1.0 | 0.687 | 0.758 | 5e-74 | |
| 225453038 | 285 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.610 | 0.752 | 2e-73 | |
| 358248094 | 282 | uncharacterized protein LOC100814672 [Gl | 1.0 | 0.617 | 0.758 | 5e-73 | |
| 449465818 | 285 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.610 | 0.735 | 6e-72 | |
| 356505987 | 282 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.617 | 0.747 | 2e-71 | |
| 217075548 | 285 | unknown [Medicago truncatula] | 1.0 | 0.610 | 0.729 | 2e-68 | |
| 297849170 | 294 | heat shock protein binding protein [Arab | 1.0 | 0.591 | 0.701 | 2e-65 | |
| 18390922 | 294 | Chloroplast J-like domain 1-containing p | 1.0 | 0.591 | 0.695 | 3e-65 | |
| 21593057 | 294 | unknown [Arabidopsis thaliana] | 1.0 | 0.591 | 0.695 | 4e-65 |
| >gi|224144561|ref|XP_002325333.1| predicted protein [Populus trichocarpa] gi|222862208|gb|EEE99714.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 283 bits (725), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 131/174 (75%), Positives = 156/174 (89%)
Query: 1 MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWI 60
MM QLS RKKGVT+GSFKV+K+IK+ADKQP+ PWGPRFAKSS D+RINLAISAAFTAWI
Sbjct: 112 MMAQLSNRKKGVTYGSFKVAKDIKYADKQPVVPWGPRFAKSSENDMRINLAISAAFTAWI 171
Query: 61 AIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLAL 120
IKR AEYKPLQFLAFAFVYR FEKLK+FEP VSPTY+E+G+D+GR LR+GKR+LR LAL
Sbjct: 172 LIKRNAEYKPLQFLAFAFVYRIFEKLKTFEPPVSPTYSEDGEDEGRTLRLGKRILRALAL 231
Query: 121 VFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR 174
VFG I +SLAYTGILNLIE AG +IPAFL++NQEL++T +++ +L+I+ASYYR
Sbjct: 232 VFGSITFASLAYTGILNLIEMAGSYIPAFLYNNQELLITTATSAILYILASYYR 285
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224123536|ref|XP_002319104.1| predicted protein [Populus trichocarpa] gi|222857480|gb|EEE95027.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|225453038|ref|XP_002266453.1| PREDICTED: uncharacterized protein LOC100243269 [Vitis vinifera] gi|302143637|emb|CBI22390.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|358248094|ref|NP_001240067.1| uncharacterized protein LOC100814672 [Glycine max] gi|255638094|gb|ACU19361.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449465818|ref|XP_004150624.1| PREDICTED: uncharacterized protein LOC101213790 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356505987|ref|XP_003521770.1| PREDICTED: uncharacterized protein LOC100805399 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|217075548|gb|ACJ86134.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|297849170|ref|XP_002892466.1| heat shock protein binding protein [Arabidopsis lyrata subsp. lyrata] gi|297338308|gb|EFH68725.1| heat shock protein binding protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|18390922|ref|NP_563823.1| Chloroplast J-like domain 1-containing protein [Arabidopsis thaliana] gi|13877839|gb|AAK43997.1|AF370182_1 unknown protein [Arabidopsis thaliana] gi|16323504|gb|AAL15246.1| unknown protein [Arabidopsis thaliana] gi|332190204|gb|AEE28325.1| Chloroplast J-like domain 1-containing protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|21593057|gb|AAM65006.1| unknown [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 174 | ||||||
| TAIR|locus:2025660 | 294 | CJD1 "Chloroplast J-like domai | 1.0 | 0.591 | 0.695 | 5.5e-61 |
| TAIR|locus:2025660 CJD1 "Chloroplast J-like domain 1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
Identities = 121/174 (69%), Positives = 141/174 (81%)
Query: 1 MMEQLSKRKKGVTFGSFKVSKEIKFADKQPIFPWGPRFAKSSPQDIRINLAISAAFTAWI 60
M QL RKKGVTFGSFKVSK+IK+ADKQPI PWGPRF++SS D+ INLAIS F+AWI
Sbjct: 121 MYAQLMNRKKGVTFGSFKVSKDIKYADKQPIIPWGPRFSRSSKNDMLINLAISVVFSAWI 180
Query: 61 AIKRYAEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLAL 120
AIKR EYKPLQF++F FVYR FEKLKSFE SP Y EEG++ GR LRMGKRLLR L+L
Sbjct: 181 AIKRNVEYKPLQFMSFVFVYRIFEKLKSFEAPSSPIYNEEGEESGRGLRMGKRLLRSLSL 240
Query: 121 VFGVIAVSSLAYTGILNLIEYAGGFIPAFLFDNQELIVTGSSAVLLFIMASYYR 174
VFG I ++SLAYTG LN IEY G IP L++NQELIVT SSA +L+++AS+YR
Sbjct: 241 VFGSILLASLAYTGFLNGIEYMGYSIPMVLYNNQELIVTASSAFMLYVIASFYR 294
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.326 0.140 0.415 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 174 174 0.00075 109 3 11 22 0.47 31
31 0.49 34
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 560 (60 KB)
Total size of DFA: 133 KB (2084 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 15.30u 0.18s 15.48t Elapsed: 00:00:00
Total cpu time: 15.30u 0.18s 15.48t Elapsed: 00:00:00
Start: Sat May 11 08:14:51 2013 End: Sat May 11 08:14:51 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 174 | |||
| PF11833 | 194 | DUF3353: Protein of unknown function (DUF3353); In | 99.39 |
| >PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised | Back alignment and domain information |
|---|
Probab=99.39 E-value=1.4e-12 Score=107.85 Aligned_cols=140 Identities=27% Similarity=0.422 Sum_probs=102.0
Q ss_pred ChhhhhhhhcCccccceeeehhhccccC-----------CCCccCCCC----CCCCCcchhhHhHHHHHHHHHHHHhhcc
Q 030628 1 MMEQLSKRKKGVTFGSFKVSKEIKFADK-----------QPIFPWGPR----FAKSSPQDIRINLAISAAFTAWIAIKRY 65 (174)
Q Consensus 1 MmaQLs~RkkG~t~Gs~~VsKdikYADk-----------qp~~PW~PR----~~~s~~kDi~IN~aIsa~~~~wi~~~~~ 65 (174)
||++|..|+|| .++|+++|||||+ ++..||.+| +...+.+|+.+..++-+++.+|......
T Consensus 38 LM~rL~~Rq~G----ki~v~~~ir~ad~~~~~~~~~~~~~~~p~wl~~~~~~~~~P~~~~l~~~~~~f~~L~~~~~~~~~ 113 (194)
T PF11833_consen 38 LMERLRQRQKG----KIKVPERIRYADREEPKPPNPKPSNPSPPWLQRLLPSFDTPSSQDLLIRAAAFGALGLWSLLFPA 113 (194)
T ss_pred HHHHHHHHHcC----CCCccHHHHHhhhccccccCCCCCCccchHHHhcccceeCCCcchHHHHHHHHHHHHHHHHHHcC
Confidence 69999999999 6899999999999 777899999 5888899999999998999999998877
Q ss_pred cccchHHHHHHHHHHHHHHHhhhcCCCCCCCCCCCCCCchhhhhhhHHHHHHHHHHHHHHHHHhhhHHhHH-HHHHHhCc
Q 030628 66 AEYKPLQFLAFAFVYRFFEKLKSFEPAVSPTYTEEGDDDGRALRMGKRLLRCLALVFGVIAVSSLAYTGIL-NLIEYAGG 144 (174)
Q Consensus 66 a~~kPl~f~af~~v~r~f~KL~~~ePp~~~~~~e~ge~~gr~~r~~KR~lRsl~LvfG~~~v~sL~yT~~l-n~ie~~g~ 144 (174)
++...+|.+ ++. ...+..|++ -++|+.|++++.+|++++ .+.....+ +.+.. .
T Consensus 114 ~~~~~l~La-l~~-~~~iyfl~~---------------------K~~~~~rA~~~~~~~L~~-G~~lGs~l~~~l~~--~ 167 (194)
T PF11833_consen 114 ASGPGLQLA-LGL-GACIYFLNR---------------------KERKLGRAFLWTLGGLVV-GLILGSLLASWLPV--D 167 (194)
T ss_pred CCCcchHHH-HHH-HHHHHHHHH---------------------hcchHHHHHHHHHHHHHH-HHHHHHHHHhhccc--c
Confidence 777777733 322 223333322 178999999999999988 44444444 33322 1
Q ss_pred ccchheeccceeeeehhhhHHHHhhhhc
Q 030628 145 FIPAFLFDNQELIVTGSSAVLLFIMASY 172 (174)
Q Consensus 145 ~iP~~~y~~Qel~vt~~~~v~L~~~~s~ 172 (174)
.+| .-.+=|.++|..+-++|....+|
T Consensus 168 ~~p--~~~s~~~~~sl~~~i~lwl~s~f 193 (194)
T PF11833_consen 168 IVP--GPWSPEQLVSLFTYILLWLVSLF 193 (194)
T ss_pred cCC--CCCCHHHHHHHHHHHHHHHHHhc
Confidence 223 34455777777777777766555
|
This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00