Citrus Sinensis ID: 025423
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 253 | ||||||
| 224083464 | 297 | predicted protein [Populus trichocarpa] | 0.972 | 0.828 | 0.532 | 8e-78 | |
| 388519441 | 240 | unknown [Lotus japonicus] | 0.936 | 0.987 | 0.558 | 4e-72 | |
| 357519993 | 243 | hypothetical protein MTR_8g093850 [Medic | 0.948 | 0.987 | 0.568 | 1e-71 | |
| 449459256 | 240 | PREDICTED: uncharacterized protein LOC10 | 0.928 | 0.979 | 0.535 | 3e-69 | |
| 297797403 | 236 | hypothetical protein ARALYDRAFT_496595 [ | 0.913 | 0.978 | 0.487 | 1e-52 | |
| 79547434 | 236 | uncharacterized protein [Arabidopsis tha | 0.913 | 0.978 | 0.479 | 5e-51 | |
| 27754430 | 236 | unknown protein [Arabidopsis thaliana] g | 0.913 | 0.978 | 0.479 | 5e-51 | |
| 8777314 | 245 | unnamed protein product [Arabidopsis tha | 0.948 | 0.979 | 0.463 | 9e-50 | |
| 296082599 | 169 | unnamed protein product [Vitis vinifera] | 0.592 | 0.887 | 0.647 | 1e-49 | |
| 351723845 | 229 | uncharacterized protein LOC100305573 [Gl | 0.877 | 0.969 | 0.468 | 2e-49 |
| >gi|224083464|ref|XP_002307037.1| predicted protein [Populus trichocarpa] gi|222856486|gb|EEE94033.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 295 bits (756), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 156/293 (53%), Positives = 196/293 (66%), Gaps = 47/293 (16%)
Query: 6 FEPIFGEPKAEWADSRSDSLG---RFLFHVSAPDSSHLLIQVTDFRSNTWEAKRSVLQLD 62
FEPIF EP+ WA + + G +FL H+ APD +HL IQVTD+ SNT+EA +SV+QLD
Sbjct: 7 FEPIFNEPRIGWAKNSNPGSGLMDQFLMHIFAPDDNHLKIQVTDYHSNTFEAVKSVMQLD 66
Query: 63 DMRDEIGIGGSWSEFIDYVVASIKSEDVKLILEGHSNADGAAYAKIVAQKSKGMPRISIS 122
DMRD IGIGGSW+EF++Y+VAS K+EDVKL+LE S++DG AYAK+VAQKSKGMP ISIS
Sbjct: 67 DMRDCIGIGGSWAEFVEYLVASFKAEDVKLVLEKLSDSDGVAYAKLVAQKSKGMPLISIS 126
Query: 123 LTRLTGSAATEAMAKLSLELFTAFRSMQTLIVQEQERCLQLEKEAAAEKERNENIQNQPL 182
LT+L +AA +AMA +S LF AF+ + L++QE+E LQL K +AEKER+ENIQ+Q
Sbjct: 127 LTKLLDNAARDAMANMSFGLFKAFKRTKNLVLQEKEHSLQLTKVISAEKERSENIQSQ-- 184
Query: 183 YSKRQKLQKMNFSDKTDISASILSNGSQDSP----------------------------- 213
KRQKL+KMN SD+ D+S SNG+Q+SP
Sbjct: 185 LGKRQKLEKMNSSDRLDVSGPPASNGAQNSPGCVLSLGGFQIICKLSASIYITECDDTVS 244
Query: 214 -------------DKQAAQSPVASKVANRVIPAHRRAKVRGALLQDTEDDKDN 253
DK A + P ++KV NRV+PA RRAKVRGALLQD EDDKDN
Sbjct: 245 SINKIFLNQIIDADKLAGRDPASTKVTNRVVPAFRRAKVRGALLQDIEDDKDN 297
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|388519441|gb|AFK47782.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|357519993|ref|XP_003630285.1| hypothetical protein MTR_8g093850 [Medicago truncatula] gi|355524307|gb|AET04761.1| hypothetical protein MTR_8g093850 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449459256|ref|XP_004147362.1| PREDICTED: uncharacterized protein LOC101217609 [Cucumis sativus] gi|449526088|ref|XP_004170046.1| PREDICTED: uncharacterized protein LOC101231777 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297797403|ref|XP_002866586.1| hypothetical protein ARALYDRAFT_496595 [Arabidopsis lyrata subsp. lyrata] gi|297312421|gb|EFH42845.1| hypothetical protein ARALYDRAFT_496595 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|79547434|ref|NP_201206.3| uncharacterized protein [Arabidopsis thaliana] gi|332010446|gb|AED97829.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|27754430|gb|AAO22663.1| unknown protein [Arabidopsis thaliana] gi|50253576|gb|AAT71990.1| At1g12020 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|8777314|dbj|BAA96904.1| unnamed protein product [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|296082599|emb|CBI21604.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|351723845|ref|NP_001237293.1| uncharacterized protein LOC100305573 [Glycine max] gi|255625951|gb|ACU13320.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 253 | ||||||
| TAIR|locus:2160786 | 236 | AT5G64010 "AT5G64010" [Arabido | 0.913 | 0.978 | 0.437 | 1.4e-41 |
| TAIR|locus:2160786 AT5G64010 "AT5G64010" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 441 (160.3 bits), Expect = 1.4e-41, P = 1.4e-41
Identities = 108/247 (43%), Positives = 143/247 (57%)
Query: 4 EGFEPIFGEPKAEWADSRSDSLGRFLFHVSAPDSSHLLIQVTDFRSNTWEAKRSVLQLDD 63
E FEPIFGE E +D S L R LFHV A DS +L + VTDF S W SV QLDD
Sbjct: 5 EKFEPIFGEVVPERSDPGSGLLRRCLFHVYASDSYNLTVHVTDFISGVWTTILSVSQLDD 64
Query: 64 MRDEIGIGGSWSEFIDYVVASIKSEDVKLILEGHSNADGAAYAKIVAQKSKGMPRISISL 123
MRD +GIGGSWSEF+DY VAS+KS++VKL+L S ++G A++V+QK+KGMPRI++ L
Sbjct: 65 MRDTVGIGGSWSEFVDYTVASLKSDNVKLLLGETSVSNGVKTARLVSQKAKGMPRINVPL 124
Query: 124 TRLTGSAATEAMAKLSLELFTAFRSMQTLIVQEQERCLQLXXXXXXXXXRNENIQNQPLY 183
T++ S+A+EAMA LSLELF AF+S Q L Q + +E + Y
Sbjct: 125 TKMVESSASEAMANLSLELFRAFKSKQHL--QGE--------VSFSAAATDEKDKRDATY 174
Query: 184 SKRQKLQKMNFSDKTDISASILSNGSQDSPDKQAAQSPVASKVANRVIPAHRRAKVRGAL 243
++ ++ +S K D+ A +N QDSP Q+A+ RV R K L
Sbjct: 175 NQLER-----YSRKLDVMAPS-TNNRQDSPANQSAREANTKNPVKRVPAHRRTRKRGALL 228
Query: 244 LQDTEDD 250
E+D
Sbjct: 229 QDSEEED 235
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.314 0.128 0.355 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 253 244 0.00098 113 3 11 23 0.47 33
32 0.45 36
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 592 (63 KB)
Total size of DFA: 174 KB (2102 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 21.28u 0.09s 21.37t Elapsed: 00:00:01
Total cpu time: 21.28u 0.09s 21.37t Elapsed: 00:00:01
Start: Fri May 10 08:15:32 2013 End: Fri May 10 08:15:33 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 253 | |||
| PF15384 | 197 | DUF4610: Domain of unknown function (DUF4610) | 97.18 | |
| PF06632 | 342 | XRCC4: DNA double-strand break repair and V(D)J re | 95.39 |
| >PF15384 DUF4610: Domain of unknown function (DUF4610) | Back alignment and domain information |
|---|
Probab=97.18 E-value=0.011 Score=53.05 Aligned_cols=112 Identities=18% Similarity=0.276 Sum_probs=95.3
Q ss_pred cceEEEEeCC-----CCCCeEEEEecccccchhhccccchhhhhHhhhcCCCChHHHHHHHHHhcccccceEeecCCCCC
Q 025423 26 GRFLFHVSAP-----DSSHLLIQVTDFRSNTWEAKRSVLQLDDMRDEIGIGGSWSEFIDYVVASIKSEDVKLILEGHSNA 100 (253)
Q Consensus 26 ~pfLFh~~a~-----d~~hL~v~vTDfHS~tW~~slSv~~LeDlRD~VGIGGSwsdF~dYl~~slsS~~VkL~l~~~s~s 100 (253)
-+|+++-+-. +.+...|+|||--. -|...|+-++|+.+|+-.|.- +..||.--|.+.+..|.|.|-|-.+
T Consensus 13 ~ryvCyce~~~~~~~~~g~~~i~vTDg~d-vW~t~~t~dsL~~~k~~~~L~-~~Edy~~rfR~Ac~~~~vtvtlqed--- 87 (197)
T PF15384_consen 13 PRYVCYCEGEGSGDGDAGVWNIYVTDGAD-VWSTCFTPDSLAALKARFGLS-SAEDYFSRFRAACEQQAVTVTLQED--- 87 (197)
T ss_pred CcEEEEEeCCCCCCCCCCeeEEEecccHH-hhhhccCHHHHHHHHhhcccc-hHHHHHHHHHHHhhcCeeEEEEecC---
Confidence 3588888877 67889999999874 499999999999999999984 6889999999999999999999863
Q ss_pred CCcccceeeeeccCCCCeeEEeeccccchhHHHHHHHhhHHHHHHHHH
Q 025423 101 DGAAYAKIVAQKSKGMPRISISLTRLTGSAATEAMAKLSLELFTAFRS 148 (253)
Q Consensus 101 ~G~~~akLVa~KaKGmP~ItI~L~~l~~saa~D~ma~~sl~Lf~afrs 148 (253)
+++-+-+.|-.-|++.|.++.++-+...+-.|-|.|.+....
T Consensus 88 ------~a~Ltls~g~s~L~~dL~k~p~~Ea~~~Lq~L~f~lAe~v~~ 129 (197)
T PF15384_consen 88 ------RASLTLSGGPSALTFDLSKVPAPEAAPRLQALTFRLAERVCS 129 (197)
T ss_pred ------eEEEEecCCCccceEEhhhCCCchhhHHHHHHHHHHHHHHHH
Confidence 444455888999999999999999998888888887765533
|
|
| >PF06632 XRCC4: DNA double-strand break repair and V(D)J recombination protein XRCC4; InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 253 | |||
| 3w03_C | 184 | DNA repair protein XRCC4; coiled-coil, NHEJ, DSBS | 94.41 | |
| 1ik9_A | 213 | DNA repair protein XRCC4; DNA END joining, double- | 93.03 | |
| 3mud_A | 175 | DNA repair protein XRCC4, tropomyosin alpha-1 CHA; | 93.03 | |
| 3q4f_C | 186 | DNA repair protein XRCC4; DSB repair, nuclear, rec | 89.27 |
| >3w03_C DNA repair protein XRCC4; coiled-coil, NHEJ, DSBS repair, KU70/80, DNA-PKCS, DNA ligas binding protein; HET: DNA; 8.49A {Homo sapiens} | Back alignment and structure |
|---|
Probab=94.41 E-value=2 Score=37.11 Aligned_cols=131 Identities=16% Similarity=0.223 Sum_probs=86.2
Q ss_pred CCCccceEEEEeC-CC-CCCeEEEEecccccchhhccccchhhhhHhhhcCCCChHHHHHHHHHhcccccc-----eEee
Q 025423 22 SDSLGRFLFHVSA-PD-SSHLLIQVTDFRSNTWEAKRSVLQLDDMRDEIGIGGSWSEFIDYVVASIKSEDV-----KLIL 94 (253)
Q Consensus 22 s~~~~pfLFh~~a-~d-~~hL~v~vTDfHS~tW~~slSv~~LeDlRD~VGIGGSwsdF~dYl~~slsS~~V-----kL~l 94 (253)
|-+-.+++..+.= .| .+.+.|.+||-+ ++|.+..|-+++..+-+++|- .-++|++-|+.+|-+++- ++-+
T Consensus 32 s~p~~~yflk~~W~~~l~sgF~L~LTDG~-sAW~G~vSe~~I~~~A~~~~m--~~e~Yv~eL~kAl~~~~~~~~~Y~f~~ 108 (184)
T 3w03_C 32 SEPSITHFLQVSWEKTLESGFVITLTDGH-SAWTGTVSESEISQEADDMAM--EKGKYVGELRKALLSGAGPADVYTFNF 108 (184)
T ss_dssp TCTTSCEEEEEEESSCGGGEEEEEEECSS-CEEEEEEEHHHHHHHHHHTTC--CHHHHHHHHHHHHSSCCCSSCCEEEEE
T ss_pred CCCCceEEEEEEeccCCCCceEEEEECCC-ceeeEEcCHHHHHHHHHHhCC--CHHHHHHHHHHHHhCCCCCCceEEEec
Confidence 3344555555544 22 367999999999 789999999999998888654 788999999999965443 5555
Q ss_pred cCCCCCCCcccceeeeecc-CCCCe--eEEeeccccchh-HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q 025423 95 EGHSNADGAAYAKIVAQKS-KGMPR--ISISLTRLTGSA-ATEAMAKLSLELFTAFRSMQTLIVQEQERCLQ 162 (253)
Q Consensus 95 ~~~s~s~G~~~akLVa~Ka-KGmP~--ItI~L~~l~~sa-a~D~ma~~sl~Lf~afrs~q~~~~~eqe~~s~ 162 (253)
.+ ...++.--|. +++-. =+++|.++.+.+ +.--|-+++++-+..++..-.++.+|-+|..+
T Consensus 109 s~-------e~~~fs~eK~l~di~frLGSv~L~~v~~p~e~i~elid~~ld~~~~L~~~n~~LqkeNeRL~~ 173 (184)
T 3w03_C 109 SK-------ESCYFFFEKNLKDVSFRLGSFNLEKVENPAEVIRELICYCLDTIAENQAKNEHLQKENERLLR 173 (184)
T ss_dssp CT-------TTCEEEEEEEETTEEEEEEEEECEECSCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred Cc-------cceEEEeeeccCceEEEEeeeeeeECCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 43 2355544444 33321 147788888755 33336667777666666555555555555443
|
| >1ik9_A DNA repair protein XRCC4; DNA END joining, double-strand break repair, V(D)J recombination, protein-protein complex, coiled coil; HET: DNA; 2.30A {Homo sapiens} SCOP: b.59.1.1 h.1.11.1 PDB: 3ii6_A* 1fu1_A* 3rwr_A* | Back alignment and structure |
|---|
| >3mud_A DNA repair protein XRCC4, tropomyosin alpha-1 CHA; tropomysoin, overlap complex, coiled-coils, contractIle PROT; HET: DNA; 2.20A {Homo sapiens} PDB: 3sr2_A* | Back alignment and structure |
|---|
| >3q4f_C DNA repair protein XRCC4; DSB repair, nuclear, recombination-recombination complex, DN protein-protein binding complex; HET: DNA; 5.50A {Homo sapiens} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00