Citrus Sinensis ID: 022738
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 293 | ||||||
| 118483316 | 281 | unknown [Populus trichocarpa] | 0.941 | 0.982 | 0.617 | 2e-86 | |
| 224065502 | 281 | predicted protein [Populus trichocarpa] | 0.941 | 0.982 | 0.617 | 2e-86 | |
| 297836220 | 270 | hypothetical protein ARALYDRAFT_480447 [ | 0.863 | 0.937 | 0.570 | 4e-79 | |
| 297744265 | 274 | unnamed protein product [Vitis vinifera] | 0.877 | 0.937 | 0.580 | 5e-77 | |
| 388519487 | 249 | unknown [Medicago truncatula] | 0.795 | 0.935 | 0.529 | 1e-75 | |
| 217073278 | 249 | unknown [Medicago truncatula] | 0.795 | 0.935 | 0.529 | 2e-75 | |
| 449517824 | 280 | PREDICTED: uncharacterized LOC101206480 | 0.935 | 0.978 | 0.569 | 2e-73 | |
| 449468604 | 280 | PREDICTED: uncharacterized protein LOC10 | 0.935 | 0.978 | 0.566 | 3e-73 | |
| 18399283 | 270 | Tetraspanin family protein [Arabidopsis | 0.863 | 0.937 | 0.570 | 1e-71 | |
| 255560279 | 225 | conserved hypothetical protein [Ricinus | 0.723 | 0.942 | 0.630 | 3e-70 |
| >gi|118483316|gb|ABK93560.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 184/298 (61%), Positives = 217/298 (72%), Gaps = 22/298 (7%)
Query: 1 MQRNCCHISFAFILKFLNFLQGFIGVSIILYSIWMLDQWNHHVPVPPLPPLAPTPDTSVS 60
M+RNCCH+ AF+LK LNFLQ FIG+SII+YS WMLDQWN+ VPV P +P S S
Sbjct: 1 MERNCCHVPLAFVLKLLNFLQAFIGISIIIYSFWMLDQWNNKVPVFPPSAPSPDSSFSSS 60
Query: 61 SSSLSLFLNSDTTQSRVLSHDHRRLTTSTGPLDVMVSGFDDVSGLGFDFNSFELPAPWFI 120
S L S + + V S +V F++ GL FD NSF+LPAPWFI
Sbjct: 61 SLLLLPGSESHSIRDLVFSD--------------VVPRFEN--GLDFDLNSFQLPAPWFI 104
Query: 121 YSFMGVGILVCCIALIGCIAAEAISGCCLCFYAILKIILFLLEAALVAFIAIDRRWEKDL 180
YSFMGVG+++CCI IGCIAAE+I+GCCLCFY ILKI+ LLEAALVAFIAIDRRWEKDL
Sbjct: 105 YSFMGVGVILCCITFIGCIAAESINGCCLCFYTILKIVFILLEAALVAFIAIDRRWEKDL 164
Query: 181 PFDPTGELDSLRSFIEDNVDICKWVGITVVIVQALSLLLAIILRAMVSTRRTNFDEEDDY 240
PFDPTGEL SLR F+E+NVDICKWVGITV+I+QAL+LLLA+ILRAMVSTRR FD EDD+
Sbjct: 165 PFDPTGELQSLRIFVEENVDICKWVGITVLIIQALALLLAMILRAMVSTRRNEFD-EDDF 223
Query: 241 ESERIRIREPLIHQ-----QPIAHSDIWSSRIREKYGLNGGGEQYTFGNNPSASMKSK 293
E+ R R REPL++Q P HSDIW+SR+REKYGL+ G + N S S KSK
Sbjct: 224 ENVRGRTREPLLNQSGQTFSPGTHSDIWTSRMREKYGLSTGDKPNLLNKNASMSTKSK 281
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224065502|ref|XP_002301831.1| predicted protein [Populus trichocarpa] gi|118482993|gb|ABK93408.1| unknown [Populus trichocarpa] gi|222843557|gb|EEE81104.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|297836220|ref|XP_002885992.1| hypothetical protein ARALYDRAFT_480447 [Arabidopsis lyrata subsp. lyrata] gi|297331832|gb|EFH62251.1| hypothetical protein ARALYDRAFT_480447 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|297744265|emb|CBI37235.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|388519487|gb|AFK47805.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|217073278|gb|ACJ84998.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449517824|ref|XP_004165944.1| PREDICTED: uncharacterized LOC101206480 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449468604|ref|XP_004152011.1| PREDICTED: uncharacterized protein LOC101206480 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|18399283|ref|NP_565468.1| Tetraspanin family protein [Arabidopsis thaliana] gi|75249394|sp|Q93XY5.1|TET18_ARATH RecName: Full=Tetraspanin-18 gi|15450980|gb|AAK96761.1| Unknown protein [Arabidopsis thaliana] gi|20197855|gb|AAD21765.2| expressed protein [Arabidopsis thaliana] gi|21594467|gb|AAM66011.1| unknown [Arabidopsis thaliana] gi|330251889|gb|AEC06983.1| Tetraspanin family protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|255560279|ref|XP_002521157.1| conserved hypothetical protein [Ricinus communis] gi|223539726|gb|EEF41308.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 293 | ||||||
| TAIR|locus:2038997 | 270 | AT2G20230 [Arabidopsis thalian | 0.627 | 0.681 | 0.583 | 5.3e-69 | |
| TAIR|locus:2051348 | 221 | AT2G20740 "AT2G20740" [Arabido | 0.542 | 0.719 | 0.311 | 1.6e-30 | |
| TAIR|locus:2033718 | 280 | TOM2A "tobamovirus multiplicat | 0.587 | 0.614 | 0.353 | 1.1e-29 |
| TAIR|locus:2038997 AT2G20230 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 549 (198.3 bits), Expect = 5.3e-69, Sum P(2) = 5.3e-69
Identities = 112/192 (58%), Positives = 132/192 (68%)
Query: 91 PLDVMVSGFDDVSG--LGFDFNSFELPAPWFIYSFMGVGILVCCIALIGCIAAEAISGCC 148
P+D + S +G GF+ S +LPAPWFIYSFM VGILVC + IG IAAEAI+GCC
Sbjct: 70 PIDFVASIILGSNGGDHGFNLRSLDLPAPWFIYSFMAVGILVCIVTFIGFIAAEAINGCC 129
Query: 149 LCFYAILKIILFLLEAALVAFIAIDRRWEKDLPFDPTGELDSLRSFIEDNVDICKWVGIT 208
LCFY+ILK +L LLEAALVA+IAIDR WEKDLP+DPTGEL SLR+FIE+N+DICKWVGI
Sbjct: 130 LCFYSILKTLLILLEAALVAYIAIDRHWEKDLPYDPTGELSSLRAFIEENIDICKWVGIA 189
Query: 209 XXXXXXXXXXXXXXXXXMVSTRRTNFDEEDDYESERIRIREPLI----HQQPIAHSDI-- 262
MVST + DEE+D E+ R R +PL+ +Q P S I
Sbjct: 190 VVAVQLLSLLLAMVLRAMVSTPKPELDEEEDDENPRSRTWDPLLGPQGNQAPAGSSKIEN 249
Query: 263 WSSRIREKYGLN 274
WSSRIREKYGLN
Sbjct: 250 WSSRIREKYGLN 261
|
|
| TAIR|locus:2051348 AT2G20740 "AT2G20740" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2033718 TOM2A "tobamovirus multiplication 2A" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 293 | |||
| pfam00335 | 221 | pfam00335, Tetraspannin, Tetraspanin family | 3e-05 |
| >gnl|CDD|215864 pfam00335, Tetraspannin, Tetraspanin family | Back alignment and domain information |
|---|
Score = 43.9 bits (104), Expect = 3e-05
Identities = 17/58 (29%), Positives = 27/58 (46%)
Query: 117 PWFIYSFMGVGILVCCIALIGCIAAEAISGCCLCFYAILKIILFLLEAALVAFIAIDR 174
+ +G+++ + +GC A S C L Y IL +ILF+LE A + R
Sbjct: 44 RALYILIIVLGVIILLVGFLGCCGAIKESRCLLLTYFILLLILFILEIAAGILAFVYR 101
|
Length = 221 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 293 | |||
| KOG3882 | 237 | consensus Tetraspanin family integral membrane pro | 99.91 | |
| PF00335 | 221 | Tetraspannin: Tetraspanin family RDS_ROM1 subfamil | 99.6 | |
| KOG4433 | 526 | consensus Tweety transmembrane/cell surface protei | 94.15 | |
| PF04906 | 406 | Tweety: Tweety; InterPro: IPR006990 None of the me | 90.65 | |
| cd03164 | 86 | CD53_like_LEL Tetraspanin, extracellular domain or | 90.26 |
| >KOG3882 consensus Tetraspanin family integral membrane protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=99.91 E-value=1.4e-23 Score=190.12 Aligned_cols=163 Identities=26% Similarity=0.442 Sum_probs=136.1
Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCccCCCCCCCCCCCCCCCCCcccccccccccccccccccccc
Q 022738 3 RNCCHISFAFILKFLNFLQGFIGVSIILYSIWMLDQWNHHVPVPPLPPLAPTPDTSVSSSSLSLFLNSDTTQSRVLSHDH 82 (293)
Q Consensus 3 ~~~C~~~lk~lL~~lN~l~~l~Gl~lI~~giwml~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 82 (293)
.++|..|+|++++++|+++|++|++++++|+|++.+...... +
T Consensus 2 ~~~~~~~~K~~lf~~N~~~~l~G~~ll~~giw~~~~~~~~~~-------------------------------------~ 44 (237)
T KOG3882|consen 2 MSCGSSCLKYLLFLLNLLFWLLGLLLLAVGIWLLADKGFLSS-------------------------------------L 44 (237)
T ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHhhhheeEeccchhh-------------------------------------c
Confidence 678999999999999999999999999999999887665211 0
Q ss_pred ccccccCCCcccccccccCCCCCCcccCCCCCCcchhhhHHHhhhHHHHHHHHHHHHHHhhcccchhHHHHHHHHHHHHH
Q 022738 83 RRLTTSTGPLDVMVSGFDDVSGLGFDFNSFELPAPWFIYSFMGVGILVCCIALIGCIAAEAISGCCLCFYAILKIILFLL 162 (293)
Q Consensus 83 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~~wfIY~~i~vG~i~~iis~~GcigA~~~n~c~L~~Y~vll~lLill 162 (293)
. . . . .+..+| +++++|+++++++++||+||.+||+|+|.+|++++++++++
T Consensus 45 ~-----~-------------------~-~-~~~~~~---ili~~G~v~~~v~flGc~Ga~~es~~lL~~y~~~l~l~~i~ 95 (237)
T KOG3882|consen 45 L-----E-------------------S-D-FLVPAY---ILIAVGGVVFLVGFLGCCGALRESRCLLLSYFILLLLLFIA 95 (237)
T ss_pred c-----c-------------------c-c-hhcchh---hhhhhhHHHHHHHHhhhhhhHhhhHHHHHHHHHHHHHHHHH
Confidence 0 0 0 0 122334 89999999999999999999999999999999999999999
Q ss_pred HHHHHHhhcc-----------------------c----------------------CCCCCC----CC------------
Q 022738 163 EAALVAFIAI-----------------------D----------------------RRWEKD----LP------------ 181 (293)
Q Consensus 163 Eia~~a~i~~-----------------------n----------------------~dW~~~----iP------------ 181 (293)
|++++++.+. | .||.+. +|
T Consensus 96 e~~~~i~~~~~~~~l~~~~~~~~~~~~~~~y~~~~~~~~~~d~~Q~~~~CCG~~~~~~~~~~~~~~vP~SCC~~~~~~~~ 175 (237)
T KOG3882|consen 96 ELAAGILAFVFRDSLRDELEEQLLKSIWNNYSSDPDLGEAWDKLQRELKCCGVNGYSDYFNCSSNNVPPSCCKRTRRQKF 175 (237)
T ss_pred HHHHHHHhheeHHHHHHHHHHHHHHHHHHHhCCCccHHHHHHHHHHhccCCcCCCchHHhcCCCCCCCcccCCCcccccc
Confidence 9999966552 0 123211 55
Q ss_pred -------CCCCCchHHHHHHHHhhchhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Q 022738 182 -------FDPTGELDSLRSFIEDNVDICKWVGITVVIVQALSLLLAIILRAMVSTRR 231 (293)
Q Consensus 182 -------~y~tGC~~~l~~~l~~n~~ii~~v~l~v~iiQilgillA~~L~~~i~~~~ 231 (293)
.|++||.+++.+|+++|+.++++++++++++|++++++|++|...+++++
T Consensus 176 ~~~~~~~~~~~GC~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~a~~l~~~i~~~~ 232 (237)
T KOG3882|consen 176 PQDVPDNIYTEGCLEKLSSWLESNLLIIGGVGLGIAVLELLGMILACCLANAIRNQR 232 (237)
T ss_pred cccchhhhhccccHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 48899999999999999999999999999999999999999998876544
|
|
| >PF00335 Tetraspannin: Tetraspanin family RDS_ROM1 subfamily; InterPro: IPR018499 A number of eukaryotic CD antigens have been shown to be related [] | Back alignment and domain information |
|---|
| >KOG4433 consensus Tweety transmembrane/cell surface protein [General function prediction only] | Back alignment and domain information |
|---|
| >PF04906 Tweety: Tweety; InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised | Back alignment and domain information |
|---|
| >cd03164 CD53_like_LEL Tetraspanin, extracellular domain or large extracellular loop (LEL), CD53_Like family | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00