Citrus Sinensis ID: 032997


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------13
MNTDITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRGFSS
ccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHccccc
ccccccccccccccEEccccccHHHHccccHHHHHHHHHHHHHHccHHHHcccccccccHHHHHHHHHcccccHccccccccccHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHccccc
mntditasekpqypvidrnppftavvgnfntldylrfSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNfinidagikpglkgpsmvtgGLIGLMGGFMYAYQNSagrlmgffpnegevaRYQKRGFSS
mntditasekpqypvidrnppFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRGFSS
MNTDITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRGFSS
**************VIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPN**************
********EKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVA*********
*********KPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRGFSS
****ITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKR****
ooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNTDITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRGFSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
224089721105 predicted protein [Populus trichocarpa] 0.813 1.0 0.736 2e-44
297804546106 hypothetical protein ARALYDRAFT_915104 [ 0.790 0.962 0.722 2e-42
224137260105 predicted protein [Populus trichocarpa] 0.806 0.990 0.726 2e-42
315937234102 hypothetical protein [Jatropha curcas] 0.790 1.0 0.722 6e-42
21553735106 unknown [Arabidopsis thaliana] 0.790 0.962 0.714 7e-42
186511907106 uncharacterized protein [Arabidopsis tha 0.790 0.962 0.714 8e-42
2245001106 hypothetical protein [Arabidopsis thalia 0.790 0.962 0.714 1e-41
351720975101 uncharacterized protein LOC100306111 [Gl 0.782 1.0 0.704 8e-41
388513791101 unknown [Lotus japonicus] 0.782 1.0 0.704 1e-40
147799661116 hypothetical protein VITISV_010564 [Viti 0.775 0.862 0.701 2e-40
>gi|224089721|ref|XP_002308803.1| predicted protein [Populus trichocarpa] gi|118482707|gb|ABK93272.1| unknown [Populus trichocarpa] gi|118483785|gb|ABK93785.1| unknown [Populus trichocarpa] gi|222854779|gb|EEE92326.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  183 bits (464), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/129 (73%), Positives = 99/129 (76%), Gaps = 24/129 (18%)

Query: 1   MNTDITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHY 60
           MNTDITAS KP+YPVIDRNP FT VVGNFNTLDY RF ++TGVSV VGYLSGI       
Sbjct: 1   MNTDITASAKPEYPVIDRNPEFTKVVGNFNTLDYCRFITLTGVSVTVGYLSGI------- 53

Query: 61  LSHRVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVA 120
                            KPG+KGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVA
Sbjct: 54  -----------------KPGIKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVA 96

Query: 121 RYQKRGFSS 129
           RYQKRGFSS
Sbjct: 97  RYQKRGFSS 105




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297804546|ref|XP_002870157.1| hypothetical protein ARALYDRAFT_915104 [Arabidopsis lyrata subsp. lyrata] gi|297315993|gb|EFH46416.1| hypothetical protein ARALYDRAFT_915104 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|224137260|ref|XP_002322513.1| predicted protein [Populus trichocarpa] gi|222867143|gb|EEF04274.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|315937234|gb|ADU56175.1| hypothetical protein [Jatropha curcas] Back     alignment and taxonomy information
>gi|21553735|gb|AAM62828.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|186511907|ref|NP_567500.2| uncharacterized protein [Arabidopsis thaliana] gi|334186608|ref|NP_001190742.1| uncharacterized protein [Arabidopsis thaliana] gi|28416547|gb|AAO42804.1| At4g16450 [Arabidopsis thaliana] gi|110742933|dbj|BAE99362.1| hypothetical protein [Arabidopsis thaliana] gi|332658352|gb|AEE83752.1| uncharacterized protein [Arabidopsis thaliana] gi|332658353|gb|AEE83753.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|2245001|emb|CAB10421.1| hypothetical protein [Arabidopsis thaliana] gi|7268395|emb|CAB78687.1| hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|351720975|ref|NP_001237706.1| uncharacterized protein LOC100306111 [Glycine max] gi|356555750|ref|XP_003546193.1| PREDICTED: uncharacterized protein LOC100819728 [Glycine max] gi|255627585|gb|ACU14137.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388513791|gb|AFK44957.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|147799661|emb|CAN72866.1| hypothetical protein VITISV_010564 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
TAIR|locus:2130649106 AT4G16450 "AT4G16450" [Arabido 0.418 0.509 0.909 9.7e-46
CGD|CAL0003931204 orf19.6607 [Candida albicans ( 0.713 0.450 0.296 0.00014
UNIPROTKB|Q59NK8204 CaO19.6607 "Putative uncharact 0.713 0.450 0.296 0.00014
TAIR|locus:2130649 AT4G16450 "AT4G16450" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 262 (97.3 bits), Expect = 9.7e-46, Sum P(2) = 9.7e-46
 Identities = 50/55 (90%), Positives = 53/55 (96%)

Query:    75 AGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQKRG-FS 128
             +GIKPG+KGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPN+GEVA YQKRG FS
Sbjct:    51 SGIKPGIKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNDGEVASYQKRGGFS 105


GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM;IDA
GO:0009853 "photorespiration" evidence=RCA;TAS
GO:0031966 "mitochondrial membrane" evidence=IDA
GO:0045271 "respiratory chain complex I" evidence=IDA
GO:0016020 "membrane" evidence=IDA
GO:0005747 "mitochondrial respiratory chain complex I" evidence=IDA
GO:0006511 "ubiquitin-dependent protein catabolic process" evidence=RCA
GO:0051788 "response to misfolded protein" evidence=RCA
GO:0080129 "proteasome core complex assembly" evidence=RCA
CGD|CAL0003931 orf19.6607 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q59NK8 CaO19.6607 "Putative uncharacterized protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00060155
SubName- Full=Putative uncharacterized protein; (105 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query129
PLN0297597 PLN02975, PLN02975, complex I subunit 5e-54
pfam1078584 pfam10785, NADH-u_ox-rdase, NADH-ubiquinone oxidor 4e-24
>gnl|CDD|166616 PLN02975, PLN02975, complex I subunit Back     alignment and domain information
 Score =  164 bits (416), Expect = 5e-54
 Identities = 85/121 (70%), Positives = 93/121 (76%), Gaps = 24/121 (19%)

Query: 4   DITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSH 63
           DITAS+KP+YPV+DRNP FT VVGNF+ LDYLRF++ITGVSV VGYLSGI          
Sbjct: 1   DITASDKPEYPVVDRNPTFTKVVGNFSALDYLRFATITGVSVTVGYLSGI---------- 50

Query: 64  RVWIFNFINIDAGIKPGLKGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQ 123
                         KPG++GPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQ
Sbjct: 51  --------------KPGIRGPSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQ 96

Query: 124 K 124
           K
Sbjct: 97  K 97


Length = 97

>gnl|CDD|192666 pfam10785, NADH-u_ox-rdase, NADH-ubiquinone oxidoreductase complex I, 21 kDa subunit Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 129
PLN0297597 complex I subunit 100.0
PF1078586 NADH-u_ox-rdase: NADH-ubiquinone oxidoreductase co 100.0
>PLN02975 complex I subunit Back     alignment and domain information
Probab=100.00  E-value=7.7e-50  Score=288.69  Aligned_cols=97  Identities=88%  Similarity=1.471  Sum_probs=93.6

Q ss_pred             ccCCCCCCCCCcccCCCCchhhhcccChhHHHHHHHhhhhhhHHHHHhcccchhhhhhhhhhhhhhhcccccCCCCCCCC
Q 032997            4 DITASEKPQYPVIDRNPPFTAVVGNFNTLDYLRFSSITGVSVVVGYLSGIIPLHLHYLSHRVWIFNFINIDAGIKPGLKG   83 (129)
Q Consensus         4 ~i~~~~~p~YPvID~dP~f~rVv~yfR~SDY~~~a~~ta~~~~~~y~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~   83 (129)
                      ||+..++|+|||||+||||+|||+|||+|||++|+++|+++|+++|+.+                        ++|++++
T Consensus         1 ~~~~~~~P~YPlId~dP~f~rVv~yfr~sDY~~~a~~ta~s~~~~~~~~------------------------~~~~~~~   56 (97)
T PLN02975          1 DITASDKPEYPVVDRNPTFTKVVGNFSALDYLRFATITGVSVTVGYLSG------------------------IKPGIRG   56 (97)
T ss_pred             CCcccCCCCCCccCCCCChHHHHHhCCHHHHHHHHHHHHHHHHHHHHHc------------------------cCccccc
Confidence            6899999999999999999999999999999999999999999999977                        6788899


Q ss_pred             chHHHHHHHHHHHHHHHHHhhccccccCcCCCHHHHHHhhh
Q 032997           84 PSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQK  124 (129)
Q Consensus        84 ~amr~ag~iG~~GGfl~aYqrS~~Rf~G~~EN~rEv~ry~~  124 (129)
                      ++||++|+||++||||+||||||+|||||+||+||||||+|
T Consensus        57 ~~mr~ag~iG~~gGf~~aYq~S~~Rf~G~~EN~rEV~~~~~   97 (97)
T PLN02975         57 PSMVTGGLIGLMGGFMYAYQNSAGRLMGFFPNEGEVARYQK   97 (97)
T ss_pred             hHHHHHHHHHHhhhHHhhhcccchhhcCCCCCHHHHHhccC
Confidence            99999999999999999999999999999999999999986



>PF10785 NADH-u_ox-rdase: NADH-ubiquinone oxidoreductase complex I, 21 kDa subunit; InterPro: IPR019721 This domain is found in the N-terminal region of NADH-ubiquinone oxidoreductase 21kDa subunits from plants and fungi [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00