Citrus Sinensis ID: 019937


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330---
MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAECFR
ccHHHHHHHHHccccccccccccccEEEEEcccccEEEEEEcccccccEEEEccccHHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccEEHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHcccccHHHHccHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHccccccHHHHHHcccccccccccccccccccccccccccccc
ccHHHHHHHccccccccccccccccEEEEEccccEEEEEEEcccccccEEEccccccccHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHEEEEHccccccccHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHccHHHHHccHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHEEEHHcccHHHHHHHHHHHHHccccccHHHHHHcccccccccccHHHccccHccccccHHHcc
MNLLEKIKKQkkkepdksmsvqeipiywfetsdsvtrhyqfqpdghlsvkvvddsrpvyHRVVDSFlnkffpsgypfsvnegylrYTQFRALQHFTSAALSVLSTQSLLFaaglrptpaQATVVSWILKDGMQHVGKLICSnlgarmdsepKRWRILADALYDLgtglevlsplcPQLFLEMAGLGNFAKGIAVVAARatrlpiyssfakegnlsdlfakGEAISTLFNVVGLGAGIQLASTVCSsmqgkmivgpLLSVIHVYSVIEEmraapvntlnpqrTAMIVADFVktgrisspadlryredllfpgrliedagnvkmdrhgiraecfr
mnllekikkqkkkepdksmsvqeipIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAapvntlnpqrTAMIVADFVKtgrisspadlryredllfpgrliedagnvkmdrhgiraecfr
MNLLEkikkqkkkepdkSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAECFR
**********************EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVK************
*************************IYWFETSDSV*************************RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAECFR
MNLLEKI************SVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAECFR
*N******KQK********SVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRA*CFR
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAECFR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query333 2.2.26 [Sep-21-2011]
Q86K80 527 UPF0420 protein OS=Dictyo yes no 0.720 0.455 0.295 4e-26
Q499P8 466 UPF0420 protein C16orf58 yes no 0.714 0.510 0.298 1e-25
Q91W34 466 UPF0420 protein C16orf58 yes no 0.708 0.506 0.308 1e-25
Q5R8F6 468 UPF0420 protein C16orf58 yes no 0.813 0.579 0.278 3e-25
Q96GQ5 468 UPF0420 protein C16orf58 yes no 0.813 0.579 0.271 1e-24
>sp|Q86K80|U420_DICDI UPF0420 protein OS=Dictyostelium discoideum GN=DDB_G0277179 PE=3 SV=1 Back     alignment and function desciption
 Score =  119 bits (297), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 130/247 (52%), Gaps = 7/247 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV   Y  Y  + ++Q   S     L+T+++L   G+  + A   +    W+
Sbjct: 63  FLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWL 122

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPL-CPQLFLEMAGL 185
           ++DGM  +G+++ +   G  +D   K+WR  AD L ++G   E++SPL   QLFL ++ +
Sbjct: 123 IRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCI 182

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-- 243
           G  AK I  VA   T+  +   FAK  NL+D+ AK  +  T  N+VG+   + ++S +  
Sbjct: 183 GLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFIND 242

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
            +S+    +V    + +H++     + A  + ++N  R  +I   F+   G I SP+++ 
Sbjct: 243 NTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEIS 302

Query: 303 YREDLLF 309
             E++LF
Sbjct: 303 KLENILF 309





Dictyostelium discoideum (taxid: 44689)
>sp|Q499P8|CP058_RAT UPF0420 protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1 Back     alignment and function description
>sp|Q91W34|CP058_MOUSE UPF0420 protein C16orf58 homolog OS=Mus musculus PE=2 SV=1 Back     alignment and function description
>sp|Q5R8F6|CP058_PONAB UPF0420 protein C16orf58 homolog OS=Pongo abelii PE=2 SV=1 Back     alignment and function description
>sp|Q96GQ5|CP058_HUMAN UPF0420 protein C16orf58 OS=Homo sapiens GN=C16orf58 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query333
224082810428 predicted protein [Populus trichocarpa] 0.975 0.759 0.860 1e-165
297744143424 unnamed protein product [Vitis vinifera] 0.948 0.745 0.867 1e-162
255567274 436 conserved hypothetical protein [Ricinus 0.969 0.740 0.838 1e-162
225438149420 PREDICTED: UPF0420 protein C16orf58 homo 0.933 0.740 0.875 1e-160
334184613432 uncharacterized protein [Arabidopsis tha 0.972 0.75 0.799 1e-156
18402585433 uncharacterized protein [Arabidopsis tha 0.966 0.743 0.801 1e-155
449462449428 PREDICTED: UPF0420 protein C16orf58 homo 0.960 0.747 0.802 1e-155
449502048428 PREDICTED: UPF0420 protein C16orf58 homo 0.960 0.747 0.802 1e-154
356545865431 PREDICTED: UPF0420 protein C16orf58 homo 0.957 0.740 0.798 1e-152
356505015415 PREDICTED: UPF0420 protein C16orf58 homo 0.918 0.737 0.813 1e-149
>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa] gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  586 bits (1510), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 283/329 (86%), Positives = 305/329 (92%), Gaps = 4/329 (1%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           MNLL+KIK QKK EPDK+    EIP+YW ETSDSV+RH+QF+PDG LS+KVVDD+RPVY 
Sbjct: 1   MNLLDKIKMQKK-EPDKT---PEIPVYWIETSDSVSRHFQFEPDGQLSMKVVDDARPVYR 56

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           RVV+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 57  RVVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 116

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           AT VSWILKDGMQH GKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLCP LFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+GLG GIQLA
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           STVCSSMQGK + GPLLS++HV  VIEEMRA PVNTLNPQRTAM+VADFVKTG+ISSPAD
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296

Query: 301 LRYREDLLFPGRLIEDAGNVKMDRHGIRA 329
           LRY EDLLFPGRLIE+AGNVK+ +   RA
Sbjct: 297 LRYHEDLLFPGRLIENAGNVKVGQALHRA 325




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis] gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera] Back     alignment and taxonomy information
>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana] gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana] gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana] gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana] gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana] gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus] Back     alignment and taxonomy information
>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus] Back     alignment and taxonomy information
>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max] Back     alignment and taxonomy information
>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query333
TAIR|locus:2042536433 RUS2 "AT2G31190" [Arabidopsis 0.915 0.704 0.822 1.1e-137
TAIR|locus:2155436 497 RUS6 "AT5G49820" [Arabidopsis 0.732 0.490 0.380 2.1e-38
TAIR|locus:2077197 608 RUS1 "AT3G45890" [Arabidopsis 0.735 0.402 0.314 6.8e-33
ZFIN|ZDB-GENE-070410-89 435 zgc:162613 "zgc:162613" [Danio 0.738 0.565 0.318 2.6e-31
TAIR|locus:2023885 440 RUS3 "AT1G13770" [Arabidopsis 0.750 0.568 0.292 1.2e-28
MGI|MGI:2384572 466 BC017158 "cDNA sequence BC0171 0.702 0.502 0.311 1.1e-27
RGD|1310127 466 RGD1310127 "similar to cDNA se 0.708 0.506 0.301 1.9e-27
DICTYBASE|DDB_G0278315 567 DDB_G0278315 "DUF647 family pr 0.777 0.456 0.272 2.7e-26
DICTYBASE|DDB_G0277179 527 DDB_G0277179 "DUF647 family pr 0.717 0.453 0.306 3.3e-26
UNIPROTKB|Q96GQ5 468 C16orf58 "UPF0420 protein C16o 0.813 0.579 0.271 3.7e-26
TAIR|locus:2042536 RUS2 "AT2G31190" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1348 (479.6 bits), Expect = 1.1e-137, P = 1.1e-137
 Identities = 251/305 (82%), Positives = 286/305 (93%)

Query:    20 SVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
             S ++ P+YWFETSDSV+  YQFQ DGHLS+KVVDD+RPV  ++V+SFLNKFFPSGYP+SV
Sbjct:    21 SPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKMVESFLNKFFPSGYPYSV 80

Query:    80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLI 139
             NEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLI
Sbjct:    81 NEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLI 140

Query:   140 CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
             CSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFLEMAGLGNFAKG+A VAARA
Sbjct:   141 CSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAGLGNFAKGMATVAARA 200

Query:   200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
             TRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLAST+CSSM+GK++VG +LSV
Sbjct:   201 TRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTICSSMEGKLVVGSILSV 260

Query:   260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGN 319
             +HVYSV+E+MR  P+NTLNPQRTA+IVA+F+KTG++ SP DLR++EDL+FP R I+DAGN
Sbjct:   261 VHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQEDLMFPERPIQDAGN 320

Query:   320 VKMDR 324
             VK+ R
Sbjct:   321 VKVGR 325




GO:0009941 "chloroplast envelope" evidence=IDA
GO:0005739 "mitochondrion" evidence=IDA
GO:0009536 "plastid" evidence=IDA
GO:0010224 "response to UV-B" evidence=IMP
GO:0009926 "auxin polar transport" evidence=IMP
TAIR|locus:2155436 RUS6 "AT5G49820" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077197 RUS1 "AT3G45890" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-070410-89 zgc:162613 "zgc:162613" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
TAIR|locus:2023885 RUS3 "AT1G13770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
MGI|MGI:2384572 BC017158 "cDNA sequence BC017158" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1310127 RGD1310127 "similar to cDNA sequence BC017158" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0278315 DDB_G0278315 "DUF647 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0277179 DDB_G0277179 "DUF647 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|Q96GQ5 C16orf58 "UPF0420 protein C16orf58" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query333
pfam04884251 pfam04884, DUF647, Protein of unknown function (DU 1e-103
>gnl|CDD|191120 pfam04884, DUF647, Protein of unknown function (DUF647) Back     alignment and domain information
 Score =  302 bits (776), Expect = e-103
 Identities = 109/250 (43%), Positives = 161/250 (64%), Gaps = 3/250 (1%)

Query: 49  VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
            K+   +  +  R+V S LN F P GYP SV E YL Y  + +LQ F+S+   VL+TQ+L
Sbjct: 1   SKLDLKAGKLRSRLVSSLLNVFLPEGYPDSVTEDYLPYQLWDSLQAFSSSISGVLATQAL 60

Query: 109 LFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
           L   G+        A  + WILKDG   +G+++ ++ LG+ +DSEPK+WR+LAD L DL 
Sbjct: 61  LEGVGVGSSSALPTAAAILWILKDGTGRLGRILFAHRLGSALDSEPKKWRLLADVLNDLA 120

Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
            GLE+L+PL PQLFL +A   N AK +A VAA ATR  + + FA  GNL+D+ AK E+  
Sbjct: 121 MGLELLTPLFPQLFLPLASAANVAKSLAGVAAGATRASLTAHFALRGNLADVSAKDESQE 180

Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
           TL N++GL  GI LASTV +S     +   LLSV+H+Y+  + +R+  + TLN QR +++
Sbjct: 181 TLVNLLGLLVGILLASTVSTSAALTWVSFGLLSVVHLYANYQAVRSVQLRTLNRQRASIV 240

Query: 286 VADFVKTGRI 295
           + +++K+G++
Sbjct: 241 LEEYLKSGKV 250


In plants, this domain plays a role in auxin-transport, plant growth and development. Length = 251

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 333
PF04884250 DUF647: Vitamin B6 photo-protection and homoeostas 100.0
KOG4249 408 consensus Uncharacterized conserved protein [Funct 100.0
>PF04884 DUF647: Vitamin B6 photo-protection and homoeostasis; InterPro: IPR006968 This is a family of proteins of unknown function, restricted to eukaryotes Back     alignment and domain information
Probab=100.00  E-value=1.8e-92  Score=662.94  Aligned_cols=241  Identities=39%  Similarity=0.627  Sum_probs=231.5

Q ss_pred             chhhhhHHHHHHHhhcCCCCCCccccCccchhHHHHhhHHHhhHHHHHHHHHHHHHhCCC--CChhHHHHHHHHHHhhhh
Q 019937           56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWILKDGMQ  133 (333)
Q Consensus        56 ~~~~~~~~~~l~~~FLP~GYP~SVs~dYl~Y~~w~~lq~f~ssi~gvLstqAlL~avGV~--~a~~~AAal~WvLKDg~G  133 (333)
                      +...+++.+.++++|||+|||+||||||++||+||++|+|+|+++|+|+|||+|+|+||+  +++|+|||++||||||+|
T Consensus         7 ~~~~~~~~~~l~~~fLP~GyP~SVs~dY~~y~~w~~~q~~~s~~~~vlstqalL~~vGvG~~~a~~~aAa~~wvlKDg~G   86 (250)
T PF04884_consen    7 PRRSRSLQQRLRDVFLPAGYPDSVSPDYLRYQIWDFLQAFASSISGVLSTQALLKAVGVGDASASPLAAALNWVLKDGLG   86 (250)
T ss_pred             hhhhhHHHHHHHHHhCCCCCCCCcCchhHhHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCCCHHHHHHHHHHHHhHH
Confidence            345678899999999999999999999999999999999999999999999999999994  677999999999999999


Q ss_pred             HHHHHHHhh-ccCccCCCCceehhhHHHHHHHHHHHHhhhccCCchhHHHHhHHHHHHHHHHHhhhcchhhhhhhhhhcC
Q 019937          134 HVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG  212 (333)
Q Consensus       134 ~lg~ilfa~-~G~~~D~d~K~wRl~Advlnd~a~~lElltP~~P~~Fl~la~lan~~Kai~gva~~aTraai~~hFA~~~  212 (333)
                      |+|+|+|++ +|++||+|||+|||+||++||+|+++|++||+||++|++++|++|++|++|++|++|||++|++|||++|
T Consensus        87 ~lg~ilfa~~~g~~~D~d~K~wRl~ad~l~d~a~~lEl~tP~~P~~fl~la~~a~~~k~v~~va~gaTraai~~hfA~~~  166 (250)
T PF04884_consen   87 MLGRILFAWRFGSSFDSDPKRWRLVADVLNDAAMILELLTPLFPQYFLPLACLANVAKAVCGVAGGATRAAITQHFARRN  166 (250)
T ss_pred             HHHHHHhhhcccccccccchhHhhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhC
Confidence            999999985 7999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CcchhhccchhhHHHHHHHHHhHhhhhhhhhcccccchHHHHHHHHHHHHHHhhhhhhcccCCCccHHHHHHHHHHHHhc
Q 019937          213 NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT  292 (333)
Q Consensus       213 NlgDV~AK~~SQet~~~LlGl~~Gi~l~~~v~~~~~~~~~~~~~ls~lHL~~nY~avr~v~l~TLN~~R~~iv~~~~l~~  292 (333)
                      |+|||+|||+||||+++|+||++|+.+++.++.+....++++.+++++||||||++||+|+|+|||+||++|++++|+++
T Consensus       167 NlaDv~AK~~sQet~~~LlG~~~Gi~l~~~~~~~~~~~~~~f~~l~~~Hl~~ny~avr~v~l~TLN~~R~~i~~~~~l~~  246 (250)
T PF04884_consen  167 NLADVTAKDGSQETAANLLGMLLGIFLSSLVSGSPSAVWIAFIILSALHLYCNYKAVRAVQLRTLNRQRLEILLEEFLRT  246 (250)
T ss_pred             cHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhccCchHHHHHHHHHHHHHHHHHHHHhhcccccCcCHHHHHHHHHHHHHc
Confidence            99999999999999999999999999999998665566778888999999999999999999999999999999999999


Q ss_pred             CCCC
Q 019937          293 GRIS  296 (333)
Q Consensus       293 g~V~  296 (333)
                      |+||
T Consensus       247 g~V~  250 (250)
T PF04884_consen  247 GRVP  250 (250)
T ss_pred             CCCC
Confidence            9998



>KOG4249 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query333
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-04
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 50.6 bits (120), Expect = 5e-07
 Identities = 27/205 (13%), Positives = 55/205 (26%), Gaps = 52/205 (25%)

Query: 2   NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH- 60
            LL +I        D S ++ ++ I+  +      R           + V+ +   V + 
Sbjct: 204 KLLYQIDPNWTSRSDHSSNI-KLRIHSIQAEL---RRLLKSKPYENCLLVLLN---VQNA 256

Query: 61  RVVDSFLNK-------------FFPSG---YPFSVNEGYLRYTQFRALQHFTSAALSVLS 104
           +  ++F                 F S       S++   +  T            L    
Sbjct: 257 KAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL-LKYLDC-R 314

Query: 105 TQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRW----RILADA 160
            Q L         P +    +               S +   +      W     +  D 
Sbjct: 315 PQDL---------PREVLTTN-----------PRRLSIIAESIRDGLATWDNWKHVNCDK 354

Query: 161 LYD-LGTGLEVLSP-LCPQLFLEMA 183
           L   + + L VL P    ++F  ++
Sbjct: 355 LTTIIESSLNVLEPAEYRKMFDRLS 379


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00