Citrus Sinensis ID: 028395


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------21
MQLQLRSSSSASALFSAVQVKPPSSPPVRVKALFTKNPGSGTLKNSNSNTNPNPNPMASNPRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFCI
cccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccEEEEEEEEccccccEEEccHHHHHHHHHHcccccEEEEEEEEccccEEEEEcccccccHHHHHHHHHHHHcccccccccEEcccccccHHHHHHHHHccccEEEEEEcccccccccccccccccccEEEEEEEEEEEcccccccEEc
ccHHHHHHccHHHHHHHccccccccccccHHHHccccccccccccccccccccccccccccHHHEEEEEccccccccEEcHHHHHHHcHHHHHHHHHcHEEEEEHHcccEEEEcccccccHHHHHHHHHHHHcccccccccEEcccccccccHHHHHHHccccEEEEcccccEEEccEEEEEEccccccEEEEEEEEccccccEEEEcc
mqlqlrssssASALFSAvqvkppssppvrvkalftknpgsgtlknsnsntnpnpnpmasnprwaqktvtlpplrrgchlitPKIVKEIAQDLSEFKCGLAHLFLLHTSasltinenydsdvrddTETFLNkivpegrsaswkhtlegpddmpahikssmfgctltipitdgqlnmgtWQVCrnglgfrnySLSCFLwsksnfhvnrfci
MQLQLRSSSSASALFSavqvkppssppvrVKALFTknpgsgtlknsnsntnpnpnpmasnprWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNkivpegrsaswkhtlEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCflwsksnfhvnrfci
MqlqlrssssasalfsavqvkppssppvrvkALFTKNPGSGTLKnsnsntnpnpnpmasnpRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFCI
****************************************************************QKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVP*******************HIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFC*
*************LFSAVQVKPPSSPPVRVK*********************************QKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSN********
**************************PVRVKALFTKNPGSGTLKNSNSNTNPNPNPMASNPRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFCI
***********************************************************NPRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFCI
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQLQLRSSSSASALFSAVQVKPPSSPPVRVKALFTKNPGSGTLKNSNSNTNPNPNPMASNPRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNGLGFRNYSLSCFLWSKSNFHVNRFCI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query209 2.2.26 [Sep-21-2011]
O14155142 UPF0047 protein C4A8.02c yes no 0.535 0.788 0.513 1e-27
P0AF48138 UPF0047 protein YjbQ OS=E N/A no 0.545 0.826 0.495 3e-26
P0AF49138 UPF0047 protein YjbQ OS=E N/A no 0.545 0.826 0.495 3e-26
P0A2L1138 UPF0047 protein YjbQ OS=S yes no 0.545 0.826 0.504 2e-20
P0A2L2138 UPF0047 protein YjbQ OS=S N/A no 0.545 0.826 0.504 2e-20
P74125147 UPF0047 protein sll1880 O N/A no 0.564 0.802 0.414 4e-17
Q58481138 UPF0047 protein MJ1081 OS yes no 0.440 0.666 0.455 4e-14
O05243132 UPF0047 protein YugU OS=B yes no 0.430 0.681 0.34 7e-08
O28229126 UPF0047 protein AF_2050 O yes no 0.387 0.642 0.344 3e-07
O26865143 UPF0047 protein MTH_771 O yes no 0.497 0.727 0.290 3e-06
>sp|O14155|YE72_SCHPO UPF0047 protein C4A8.02c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPAC4A8.02c PE=3 SV=1 Back     alignment and function desciption
 Score =  123 bits (308), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 59/115 (51%), Positives = 84/115 (73%), Gaps = 3/115 (2%)

Query: 65  QKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDD 124
           Q+ +TL    +G ++IT  +VK++ + L  F  G  + F+ HTSA+LTINEN+D+D R D
Sbjct: 5   QRIITLDRRSKGFYIITNDLVKKLPE-LKSFSSGTVNFFIQHTSAALTINENWDADTRAD 63

Query: 125 TETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179
               L+KIVPE  SA ++HT EG DDMPAH+KSS+ G +LT+PIT+G+L++GTWQ
Sbjct: 64  MNDILDKIVPE--SAGYRHTAEGLDDMPAHVKSSLIGPSLTVPITNGKLSLGTWQ 116





Schizosaccharomyces pombe (strain 972 / ATCC 24843) (taxid: 284812)
>sp|P0AF48|YJBQ_ECOLI UPF0047 protein YjbQ OS=Escherichia coli (strain K12) GN=yjbQ PE=3 SV=1 Back     alignment and function description
>sp|P0AF49|YJBQ_ECO57 UPF0047 protein YjbQ OS=Escherichia coli O157:H7 GN=yjbQ PE=3 SV=1 Back     alignment and function description
>sp|P0A2L1|YJBQ_SALTY UPF0047 protein YjbQ OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=yjbQ PE=3 SV=1 Back     alignment and function description
>sp|P0A2L2|YJBQ_SALTI UPF0047 protein YjbQ OS=Salmonella typhi GN=yjbQ PE=3 SV=1 Back     alignment and function description
>sp|P74125|Y1880_SYNY3 UPF0047 protein sll1880 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) GN=sll1880 PE=3 SV=1 Back     alignment and function description
>sp|Q58481|Y1081_METJA UPF0047 protein MJ1081 OS=Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) GN=MJ1081 PE=3 SV=1 Back     alignment and function description
>sp|O05243|YUGU_BACSU UPF0047 protein YugU OS=Bacillus subtilis (strain 168) GN=yugU PE=3 SV=2 Back     alignment and function description
>sp|O28229|Y2050_ARCFU UPF0047 protein AF_2050 OS=Archaeoglobus fulgidus (strain ATCC 49558 / VC-16 / DSM 4304 / JCM 9628 / NBRC 100126) GN=AF_2050 PE=3 SV=1 Back     alignment and function description
>sp|O26865|Y771_METTH UPF0047 protein MTH_771 OS=Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) GN=MTH_771 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query209
225457713236 PREDICTED: uncharacterized protein LOC10 0.822 0.728 0.661 8e-61
449527653162 PREDICTED: UPF0047 protein C4A8.02c-like 0.741 0.956 0.730 1e-60
449455657185 PREDICTED: UPF0047 protein C4A8.02c-like 0.741 0.837 0.730 3e-60
224065613163 predicted protein [Populus trichocarpa] 0.669 0.858 0.756 7e-60
255539386195 conserved hypothetical protein [Ricinus 0.789 0.846 0.698 8e-60
118481865147 unknown [Populus trichocarpa] 0.578 0.823 0.900 2e-59
297845088218 hypothetical protein ARALYDRAFT_472344 [ 0.760 0.729 0.651 8e-59
297745635148 unnamed protein product [Vitis vinifera] 0.569 0.804 0.899 9e-59
356521568175 PREDICTED: UPF0047 protein yjbQ-like [Gl 0.569 0.68 0.890 7e-58
357475211173 hypothetical protein MTR_4g084120 [Medic 0.588 0.710 0.845 2e-57
>gi|225457713|ref|XP_002277414.1| PREDICTED: uncharacterized protein LOC100252976 isoform 1 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  238 bits (608), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 119/180 (66%), Positives = 136/180 (75%), Gaps = 8/180 (4%)

Query: 1   MQLQLRSSSSASALFSAVQVKPP-SSPPVRVKALFTKNPGSGTLKNSNSNTNPNPNPMAS 59
           +  + +      A   ++  KPP +S  +RVK+L+T         + N  T+        
Sbjct: 41  ITYKFKEEVPMQASLLSLGAKPPLTSHQIRVKSLYTPT-------SFNDPTDSTSMAAIP 93

Query: 60  NPRWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDS 119
            P+WAQKT+TLPP RRGCH ITPKI+KEI QDLS FKCGLAHLF+ HTSASLTINENYDS
Sbjct: 94  APKWAQKTITLPPQRRGCHHITPKILKEIGQDLSGFKCGLAHLFIQHTSASLTINENYDS 153

Query: 120 DVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179
           DVRDDTETFL+KIVPEGRSA WKHTLEGPDDMPAHIKSSMFGC+LTIPITDGQLNMGTWQ
Sbjct: 154 DVRDDTETFLSKIVPEGRSAPWKHTLEGPDDMPAHIKSSMFGCSLTIPITDGQLNMGTWQ 213




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449527653|ref|XP_004170824.1| PREDICTED: UPF0047 protein C4A8.02c-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|449455657|ref|XP_004145568.1| PREDICTED: UPF0047 protein C4A8.02c-like isoform 1 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224065613|ref|XP_002301884.1| predicted protein [Populus trichocarpa] gi|222843610|gb|EEE81157.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255539386|ref|XP_002510758.1| conserved hypothetical protein [Ricinus communis] gi|223551459|gb|EEF52945.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|118481865|gb|ABK92869.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297845088|ref|XP_002890425.1| hypothetical protein ARALYDRAFT_472344 [Arabidopsis lyrata subsp. lyrata] gi|297336267|gb|EFH66684.1| hypothetical protein ARALYDRAFT_472344 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|297745635|emb|CBI40800.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356521568|ref|XP_003529426.1| PREDICTED: UPF0047 protein yjbQ-like [Glycine max] Back     alignment and taxonomy information
>gi|357475211|ref|XP_003607891.1| hypothetical protein MTR_4g084120 [Medicago truncatula] gi|85719357|gb|ABC75362.1| Protein of unknown function UPF0047 [Medicago truncatula] gi|355508946|gb|AES90088.1| hypothetical protein MTR_4g084120 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query209
TAIR|locus:505006137217 AT1G21065 "AT1G21065" [Arabido 0.564 0.543 0.822 1e-52
FB|FBgn0263355 687 CG31688 [Drosophila melanogast 0.545 0.165 0.547 1.4e-31
UNIPROTKB|Q60BB8139 MCA0559 "Putative uncharacteri 0.550 0.827 0.529 2.1e-29
UNIPROTKB|Q47W13139 CPS_4360 "Putative uncharacter 0.550 0.827 0.504 7.1e-29
TIGR_CMR|CPS_4360139 CPS_4360 "conserved hypothetic 0.550 0.827 0.504 7.1e-29
UNIPROTKB|G4MUS6210 MGG_01675 "Uncharacterized pro 0.698 0.695 0.434 5e-28
UNIPROTKB|Q9KUY5139 VC_0373 "Putative uncharacteri 0.550 0.827 0.529 5e-28
TIGR_CMR|VC_0373139 VC_0373 "conserved hypothetica 0.550 0.827 0.529 5e-28
POMBASE|SPAC4A8.02c142 SPAC4A8.02c "conserved protein 0.535 0.788 0.513 1.7e-27
ASPGD|ASPL0000014934144 AN8050 [Emericella nidulans (t 0.555 0.805 0.466 7.3e-27
TAIR|locus:505006137 AT1G21065 "AT1G21065" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 97/118 (82%), Positives = 109/118 (92%)

Query:    62 RWAQKTVTLPPLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDV 121
             +WAQKT+TLPPLRRGCHLITPKI+KEI +DLS+F CGLAH+FL HTSASLTINENYD DV
Sbjct:    77 KWAQKTITLPPLRRGCHLITPKILKEIREDLSDFNCGLAHVFLQHTSASLTINENYDPDV 136

Query:   122 RDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179
             + DTETFLN+IVPEG SA W+HT+EGPDDMPAHIKSSMFGC LTIPIT G+L+MGTWQ
Sbjct:   137 QADTETFLNRIVPEGNSAPWRHTMEGPDDMPAHIKSSMFGCQLTIPITKGKLSMGTWQ 194




GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=IDA
FB|FBgn0263355 CG31688 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|Q60BB8 MCA0559 "Putative uncharacterized protein" [Methylococcus capsulatus str. Bath (taxid:243233)] Back     alignment and assigned GO terms
UNIPROTKB|Q47W13 CPS_4360 "Putative uncharacterized protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
TIGR_CMR|CPS_4360 CPS_4360 "conserved hypothetical protein TIGR00149" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
UNIPROTKB|G4MUS6 MGG_01675 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms
UNIPROTKB|Q9KUY5 VC_0373 "Putative uncharacterized protein" [Vibrio cholerae O1 biovar El Tor str. N16961 (taxid:243277)] Back     alignment and assigned GO terms
TIGR_CMR|VC_0373 VC_0373 "conserved hypothetical protein" [Vibrio cholerae O1 biovar El Tor (taxid:686)] Back     alignment and assigned GO terms
POMBASE|SPAC4A8.02c SPAC4A8.02c "conserved protein, UPF0047 family" [Schizosaccharomyces pombe (taxid:4896)] Back     alignment and assigned GO terms
ASPGD|ASPL0000014934 AN8050 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00002972001
SubName- Full=Chromosome chr18 scaffold_137, whole genome shotgun sequence; (236 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00019349001
RecName- Full=Ribonucleoside-diphosphate reductase; EC=1.17.4.1;; Provides the precursors neces [...] (790 aa)
       0.432
GSVIVG00008868001
SubName- Full=Chromosome chr13 scaffold_210, whole genome shotgun sequence; (278 aa)
       0.427
GSVIVG00027326001
SubName- Full=Chromosome chr19 scaffold_4, whole genome shotgun sequence; (188 aa)
       0.421
GSVIVG00016152001
SubName- Full=Chromosome chr17 scaffold_12, whole genome shotgun sequence; (138 aa)
      0.406

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
pfam01894118 pfam01894, UPF0047, Uncharacterized protein family 3e-50
COG0432137 COG0432, COG0432, Uncharacterized conserved protei 9e-45
TIGR00149132 TIGR00149, TIGR00149_YjbQ, secondary thiamine-phos 9e-34
>gnl|CDD|216769 pfam01894, UPF0047, Uncharacterized protein family UPF0047 Back     alignment and domain information
 Score =  158 bits (403), Expect = 3e-50
 Identities = 53/102 (51%), Positives = 75/102 (73%), Gaps = 4/102 (3%)

Query: 78  HLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGR 137
             IT ++ + + +  S  K GL H+F+ HT+ASLTINEN D DVR+D E FLN++VPE  
Sbjct: 1   IDITDEVREAVEE--SGVKNGLVHVFVPHTTASLTINENADPDVREDLERFLNRLVPED- 57

Query: 138 SASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179
              ++H  EGPD+MPAH+KSS+ G +LT+P+T+G+L +GTWQ
Sbjct: 58  -DPYRHNEEGPDNMPAHLKSSLLGPSLTVPVTNGRLALGTWQ 98


This family has no known function. The alignment contains a conserved aspartate and histidine that may be functionally important. Length = 118

>gnl|CDD|223509 COG0432, COG0432, Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>gnl|CDD|129253 TIGR00149, TIGR00149_YjbQ, secondary thiamine-phosphate synthase enzyme Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 209
COG0432137 Uncharacterized conserved protein [Function unknow 100.0
TIGR00149132 TIGR00149_YbjQ secondary thiamine-phosphate syntha 100.0
PF01894118 UPF0047: Uncharacterised protein family UPF0047; I 100.0
KOG3267138 consensus Uncharacterized conserved protein [Funct 100.0
>COG0432 Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=2.7e-50  Score=327.56  Aligned_cols=128  Identities=41%  Similarity=0.631  Sum_probs=121.0

Q ss_pred             CeEEEEEEEEcCCCC-eEEeCcHHHHHHHhhhccCccccEEEEEecccceEEEEeecCCcchHHHHHHHHhhhCCCCCCC
Q 028395           61 PRWAQKTVTLPPLRR-GCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSA  139 (209)
Q Consensus        61 M~~~~~tItV~T~~r-~~~dIT~eV~~~V~~~~SgIk~Giv~Vf~~HTTAsLtInEn~DP~l~~Dl~~~L~rLVP~~~~~  139 (209)
                      |+|+|++|+|+|+++ +++|||++|+++|++  ||+++|+|+||++||||||+||| +||+|++||+++|++|+|++.  
T Consensus         1 m~~~~~~l~v~T~~r~~~vdIT~ev~~~v~e--sgv~~Gl~~vf~~HtTaal~inE-~ep~l~~Di~~~l~~lvP~~~--   75 (137)
T COG0432           1 MKVYQKELTVSTKRRIEFVDITDEVEKFVRE--SGVKNGLLLVFVPHTTAALTINE-AEPGLKEDIERFLEKLVPEGA--   75 (137)
T ss_pred             CceEEEEEEEeccCccceEEchHHHHHHHHH--cCCccceEEEEecCcceEEEEec-CCCcHHHHHHHHHHHhCCCCC--
Confidence            789999999999987 999999999999998  99999999999999999999999 699999999999999999985  


Q ss_pred             CceeCccCCCChhhhhhhhccCceEEEEeeCCeecccccceEeee----cCC-ceeeEE
Q 028395          140 SWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNG----LGF-RNYSLS  193 (209)
Q Consensus       140 ~Y~H~~eG~dN~~AHIKSsLlG~SltIPV~dGkL~LGTWQ~I~~~----~~~-R~v~~~  193 (209)
                      .|+|+.+|+|||+|||||+|+|+|++|||.||+|.|||||+|||+    ++. |+|+|.
T Consensus        76 ~Y~H~~~~~Dn~~aHlkasllG~S~~iPv~~GrL~LGTWQ~I~~~E~dg~r~~R~v~v~  134 (137)
T COG0432          76 GYRHDEEGPDNAPAHLKASLLGPSLTIPVINGRLVLGTWQGIFLVEFDGPRHRRRVVVK  134 (137)
T ss_pred             CcccccCCCCchHHHHHHHhcCceEEEEEeCCeEceecccEEEEEEecCCCCccEEEEE
Confidence            599999999999999999999999999999999999999999874    344 777775



>TIGR00149 TIGR00149_YbjQ secondary thiamine-phosphate synthase enzyme Back     alignment and domain information
>PF01894 UPF0047: Uncharacterised protein family UPF0047; InterPro: IPR001602 This family contains small uncharacterised proteins of 14 to 16 kDa mainly from bacteria although the signatures also occur in a hypothetical protein from archaea and from yeast Back     alignment and domain information
>KOG3267 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
1vmh_A144 Crystal Structure Of An Uncharacterized Conserved P 2e-11
1xbf_A140 X-Ray Structure Northeast Structural Genomics Conso 6e-11
1vmj_A151 Crystal Structure Of A Putative Thiamin Phosphate S 7e-09
1vmf_A145 Crystal Structure Of A Ybjq-Like Fold Protein Of Un 9e-09
2p6c_A137 Crystal Structure Of Hypothetical Protein Aq_2013 F 2e-07
1ve0_A134 Crystal Structure Of Uncharacterized Protein St2072 2e-07
1vph_A149 Crystal Structure Of A Ybjq-Like Protein Of Unknown 5e-06
2p6h_A134 Crystal Structure Of Hypothetical Protein Ape1520 F 6e-06
2cu5_A129 Crystal Structure Of The Conserved Hypothetical Pro 3e-04
>pdb|1VMH|A Chain A, Crystal Structure Of An Uncharacterized Conserved Protein YjbqUPF0047 Family, Ortholog Yugu B.Subtilis (Ca_c0907) From Clostridium Acetobutylicum At 1.31 A Resolution Length = 144 Back     alignment and structure

Iteration: 1

Score = 65.9 bits (159), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 39/91 (42%), Positives = 55/91 (60%), Gaps = 5/91 (5%) Query: 89 AQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGP 148 A D S G+A +F HT+A +TINEN D DV D L+K+ P + +KH +EG Sbjct: 39 AVDESGVSDGMAVVFCPHTTAGITINENADPDVTRDILVNLDKVFP--KVGDYKH-VEG- 94 Query: 149 DDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179 + AHIK+S+ G + I I +G+L +GTWQ Sbjct: 95 -NSHAHIKASLMGSSQQIIIENGKLKLGTWQ 124
>pdb|1XBF|A Chain A, X-Ray Structure Northeast Structural Genomics Consortium Target Car10 From C. Acetobutylicum Length = 140 Back     alignment and structure
>pdb|1VMJ|A Chain A, Crystal Structure Of A Putative Thiamin Phosphate Synthase (Tm0723) From Thermotoga Maritima Msb8 At 1.52 A Resolution Length = 151 Back     alignment and structure
>pdb|1VMF|A Chain A, Crystal Structure Of A Ybjq-Like Fold Protein Of Unknown Function (Bh3498) From Bacillus Halodurans At 1.46 A Resolution Length = 145 Back     alignment and structure
>pdb|2P6C|A Chain A, Crystal Structure Of Hypothetical Protein Aq_2013 From Aquifex Aeolicus Vf5. Length = 137 Back     alignment and structure
>pdb|1VE0|A Chain A, Crystal Structure Of Uncharacterized Protein St2072 From Sulfolobus Tokodaii Length = 134 Back     alignment and structure
>pdb|1VPH|A Chain A, Crystal Structure Of A Ybjq-Like Protein Of Unknown Function (Sso2532) From Sulfolobus Solfataricus P2 At 1.76 A Resolution Length = 149 Back     alignment and structure
>pdb|2P6H|A Chain A, Crystal Structure Of Hypothetical Protein Ape1520 From Aeropyrum Pernix K1 Length = 134 Back     alignment and structure
>pdb|2CU5|A Chain A, Crystal Structure Of The Conserved Hypothetical Protein Tt1486 From Thermus Thermophilus Hb8 Length = 129 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
1vmj_A151 Hypothetical protein TM0723; putative thiamin phos 3e-30
2p6c_A137 AQ_2013 protein; NPPSFA, national project on prote 5e-29
1vmf_A145 Hypothetical protein; structural genomics, joint c 1e-28
2cu5_A129 Conserved hypothetical protein TT1486; thermus the 2e-27
1vmh_A144 Uncharacterized conserved protein YJBQ/UPF0047 FA 2e-26
1ve0_A134 Hypothetical protein (ST2072); structural genomics 2e-25
1vph_A149 Hypothetical protein SSO2532; YBJQ-like fold, stru 3e-23
2p6h_A134 Hypothetical protein; structural genomics, unknown 8e-23
>1vmj_A Hypothetical protein TM0723; putative thiamin phosphate synthase, structural genomics, JO center for structural genomics, JCSG; 1.52A {Thermotoga maritima} SCOP: d.273.1.1 Length = 151 Back     alignment and structure
 Score =  107 bits (269), Expect = 3e-30
 Identities = 36/130 (27%), Positives = 66/130 (50%), Gaps = 8/130 (6%)

Query: 51  NPNPNPMASNPRWAQKTVTLP-PLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSA 109
           + + + M    +  +K +      RR    ITP + + + +  S  K GL     +H +A
Sbjct: 7   HHHHHHM----KSYRKELWFHTKRRREFINITPLLEECVRE--SGIKEGLLLCNAMHITA 60

Query: 110 SLTINENYDSDVRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPIT 169
           S+ IN+  +  +  D E +L K+ PE   + +KH   G D+  AH+K ++ G  + I IT
Sbjct: 61  SVFIND-DEPGLHHDFEVWLEKLAPEKPYSQYKHNDTGEDNADAHLKRTIMGREVVIAIT 119

Query: 170 DGQLNMGTWQ 179
           D ++++G W+
Sbjct: 120 DRKMDLGPWE 129


>2p6c_A AQ_2013 protein; NPPSFA, national project on protein structural and functiona analyses, riken structural genomics/proteomics initiative; 2.00A {Aquifex aeolicus} Length = 137 Back     alignment and structure
>1vmf_A Hypothetical protein; structural genomics, joint center for structural genomics, J protein structure initiative, PSI, unknown function; HET: EPE; 1.46A {Bacillus halodurans} SCOP: d.273.1.1 Length = 145 Back     alignment and structure
>2cu5_A Conserved hypothetical protein TT1486; thermus thermophilus HB8, ST genomics, riken structural genomics/proteomics initiative; 1.84A {Thermus thermophilus} Length = 129 Back     alignment and structure
>1vmh_A Uncharacterized conserved protein YJBQ/UPF0047 FA ortholog YUGU B.subtilis; YJBQ-like fold, structural genomics; 1.31A {Clostridium acetobutylicum} SCOP: d.273.1.1 PDB: 1xbf_A Length = 144 Back     alignment and structure
>1ve0_A Hypothetical protein (ST2072); structural genomics, zinc binding protein, metal binding Pro; 2.00A {Sulfolobus tokodaii} Length = 134 Back     alignment and structure
>1vph_A Hypothetical protein SSO2532; YBJQ-like fold, structural genomics, joint center for struct genomics, JCSG, protein structure initiative; 1.76A {Sulfolobus solfataricus} SCOP: d.273.1.1 Length = 149 Back     alignment and structure
>2p6h_A Hypothetical protein; structural genomics, unknown function, NPPSFA, national PROJ protein structural and functional analyses; 1.95A {Aeropyrum pernix} Length = 134 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query209
2p6c_A137 AQ_2013 protein; NPPSFA, national project on prote 100.0
1vmj_A151 Hypothetical protein TM0723; putative thiamin phos 100.0
1vph_A149 Hypothetical protein SSO2532; YBJQ-like fold, stru 100.0
2p6h_A134 Hypothetical protein; structural genomics, unknown 100.0
1ve0_A134 Hypothetical protein (ST2072); structural genomics 100.0
1vmh_A144 Uncharacterized conserved protein YJBQ/UPF0047 FA 100.0
1vmf_A145 Hypothetical protein; structural genomics, joint c 100.0
2cu5_A129 Conserved hypothetical protein TT1486; thermus the 100.0
>2p6c_A AQ_2013 protein; NPPSFA, national project on protein structural and functiona analyses, riken structural genomics/proteomics initiative; 2.00A {Aquifex aeolicus} Back     alignment and structure
Probab=100.00  E-value=1.2e-50  Score=327.83  Aligned_cols=128  Identities=26%  Similarity=0.417  Sum_probs=120.8

Q ss_pred             CeEEEEEEEEcCCC-CeEEeCcHHHHHHHhhhccCccccEEEEEecccceEEEEeecCCcchHHHHHHHHhhhCCCCCCC
Q 028395           61 PRWAQKTVTLPPLR-RGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSA  139 (209)
Q Consensus        61 M~~~~~tItV~T~~-r~~~dIT~eV~~~V~~~~SgIk~Giv~Vf~~HTTAsLtInEn~DP~l~~Dl~~~L~rLVP~~~~~  139 (209)
                      |+++|++|+|+|++ ++++|||++|+++|++  +|+++|+|+||++||||||+|||| ||+|++||+++|++|||++.  
T Consensus         1 M~~~~~~i~~~t~~~~~~~dIT~~V~~~v~~--sgi~~Gl~~vf~~HTTasl~inEn-dp~v~~Dl~~~l~~lvP~~~--   75 (137)
T 2p6c_A            1 MKAYTKYLTFNTKKRRELIRITDEVKKAVEE--SEVKEGLCLVSSMHLTSSVIIQDD-EEGLHEDIWEWLEKLAPYRP--   75 (137)
T ss_dssp             CEEEEEEEEECCSSSSEEEECHHHHHHHHHH--HTCSSEEEEEEESSTTEEEEEECC-CHHHHHHHHHHHHHHSCCCT--
T ss_pred             CcEEEEEEEEecCCCCeEEECHHHHHHHHHH--cCCCceEEEEEeCCCeEEEEEEcC-CccHHHHHHHHHHHHCCCCC--
Confidence            89999999999986 6999999999999998  999999999999999999999999 99999999999999999874  


Q ss_pred             CceeCccCCCChhhhhhhhccCceEEEEeeCCeecccccceEeee----cCCceeeEE
Q 028395          140 SWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNG----LGFRNYSLS  193 (209)
Q Consensus       140 ~Y~H~~eG~dN~~AHIKSsLlG~SltIPV~dGkL~LGTWQ~I~~~----~~~R~v~~~  193 (209)
                      +|+|++||+|||+|||||+|+|+|++|||.||+|.|||||+|||.    ++.|+|++.
T Consensus        76 ~y~H~~eg~dn~~AHiks~l~G~s~tipv~~G~L~LGtWQ~Iyl~E~dg~r~R~v~v~  133 (137)
T 2p6c_A           76 DYKHHRTGEDNGDAHLKNLLTHLQVVLPITNGKLDLGPWQEIFYAEFDGQRPKRVVIK  133 (137)
T ss_dssp             TCGGGGGTCCCHHHHHHHHHHCSEEEEEECSSSBCCCSSCEEEEEESSCSSCEEEEEE
T ss_pred             CcccCcCCCCCHHHhhhhheeCCeEEEEEECCEECcCCCceEEEEECCCCCccEEEEE
Confidence            599999999999999999999999999999999999999999963    456777765



>1vmj_A Hypothetical protein TM0723; putative thiamin phosphate synthase, structural genomics, JO center for structural genomics, JCSG; 1.52A {Thermotoga maritima} SCOP: d.273.1.1 Back     alignment and structure
>1vph_A Hypothetical protein SSO2532; YBJQ-like fold, structural genomics, joint center for struct genomics, JCSG, protein structure initiative; 1.76A {Sulfolobus solfataricus} SCOP: d.273.1.1 Back     alignment and structure
>2p6h_A Hypothetical protein; structural genomics, unknown function, NPPSFA, national PROJ protein structural and functional analyses; 1.95A {Aeropyrum pernix} Back     alignment and structure
>1ve0_A Hypothetical protein (ST2072); structural genomics, zinc binding protein, metal binding Pro; 2.00A {Sulfolobus tokodaii} Back     alignment and structure
>1vmh_A Uncharacterized conserved protein YJBQ/UPF0047 FA ortholog YUGU B.subtilis; YJBQ-like fold, structural genomics; 1.31A {Clostridium acetobutylicum} SCOP: d.273.1.1 PDB: 1xbf_A Back     alignment and structure
>1vmf_A Hypothetical protein; structural genomics, joint center for structural genomics, J protein structure initiative, PSI, unknown function; HET: EPE; 1.46A {Bacillus halodurans} SCOP: d.273.1.1 Back     alignment and structure
>2cu5_A Conserved hypothetical protein TT1486; thermus thermophilus HB8, ST genomics, riken structural genomics/proteomics initiative; 1.84A {Thermus thermophilus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 209
d1vmja_139 d.273.1.1 (A:) Hypothetical protein TM0723 {Thermo 4e-37
d1vmfa_136 d.273.1.1 (A:) Hypothetical protein BH3498 {Bacill 2e-33
d1vmha_129 d.273.1.1 (A:) B.subtilis YugU ortolog CAC0907 {Cl 1e-32
d1vpha_138 d.273.1.1 (A:) Hypothetical protein SSO2532 {Sulfo 6e-29
>d1vmja_ d.273.1.1 (A:) Hypothetical protein TM0723 {Thermotoga maritima [TaxId: 2336]} Length = 139 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: YjbQ-like
superfamily: YjbQ-like
family: YjbQ-like
domain: Hypothetical protein TM0723
species: Thermotoga maritima [TaxId: 2336]
 Score =  124 bits (312), Expect = 4e-37
 Identities = 35/119 (29%), Positives = 62/119 (52%), Gaps = 4/119 (3%)

Query: 62  RWAQKTVTLP-PLRRGCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSD 120
           +  +K +      RR    ITP + + + +  S  K GL     +H +AS+ IN++    
Sbjct: 2   KSYRKELWFHTKRRREFINITPLLEECVRE--SGIKEGLLLCNAMHITASVFINDDEP-G 58

Query: 121 VRDDTETFLNKIVPEGRSASWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQ 179
           +  D E +L K+ PE   + +KH   G D+  AH+K ++ G  + I ITD ++++G W+
Sbjct: 59  LHHDFEVWLEKLAPEKPYSQYKHNDTGEDNADAHLKRTIMGREVVIAITDRKMDLGPWE 117


>d1vmfa_ d.273.1.1 (A:) Hypothetical protein BH3498 {Bacillus halodurans [TaxId: 86665]} Length = 136 Back     information, alignment and structure
>d1vmha_ d.273.1.1 (A:) B.subtilis YugU ortolog CAC0907 {Clostridium acetobutylicum [TaxId: 1488]} Length = 129 Back     information, alignment and structure
>d1vpha_ d.273.1.1 (A:) Hypothetical protein SSO2532 {Sulfolobus solfataricus [TaxId: 2287]} Length = 138 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query209
d1vmja_139 Hypothetical protein TM0723 {Thermotoga maritima [ 100.0
d1vmfa_136 Hypothetical protein BH3498 {Bacillus halodurans [ 100.0
d1vpha_138 Hypothetical protein SSO2532 {Sulfolobus solfatari 100.0
d1vmha_129 B.subtilis YugU ortolog CAC0907 {Clostridium aceto 100.0
>d1vmja_ d.273.1.1 (A:) Hypothetical protein TM0723 {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: YjbQ-like
superfamily: YjbQ-like
family: YjbQ-like
domain: Hypothetical protein TM0723
species: Thermotoga maritima [TaxId: 2336]
Probab=100.00  E-value=3.3e-49  Score=318.37  Aligned_cols=130  Identities=28%  Similarity=0.448  Sum_probs=121.4

Q ss_pred             CeEEEEEEEEcCCCC-eEEeCcHHHHHHHhhhccCccccEEEEEecccceEEEEeecCCcchHHHHHHHHhhhCCCCCCC
Q 028395           61 PRWAQKTVTLPPLRR-GCHLITPKIVKEIAQDLSEFKCGLAHLFLLHTSASLTINENYDSDVRDDTETFLNKIVPEGRSA  139 (209)
Q Consensus        61 M~~~~~tItV~T~~r-~~~dIT~eV~~~V~~~~SgIk~Giv~Vf~~HTTAsLtInEn~DP~l~~Dl~~~L~rLVP~~~~~  139 (209)
                      |++++++|+++|+++ +++|||++|+++|++  |||++|+|+||++||||||+|||| ||+|+.||+++|++|||+++..
T Consensus         1 M~~~~~~i~~~T~~~~~~~dIT~~v~~~v~~--s~i~~Giv~vf~~HTTasl~inE~-dp~~~~Dl~~~l~~lvP~~~~~   77 (139)
T d1vmja_           1 MKSYRKELWFHTKRRREFINITPLLEECVRE--SGIKEGLLLCNAMHITASVFINDD-EPGLHHDFEVWLEKLAPEKPYS   77 (139)
T ss_dssp             CEEEEEEEEECCSSSSEEEECHHHHHHHHHH--HCCSSEEEEEEESSTTEEEEEECC-CHHHHHHHHHHHHHHSCCCCGG
T ss_pred             CCcEEEEEEEECCCCCEEEEChHHHHHHHHH--hCCceEEEEEEeCCCceEEEEecC-chhHHhhHHHHHHHhhccCCcc
Confidence            899999999999874 899999999999998  999999999999999999999999 9999999999999999987434


Q ss_pred             CceeCccCCCChhhhhhhhccCceEEEEeeCCeecccccceEeee----cCCceeeEE
Q 028395          140 SWKHTLEGPDDMPAHIKSSMFGCTLTIPITDGQLNMGTWQVCRNG----LGFRNYSLS  193 (209)
Q Consensus       140 ~Y~H~~eG~dN~~AHIKSsLlG~SltIPV~dGkL~LGTWQ~I~~~----~~~R~v~~~  193 (209)
                      .|+|+.+|+|||+|||||+|+|+|++|||.||+|.|||||+|||.    ++.|+|.+.
T Consensus        78 ~y~H~~~g~dn~~aHiks~l~g~s~tipi~~G~L~LGtWQ~I~l~E~dg~r~R~v~v~  135 (139)
T d1vmja_          78 QYKHNDTGEDNADAHLKRTIMGREVVIAITDRKMDLGPWEQVFYGEFDGMRPKRVLVK  135 (139)
T ss_dssp             GCGGGTTSCCCHHHHHHHHHHCSEEEEEEETTEECCCTTCEEEEEESSCSSCEEEEEE
T ss_pred             ccCcCCccCCCcHHHHHHhhhCCeEEEEEECCEECccCCCEEEEEECcCCCceEEEEE
Confidence            699999999999999999999999999999999999999999864    467777764



>d1vmfa_ d.273.1.1 (A:) Hypothetical protein BH3498 {Bacillus halodurans [TaxId: 86665]} Back     information, alignment and structure
>d1vpha_ d.273.1.1 (A:) Hypothetical protein SSO2532 {Sulfolobus solfataricus [TaxId: 2287]} Back     information, alignment and structure
>d1vmha_ d.273.1.1 (A:) B.subtilis YugU ortolog CAC0907 {Clostridium acetobutylicum [TaxId: 1488]} Back     information, alignment and structure