Citrus Sinensis ID: 029243


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190------
MASLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASLFPRKGSKKLISDRARFIIPRSSGSSNSNGSDSSDQTETPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGEQVN
cccccccccccccccccccccccHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHEEEEEEEcccccHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHccccccc
ccccccHHHHHHcccccHHHHHHHHHHHHHcccHHHHHccccccEcccccccccccccEEEEccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHcccccccc
maslrncgsfykygansfdgvsaLPLLLHMRGVISKSLYKKeaslfprkgskklISDRarfiiprssgssnsngsdssdqtetpfgytrkdvLLIGVGVTVIGYGLksglelfgvdplqagnVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGEQVN
MASLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLykkeaslfprkgskklisdrarfiiprssgssnsngsdssdQTETPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEekrrlasgeqvn
MASLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASLFPRKGSKKLISDRARFIIPRssgssnsngsdssdQTETPFGYTRKDvlligvgvtvigyglKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRleglteaelealieqveeeKRRLASGEQVN
*****NCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASL***************************************FGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKV**********************************
*****N*GSFYKYGANSFDGVSALPLLLHMRGVI***************************************************GYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLT*************************
MASLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASLFPRKGSKKLISDRARFIIPRS****************TPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVE*************
**SLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASLFP***********************************TPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEK**********
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MASLRNCGSFYKYGANSFDGVSALPLLLHMRGVISKSLYKKEASLFPRKGSKKLISDRARFIIPRSSGSSNSNGSDSSDQTETPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLxxxxxxxxxxxxxxxxxxxxxxxxxxxxN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query196
225456193218 PREDICTED: uncharacterized protein LOC10 0.994 0.894 0.691 2e-53
297734347210 unnamed protein product [Vitis vinifera] 0.994 0.928 0.691 2e-53
118488661204 unknown [Populus trichocarpa] 0.943 0.906 0.639 3e-53
125528076209 hypothetical protein OsI_04123 [Oryza sa 0.617 0.578 0.735 9e-48
115440557209 Os01g0805200 [Oryza sativa Japonica Grou 0.617 0.578 0.735 1e-47
242059007200 hypothetical protein SORBIDRAFT_03g03741 0.739 0.725 0.641 2e-47
18401689190 uncharacterized protein [Arabidopsis tha 0.775 0.8 0.596 1e-44
21554118190 unknown [Arabidopsis thaliana] 0.637 0.657 0.714 1e-44
297834704190 hypothetical protein ARALYDRAFT_898147 [ 0.637 0.657 0.714 2e-44
9294497130 unnamed protein product [Arabidopsis tha 0.637 0.961 0.714 2e-44
>gi|225456193|ref|XP_002278927.1| PREDICTED: uncharacterized protein LOC100245848 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  214 bits (545), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 146/211 (69%), Positives = 165/211 (78%), Gaps = 16/211 (7%)

Query: 1   MASLR----NCGSFYKYGANSFDGVSALPLLLH--------MRG-VISKSLYKKEASLFP 47
           MASL     NC  F + G+ S  GVSALPLL          MRG +IS+S Y KE S FP
Sbjct: 9   MASLPLKSPNCSIFCERGSISIMGVSALPLLQQRVRAVGGLMRGHIISQSAYTKETSTFP 68

Query: 48  RKGSKKLISDRARFIIPRSSGSSNSNG--SDSSDQTETPFGYTRKDVLLIGVGVTVIGYG 105
           RKGS  +I  +  F  PRSSGSSNS+G   DS++Q + PFGY RKDVLLIGVGVTV+G G
Sbjct: 69  RKGSNSMIPKKLVFT-PRSSGSSNSDGSKDDSTNQDKVPFGYNRKDVLLIGVGVTVLGIG 127

Query: 106 LKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQK 165
           LKSGLE  GVDPLQAGNVV++++VLGLT+GW+STYIFRVSNKEMTYAQQLRDYE+KVMQK
Sbjct: 128 LKSGLEFVGVDPLQAGNVVQLVLVLGLTIGWISTYIFRVSNKEMTYAQQLRDYENKVMQK 187

Query: 166 RLEGLTEAELEALIEQVEEEKRRLASGEQVN 196
           RLEGLTEAELE L+EQVEEEKRRL SGEQVN
Sbjct: 188 RLEGLTEAELEVLLEQVEEEKRRLVSGEQVN 218




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297734347|emb|CBI15594.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|118488661|gb|ABK96142.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|125528076|gb|EAY76190.1| hypothetical protein OsI_04123 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|115440557|ref|NP_001044558.1| Os01g0805200 [Oryza sativa Japonica Group] gi|20804576|dbj|BAB92268.1| unknown protein [Oryza sativa Japonica Group] gi|113534089|dbj|BAF06472.1| Os01g0805200 [Oryza sativa Japonica Group] gi|125572357|gb|EAZ13872.1| hypothetical protein OsJ_03796 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|242059007|ref|XP_002458649.1| hypothetical protein SORBIDRAFT_03g037410 [Sorghum bicolor] gi|241930624|gb|EES03769.1| hypothetical protein SORBIDRAFT_03g037410 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|18401689|ref|NP_566593.1| uncharacterized protein [Arabidopsis thaliana] gi|14517430|gb|AAK62605.1| AT3g17930/MEB5_15 [Arabidopsis thaliana] gi|20147323|gb|AAM10375.1| AT3g17930/MEB5_15 [Arabidopsis thaliana] gi|332642505|gb|AEE76026.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21554118|gb|AAM63198.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297834704|ref|XP_002885234.1| hypothetical protein ARALYDRAFT_898147 [Arabidopsis lyrata subsp. lyrata] gi|297331074|gb|EFH61493.1| hypothetical protein ARALYDRAFT_898147 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|9294497|dbj|BAB02716.1| unnamed protein product [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query196
TAIR|locus:2088540190 DAC "AT3G17930" [Arabidopsis t 0.653 0.673 0.510 1.5e-26
TAIR|locus:2088540 DAC "AT3G17930" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 299 (110.3 bits), Expect = 1.5e-26, P = 1.5e-26
 Identities = 71/139 (51%), Positives = 81/139 (58%)

Query:    32 GVISKSLYKKEA---SLFPRKGSKKLISDRARFIIPRXXXXXXXXXXXXXXQTET-PFGY 87
             G   K LY+ E    S    KG K+    R R    R              +T+T PFGY
Sbjct:    35 GAAKKVLYRFEGHRLSSSVEKGEKQ----RMRC---RVSSNSTETEDDSATKTKTTPFGY 87

Query:    88 TRKDXXXXXXXXXXXXXXXKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNK 147
             TRKD               +SGLE  GVDPLQAGN V++I+VLGLTLGW+STYIFRV NK
Sbjct:    88 TRKDVILIGVGVTALGIGLESGLEYVGVDPLQAGNAVQLILVLGLTLGWISTYIFRVGNK 147

Query:   148 EMTYAQQLRDYEDKVMQKR 166
             EMTYAQQLRDYE +VMQKR
Sbjct:   148 EMTYAQQLRDYESQVMQKR 166


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.136   0.389    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      196       148   0.00068  104 3  11 22  0.45    31
                                                     30  0.39    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  567 (60 KB)
  Total size of DFA:  121 KB (2078 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  10.90u 0.12s 11.02t   Elapsed:  00:00:01
  Total cpu time:  10.90u 0.12s 11.02t   Elapsed:  00:00:01
  Start:  Fri May 10 12:02:35 2013   End:  Fri May 10 12:02:36 2013


GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0000023 "maltose metabolic process" evidence=RCA
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0009902 "chloroplast relocation" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0019252 "starch biosynthetic process" evidence=RCA
GO:0034660 "ncRNA metabolic process" evidence=RCA
GO:0043085 "positive regulation of catalytic activity" evidence=RCA
GO:0010190 "cytochrome b6f complex assembly" evidence=IMP

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00150423
SubName- Full=Putative uncharacterized protein; (206 aa)
(Populus trichocarpa)
Predicted Functional Partners:
eugene3.00130190
SubName- Full=Putative uncharacterized protein; (339 aa)
      0.592
eugene3.00090150
hypothetical protein (129 aa)
      0.588
gw1.I.6006.1
hypothetical protein (200 aa)
      0.587
gw1.XIII.355.1
hypothetical protein (158 aa)
      0.586
gw1.70.142.1
hypothetical protein (191 aa)
      0.565
eugene3.00160848
hypothetical protein (215 aa)
      0.564
fgenesh4_pm.C_LG_II001205
RecName- Full=Photosystem II reaction center psb28 protein; (115 aa)
      0.558
estExt_Genewise1_v1.C_LG_II4195
SubName- Full=Putative uncharacterized protein; (140 aa)
      0.552
gw1.IV.4064.1
hypothetical protein (225 aa)
      0.551
gw1.123.192.1
SubName- Full=Putative uncharacterized protein; (218 aa)
      0.549

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query196
pfam11460104 pfam11460, DUF3007, Protein of unknown function (D 9e-38
>gnl|CDD|221117 pfam11460, DUF3007, Protein of unknown function (DUF3007) Back     alignment and domain information
 Score =  125 bits (317), Expect = 9e-38
 Identities = 46/105 (43%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 87  YTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSN 146
            TR DVLLIG+GV ++G     GL+  G+D L AG   + ++VLG+ +GW  +Y+FRV  
Sbjct: 1   MTRIDVLLIGLGVFLLGGLGYGGLQAAGLDGLSAGIASQALLVLGV-VGWTGSYLFRVVT 59

Query: 147 KEMTYAQQLRDYEDKV-------MQKRLEGLTEAELEALIEQVEE 184
            +MTY QQ RDY +         +QKR + L+  ELEAL  ++EE
Sbjct: 60  GKMTYMQQRRDYREAYDALTNEELQKRFDSLSPEELEALQAELEE 104


This is a family of uncharacterized proteins found in bacteria and eukaryotes. Length = 104

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 196
PF11460104 DUF3007: Protein of unknown function (DUF3007); In 100.0
PF14163151 SieB: Superinfection exclusion protein B 94.75
PF04156191 IncA: IncA protein; InterPro: IPR007285 Chlamydia 93.92
PF12576141 DUF3754: Protein of unknown function (DUF3754); In 89.16
PF0630568 DUF1049: Protein of unknown function (DUF1049); In 86.54
PF0651893 DUF1104: Protein of unknown function (DUF1104); In 86.27
PF03250147 Tropomodulin: Tropomodulin; InterPro: IPR004934 Ac 85.91
PF0862149 RPAP1_N: RPAP1-like, N-terminal; InterPro: IPR0139 83.69
PF04120132 Iron_permease: Low affinity iron permease ; InterP 82.49
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes Back     alignment and domain information
Probab=100.00  E-value=1.8e-51  Score=318.75  Aligned_cols=97  Identities=55%  Similarity=0.856  Sum_probs=95.5

Q ss_pred             ccchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHH----
Q 029243           87 YTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKV----  162 (196)
Q Consensus        87 MtR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~av----  162 (196)
                      |||+|||+||+++|++|+++|++|+++|+|+++||||||++||+|+ |+|++||+|||+||+|||||||||||+++    
T Consensus         1 MtR~dvl~Iglgv~~~Gg~~Y~~l~~~G~d~~~AGi~sq~~lv~gl-vgW~~sYlfRV~t~~MTy~~Q~k~Ye~a~~~~~   79 (104)
T PF11460_consen    1 MTRIDVLLIGLGVFLLGGLLYGGLQAAGLDSLSAGIWSQALLVLGL-VGWVSSYLFRVVTGKMTYMQQRKDYEEAVDQLT   79 (104)
T ss_pred             CcccceeeecHHHHHHHHHHHHHHHHcCCCchhhhHHHHHHHHHHH-HHHHhHHHhhhccCCCcHHHHHHHHHHHHHHHh
Confidence            8999999999999999999999999999999999999999999986 99999999999999999999999999999    


Q ss_pred             ---HHHHHhcCCHHHHHHHHHHHHH
Q 029243          163 ---MQKRLEGLTEAELEALIEQVEE  184 (196)
Q Consensus       163 ---LQKRleeLtpEEle~L~aEIE~  184 (196)
                         ||||||||||||+|+||+||||
T Consensus        80 ~~~lqkRle~l~~eE~~~L~~eiee  104 (104)
T PF11460_consen   80 NEELQKRLEELSPEELEALQAEIEE  104 (104)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHhcC
Confidence               9999999999999999999985



>PF14163 SieB: Superinfection exclusion protein B Back     alignment and domain information
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion Back     alignment and domain information
>PF12576 DUF3754: Protein of unknown function (DUF3754); InterPro: IPR022227 This domain family is found in bacteria, archaea and eukaryotes, and is typically between 135 and 166 amino acids in length Back     alignment and domain information
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function Back     alignment and domain information
>PF06518 DUF1104: Protein of unknown function (DUF1104); InterPro: IPR009488 This family consists of several hypothetical proteins of unknown function which appear to be found exclusively in Helicobacter pylori Back     alignment and domain information
>PF03250 Tropomodulin: Tropomodulin; InterPro: IPR004934 Actin filaments have an intrinsic polarity, each with a fast-growing (barbed) end and a slow-growing (pointed) end Back     alignment and domain information
>PF08621 RPAP1_N: RPAP1-like, N-terminal; InterPro: IPR013930 Inhibition of RNA polymerase II-associated protein 1 (RPAP1) synthesis in Saccharomyces cerevisiae (Baker's yeast) results in changes in global gene expression that are similar to those caused by the loss of the RNAPII subunit Rpb11 [] Back     alignment and domain information
>PF04120 Iron_permease: Low affinity iron permease ; InterPro: IPR007251 Although originally identified as a low-affinity iron(II) permease [, ], Fet4 has since been shown to import several other transition metal ions, including copper [, ] and zinc [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query196
1sf9_A128 YFHH hypothetical protein; structural genomics, un 90.14
3eff_K139 Voltage-gated potassium channel; FULL length KCSA, 88.61
3oop_A143 LIN2960 protein; protein structure initiative, PSI 83.64
>1sf9_A YFHH hypothetical protein; structural genomics, unknown function, PSI, protein structure initiative, midwest center for structural genomics; 1.71A {Bacillus subtilis} SCOP: b.34.15.1 Back     alignment and structure
Probab=90.14  E-value=0.21  Score=39.91  Aligned_cols=36  Identities=25%  Similarity=0.350  Sum_probs=27.6

Q ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHHhccccCCC
Q 029243          159 EDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGEQ  194 (196)
Q Consensus       159 E~avLQKRleeLtpEEle~L~aEIE~Ek~~~~~~~~  194 (196)
                      ....|+||+.+||++||+.-.+.+.++...+.+...
T Consensus        21 ~~~~mekrySeMS~~EL~~EI~~L~EKaRKAEq~Gi   56 (128)
T 1sf9_A           21 QSNAMEKRYSQMTPHELNTEIALLSEKARKAEQHGI   56 (128)
T ss_dssp             HHHHHHHHHHTCCHHHHHHHHHHHHHHHHHHHHTTC
T ss_pred             ecchhhHHHHHcCHHHHHHHHHHHHHHHHHHHHcCC
Confidence            355689999999999999888888776655544433



>3eff_K Voltage-gated potassium channel; FULL length KCSA, bulge helix, cell membrane, ION transport, ionic channel, membrane, transmembrane; 3.80A {Streptomyces lividans} Back     alignment and structure
>3oop_A LIN2960 protein; protein structure initiative, PSI-2, structural genomics, MI center for structural genomics, MCSG, unknown function; 1.78A {Listeria innocua} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query196
d1sf9a_118 Hypothetical protein YfhH {Bacillus subtilis [TaxI 90.2
d2fgea3268 Presequence protease 1, PREP1 {Thale cress (Arabid 88.99
>d1sf9a_ b.34.15.1 (A:) Hypothetical protein YfhH {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
class: All beta proteins
fold: SH3-like barrel
superfamily: Hypothetical protein YfhH
family: Hypothetical protein YfhH
domain: Hypothetical protein YfhH
species: Bacillus subtilis [TaxId: 1423]
Probab=90.20  E-value=0.099  Score=39.46  Aligned_cols=37  Identities=24%  Similarity=0.386  Sum_probs=27.8

Q ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHHhccccCCC
Q 029243          158 YEDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGEQ  194 (196)
Q Consensus       158 YE~avLQKRleeLtpEEle~L~aEIE~Ek~~~~~~~~  194 (196)
                      +..-.|+||+.+||+.||+.-.+.+.+....+.+..+
T Consensus        10 ~~~~~m~krySeMs~~EL~~EIa~L~EKaRKAEq~Gi   46 (118)
T d1sf9a_          10 FQSNAMEKRYSQMTPHELNTEIALLSEKARKAEQHGI   46 (118)
T ss_dssp             HHHHHHHHHHHTCCHHHHHHHHHHHHHHHHHHHHTTC
T ss_pred             eecchhhHHHHhcCHHHHHHHHHHHHHHHHHHHHccC
Confidence            3466899999999999999888888775555444433



>d2fgea3 d.185.1.1 (A:272-539) Presequence protease 1, PREP1 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure