Citrus Sinensis ID: 028413


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------21
MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEVNLKSTHGIVS
cccccHHHHHHHHHHHHHHHccccEEEEcccccHHHHHHHHHHHccccEEEEEEccccccccccccccccccccEEEEcccHHHHHHHHHHHHccccccccEEEEccccccHHHHHHHHHHHHHHccccccccccEEEEcccccHHHHHHHHHHHHHcccccHHHccccEEEEccHHHHHHHHHHHHcccccccccccHHccccccccc
cccccHHHHHHHHHHHHHHHccccEEEEccccHHHHHHHcccHHcccccccEEEccccccccccccccccccccEEEEEEEEEEEEEEEEEEcccccccccEEEEEccccccHHHHHHHHHHHHHccccccccccEEEEccHHHHHHHHHHHHHHHHHcccccHHHccEEEEEccHHHHHHHHHHHHHcccccccHHHHHHHHcccccc
mgpdhphylqsfELGGEIARLLdcttwsgagpglmDAVTKGAmqagkpvggfkvgkeagewtasnfhpylpletylTCRFfsarkhglidcavrndscdrtavvalpggvgtLDEMFEILALIQLErigselpvpflVMNYDSFYKKLLDFLgdcedwgtVAKDEVASLWKICDSNSEALSYLAEFydlssidkrVHEVNLKSTHGIVS
MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEVNLKSTHGIVS
MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEVNLKSTHGIVS
*******YLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEV**********
***DHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSS******************
MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEVNLKSTHGIVS
*GPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSS******************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSYLAEFYDLSSIDKRVHEVNLKSTHGIVS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query209 2.2.26 [Sep-21-2011]
Q7XDB8204 Probable cytokinin ribosi no no 0.679 0.696 0.267 7e-05
B9F166244 Probable cytokinin ribosi no no 0.746 0.639 0.258 0.0003
P46378198 LOG family protein ORF6 i N/A no 0.550 0.580 0.292 0.0004
>sp|Q7XDB8|LOGLA_ORYSJ Probable cytokinin riboside 5'-monophosphate phosphoribohydrolase LOGL10 OS=Oryza sativa subsp. japonica GN=LOGL10 PE=2 SV=1 Back     alignment and function desciption
 Score = 47.4 bits (111), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 19/161 (11%)

Query: 2   GPDHPHYLQSFELGGE-IARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGE 60
           G    ++  + ELG E +AR +D   + G   GLM  V++     G+ V         G 
Sbjct: 18  GKKRSYHDAAIELGNELVARSIDLV-YGGGSIGLMGLVSQAVFDGGRHV--------IGV 68

Query: 61  WTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEIL 120
              +   P +  ET    R  +       + A ++D     A +ALPGG GTL+E+ E++
Sbjct: 69  IPKTLMTPEISGETVGEVRPVADMHQRKAEMARQSD-----AFIALPGGYGTLEELLEVI 123

Query: 121 ALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTV 161
              QL         P  ++N D +Y  LL F+    + G +
Sbjct: 124 TWAQL----GIHHKPVGLLNVDGYYNSLLTFIDKAVEEGFI 160




Cytokinin-activating enzyme working in the direct activation pathway. Phosphoribohydrolase that converts inactive cytokinin nucleotides to the biologically active free-base forms.
Oryza sativa subsp. japonica (taxid: 39947)
EC: 3EC: .EC: 2EC: .EC: 2EC: .EC: nEC: 1
>sp|B9F166|LOGL2_ORYSJ Probable cytokinin riboside 5'-monophosphate phosphoribohydrolase LOGL2 OS=Oryza sativa subsp. japonica GN=LOGL2 PE=3 SV=1 Back     alignment and function description
>sp|P46378|FAS6_RHOFA LOG family protein ORF6 in fasciation locus OS=Rhodococcus fascians GN=fas6 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query209
255576280302 carboxy-lyase, putative [Ricinus communi 0.980 0.678 0.776 2e-91
156138540305 putative lysine decarbxylase [Nicotiana 0.923 0.632 0.777 1e-90
356527115305 PREDICTED: probable cytokinin riboside 5 0.980 0.672 0.760 2e-90
224100075226 predicted protein [Populus trichocarpa] 0.904 0.836 0.820 2e-90
357459923 312 hypothetical protein MTR_3g055920 [Medic 1.0 0.669 0.752 5e-89
225460791 310 PREDICTED: uncharacterized protein LOC10 1.0 0.674 0.746 1e-88
449467507298 PREDICTED: probable cytokinin riboside 5 0.980 0.687 0.726 2e-88
357459921 312 hypothetical protein MTR_3g055920 [Medic 1.0 0.669 0.742 7e-88
50399936 335 expressed protein [Oryza sativa Japonica 0.918 0.573 0.75 5e-86
242038951 329 hypothetical protein SORBIDRAFT_01g01550 0.942 0.598 0.737 9e-86
>gi|255576280|ref|XP_002529033.1| carboxy-lyase, putative [Ricinus communis] gi|223531513|gb|EEF33344.1| carboxy-lyase, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  340 bits (872), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 160/206 (77%), Positives = 178/206 (86%), Gaps = 1/206 (0%)

Query: 1   MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGE 60
           MGPDHPHYLQ+ EL  E+A+LLDCT+W GAGPGLMDA  KGA+QAGKPVGGFK+ KEAGE
Sbjct: 97  MGPDHPHYLQALELSREVAKLLDCTSWMGAGPGLMDATIKGALQAGKPVGGFKIAKEAGE 156

Query: 61  WTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEIL 120
           WTASNFHPYLP ETYLTCRFFSARKHGL+D AVRN   DRTAVVALPGG+GTLDEMFEIL
Sbjct: 157 WTASNFHPYLPSETYLTCRFFSARKHGLVDAAVRNTRSDRTAVVALPGGIGTLDEMFEIL 216

Query: 121 ALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEAL 180
           ALIQLERIGS LPVPF+VMNYDSFY+KLLDF+ +CEDWGTV+K EV  LWKIC+SNSEAL
Sbjct: 217 ALIQLERIGSALPVPFIVMNYDSFYQKLLDFIQNCEDWGTVSKGEVTPLWKICNSNSEAL 276

Query: 181 SYLAEFYDL-SSIDKRVHEVNLKSTH 205
           +YL +FY+L SS D+  H     S H
Sbjct: 277 AYLTDFYNLHSSSDEYGHGKMPTSAH 302




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|156138540|emb|CAO91858.1| putative lysine decarbxylase [Nicotiana tabacum] Back     alignment and taxonomy information
>gi|356527115|ref|XP_003532159.1| PREDICTED: probable cytokinin riboside 5'-monophosphate phosphoribohydrolase LOGL3-like [Glycine max] Back     alignment and taxonomy information
>gi|224100075|ref|XP_002311733.1| predicted protein [Populus trichocarpa] gi|222851553|gb|EEE89100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357459923|ref|XP_003600243.1| hypothetical protein MTR_3g055920 [Medicago truncatula] gi|217074988|gb|ACJ85854.1| unknown [Medicago truncatula] gi|355489291|gb|AES70494.1| hypothetical protein MTR_3g055920 [Medicago truncatula] Back     alignment and taxonomy information
>gi|225460791|ref|XP_002276560.1| PREDICTED: uncharacterized protein LOC100243479 [Vitis vinifera] gi|297737525|emb|CBI26726.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449467507|ref|XP_004151464.1| PREDICTED: probable cytokinin riboside 5'-monophosphate phosphoribohydrolase LOGL9-like [Cucumis sativus] gi|449524603|ref|XP_004169311.1| PREDICTED: probable cytokinin riboside 5'-monophosphate phosphoribohydrolase LOGL9-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|357459921|ref|XP_003600242.1| hypothetical protein MTR_3g055920 [Medicago truncatula] gi|355489290|gb|AES70493.1| hypothetical protein MTR_3g055920 [Medicago truncatula] Back     alignment and taxonomy information
>gi|50399936|gb|AAT76324.1| expressed protein [Oryza sativa Japonica Group] gi|108709566|gb|ABF97361.1| Possible lysine decarboxylase family protein, expressed [Oryza sativa Japonica Group] gi|215736866|dbj|BAG95795.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|242038951|ref|XP_002466870.1| hypothetical protein SORBIDRAFT_01g015500 [Sorghum bicolor] gi|241920724|gb|EER93868.1| hypothetical protein SORBIDRAFT_01g015500 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query209
TAIR|locus:505006177306 AT1G50575 "AT1G50575" [Arabido 0.961 0.656 0.688 8.1e-76
UNIPROTKB|Q0BZM4250 HNE_2374 "Putative uncharacter 0.636 0.532 0.333 2.4e-12
TIGR_CMR|SPO_3342291 SPO_3342 "decarboxylase family 0.655 0.470 0.299 4.3e-08
TIGR_CMR|GSU_1904342 GSU_1904 "decarboxylase family 0.684 0.418 0.265 2.6e-05
CGD|CAL0000804222 orf19.1723 [Candida albicans ( 0.392 0.369 0.340 5.4e-05
UNIPROTKB|Q5AJ69222 CaO19.1723 "Putative uncharact 0.392 0.369 0.340 5.4e-05
TIGR_CMR|GSU_2766196 GSU_2766 "decarboxylase family 0.210 0.224 0.437 6.8e-05
TAIR|locus:2828223213 LOG1 "AT2G28305" [Arabidopsis 0.272 0.267 0.360 0.00036
TAIR|locus:2084051215 LOG4 "AT3G53450" [Arabidopsis 0.272 0.265 0.377 0.00038
UNIPROTKB|Q5ZC82242 LOG "Cytokinin riboside 5'-mon 0.282 0.243 0.412 0.00044
TAIR|locus:505006177 AT1G50575 "AT1G50575" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 764 (274.0 bits), Expect = 8.1e-76, P = 8.1e-76
 Identities = 139/202 (68%), Positives = 169/202 (83%)

Query:     3 PDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWT 62
             P+H HYLQ+ EL  E A LLDCTTWSGAGPGLMDAVT+GA++A KPVGG K+ KEAGEWT
Sbjct:   101 PNHSHYLQAQELSREAATLLDCTTWSGAGPGLMDAVTEGALEAEKPVGGIKIEKEAGEWT 160

Query:    63 ASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILAL 122
             AS FHPYLP + Y TCRFFSARKHGL+D  +RN+  ++TA++ALPGG+GTLDEMFEILAL
Sbjct:   161 ASKFHPYLPPQNYHTCRFFSARKHGLVDAVIRNNVSEKTAIIALPGGIGTLDEMFEILAL 220

Query:   123 IQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEALSY 182
             IQLERIGS LPVPF+VMNYD+FY KLL+F+  CE+ GTV+K EV++LWK+C++N EAL+Y
Sbjct:   221 IQLERIGSALPVPFIVMNYDAFYSKLLEFIETCENLGTVSKGEVSALWKVCNNNFEALTY 280

Query:   183 LAEFYDLSSIDKRVHEVNLKST 204
             LAEFYDL     ++ E  L+ T
Sbjct:   281 LAEFYDLPDGSAKL-ETELRKT 301




GO:0008152 "metabolic process" evidence=ISS
GO:0009507 "chloroplast" evidence=ISM
GO:0016831 "carboxy-lyase activity" evidence=ISS
UNIPROTKB|Q0BZM4 HNE_2374 "Putative uncharacterized protein" [Hyphomonas neptunium ATCC 15444 (taxid:228405)] Back     alignment and assigned GO terms
TIGR_CMR|SPO_3342 SPO_3342 "decarboxylase family protein" [Ruegeria pomeroyi DSS-3 (taxid:246200)] Back     alignment and assigned GO terms
TIGR_CMR|GSU_1904 GSU_1904 "decarboxylase family protein" [Geobacter sulfurreducens PCA (taxid:243231)] Back     alignment and assigned GO terms
CGD|CAL0000804 orf19.1723 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q5AJ69 CaO19.1723 "Putative uncharacterized protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
TIGR_CMR|GSU_2766 GSU_2766 "decarboxylase family protein" [Geobacter sulfurreducens PCA (taxid:243231)] Back     alignment and assigned GO terms
TAIR|locus:2828223 LOG1 "AT2G28305" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2084051 LOG4 "AT3G53450" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q5ZC82 LOG "Cytokinin riboside 5'-monophosphate phosphoribohydrolase LOG" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.VIII.588.1
hypothetical protein (226 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
pfam03641130 pfam03641, Lysine_decarbox, Possible lysine decarb 1e-26
COG1611205 COG1611, COG1611, Predicted Rossmann fold nucleoti 2e-19
TIGR00730178 TIGR00730, TIGR00730, TIGR00730 family protein 3e-12
>gnl|CDD|217655 pfam03641, Lysine_decarbox, Possible lysine decarboxylase Back     alignment and domain information
 Score = 98.4 bits (246), Expect = 1e-26
 Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 24/151 (15%)

Query: 35  MDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCR---FFSARKHGLIDC 91
           M AV  GA++AG  V G      A E         +P               RK  +   
Sbjct: 1   MGAVADGALEAGGRVIGIIPNILAPE--------EIPNPIVTELIIVPDMHERKAAMARL 52

Query: 92  AVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDF 151
           A         A VALPGG GTL+E+FEIL  IQL         P +++N + +Y  LL+F
Sbjct: 53  A--------DAFVALPGGFGTLEELFEILTWIQLGIH----QKPIILLNPNGYYDPLLEF 100

Query: 152 LGDCEDWGTVAKDEVASLWKICDSNSEALSY 182
           +    + G ++      L  + D   EA+  
Sbjct: 101 IDHMVEEGFISP-AARELIIVVDDPEEAVEK 130


The members of this family share a highly conserved motif PGGXGTXXE that is probably functionally important. This family includes proteins annotated as lysine decarboxylases, although the evidence for this is not clear. Length = 130

>gnl|CDD|224527 COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only] Back     alignment and domain information
>gnl|CDD|129813 TIGR00730, TIGR00730, TIGR00730 family protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 209
TIGR00730178 conserved hypothetical protein, DprA/Smf-related, 100.0
COG1611205 Predicted Rossmann fold nucleotide-binding protein 100.0
PF03641133 Lysine_decarbox: Possible lysine decarboxylase; In 100.0
TIGR00725159 conserved hypothetical protein, DprA/Smf-related, 100.0
TIGR00732220 dprA DNA protecting protein DprA. Disruption of th 98.94
PF02481212 DNA_processg_A: DNA recombination-mediator protein 98.13
PRK10736374 hypothetical protein; Provisional 97.42
COG0758350 Smf Predicted Rossmann fold nucleotide-binding pro 96.08
PF12694145 MoCo_carrier: Putative molybdenum carrier; InterPr 92.57
PF05014113 Nuc_deoxyrib_tr: Nucleoside 2-deoxyribosyltransfer 92.01
PRK10565508 putative carbohydrate kinase; Provisional 87.34
>TIGR00730 conserved hypothetical protein, DprA/Smf-related, family 2 Back     alignment and domain information
Probab=100.00  E-value=4.7e-47  Score=311.33  Aligned_cols=165  Identities=24%  Similarity=0.389  Sum_probs=146.0

Q ss_pred             CcHHHHH-HHHHHHHHHHcCCCEEEccCCccHHHHHHHHHHhCCCcEEEEecCCCcccccccCCCCCCCccceeeccchH
Q 028413            4 DHPHYLQ-SFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFS   82 (209)
Q Consensus         4 ~~p~y~~-A~~LG~~La~~g~~~V~~GG~~GlM~ava~ga~~~gG~viGi~~~~~~~~~~~~~~n~~l~~e~~i~~~~~~   82 (209)
                      .+|.|.+ |++||++||++|+.+|||||.+|+|+|+|+||+++||+|+||+|..-..+   +.+|++++  ..+.+++|+
T Consensus        13 ~~~~~~~~A~~lG~~la~~g~~lV~GGg~~GlM~a~a~ga~~~gG~viGi~p~~l~~~---~~~~~~~~--~~i~~~~~~   87 (178)
T TIGR00730        13 GNAAYKELAAELGAYLAGQGWGLVYGGGRVGLMGAIADAAMENGGTAVGVNPSGLFSG---EVVHQNLT--ELIEVNGMH   87 (178)
T ss_pred             CCcHHHHHHHHHHHHHHHCCCEEEECCChHhHHHHHHHHHHhcCCeEEEecchhhhhh---hccCCCCC--ceEEECCHH
Confidence            4666665 89999999999988888888899999999999999999999998532111   23599987  456779999


Q ss_pred             HHHHHhHhhhhhcCCCCccEEEEeCCCcccHHHHHHHHHHHHhhhhcCCCCccEEEEeCCccchHHHHHHHhHHHcCCCC
Q 028413           83 ARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVA  162 (209)
Q Consensus        83 ~Rk~~m~~~~~~~~~~~sDa~I~lPGG~GTLeEl~e~~t~~ql~~~~~~~~kPiilln~~g~w~~l~~~l~~~~~~gfi~  162 (209)
                      +||.+|++.|        ||||+||||+|||+|++++|+|.|++.+    +||++++|.+|||+++++|+++++++||++
T Consensus        88 ~Rk~~m~~~s--------da~I~lPGG~GTL~El~e~~~~~qlg~~----~kPiil~n~~g~~~~l~~~l~~~~~~gfi~  155 (178)
T TIGR00730        88 ERKAMMAELA--------DAFIAMPGGFGTLEELFEVLTWAQLGIH----QKPIILFNVNGHFDGLVEWLKYSIQEGFIS  155 (178)
T ss_pred             HHHHHHHHhC--------CEEEEcCCCcchHHHHHHHHHHHHcCCC----CCCEEEECCcchHHHHHHHHHHHHHCCCCC
Confidence            9999999987        9999999999999999999999999864    799999999999999999999999999999


Q ss_pred             hhcccccEEEeCCHHHHHHHHHhh
Q 028413          163 KDEVASLWKICDSNSEALSYLAEF  186 (209)
Q Consensus       163 ~~~~~~~i~~~~~~ee~~~~l~~~  186 (209)
                      +++ .+.+.+++|++|++++|++|
T Consensus       156 ~~~-~~~~~~~d~~~e~~~~i~~~  178 (178)
T TIGR00730       156 ESH-LKLIHVVSRPDELIEQVQNY  178 (178)
T ss_pred             HHH-cCcEEEcCCHHHHHHHHHhC
Confidence            975 47899999999999999764



This model represents one branch of a subfamily of proteins of unknown function. Both PSI-BLAST and weak hits by this model show a low level of similarity to and suggest an evolutionary relationship of the subfamily to the DprA/Smf family of DNA-processing proteins involved in chromosomal transformation with foreign DNA. Both Aquifex aeolicus and Mycobacterium leprae have one member in each of two branches of this subfamily, suggesting that the branches may have distinct functions.

>COG1611 Predicted Rossmann fold nucleotide-binding protein [General function prediction only] Back     alignment and domain information
>PF03641 Lysine_decarbox: Possible lysine decarboxylase; InterPro: IPR005269 This entry represents a cytokinin-activating enzyme working in the direct activation pathway Back     alignment and domain information
>TIGR00725 conserved hypothetical protein, DprA/Smf-related, family 1 Back     alignment and domain information
>TIGR00732 dprA DNA protecting protein DprA Back     alignment and domain information
>PF02481 DNA_processg_A: DNA recombination-mediator protein A; InterPro: IPR003488 The SMF family, of DNA processing chain A, dprA, are a group of bacterial proteins Back     alignment and domain information
>PRK10736 hypothetical protein; Provisional Back     alignment and domain information
>COG0758 Smf Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion] Back     alignment and domain information
>PF12694 MoCo_carrier: Putative molybdenum carrier; InterPro: IPR024755 The structure of proteins in this family contain central beta strands with flanking alpha helices Back     alignment and domain information
>PF05014 Nuc_deoxyrib_tr: Nucleoside 2-deoxyribosyltransferase; InterPro: IPR007710 Nucleoside 2-deoxyribosyltransferase (2 Back     alignment and domain information
>PRK10565 putative carbohydrate kinase; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
1wek_A217 Crystal Structure Of The Conserved Hypothetical Pro 2e-07
>pdb|1WEK|A Chain A, Crystal Structure Of The Conserved Hypothetical Protein Tt1465 From Thermus Thermophilus Hb8 Length = 217 Back     alignment and structure

Iteration: 1

Score = 52.4 bits (124), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 55/184 (29%), Positives = 83/184 (45%), Gaps = 22/184 (11%) Query: 1 MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGE 60 G HP Y + LG +A G G+ +AV +GA +AG G VG Sbjct: 48 FGEGHPAYEAGYRLGRALAEAGFGVVTGGGP-GVXEAVNRGAYEAG----GVSVGLNIEL 102 Query: 61 WTASNFHPYLPLETY-LTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEI 119 +PY +T+ L+ R+F RK + AV V LPGG GTLDE+ E+ Sbjct: 103 PHEQKPNPY---QTHALSLRYFFVRKVLFVRYAV--------GFVFLPGGFGTLDELSEV 151 Query: 120 LALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEA 179 L L+Q E++ PV L Y ++ L+ +L D V +++ L+++ D E Sbjct: 152 LVLLQTEKV-HRFPVFLLDRGY---WEGLVRWLAFLRDQKAVGPEDL-QLFRLTDEPEEV 206 Query: 180 LSYL 183 + L Sbjct: 207 VQAL 210

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query209
1wek_A217 Hypothetical protein TT1465; rossman fold, structu 4e-34
1weh_A171 Conserved hypothetical protein TT1887; rossman fol 1e-28
1rcu_A195 Conserved hypothetical protein VT76; structural ge 1e-23
3gh1_A 462 Predicted nucleotide-binding protein; structural g 1e-21
2iz6_A176 Molybdenum cofactor carrier protein; metal transpo 6e-20
3bq9_A 460 Predicted rossmann fold nucleotide-binding domain 2e-19
3sbx_A189 Putative uncharacterized protein; structural genom 4e-18
1ydh_A216 AT5G11950; structural genomics, protein structure 8e-18
2a33_A215 Hypothetical protein; structural genomics, protein 1e-17
1t35_A191 Hypothetical protein YVDD, putative lysine decarbo 2e-17
3qua_A199 Putative uncharacterized protein; structural genom 2e-17
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
>1wek_A Hypothetical protein TT1465; rossman fold, structural genomics, riken structural genomics/proteomics initiative, RSGI, unknown function; 2.20A {Thermus thermophilus} SCOP: c.129.1.1 Length = 217 Back     alignment and structure
 Score =  120 bits (302), Expect = 4e-34
 Identities = 54/194 (27%), Positives = 84/194 (43%), Gaps = 36/194 (18%)

Query: 2   GPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEW 61
           G  HP Y   + LG  +A        +G GPG+M+AV +GA +AG    G          
Sbjct: 49  GEGHPAYEAGYRLGRALAEA-GFGVVTGGGPGVMEAVNRGAYEAGGVSVGL--------- 98

Query: 62  TASNFHPYLPLE----TYLT----CRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTL 113
                +  LP E     Y T     R+F  RK   +  AV          V LPGG GTL
Sbjct: 99  -----NIELPHEQKPNPYQTHALSLRYFFVRKVLFVRYAV--------GFVFLPGGFGTL 145

Query: 114 DEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKIC 173
           DE+ E+L L+Q E++      P  +++   +++ L+ +L    D   V  +++  L+++ 
Sbjct: 146 DELSEVLVLLQTEKVH---RFPVFLLD-RGYWEGLVRWLAFLRDQKAVGPEDL-QLFRLT 200

Query: 174 DSNSEALSYLAEFY 187
           D   E +  L    
Sbjct: 201 DEPEEVVQALKAEA 214


>1weh_A Conserved hypothetical protein TT1887; rossman fold, structural genomics, riken structural genomics/proteomics initiative, RSGI; 1.80A {Thermus thermophilus} SCOP: c.129.1.1 Length = 171 Back     alignment and structure
>1rcu_A Conserved hypothetical protein VT76; structural genomics, PSI, protein structure initiative, northeast structural genomics consortium, NESG; 2.50A {Thermotoga maritima} SCOP: c.129.1.1 Length = 195 Back     alignment and structure
>3gh1_A Predicted nucleotide-binding protein; structural genomics, protein structure initiative; 1.90A {Vibrio cholerae o1 biovar el tor str} PDB: 2pmb_A Length = 462 Back     alignment and structure
>2iz6_A Molybdenum cofactor carrier protein; metal transport; 1.60A {Chlamydomonas reinhardtii} PDB: 2iz5_A 2iz7_A Length = 176 Back     alignment and structure
>3bq9_A Predicted rossmann fold nucleotide-binding domain containing protein; structural genomics, PSI-2, protein structure initiative; 1.80A {Idiomarina baltica} Length = 460 Back     alignment and structure
>3sbx_A Putative uncharacterized protein; structural genomics, seattle structural genomics center for infectious disease, ssgcid; HET: AMP; 2.50A {Mycobacterium marinum M} Length = 189 Back     alignment and structure
>1ydh_A AT5G11950; structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG; 2.15A {Arabidopsis thaliana} SCOP: c.129.1.1 PDB: 2q4d_A Length = 216 Back     alignment and structure
>2a33_A Hypothetical protein; structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG, AT2G37210; 1.95A {Arabidopsis thaliana} SCOP: c.129.1.1 PDB: 2q4o_A Length = 215 Back     alignment and structure
>1t35_A Hypothetical protein YVDD, putative lysine decarboxylase; structural genomics target, NYSGXRC, PSI, protein structure initiative; 2.72A {Bacillus subtilis} SCOP: c.129.1.1 Length = 191 Back     alignment and structure
>3qua_A Putative uncharacterized protein; structural genomics, seattle structural genomics center for infectious disease, ssgcid; 2.10A {Mycobacterium smegmatis str} Length = 199 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query209
3sbx_A189 Putative uncharacterized protein; structural genom 100.0
3qua_A199 Putative uncharacterized protein; structural genom 100.0
1t35_A191 Hypothetical protein YVDD, putative lysine decarbo 100.0
1ydh_A216 AT5G11950; structural genomics, protein structure 100.0
2a33_A215 Hypothetical protein; structural genomics, protein 100.0
1wek_A217 Hypothetical protein TT1465; rossman fold, structu 100.0
1weh_A171 Conserved hypothetical protein TT1887; rossman fol 100.0
3gh1_A 462 Predicted nucleotide-binding protein; structural g 100.0
3bq9_A 460 Predicted rossmann fold nucleotide-binding domain 100.0
1rcu_A195 Conserved hypothetical protein VT76; structural ge 100.0
2iz6_A176 Molybdenum cofactor carrier protein; metal transpo 100.0
3maj_A382 DNA processing chain A; MCSG, PSI-2, structural ge 98.26
3uqz_A288 DNA processing protein DPRA; SAM and rossmann fold 97.83
3imk_A158 Putative molybdenum carrier protein; YP_461806.1, 96.08
2nx2_A181 Hypothetical protein YPSA; structural genomics, un 92.95
2f62_A161 Nucleoside 2-deoxyribosyltransferase; SGPP, struct 92.68
2khz_A165 C-MYC-responsive protein RCL; flexible loop, nucle 92.1
3ehd_A162 Uncharacterized conserved protein; PSI,MCSG,PF0501 91.43
1f8y_A157 Nucleoside 2-deoxyribosyltransferase; active site, 85.08
2o6l_A170 UDP-glucuronosyltransferase 2B7; drug metabolism, 84.41
>3sbx_A Putative uncharacterized protein; structural genomics, seattle structural genomics center for infectious disease, ssgcid; HET: AMP; 2.50A {Mycobacterium marinum M} Back     alignment and structure
Probab=100.00  E-value=6e-48  Score=317.99  Aligned_cols=163  Identities=25%  Similarity=0.275  Sum_probs=144.0

Q ss_pred             CcHHHHH-HHHHHHHHHHcCCCEEEccCCccHHHHHHHHHHhCCCcEEEEecCCCcccccccCCCCCCCccceeeccchH
Q 028413            4 DHPHYLQ-SFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFS   82 (209)
Q Consensus         4 ~~p~y~~-A~~LG~~La~~g~~~V~~GG~~GlM~ava~ga~~~gG~viGi~~~~~~~~~~~~~~n~~l~~e~~i~~~~~~   82 (209)
                      .+|.|.+ |++||++||++|..+|||||..|+|+|+|+||+++||+|+||+|...+.+   +.+|++++  ..+++.+|+
T Consensus        25 ~~~~~~~~A~~lg~~la~~g~~lv~GGG~~GlM~a~~~ga~~~GG~viGv~p~~l~~~---e~~~~~~~--~~i~~~~~~   99 (189)
T 3sbx_A           25 THPELLELAGAVGAAIAARGWTLVWGGGHVSAMGAVSSAARAHGGWTVGVIPKMLVHR---ELADHDAD--ELVVTETMW   99 (189)
T ss_dssp             CCHHHHHHHHHHHHHHHHTTCEEEECCBCSHHHHHHHHHHHTTTCCEEEEEETTTTTT---TTBCTTCS--EEEEESSHH
T ss_pred             CChHHHHHHHHHHHHHHHCCCEEEECCCccCHHHHHHHHHHHcCCcEEEEcCchhhhc---ccCCCCCC--eeEEcCCHH
Confidence            4566655 99999999999887777767779999999999999999999999642221   24699997  567789999


Q ss_pred             HHHHHhHhhhhhcCCCCccEEEEeCCCcccHHHHHHHHHHHHhhhhcCCCCccEEEEeCCccchHHHHHHHhHHHcCCCC
Q 028413           83 ARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVA  162 (209)
Q Consensus        83 ~Rk~~m~~~~~~~~~~~sDa~I~lPGG~GTLeEl~e~~t~~ql~~~~~~~~kPiilln~~g~w~~l~~~l~~~~~~gfi~  162 (209)
                      +||.+|+++|        ||||+||||+|||||+||+|||.|+++|    +|||+|+|.+|||+++++|+++++++||++
T Consensus       100 ~Rk~~m~~~s--------da~IalPGG~GTLdElfe~lt~~qlg~~----~kPvvlln~~gfw~~l~~~l~~~~~~Gfi~  167 (189)
T 3sbx_A          100 ERKQVMEDRA--------NAFITLPGGVGTLDELLDVWTEGYLGMH----DKSIVVLDPWGHFDGLRAWLSELADTGYVS  167 (189)
T ss_dssp             HHHHHHHHHC--------SEEEECSCCHHHHHHHHHHHHHHHTTSC----CCCEEEECTTCTTHHHHHHHHHHHHTTSSC
T ss_pred             HHHHHHHHHC--------CEEEEeCCCcchHHHHHHHHHHHHhccc----CCCEEEecCCccchHHHHHHHHHHHCCCCC
Confidence            9999999997        9999999999999999999999999864    799999999999999999999999999999


Q ss_pred             hhcccccEEEeCCHHHHHHHHH
Q 028413          163 KDEVASLWKICDSNSEALSYLA  184 (209)
Q Consensus       163 ~~~~~~~i~~~~~~ee~~~~l~  184 (209)
                      +++ .+++.+++|++|+++.|+
T Consensus       168 ~~~-~~~i~~~d~~ee~~~~l~  188 (189)
T 3sbx_A          168 RTA-MERLIVVDNLDDALQACA  188 (189)
T ss_dssp             HHH-HHHEEEESSHHHHHHHHC
T ss_pred             HHH-cCeEEEeCCHHHHHHHhc
Confidence            975 488999999999999873



>3qua_A Putative uncharacterized protein; structural genomics, seattle structural genomics center for infectious disease, ssgcid; 2.10A {Mycobacterium smegmatis str} Back     alignment and structure
>1t35_A Hypothetical protein YVDD, putative lysine decarboxylase; structural genomics target, NYSGXRC, PSI, protein structure initiative; 2.72A {Bacillus subtilis} SCOP: c.129.1.1 Back     alignment and structure
>1ydh_A AT5G11950; structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG; 2.15A {Arabidopsis thaliana} SCOP: c.129.1.1 PDB: 2q4d_A Back     alignment and structure
>2a33_A Hypothetical protein; structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG, AT2G37210; 1.95A {Arabidopsis thaliana} SCOP: c.129.1.1 PDB: 2q4o_A Back     alignment and structure
>1wek_A Hypothetical protein TT1465; rossman fold, structural genomics, riken structural genomics/proteomics initiative, RSGI, unknown function; 2.20A {Thermus thermophilus} SCOP: c.129.1.1 Back     alignment and structure
>1weh_A Conserved hypothetical protein TT1887; rossman fold, structural genomics, riken structural genomics/proteomics initiative, RSGI; 1.80A {Thermus thermophilus} SCOP: c.129.1.1 Back     alignment and structure
>3gh1_A Predicted nucleotide-binding protein; structural genomics, protein structure initiative; 1.90A {Vibrio cholerae o1 biovar el tor str} PDB: 2pmb_A Back     alignment and structure
>3bq9_A Predicted rossmann fold nucleotide-binding domain containing protein; structural genomics, PSI-2, protein structure initiative; 1.80A {Idiomarina baltica} Back     alignment and structure
>1rcu_A Conserved hypothetical protein VT76; structural genomics, PSI, protein structure initiative, northeast structural genomics consortium, NESG; 2.50A {Thermotoga maritima} SCOP: c.129.1.1 Back     alignment and structure
>2iz6_A Molybdenum cofactor carrier protein; metal transport; 1.60A {Chlamydomonas reinhardtii} PDB: 2iz5_A 2iz7_A Back     alignment and structure
>3maj_A DNA processing chain A; MCSG, PSI-2, structural genomics, protein structure initiati midwest center for structural genomics; HET: DNA; 2.05A {Rhodopseudomonas palustris} Back     alignment and structure
>3uqz_A DNA processing protein DPRA; SAM and rossmann fold, DNA processing protein A, DNA binding; HET: DNA SO4; 2.70A {Streptococcus pneumoniae} Back     alignment and structure
>3imk_A Putative molybdenum carrier protein; YP_461806.1, structural genomics, joint center for structural genomics, JCSG; HET: MSE MES PG4 PG6; 1.45A {Syntrophus aciditrophicus SB} Back     alignment and structure
>2nx2_A Hypothetical protein YPSA; structural genomics, unknown function, PSI, protein structure initiative; 2.00A {Bacillus subtilis} SCOP: c.129.1.2 Back     alignment and structure
>2f62_A Nucleoside 2-deoxyribosyltransferase; SGPP, structural genomics, PSI, S genomics of pathogenic protozoa consortium; HET: 12M; 1.50A {Trypanosoma brucei} SCOP: c.23.14.1 PDB: 2a0k_A* 2f2t_A* 2f64_A* 2f67_A* Back     alignment and structure
>2khz_A C-MYC-responsive protein RCL; flexible loop, nucleus, phosphoprotein, nuclear protein; NMR {Rattus norvegicus} PDB: 2klh_A* Back     alignment and structure
>3ehd_A Uncharacterized conserved protein; PSI,MCSG,PF05014, structural genomics, protein structure INI midwest center for structural genomics; HET: MSE; 2.15A {Enterococcus faecalis} Back     alignment and structure
>1f8y_A Nucleoside 2-deoxyribosyltransferase; active site, alpha/beta protein, biocatalyst, X- RAY crystallography; HET: 5MD; 2.40A {Lactobacillus leichmannii} SCOP: c.23.14.1 PDB: 1f8x_A* Back     alignment and structure
>2o6l_A UDP-glucuronosyltransferase 2B7; drug metabolism, rossman, MAD, enzyme, nucleotide binding, sugar,UDP-glucuronosyltransferase, UGT; 1.80A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 209
d1ydhb_181 c.129.1.1 (B:) Hypothetical protein At5g11950 {Tha 2e-25
d2q4oa1183 c.129.1.1 (A:8-190) Hypothetical protein At2g37210 3e-24
d1weka_208 c.129.1.1 (A:) Hypothetical protein TT1465 (TTHA16 3e-21
d1t35a_179 c.129.1.1 (A:) Hypothetical protein YvdD {Bacillus 2e-20
d1weha_171 c.129.1.1 (A:) Hypothetical protein TT1887 (TTHA02 9e-16
d1rcua_170 c.129.1.1 (A:) Hypothetical protein TM1055 {Thermo 7e-15
>d1ydhb_ c.129.1.1 (B:) Hypothetical protein At5g11950 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 181 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: MCP/YpsA-like
superfamily: MCP/YpsA-like
family: MoCo carrier protein-like
domain: Hypothetical protein At5g11950
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 95.2 bits (236), Expect = 2e-25
 Identities = 33/186 (17%), Positives = 63/186 (33%), Gaps = 18/186 (9%)

Query: 1   MGPDHPHYLQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGE 60
            G        + ELG E+ +      + G   GLM  +++   + G  V G         
Sbjct: 13  SGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGGLHVLGIIPKALMPI 72

Query: 61  WTASNFHPYLPLETYLTCRFFSARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEIL 120
             +         +  +       +     +           A +ALPGG GT++E+ E++
Sbjct: 73  EISGETVG----DVRVVADMHERKAAMAQEA---------EAFIALPGGYGTMEELLEMI 119

Query: 121 ALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVAKDEVASLWKICDSNSEAL 180
              QL            ++N D +Y  LL       + G + K    ++     +  E +
Sbjct: 120 TWSQLGIH----KKTVGLLNVDGYYNNLLALFDTGVEEGFI-KPGARNIVVSAPTAKELM 174

Query: 181 SYLAEF 186
             + E+
Sbjct: 175 EKMEEY 180


>d1weka_ c.129.1.1 (A:) Hypothetical protein TT1465 (TTHA1644) {Thermus thermophilus [TaxId: 274]} Length = 208 Back     information, alignment and structure
>d1t35a_ c.129.1.1 (A:) Hypothetical protein YvdD {Bacillus subtilis [TaxId: 1423]} Length = 179 Back     information, alignment and structure
>d1weha_ c.129.1.1 (A:) Hypothetical protein TT1887 (TTHA0294) {Thermus thermophilus [TaxId: 274]} Length = 171 Back     information, alignment and structure
>d1rcua_ c.129.1.1 (A:) Hypothetical protein TM1055 {Thermotoga maritima [TaxId: 2336]} Length = 170 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query209
d1t35a_179 Hypothetical protein YvdD {Bacillus subtilis [TaxI 100.0
d1ydhb_181 Hypothetical protein At5g11950 {Thale cress (Arabi 100.0
d2q4oa1183 Hypothetical protein At2g37210/T2N18.3 {Thale cres 100.0
d1weka_208 Hypothetical protein TT1465 (TTHA1644) {Thermus th 100.0
d1rcua_170 Hypothetical protein TM1055 {Thermotoga maritima [ 100.0
d1weha_171 Hypothetical protein TT1887 (TTHA0294) {Thermus th 100.0
d1s2da_167 Purine transdeoxyribosylase {Lactobacillus helveti 90.71
d1f8ya_156 Nucleoside 2-deoxyribosyltransferase {Lactobacillu 88.06
d2f62a1152 Nucleoside 2-deoxyribosyltransferase {Trypanosoma 87.21
d1kyha_275 Hypothetical protein YxkO {Bacillus subtilis [TaxI 85.22
>d1t35a_ c.129.1.1 (A:) Hypothetical protein YvdD {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: MCP/YpsA-like
superfamily: MCP/YpsA-like
family: MoCo carrier protein-like
domain: Hypothetical protein YvdD
species: Bacillus subtilis [TaxId: 1423]
Probab=100.00  E-value=3.1e-48  Score=315.33  Aligned_cols=165  Identities=19%  Similarity=0.317  Sum_probs=146.4

Q ss_pred             CcHHH-HHHHHHHHHHHHcCCCEEEccCCccHHHHHHHHHHhCCCcEEEEecCCCcccccccCCCCCCCccceeeccchH
Q 028413            4 DHPHY-LQSFELGGEIARLLDCTTWSGAGPGLMDAVTKGAMQAGKPVGGFKVGKEAGEWTASNFHPYLPLETYLTCRFFS   82 (209)
Q Consensus         4 ~~p~y-~~A~~LG~~La~~g~~~V~~GG~~GlM~ava~ga~~~gG~viGi~~~~~~~~~~~~~~n~~l~~e~~i~~~~~~   82 (209)
                      .||.| ++|++||+.||++|+.+|||||..|+|+|+++||+++||+++||+|......   +.+|++++  ..+.+++|+
T Consensus        14 ~~~~~~~~a~~lg~~la~~g~~lv~GGG~~GlMga~a~ga~~~gg~v~gv~~~~l~~~---~~~~~~~~--~~~~~~~~~   88 (179)
T d1t35a_          14 GNEAYKRKAAELGVYMAEQGIGLVYGGSRVGLMGTIADAIMENGGTAIGVMPSGLFSG---EVVHQNLT--ELIEVNGMH   88 (179)
T ss_dssp             SSTHHHHHHHHHHHHHHHTTCEEEECCCCSHHHHHHHHHHHTTTCCEEEEEETTCCHH---HHTTCCCS--EEEEESHHH
T ss_pred             cCCHHHHHHHHHHHHHHHCCCeEEECCCchHHHHHHhcchhhcCCceeccccchhhcc---ccccccce--eeeeeccHH
Confidence            44555 4599999999999999999999999999999999999999999998542211   23588886  567779999


Q ss_pred             HHHHHhHhhhhhcCCCCccEEEEeCCCcccHHHHHHHHHHHHhhhhcCCCCccEEEEeCCccchHHHHHHHhHHHcCCCC
Q 028413           83 ARKHGLIDCAVRNDSCDRTAVVALPGGVGTLDEMFEILALIQLERIGSELPVPFLVMNYDSFYKKLLDFLGDCEDWGTVA  162 (209)
Q Consensus        83 ~Rk~~m~~~~~~~~~~~sDa~I~lPGG~GTLeEl~e~~t~~ql~~~~~~~~kPiilln~~g~w~~l~~~l~~~~~~gfi~  162 (209)
                      +||..|++.|        |+||+||||+|||||+|++|+|.|++.|    +|||+|+|.+|||+++++++++++++||++
T Consensus        89 ~Rk~~m~~~s--------dafI~lPGG~GTLdEl~e~l~~~ql~~~----~kPiil~n~~gfw~~l~~~l~~~~~~gfi~  156 (179)
T d1t35a_          89 ERKAKMSELA--------DGFISMPGGFGTYEELFEVLCWAQIGIH----QKPIGLYNVNGYFEPMMKMVKYSIQEGFSN  156 (179)
T ss_dssp             HHHHHHHHHC--------SEEEECSCCHHHHHHHHHHHHTTSCSSC----CCCEEEECGGGTTHHHHHHHHHHHHTTSSC
T ss_pred             HHHHHHHHhc--------CeEEEecCccchhhHHHHHHHHHHhhcc----CCCeEeecCCccHHHHHHHHHHHHHcCCCC
Confidence            9999999987        9999999999999999999999999864    799999999999999999999999999999


Q ss_pred             hhcccccEEEeCCHHHHHHHHHhh
Q 028413          163 KDEVASLWKICDSNSEALSYLAEF  186 (209)
Q Consensus       163 ~~~~~~~i~~~~~~ee~~~~l~~~  186 (209)
                      +++ .+.+.+++||+|++++|++|
T Consensus       157 ~~~-~~~i~~~~~~~e~i~~L~~~  179 (179)
T d1t35a_         157 ESH-LKLIHSSSRPDELIEQMQNY  179 (179)
T ss_dssp             TTH-HHHEEEESSHHHHHHHHHTC
T ss_pred             hHH-hCcEEEeCCHHHHHHHHHhC
Confidence            975 48899999999999999875



>d1ydhb_ c.129.1.1 (B:) Hypothetical protein At5g11950 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
>d1weka_ c.129.1.1 (A:) Hypothetical protein TT1465 (TTHA1644) {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
>d1rcua_ c.129.1.1 (A:) Hypothetical protein TM1055 {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
>d1weha_ c.129.1.1 (A:) Hypothetical protein TT1887 (TTHA0294) {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
>d1s2da_ c.23.14.1 (A:) Purine transdeoxyribosylase {Lactobacillus helveticus [TaxId: 1587]} Back     information, alignment and structure
>d1f8ya_ c.23.14.1 (A:) Nucleoside 2-deoxyribosyltransferase {Lactobacillus leichmannii [TaxId: 28039]} Back     information, alignment and structure
>d2f62a1 c.23.14.1 (A:9-160) Nucleoside 2-deoxyribosyltransferase {Trypanosoma brucei [TaxId: 5691]} Back     information, alignment and structure
>d1kyha_ c.72.1.4 (A:) Hypothetical protein YxkO {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure