TriCyp
BrowseH-GroupsBenchmarkDownloadsPaper

TriCyp

Three-state cysteine classification across ECOD F70 representative domains — disulfide-bonded, metal-binding, or free thiol — combining ESM2 predictions with PDB structural evidence.

Navigation

  • Dashboard
  • Browse Families
  • H-Groups
  • Benchmark
  • AF Geometric
  • Downloads & API
  • About / Methods
  • Paper

Resources

  • ECOD Database
  • RCSB PDB

© 2026 Schaeffer & Cong Labs, UT Southwestern Medical Center

data · paper-v1·refreshed 2026-05-06

Paper

Classification of cysteine fates in structure predictions using a protein language model

Yuan, Durham, Cong, Schaeffer · 2026

TriCyp is the companion deposition for this manuscript. Each main and supplementary figure maps to a navigable surface here so a reader landing from the PDF can re-find the exact panel they want to inspect interactively.

bioRxiv link pendingDownloads & API

Main figures

FigureWhat it showsTriCyp surface
Fig 1Pipeline overview. Pipeline diagram for ESM2-3state classification across ECOD F70 representative domains.About / Methods →
Fig 2Held-out benchmark. ROC + PR curves and threshold tuning for ESM2-3state vs SSBONDPredict (disulfide) and vs LMetalSite / GPSite (metal-binding).Benchmark →
Fig 3ASource-stratified rates. Stacked bars: PDB-geom / PDB-ESM / AFDB-ESM × free-thiol / disulfide / metal-binding fractions.Dashboard →
Fig 3BKingdom representation. Domain fraction vs cysteine fraction by superkingdom.Dashboard →
Fig 3CPer-kingdom rates. Stacked classification rates for Bacteria / Archaea / Eukaryota.Dashboard →
Fig 3DSubcellular gradient. Eukaryotic subcellular localisation: disulfide and metal-binding rates per compartment.Dashboard →
Fig 4AF geometric scanning. Why AlphaFold-monomer geometric scanning is fundamentally limited as a disulfide annotation source — panels A–F with downloadable PyMOL sessions.AlphaFold geometric scanning →
Fig 5A,BH-group confusion matrix. Structurally-known × ESM2-predicted cysteine fractions per H-group; click-through to the H-groups in any cell.H-group browser →
Fig 5CNovel metal H-group · 3380.1. Side-by-side PDB-source and AFDB-source representatives with ESM2-predicted metal-binding cysteines highlighted.H-group 3380.1 →
Fig 5DNovel metal H-group · 804.1. Second highlighted candidate-novel metal-binding H-group.H-group 804.1 →
Fig 5ENovel metal H-group · 3991.1. Third highlighted candidate-novel metal-binding H-group.H-group 3991.1 →

Supplementary figures

FigureWhat it showsTriCyp surface
Fig S1Iron-only ROC. Metal-type-stratified ROC. The headline iron-only finding (ESM2 0.993 / LMetalSite 0.917 / GPSite 0.877).Benchmark →
Fig S2Source-type breakdown. Source-type breakdown (PDB / AFDB / Prodigal / UniParc) × classification fractions.Dashboard →
Fig S3Confidence distribution. Distribution of max-class probability across all classified cysteines.Dashboard →

How to cite

Please cite the manuscript when reusing TriCyp data. Data is released under CC-BY 4.0; the predictor source carries its existing license.Templates below contain DOI/URL placeholders until the bioRxiv preprint is assigned — they will fill in automatically.

BibTeX

@article{yuan_tricyp_2026,
  title = {Classification of cysteine fates in structure predictions using a protein language model},
  author = {Yuan and Durham and Cong and Schaeffer},
  year = {2026},
  journal = {bioRxiv},
  note = {TriCyp companion site: https://tricyp.swmed.edu},
}

RIS

TY  - JOUR
TI  - Classification of cysteine fates in structure predictions using a protein language model
AU  - Yuan
AU  - Durham
AU  - Cong
AU  - Schaeffer, Dustin
PY  - 2026
JO  - bioRxiv
ER  - 

How to read this site

  • Every panel that mirrors a paper figure is labelled with that figure number in the top-left of the card; click Show data beside any dashboard panel to inspect the underlying counts or download the panel CSV.
  • For raw artefacts — per-cysteine TSV, per-domain TSV, per-H-group aggregates, and the manuscript's paper/figure_data/ CSVs — see the Downloads page.
  • The H-group browser replicates Fig 5A,B as click-through confusion matrices; the three highlighted novel-metal H-groups link directly to their detail pages.
  • Public REST API endpoints are documented on the Downloads & API page; every response uses the same { success, data, error } envelope.