Choose what's right

Offering both pre-made and custom dataset solutions. Train your model with general-pourpose synthetic data, fine-tune on highly-customized datasets to match your needs.

Pre-made Dataset Packages

Ready-to-use curated datasets

  • Curated MD simulation trajectories (6 replicas, 10-100 ns)
  • Self-Distillation structure sets (entire human proteome and beyond)
  • Quality-filtered and validated
  • Instant download access
  • Commercial usage

Perfect for teams looking to quickly augment their training data

Most Flexible

Custom Dataset Generation

Tailored to your specific needs

  • Focus on custom protein sequences, and protein-classes of interest
  • Custom molecular force-fields, membrane-containing systems
  • Specialized enhanced sampling methods

Ideal for organizations with specific research requirements