← Back to Innovations
Biomedical Sciences Model Scientific Experimental

BioEmu

Protein Structural Ensembles

6,700 USERS
41 CITATIONS
Try BioEmu on Microsoft Foundry → Try on Microsoft Foundry →
BioEmu

Interactive Playground

Input Protein

Provide a protein sequence to fold
Try a preset
10 characters
Paste amino acid sequence using single-letter codes (e.g., NLYIQWLKDGG...)
Conformations 10

3D Result

Generate a structure to view it here

About BioEmu

BioEmu is a diffusion-based generative model from Microsoft Research that predicts protein structural ensembles — the full landscape of conformational states a protein occupies — rather than a single static fold. It generates thousands of statistically independent conformations per hour on a single GPU and is trained on more than 200 milliseconds of molecular-dynamics simulation data alongside experimental structures and stability measurements. The model captures functional motions including cryptic-pocket formation, domain rearrangements, and local unfolding, and predicts relative free energies with roughly kcal/mol accuracy.

BioEmu shifts protein modeling from a static-structure picture to a dynamic, ensemble picture, addressing the long-standing limitation that real proteins flex and breathe in ways relevant to function and drug binding. By amortizing the cost of microsecond-scale molecular dynamics and ensemble-resolving experimental techniques, it enables researchers to design drugs that target specific conformational states inaccessible to single-structure methods. Within Microsoft’s AI for Science and Health Futures programs, BioEmu is a central piece of the push to make protein dynamics — not just structure — a computable, scalable input to therapeutic discovery.

Key capabilities

  • Generates 10,000+ structurally diverse conformations per protein in minutes on a single H100
  • Predicts full structural ensembles rather than a single fold
  • Diffusion-based generative model for conformational landscapes
  • Integrates with OpenMM for downstream simulation
  • Published in Science; open-sourced on GitHub
Technology Stack
PyTorch Diffusion Models OpenMM CUDA BioPython
Technology Stack
PyTorch Diffusion Models OpenMM CUDA BioPython