๐ฌ LeMat-GenBench: A Unified Benchmark for Generative Models of Crystalline Materials
Generative machine learning models hold great promise for accelerating materials discovery, particularly through the inverse design of inorganic crystals, enabling an unprecedented exploration of chemical space. Yet, the lack of standardized evaluation frameworks makes it difficult to evaluate, compare and further develop these ML models meaningfully.
LeMat-GenBench introduces a unified benchmark for generative models of crystalline materials, with standardized evaluation metrics** for meaningful model comparison, diverse tasks, and this leaderboard to encourage and track community progress.
๐ Paper: arXiv | ๐ป Code: GitHub | ๐ง Contact: siddharth.betala-ext [at] entalpic.ai, alexandre.duval [at] entalpic.ai
LeMat-GenBench
Show only key metrics
Display count-based metrics as percentages of total structures
Select column to sort by (default: sorted by MSUN+SUN descending)
| Color | Group | Metrics | Direction |
|---|---|---|---|
| Validity | Valid, Charge Neutral, Distance Valid, Plausibility Valid | โ Higher is better | |
| Uniqueness & Novelty | Unique, Novel | โ Higher is better | |
| Energy Metrics | E Above Hull, Formation Energy, Relaxation RMSD (with std) | โ Lower is better | |
| Stability | Stable, Unique in Stable, SUN | โ Higher is better | |
| Metastability | Metastable, Unique in Metastable, MSUN | โ Higher is better | |
| Distribution | JS Distance, MMD, FID | โ Lower is better | |
| Diversity | Element, Space Group, Atomic Site, Crystal Size | โ Higher is better | |
| HHI | HHI Production, HHI Reserve | โ Lower is better |
GenBench Leaderboard
WyFormer-DFT โก | 95.7% | 95.1% | 70.5% | 12.4% | 33.4% | 0.2% | 15.0% | 0.1834 | -0.4975 | 0.3878 |
Symbol Legend:
- โก Structures were already relaxed
- โ Contributes to LeMat-Bulk reference dataset (in-distribution)
- โ Out-of-distribution relative to LeMat-Bulk reference dataset
Verified submissions mean the results came from a model submission rather than a CIF submission.