๐Ÿ”ฌ LeMat-GenBench: A Unified Benchmark for Generative Models of Crystalline Materials

Generative machine learning models hold great promise for accelerating materials discovery, particularly through the inverse design of inorganic crystals, enabling an unprecedented exploration of chemical space. Yet, the lack of standardized evaluation frameworks makes it difficult to evaluate, compare and further develop these ML models meaningfully.

LeMat-GenBench introduces a unified benchmark for generative models of crystalline materials, with standardized evaluation metrics** for meaningful model comparison, diverse tasks, and this leaderboard to encourage and track community progress.

๐Ÿ“„ Paper: arXiv | ๐Ÿ’ป Code: GitHub | ๐Ÿ“ง Contact: siddharth.betala-ext [at] entalpic.ai, alexandre.duval [at] entalpic.ai

LeMat-GenBench

Show only key metrics

Display count-based metrics as percentages of total structures

Sort By

Select column to sort by (default: sorted by MSUN+SUN descending)

Sort Direction
Metric Families (only active when Compact View is off)

Select which metric groups to display

ColorGroupMetricsDirection
ValidityValid, Charge Neutral, Distance Valid, Plausibility Validโ†‘ Higher is better
Uniqueness & NoveltyUnique, Novelโ†‘ Higher is better
Energy MetricsE Above Hull, Formation Energy, Relaxation RMSD (with std)โ†“ Lower is better
StabilityStable, Unique in Stable, SUNโ†‘ Higher is better
MetastabilityMetastable, Unique in Metastable, MSUNโ†‘ Higher is better
DistributionJS Distance, MMD, FIDโ†“ Lower is better
DiversityElement, Space Group, Atomic Site, Crystal Sizeโ†‘ Higher is better
HHIHHI Production, HHI Reserveโ†“ Lower is better

GenBench Leaderboard

Symbol Legend:

  • โšก Structures were already relaxed
  • โ˜… Contributes to LeMat-Bulk reference dataset (in-distribution)
  • โ—† Out-of-distribution relative to LeMat-Bulk reference dataset

Verified submissions mean the results came from a model submission rather than a CIF submission.