Cosmological Field Emulation and Parameter Inference with Diffusion Models

The Big Picture

Imagine trying to understand the entire universe by running a single physics experiment, except each “experiment” costs millions of CPU hours and weeks of supercomputer time. That’s the reality facing modern cosmologists. To test theories about how matter clumped together after the Big Bang, physicists run N-body simulations: calculations that track how gravity pulls dark matter into the vast cosmic web of filaments, voids, and galaxy clusters we observe today.

The problem? You can only run a handful of these simulations in a lifetime of research. And the universe has a staggering number of knobs to turn.

Enter the emulator: a shortcut model that predicts what a simulation would produce at a fraction of the cost. The cosmic structures these simulations produce aren’t smooth or regular. They carry complex, irregular patterns that encode physical signatures. If your emulator gets those patterns even slightly wrong, any scientific conclusions built on it could be misleading.

A team from Harvard and MIT’s IAIFI has shown that diffusion models, the same class of AI behind image generators like Stable Diffusion, can faithfully emulate cosmological density fields. They can also run the problem in reverse: given an observed patch of the universe, the model works backward to pin down the fundamental constants governing how matter distributes itself across the cosmos.

The punchline: a single diffusion model can both generate realistic dark matter density fields conditioned on cosmological parameters and infer those parameters from new fields. Two tasks that usually require separate pipelines, handled by one network.

How It Works

The team trained on the CAMELS Multifield Dataset, a collection of cold dark matter density fields (maps of how invisible dark matter distributes across space) drawn from IllustrisTNG simulations. The dataset spans 1,000 points in parameter space, with 15 field realizations each. Every field covers a 25 h⁻¹ Mpc patch of the universe, tens of millions of light-years, at redshift z = 0: a snapshot of the cosmos today.

The core architecture is a denoising diffusion probabilistic model (DDPM), a neural network trained to reverse a gradual noising process. Think of it like teaching a network to restore a painting from static. You corrupt the original image with noise in small, controlled steps, then train the model to undo each step. At inference time, you start from pure noise and let the model sculpt it into a realistic cosmic field.

Two design choices made the physics work:

Circular convolutions in the downsampling layers wrap computations around the map’s edges, matching the periodic boundary conditions of cosmological simulations (the field wraps around like a video game map). Standard convolutions would introduce spurious edge artifacts.
Conditioning on cosmological parameters: the matter density Ωm (how much matter exists relative to the total energy content) and the amplitude of density fluctuations σ8 (how “lumpy” the universe is on large scales). Each ResNet block in the U-Net receives the parameter vector as input, so the model learns how large-scale structure shifts as you dial these values.

Training proceeded in two stages. First, train on downsampled 64×64 images for 60,000 iterations to learn coarse structure. Then initialize a 256×256 model with those weights and continue for over 500,000 iterations. This curriculum sped up convergence on full-resolution fields.

To verify quality, the team measured a reduced chi-squared statistic on the power spectrum (how much structure exists at each size scale) across 10 validation parameter points. The generated fields’ power spectra matched the simulated distribution closely for three validation parameter sets, with z-scores near zero across nearly all wavenumber bins. At its best, the model achieved a mean reduced chi-squared of 1.30, versus 1.27 for actual simulation samples. A perfect match scores 1.0. Close enough that the generated fields are statistically indistinguishable from real simulation outputs.

The model also picks up on differential signatures. Increasing Ωm boosts power across all scales and shifts the pixel value distribution. Changes in σ8 affect only large scales. These are physically meaningful distinctions the model learned without being explicitly told.

For parameter inference, the team used the variational lower bound (VLB) on the log likelihood, a score that diffusion models compute as a byproduct of training. It measures how well a given parameter set explains an observed field. Scanning over parameter space and finding the maximum yields a posterior estimate: which parameter values best explain the data. The constraints were tight, and no additional training was required.

Why It Matters

Cosmology is entering a data-rich era. Surveys like DESI, Euclid, and the Rubin Observatory’s LSST are mapping the universe at enormous scale. Extracting parameter constraints from these maps means comparing observations to theoretical predictions, and that comparison is only as good as your emulator. If the emulator misses subtle correlations, you get biased parameter estimates.

Diffusion models capture the full distribution of field statistics, which makes them a natural fit for this problem.

The real selling point is the dual use. Emulation and inference are traditionally separate pipelines with separate models. Here, one trained network handles both directions: generate fields from parameters, or infer parameters from fields. That could simplify cosmological analysis considerably, especially as field-level inference (analyzing the full density map rather than compressing it to summary statistics) becomes standard practice.

Open questions remain. How well does the model generalize beyond its training parameter range? Can it extend to three-dimensional fields or baryonic matter, which is messier to simulate? Can the inference approach compete with dedicated methods like neural posterior estimation? This paper is a starting point for all three.

The takeaway: diffusion models trained on cosmological simulations can generate statistically faithful dark matter density fields and recover tight cosmological parameter constraints, doing with one neural network what previously required two separate modeling pipelines.

IAIFI Research Highlights

Interdisciplinary Research
This work connects generative AI and observational cosmology, showing that the diffusion model architecture powering modern image synthesis can encode the physical statistics of the cosmic web with high fidelity.

Impact on Artificial Intelligence
Conditional diffusion models can function simultaneously as scientific emulators and probabilistic inference engines, a use case far removed from creative image generation.

Impact on Fundamental Interactions
By accurately emulating cold dark matter density fields and recovering tight constraints on Ωm and σ8, this approach could enable field-level cosmological inference from next-generation surveys.

Outlook and References
Future work could extend this framework to 3D fields, baryonic tracers, and larger parameter spaces. The paper was presented at the Machine Learning and the Physical Sciences Workshop at NeurIPS 2023. [arXiv:2312.07534](https://arxiv.org/abs/2312.07534)

Original Paper Details

Title
Cosmological Field Emulation and Parameter Inference with Diffusion Models

arXiv ID
2312.07534

Authors
Nayantara Mudur, Carolina Cuesta-Lazaro, Douglas P. Finkbeiner

Abstract
Cosmological simulations play a crucial role in elucidating the effect of physical parameters on the statistics of fields and on constraining parameters given information on density fields. We leverage diffusion generative models to address two tasks of importance to cosmology -- as an emulator for cold dark matter density fields conditional on input cosmological parameters $Ω_m$ and $σ_8$, and as a parameter inference model that can return constraints on the cosmological parameters of an input field. We show that the model is able to generate fields with power spectra that are consistent with those of the simulated target distribution, and capture the subtle effect of each parameter on modulations in the power spectrum. We additionally explore their utility as parameter inference models and find that we can obtain tight constraints on cosmological parameters.

Cosmological Field Emulation and Parameter Inference with Diffusion Models

Authors

Abstract

Concepts

The Big Picture

How It Works

Why It Matters

IAIFI Research Highlights

Original Paper Details