← Back to Timeline

Bridging Simulations and EFT: A Hybrid Model of the Lyman-Alpha Forest Field

Theoretical Physics

Authors

Roger de Belsunce, Boryana Hadzhiyska, Mikhail M. Ivanov

Abstract

The Lyman-alpha (Lya) forest is a unique probe of cosmology and the intergalactic medium at high redshift and small scales. The statistical power of the ongoing Dark Energy Spectroscopic Instrument (DESI) demands precise theoretical tools to model the Lya forest. We present a hybrid effective field theory (HEFT) forward model in redshift space that leverages the accuracy of non-linear particle displacements computed using the N-body simulation suite AbacusSummit with the predictive power of an analytical, perturbative bias forward model in the framework of the effective field theory (EFT). The residual noise between the model and the simulated Lya field has a nearly white (scale-and orientation-independent) power spectrum on quasi-linear scales, substantially simplifying its modeling compared to a purely perturbative description. As a consequence of the improved control over the 3D Lya forest stochasticity, we find agreement between the modeled and the true power spectra at the 5 per cent level down to scales of k <= 1 h/Mpc. This procedure offers a promising path toward constructing efficient and accurate emulators to predict large-scale clustering summary statistics for full-shape cosmological analyses of Lya forest data from both DESI and its successor, DESI-II.

Concepts

effective field theory lyman-alpha forest cosmological simulation lagrangian methods bias expansion stochastic processes emulation surrogate modeling spectral methods redshift-space distortions dark energy bayesian inference dark matter

The Big Picture

The skeleton of the universe is made of vast filaments and empty voids strung across billions of light-years. One of the best tools for mapping it: ancient light from distant quasars. As this light travels toward us, it passes through enormous clouds of hydrogen gas between galaxies, leaving dark absorption lines in the spectrum. Astronomers call this the Lyman-alpha (Ly-α) forest, and it carries signatures of large-scale structure, neutrino masses, and dark matter.

The Dark Energy Spectroscopic Instrument (DESI) is gathering Ly-α forest data at a scale that outpaces our theoretical models. DESI’s precision targets demand models that are both physically accurate and computationally affordable, but current approaches force a trade-off. Full physics simulations faithfully reproduce the underlying gas dynamics but eat enormous computing resources. Simpler mathematical models break down at the small scales where the forest is richest in information.

Roger de Belsunce, Boryana Hadzhiyska, and Mikhail Ivanov have developed a hybrid approach that threads this needle, achieving 5% agreement with simulated Ly-α fields down to scales of k ≤ 1 h/Mpc (higher k means finer spatial detail). Purely perturbative methods can’t touch that.

Key Insight: Combining non-linear particle dynamics from N-body simulations with a perturbative bias model in the Effective Field Theory framework, the residual “noise” between model and reality becomes nearly white (scale- and direction-independent). That simplifies the modeling problem considerably.

How It Works

The central idea is modular. Effective Field Theory of Large-Scale Structure (EFT) describes how matter clusters by treating complex small-scale physics as a systematic set of corrections. It’s powerful and predictive, but it relies on the Zel’dovich approximation, which treats gravitational trajectories as simple straight-line extrapolations. At small scales, where the full complexity of gravitational dynamics kicks in, this breaks down.

The fix: steal the hard part from simulations. The team’s Hybrid EFT (HEFT) model takes particle trajectories computed by AbacusSummit, a large N-body simulation suite that tracks how millions of particles move under gravity. On top of those trajectories, it layers a systematic mathematical description of how the particles trace the Ly-α forest. Each particle gets a weight determined by its local environment in the initial conditions, capturing:

  • Local density bias (β₁, β₂): how the forest traces the underlying matter density
  • Tidal shear terms (βs, βt): how the shape of the local gravitational environment affects absorption
  • Velocity gradient terms (βcb, βv): corrections from redshift-space distortions, where peculiar velocities (motions beyond simple cosmic expansion) shift spectral lines and smear apparent positions along the line of sight

Once each particle has its weight, the model “paints” the Ly-α field onto a grid by depositing these weights at the particles’ final positions. The bias parameters are then fit by minimizing the difference between the modeled field and the simulated reference.

A useful trick makes the comparison sharper: cosmic variance cancellation. Because the model and reference simulation share the same initial conditions, fluctuations common to both cancel out. The comparison becomes sensitive to genuine modeling errors rather than random cosmic scatter.

The residuals tell the story. The difference between the HEFT model and the true simulated field has a nearly white power spectrum, roughly constant across all scales and directions out to k ~ 1 h/Mpc. A purely perturbative EFT approach, by contrast, produces residuals that grow rapidly at small scales and develop strong anisotropy. It misses coherent small-scale flows that the N-body simulation captures correctly.

Why does a white residual matter so much? Because it can be modeled with just one or two free parameters instead of a complicated scale- and angle-dependent function. Fitting a flat line is much easier than fitting a wiggly curve to noisy data.

Why It Matters

The most direct payoff is a cleaner path to full-shape analyses of the 3D Ly-α forest power spectrum, extracting more cosmological information rather than measuring only specific features. Current DESI Ly-α analyses rely on curve-fitting formulas calibrated on gas simulations, and these can bias BAO (Baryon Acoustic Oscillation) measurements (the “standard ruler” imprint of sound waves from the early universe) at a level comparable to DESI Year 5 statistical errors. That can’t be ignored.

The EFT framework corrects this bias, but its reach has been limited by messy stochastic residuals that vary with scale and direction. HEFT tames that stochasticity.

Fast emulators are another payoff: surrogate models that predict Ly-α forest clustering statistics across a grid of cosmological parameters at a fraction of the simulation cost. The team plans to extend the approach to cross-correlations between the Ly-α forest and other tracers like high-redshift galaxies and quasars, where multi-tracer analyses can break degeneracies between cosmological parameters. With WEAVE-QSO, the Prime Focus Spectrograph, and 4MOST all coming online this decade, these tools are needed soon.

HEFT was originally developed for galaxy clustering. This paper extends it to a fundamentally different tracer, one defined by gas absorption rather than discrete objects. The same techniques could apply to any diffuse tracer of large-scale structure where discrete-object biasing falls short.

Bottom Line: By marrying N-body simulation dynamics with perturbative EFT bias modeling, this hybrid approach achieves 5% accuracy in the Ly-α forest power spectrum down to k ≤ 1 h/Mpc, with a nearly white residual that simplifies the road to precision cosmology with DESI.


IAIFI Research Highlights

Interdisciplinary Research Achievement
This work combines N-body machinery from numerical cosmology with the theoretical rigor of EFT. Neither field gets here alone, making it a natural fit for IAIFI's cross-disciplinary environment.
Impact on Artificial Intelligence
HEFT lays the groundwork for fast ML surrogate emulators that can stand in for expensive simulations in cosmological inference pipelines, making parameter estimation over large datasets computationally tractable.
Impact on Fundamental Interactions
By controlling Ly-α forest stochasticity at the field level, this approach enables unbiased BAO and broadband power spectrum measurements from DESI, tightening constraints on the universe's expansion history and neutrino masses.
Outlook and References
The team plans to develop full-shape Ly-α forest emulators targeting DESI and DESI-II, with extensions to multi-tracer cross-correlations; full results appear at [arXiv:2512.13681](https://arxiv.org/abs/2512.13681).

Original Paper Details

Title
Bridging Simulations and EFT: A Hybrid Model of the Lyman-Alpha Forest Field
arXiv ID
2512.13681
Authors
Roger de Belsunce, Boryana Hadzhiyska, Mikhail M. Ivanov
Abstract
The Lyman-alpha (Lya) forest is a unique probe of cosmology and the intergalactic medium at high redshift and small scales. The statistical power of the ongoing Dark Energy Spectroscopic Instrument (DESI) demands precise theoretical tools to model the Lya forest. We present a hybrid effective field theory (HEFT) forward model in redshift space that leverages the accuracy of non-linear particle displacements computed using the N-body simulation suite AbacusSummit with the predictive power of an analytical, perturbative bias forward model in the framework of the effective field theory (EFT). The residual noise between the model and the simulated Lya field has a nearly white (scale-and orientation-independent) power spectrum on quasi-linear scales, substantially simplifying its modeling compared to a purely perturbative description. As a consequence of the improved control over the 3D Lya forest stochasticity, we find agreement between the modeled and the true power spectra at the 5 per cent level down to scales of k <= 1 h/Mpc. This procedure offers a promising path toward constructing efficient and accurate emulators to predict large-scale clustering summary statistics for full-shape cosmological analyses of Lya forest data from both DESI and its successor, DESI-II.