Fast Low Energy Reconstruction using Convolutional Neural Networks
Authors
IceCube Collaboration
Abstract
IceCube is a Cherenkov detector instrumenting over a cubic kilometer of glacial ice deep under the surface of the South Pole. The DeepCore sub-detector lowers the detection energy threshold to a few GeV, enabling the precise measurements of neutrino oscillation parameters with atmospheric neutrinos. The reconstruction of neutrino interactions inside the detector is essential in studying neutrino oscillations. It is particularly challenging to reconstruct sub-100 GeV events with the IceCube detectors due to the relatively sparse detection units and detection medium. Convolutional neural networks (CNNs) are broadly used in physics experiments for both classification and regression purposes. This paper discusses the CNNs developed and employed for the latest IceCube-DeepCore oscillation measurements. These CNNs estimate various properties of the detected neutrinos, such as their energy, direction of arrival, interaction vertex position, flavor-related signature, and are also used for background classification.
Concepts
The Big Picture
Imagine trying to figure out the direction, speed, and identity of a ghost, not by seeing it, but by watching how scattered candles flicker as it passes through the room. That’s roughly the challenge facing physicists at the South Pole. The IceCube Neutrino Observatory reconstructs the properties of neutrinos: subatomic particles so elusive that trillions pass through your body every second without interacting.
Neutrinos come in three “flavors” and they oscillate, spontaneously changing type as they travel. Measuring how fast and completely they oscillate pins down fundamental parameters of the Standard Model of particle physics. But to make those measurements, you first need to know where a neutrino came from, how much energy it carried, and what type it was.
Deep in the Antarctic ice, that reconstruction is hard. IceCube’s latest work uses a suite of convolutional neural networks (CNNs), deep learning models originally developed to recognize patterns in photographs, now trained to extract neutrino properties from sparse light signals in glacial ice. The result: reconstruction that’s fast and accurate enough to handle precision oscillation measurements on millions of events.
Key Insight: By treating IceCube detector data as a structured 3D image, CNNs can simultaneously estimate neutrino energy, direction, vertex position, and flavor while rejecting background, all fast enough to run the full oscillation analysis pipeline.
How It Works
The IceCube detector buries 5,160 optical sensors in a cubic kilometer of Antarctic ice, arranged in vertical strings. When a neutrino collides with a water molecule, it produces a charged particle that emits Cherenkov radiation, a faint cone of blue light picked up by the sensors. The DeepCore sub-array occupies the densest, clearest region and pushes the detection threshold to just a few GeV, right where atmospheric neutrino oscillations are most pronounced.

Below 100 GeV, events produce only a handful of photons across a small number of sensors. Traditional reconstruction methods fit particle tracks and cascades analytically, and they become slow and unreliable at these low energies. The fix: reframe the problem as computer vision.
Each event is mapped onto a hexagonally-structured 3D grid that preserves the geometric layout of the detector strings. CNNs, built to find faint patterns in spatially-organized data, turn out to be well suited to this structure.
The architecture processes events through several layers of learned filters:
- Input representation — Raw photon arrival times and total charge at each Digital Optical Module (DOM) are encoded into image-like arrays that account for the hexagonal geometry of DeepCore strings.
- Convolutional layers — Successive 3D convolutions detect local patterns like clusters of hits and timing gradients, then build up to detector-scale features.
- Output heads — Separate branches predict reconstructed energy, zenith angle (the neutrino’s arrival direction measured from directly overhead), interaction vertex coordinates, a particle identification (PID) score distinguishing track-like events from cascade-like events, and a background classifier separating genuine neutrino interactions from atmospheric muons.

Track-like events come from muon neutrinos, which produce muons that leave long trails through the ice. Cascade-like events come from electron or tau neutrinos, which produce compact, roughly spherical bursts of light. Telling these apart is essential for oscillation measurements.
Speed was a hard constraint. Oscillation analyses require processing millions of simulated and real events. Likelihood-based methods can take seconds per event; the CNN reconstructions run orders of magnitude faster.

Training used detailed Monte Carlo simulations that mimic particle interactions and detector response, including photon propagation through inhomogeneous ice with depth-dependent scattering and absorption. Validation on real data confirmed that the networks generalize from simulation to observation, a notoriously tricky step in particle physics machine learning.
For energy estimation, the CNN achieves better resolution than previous methods at low energies, and directional reconstruction similarly outperforms classical fits. The PID network cleanly separates muon neutrinos from electron and tau neutrinos, a separation that the oscillation analysis depends on for measuring both disappearance and appearance channels.
Why It Matters
Neutrino oscillation measurements probe physics the Standard Model barely anticipated. The precise values of the mixing angles (quantum mechanical parameters governing how much one neutrino flavor mixes with another) connect to some of the biggest open questions in physics. Why is there more matter than antimatter? Is grand unification real? Are there new particles or forces we haven’t seen yet?
IceCube-DeepCore sits in a sweet spot. It measures atmospheric neutrinos at energies and distances where the dominant oscillation channel, muon neutrino disappearance, hits maximum effect. That gives it competitive sensitivity to the mixing angle θ₂₃ and mass-squared splitting Δm²₃₂, two of the most important parameters in the neutrino sector.
This CNN framework is not a minor upgrade. It is the backbone of IceCube’s current-generation oscillation analysis. Fast, accurate reconstruction makes full likelihood analyses feasible, working with the complete reconstructed phase space rather than a handful of binned observables.
As the proposed IceCube Upgrade adds densely-instrumented low-energy strings, CNN-based reconstruction will become even more central. The same approach could transfer to other sparse 3D Cherenkov detectors, from ORCA in the Mediterranean to future water-based neutrino telescopes. When detector geometry can be mapped to a structured grid, the full toolkit of computer vision is available for particle physics reconstruction.
Bottom Line: IceCube’s CNN-based reconstruction turns sparse light patterns in Antarctic ice into precise neutrino properties at speeds that make large-scale oscillation analyses practical. The paper is available at arXiv:2505.16777.
IAIFI Research Highlights
This work applies convolutional neural network architectures, originally developed for image recognition, directly to 3D particle detector data, making precision neutrino oscillation measurements possible where classical methods could not keep up.
Mapping irregular hexagonal detector geometry to structured CNN-compatible inputs, and training simultaneous multi-task networks for both regression and classification, shows how deep learning can handle non-standard spatial domains beyond natural images.
The CNNs power IceCube-DeepCore's latest measurements of mixing parameters θ₂₃ and Δm²₃₂, tightening constraints on one of the least understood sectors of the Standard Model.
Future work will extend these methods to the denser IceCube Upgrade geometry and deeper statistical analyses of the full reconstructed phase space. The paper is available at [arXiv:2505.16777](https://arxiv.org/abs/2505.16777).
Original Paper Details
Fast Low Energy Reconstruction using Convolutional Neural Networks
2505.16777
IceCube Collaboration
IceCube is a Cherenkov detector instrumenting over a cubic kilometer of glacial ice deep under the surface of the South Pole. The DeepCore sub-detector lowers the detection energy threshold to a few GeV, enabling the precise measurements of neutrino oscillation parameters with atmospheric neutrinos. The reconstruction of neutrino interactions inside the detector is essential in studying neutrino oscillations. It is particularly challenging to reconstruct sub-100 GeV events with the IceCube detectors due to the relatively sparse detection units and detection medium. Convolutional neural networks (CNNs) are broadly used in physics experiments for both classification and regression purposes. This paper discusses the CNNs developed and employed for the latest IceCube-DeepCore oscillation measurements. These CNNs estimate various properties of the detected neutrinos, such as their energy, direction of arrival, interaction vertex position, flavor-related signature, and are also used for background classification.