AllShowers: One model for all calorimeter showers
- URL: http://arxiv.org/abs/2601.11716v1
- Date: Fri, 16 Jan 2026 19:09:57 GMT
- Title: AllShowers: One model for all calorimeter showers
- Authors: Thorsten Buss, Henry Day-Hall, Frank Gaede, Gregor Kasieczka, Katja Krüger,
- Abstract summary: AllShowers is a unified generative model that simulates calorimeter showers across multiple particle types.<n>It is trained on a diverse dataset of simulated showers in the highly granular ILD detector.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate and efficient detector simulation is essential for modern collider experiments. To reduce the high computational cost, various fast machine learning surrogate models have been proposed. Traditional surrogate models for calorimeter shower modeling train separate networks for each particle species, limiting scalability and reuse. We introduce AllShowers, a unified generative model that simulates calorimeter showers across multiple particle types using a single generative model. AllShowers is a continuous normalizing flow model with a Transformer architecture, enabling it to generate complex spatial and energy correlations in variable-length point cloud representations of showers. Trained on a diverse dataset of simulated showers in the highly granular ILD detector, the model demonstrates the ability to generate realistic showers for electrons, photons, and charged and neutral hadrons across a wide range of incident energies and angles without retraining. In addition to unifying shower generation for multiple particle types, AllShowers surpasses the fidelity of previous single-particle-type models for hadronic showers. Key innovations include the use of a layer embedding, allowing the model to learn all relevant calorimeter layer properties; a custom attention masking scheme to reduce computational demands and introduce a helpful inductive bias; and a shower- and layer-wise optimal transport mapping to improve training convergence and sample quality. AllShowers marks a significant step towards a universal model for calorimeter shower simulations in collider experiments.
Related papers
- CaloHadronic: a diffusion model for the generation of hadronic showers [0.0]
Simulating showers of particles in highly-granular calorimeters is a key frontier in the application of machine learning to particle physics.<n>Recent developments have shown how diffusion based generative shower simulation approaches are very efficient.<n>This is the first time that machine learning methods are used to holistically generate showers across the electromagnetic and hadronic calorimeter.
arXiv Detail & Related papers (2025-06-26T19:12:44Z) - Models of Heavy-Tailed Mechanistic Universality [62.107333654304014]
We propose a family of random matrix models to explore attributes that give rise to heavy-tailed behavior in trained neural networks.<n>Under this model, spectral densities with power laws on tails arise through a combination of three independent factors.<n> Implications of our model on other appearances of heavy tails, including neural scaling laws, trajectories, and the five-plus-one phases of neural network training, are discussed.
arXiv Detail & Related papers (2025-06-04T00:55:01Z) - CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation [22.42342223406944]
We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge.
We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality.
arXiv Detail & Related papers (2024-10-28T23:28:07Z) - Energy-Based Diffusion Language Models for Text Generation [126.23425882687195]
Energy-based Diffusion Language Model (EDLM) is an energy-based model operating at the full sequence level for each diffusion step.<n>Our framework offers a 1.3$times$ sampling speedup over existing diffusion models.
arXiv Detail & Related papers (2024-10-28T17:25:56Z) - Comparison of Point Cloud and Image-based Models for Calorimeter Fast
Simulation [48.26243807950606]
Two state-of-the-art score based models are trained on the same set of calorimeter simulation and directly compared.
generative models are a new class of generative models that have been shown to accurately generate high dimensional calorimeter datasets.
arXiv Detail & Related papers (2023-07-10T08:20:45Z) - CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter
Simulation [0.0]
Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics.
This work achieves a major breakthrough by, for the first time, directly generating a point cloud of a few thousand space points with energy depositions in the detector in 3D space without relying on a fixed-grid structure.
arXiv Detail & Related papers (2023-05-08T16:44:15Z) - Geometry-aware Autoregressive Models for Calorimeter Shower Simulations [6.01665219244256]
We develop a geometry-aware autoregressive model on a range of calorimeter geometries.
This is a key proof-of-concept step towards building a model that can generalize to new unseen calorimeter geometries.
Such a model can replace the hundreds of generative models used for calorimeter simulation in a Large Hadron Collider experiment.
arXiv Detail & Related papers (2022-12-16T01:45:17Z) - CaloMan: Fast generation of calorimeter showers with density estimation
on learned manifolds [10.089611750812391]
Most computationally expensive simulations involve calorimeter showers.
Deep generative modelling has opened the possibility of generating realistic calorimeter showers orders of magnitude more quickly than physics-based simulation.
We propose modelling calorimeter showers first by learning their manifold structure, and then estimating the density of data across this manifold.
arXiv Detail & Related papers (2022-11-23T19:00:03Z) - Conditional Generative Models for Simulation of EMG During Naturalistic
Movements [45.698312905115955]
We present a conditional generative neural network trained adversarially to generate motor unit activation potential waveforms.
We demonstrate the ability of such a model to predictively interpolate between a much smaller number of numerical model's outputs with a high accuracy.
arXiv Detail & Related papers (2022-11-03T14:49:02Z) - MoEfication: Conditional Computation of Transformer Models for Efficient
Inference [66.56994436947441]
Transformer-based pre-trained language models can achieve superior performance on most NLP tasks due to large parameter capacity, but also lead to huge computation cost.
We explore to accelerate large-model inference by conditional computation based on the sparse activation phenomenon.
We propose to transform a large model into its mixture-of-experts (MoE) version with equal model size, namely MoEfication.
arXiv Detail & Related papers (2021-10-05T02:14:38Z) - Physics-informed CoKriging model of a redox flow battery [68.8204255655161]
Redox flow batteries (RFBs) offer the capability to store large amounts of energy cheaply and efficiently.
There is a need for fast and accurate models of the charge-discharge curve of a RFB to potentially improve the battery capacity and performance.
We develop a multifidelity model for predicting the charge-discharge curve of a RFB.
arXiv Detail & Related papers (2021-06-17T00:49:55Z) - Machine learning for rapid discovery of laminar flow channel wall
modifications that enhance heat transfer [56.34005280792013]
We present a combination of accurate numerical simulations of arbitrary, flat, and non-flat channels and machine learning models predicting drag coefficient and Stanton number.
We show that convolutional neural networks (CNN) can accurately predict the target properties at a fraction of the time of numerical simulations.
arXiv Detail & Related papers (2021-01-19T16:14:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.