Related papers: HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture

HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture

URL: http://arxiv.org/abs/2502.03933v1
Date: Thu, 06 Feb 2025 10:16:27 GMT
Title: HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture
Authors: Jai Bardhan, Radhikesh Agrawal, Abhiram Tilak, Cyrin Neeraj, Subhadip Mitra,
Abstract summary: We present a transformer architecture-based foundation model for tasks at high-energy particle colliders.<n>We train the model to classify jets using a self-supervised strategy inspired by the Joint Embedding Predictive Architecture.<n>Our model fares well with other datasets for standard classification benchmark tasks.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present a transformer architecture-based foundation model for tasks at high-energy particle colliders such as the Large Hadron Collider. We train the model to classify jets using a self-supervised strategy inspired by the Joint Embedding Predictive Architecture. We use the JetClass dataset containing 100M jets of various known particles to pre-train the model with a data-centric approach -- the model uses a fraction of the jet constituents as the context to predict the embeddings of the unseen target constituents. Our pre-trained model fares well with other datasets for standard classification benchmark tasks. We test our model on two additional downstream tasks: top tagging and differentiating light-quark jets from gluon jets. We also evaluate our model with task-specific metrics and baselines and compare it with state-of-the-art models in high-energy physics. Project site: https://hep-jepa.github.io/

Related papers

Jet Image Generation in High Energy Physics Using Diffusion Models [0.0]
This article presents the application of diffusion models for generating jet images corresponding to proton-proton collision events at the Large Hadron Collider (LHC)<n>The kinematic variables of quark, gluon, W-boson, Z-boson, and top quark jets from the JetNet simulation dataset are mapped to two-dimensional image representations.<n>We compare the performance of score-based diffusion models and consistency models in accurately generating class-conditional jet images.
arXiv Detail & Related papers (2025-08-01T01:41:27Z)
InfoBridge: Mutual Information estimation via Bridge Matching [64.11574776911542]
We show that by using the theory of diffusion bridges, one can construct an unbiased estimator for data posing difficulties for conventional MI estimators.<n>We showcase the performance of our estimator on two standard MI estimation benchmarks, i.e., low-dimensional and image-based, and on real-world data.
arXiv Detail & Related papers (2025-02-03T14:18:37Z)
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics [0.5055815271772576]
We introduce the AspenOpenJets dataset, consisting of approximately 180M high $p_T$ jets derived from CMS 2016 Open Data.<n>We show how pre-training the OmniJet-$alpha$ foundation model on AspenOpenJets improves performance on generative tasks with significant domain shift.<n>In addition to demonstrating the power of pre-training of a jet-based foundation model on actual proton-proton collision data, we provide the ML-ready derived AspenOpenJets dataset for further public use.
arXiv Detail & Related papers (2024-12-13T19:00:03Z)
Decomposing and Editing Predictions by Modeling Model Computation [75.37535202884463]
We introduce a task called component modeling. The goal of component modeling is to decompose an ML model's prediction in terms of its components. We present COAR, a scalable algorithm for estimating component attributions.
arXiv Detail & Related papers (2024-04-17T16:28:08Z)
OmniJet-$α$: The first cross-task foundation model for particle physics [0.0]
Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a variety of downstream applications. We report significant progress on this challenge on several fronts. We demonstrate transfer learning between an unsupervised problem (jet generation) and a classic supervised task (jet tagging) with our new OmniJet-$alpha$ model.
arXiv Detail & Related papers (2024-03-08T19:00:01Z)
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects [55.77542145604758]
FoundationPose is a unified foundation model for 6D object pose estimation and tracking. Our approach can be instantly applied at test-time to a novel object without fine-tuning.
arXiv Detail & Related papers (2023-12-13T18:28:09Z)
Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information [0.0]
We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. For the first time, we also introduce a generative model that goes beyond the kinematic features of jet constituents.
arXiv Detail & Related papers (2023-11-30T19:00:02Z)
Towards Efficient Task-Driven Model Reprogramming with Foundation Models [52.411508216448716]
Vision foundation models exhibit impressive power, benefiting from the extremely large model capacity and broad training data. However, in practice, downstream scenarios may only support a small model due to the limited computational resources or efficiency considerations. This brings a critical challenge for the real-world application of foundation models: one has to transfer the knowledge of a foundation model to the downstream task.
arXiv Detail & Related papers (2023-04-05T07:28:33Z)
Particle Transformer for Jet Tagging [4.604003661048267]
We present JetClass, a new comprehensive dataset for jet tagging. The dataset consists of 100 M jets, about two orders of magnitude larger than existing public datasets. We propose a new Transformer-based architecture for jet tagging, called Particle Transformer (ParT)
arXiv Detail & Related papers (2022-02-08T10:36:29Z)
Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency [114.02182755620784]
We present an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion and depth in a monocular camera setup without supervision. Our framework is shown to outperform the state-of-the-art depth and motion estimation methods.
arXiv Detail & Related papers (2021-02-04T14:26:42Z)
Object Rearrangement Using Learned Implicit Collision Functions [61.90305371998561]
We propose a learned collision model that accepts scene and query object point clouds and predicts collisions for 6DOF object poses within the scene. We leverage the learned collision model as part of a model predictive path integral (MPPI) policy in a tabletop rearrangement task. The learned model outperforms both traditional pipelines and learned ablations by 9.8% in accuracy on a dataset of simulated collision queries.
arXiv Detail & Related papers (2020-11-21T05:36:06Z)
Model Reuse with Reduced Kernel Mean Embedding Specification [70.044322798187]
We present a two-phase framework for finding helpful models for a current application. In the upload phase, when a model is uploading into the pool, we construct a reduced kernel mean embedding (RKME) as a specification for the model. Then in the deployment phase, the relatedness of the current task and pre-trained models will be measured based on the value of the RKME specification.
arXiv Detail & Related papers (2020-01-20T15:15:07Z)

This list is automatically generated from the titles and abstracts of the papers in this site.