Related papers: Information theory and discriminative sampling for model discovery

Information theory and discriminative sampling for model discovery

URL: http://arxiv.org/abs/2512.16000v1
Date: Wed, 17 Dec 2025 22:08:21 GMT
Title: Information theory and discriminative sampling for model discovery
Authors: Yuxuan Bao, J. Nathan Kutz,
Abstract summary: We leverage the Fisher Information Matrix (FIM) within the data-driven framework of em identification of nonlinear dynamics (SINDy)<n>We visualize information patterns in chaotic and non-chaotic systems for both single trajectories and multiple initial conditions.
Score: 2.47593085771929
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fisher information and Shannon entropy are fundamental tools for understanding and analyzing dynamical systems from complementary perspectives. They can characterize unknown parameters by quantifying the information contained in variables, or measure how different initial trajectories or temporal segments of a trajectory contribute to learning or inferring system dynamics. In this work, we leverage the Fisher Information Matrix (FIM) within the data-driven framework of {\em sparse identification of nonlinear dynamics} (SINDy). We visualize information patterns in chaotic and non-chaotic systems for both single trajectories and multiple initial conditions, demonstrating how information-based analysis can improve sampling efficiency and enhance model performance by prioritizing more informative data. The benefits of statistical bagging are further elucidated through spectral analysis of the FIM. We also illustrate how Fisher information and entropy metrics can promote data efficiency in three scenarios: when only a single trajectory is available, when a tunable control parameter exists, and when multiple trajectories can be freely initialized. As data-driven model discovery continues to gain prominence, principled sampling strategies guided by quantifiable information metrics offer a powerful approach for improving learning efficiency and reducing data requirements.

Related papers

Flexible Gravitational-Wave Parameter Estimation with Transformers [73.44614054040267]
We introduce a flexible transformer-based architecture paired with a training strategy that enables adaptation to diverse analysis settings at inference time.<n>We demonstrate that a single flexible model -- called Dingo-T1 -- can analyze 48 gravitational-wave events from the third LIGO-Virgo-KAGRA Observing Run.
arXiv Detail & Related papers (2025-12-02T17:49:08Z)
Fisher information flow in artificial neural networks [3.0053910105391264]
We present a method to monitor the flow of Fisher information through an ANN performing a parameter estimation task.<n>We show that optimal estimation performance corresponds to the maximal transmission of Fisher information.<n>This provides a model-free stopping criterion for network training-eliminating the need for a separate validation dataset.
arXiv Detail & Related papers (2025-09-02T15:17:42Z)
Laplace Sample Information: Data Informativeness Through a Bayesian Lens [13.319283849678234]
We propose Laplace Sample Information (LSI) measure of sample informativeness grounded in information theory.<n>We experimentally show that LSI is effective in ordering the data with respect to typicality, detecting mislabeled samples, measuring class-wise informativeness, and assessing dataset difficulty.
arXiv Detail & Related papers (2025-05-21T09:34:27Z)
Meta-Statistical Learning: Supervised Learning of Statistical Inference [59.463430294611626]
This work demonstrates that the tools and principles driving the success of large language models (LLMs) can be repurposed to tackle distribution-level tasks.<n>We propose meta-statistical learning, a framework inspired by multi-instance learning that reformulates statistical inference tasks as supervised learning problems.
arXiv Detail & Related papers (2025-02-17T18:04:39Z)
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining [55.262510814326035]
Existing reweighting strategies primarily focus on group-level data importance.<n>We introduce novel algorithms for dynamic, instance-level data reweighting.<n>Our framework allows us to devise reweighting strategies deprioritizing redundant or uninformative data.
arXiv Detail & Related papers (2025-02-10T17:57:15Z)
Capturing the Temporal Dependence of Training Data Influence [100.91355498124527]
We formalize the concept of trajectory-specific leave-one-out influence, which quantifies the impact of removing a data point during training.<n>We propose data value embedding, a novel technique enabling efficient approximation of trajectory-specific LOO.<n>As data value embedding captures training data ordering, it offers valuable insights into model training dynamics.
arXiv Detail & Related papers (2024-12-12T18:28:55Z)
Dynamic Importance Learning using Fisher Information Matrix (FIM) for Nonlinear Dynamic Mapping [0.2455468619225742]
This work presents a methodology for dynamically determining relevance scores in black-box models.<n>The proposed method leverages a gradient-based framework to uncover the importance of variance-driven features.<n>The practical utility of this approach is showcased through its application to an industrial pH neutralization process.
arXiv Detail & Related papers (2024-06-08T08:12:41Z)
InVAErt networks: a data-driven framework for model synthesis and identifiability analysis [0.0]
inVAErt is a framework for data-driven analysis and synthesis of physical systems. It uses a deterministic decoder to represent the forward and inverse maps, a normalizing flow to capture the probabilistic distribution of system outputs, and a variational encoder to learn a compact latent representation for the lack of bijectivity between inputs and outputs.
arXiv Detail & Related papers (2023-07-24T07:58:18Z)
Neural FIM for learning Fisher Information Metrics from point cloud data [71.07939200676199]
We propose neural FIM, a method for computing the Fisher information metric (FIM) from point cloud data. We demonstrate its utility in selecting parameters for the PHATE visualization method as well as its ability to obtain information pertaining to local volume illuminating branching points and cluster centers embeddings of a toy dataset and two single-cell datasets of IPSC reprogramming and PBMCs (immune cells)
arXiv Detail & Related papers (2023-06-01T17:36:13Z)
Using machine-learning modelling to understand macroscopic dynamics in a system of coupled maps [0.0]
We consider a case study the macroscopic motion emerging from a system of globally coupled maps. We build a coarse-grained Markov process for the macroscopic dynamics both with a machine learning approach and with a direct numerical computation of the transition probability of the coarse-grained process. We are able to infer important information about the effective dimension of the attractor, the persistence of memory effects and the multi-scale structure of the dynamics.
arXiv Detail & Related papers (2020-11-08T15:38:12Z)
How Training Data Impacts Performance in Learning-based Control [67.7875109298865]
This paper derives an analytical relationship between the density of the training data and the control performance. We formulate a quality measure for the data set, which we refer to as $rho$-gap. We show how the $rho$-gap can be applied to a feedback linearizing control law.
arXiv Detail & Related papers (2020-05-25T12:13:49Z)

This list is automatically generated from the titles and abstracts of the papers in this site.