Local Function Complexity for Active Learning via Mixture of Gaussian
Processes
- URL: http://arxiv.org/abs/1902.10664v6
- Date: Tue, 12 Dec 2023 09:24:32 GMT
- Title: Local Function Complexity for Active Learning via Mixture of Gaussian
Processes
- Authors: Danny Panknin, Stefan Chmiela, Klaus-Robert M\"uller, Shinichi
Nakajima
- Abstract summary: Inhomogeneities in real-world data, due to changes in the observation noise level or variations in the structural complexity of the source function, pose a unique set of challenges for statistical inference.
In this paper, we draw on recent theoretical results on the estimation of local function complexity (LFC)
We derive and estimate the Gaussian process regression (GPR)-based analog of the LPS-based LFC and use it as a substitute in the above framework to make it robust and scalable.
- Score: 5.382740428160009
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Inhomogeneities in real-world data, e.g., due to changes in the observation
noise level or variations in the structural complexity of the source function,
pose a unique set of challenges for statistical inference. Accounting for them
can greatly improve predictive power when physical resources or computation
time is limited. In this paper, we draw on recent theoretical results on the
estimation of local function complexity (LFC), derived from the domain of local
polynomial smoothing (LPS), to establish a notion of local structural
complexity, which is used to develop a model-agnostic active learning (AL)
framework. Due to its reliance on pointwise estimates, the LPS model class is
not robust and scalable concerning large input space dimensions that typically
come along with real-world problems. Here, we derive and estimate the Gaussian
process regression (GPR)-based analog of the LPS-based LFC and use it as a
substitute in the above framework to make it robust and scalable. We assess the
effectiveness of our LFC estimate in an AL application on a prototypical
low-dimensional synthetic dataset, before taking on the challenging real-world
task of reconstructing a quantum chemical force field for a small organic
molecule and demonstrating state-of-the-art performance with a significantly
reduced training demand.
Related papers
- Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization [15.898378661128334]
Reinforcement Learning (RL) algorithms are known to suffer from the curse of dimensionality.
We propose overcoming the curse of dimensionality by approximately factorizing the original Markov decision processes (MDPs) into smaller, independently evolving MDPs.
We provide improved sample complexity guarantees for both proposed algorithms.
arXiv Detail & Related papers (2024-11-12T07:08:00Z) - A domain decomposition-based autoregressive deep learning model for unsteady and nonlinear partial differential equations [2.7755345520127936]
We propose a domain-decomposition-based deep learning (DL) framework, named CoMLSim, for accurately modeling unsteady and nonlinear partial differential equations (PDEs)
The framework consists of two key components: (a) a convolutional neural network (CNN)-based autoencoder architecture and (b) an autoregressive model composed of fully connected layers.
arXiv Detail & Related papers (2024-08-26T17:50:47Z) - Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement [67.1393112206885]
Large Language Models (LLMs) have shown promise as intelligent agents in interactive decision-making tasks.
We introduce Entropy-Regularized Token-level Policy Optimization (ETPO), an entropy-augmented RL method tailored for optimizing LLMs at the token level.
We assess the effectiveness of ETPO within a simulated environment that models data science code generation as a series of multi-step interactive tasks.
arXiv Detail & Related papers (2024-02-09T07:45:26Z) - Efficient Model-Free Exploration in Low-Rank MDPs [76.87340323826945]
Low-Rank Markov Decision Processes offer a simple, yet expressive framework for RL with function approximation.
Existing algorithms are either (1) computationally intractable, or (2) reliant upon restrictive statistical assumptions.
We propose the first provably sample-efficient algorithm for exploration in Low-Rank MDPs.
arXiv Detail & Related papers (2023-07-08T15:41:48Z) - A Neuromorphic Architecture for Reinforcement Learning from Real-Valued
Observations [0.34410212782758043]
Reinforcement Learning (RL) provides a powerful framework for decision-making in complex environments.
This paper presents a novel Spiking Neural Network (SNN) architecture for solving RL problems with real-valued observations.
arXiv Detail & Related papers (2023-07-06T12:33:34Z) - Complex-valued Adaptive System Identification via Low-Rank Tensor
Decomposition [3.268878947476012]
In this work we derive two new architectures to allow the processing of complex-valued signals.
We show that these extensions are able to surpass the trivial, complex-valued extension of the original architecture in terms of performance.
arXiv Detail & Related papers (2023-06-28T07:01:08Z) - ETLP: Event-based Three-factor Local Plasticity for online learning with
neuromorphic hardware [105.54048699217668]
We show a competitive performance in accuracy with a clear advantage in the computational complexity for Event-Based Three-factor Local Plasticity (ETLP)
We also show that when using local plasticity, threshold adaptation in spiking neurons and a recurrent topology are necessary to learntemporal patterns with a rich temporal structure.
arXiv Detail & Related papers (2023-01-19T19:45:42Z) - Clustered Federated Learning via Generalized Total Variation
Minimization [83.26141667853057]
We study optimization methods to train local (or personalized) models for local datasets with a decentralized network structure.
Our main conceptual contribution is to formulate federated learning as total variation minimization (GTV)
Our main algorithmic contribution is a fully decentralized federated learning algorithm.
arXiv Detail & Related papers (2021-05-26T18:07:19Z) - Reinforcement Learning for Adaptive Mesh Refinement [63.7867809197671]
We propose a novel formulation of AMR as a Markov decision process and apply deep reinforcement learning to train refinement policies directly from simulation.
The model sizes of these policy architectures are independent of the mesh size and hence scale to arbitrarily large and complex simulations.
arXiv Detail & Related papers (2021-03-01T22:55:48Z) - Multiplicative noise and heavy tails in stochastic optimization [62.993432503309485]
empirical optimization is central to modern machine learning, but its role in its success is still unclear.
We show that it commonly arises in parameters of discrete multiplicative noise due to variance.
A detailed analysis is conducted in which we describe on key factors, including recent step size, and data, all exhibit similar results on state-of-the-art neural network models.
arXiv Detail & Related papers (2020-06-11T09:58:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.