Active recursive Bayesian inference using R\'enyi information measures
- URL: http://arxiv.org/abs/2004.03139v2
- Date: Wed, 10 Mar 2021 16:35:34 GMT
- Title: Active recursive Bayesian inference using R\'enyi information measures
- Authors: Yeganeh M. Marghi, Aziz Kocanaogullari, Murat Akcakaya, Deniz Erdogmus
- Abstract summary: We propose an active Bayesian inference framework with unified inference and query selection steps.
We analytically demonstrate that the proposed approach outperforms conventional methods such as mutual information.
We present empirical and experimental performance evaluations on two applications: restaurant recommendation and brain-computer interface (BCI) typing systems.
- Score: 11.1748531496641
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recursive Bayesian inference (RBI) provides optimal Bayesian latent variable
estimates in real-time settings with streaming noisy observations. Active RBI
attempts to effectively select queries that lead to more informative
observations to rapidly reduce uncertainty until a confident decision is made.
However, typically the optimality objectives of inference and query mechanisms
are not jointly selected. Furthermore, conventional active querying methods
stagger due to misleading prior information. Motivated by information theoretic
approaches, we propose an active RBI framework with unified inference and query
selection steps through Renyi entropy and $\alpha$-divergence. We also propose
a new objective based on Renyi entropy and its changes called Momentum that
encourages exploration for misleading prior cases. The proposed active RBI
framework is applied to the trajectory of the posterior changes in the
probability simplex that provides a coordinated active querying and decision
making with specified confidence. Under certain assumptions, we analytically
demonstrate that the proposed approach outperforms conventional methods such as
mutual information by allowing the selections of unlikely events. We present
empirical and experimental performance evaluations on two applications:
restaurant recommendation and brain-computer interface (BCI) typing systems.
Related papers
- Provably Efficient UCB-type Algorithms For Learning Predictive State
Representations [55.00359893021461]
The sequential decision-making problem is statistically learnable if it admits a low-rank structure modeled by predictive state representations (PSRs)
This paper proposes the first known UCB-type approach for PSRs, featuring a novel bonus term that upper bounds the total variation distance between the estimated and true models.
In contrast to existing approaches for PSRs, our UCB-type algorithms enjoy computational tractability, last-iterate guaranteed near-optimal policy, and guaranteed model accuracy.
arXiv Detail & Related papers (2023-07-01T18:35:21Z) - Inferring Preferences from Demonstrations in Multi-objective
Reinforcement Learning: A Dynamic Weight-based Approach [0.0]
In multi-objective decision-making, preference inference is the process of inferring the preferences of a decision-maker for different objectives.
This research proposes a Dynamic Weight-based Preference Inference algorithm that can infer the preferences of agents acting in multi-objective decision-making problems.
arXiv Detail & Related papers (2023-04-27T11:55:07Z) - On Pitfalls of $\textit{RemOve-And-Retrain}$: Data Processing Inequality
Perspective [5.8010446129208155]
This study scrutinizes the dependability of the RemOve-And-Retrain (ROAR) procedure, which is prevalently employed for gauging the performance of feature importance estimates.
The insights gleaned from our theoretical foundation and empirical investigations reveal that attributions containing lesser information about the decision function may yield superior results in ROAR benchmarks.
arXiv Detail & Related papers (2023-04-26T21:43:42Z) - STEERING: Stein Information Directed Exploration for Model-Based
Reinforcement Learning [111.75423966239092]
We propose an exploration incentive in terms of the integral probability metric (IPM) between a current estimate of the transition model and the unknown optimal.
Based on KSD, we develop a novel algorithm algo: textbfSTEin information dirtextbfEcted exploration for model-based textbfReinforcement LearntextbfING.
arXiv Detail & Related papers (2023-01-28T00:49:28Z) - Active Inference for Autonomous Decision-Making with Contextual
Multi-Armed Bandits [1.3670071336891754]
In autonomous robotic decision-making under uncertainty, the tradeoff between exploitation and exploration of available options must be considered.
In this study, we apply active inference, which has been actively studied in the field of neuroscience in recent years, as an alternative action selection strategy for CMABs.
arXiv Detail & Related papers (2022-09-19T17:11:21Z) - ReAct: Temporal Action Detection with Relational Queries [84.76646044604055]
This work aims at advancing temporal action detection (TAD) using an encoder-decoder framework with action queries.
We first propose a relational attention mechanism in the decoder, which guides the attention among queries based on their relations.
Lastly, we propose to predict the localization quality of each action query at inference in order to distinguish high-quality queries.
arXiv Detail & Related papers (2022-07-14T17:46:37Z) - Meta-Wrapper: Differentiable Wrapping Operator for User Interest
Selection in CTR Prediction [97.99938802797377]
Click-through rate (CTR) prediction, whose goal is to predict the probability of the user to click on an item, has become increasingly significant in recommender systems.
Recent deep learning models with the ability to automatically extract the user interest from his/her behaviors have achieved great success.
We propose a novel approach under the framework of the wrapper method, which is named Meta-Wrapper.
arXiv Detail & Related papers (2022-06-28T03:28:15Z) - Limitations of a proposed correction for slow drifts in decision
criterion [0.0]
We propose a model-based approach for disambiguating systematic updates from random drifts.
We show that this approach accurately recovers the latent trajectory of drifts in decision criterion.
Our results highlight the advantages of incorporating assumptions about the generative process directly into models of decision-making.
arXiv Detail & Related papers (2022-05-22T19:33:19Z) - Learning Bias-Invariant Representation by Cross-Sample Mutual
Information Minimization [77.8735802150511]
We propose a cross-sample adversarial debiasing (CSAD) method to remove the bias information misused by the target task.
The correlation measurement plays a critical role in adversarial debiasing and is conducted by a cross-sample neural mutual information estimator.
We conduct thorough experiments on publicly available datasets to validate the advantages of the proposed method over state-of-the-art approaches.
arXiv Detail & Related papers (2021-08-11T21:17:02Z) - Continuous Mean-Covariance Bandits [39.820490484375156]
We propose a novel Continuous Mean-Covariance Bandit model to take into account option correlation.
In CMCB, there is a learner who sequentially chooses weight vectors on given options and observes random feedback according to the decisions.
We propose novel algorithms with optimal regrets (within logarithmic factors) and provide matching lower bounds to validate their optimalities.
arXiv Detail & Related papers (2021-02-24T06:37:05Z) - BSN++: Complementary Boundary Regressor with Scale-Balanced Relation
Modeling for Temporal Action Proposal Generation [85.13713217986738]
We present BSN++, a new framework which exploits complementary boundary regressor and relation modeling for temporal proposal generation.
Not surprisingly, the proposed BSN++ ranked 1st place in the CVPR19 - ActivityNet challenge leaderboard on temporal action localization task.
arXiv Detail & Related papers (2020-09-15T07:08:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.