Optimal and Private Learning from Human Response Data
- URL: http://arxiv.org/abs/2303.06234v2
- Date: Sat, 11 Nov 2023 00:16:00 GMT
- Title: Optimal and Private Learning from Human Response Data
- Authors: Duc Nguyen and Anderson Y. Zhang
- Abstract summary: Recently, Nguyen and Zhang proposed a new spectral estimation algorithm that is efficient and accurate.
We extend their results in two important ways.
We develop a private extension of the spectral algorithm, leveraging its unique Markov chain formulation and the discrete Gaussian mechanism.
- Score: 13.869502085838452
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Item response theory (IRT) is the study of how people make probabilistic
decisions, with diverse applications in education testing, recommendation
systems, among others. The Rasch model of binary response data, one of the most
fundamental models in IRT, remains an active area of research with important
practical significance. Recently, Nguyen and Zhang (2022) proposed a new
spectral estimation algorithm that is efficient and accurate. In this work, we
extend their results in two important ways. Firstly, we obtain a refined
entrywise error bound for the spectral algorithm, complementing the `average
error' $\ell_2$ bound in their work. Notably, under mild sampling conditions,
the spectral algorithm achieves the minimax optimal error bound (modulo a log
factor). Building on the refined analysis, we also show that the spectral
algorithm enjoys optimal sample complexity for top-$K$ recovery (e.g.,
identifying the best $K$ items from approval/disapproval response data),
explaining the empirical findings in the previous work. Our second contribution
addresses an important but understudied topic in IRT: privacy. Despite the
human-centric applications of IRT, there has not been any proposed
privacy-preserving mechanism in the literature. We develop a private extension
of the spectral algorithm, leveraging its unique Markov chain formulation and
the discrete Gaussian mechanism (Canonne et al., 2020). Experiments show that
our approach is significantly more accurate than the baselines in the
low-to-moderate privacy regime.
Related papers
- Discretize Relaxed Solution of Spectral Clustering via a Non-Heuristic
Algorithm [77.53604156112144]
We develop a first-order term to bridge the original problem and discretization algorithm.
Since the non-heuristic method is aware of the original graph cut problem, the final discrete solution is more reliable.
arXiv Detail & Related papers (2023-10-19T13:57:38Z) - Efficient Model-Free Exploration in Low-Rank MDPs [76.87340323826945]
Low-Rank Markov Decision Processes offer a simple, yet expressive framework for RL with function approximation.
Existing algorithms are either (1) computationally intractable, or (2) reliant upon restrictive statistical assumptions.
We propose the first provably sample-efficient algorithm for exploration in Low-Rank MDPs.
arXiv Detail & Related papers (2023-07-08T15:41:48Z) - High-dimensional Contextual Bandit Problem without Sparsity [8.782204980889077]
We propose an explore-then-commit (EtC) algorithm to address this problem and examine its performance.
We derive the optimal rate of the ETC algorithm in terms of $T$ and show that this rate can be achieved by balancing exploration and exploitation.
We introduce an adaptive explore-then-commit (AEtC) algorithm that adaptively finds the optimal balance.
arXiv Detail & Related papers (2023-06-19T15:29:32Z) - Representation Learning with Multi-Step Inverse Kinematics: An Efficient
and Optimal Approach to Rich-Observation RL [106.82295532402335]
Existing reinforcement learning algorithms suffer from computational intractability, strong statistical assumptions, and suboptimal sample complexity.
We provide the first computationally efficient algorithm that attains rate-optimal sample complexity with respect to the desired accuracy level.
Our algorithm, MusIK, combines systematic exploration with representation learning based on multi-step inverse kinematics.
arXiv Detail & Related papers (2023-04-12T14:51:47Z) - GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP,
and Beyond [101.5329678997916]
We study sample efficient reinforcement learning (RL) under the general framework of interactive decision making.
We propose a novel complexity measure, generalized eluder coefficient (GEC), which characterizes the fundamental tradeoff between exploration and exploitation.
We show that RL problems with low GEC form a remarkably rich class, which subsumes low Bellman eluder dimension problems, bilinear class, low witness rank problems, PO-bilinear class, and generalized regular PSR.
arXiv Detail & Related papers (2022-11-03T16:42:40Z) - Human-in-the-loop: Provably Efficient Preference-based Reinforcement
Learning with General Function Approximation [107.54516740713969]
We study human-in-the-loop reinforcement learning (RL) with trajectory preferences.
Instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer.
We propose the first optimistic model-based algorithm for PbRL with general function approximation.
arXiv Detail & Related papers (2022-05-23T09:03:24Z) - Understanding the Generalization Performance of Spectral Clustering
Algorithms [11.025579607812167]
We study the excess risk bounds of the popular spectral clustering algorithms: emphrelaxed RatioCut and emphrelaxed NCut.
We propose two novel algorithms that can not only penalize this quantity, but also cluster the out-of-sample data without re-eigendecomposition on the overall sample.
arXiv Detail & Related papers (2022-04-30T14:21:56Z) - Iterative Methods for Private Synthetic Data: Unifying Framework and New
Methods [18.317488965846636]
We study private synthetic data generation for query release.
The goal is to construct a sanitized version of a sensitive dataset subject to differential privacy.
Under this framework, we propose two new methods.
arXiv Detail & Related papers (2021-06-14T04:19:35Z) - Spectral Methods for Data Science: A Statistical Perspective [37.2486912080998]
Spectral methods have emerged as a simple yet surprisingly effective approach for extracting information from massive, noisy and incomplete data.
This book aims to present a systematic, comprehensive, yet accessible introduction to spectral methods from a modern statistical perspective.
arXiv Detail & Related papers (2020-12-15T18:40:56Z) - Adaptive Sampling for Best Policy Identification in Markov Decision
Processes [79.4957965474334]
We investigate the problem of best-policy identification in discounted Markov Decision (MDPs) when the learner has access to a generative model.
The advantages of state-of-the-art algorithms are discussed and illustrated.
arXiv Detail & Related papers (2020-09-28T15:22:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.