Sample-efficient proper PAC learning with approximate differential
privacy
- URL: http://arxiv.org/abs/2012.03893v1
- Date: Mon, 7 Dec 2020 18:17:46 GMT
- Title: Sample-efficient proper PAC learning with approximate differential
privacy
- Authors: Badih Ghazi, Noah Golowich, Ravi Kumar, Pasin Manurangsi
- Abstract summary: We prove that the sample complexity of properly learning a class of Littlestone dimension $d$ with approximate differential privacy is $tilde O(d6)$, ignoring privacy and accuracy parameters.
This result answers a question of Bun et al. (FOCS 2020) by improving upon their upper bound of $2O(d)$ on the sample complexity.
- Score: 51.09425023771246
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper we prove that the sample complexity of properly learning a
class of Littlestone dimension $d$ with approximate differential privacy is
$\tilde O(d^6)$, ignoring privacy and accuracy parameters. This result answers
a question of Bun et al. (FOCS 2020) by improving upon their upper bound of
$2^{O(d)}$ on the sample complexity. Prior to our work, finiteness of the
sample complexity for privately learning a class of finite Littlestone
dimension was only known for improper private learners, and the fact that our
learner is proper answers another question of Bun et al., which was also asked
by Bousquet et al. (NeurIPS 2020). Using machinery developed by Bousquet et
al., we then show that the sample complexity of sanitizing a binary hypothesis
class is at most polynomial in its Littlestone dimension and dual Littlestone
dimension. This implies that a class is sanitizable if and only if it has
finite Littlestone dimension. An important ingredient of our proofs is a new
property of binary hypothesis classes that we call irreducibility, which may be
of independent interest.
Related papers
- Private PAC Learning May be Harder than Online Learning [14.180842831990999]
We show that any concept class of Littlestone dimension $d$ can be privately PAC learned using $mathrmpoly(d)$ samples.
This raises the natural question of whether there might be a generic conversion from online learners to private PAC learners.
arXiv Detail & Related papers (2024-02-16T22:44:52Z) - Optimal Multi-Distribution Learning [88.3008613028333]
Multi-distribution learning seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions.
We propose a novel algorithm that yields an varepsilon-optimal randomized hypothesis with a sample complexity on the order of (d+k)/varepsilon2.
arXiv Detail & Related papers (2023-12-08T16:06:29Z) - Applications of Littlestone dimension to query learning and to
compression [1.9336815376402723]
We extend the model of citeangluin 2017power for learning by equivalence queries with random counterexamples.
Second, we extend that model to infinite concept classes with an additional source of randomness.
Third, we give improved results on the relationship of Littlestone dimension to classes with extended $d$-compression schemes.
arXiv Detail & Related papers (2023-10-07T14:04:18Z) - Find a witness or shatter: the landscape of computable PAC learning [63.26015736148707]
This paper contributes to the study of CPAC learnability by solving three open questions from recent papers.
Firstly, we prove that every CPAC learnable class is contained in a class which is properly CPAC learnable with a sample complexity.
Secondly, we show that there exists a decidable class of hypothesis which is properly learnable, but only with uncomputably fast growing sample complexity.
arXiv Detail & Related papers (2023-02-06T02:52:36Z) - Comparative Learning: A Sample Complexity Theory for Two Hypothesis
Classes [5.194264506657145]
We introduce comparative learning as a combination of the realizable and agnostic settings in PAC learning.
Even when both $S$ and $B$ have infinite VC dimensions, comparative learning can still have a small sample complexity.
We show that the sample complexity of comparative learning is characterized by the mutual VC dimension $mathsfVC(S,B)$.
arXiv Detail & Related papers (2022-11-16T18:38:24Z) - Improved Inapproximability of VC Dimension and Littlestone's Dimension
via (Unbalanced) Biclique [28.57552551316786]
We give a simple reduction from Maximum (Unbalanced) Biclique problem to approximating VC Dimension and Littlestone's Dimension.
With this connection, we derive a range of hardness of approximation results and running time lower bounds.
arXiv Detail & Related papers (2022-11-02T19:23:42Z) - Learning versus Refutation in Noninteractive Local Differential Privacy [133.80204506727526]
We study two basic statistical tasks in non-interactive local differential privacy (LDP): learning and refutation.
Our main result is a complete characterization of the sample complexity of PAC learning for non-interactive LDP protocols.
arXiv Detail & Related papers (2022-10-26T03:19:24Z) - Sample Complexity Bounds for Robustly Learning Decision Lists against
Evasion Attacks [25.832511407411637]
A fundamental problem in adversarial machine learning is to quantify how much training data is needed in the presence of evasion attacks.
We work with probability distributions on the input data that satisfy a Lipschitz condition: nearby points have similar probability.
For every fixed $k$ the class of $k$-decision lists has sample complexity against a $log(n)$-bounded adversary.
arXiv Detail & Related papers (2022-05-12T14:40:18Z) - Near-tight closure bounds for Littlestone and threshold dimensions [59.245101116396555]
We study closure properties for the Littlestone and threshold dimensions of binary hypothesis classes.
Our upper bounds give an exponential (in $k$) improvement upon analogous bounds shown by Alon et al.
arXiv Detail & Related papers (2020-07-07T17:56:06Z) - Private Query Release Assisted by Public Data [96.6174729958211]
We study the problem of differentially private query release assisted by access to public data.
We show that we can solve the problem for any query class $mathcalH$ of finite VC-dimension using only $d/alpha$ public samples and $sqrtpd3/2/alpha2$ private samples.
arXiv Detail & Related papers (2020-04-23T02:46:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.