Online Learning of Smooth Functions
- URL: http://arxiv.org/abs/2301.01434v1
- Date: Wed, 4 Jan 2023 04:05:58 GMT
- Title: Online Learning of Smooth Functions
- Authors: Jesse Geneson and Ethan Zhou
- Abstract summary: We study the online learning of real-valued functions where the hidden function is known to have certain smoothness properties.
We find new bounds for $textopt_p(mathcal F_q)$ that are sharp up to a constant factor.
In the multi-variable setup, we establish inequalities relating $textopt_p(mathcal F_q,d)$ to $textopt_p(mathcal F_q,d)$ and show that $textopt_p(mathcal F
- Score: 0.35534933448684125
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we study the online learning of real-valued functions where
the hidden function is known to have certain smoothness properties.
Specifically, for $q \ge 1$, let $\mathcal F_q$ be the class of absolutely
continuous functions $f: [0,1] \to \mathbb R$ such that $\|f'\|_q \le 1$. For
$q \ge 1$ and $d \in \mathbb Z^+$, let $\mathcal F_{q,d}$ be the class of
functions $f: [0,1]^d \to \mathbb R$ such that any function $g: [0,1] \to
\mathbb R$ formed by fixing all but one parameter of $f$ is in $\mathcal F_q$.
For any class of real-valued functions $\mathcal F$ and $p>0$, let
$\text{opt}_p(\mathcal F)$ be the best upper bound on the sum of
$p^{\text{th}}$ powers of absolute prediction errors that a learner can
guarantee in the worst case. In the single-variable setup, we find new bounds
for $\text{opt}_p(\mathcal F_q)$ that are sharp up to a constant factor. We
show for all $\varepsilon \in (0, 1)$ that
$\text{opt}_{1+\varepsilon}(\mathcal{F}_{\infty}) =
\Theta(\varepsilon^{-\frac{1}{2}})$ and
$\text{opt}_{1+\varepsilon}(\mathcal{F}_q) =
\Theta(\varepsilon^{-\frac{1}{2}})$ for all $q \ge 2$. We also show for
$\varepsilon \in (0,1)$ that $\text{opt}_2(\mathcal
F_{1+\varepsilon})=\Theta(\varepsilon^{-1})$. In addition, we obtain new exact
results by proving that $\text{opt}_p(\mathcal F_q)=1$ for $q \in (1,2)$ and $p
\ge 2+\frac{1}{q-1}$. In the multi-variable setup, we establish inequalities
relating $\text{opt}_p(\mathcal F_{q,d})$ to $\text{opt}_p(\mathcal F_q)$ and
show that $\text{opt}_p(\mathcal F_{\infty,d})$ is infinite when $p<d$ and
finite when $p>d$. We also obtain sharp bounds on learning $\mathcal
F_{\infty,d}$ for $p < d$ when the number of trials is bounded.
Related papers
- Efficient Continual Finite-Sum Minimization [52.5238287567572]
We propose a key twist into the finite-sum minimization, dubbed as continual finite-sum minimization.
Our approach significantly improves upon the $mathcalO(n/epsilon)$ FOs that $mathrmStochasticGradientDescent$ requires.
We also prove that there is no natural first-order method with $mathcalOleft(n/epsilonalpharight)$ complexity gradient for $alpha 1/4$, establishing that the first-order complexity of our method is nearly tight.
arXiv Detail & Related papers (2024-06-07T08:26:31Z) - Coresets for Multiple $\ell_p$ Regression [47.790126028106734]
We construct coresets of size $tilde O(varepsilon-2d)$ for $p2$ and $tilde O(varepsilon-pdp/2)$ for $p>2$.
For $1p2$, every matrix has a subset of $tilde O(varepsilon-1k)$ rows which spans a $(varepsilon-1k)$-approximately optimal $k$-dimensional subspace for $ell_p$ subspace approximation
arXiv Detail & Related papers (2024-06-04T15:50:42Z) - Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix
Factorization [54.29685789885059]
We introduce efficient $(1+varepsilon)$-approximation algorithms for the binary matrix factorization (BMF) problem.
The goal is to approximate $mathbfA$ as a product of low-rank factors.
Our techniques generalize to other common variants of the BMF problem.
arXiv Detail & Related papers (2023-06-02T18:55:27Z) - $L^p$ sampling numbers for the Fourier-analytic Barron space [0.0]
We study functions that can be written as [ f(x) = int_mathbbRd F(xi), e2 pi i langle x, xi rangle, d xi quad textwith quad int_mathbbRd |F(xi)| cdot (1 + |xi|)sigma, d xi infty.
For $
arXiv Detail & Related papers (2022-08-16T08:41:48Z) - On Outer Bi-Lipschitz Extensions of Linear Johnson-Lindenstrauss
Embeddings of Low-Dimensional Submanifolds of $\mathbb{R}^N$ [0.24366811507669117]
Let $mathcalM$ be a compact $d$-dimensional submanifold of $mathbbRN$ with reach $tau$ and volume $V_mathcal M$.
We prove that a nonlinear function $f: mathbbRN rightarrow mathbbRmm exists with $m leq C left(d / epsilon2right) log left(fracsqrt[d]V_math
arXiv Detail & Related papers (2022-06-07T15:10:46Z) - Low-Rank Approximation with $1/\epsilon^{1/3}$ Matrix-Vector Products [58.05771390012827]
We study iterative methods based on Krylov subspaces for low-rank approximation under any Schatten-$p$ norm.
Our main result is an algorithm that uses only $tildeO(k/sqrtepsilon)$ matrix-vector products.
arXiv Detail & Related papers (2022-02-10T16:10:41Z) - Learning low-degree functions from a logarithmic number of random
queries [77.34726150561087]
We prove that for any integer $ninmathbbN$, $din1,ldots,n$ and any $varepsilon,deltain(0,1)$, a bounded function $f:-1,1nto[-1,1]$ of degree at most $d$ can be learned.
arXiv Detail & Related papers (2021-09-21T13:19:04Z) - Sharper bounds for online learning of smooth functions of a single
variable [0.0]
We show that $opt_1+epsilon(mathcalF_q) = Theta(epsilon-frac12)$, where the constants in the bound do not depend on $q$.
We also show that $opt_1+epsilon(mathcalF_q) = Theta(epsilon-frac12)$, where the constants in the bound do not depend on $q$.
arXiv Detail & Related papers (2021-05-30T23:06:21Z) - Linear Bandits on Uniformly Convex Sets [88.3673525964507]
Linear bandit algorithms yield $tildemathcalO(nsqrtT)$ pseudo-regret bounds on compact convex action sets.
Two types of structural assumptions lead to better pseudo-regret bounds.
arXiv Detail & Related papers (2021-03-10T07:33:03Z) - Some convergent results for Backtracking Gradient Descent method on
Banach spaces [0.0]
bf Theorem. Let $X$ be a Banach space and $f:Xrightarrow mathbbR$ be a $C2$ function.
Let $mathcalC$ be the set of critical points of $f$.
arXiv Detail & Related papers (2020-01-16T12:49:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.