Approximation of Nonlinear Functionals Using Deep ReLU Networks
- URL: http://arxiv.org/abs/2304.04443v1
- Date: Mon, 10 Apr 2023 08:10:11 GMT
- Title: Approximation of Nonlinear Functionals Using Deep ReLU Networks
- Authors: Linhao Song, Jun Fan, Di-Rong Chen and Ding-Xuan Zhou
- Abstract summary: We investigate the approximation power of functional deep neural networks associated with the rectified linear unit (ReLU) activation function.
In addition, we establish rates of approximation of the proposed functional deep ReLU networks under mild regularity conditions.
- Score: 7.876115370275732
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, functional neural networks have been proposed and studied in
order to approximate nonlinear continuous functionals defined on $L^p([-1,
1]^s)$ for integers $s\ge1$ and $1\le p<\infty$. However, their theoretical
properties are largely unknown beyond universality of approximation or the
existing analysis does not apply to the rectified linear unit (ReLU) activation
function. To fill in this void, we investigate here the approximation power of
functional deep neural networks associated with the ReLU activation function by
constructing a continuous piecewise linear interpolation under a simple
triangulation. In addition, we establish rates of approximation of the proposed
functional deep ReLU networks under mild regularity conditions. Finally, our
study may also shed some light on the understanding of functional data learning
algorithms.
Related papers
- ReLU neural network approximation to piecewise constant functions [3.5928501649873326]
We show that a three-layer ReLU NN is sufficient to accurately approximate any piecewise constant function.
If the discontinuity interface is convex, an analytical formula of the ReLU NN approximation with exact weights and biases is provided.
arXiv Detail & Related papers (2024-10-21T20:58:34Z) - Spherical Analysis of Learning Nonlinear Functionals [10.785977740158193]
In this paper, we consider functionals defined on sets of functions on spheres.
The approximation ability of deep ReLU neural networks is investigated using an encoder-decoder framework.
arXiv Detail & Related papers (2024-10-01T20:10:00Z) - A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimax Optimization [90.87444114491116]
This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparametricized two-layer neural networks.
We address (i) the convergence of the gradient descent-ascent algorithm and (ii) the representation learning of the neural networks.
Results show that the feature representation induced by the neural networks is allowed to deviate from the initial one by the magnitude of $O(alpha-1)$, measured in terms of the Wasserstein distance.
arXiv Detail & Related papers (2024-04-18T16:46:08Z) - Approximation of RKHS Functionals by Neural Networks [30.42446856477086]
We study the approximation of functionals on kernel reproducing Hilbert spaces (RKHS's) using neural networks.
We derive explicit error bounds for those induced by inverse multiquadric, Gaussian, and Sobolev kernels.
We apply our findings to functional regression, proving that neural networks can accurately approximate the regression maps.
arXiv Detail & Related papers (2024-03-18T18:58:23Z) - Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning [53.97335841137496]
We propose an oracle-efficient algorithm, dubbed Pessimistic Least-Square Value Iteration (PNLSVI) for offline RL with non-linear function approximation.
Our algorithm enjoys a regret bound that has a tight dependency on the function class complexity and achieves minimax optimal instance-dependent regret when specialized to linear function approximation.
arXiv Detail & Related papers (2023-10-02T17:42:01Z) - Approximation and interpolation of deep neural networks [0.0]
In the overparametrized regime, deep neural network provide universal approximations and can interpolate any data set.
In the last section, we provide a practical probabilistic method of finding such a point under general conditions on the activation function.
arXiv Detail & Related papers (2023-04-20T08:45:16Z) - Optimal Approximation Complexity of High-Dimensional Functions with
Neural Networks [3.222802562733787]
We investigate properties of neural networks that use both ReLU and $x2$ as activation functions.
We show how to leverage low local dimensionality in some contexts to overcome the curse of dimensionality, obtaining approximation rates that are optimal for unknown lower-dimensional subspaces.
arXiv Detail & Related papers (2023-01-30T17:29:19Z) - Exploring Linear Feature Disentanglement For Neural Networks [63.20827189693117]
Non-linear activation functions, e.g., Sigmoid, ReLU, and Tanh, have achieved great success in neural networks (NNs)
Due to the complex non-linear characteristic of samples, the objective of those activation functions is to project samples from their original feature space to a linear separable feature space.
This phenomenon ignites our interest in exploring whether all features need to be transformed by all non-linear functions in current typical NNs.
arXiv Detail & Related papers (2022-03-22T13:09:17Z) - Going Beyond Linear RL: Sample Efficient Neural Function Approximation [76.57464214864756]
We study function approximation with two-layer neural networks.
Our results significantly improve upon what can be attained with linear (or eluder dimension) methods.
arXiv Detail & Related papers (2021-07-14T03:03:56Z) - Deep neural network approximation of analytic functions [91.3755431537592]
entropy bound for the spaces of neural networks with piecewise linear activation functions.
We derive an oracle inequality for the expected error of the considered penalized deep neural network estimators.
arXiv Detail & Related papers (2021-04-05T18:02:04Z) - Measuring Model Complexity of Neural Networks with Curve Activation
Functions [100.98319505253797]
We propose the linear approximation neural network (LANN) to approximate a given deep model with curve activation function.
We experimentally explore the training process of neural networks and detect overfitting.
We find that the $L1$ and $L2$ regularizations suppress the increase of model complexity.
arXiv Detail & Related papers (2020-06-16T07:38:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.