論文の概要: Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions
- arxiv url: http://arxiv.org/abs/2305.12205v2
- Date: Thu, 23 May 2024 08:38:10 GMT
- Title: Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions
- Title(参考訳): 普遍近似のための語彙:マッピング構成の言語学的視点
- Authors: Yongqiang Cai,
- Abstract要約: V=phi_i: mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd to mathbbRd
- Abstract: In recent years, deep learning-based sequence modelings, such as language models, have received much attention and success, which pushes researchers to explore the possibility of transforming non-sequential problems into a sequential form. Following this thought, deep neural networks can be represented as composite functions of a sequence of mappings, linear or nonlinear, where each composition can be viewed as a \emph{word}. However, the weights of linear mappings are undetermined and hence require an infinite number of words. In this article, we investigate the finite case and constructively prove the existence of a finite \emph{vocabulary} $V=\{\phi_i: \mathbb{R}^d \to \mathbb{R}^d | i=1,...,n\}$ with $n=O(d^2)$ for the universal approximation. That is, for any continuous mapping $f: \mathbb{R}^d \to \mathbb{R}^d$, compact domain $\Omega$ and $\varepsilon>0$, there is a sequence of mappings $\phi_{i_1}, ..., \phi_{i_m} \in V, m \in \mathbb{Z}_+$, such that the composition $\phi_{i_m} \circ ... \circ \phi_{i_1} $ approximates $f$ on $\Omega$ with an error less than $\varepsilon$. Our results demonstrate an unusual approximation power of mapping compositions and motivate a novel compositional model for regular languages.
- Abstract(参考訳): 近年、言語モデルのようなディープラーニングに基づくシーケンスモデリングは、多くの注目を集め、成功している。
この考え方に従うと、ディープニューラルネットワークは、線形あるいは非線形な一連の写像の合成関数として表され、各合成は \emph{word} と見ることができる。
本稿では、有限の場合を調査し、普遍近似に対して$n=O(d^2)$で有限 \emph{vocabulary} $V=\{\phi_i: \mathbb{R}^d \to \mathbb{R}^d | i=1,...,n\}$の存在を建設的に証明する。
つまり、任意の連続写像 $f: \mathbb{R}^d \to \mathbb{R}^d$, compact domain $\Omega$ and $\varepsilon>0$ に対して、写像の列 $\phi_{i_1}, ..., \phi_{i_m} \in V, m \in \mathbb{Z}_+$ が存在して、合成 $\phi_{i_m} \circ ... \circ \phi_{i_1} $ が $\Omega$ 上の$f$ を誤差で近似する。
本研究は, 正規言語のための新たな構成モデルを構築し, 構成をマッピングする特異な近似能力を示すものである。
