Fugu-MT 論文翻訳(概要): A two-step sequential approach for hyperparameter selection in finite context models

論文の概要: A two-step sequential approach for hyperparameter selection in finite context models

arxiv url: http://arxiv.org/abs/2603.19736v1
Date: Fri, 20 Mar 2026 08:20:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 19:48:39.050551
Title: A two-step sequential approach for hyperparameter selection in finite context models
Title（参考訳）: 有限文脈モデルにおけるハイパーパラメータ選択のための2段階逐次的アプローチ
Authors: José Contente, Ana Martins, Armando J. Pinho, Sónia Gouveia,
Abstract要約: 本稿では,FCMにおける効率的なハイパーパラメータ選択のための統計的2段階逐次手法を提案する。平均(シンボル当たりのビット)で網羅的なグリッド探索に匹敵する圧縮性能を実現する
参考スコア（独自算出の注目度）: 0.08155575318208629
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Finite-context models (FCMs) are widely used for compressing symbolic sequences such as DNA, where predictive performance depends critically on the context length k and smoothing parameter α. In practice, these hyperparameters are typically selected through exhaustive search, which is computationally expensive and scales poorly with model complexity. This paper proposes a statistically grounded two-step sequential approach for efficient hyperparameter selection in FCMs. The key idea is to decompose the joint optimization problem into two independent stages. First, the context length k is estimated using categorical serial dependence measures, including Cramér's ν, Cohen's \k{appa} and partial mutual information (pami). Second, the smoothing parameter α is estimated via maximum likelihood conditional on the selected context length k. Simulation experiments were conducted on synthetic symbolic sequences generated by FCMs across multiple (k, α) configurations, considering a four-letter alphabet and different sample sizes. Results show that the dependence measures are substantially more sensitive to variations in k than in α, supporting the sequential estimation strategy. As expected, the accuracy of the hyperparameter estimation improves with increasing sample size. Furthermore, the proposed method achieves compression performance comparable to exhaustive grid search in terms of average bitrate (bits per symbol), while substantially reducing computational cost. Overall, the results on simulated data show that the proposed sequential approach is a practical and computationally efficient alternative to exhaustive hyperparameter tuning in FCMs.
Abstract（参考訳）: 有限コンテキストモデル(FCM)は、文脈長kと平滑化パラメータαに重要な予測性能が依存するDNAのようなシンボリックシーケンスの圧縮に広く用いられている。実際には、これらのハイパーパラメータは通常、網羅的な探索によって選択される。本稿では,FCMにおける効率的なハイパーパラメータ選択のための統計的2段階逐次手法を提案する。鍵となる考え方は、共同最適化問題を2つの独立した段階に分解することである。まず、文脈長さ k はクラメールのν、コーエンの \k{appa} および部分的相互情報(パミ)を含むカテゴリー的直列依存度を用いて推定される。第2に、選択した文脈長さkの最大極大条件により、平滑化パラメータαを推定する。シミュレーション実験は、複数の(k, α)構成のFCMによって生成された合成記号列について、4文字のアルファベットと異なるサンプルサイズを考慮して行った。その結果, 依存度はαよりもkの変動にかなり敏感であり, 逐次推定戦略を支持することが明らかとなった。予想通り, 試料径の増大に伴い, ハイパーパラメータ推定の精度が向上する。さらに,提案手法は,平均ビットレート(シンボル毎のビット)で網羅的グリッド探索に匹敵する圧縮性能を実現し,計算コストを大幅に削減する。シミュレーションデータから得られた結果から,提案手法はFCMにおける指数関数型ハイパーパラメータチューニングに代わる,実用的で効率的な手法であることが示唆された。

論文の概要: A two-step sequential approach for hyperparameter selection in finite context models

関連論文リスト