Fugu-MT 論文翻訳(概要): Fast Exact Nearest-Neighbor Learning for High-Frequency Financial Time Series

論文の概要: Fast Exact Nearest-Neighbor Learning for High-Frequency Financial Time Series

arxiv url: http://arxiv.org/abs/2606.10219v1
Date: Mon, 08 Jun 2026 22:17:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-10 15:40:58.214146
Title: Fast Exact Nearest-Neighbor Learning for High-Frequency Financial Time Series
Title（参考訳）: 高速近接学習による高頻度金融時系列の学習
Authors: Henry Han, Diane Li,
Abstract要約: 市場データ量が株式、FX、オプション、高周波トレーディングストリームにまたがって急増する中、大規模なAI効率は金融において重要になりつつある。この成長は、成熟した金融AIシステムにとって、中核的な課題を生み出す。モデルは、取引、リスク管理、デリバティブ価格のリアルタイム遅延制約を満たしながら、より大きな歴史的コーパスから学ぶ必要がある。私たちは、Mojoベースの金融AIがこの課題に対処できることを示す具体的なケーススタディとして、高周波金融時系列の正確な隣り合う学習を使用します。
参考スコア（独自算出の注目度）: 3.158346511479111
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI efficiency at scale is becoming critical in finance as market data volumes surge across equities, ETFs, FX, options, and high-frequency trading streams. This growth creates a core challenge for mature financial AI systems: models must learn from larger historical corpora while still meeting real-time latency constraints in trading, risk management, and derivative pricing. We use exact nearest-neighbor learning for high-frequency financial time series as a concrete case study to show that Mojo-based financial AI can address this challenge. We introduce a Mojo SIMD k-d tree with variance-based splitting, contiguous flat-buffer storage, and compile-time vectorized distance computation. We also provide a runtime result showing that, under standard pruning and implementation-cost assumptions, the Mojo SIMD k-d tree asymptotically dominates Mojo SIMD brute force and scikit-learn's k-d tree in the fixed-stock, large-$n$, moderate-dimensional regime. Empirically, across eight financial datasets on x86 and ARM64 with up to 277K training samples, the method achieves 17.5--21.6$\times$ speedup over scikit-learn's k-d tree on x86 and 28.1--43.5$\times$ over scikit-learn brute force on ARM64 equity/ETF datasets, while preserving exact outputs. Beyond nearest-neighbor inference, Mojo's compiled execution enables an Extra Trees-based implied-volatility pricing model to train on $10\times$ more options data, reducing put-IV RMSE by 8.0\%. These results position Mojo as a scalable, production-ready stack for financial AI and a promising foundation for efficient AI in other data-intensive fields. \keywords{Financial AI \and AI Efficiency \and Mojo \and SIMD \and K-D Trees \and KNN \and High-Frequency Trading \and Financial Time Series \and Scaling}
Abstract（参考訳）: 市場データボリュームが株式、ETF、FX、オプション、高周波トレーディングストリームにわたって急増する中、大規模なAI効率は金融業界で重要になりつつある。この成長は、成熟した金融AIシステムにとって、中核的な課題を生み出す。モデルは、取引、リスク管理、デリバティブ価格のリアルタイム遅延制約を満たしながら、より大きな歴史的コーパスから学ぶ必要がある。私たちは、Mojoベースの金融AIがこの課題に対処できることを示す具体的なケーススタディとして、高周波金融時系列の正確な隣り合う学習を使用します。分散に基づく分割、連続したフラットバッファストレージ、コンパイル時ベクトル化距離計算を備えたMojo SIMD k-d木を提案する。また,標準プルーニングおよび実装コストの仮定の下では,モジョSIMD k-d木がモジョSIMDブルート力とシキトラーンのk-d木を漸近的に支配することを示す。経験的には、x86とARM64上の8つの金融データセットのうち最大277Kのトレーニングサンプルで17.5--21.6$\times$x86上のシキト・ラーンのk-dツリーのスピードアップと、ARM64のエクイティ/ETFデータセット上のシキト・ラーンのブルート力に対する28.1--43.5$\times$を達成した。最寄りの推論の他に、Mojoのコンパイルされた実行では、Extra Treesベースのインプリード・ボラティリティの価格モデルで、より多くのオプションデータを10\times$でトレーニングすることが可能で、 put-IV RMSEを8.0\%削減できる。これらの結果は、Mojoを金融AIのためのスケーラブルでプロダクション対応のスタックとして位置づけ、他のデータ集約分野における効率的なAIの基礎として有望である。 \keywords{Financial AI \and AI efficiency \and Mojo \and SIMD \and K-D Trees \and KNN \and High-Frequency Trading \and Financial Time Series \and Scaling}

論文の概要: Fast Exact Nearest-Neighbor Learning for High-Frequency Financial Time Series

関連論文リスト