Fugu-MT 論文翻訳(概要): Athena: Synergizing Data Prefetching and Off-Chip Prediction via Online Reinforcement Learning

論文の概要: Athena: Synergizing Data Prefetching and Off-Chip Prediction via Online Reinforcement Learning

arxiv url: http://arxiv.org/abs/2601.17615v1
Date: Sat, 24 Jan 2026 22:39:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-27 15:23:08.09261
Title: Athena: Synergizing Data Prefetching and Off-Chip Prediction via Online Reinforcement Learning
Title（参考訳）: Athena: オンライン強化学習によるデータのプレフェッチとオフチップ予測の同期化
Authors: Rahul Bera, Zhenrong Lang, Caroline Hengartner, Konstantinos Kanellopoulos, Rakesh Kumar, Mohammad Sadrosadati, Onur Mutlu,
Abstract要約: プリフェッチとオフチップ予測は、高性能プロセッサで長いメモリアクセスレイテンシを隠すための2つの手法である。我々のゴールは、オフチップ予測器を様々なキャッシュレベルで採用した複数のプリフェッチと協調させることを自律的に学習できる総合的なフレームワークを設計することである。我々は,プレフェッチとオフチップ予測器(OCP)の協調を強化学習(RL)問題としてモデル化する,Athenaと呼ばれる新しい手法を提案する。
参考スコア（独自算出の注目度）: 7.760107607264796
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Prefetching and off-chip prediction are two techniques proposed to hide long memory access latencies in high-performance processors. In this work, we demonstrate that: (1) prefetching and off-chip prediction often provide complementary performance benefits, yet (2) naively combining them often fails to realize their full performance potential, and (3) existing prefetcher control policies leave significant room for performance improvement behind. Our goal is to design a holistic framework that can autonomously learn to coordinate an off-chip predictor with multiple prefetchers employed at various cache levels. To this end, we propose a new technique called Athena, which models the coordination between prefetchers and off-chip predictor (OCP) as a reinforcement learning (RL) problem. Athena acts as the RL agent that observes multiple system-level features (e.g., prefetcher/OCP accuracy, bandwidth usage) over an epoch of program execution, and uses them as state information to select a coordination action (i.e., enabling the prefetcher and/or OCP, and adjusting prefetcher aggressiveness). At the end of every epoch, Athena receives a numerical reward that measures the change in multiple system-level metrics (e.g., number of cycles taken to execute an epoch). Athena uses this reward to autonomously and continuously learn a policy to coordinate prefetchers with OCP. Our extensive evaluation using a diverse set of memory-intensive workloads shows that Athena consistently outperforms prior state-of-the-art coordination policies across a wide range of system configurations with various combinations of underlying prefetchers, OCPs, and main memory bandwidths, while incurring only modest storage overhead. Athena is freely available at https://github.com/CMU-SAFARI/Athena.
Abstract（参考訳）: プリフェッチとオフチップ予測は、高性能プロセッサで長いメモリアクセスレイテンシを隠すための2つの手法である。本研究では,(1)プリフェッチとオフチップ予測は相補的な性能向上をもたらすことが多いが,(2)ネーティブな組み合わせは完全な性能向上を実現するのに失敗することが多く,(3)既存のプリフェッチャーコントロールポリシーは,性能改善のための大きな余地を残していることを実証する。我々のゴールは、オフチップ予測器を様々なキャッシュレベルで採用した複数のプリフェッチ器と協調させることを自律的に学習できる全体論的なフレームワークを設計することである。そこで本研究では,プレフェッチとオフチップ予測器(OCP)の協調を強化学習(RL)問題としてモデル化する,Athenaと呼ばれる新しい手法を提案する。アテナは、プログラム実行のエポック上で複数のシステムレベルの特徴(例えば、プレフェッチャー/OCP精度、帯域幅使用量)を観察するRLエージェントとして機能し、それらを状態情報として、調整アクション(すなわち、プレフェッチャーおよび/またはOCPを有効にし、プレフェッチャー攻撃性を調整する)を選択する。各エポックの終わりに、Athenaは複数のシステムレベルのメトリクス(例えば、エポックを実行するのに要するサイクルの数)の変化を測定する数値的な報酬を受け取る。 Athenaはこの報酬を自律的かつ継続的な学習に利用し、プレフェッチをOCPと協調させるポリシーを学習する。多様なメモリ集約型ワークロードを使用した大規模な評価では、Athenaは、基礎となるプリフェッチ、OCP、メインメモリ帯域幅の様々な組み合わせで、幅広いシステム構成における、最先端のコーディネーションポリシを一貫して上回っており、ストレージオーバーヘッドはわずかである。 Athenaはhttps://github.com/CMU-SAFARI/Athenaで無料で入手できる。

論文の概要: Athena: Synergizing Data Prefetching and Off-Chip Prediction via Online Reinforcement Learning

関連論文リスト