Fugu-MT 論文翻訳(概要): CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

論文の概要: CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

arxiv url: http://arxiv.org/abs/2602.12422v1
Date: Thu, 12 Feb 2026 21:28:23 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-16 23:37:53.761123
Title: CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement
Title（参考訳）: CacheMind: ミス率から理由へ -- キャッシュ置換のための自然な言語、トレースを取り巻く推論
Authors: Kaushal Mhapsekar, Azam Ghanbari, Bita Aslrousta, Samira Mirbagher-Ajorpaz,
Abstract要約: Retrieval-Augmented Generation(RAG)とLarge Language Models(LLM)を使用してキャッシュトレースに対するセマンティック推論を可能にするツールであるCacheMindを紹介する。アーキテクトは、"なぜPC Xに関連付けられたメモリアクセスが、より多くの排除を引き起こすのか? キャッシュ置換問題に対するLCMベースの推論のための最初の検証済みベンチマークスイートであるCacheMindBenchを紹介する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Cache replacement remains a challenging problem in CPU microarchitecture, often addressed using hand-crafted heuristics, limiting cache performance. Cache data analysis requires parsing millions of trace entries with manual filtering, making the process slow and non-interactive. To address this, we introduce CacheMind, a conversational tool that uses Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) to enable semantic reasoning over cache traces. Architects can now ask natural language questions like, "Why is the memory access associated with PC X causing more evictions?", and receive trace-grounded, human-readable answers linked to program semantics for the first time. To evaluate CacheMind, we present CacheMindBench, the first verified benchmark suite for LLM-based reasoning for the cache replacement problem. Using the SIEVE retriever, CacheMind achieves 66.67% on 75 unseen trace-grounded questions and 84.80% on 25 unseen policy-specific reasoning tasks; with RANGER, it achieves 89.33% and 64.80% on the same evaluations. Additionally, with RANGER, CacheMind achieves 100% accuracy on 4 out of 6 categories in the trace-grounded tier of CacheMindBench. Compared to LlamaIndex (10% retrieval success), SIEVE achieves 60% and RANGER achieves 90%, demonstrating that existing Retrieval-Augmented Generation (RAGs) are insufficient for precise, trace-grounded microarchitectural reasoning. We provided four concrete actionable insights derived using CacheMind, wherein bypassing use case improved cache hit rate by 7.66% and speedup by 2.04%, software fix use case gives speedup of 76%, and Mockingjay replacement policy use case gives speedup of 0.7%; showing the utility of CacheMind on non-trivial queries that require a natural-language interface.
Abstract（参考訳）: キャッシュ置換はCPUマイクロアーキテクチャにおいて依然として困難な問題であり、しばしば手作りのヒューリスティックを使って対処し、キャッシュ性能を制限している。キャッシュデータ分析では、数百万のトレースエントリを手動によるフィルタリングで解析する必要があります。これを解決するために、キャッシュトレース上のセマンティック推論を可能にするために、Retrieval-Augmented Generation (RAG)とLarge Language Models (LLM)を使用する対話ツールであるCacheMindを紹介した。アーキテクトは、"なぜPC Xに関連付けられたメモリアクセスがより多くの消去を引き起こすのか?"といった自然言語の質問をすることで、プログラムセマンティクスにリンクした、トレースされた人間可読な回答を初めて受け取ることができる。 CacheMindを評価するために,キャッシュ置換問題に対するLCMベースの推論のための最初のベンチマークスイートであるCacheMindBenchを提案する。 SIEVEレトリバーを用いて、CacheMindは75の見当たらないトレースグラウンドの質問に対して66.67%、25の見当たらないポリシー固有の推論タスクで84.80%、RANGERでは89.33%、64.80%を同じ評価で達成している。さらにRANGERでは、CacheMindはCacheMindBenchのトレースグラウンド層にある6つのカテゴリのうち4つで100%の精度を実現している。 LlamaIndex (10%の検索成功)と比較して、SIEVEは60%を達成し、RANGERは90%を達成している。我々は、CacheMindを使った具体的な実行可能な4つの洞察を提供し、キャッシュヒット率7.66%、スピードアップ2.04%、ソフトウェア修正ユースケース76%、モッキンジェイ代替ポリシーユースケース0.7%、自然言語インターフェースを必要とする非自明なクエリに対するCacheMindの有用性を示す。

論文の概要: CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement

関連論文リスト