Fugu-MT 論文翻訳(概要): AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels

論文の概要: AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels

arxiv url: http://arxiv.org/abs/2604.05066v1
Date: Mon, 06 Apr 2026 18:12:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.431004
Title: AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels
Title（参考訳）: AutoLALA: AIとHPCカーネルのためのループ代数的局所性の自動解析
Authors: Yifan Zhu, Yekai Pan, Yanghui Wu, Chen Ding,
Abstract要約: AutoLALAは、アフィンループプログラムにおけるデータのローカリティを分析するオープンソースツールである。再利用距離とデータ移動複雑性のための閉形式記号式を生成する。
参考スコア（独自算出の注目度）: 6.223124502234209
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Data movement is the primary bottleneck in modern computing systems. For loop-based programs common in high-performance computing (HPC) and AI workloads, including matrix multiplication, tensor contraction, stencil computation, and einsum operations, the cost of moving data through the memory hierarchy often exceeds the cost of arithmetic. This paper presents AutoLALA, an open-source tool that analyzes data locality in affine loop programs. The tool accepts programs written in a small domain-specific language (DSL), lowers them to polyhedral sets and maps, and produces closed-form symbolic formulas for reuse distance and data movement complexity. AutoLALA implements the fully symbolic locality analysis of Zhu et al. together with the data movement distance (DMD) framework of Smith et al. In particular, it computes reuse distance as the image of the access space under the access map, avoiding both stack simulation and Denning's recursive working-set formulation. We describe the DSL syntax and its formal semantics, the polyhedral lowering pipeline that constructs timestamp spaces and access maps via affine transformations, and the sequence of Barvinok counting operations used to derive symbolic reuse-interval and reuse-distance distributions. The system is implemented in Rust as a modular library spanning three crates, with safe bindings to the Barvinok library. We provide both a command-line interface and an interactive web playground with LaTeX rendering of the output formulas. The tool handles arbitrary affine loop nests, covering workloads such as tensor contractions, einsum expressions, stencil computations, and general polyhedral programs.
Abstract（参考訳）: データムーブメントは、現代のコンピューティングシステムにおける主要なボトルネックである。行列乗算、テンソル収縮、ステンシル計算、einsum演算を含むハイパフォーマンスコンピューティング(HPC)やAIワークロードで一般的なループベースのプログラムでは、メモリ階層を通じてデータを移動させるコストが演算コストを上回ることがよくある。本稿では,アフィンループプログラムにおけるデータの局所性を解析するオープンソースツールであるAutoLALAを提案する。このツールは、小さなドメイン固有言語(DSL)で記述されたプログラムを受け入れ、それらを多面体集合とマップに還元し、再利用距離とデータ移動の複雑さのためのクローズドフォームのシンボル式を生成する。 AutoLALAは、Zhu et alとSmith et alのデータ移動距離(DMD)フレームワークの完全な記号的局所性解析を実装しており、特にアクセスマップの下のアクセス空間のイメージとして再利用距離を計算し、スタックシミュレーションとDenningの再帰的なワークセットの定式化を回避している。本稿では,DSL構文とその形式的意味論,アフィン変換によるタイムスタンプ空間とアクセスマップを構築する多面的下降パイプライン,およびシンボル的再使用間隔分布と再利用距離分布の導出に使用されるバルビノクカウント操作のシーケンスについて述べる。このシステムはRustで,3つのクレートにまたがるモジュールライブラリとして実装されており,Barvinokライブラリへの安全なバインディングを備えている。出力式をLaTeXレンダリングしたコマンドラインインタフェースと対話型Webグラウンドの両方を提供する。このツールは任意のアフィンループのネストを処理し、テンソル収縮、einsum式、ステンシル計算、一般的な多面体プログラムなどのワークロードをカバーする。

論文の概要: AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels

関連論文リスト