Fugu-MT 論文翻訳(概要): OpenZL: A Graph-Based Model for Compression

論文の概要: OpenZL: A Graph-Based Model for Compression

arxiv url: http://arxiv.org/abs/2510.03203v1
Date: Fri, 03 Oct 2025 17:40:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-06 16:35:52.515626
Title: OpenZL: A Graph-Based Model for Compression
Title（参考訳）: OpenZL: 圧縮のためのグラフベースのモデル
Authors: Yann Collet, Nick Terrell, W. Felix Handte, Danielle Rozenblit, Victor Zhang, Kevin Zhang, Yaelle Goldschlag, Jennifer Lee, Daniel Riegel, Stan Angelov, Nadav Rotem,
Abstract要約: モジュラーコーデックの有向非巡回グラフとして圧縮を表現するための新しい理論的枠組みを提案する。これは、データを自己記述型のワイヤフォーマットに圧縮する、このモデルの実装であるOpenZLを動機付けている。 OpenZLは最先端の汎用圧縮機に比べて圧縮率と速度が優れている。
参考スコア（独自算出の注目度）: 2.043259170549106
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Research in general-purpose lossless compression over the last decade has largely found improvements in compression ratio that come at great cost to resource utilization and processing throughput. However, most production workloads require high throughput and low resource utilization, so most research systems have seen little adoption. Instead, real world improvements in compression are increasingly often realized by building application-specific compressors which can exploit knowledge about the structure and semantics of the data being compressed. These systems easily outperform even the best generic compressors, but application-specific compression schemes are not without drawbacks. They are inherently limited in applicability and are difficult to maintain and deploy. We show that these challenges can be overcome with a new way of thinking about compression. We propose the ``graph model'' of compression, a new theoretical framework for representing compression as a directed acyclic graph of modular codecs. This motivates OpenZL, an implementation of this model that compresses data into a self-describing wire format, any configuration of which can be decompressed by a universal decoder. OpenZL's design enables rapid development of tailored compressors with minimal code, its universal decoder eliminates deployment lag, and its investment in a well-vetted standard component library minimizes security risks. Experimental results demonstrate that OpenZL achieves superior compression ratios and speeds compared to state-of-the-art general-purpose compressors on a variety of real-world datasets. Internal deployments at Meta have also shown consistent improvements in size and/or speed, with development timelines reduced from months to days. OpenZL thus represents an advance in practical, scalable, and maintainable data compression for modern data-intensive applications.
Abstract（参考訳）: 過去10年間の汎用的なロスレス圧縮の研究は、資源利用と処理のスループットに大きなコストがかかる圧縮率の改善を大いに見出している。しかし、ほとんどの運用ワークロードは高いスループットと低いリソース利用を必要とするため、ほとんどの研究システムはほとんど採用されていない。代わりに、圧縮されるデータの構造とセマンティクスに関する知識を活用できるアプリケーション固有の圧縮機を構築することで、圧縮の現実的な改善がますます実現される。これらのシステムは、最高の汎用圧縮機でも容易に性能が向上するが、アプリケーション固有の圧縮スキームには欠点がない。適用性には本質的に制限があり、メンテナンスやデプロイが難しい。圧縮に関する新たな考え方によって,これらの課題を克服できることが示される。モジュラーコーデックの有向非巡回グラフとして圧縮を表現するための新しい理論フレームワークである圧縮の「グラフモデル」を提案する。これは、データを自己記述型のワイヤフォーマットに圧縮するOpenZLの実装であり、任意の構成はユニバーサルデコーダによって圧縮される。 OpenZLの設計は、最小限のコードで調整された圧縮機を迅速に開発することを可能にし、ユニバーサルデコーダはデプロイメント遅延を排除し、十分に検証された標準コンポーネントライブラリへの投資は、セキュリティリスクを最小限にする。実験により,OpenZLは様々な実世界のデータセット上の最先端汎用圧縮機と比較して,圧縮率と速度が優れていることが示された。 Metaの社内展開ではサイズや速度が一貫した改善が見られ、開発タイムラインは数ヶ月から数日に短縮された。したがって、OpenZLは、現代のデータ集約型アプリケーションのための実用的でスケーラブルで保守可能なデータ圧縮の進歩を表している。

論文の概要: OpenZL: A Graph-Based Model for Compression

関連論文リスト