Fugu-MT 論文翻訳(概要): OpenZL: A Graph-Based Model for Compression

論文の概要: OpenZL: A Graph-Based Model for Compression

arxiv url: http://arxiv.org/abs/2510.03203v2
Date: Thu, 30 Oct 2025 19:30:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-03 19:57:09.714289
Title: OpenZL: A Graph-Based Model for Compression
Title（参考訳）: OpenZL: 圧縮のためのグラフベースのモデル
Authors: Yann Collet, Nick Terrell, W. Felix Handte, Danielle Rozenblit, Victor Zhang, Kevin Zhang, Yaelle Goldschlag, Jennifer Lee, Elliot Gorokhovsky, Yonatan Komornik, Daniel Riegel, Stan Angelov, Nadav Rotem,
Abstract要約: アプリケーション固有の圧縮機システムは、最高の汎用圧縮機よりも優れている。これらの課題は,新たな圧縮戦略によって克服可能であることを示す。 OpenZLはデータを自己記述型のワイヤフォーマットに圧縮し、任意の構成はユニバーサルデコーダによって圧縮される。
参考スコア（独自算出の注目度）: 1.9508265730898475
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Research techniques in the last decade have improved lossless compression ratios by significantly increasing processing time. These techniques have remained obscure because production systems require high throughput and low resource utilization. In practice, application-specific compression algorithms that leverage knowledge of the data structure and semantics are more popular. Application-specific compressor systems outperform even the best generic compressors, but these techniques have some drawbacks. Application-specific compressors are inherently limited in applicability, have high development costs, and are difficult to maintain and deploy. In this work, we show that these challenges can be overcome with a new compression strategy. We propose the "graph model" of compression, a new theoretical framework for representing compression as a directed acyclic graph of modular codecs. OpenZL compresses data into a self-describing wire format, any configuration of which can be decompressed by a universal decoder. OpenZL's design enables rapid development of tailored compressors with minimal code; its universal decoder eliminates deployment lag; and its investment in a well-vetted standard component library minimizes security risks. Experimental results demonstrate that OpenZL achieves superior compression ratios and speeds compared to state-of-the-art general-purpose compressors on a variety of real-world datasets. Internal deployments at Meta have also shown consistent improvements in size and/or speed, with development timelines reduced from months to days. OpenZL thus represents a significant advance in practical, scalable, and maintainable data compression for modern data-intensive applications.
Abstract（参考訳）: 過去10年間の研究技術は、処理時間を大幅に増加させ、ロスレス圧縮比を改善してきた。生産システムは高いスループットと低い資源利用を必要とするため、これらの技術はいまだに不明である。実際には、データ構造とセマンティクスの知識を活用するアプリケーション固有の圧縮アルゴリズムが一般的である。アプリケーション固有の圧縮機システムは、最高の汎用圧縮機よりも優れているが、これらの技術にはいくつかの欠点がある。アプリケーション固有の圧縮機は、本質的には適用性に制限があり、開発コストが高く、メンテナンスやデプロイが困難である。本研究では,新しい圧縮戦略により,これらの課題を克服できることを示す。モジュラーコーデックの有向非巡回グラフとして圧縮を表現するための新しい理論フレームワークである圧縮の「グラフモデル」を提案する。 OpenZLはデータを自己記述型のワイヤフォーマットに圧縮し、任意の構成はユニバーサルデコーダによって圧縮される。 OpenZLの設計は、最小限のコードで調整された圧縮機を迅速に開発することを可能にし、ユニバーサルデコーダはデプロイメントの遅延をなくし、十分に検証された標準コンポーネントライブラリへの投資は、セキュリティリスクを最小限に抑える。実験により,OpenZLは様々な実世界のデータセット上の最先端汎用圧縮機と比較して,圧縮率と速度が優れていることが示された。 Metaの社内展開ではサイズや速度が一貫した改善が見られ、開発タイムラインは数ヶ月から数日に短縮された。したがって、OpenZLは、現代的なデータ集約型アプリケーションのための実用的でスケーラブルで保守性のあるデータ圧縮において、大きな進歩を見せている。

論文の概要: OpenZL: A Graph-Based Model for Compression

関連論文リスト