Fugu-MT 論文翻訳(概要): Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior

論文の概要: Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior

arxiv url: http://arxiv.org/abs/2512.02795v1
Date: Tue, 02 Dec 2025 14:12:36 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-03 21:04:45.913383
Title: Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior
Title（参考訳）: レイクハウスの観察に向けて - ソフトウェア行動のインタラクティブなアーカイブ
Authors: Marcus Kessel,
Abstract要約: 先行研究では,Sequence Sheets,StimulusResponse Matrices,StimulusResponse Cubesを用いて表現を行った。本稿では,連続SRCを運用する観測用レイクハウスについて紹介する。制御パイプライン(SOLAS)とCILASからデータを取り込み、n-versionアセスメント、行動クラスタリング、コンセンサスオラクルを可能にします。
参考スコア（独自算出の注目度）: 2.6397379133308214
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Code-generating LLMs are trained largely on static artifacts (source, comments, specifications) and rarely on materializations of run-time behavior. As a result, they readily internalize buggy or mislabeled code. Since non-trivial semantic properties are undecidable in general, the only practical way to obtain ground-truth functionality is by dynamic observation of executions. In prior work, we addressed representation with Sequence Sheets, Stimulus-Response Matrices (SRMs), and Stimulus-Response Cubes (SRCs) to capture and compare behavior across tests, implementations, and contexts. These structures make observation data analyzable offline and reusable, but they do not by themselves provide persistence, evolution, or interactive analytics at scale. In this paper, therefore, we introduce observation lakehouses that operationalize continual SRCs: a tall, append-only observations table storing every actuation (stimulus, response, context) and SQL queries that materialize SRC slices on demand. Built on Apache Parquet + Iceberg + DuckDB, the lakehouse ingests data from controlled pipelines (LASSO) and CI pipelines (e.g., unit test executions), enabling n-version assessment, behavioral clustering, and consensus oracles without re-execution. On a 509-problem benchmark, we ingest $\approx$8.6M observation rows ($<$51MiB) and reconstruct SRM/SRC views and clusters in $<$100ms on a laptop, demonstrating that continual behavior mining is practical without a distributed cluster of machines. This makes behavioral ground truth first-class alongside other run-time data and provides an infrastructure path toward behavior-aware evaluation and training. The Observation Lakehouse, together with the accompanying dataset, is publicly available as an open-source project on GitHub: https://github.com/SoftwareObservatorium/observation-lakehouse
Abstract（参考訳）: コード生成 LLM は主に静的アーティファクト(ソース、コメント、仕様)に基づいて訓練され、実行時の振る舞いの実体化はめったにない。結果として、バグだらけのコードやラベルが間違えたコードを簡単に内部化できる。非自明なセマンティックな性質は一般に決定不可能であるため、基底真実性を得るための唯一の実践方法は実行の動的観察である。先行研究では、SRM(Stimulus-Response Matrices)、SRM(Stimulus-Response Cubes)を用いて、テスト、実装、コンテキスト間の振る舞いをキャプチャし比較した。これらの構造は、観測データをオフラインで、再利用可能なものにしますが、それ自体は、持続性、進化性、大規模でインタラクティブな分析を提供していません。そこで本研究では,SRCスライスを要求に応じて生成する,すべてのアクティベーション(刺激,応答,コンテキスト)とSQLクエリを格納した高高度で付加のみの観測テーブルである,連続的なSRCを運用する観測用レイクハウスを紹介する。 Apache Parquet + Iceberg + DuckDB上に構築されたLakehouseは、コントロールパイプライン(LASSO)とCIパイプライン(ユニットテスト実行など)からデータを取り込み、n-versionアセスメント、振る舞いクラスタリング、再実行不要のコンセンサスオラクルを可能にする。 509プロブレムのベンチマークでは、$\approx$8.6Mの観察行($<51MiB)を取り込み、ラップトップ上のSRM/SRCビューとクラスタを$<100msで再構築し、マシンの分散クラスタなしで連続的な行動マイニングが実用的なことを実証した。これにより、行動基盤の真理を他の実行時データとともに第一級にし、行動認識評価とトレーニングに向けたインフラストラクチャパスを提供する。 Observation Lakehouseは関連するデータセットとともに、GitHub上のオープンソースプロジェクトとして公開されている。

論文の概要: Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior

関連論文リスト