Fugu-MT 論文翻訳(概要): A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim Decomposition

論文の概要: A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim Decomposition

arxiv url: http://arxiv.org/abs/2510.22055v1
Date: Fri, 24 Oct 2025 22:37:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 15:28:14.804143
Title: A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim Decomposition
Title（参考訳）: クレーム分解による開領域数値Fact-Checkingのベンチマーク
Authors: V Venktesh, Deepali Prabhu, Avishek Anand,
Abstract要約: QuanTemp++は、自然の数値的クレーム、オープンドメインコーパスからなるデータセットであり、各クレームに関する関連する証拠がある。キークレーム分解パラダイムの検索性能を特徴付ける。
参考スコア（独自算出の注目度）: 7.910984819642885
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Fact-checking numerical claims is critical as the presence of numbers provide mirage of veracity despite being fake potentially causing catastrophic impacts on society. The prior works in automatic fact verification do not primarily focus on natural numerical claims. A typical human fact-checker first retrieves relevant evidence addressing the different numerical aspects of the claim and then reasons about them to predict the veracity of the claim. Hence, the search process of a human fact-checker is a crucial skill that forms the foundation of the verification process. Emulating a real-world setting is essential to aid in the development of automated methods that encompass such skills. However, existing benchmarks employ heuristic claim decomposition approaches augmented with weakly supervised web search to collect evidences for verifying claims. This sometimes results in less relevant evidences and noisy sources with temporal leakage rendering a less realistic retrieval setting for claim verification. Hence, we introduce QuanTemp++: a dataset consisting of natural numerical claims, an open domain corpus, with the corresponding relevant evidence for each claim. The evidences are collected through a claim decomposition process approximately emulating the approach of human fact-checker and veracity labels ensuring there is no temporal leakage. Given this dataset, we also characterize the retrieval performance of key claim decomposition paradigms. Finally, we observe their effect on the outcome of the verification pipeline and draw insights. The code for data pipeline along with link to data can be found at https://github.com/VenkteshV/QuanTemp_Plus
Abstract（参考訳）: ファクトチェックの数値的な主張は、社会に破滅的な影響をもたらす可能性のある偽物であるにもかかわらず、数字の存在が真正さを誇示しているため、非常に重要である。自動事実検証における以前の研究は、主に自然数値的な主張に焦点を当てていない。典型的な人間の事実チェックは、まずクレームの異なる数値的な側面に対処する関連する証拠を回収し、クレームの正確性を予測する理由を導出する。したがって、人間のファクトチェッカーの検索プロセスは、検証プロセスの基礎となる重要なスキルである。現実世界の設定をエミュレートすることは、そのようなスキルを含む自動化された手法の開発を支援するために不可欠である。しかし、既存のベンチマークでは、弱い教師付きWeb検索で強化されたヒューリスティックなクレーム分解アプローチを使用して、クレームを検証するための証拠を収集している。この結果、時限リークによる証拠やノイズの少ない情報源は、クレーム検証のためのより現実的でない検索設定をレンダリングすることがある。そこで我々は,自然数値クレームとオープンドメインコーパスからなるデータセットであるQuanTemp++を紹介した。証拠は、人間の事実チェックと正確性ラベルのアプローチをエミュレートしたクレーム分解プロセスを通じて収集され、時間的漏洩がないことが保証される。このデータセットから,キークレーム分解パラダイムの検索性能を特徴付ける。最後に、検証パイプラインの結果に対するそれらの影響を観察し、洞察を引き出す。データパイプラインのコードとデータへのリンクはhttps://github.com/VenkteshV/QuanTemp_Plusにある。

論文の概要: A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim Decomposition

関連論文リスト