Fugu-MT 論文翻訳(概要): Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval

論文の概要: Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval

arxiv url: http://arxiv.org/abs/2605.02392v1
Date: Mon, 04 May 2026 09:34:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.221869
Title: Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval
Title（参考訳）: 新規でなぜか?パス検索に基づくファイングレード特許の新規性予測
Authors: Valentin Knappich, Anna Hätty, Simon Razniewski, Annemarie Friedrich,
Abstract要約: このデータセットは3,658個の特許クレームと,細粒度で特徴レベルの先行技術を用いた新規なデータセットである。請求項を特徴に分解し,各特徴を先行技術に対して解析し,最終的に請求項レベルの新規性予測を導出するLCMに基づく分類器の実装と評価を行う。
参考スコア（独自算出の注目度）: 12.986519085613635
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Novelty assessment is a critical yet complex task in the examination process for patent acceptance, requiring examiners to determine whether an invention is disclosed in a prior art document. The process involves intricate matching between specific features of a patent claim and passages in the prior art. While prior work has approached novelty prediction primarily as a binary classification task at the claim level, we argue that this formulation is susceptible to spurious correlations and lacks the granularity required for practical application. In this work, we introduce FiNE-Patents (Fine-grained Novelty Examination of Patents), a novel dataset comprising 3,658 first patent claims annotated with fine-grained, feature-level prior art references extracted from European Search Opinion (ESOP) documents. We propose shifting the evaluation paradigm from simple binary classification to a joint retrieval and abstract reasoning task at the feature level, requiring models to identify specific passages from a prior art document that disclose individual claim features, and to identify which features of a claim make it novel. We implement and evaluate LLM-based workflows that decompose claims into features, analyze each feature against prior art, and finally derive a claim-level novelty prediction. Our experiments demonstrate that these workflows outperform embedding-based baselines on passage retrieval and novel feature identification. Furthermore, we show that unlike trained classifiers, LLMs are robust against spurious correlations present in the claim-level novelty classification task. We release the dataset and code to foster further research into transparent and granular patent analysis.
Abstract（参考訳）: 新規性評価は、特許受理試験プロセスにおいて重要かつ複雑な課題であり、先行技術文書に発明が開示されているかどうかを審査官が判断する必要がある。このプロセスは、特許クレームの特定の特徴と、先行技術におけるパスとの複雑なマッチングを含む。従来の研究は、主にクレームレベルでのバイナリ分類タスクとして、新規性予測にアプローチしてきたが、この定式化は、素早い相関の影響を受ける可能性があり、実用的な応用に必要な粒度を欠いていると論じている。本研究では,欧州検索オピニオン(ESOP)文書から抽出した細粒度で特徴レベルの先行技術参照を付加した3,658個の特許請求書からなる新規データセットであるFiNE-Patents(Fine-fine Novelty Examination of Patents)を紹介する。本稿では,評価パラダイムを,単純な二項分類から特徴レベルの共同検索・抽象的推論タスクにシフトし,個別のクレームの特徴を開示する先行技術文書からの特定のパスをモデルで識別し,クレームのどの特徴が新規かを特定することを提案する。請求項を特徴に分解し,各特徴を先行技術に対して分析し,最終的に請求項レベルのノベルティ予測を導出するLCMベースのワークフローを実装し,評価する。実験により, これらのワークフローは, 通過経路の検索と特徴同定において, 埋め込みベースラインよりも優れていることを示した。さらに, 訓練された分類器とは異なり, LLMは, クレームレベルの新規性分類タスクに存在する突発的相関に対して頑健であることを示す。我々は、透明できめ細かい特許分析のさらなる研究を促進するために、データセットとコードを公開します。

論文の概要: Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval

関連論文リスト