Fugu-MT 論文翻訳(概要): BoostTaxo: Zero-Shot Taxonomy Induction via Boosting-Style Agentic Reasoning and Constraint-Aware Calibration

論文の概要: BoostTaxo: Zero-Shot Taxonomy Induction via Boosting-Style Agentic Reasoning and Constraint-Aware Calibration

arxiv url: http://arxiv.org/abs/2605.12520v1
Date: Fri, 03 Apr 2026 15:48:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 12:34:33.779126
Title: BoostTaxo: Zero-Shot Taxonomy Induction via Boosting-Style Agentic Reasoning and Constraint-Aware Calibration
Title（参考訳）: BoostTaxo:Boosting-Style Agentic ReasoningとConstraint-Aware Calibrationによるゼロショット分類誘導
Authors: Yancheng Ling, Zhenlin Qin, Leizhen Wang, Zhenliang Ma,
Abstract要約: BoostTaxoはゼロショット分類誘導のためのブースティングスタイルのLLMフレームワークである。一連のドメイン用語を入力として取り、粗い方法で親識別を行う。検索強化された定義修正、ハイブリッドな親候補選択、候補評価、構造対応スコアキャリブレーションを採用している。ゼロショット分類誘導における最先端の手法よりも優れた、あるいは同等のパフォーマンスを達成する。
参考スコア（独自算出の注目度）: 6.719751155411076
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Taxonomy induction is crucial for organizing concepts into explicit and interpretable semantic hierarchies. While existing methods have achieved promising results, their generalization, structural reliability, and efficiency remain limited, hindering their performance in zero-shot and large-scale scenarios. To overcome these limitations, we introduce BoostTaxo, a boosting-style LLM framework for zero-shot taxonomy induction. It takes a set of domain terms as inputs and performs parent identification in a coarse-to-fine manner, employing retrieval-augmented definition refinement, hybrid parent candidate selection, candidate rating, and structure-aware score calibration to improve taxonomy construction. Specifically, a lightweight LLM is used to efficiently filter candidate parents, while a large-scale LLM is employed to rank and score candidate parents for fine-grained parent selection. Structural features are further incorporated to calibrate candidate edge weights and enhance the reliability of the induced taxonomy. The unified BoostTaxo is evaluated on three public benchmark datasets, namely WordNet, DBLP, and SemEval-Sci, and achieves superior or comparable performance to state-of-the-art methods in zero-shot taxonomy induction. The ablation study validates the contribution of the hybrid parent candidate selection and the structure-aware score calibration to the overall performance. Further analysis investigates the impact of candidate selection size on taxonomy quality and presents representative case and failure studies, providing deeper insights into the effectiveness and limitations of the proposed framework.
Abstract（参考訳）: 分類誘導は、概念を明示的で解釈可能な意味的階層にまとめるのに不可欠である。既存の手法は有望な結果を得たが、その一般化、構造的信頼性、効率性は限定的であり、ゼロショットや大規模シナリオのパフォーマンスを妨げている。これらの制限を克服するため、ゼロショット分類誘導のためのブースティングスタイルのLLMフレームワークBoostTaxoを紹介した。ドメイン用語のセットを入力として取り、検索強化定義修正、ハイブリッド親候補選択、候補格付け、構造認識スコアキャリブレーションを活用して、粗大な方法で親識別を行う。具体的には、軽量LLMを用いて候補親を効率よくフィルタリングし、大規模LLMを用いて候補親のランク付けとスコア付けを行い、きめ細かい親選択を行う。構造的特徴は、候補のエッジウェイトを校正し、誘導された分類の信頼性を高めるためにさらに組み込まれている。統合BoostTaxoは、WordNet、DBLP、SemEval-Sciという3つの公開ベンチマークデータセットで評価され、ゼロショット分類誘導における最先端の手法よりも優れた、あるいは同等のパフォーマンスを達成する。 Ablation study is confirmed the contribute of the hybrid parent candidate selection and the structure-aware score calibration to the overall performance。さらに, 候補選択サイズが分類品質に及ぼす影響について検討し, 代表的な事例と失敗事例を提示し, 提案フレームワークの有効性と限界についてより深い知見を提供する。

論文の概要: BoostTaxo: Zero-Shot Taxonomy Induction via Boosting-Style Agentic Reasoning and Constraint-Aware Calibration

関連論文リスト