Fugu-MT 論文翻訳(概要): DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning

論文の概要: DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning

arxiv url: http://arxiv.org/abs/2508.12726v3
Date: Wed, 08 Oct 2025 17:57:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 14:21:18.095262
Title: DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning
Title（参考訳）: DESIGNER:LLM推論のための設計論理型多分野データ合成
Authors: Weize Liu, Yongchi Zhao, Yijia Luo, Mingyu Xu, Jiaheng Liu, Yanan Li, Xiguo Hu, Zhiqi Bai, Yuchi Xu, Wenbo Su, Bo Zheng,
Abstract要約: 本稿では,「設計論理」の概念を導入し,人間教育者の質問作成過程を模倣するようにLCMに指示する。 LLMを使って、さまざまな分野にわたる既存の質問から12万以上の設計ロジックをリバースエンジニアリングし、抽象化します。これらの設計ロジックをソースドキュメントとマッチングすることで、既存のデータセットの難しさや多様性をはるかに超える推論的な質問を作成できるのです。
参考スコア（独自算出の注目度）: 31.744811175188442
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have achieved remarkable success in many natural language tasks but still struggle with complex, multi-step reasoning, particularly across diverse disciplines. Existing reasoning datasets often lack disciplinary breadth, reasoning depth, and diversity, and lack guiding principles for question synthesis. We propose DESIGNER: a DESIGN-logic-guidEd Reasoning data synthesis pipeline that leverages naturally available, extensive raw documents (e.g., book corpus and web corpus) to generate multidisciplinary challenging questions. We introduce the concept of "design logic" and instruct LLMs to mimic human educators' question-creation process, enabling automated synthesis of large-scale, high-difficulty questions. We use LLMs to reverse-engineer and abstract over 120,000 design logics from existing questions across various disciplines. By matching these design logics with source documents, we are able to create reasoning questions that far surpass the difficulty and diversity of existing datasets. Using this pipeline, we synthesized two large-scale reasoning datasets that span 75 disciplines: DLR-Book (3.04 million questions from the book corpus) and DLR-Web (1.66 million questions from the web corpus). Data analysis indicates that the questions synthesized by our method exhibit greater difficulty and diversity compared to those in the baseline datasets. We validate our synthesized data through supervised fine-tuning (SFT) on the Qwen3 and Llama3 model families. Our data substantially enhances their multidisciplinary reasoning capabilities, outperforming existing datasets. Notably, after SFT on our datasets, the base versions of these models even surpass their official instruction-tuned counterparts.
Abstract（参考訳）: 大規模言語モデル(LLM)は多くの自然言語処理において顕著な成功を収めてきたが、それでも複雑で多段階の推論に苦戦している。既存の推論データセットは、しばしば学際的な幅、推論の深さ、多様性を欠き、質問合成の指針を欠いている。我々はDESIGNER:DESIGN-logic-guidEd Reasoningデータ合成パイプラインを提案する。我々は「設計論理」の概念を導入し、LLMに人間の教育者の質問作成プロセスを模倣するよう指示し、大規模で難解な質問の自動合成を可能にした。 LLMを使って、さまざまな分野にわたる既存の質問から12万以上の設計ロジックをリバースエンジニアリングし、抽象化します。これらの設計ロジックをソースドキュメントとマッチングすることで、既存のデータセットの難しさや多様性をはるかに超える推論的な質問を作成できるのです。このパイプラインを用いて、DLR-Book(本コーパスから3.04万質問)とDLR-Web(ウェブコーパスから1.66万質問)という、75の分野にわたる大規模推論データセットを合成した。データ分析により,本手法によって合成された質問は,ベースラインデータセットよりも難易度や多様性が高いことが示された。我々は,Qwen3およびLlama3モデルファミリ上の教師付き微調整(SFT)により合成データを検証した。我々のデータは、その多分野推論能力を大幅に向上させ、既存のデータセットよりも優れています。特に、データセットのSFT後、これらのモデルのベースバージョンは、公式の命令指定モデルを超えています。

論文の概要: DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning

関連論文リスト