Fugu-MT 論文翻訳(概要): Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya

論文の概要: Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya

arxiv url: http://arxiv.org/abs/2604.04937v1
Date: Sat, 14 Feb 2026 23:45:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-19 19:09:11.356623
Title: Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya
Title（参考訳）: Pramana: Navya-Nyayaを通したてんかん治療のための微調整大言語モデル
Authors: Sharath Sathish,
Abstract要約: 大規模な言語モデルは、流動的なテキストを生成するが、体系的な推論に苦労する。 Appleの研究者が無関係なコンテキストを追加すると、LLMのパフォーマンスは65%低下した。トレーサブルエビデンスにおけるこの主張を根拠にできないことは、正当化を必要とする領域におけるAIの信頼性を制限します。 2500年前のインドの推論フレームワークであるPramanaの微調整について紹介する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models produce fluent text but struggle with systematic reasoning, often hallucinating confident but unfounded claims. When Apple researchers added irrelevant context to mathematical problems, LLM performance degraded by 65% Apple Machine Learning Research, exposing brittle pattern-matching beneath apparent reasoning. This epistemic gap, the inability to ground claims in traceable evidence, limits AI reliability in domains requiring justification. We introduce Pramana, a novel approach that teaches LLMs explicit epistemological methodology by fine-tuning on Navya-Nyaya logic, a 2,500-year-old Indian reasoning framework. Unlike generic chain-of-thought prompting, Navya-Nyaya enforces structured 6-phase reasoning: SAMSHAYA (doubt analysis), PRAMANA (evidence source identification), PANCHA AVAYAVA (5-member syllogism with universal rules), TARKA (counterfactual verification), HETVABHASA (fallacy detection), and NIRNAYA (ascertainment distinguishing knowledge from hypothesis). This integration of logic and epistemology provides cognitive scaffolding absent from standard reasoning approaches. We fine-tune Llama 3.2-3B and DeepSeek-R1-Distill-Llama-8B on 55 Nyaya-structured logical problems (constraint satisfaction, Boolean SAT, multi-step deduction). Stage 1 achieves 100% semantic correctness on held-out evaluation despite only 40% strict format adherence revealing that models internalize reasoning content even when structural enforcement is imperfect. Ablation studies show format prompting and temperature critically affect performance, with optimal configurations differing by stage. We release all models, datasets, and training infrastructure on Hugging Face to enable further research on epistemic frameworks for AI reasoning.
Abstract（参考訳）: 大規模な言語モデルは、流動的なテキストを生成するが、体系的な推論に苦慮し、しばしば自信あるが根拠のない主張を幻覚させる。 Appleの研究者たちが数学的問題に無関係なコンテキストを追加すると、LLMのパフォーマンスは65%低下し、明らかな推論の下にある脆いパターンマッチングが明らかになった。この疫学的なギャップ、トレーサブルな証拠の主張を根拠にできないことは、正当化を必要とする領域におけるAIの信頼性を制限する。 2500年前のインドの推論フレームワークであるNavala-Nyaya論理を微調整し,LLMの明示的な認識論的方法論を教える新しいアプローチであるPranaを紹介した。一般的なチェーン・オブ・シンセサイティングとは異なり、Navala-Nyayaは、SAMSHAYA(疑似分析)、PRAMANA(証拠情報源同定)、PANCHA AVAYAVA(普遍規則付き5員のシロジズム)、TARKA(偽検証)、HETVABHASA(誤検出)、NIRNAYA(仮説と知識を区別する確認)という、構造化された6相推論を施行している。この論理学と認識学の統合は、標準的な推論アプローチを欠いた認知的足場を提供する。 Llama 3.2-3BとDeepSeek-R1-Distill-Llama-8Bを55のニヤヤ構造論理問題(制約満足度,ブールSAT,マルチステップ推論)で微調整する。ステージ1は、構造的強制が不完全である場合でも、モデルが推論内容を内部化することを示す厳密な形式順守のわずか40%にもかかわらず、ホールトアウト評価において100%の意味的正当性を達成する。アブレーション研究は、フォーマットのプロンプトと温度がパフォーマンスに重大な影響を与え、最適な構成はステージによって異なることを示している。私たちはHugging Face上ですべてのモデル、データセット、トレーニングインフラストラクチャをリリースし、AI推論のための疫学フレームワークに関するさらなる研究を可能にします。

論文の概要: Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya

関連論文リスト