Fugu-MT 論文翻訳(概要): Language Native Lightly Structured Databases for Large Language Model Driven Composite Materials Research

論文の概要: Language Native Lightly Structured Databases for Large Language Model Driven Composite Materials Research

arxiv url: http://arxiv.org/abs/2509.06093v2
Date: Sat, 01 Nov 2025 11:07:08 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-04 16:14:22.219784
Title: Language Native Lightly Structured Databases for Large Language Model Driven Composite Materials Research
Title（参考訳）: 大規模言語モデル駆動複合材料研究のための言語ネイティブ軽量構造化データベース
Authors: Yuze Liu, Zhaoyuan Zhang, Xiangsheng Zeng, Yihe Zhang, Leping Yu, Lejia Wang, Xi Yu,
Abstract要約: 材料の準備手順は、しばしば実験的なプロトコル、研究論文、特許、実験ノートに物語的に埋め込まれる。我々はこの課題を,テキストファーストで軽量に構造化された材料データベースを中心としたフレームワークを通じて,テキスト推論問題に再構築する。 LLMに基づく推論と言語固有のデータが組み合わさることで,実用的材料準備が大幅に促進されることを示す。
参考スコア（独自算出の注目度）: 6.31777560888658
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The preparation procedures of materials are often embedded narratively in experimental protocols, research articles, patents, and laboratory notes, and are structured around procedural sequences, causal relationships, and conditional logic. The synthesis of boron nitride nanosheet (BNNS) polymer composites exemplifies this linguistically encoded decision-making system, where the practical experiments involve interdependent multistage and path-dependent processes such as exfoliation, functionalization, and dispersion, each governed by heterogeneous parameters and contextual contingencies, challenging conventional numerical optimization paradigms for experiment design. We reformulate this challenge into a text-reasoning problem through a framework centered on a text-first, lightly structured materials database and large language models (LLMs) as text reasoning engines. We constructed a database that captures evidence-linked narrative excerpts from the literature while normalizing only the minimum necessary entities, attributes, and relations to enable composite retrieval that unifies semantic matching, lexical cues, and explicit value filters. Building on this language-native, provenance-preserving foundation, the LLM operates in two complementary modes: retrieval-augmented generation (RAG), grounding outputs in retrieved evidence modules from the database, and experience-augmented reasoning (EAR), which leverages iteratively trained text guides derived from multi-source literature-based narrative data as external references to inform reasoning and decision-making. Applying this integration-and-reasoning framework, we demonstrate rapid, laboratory-scale optimization of BNNS preparation, highlighting how language-native data combined with LLM-based reasoning can significantly accelerate practical material preparation.
Abstract（参考訳）: 材料の準備手順は、しばしば実験的なプロトコル、研究論文、特許、実験ノートに物語的に埋め込まれ、手続き的なシーケンス、因果関係、条件論理を中心に構成されている。窒化ホウ素ナノシート(BNNS)ポリマー複合化合物の合成は、この言語的に符号化された意思決定システムを例示し、実際的な実験は、剥離、官能化、分散といった、相互依存的な多段階および経路依存のプロセスを含み、それぞれが異種パラメータと文脈的一致によって支配され、実験設計のための従来の数値最適化パラダイムに挑戦する。我々は,この課題をテキスト推論エンジンとして,テキストファーストで軽量に構造化された材料データベースと大規模言語モデル(LLM)を中心としたフレームワークを用いて,テキスト推論問題に再構成する。文献からエビデンスにリンクされた物語の抜粋を抽出するデータベースを構築し,必要最低限のエンティティ,属性,関係性のみを正規化し,セマンティックマッチング,語彙的手がかり,明示的な値フィルタを統一する複合検索を可能にする。この言語固有の証明保存基盤に基づいて、LLMは、検索強化生成(RAG)、データベースから取得したエビデンスモジュールの出力のグルーピング、複数ソースの文献ベースの物語データから得られた反復的に訓練されたテキストガイドを外部参照として活用し、情報推論と意思決定を行う2つの補完モードで機能する。この統合・推論フレームワークを応用して,言語ネイティブデータとLLMに基づく推論を組み合わせることで,BNNS作成の迅速かつ大規模な最適化が実現可能であることを示す。

論文の概要: Language Native Lightly Structured Databases for Large Language Model Driven Composite Materials Research

関連論文リスト