Fugu-MT 論文翻訳(概要): MatFormBench: A Benchmarking Evaluation Framework for Target-Driven Materials Formulation

論文の概要: MatFormBench: A Benchmarking Evaluation Framework for Target-Driven Materials Formulation

arxiv url: http://arxiv.org/abs/2605.26741v1
Date: Tue, 26 May 2026 09:15:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-27 17:51:41.783914
Title: MatFormBench: A Benchmarking Evaluation Framework for Target-Driven Materials Formulation
Title（参考訳）: MatFormBench: ターゲット駆動の材料定式化のためのベンチマーク評価フレームワーク
Authors: Linhan Wu, Chenxi Wang, Chuhan Yang, Zhengwei Yang, Yuyang Liu,
Abstract要約: MatFormBenchは、ターゲット駆動型定式化のための生成戦略の評価とガイドに適した、新しいベンチマークエコシステムである。 MatFormBenchは、物理駆動の定式化生成スキームを統合し、材料構造-不純物応答関係を忠実にエミュレートする合成サンプルを生成する。我々は、39種類の逆設計アルゴリズムを評価し、古典的なサロゲート支援ブラックボックス探索、最先端の深層生成モデル、そしてより人気のあるLarge Language Model(LLM)ベースのレコメンデーション戦略を網羅して、MatFormBenchを検証する。
参考スコア（独自算出の注目度）: 14.90227472199598
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Inverse design of materials has significantly advanced target-driven formulation optimization, yet existing materials machine learning benchmarks remain limited to forward property prediction, failing to systematically evaluate inverse optimization and generation algorithms, a critical gap that hinders the progress of target-driven materials design. To address this limitation, we propose MatFormBench, a novel benchmarking ecosystem tailored to evaluate and guide generative strategies for target-driven formulation. MatFormBench integrates a physics-driven formulation generation scheme to generate synthetic samples that faithfully emulate realistic materials structure-property response relationships, complemented by five escalating difficulty levels to quantify the complexity of these relationships. To rigorously assess algorithm performance, we further propose MatFormScore, a multi-dimensional metric that comprehensively quantifies performance across five critical axes: target success, search efficiency, exploratory capacity, robustness, and stability. We validate MatFormBench by evaluating 39 diverse inverse design algorithms, covering classical surrogate-assisted black-box search, state-of-the-art deep generative models, and increasingly popular Large Language Model (LLM)-based recommendation strategies. Across 1170 standardized algorithm-task evaluations, diffusion-based models demonstrate the strongest overall performance, while Variational Autoencoder (VAE)-based and Genetic Algorithm (GA)-based methods exhibit distinct advantages in specific scenarios. By establishing a unified evaluation standard for target-driven materials formulation, MatFormBench enables reproducible benchmarking, principled algorithm comparison, and diagnostic analysis of inverse design strategies, providing a foundational tool for advancing materials inverse design.
Abstract（参考訳）: 素材の逆設計は、ターゲット駆動型定式化の最適化が著しく進んでいるが、既存の機械学習ベンチマークは、対象駆動型材料設計の進行を妨げる重要なギャップである逆最適化と生成アルゴリズムの体系的評価に失敗し、前進特性予測に限られている。この制限に対処するために,ターゲット駆動型定式化のための生成戦略の評価とガイドを目的とした,新しいベンチマークエコシステムであるMatFormBenchを提案する。 MatFormBenchは、物理駆動の定式化生成スキームを統合し、現実的な物質構造-不適切な応答関係を忠実にエミュレートする合成サンプルを生成する。さらに,アルゴリズムの性能を厳密に評価するために,ターゲット成功,探索効率,探索能力,堅牢性,安定性の5つの重要な軸にまたがる性能を総合的に定量化する多次元計量MatFormScoreを提案する。我々は、39種類の逆設計アルゴリズムを評価し、古典的なサロゲート支援ブラックボックス探索、最先端の深層生成モデル、そしてより人気のあるLarge Language Model(LLM)ベースのレコメンデーション戦略を網羅して、MatFormBenchを検証する。 1170を超える標準化されたアルゴリズムタスク評価では、拡散ベースモデルが最も高い性能を示し、変分オートエンコーダ(VAE)ベースのGAベースの手法は特定のシナリオにおいて異なる優位性を示す。 MatFormBenchは、ターゲット駆動型材料定式化のための統一的な評価基準を確立することにより、再現可能なベンチマーク、アルゴリズム比較、逆設計戦略の診断分析を可能にし、材料逆設計を進めるための基本的なツールを提供する。

論文の概要: MatFormBench: A Benchmarking Evaluation Framework for Target-Driven Materials Formulation

関連論文リスト