Fugu-MT 論文翻訳(概要): MolDeTox: Evaluating Language Model's Stepwise Fragment Editing for Molecular Detoxification

論文の概要: MolDeTox: Evaluating Language Model's Stepwise Fragment Editing for Molecular Detoxification

arxiv url: http://arxiv.org/abs/2605.12181v1
Date: Tue, 12 May 2026 14:24:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.921339
Title: MolDeTox: Evaluating Language Model's Stepwise Fragment Editing for Molecular Detoxification
Title（参考訳）: MolDeTox: 分子デトキシ化のための言語モデルの段階的フラグメント編集の評価
Authors: Jueon Park, Wonjune Jang, Jiwoo Lee, Yein Park, Jaewoo Kang,
Abstract要約: 大規模言語モデル(LLM)とビジョン言語モデル(VLM)は、最近、有望な機能を示している。 MolDeToxは分子デトキシ化のための新しいベンチマークである。
参考スコア（独自算出の注目度）: 17.99639229055483
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) and Vision Language Models (VLMs) have recently shown promising capabilities in various scientific domain. In particular, these advances have opened new opportunities in drug discovery, where the ability to understand and modify molecular structures is critical for optimizing drug properties such as efficacy and toxicity. However, existing models and benchmarks often overlook toxicity-related challenges, focusing primarily on general property optimization without adequately addressing safety concerns. In addition, even existing toxicity repair benchmarks suffer from limited data diversity, low structural validity of generated molecules, and heavy reliance on proxy models for toxicity assessment. To address these limitations, we propose MolDeTox, a novel benchmark for molecular detoxification, designed to enable fine-grained and reliable evaluation of toxicity-aware molecular optimization across stepwise tasks. We evaluate a wide range of general-purpose LLMs and VLMs under diverse settings, and demonstrate that understanding and generating molecules at the fragment-level improves structural validity and enhances the quality of generated molecules. Moreover, through detailed task-level performance analysis, MolDeTox provides an interpretable benchmark that enables a deeper understanding of the detoxification process. Our dataset is available at : https://huggingface.co/datasets/MolDeTox/MolDeTox
Abstract（参考訳）: 大規模言語モデル (LLM) と視覚言語モデル (VLM) は、最近、様々な科学領域で有望な能力を示している。特に、これらの進歩は、分子構造を理解し、修正する能力が、有効性や毒性などの薬物特性を最適化するために重要である薬物発見の新しい機会を開いた。しかし、既存のモデルやベンチマークはしばしば毒性に関連する問題を見落とし、安全上の問題に適切に対処することなく、主に一般財産の最適化に焦点を当てている。さらに、既存の毒性修復ベンチマークでさえ、限られたデータ多様性、生成分子の構造的妥当性の低いこと、毒性評価のためのプロキシモデルに大きく依存している。これらの制約に対処するため,段階的タスク間の毒性を意識した分子最適化のきめ細やかで信頼性の高い評価を可能にするため,分子デトキシ化のための新しいベンチマークである MolDeTox を提案する。多様な環境下で汎用LLMとVLMを多岐にわたって評価し, フラグメントレベルでの分子の理解と生成が構造的妥当性を向上し, 生成分子の品質を向上させることを示す。さらに、タスクレベルの詳細なパフォーマンス分析を通じて、MoldDeToxは、解毒プロセスのより深い理解を可能にする解釈可能なベンチマークを提供する。私たちのデータセットはhttps://huggingface.co/datasets/MolDeTox/MolDeToxで利用可能です。

論文の概要: MolDeTox: Evaluating Language Model's Stepwise Fragment Editing for Molecular Detoxification

関連論文リスト