Fugu-MT 論文翻訳(概要): Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety

論文の概要: Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety

arxiv url: http://arxiv.org/abs/2603.09154v1
Date: Tue, 10 Mar 2026 03:44:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:24.00453
Title: Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety
Title（参考訳）: バイオアライメント:AI安全のためのバイオシステムに向けたLCM配置の測定と改善
Authors: Trent R Northen, Mingxun Wang,
Abstract要約: インターネット規模のコーパスで訓練された大規模言語モデル(LLM)は、望ましくない行動の確率を増加させる体系的なバイアスを示す。 4つの領域(材料、エネルギー、製造、アルゴリズム)にわたる合成と生物の技術的ソリューションに対する潜在的なバイアスについて検討した。その結果,QLoRAファインチューニングは,一般性能を低下させることなく,両モデルの生体溶液のスコアリングを著しく向上させることがわかった。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) trained on internet-scale corpora can exhibit systematic biases that increase the probability of unwanted behavior. In this study, we examined potential biases towards synthetic vs. biological technological solutions across four domains (materials, energy, manufacturing, and algorithms). A sample of 5 frontier and 5 open-weight models were measured using 50 curated Bioalignment prompts with a Kelly criterion-inspired evaluation framework. According to this metric, most models were not bioaligned in that they exhibit biases in favor of synthetic (non-biological) solutions. We next examined if fine-tuning could increase the preferences of two open-weight models, Llama 3.2-3B-Instruct and Qwen2.5-3B-Instruct, for biological-based approaches. A curated corpus of ~22M tokens from 6,636 PMC articles emphasizing biological problem-solving was used first to fine-tune Llama 3B with a mixed corpus of continued training and instruction-formatted. This was then extended to Qwen 3B using instruction-formatted only. We found that QLoRA fine-tuning significantly increased the scoring of biological solutions for both models without degrading general capabilities (Holm-Bonferroni-corrected p < 0.001 and p < 0.01, respectively). This suggests that even a small amount of fine-tuning can change how models weigh the relative value of biological and bioinspired vs. synthetic approaches. Although this work focused on small open-weight LLMs, it may be extensible to much larger models and could be used to develop models that favor bio-based approaches. We release the benchmark, corpus, code, and adapter weights.
Abstract（参考訳）: インターネット規模のコーパスで訓練された大規模言語モデル(LLM)は、望ましくない行動の確率を増加させる体系的なバイアスを示す。本研究では, 4つの領域(材料, エネルギー, 製造, アルゴリズム)にまたがる, 合成と生物の技術的ソリューションに対する潜在的なバイアスについて検討した。 5つのフロンティアと5つのオープンウェイトモデルのサンプルをケリー基準にインスパイアされた評価フレームワークを用いて50個のキュレートされたバイオアライメントプロンプトを用いて測定した。この測定値によると、ほとんどのモデルは、合成(非生物学的)解に有利なバイアスを示すというバイオアライメントを持っていなかった。次に,2種類のオープンウェイトモデルであるLlama 3.2-3B-インストラクトとQwen2.5-3B-インストラクトの生物学的アプローチに対する嗜好を高めることができるかを検討した。 6,636のMCC論文から得られた約22Mのトークンのキュレートコーパスを用いて,Llama 3Bの微調整を行った。その後、命令形式のみを使用してQwen 3Bに拡張された。その結果,QLoRA微調整は,一般性能を低下させることなく,両モデルの生物学的解のスコアリングを著しく向上させた(ホルム・ボンフェロニ補正 p < 0.001 および p < 0.01 )。これは、少量の微調整でさえ、モデルが生物や生物にインスパイアされた相対的価値と合成アプローチの相対的価値をどう評価するかを変える可能性があることを示唆している。この研究は、小さなオープンウェイト LLM に焦点を当てたものの、より大きなモデルに拡張可能であり、バイオベースのアプローチを好むモデルの開発に使用できる可能性がある。ベンチマーク、コーパス、コード、アダプタウェイトをリリースしています。

論文の概要: Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety

関連論文リスト