Fugu-MT 論文翻訳(概要): BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data

論文の概要: BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data

arxiv url: http://arxiv.org/abs/2604.03506v1
Date: Fri, 03 Apr 2026 23:06:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.615484
Title: BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data
Title（参考訳）: BioAlchemy: 生物文献を推論可能な強化学習データに蒸留する
Authors: Brian Hsu, Ozan Gökdemir, Carlo Siebenschuh, Bruce Parrello, Neil Getty, Thomas S. Brettin, Rick L. Stevens, Ian T. Foster, Nicholas Chia, Arvind Ramanathan,
Abstract要約: 現在の大規模推論データセットからの生物学の質問は、生物学における現代の研究トピックの分布とよく一致していないことを示す。生物学研究のテキストから検証可能な質問と回答のペアのさまざまなセットをソーシングするためのパイプラインであるBioAlchemyを紹介した。本稿では,現代の科学的生物学のトピック分布にデータセットを合わせることで,推論性能を向上させるための強化学習をいかに活用できるかを実証する。
参考スコア（独自算出の注目度）: 5.668472223629237
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Despite the large corpus of biology training text, the impact of reasoning models on biological research generally lags behind math and coding. In this work, we show that biology questions from current large-scale reasoning datasets do not align well with modern research topic distributions in biology, and that this topic imbalance may negatively affect performance. In addition, we find that methods for extracting challenging and verifiable research problems from biology research text are a critical yet underdeveloped ingredient in applying reinforcement learning for better performance on biology research tasks. We introduce BioAlchemy, a pipeline for sourcing a diverse set of verifiable question-and-answer pairs from a scientific corpus of biology research text. We curate BioAlchemy-345K, a training dataset containing over 345K scientific reasoning problems in biology. Then, we demonstrate how aligning our dataset to the topic distribution of modern scientific biology can be used with reinforcement learning to improve reasoning performance. Finally, we present BioAlchemist-8B, which improves over its base reasoning model by 9.12% on biology benchmarks. These results demonstrate the efficacy of our approach for developing stronger scientific reasoning capabilities in biology. The BioAlchemist-8B model is available at: https://huggingface.co/BioAlchemy.
Abstract（参考訳）: 生物学のトレーニングテキストの膨大なコーパスにもかかわらず、推論モデルが生物学的研究に与える影響は一般に数学やコーディングに遅れている。本研究では,現在の大規模推論データセットからの生物学の質問は,生物学における現代の研究トピックの分布とよく一致せず,この話題の不均衡が性能に悪影響を及ぼす可能性があることを示す。さらに, 生物学研究課題に対する強化学習の適用において, 生物学研究テキストから, 困難かつ検証可能な研究問題を抽出する方法が, 重要かつ未開発な要素であることが判明した。生物学研究テキストの科学的コーパスから、検証可能な質問と回答のペアのさまざまなセットをソーシングするためのパイプラインであるBioAlchemyを紹介した。生物学における345K以上の科学的推論問題を含むトレーニングデータセットであるBioAlchemy-345Kをキュレートする。そして,現代の科学的生物学のトピック分布にデータセットを合わせることで,推論性能を向上させるための強化学習をいかに活用できるかを実証する。最後に,BioAlchemist-8Bについて述べる。BioAlchemist-8Bは,生物学ベンチマークにおいて,基礎的推論モデルよりも9.12%向上する。これらの結果から, 生物学における科学的推論能力の向上に向けたアプローチの有効性が示唆された。 BioAlchemist-8Bモデルは、https://huggingface.co/BioAlchemy.comで利用可能である。

論文の概要: BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data

関連論文リスト