Fugu-MT 論文翻訳(概要): Divide-Prompt-Refine: a Training-Free, Structure-Aware Framework for Biomedical Abstract Generation

論文の概要: Divide-Prompt-Refine: a Training-Free, Structure-Aware Framework for Biomedical Abstract Generation

arxiv url: http://arxiv.org/abs/2605.20628v1
Date: Wed, 20 May 2026 02:25:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.442956
Title: Divide-Prompt-Refine: a Training-Free, Structure-Aware Framework for Biomedical Abstract Generation
Title（参考訳）: Divide-Prompt-Refine: バイオメディカル抽象化のためのトレーニング不要な構造認識フレームワーク
Authors: Sylvey Lin, Joe Menke, Shufan Ming, Dongin Nam, Neil Smalheiser, Halil Kilicoglu,
Abstract要約: DPR-BAG (Divide, Prompt, Refine for Biomedical Abstract Generation) を提案する。 DPR-BAGは、全文文書をBOMRCスキーマに従って構造化された修辞面に分解する。厳密な抽出と微調整のベースラインよりも抽象的ノベルティを向上し、事実整合性を維持している。
参考スコア（独自算出の注目度）: 0.8774270519266251
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Biomedical abstracts play a critical role in downstream NLP applications, such as information retrieval, biocuration, and biomedical knowledge discovery. However, a non-trivial number of biomedical articles do not have abstracts, diminishing the utility of these articles for downstream tasks. We propose DPR-BAG (Divide, Prompt, and Refine for Biomedical Abstract Generation), a training-free, zero-shot framework that generates coherent and factually grounded abstracts for biomedical articles with full text but no abstract. DPR-BAG decomposes full-text documents into structured rhetorical facets following the Background-Objective-Methods-Results-Conclusions (BOMRC) schema, performs parallel LLM-based summarization for each facet, and applies a final refinement stage to restore global discourse coherence. On PMC-MAD, a distribution-aligned dataset of 46,309 biomedical articles, DPR-BAG improves abstractive novelty over strong extractive and fine-tuned baselines, while maintaining factual consistency. Our ablation study reveals a counterintuitive finding: increasing prompt complexity or explicitly injecting entity-level guidance can degrade factual alignment, highlighting the importance of controlled prompting strategies. These findings underscore the potential of training-free, structure-aware frameworks for scalable biomedical abstract generation in low-resource settings. Our data and code are available at https://huggingface.co/datasets/pmc-mad/PMC-MAD and https://github.com/ScienceNLP-Lab/MultiTagger-v2/tree/main/DPR-BAG.
Abstract（参考訳）: バイオメディカル抽象化は、情報検索、バイオキュレーション、バイオメディカル知識発見など、下流のNLPアプリケーションにおいて重要な役割を果たす。しかし、非自明な数のバイオメディカル記事は抽象概念を持たず、下流業務におけるこれらの記事の有用性を低下させる。本稿では,DPR-BAG (Divide, Prompt, Refine for Biomedical Abstract Generation) を提案する。 DPR-BAGは、全文文書をBOMRCスキーマに従って構造化された修辞系ファセットに分解し、各ファセットに対して並列LLMベースの要約を行い、グローバルな談話コヒーレンスを復元するために最終改良段階を適用する。 PMC-MADでは、46,309のバイオメディカルな記事の分布に整合したデータセットとして、DPR-BAGは、厳密な抽出と微調整によるベースラインよりも抽象的ノベルティを向上し、事実整合性を維持している。迅速な複雑性の増大や、エンティティレベルのガイダンスを明示的に注入することは、現実的なアライメントを低下させ、コントロールされたプロンプト戦略の重要性を浮き彫りにする。これらの知見は、低リソース環境下でのスケーラブルなバイオメディカル抽象生成のための、トレーニングフリーで構造対応のフレームワークの可能性を示している。私たちのデータとコードはhttps://huggingface.co/datasets/pmc-mad/PMC-MADとhttps://github.com/ScienceNLP-Lab/MultiTagger-v2/tree/DPR-BAGで利用可能です。

論文の概要: Divide-Prompt-Refine: a Training-Free, Structure-Aware Framework for Biomedical Abstract Generation

関連論文リスト