Fugu-MT 論文翻訳(概要): CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation

論文の概要: CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation

arxiv url: http://arxiv.org/abs/2510.06231v1
Date: Wed, 01 Oct 2025 08:03:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 16:41:20.036427
Title: CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
Title（参考訳）: CML-Bench: LLMによる映画スクリプト生成の評価と改善のためのフレームワーク
Authors: Mingzhe Zheng, Dingjie Song, Guanyu Zhou, Jun You, Jiahao Zhan, Xuran Ma, Xinyuan Song, Ser-Nam Lim, Qifeng Chen, Harry Yang,
Abstract要約: 対話コヒーレンス(DC)、キャラクタ一貫性(CC)、Plot Reasonableness(PR)にまたがる定量的メトリクスを特徴とするCML-Benchを提案する。 CML-Benchは、熟練した人書きのスクリプトに効果的に高いスコアを割り当てる。ベンチマークをさらに検証するために,文字対話とイベントロジックに関する詳細な指示を取り入れたプロンプト戦略であるCML-Instructionを導入する。
参考スコア（独自算出の注目度）: 66.24195604846103
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have demonstrated remarkable proficiency in generating highly structured texts. However, while exhibiting a high degree of structural organization, movie scripts demand an additional layer of nuanced storytelling and emotional depth-the 'soul' of compelling cinema-that LLMs often fail to capture. To investigate this deficiency, we first curated CML-Dataset, a dataset comprising (summary, content) pairs for Cinematic Markup Language (CML), where 'content' consists of segments from esteemed, high-quality movie scripts and 'summary' is a concise description of the content. Through an in-depth analysis of the intrinsic multi-shot continuity and narrative structures within these authentic scripts, we identified three pivotal dimensions for quality assessment: Dialogue Coherence (DC), Character Consistency (CC), and Plot Reasonableness (PR). Informed by these findings, we propose the CML-Bench, featuring quantitative metrics across these dimensions. CML-Bench effectively assigns high scores to well-crafted, human-written scripts while concurrently pinpointing the weaknesses in screenplays generated by LLMs. To further validate our benchmark, we introduce CML-Instruction, a prompting strategy with detailed instructions on character dialogue and event logic, to guide LLMs to generate more structured and cinematically sound scripts. Extensive experiments validate the effectiveness of our benchmark and demonstrate that LLMs guided by CML-Instruction generate higher-quality screenplays, with results aligned with human preferences.
Abstract（参考訳）: 大規模言語モデル (LLM) は高度に構造化されたテキストを生成するのに顕著な能力を示した。しかし、高い構造構造を誇示する一方で、映画脚本は微妙なストーリーテリングと感情的な深さの付加層を必要とする。この欠陥を調査するために,まずCML-DatasetというCinematic Markup Language (CML) のペアからなるデータセットをキュレートした。これらの本文中の本質的なマルチショット連続性と物語構造を詳細に分析することにより,対話コヒーレンス(DC),キャラクタ一貫性(CC),Plot Reasonableness(PR)という,品質評価の3つの重要な要素を同定した。これらの知見からCML-Benchを提案する。 CML-Benchは、LLMが生成するスクリーンプレイの弱点を同時に特定しながら、十分に製作された人手書きのスクリプトに高いスコアを割り当てる。 CML-Instructionは文字対話とイベントロジックの詳細な説明によるプロンプト戦略であり、LCMを誘導してより構造化され、映画的な音声スクリプトを生成する。 CML-Instruction で指導された LLM が高品質なスクリーンプレイを生成し,その結果が人間の嗜好に合致することを示した。

論文の概要: CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation

関連論文リスト