MARCUS: An Event-Centric NLP Pipeline that generates Character Arcs from Narratives
- URL: http://arxiv.org/abs/2510.18201v1
- Date: Tue, 21 Oct 2025 01:03:48 GMT
- Title: MARCUS: An Event-Centric NLP Pipeline that generates Character Arcs from Narratives
- Authors: Sriharsh Bhyravajjula, Ujwal Narayan, Manish Shrivastava,
- Abstract summary: We present MARCUS, an NLP pipeline that extracts events, participant characters, implied emotion, and sentiment to model inter-character relations.<n>We generate character arcs from two extended fantasy series, Harry Potter and Lord of the Rings.
- Score: 3.0765811485120182
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Character arcs are important theoretical devices employed in literary studies to understand character journeys, identify tropes across literary genres, and establish similarities between narratives. This work addresses the novel task of computationally generating event-centric, relation-based character arcs from narratives. Providing a quantitative representation for arcs brings tangibility to a theoretical concept and paves the way for subsequent applications. We present MARCUS (Modelling Arcs for Understanding Stories), an NLP pipeline that extracts events, participant characters, implied emotion, and sentiment to model inter-character relations. MARCUS tracks and aggregates these relations across the narrative to generate character arcs as graphical plots. We generate character arcs from two extended fantasy series, Harry Potter and Lord of the Rings. We evaluate our approach before outlining existing challenges, suggesting applications of our pipeline, and discussing future work.
Related papers
- Computational Representations of Character Significance in Novels [10.538161193756666]
We present a new literary theory proposing a six-component structural model of character.<n>This model accounts for the narrator-character distinction and includes a component neglected by prior methods, discussion by other characters.<n>We then demonstrate that these representations allow us to approach literary questions at scale from a new computational lens.
arXiv Detail & Related papers (2026-01-21T22:29:41Z) - Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels [50.43968216132018]
We present an end-to-end system that transforms any literary work into an immersive, multi-character conversational experience.<n>This system is designed to solve two fundamental challenges for LLM-driven characters.
arXiv Detail & Related papers (2025-12-08T11:57:46Z) - Story Ribbons: Reimagining Storyline Visualizations with Large Language Models [39.0439095287205]
Large language models (LLMs) are being used to augment and reimagine existing storyline visualization techniques.<n>We introduce an LLM-driven data parsing pipeline that automatically extracts relevant narrative information from novels and scripts.<n>We then apply this pipeline to create Story Ribbons, an interactive visualization system that helps novice and expert literary analysts explore detailed character and theme trajectories.
arXiv Detail & Related papers (2025-08-09T01:49:30Z) - BookWorm: A Dataset for Character Description and Analysis [59.186325346763184]
We define two tasks: character description, which generates a brief factual profile, and character analysis, which offers an in-depth interpretation.
We introduce the BookWorm dataset, pairing books from the Gutenberg Project with human-written descriptions and analyses.
Our findings show that retrieval-based approaches outperform hierarchical ones in both tasks.
arXiv Detail & Related papers (2024-10-14T10:55:58Z) - Agents' Room: Narrative Generation through Multi-step Collaboration [54.98886593802834]
We propose a generation framework inspired by narrative theory that decomposes narrative writing into subtasks tackled by specialized agents.<n>We show that Agents' Room generates stories preferred by expert evaluators over those produced by baseline systems.
arXiv Detail & Related papers (2024-10-03T15:44:42Z) - Generating Visual Stories with Grounded and Coreferent Characters [63.07511918366848]
We present the first model capable of predicting visual stories with consistently grounded and coreferent character mentions.<n>Our model is finetuned on a new dataset which we build on top of the widely used VIST benchmark.<n>We also propose new evaluation metrics to measure the richness of characters and coreference in stories.
arXiv Detail & Related papers (2024-09-20T14:56:33Z) - Are Large Language Models Capable of Generating Human-Level Narratives? [114.34140090869175]
This paper investigates the capability of LLMs in storytelling, focusing on narrative development and plot progression.
We introduce a novel computational framework to analyze narratives through three discourse-level aspects.
We show that explicit integration of discourse features can enhance storytelling, as is demonstrated by over 40% improvement in neural storytelling.
arXiv Detail & Related papers (2024-07-18T08:02:49Z) - Fine-Grained Modeling of Narrative Context: A Coherence Perspective via Retrospective Questions [48.18584733906447]
This work introduces an original and practical paradigm for narrative comprehension, stemming from the characteristics that individual passages within narratives tend to be more cohesively related than isolated.
We propose a fine-grained modeling of narrative context, by formulating a graph dubbed NarCo, which explicitly depicts task-agnostic coherence dependencies.
arXiv Detail & Related papers (2024-02-21T06:14:04Z) - Understanding Social Structures from Contemporary Literary Fiction using
Character Interaction Graph -- Half Century Chronology of Influential Bengali
Writers [2.103087897983347]
Social structures and real-world incidents often influence contemporary literary fiction.
We use character interaction graphs to explore societal inquiries about contemporary culture's impact on the landscape of literary fiction.
arXiv Detail & Related papers (2023-10-25T20:09:14Z) - "Let Your Characters Tell Their Story": A Dataset for Character-Centric
Narrative Understanding [31.803481510886378]
We present LiSCU -- a new dataset of literary pieces and their summaries paired with descriptions of characters that appear in them.
We also introduce two new tasks on LiSCU: Character Identification and Character Description Generation.
Our experiments with several pre-trained language models adapted for these tasks demonstrate that there is a need for better models of narrative comprehension.
arXiv Detail & Related papers (2021-09-12T06:12:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.