What time is it? Temporal Analysis of Novels
- URL: http://arxiv.org/abs/2011.04124v1
- Date: Mon, 9 Nov 2020 01:11:55 GMT
- Title: What time is it? Temporal Analysis of Novels
- Authors: Allen Kim, Charuta Pethe, Steven Skiena
- Abstract summary: We construct a data set of hourly time phrases from 52,183 fictional books.
We then construct a time-of-day classification model that achieves an average error of 2.27 hours.
We show that by analyzing a book in whole using dynamic programming of breakpoints, we can roughly partition a book into segments that each correspond to a particular time-of-day.
- Score: 10.481474734742486
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recognizing the flow of time in a story is a crucial aspect of understanding
it. Prior work related to time has primarily focused on identifying temporal
expressions or relative sequencing of events, but here we propose
computationally annotating each line of a book with wall clock times, even in
the absence of explicit time-descriptive phrases. To do so, we construct a data
set of hourly time phrases from 52,183 fictional books. We then construct a
time-of-day classification model that achieves an average error of 2.27 hours.
Furthermore, we show that by analyzing a book in whole using dynamic
programming of breakpoints, we can roughly partition a book into segments that
each correspond to a particular time-of-day. This approach improves upon
baselines by over two hours. Finally, we apply our model to a corpus of
literature categorized by different periods in history, to show interesting
trends of hourly activity throughout the past. Among several observations we
find that the fraction of events taking place past 10 P.M jumps past 1880 -
coincident with the advent of the electric light bulb and city lights.
Related papers
- Deep Time Series Models: A Comprehensive Survey and Benchmark [74.28364194333447]
Time series data is of great significance in real-world scenarios.
Recent years have witnessed remarkable breakthroughs in the time series community.
We release Time Series Library (TSLib) as a fair benchmark of deep time series models for diverse analysis tasks.
arXiv Detail & Related papers (2024-07-18T08:31:55Z) - Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding [57.62275091656578]
We refer to the complex events composed of many news articles over an extended period as Temporal Complex Event (TCE)
This paper proposes a novel approach using Large Language Models (LLMs) to systematically extract and analyze the event chain within TCE.
arXiv Detail & Related papers (2024-06-04T16:42:17Z) - TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling [67.02157180089573]
Time series pre-training has recently garnered wide attention for its potential to reduce labeling expenses and benefit various downstream tasks.
This paper proposes TimeSiam as a simple but effective self-supervised pre-training framework for Time series based on Siamese networks.
arXiv Detail & Related papers (2024-02-04T13:10:51Z) - Time is Encoded in the Weights of Finetuned Language Models [65.71926562424795]
We present time vectors, a simple tool to customize language models to new time periods.
Time vectors are created by finetuning a language model on data from a single time.
This vector specifies a direction in weight space that, as our experiments show, improves performance on text from that time period.
arXiv Detail & Related papers (2023-12-20T20:04:45Z) - A decomposition of book structure through ousiometric fluctuations in
cumulative word-time [1.181206257787103]
We look at how words change over the course of a book as a function of the number of words, rather than the fraction of the book.
We find that shorter books exhibit only a general trend, while longer books have fluctuations in addition to the general trend.
Our findings suggest that, in the ousiometric sense, longer books are not expanded versions of shorter books, but are more similar in structure to a concatenation of shorter texts.
arXiv Detail & Related papers (2022-08-19T18:17:27Z) - Towards Spatio-Temporal Aware Traffic Time Series Forecasting--Full
Version [37.09531298150374]
Traffic series forecasting is challenging due to complex time series patterns for the same time series patterns may vary across time, where, for example, there exist periods across a day showing stronger temporal correlations.
Such-temporal models employ a shared parameter space irrespective of the time locations and the time periods and they assume that the temporal correlations are similar across locations and do not always hold across time which may not always be the case.
We propose a framework that aims at turning ICD-temporal aware models to encode sub-temporal models.
arXiv Detail & Related papers (2022-03-29T16:44:56Z) - AutoFITS: Automatic Feature Engineering for Irregular Time Series [0.44198435146063353]
In irregular time series, the time at which each observation is collected may be helpful to summarise the dynamics of the data and improve forecasting performance.
We develop a novel automatic feature engineering framework, which focuses on extracting information from this point of view when each instance is collected.
We study how valuable this information is by integrating it in a time series forecasting workflow and investigate how it compares to or complements state-of-the-art methods for regular time series forecasting.
arXiv Detail & Related papers (2021-12-29T19:42:48Z) - Changepoint Analysis of Topic Proportions in Temporal Text Data [1.8262547855491456]
We build a specialised temporal topic model with provisions for changepoints in the distribution of topic proportions.
We use sample splitting to estimate topic polytopes first and then apply a likelihood ratio statistic.
We obtain some historically well-known changepoints and discover some new ones.
arXiv Detail & Related papers (2021-11-29T17:20:51Z) - Time Series Analysis via Network Science: Concepts and Algorithms [62.997667081978825]
This review provides a comprehensive overview of existing mapping methods for transforming time series into networks.
We describe the main conceptual approaches, provide authoritative references and give insight into their advantages and limitations in a unified notation and language.
Although still very recent, this research area has much potential and with this survey we intend to pave the way for future research on the topic.
arXiv Detail & Related papers (2021-10-11T13:33:18Z) - Instance-wise Graph-based Framework for Multivariate Time Series
Forecasting [69.38716332931986]
We propose a simple yet efficient instance-wise graph-based framework to utilize the inter-dependencies of different variables at different time stamps.
The key idea of our framework is aggregating information from the historical time series of different variables to the current time series that we need to forecast.
arXiv Detail & Related papers (2021-09-14T07:38:35Z) - Human-like Time Series Summaries via Trend Utility Estimation [13.560018516096754]
We propose a model to create human-like text descriptions for time series.
Our system finds patterns in time series data and ranks these patterns based on empirical observations of human behavior.
The output of our system is a natural language description of time series that attempts to match a human's summary of the same data.
arXiv Detail & Related papers (2020-01-16T06:09:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.