Probabilistic Querying of Continuous-Time Event Sequences
- URL: http://arxiv.org/abs/2211.08499v1
- Date: Tue, 15 Nov 2022 20:58:00 GMT
- Title: Probabilistic Querying of Continuous-Time Event Sequences
- Authors: Alex Boyd, Yuxin Chang, Stephan Mandt, Padhraic Smyth
- Abstract summary: This paper introduces a new typology of query types and a framework for addressing them using importance sampling.
Example queries include predicting the $ntextth$ event type in a sequence and the hitting time distribution of one or more event types.
We prove theoretically that our estimation method is effectively always better than naive simulation and show empirically based on three real-world datasets that it is on average 1,000 times more efficient than existing approaches.
- Score: 23.85426261235507
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Continuous-time event sequences, i.e., sequences consisting of continuous
time stamps and associated event types ("marks"), are an important type of
sequential data with many applications, e.g., in clinical medicine or user
behavior modeling. Since these data are typically modeled autoregressively
(e.g., using neural Hawkes processes or their classical counterparts), it is
natural to ask questions about future scenarios such as "what kind of event
will occur next" or "will an event of type $A$ occur before one of type $B$".
Unfortunately, some of these queries are notoriously hard to address since
current methods are limited to naive simulation, which can be highly
inefficient. This paper introduces a new typology of query types and a
framework for addressing them using importance sampling. Example queries
include predicting the $n^\text{th}$ event type in a sequence and the hitting
time distribution of one or more event types. We also leverage these findings
further to be applicable for estimating general "$A$ before $B$" type of
queries. We prove theoretically that our estimation method is effectively
always better than naive simulation and show empirically based on three
real-world datasets that it is on average 1,000 times more efficient than
existing approaches.
Related papers
- A Simple Baseline for Predicting Events with Auto-Regressive Tabular Transformers [70.20477771578824]
Existing approaches to event prediction include time-aware positional embeddings, learned row and field encodings, and oversampling methods for addressing class imbalance.
We propose a simple but flexible baseline using standard autoregressive LLM-style transformers with elementary positional embeddings and a causal language modeling objective.
Our baseline outperforms existing approaches across popular datasets and can be employed for various use-cases.
arXiv Detail & Related papers (2024-10-14T15:59:16Z) - Probabilistic Modeling for Sequences of Sets in Continuous-Time [14.423456635520084]
We develop a general framework for modeling set-valued data in continuous-time.
We also develop inference methods that can use such models to answer probabilistic queries.
arXiv Detail & Related papers (2023-12-22T20:16:10Z) - Contrastive Difference Predictive Coding [79.74052624853303]
We introduce a temporal difference version of contrastive predictive coding that stitches together pieces of different time series data to decrease the amount of data required to learn predictions of future events.
We apply this representation learning method to derive an off-policy algorithm for goal-conditioned RL.
arXiv Detail & Related papers (2023-10-31T03:16:32Z) - A method for incremental discovery of financial event types based on
anomaly detection [7.035546119642223]
Event datasets in the financial domain are often constructed based on actual application scenarios.
New financial big data cannot be limited to the event types defined for specific scenarios.
A three-stage approach is proposed to accomplish incremental discovery of event types.
arXiv Detail & Related papers (2023-02-16T10:37:19Z) - Towards Out-of-Distribution Sequential Event Prediction: A Causal
Treatment [72.50906475214457]
The goal of sequential event prediction is to estimate the next event based on a sequence of historical events.
In practice, the next-event prediction models are trained with sequential data collected at one time.
We propose a framework with hierarchical branching structures for learning context-specific representations.
arXiv Detail & Related papers (2022-10-24T07:54:13Z) - Predictive Querying for Autoregressive Neural Sequence Models [23.85426261235507]
We introduce a general typology for predictive queries in neural autoregressive sequence models.
We show that such queries can be systematically represented by sets of elementary building blocks.
We leverage this typology to develop new query estimation methods.
arXiv Detail & Related papers (2022-10-12T17:59:36Z) - HyperHawkes: Hypernetwork based Neural Temporal Point Process [5.607676459156789]
Temporal point process serves as an essential tool for modeling time-to-event data in continuous time space.
It is not generalizable to predict events from unseen sequences in dynamic environment.
We propose textitHyperHawkes, a hypernetwork based temporal point process framework.
arXiv Detail & Related papers (2022-10-01T07:14:19Z) - Variational Neural Temporal Point Process [22.396329275957996]
A temporal point process is a process that predicts which type of events is likely to happen and when the event will occur.
We introduce the inference and the generative networks, and train a distribution of latent variable to deal with property on deep neural network.
We empirically demonstrate that our model can generalize the representations of various event types.
arXiv Detail & Related papers (2022-02-17T13:34:30Z) - COHORTNEY: Deep Clustering for Heterogeneous Event Sequences [9.811178291117496]
Clustering of event sequences is widely applicable in domains such as healthcare, marketing, and finance.
We propose COHORTNEY as a novel deep learning method for clustering heterogeneous event sequences.
Our results show that COHORTNEY vastly outperforms in speed and cluster quality the state-of-the-art algorithm for clustering event sequences.
arXiv Detail & Related papers (2021-04-03T16:12:21Z) - Unsupervised Label-aware Event Trigger and Argument Classification [73.86358632937372]
We propose an unsupervised event extraction pipeline, which first identifies events with available tools (e.g., SRL) and then automatically maps them to pre-defined event types.
We leverage pre-trained language models to contextually represent pre-defined types for both event triggers and arguments.
We successfully map 83% of the triggers and 54% of the arguments to the correct types, almost doubling the performance of previous zero-shot approaches.
arXiv Detail & Related papers (2020-12-30T17:47:24Z) - Ambiguity in Sequential Data: Predicting Uncertain Futures with
Recurrent Models [110.82452096672182]
We propose an extension of the Multiple Hypothesis Prediction (MHP) model to handle ambiguous predictions with sequential data.
We also introduce a novel metric for ambiguous problems, which is better suited to account for uncertainties.
arXiv Detail & Related papers (2020-03-10T09:15:42Z) - A Multi-Channel Neural Graphical Event Model with Negative Evidence [76.51278722190607]
Event datasets are sequences of events of various types occurring irregularly over the time-line.
We propose a non-parametric deep neural network approach in order to estimate the underlying intensity functions.
arXiv Detail & Related papers (2020-02-21T23:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.