MAILEX: Email Event and Argument Extraction
- URL: http://arxiv.org/abs/2305.13469v2
- Date: Sat, 21 Oct 2023 02:15:22 GMT
- Title: MAILEX: Email Event and Argument Extraction
- Authors: Saurabh Srivastava, Gaurav Singh, Shou Matsumoto, Ali Raz, Paulo
Costa, Joshua Poore, Ziyu Yao
- Abstract summary: We present the first dataset, MailEx, for performing event extraction from conversational email threads.
Our dataset includes 1.5K email threads and 4K emails, which are annotated with totally 8K event instances.
- Score: 7.1087833991544125
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we present the first dataset, MailEx, for performing event
extraction from conversational email threads. To this end, we first proposed a
new taxonomy covering 10 event types and 76 arguments in the email domain. Our
final dataset includes 1.5K email threads and ~4K emails, which are annotated
with totally ~8K event instances. To understand the task challenges, we
conducted a series of experiments comparing three types of approaches, i.e.,
fine-tuned sequence labeling, fine-tuned generative extraction, and few-shot
in-context learning. Our results showed that the task of email event extraction
is far from being addressed, due to challenges lying in, e.g., extracting
non-continuous, shared trigger spans, extracting non-named entity arguments,
and modeling the email conversational history. Our work thus suggests more
future investigations in this domain-specific event extraction task.
Related papers
- Towards Event Extraction from Speech with Contextual Clues [61.164413398231254]
We introduce the Speech Event Extraction (SpeechEE) task and construct three synthetic training sets and one human-spoken test set.
Compared to event extraction from text, SpeechEE poses greater challenges mainly due to complex speech signals that are continuous and have no word boundaries.
Our method brings significant improvements on all datasets, achieving a maximum F1 gain of 10.7%.
arXiv Detail & Related papers (2024-01-27T11:07:19Z) - MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation [104.6065882758648]
MAVEN-Arg is the first all-in-one dataset supporting event detection, event argument extraction, and event relation extraction.
As an EAE benchmark, MAVEN-Arg offers three main advantages: (1) a comprehensive schema covering 162 event types and 612 argument roles, all with expert-written definitions and examples; (2) a large data scale, containing 98,591 events and 290,613 arguments obtained with laborious human annotation; and (3) the exhaustive annotation supporting all task variants of EAE.
arXiv Detail & Related papers (2023-11-15T16:52:14Z) - LED: A Dataset for Life Event Extraction from Dialogs [57.390999707053915]
Lifelogging has gained more attention due to its wide applications, such as personalized recommendations or memory assistance.
Life Event Dialog is a dataset containing fine-grained life event annotations on conversational data.
We explore three information extraction (IE) frameworks to address the conversational life event extraction task.
arXiv Detail & Related papers (2023-04-17T14:46:59Z) - EmailSum: Abstractive Email Thread Summarization [105.46012304024312]
We develop an abstractive Email Thread Summarization (EmailSum) dataset.
This dataset contains human-annotated short (30 words) and long (100 words) summaries of 2549 email threads.
Our results reveal the key challenges of current abstractive summarization models in this task.
arXiv Detail & Related papers (2021-07-30T15:13:14Z) - Document-Level Event Argument Extraction by Conditional Generation [75.73327502536938]
Event extraction has long been treated as a sentence-level task in the IE community.
We propose a document-level neural event argument extraction model by formulating the task as conditional generation following event templates.
We also compile a new document-level event extraction benchmark dataset WikiEvents.
arXiv Detail & Related papers (2021-04-13T03:36:38Z) - Learning with Weak Supervision for Email Intent Detection [56.71599262462638]
We propose to leverage user actions as a source of weak supervision to detect intents in emails.
We develop an end-to-end robust deep neural network model for email intent identification.
arXiv Detail & Related papers (2020-05-26T23:41:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.