Title2Event: Benchmarking Open Event Extraction with a Large-scale
Chinese Title Dataset
- URL: http://arxiv.org/abs/2211.00869v1
- Date: Wed, 2 Nov 2022 04:39:36 GMT
- Title: Title2Event: Benchmarking Open Event Extraction with a Large-scale
Chinese Title Dataset
- Authors: Haolin Deng, Yanan Zhang, Yangfan Zhang, Wangyang Ying, Changlong Yu,
Jun Gao, Wei Wang, Xiaoling Bai, Nan Yang, Jin Ma, Xiang Chen, Tianhua Zhou
- Abstract summary: We present Title2Event, a large-scale sentence-level dataset benchmarking Open Event Extraction without restricting event types.
Title2Event contains more than 42,000 news titles in 34 topics collected from Chinese web pages.
To the best of our knowledge, it is currently the largest manually-annotated Chinese dataset for open event extraction.
- Score: 19.634367718707857
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Event extraction (EE) is crucial to downstream tasks such as new aggregation
and event knowledge graph construction. Most existing EE datasets manually
define fixed event types and design specific schema for each of them, failing
to cover diverse events emerging from the online text. Moreover, news titles,
an important source of event mentions, have not gained enough attention in
current EE research. In this paper, We present Title2Event, a large-scale
sentence-level dataset benchmarking Open Event Extraction without restricting
event types. Title2Event contains more than 42,000 news titles in 34 topics
collected from Chinese web pages. To the best of our knowledge, it is currently
the largest manually-annotated Chinese dataset for open event extraction. We
further conduct experiments on Title2Event with different models and show that
the characteristics of titles make it challenging for event extraction,
addressing the significance of advanced study on this problem. The dataset and
baseline codes are available at https://open-event-hub.github.io/title2event.
Related papers
- EXCEEDS: Extracting Complex Events as Connecting the Dots to Graphs in Scientific Domain [57.56639626657212]
We construct SciEvents, a large-scale multi-event document-level dataset with a schema tailored for scientific domain.
Then, we propose EXCEEDS, a novel end-to-end scientific event extraction framework by storing dense nuggets in a grid matrix.
Experimental results demonstrate state-of-the-art performances of EXCEEDS on SciEvents.
arXiv Detail & Related papers (2024-06-20T07:50:37Z) - Improving Event Definition Following For Zero-Shot Event Detection [66.27883872707523]
Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types.
We aim to improve zero-shot event detection by training models to better follow event definitions.
arXiv Detail & Related papers (2024-03-05T01:46:50Z) - MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation [104.6065882758648]
MAVEN-Arg is the first all-in-one dataset supporting event detection, event argument extraction, and event relation extraction.
As an EAE benchmark, MAVEN-Arg offers three main advantages: (1) a comprehensive schema covering 162 event types and 612 argument roles, all with expert-written definitions and examples; (2) a large data scale, containing 98,591 events and 290,613 arguments obtained with laborious human annotation; and (3) the exhaustive annotation supporting all task variants of EAE.
arXiv Detail & Related papers (2023-11-15T16:52:14Z) - Rich Event Modeling for Script Event Prediction [60.67635412135682]
We propose the Rich Event Prediction (REP) framework for script event prediction.
REP contains an event extractor to extract such information from texts.
The core component of the predictor is a transformer-based event encoder to flexibly deal with an arbitrary number of arguments.
arXiv Detail & Related papers (2022-12-16T05:17:59Z) - MEE: A Novel Multilingual Event Extraction Dataset [62.80569691825534]
Event Extraction aims to recognize event mentions and their arguments from text.
The lack of high-quality multilingual EE datasets for model training and evaluation has been the main hindrance.
We propose a novel Multilingual Event Extraction dataset (EE) that provides annotation for more than 50K event mentions in 8 typologically different languages.
arXiv Detail & Related papers (2022-11-11T02:01:41Z) - EventGraph: Event Extraction as Semantic Graph Parsing [5.21480688623047]
Event extraction involves the detection and extraction of both the event triggers and corresponding event arguments.
We propose EventGraph, a joint framework for event extraction, which encodes events as graphs.
Our code and models are released as open-source.
arXiv Detail & Related papers (2022-10-16T22:11:46Z) - Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across
Modalities [43.048896440009784]
We propose the task of extracting event hierarchies from multimodal (video and text) data.
This reveals the structure of events and is critical to understanding them.
We show the limitations of state-of-the-art unimodal and multimodal baselines on this task.
arXiv Detail & Related papers (2022-06-14T23:24:15Z) - PILED: An Identify-and-Localize Framework for Few-Shot Event Detection [79.66042333016478]
In our study, we employ cloze prompts to elicit event-related knowledge from pretrained language models.
We minimize the number of type-specific parameters, enabling our model to quickly adapt to event detection tasks for new types.
arXiv Detail & Related papers (2022-02-15T18:01:39Z) - Document-level Event Extraction with Efficient End-to-end Learning of
Cross-event Dependencies [37.96254956540803]
We propose an end-to-end model leveraging Deep Value Networks (DVN), a structured prediction algorithm, to efficiently capture cross-event dependencies for document-level event extraction.
Our approach achieves comparable performance to CRF-based models on ACE05, while enjoys significantly higher computational efficiency.
arXiv Detail & Related papers (2020-10-24T05:28:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.