Event Detection Explorer: An Interactive Tool for Event Detection
Exploration
- URL: http://arxiv.org/abs/2204.12456v1
- Date: Tue, 26 Apr 2022 17:22:37 GMT
- Title: Event Detection Explorer: An Interactive Tool for Event Detection
Exploration
- Authors: Wenlong Zhang, Bhagyashree Ingale, Hamza Shabir, Tianyi Li, Tian Shi,
Ping Wang
- Abstract summary: Event Detection (ED) is an important task in natural language processing.
In this paper, we present an interactive and easy-to-use tool, namely ED Explorer, for ED dataset and model exploration.
- Score: 15.673794190575295
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Event Detection (ED) is an important task in natural language processing. In
the past few years, many datasets have been introduced for advancing ED machine
learning models. However, most of these datasets are under-explored because not
many tools are available for people to study events, trigger words, and event
mention instances systematically and efficiently. In this paper, we present an
interactive and easy-to-use tool, namely ED Explorer, for ED dataset and model
exploration. ED Explorer consists of an interactive web application, an API,
and an NLP toolkit, which can help both domain experts and non-experts to
better understand the ED task. We use ED Explorer to analyze a recent proposed
large-scale ED datasets (referred to as MAVEN), and discover several underlying
problems, including sparsity, label bias, label imbalance, and debatable
annotations, which provide us with directions to improve the MAVEN dataset. The
ED Explorer can be publicly accessed through http://edx.leafnlp.org/. The
demonstration video is available here
https://www.youtube.com/watch?v=6QPnxPwxg50.
Related papers
- MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction [11.458594744457521]
Active adverse event surveillance monitors Adverse Drug Events (ADE) from different data sources.
One unanswered question is how far we are from having a single ADE extraction model that are effective on various types of text.
We contribute to answering this question by building a multi-domain benchmark for adverse drug event extraction, which we named MultiADE.
arXiv Detail & Related papers (2024-05-28T09:57:28Z) - ADELIE: Aligning Large Language Models on Information Extraction [55.60192044049083]
Large language models (LLMs) usually fall short on information extraction tasks.
In this paper, we introduce ADELIE, an aligned LLM that effectively solves various IE tasks.
We show that our models achieve state-of-the-art (SoTA) performance among open-source models.
arXiv Detail & Related papers (2024-05-08T12:24:52Z) - Large Language Models for Generative Information Extraction: A Survey [89.71273968283616]
Large Language Models (LLMs) have demonstrated remarkable capabilities in text understanding and generation.
We present an extensive overview by categorizing these works in terms of various IE subtasks and techniques.
We empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs.
arXiv Detail & Related papers (2023-12-29T14:25:22Z) - MINION: a Large-Scale and Diverse Dataset for Multilingual Event
Detection [65.46122357928041]
Event Detection (ED) is the task of identifying and classifying trigger words of event mentions in text.
Main questions include how well existing ED models perform on different languages, how challenging ED is in other languages, and how well ED knowledge and annotation can be transferred across languages.
We introduce a new large-scale multilingual dataset for ED (called MINION) that consistently annotates events for 8 different languages.
arXiv Detail & Related papers (2022-11-11T02:09:51Z) - MEE: A Novel Multilingual Event Extraction Dataset [62.80569691825534]
Event Extraction aims to recognize event mentions and their arguments from text.
The lack of high-quality multilingual EE datasets for model training and evaluation has been the main hindrance.
We propose a novel Multilingual Event Extraction dataset (EE) that provides annotation for more than 50K event mentions in 8 typologically different languages.
arXiv Detail & Related papers (2022-11-11T02:01:41Z) - SciREX: A Challenge Dataset for Document-Level Information Extraction [56.83748634747753]
It is challenging to create a large-scale information extraction dataset at the document level.
We introduce SciREX, a document level IE dataset that encompasses multiple IE tasks.
We develop a neural model as a strong baseline that extends previous state-of-the-art IE models to document-level IE.
arXiv Detail & Related papers (2020-05-01T17:30:10Z) - MAVEN: A Massive General Domain Event Detection Dataset [56.00401399384715]
Event detection (ED) is the first and most fundamental step for extracting event knowledge from plain text.
Existing datasets exhibit issues that limit further development of ED.
We present a MAssive eVENt detection dataset (MAVEN), which contains 4,480 Wikipedia documents, 118,732 event mention instances, and 168 event types.
arXiv Detail & Related papers (2020-04-28T15:25:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.