SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday
Acronyms for Scientific Domain
- URL: http://arxiv.org/abs/2102.08818v1
- Date: Wed, 17 Feb 2021 15:24:50 GMT
- Title: SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday
Acronyms for Scientific Domain
- Authors: Aadarsh Singh and Priyanshu Kumar
- Abstract summary: We present our systems submitted for the shared tasks of Acronym Identification (AI) and Acronym Disambiguation (AD)
We mainly experiment with BERT and SciBERT.
For AD, we formulate the problem as a span prediction task, experiment with different training techniques and also leverage the use of external data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present our systems submitted for the shared tasks of Acronym
Identification (AI) and Acronym Disambiguation (AD) held under Workshop on SDU.
We mainly experiment with BERT and SciBERT. In addition, we assess the
effectiveness of "BIOless" tagging and blending along with the prowess of
ensembling in AI. For AD, we formulate the problem as a span prediction task,
experiment with different training techniques and also leverage the use of
external data. Our systems rank 11th and 3rd in AI and AD tasks respectively.
Related papers
- Semi-Supervised One-Shot Imitation Learning [83.94646047695412]
One-shot Imitation Learning aims to imbue AI agents with the ability to learn a new task from a single demonstration.
We introduce the semi-supervised OSIL problem setting, where the learning agent is presented with a large dataset of trajectories.
We develop an algorithm specifically applicable to this semi-supervised OSIL setting.
arXiv Detail & Related papers (2024-08-09T18:11:26Z) - Generative AI [20.57872238271025]
"generative AI" refers to computational techniques that are capable of generating seemingly new, meaningful content.
The widespread diffusion of this technology with examples such as Dall-E 2, GPT-4, and Copilot is currently revolutionizing the way we work and communicate with each other.
arXiv Detail & Related papers (2023-09-13T08:21:59Z) - IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing [88.35145788575348]
Image anomaly detection (IAD) is an emerging and vital computer vision task in industrial manufacturing.
The lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications.
We construct a comprehensive image anomaly detection benchmark (IM-IAD), which includes 19 algorithms on seven major datasets.
arXiv Detail & Related papers (2023-01-31T01:24:45Z) - Human-Centered AI for Data Science: A Systematic Approach [48.71756559152512]
Human-Centered AI (HCAI) refers to the research effort that aims to design and implement AI techniques to support various human tasks.
We illustrate how we approach HCAI using a series of research projects around Data Science (DS) works as a case study.
arXiv Detail & Related papers (2021-10-03T21:47:13Z) - The MineRL BASALT Competition on Learning from Human Feedback [58.17897225617566]
The MineRL BASALT competition aims to spur forward research on this important class of techniques.
We design a suite of four tasks in Minecraft for which we expect it will be hard to write down hardcoded reward functions.
We provide a dataset of human demonstrations on each of the four tasks, as well as an imitation learning baseline.
arXiv Detail & Related papers (2021-07-05T12:18:17Z) - BERT-based Acronym Disambiguation with Multiple Training Strategies [8.82012912690778]
Acronym disambiguation (AD) task aims to find the correct expansions of an ambiguous ancronym in a given sentence.
We propose a binary classification model incorporating BERT and several training strategies including dynamic negative sample selection.
Experiments on SciAD show the effectiveness of our proposed model and our score ranks 1st in SDU@AAAI-21 shared task 2: Acronym Disambiguation.
arXiv Detail & Related papers (2021-02-25T05:40:21Z) - AT-BERT: Adversarial Training BERT for Acronym Identification Winning
Solution for SDU@AAAI-21 [5.478126869836199]
Acronym identification focuses on finding the acronyms and the phrases that have been abbreviated.
Recent breakthroughs of language models pre-trained on large corpora clearly show that unsupervised pre-training can vastly improve the performance of downstream tasks.
We present an Adversarial Training BERT method named AT-BERT, our winning solution to acronym identification task for Scientific Document Understanding (SDU) Challenge of AAAI 2021.
arXiv Detail & Related papers (2021-01-11T05:02:34Z) - Acronym Identification and Disambiguation Shared Tasks for Scientific
Document Understanding [41.63345823743157]
Acronyms are short forms of longer phrases frequently used in writing.
Every text understanding tool should be capable of recognizing acronyms in text.
To push forward research in this direction, we have organized two shared task for acronym identification and acronym disambiguation in scientific documents.
arXiv Detail & Related papers (2020-12-22T00:29:15Z) - Primer AI's Systems for Acronym Identification and Disambiguation [0.0]
We introduce new methods for acronym identification and disambiguation.
Our systems achieve significant performance gains over previously suggested methods.
Both of our systems perform competitively on the SDU@AAAI-21 shared task leaderboard.
arXiv Detail & Related papers (2020-12-14T23:59:05Z) - What Does This Acronym Mean? Introducing a New Dataset for Acronym
Identification and Disambiguation [74.42107665213909]
Acronyms are the short forms of phrases that facilitate conveying lengthy sentences in documents and serve as one of the mainstays of writing.
Due to their importance, identifying acronyms and corresponding phrases (AI) and finding the correct meaning of each acronym (i.e., acronym disambiguation (AD)) are crucial for text understanding.
Despite the recent progress on this task, there are some limitations in the existing datasets which hinder further improvement.
arXiv Detail & Related papers (2020-10-28T00:12:36Z) - Semantics-Guided Clustering with Deep Progressive Learning for
Semi-Supervised Person Re-identification [58.01834972099855]
Person re-identification (re-ID) requires one to match images of the same person across camera views.
We propose a novel framework of Semantics-Guided Clustering with Deep Progressive Learning (SGC-DPL) to jointly exploit the above data.
Our approach is able to augment the labeled training data in the semi-supervised setting.
arXiv Detail & Related papers (2020-10-02T18:02:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.