Word Sense Disambiguation as a Game of Neurosymbolic Darts
- URL: http://arxiv.org/abs/2307.16663v1
- Date: Tue, 25 Jul 2023 07:22:57 GMT
- Title: Word Sense Disambiguation as a Game of Neurosymbolic Darts
- Authors: Tiansi Dong, Rafet Sifa
- Abstract summary: We propose a novel neurosymbolic methodology to push the F1 score above 90%.
The core of our methodology is a neurosymbolic sense embedding, in terms of a configuration of nested balls in n-dimensional space.
We trained a Transformer to learn the mapping from a contextualized word embedding to its sense ball embedding, just like playing the game of darts.
- Score: 3.0572129477925727
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Word Sense Disambiguation (WSD) is one of the hardest tasks in natural
language understanding and knowledge engineering. The glass ceiling of 80% F1
score is recently achieved through supervised deep-learning, enriched by a
variety of knowledge graphs. Here, we propose a novel neurosymbolic methodology
that is able to push the F1 score above 90%. The core of our methodology is a
neurosymbolic sense embedding, in terms of a configuration of nested balls in
n-dimensional space. The centre point of a ball well-preserves word embedding,
which partially fix the locations of balls. Inclusion relations among balls
precisely encode symbolic hypernym relations among senses, and enable simple
logic deduction among sense embeddings, which cannot be realised before. We
trained a Transformer to learn the mapping from a contextualized word embedding
to its sense ball embedding, just like playing the game of darts (a game of
shooting darts into a dartboard). A series of experiments are conducted by
utilizing pre-training n-ball embeddings, which have the coverage of around 70%
training data and 75% testing data in the benchmark WSD corpus. The F1 scores
in experiments range from 90.1% to 100.0% in all six groups of test data-sets
(each group has 4 testing data with different sizes of n-ball embeddings). Our
novel neurosymbolic methodology has the potential to break the ceiling of
deep-learning approaches for WSD. Limitations and extensions of our current
works are listed.
Related papers
- Composing Diffusion Policies for Few-shot Learning of Movement Trajectories [1.2576113481317526]
Humans can perform various combinations of physical skills without having to relearn skills from scratch every single time.
We propose a novel compositional approach called DSE that enables few-shot learning for novel skills.
We show that we are able to achieve a reduction of over 30% in Maximum Mean Discrepancy on the Forward Kinematics Kernel (MMD-FK)
arXiv Detail & Related papers (2024-10-22T23:57:37Z) - Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment [126.34547428473968]
Large language models (LLMs) are still struggling in aligning with human preference in complex tasks and scenarios.
We propose a low-redundant alignment method named textbfALLO, focusing on optimizing the most related neurons with the most useful supervised signals.
Experimental results on 10 datasets have shown the effectiveness of ALLO.
arXiv Detail & Related papers (2024-06-18T13:34:40Z) - UrduFake@FIRE2020: Shared Track on Fake News Identification in Urdu [62.6928395368204]
This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.
The goal is to identify fake news using a dataset composed of 900 annotated news articles for training and 400 news articles for testing.
The dataset contains news in five domains: (i) Health, (ii) Sports, (iii) Showbiz, (iv) Technology, and (v) Business.
arXiv Detail & Related papers (2022-07-25T03:46:51Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2020 [62.6928395368204]
Task was posed as a binary classification task, in which the goal is to differentiate between real and fake news.
We provided a dataset divided into 900 annotated news articles for training and 400 news articles for testing.
42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task.
arXiv Detail & Related papers (2022-07-25T03:41:32Z) - Neuro-Symbolic Learning of Answer Set Programs from Raw Data [54.56905063752427]
Neuro-Symbolic AI aims to combine interpretability of symbolic techniques with the ability of deep learning to learn from raw data.
We introduce Neuro-Symbolic Inductive Learner (NSIL), an approach that trains a general neural network to extract latent concepts from raw data.
NSIL learns expressive knowledge, solves computationally complex problems, and achieves state-of-the-art performance in terms of accuracy and data efficiency.
arXiv Detail & Related papers (2022-05-25T12:41:59Z) - End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge
Distillation [86.41437210485932]
We aim at advancing zero-shot HOI detection to detect both seen and unseen HOIs simultaneously.
We propose a novel end-to-end zero-shot HOI Detection framework via vision-language knowledge distillation.
Our method outperforms the previous SOTA by 8.92% on unseen mAP and 10.18% on overall mAP.
arXiv Detail & Related papers (2022-04-01T07:27:19Z) - Encoding priors in the brain: a reinforcement learning model for mouse
decision making [1.14219428942199]
We study the International Brain Laboratory task, in which a grating appears on either the right or left side of a screen, and a mouse has to move a wheel to bring the grating to the center.
We model this as a reinforcement learning task, using a feedforward neural network to map states to actions, and adjust the weights of the network to maximize reward, learning via policy gradient.
Our model reproduces the main experimental finding - that the psychometric curve with respect to contrast shifts after a block switch in about 10 trials.
arXiv Detail & Related papers (2021-12-10T20:16:36Z) - Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot
Sentiment Classification [78.120927891455]
State-of-the-art brain-to-text systems have achieved great success in decoding language directly from brain signals using neural networks.
In this paper, we extend the problem to open vocabulary Electroencephalography(EEG)-To-Text Sequence-To-Sequence decoding and zero-shot sentence sentiment classification on natural reading tasks.
Our model achieves a 40.1% BLEU-1 score on EEG-To-Text decoding and a 55.6% F1 score on zero-shot EEG-based ternary sentiment classification, which significantly outperforms supervised baselines.
arXiv Detail & Related papers (2021-12-05T21:57:22Z) - Non-Parametric Few-Shot Learning for Word Sense Disambiguation [11.175893018731712]
MetricWSD is a non-parametric few-shot learning approach to mitigate this data imbalance issue.
By learning to compute distances among the senses of a given word through episodic training, MetricWSD transfers knowledge from high-frequency words to infrequent ones.
arXiv Detail & Related papers (2021-04-26T16:08:46Z) - Can a Fruit Fly Learn Word Embeddings? [16.280120177501733]
The fruit fly brain is one of the best studied systems in neuroscience.
We show that a network motif can learn semantic representations of words and can generate both static and context-dependent word embeddings.
It is shown that not only can the fruit fly network motif achieve performance comparable to existing methods in NLP, but, additionally, it uses only a fraction of the computational resources.
arXiv Detail & Related papers (2021-01-18T05:41:50Z) - Voice@SRIB at SemEval-2020 Task 9 and 12: Stacked Ensembling method for
Sentiment and Offensiveness detection in Social Media [2.9008108937701333]
We train embeddings, ensembling methods for Sentimix, and OffensEval tasks.
We evaluate our models on macro F1-score, precision, accuracy, and recall on the datasets.
arXiv Detail & Related papers (2020-07-20T11:54:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.