Tackling Morphological Analogies Using Deep Learning -- Extended Version
- URL: http://arxiv.org/abs/2111.05147v1
- Date: Tue, 9 Nov 2021 13:45:23 GMT
- Title: Tackling Morphological Analogies Using Deep Learning -- Extended Version
- Authors: Safa Alsaidi, Amandine Decker, Esteban Marquer, Pierre-Alexandre
Murena, Miguel Couceiro
- Abstract summary: Analogical proportions are statements of the form "A is to B as C is to D"
We propose an approach using deep learning to detect and solve morphological analogies.
We demonstrate our model's competitive performance on analogy detection and resolution over multiple languages.
- Score: 8.288496996031684
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Analogical proportions are statements of the form "A is to B as C is to D".
They constitute an inference tool that provides a logical framework to address
learning, transfer, and explainability concerns and that finds useful
applications in artificial intelligence and natural language processing. In
this paper, we address two problems, namely, analogy detection and resolution
in morphology. Multiple symbolic approaches tackle the problem of analogies in
morphology and achieve competitive performance. We show that it is possible to
use a data-driven strategy to outperform those models. We propose an approach
using deep learning to detect and solve morphological analogies. It encodes
structural properties of analogical proportions and relies on a specifically
designed embedding model capturing morphological characteristics of words. We
demonstrate our model's competitive performance on analogy detection and
resolution over multiple languages. We provide an empirical study to analyze
the impact of balancing training data and evaluate the robustness of our
approach to input perturbation.
Related papers
- Explaining Text Similarity in Transformer Models [52.571158418102584]
Recent advances in explainable AI have made it possible to mitigate limitations by leveraging improved explanations for Transformers.
We use BiLRP, an extension developed for computing second-order explanations in bilinear similarity models, to investigate which feature interactions drive similarity in NLP models.
Our findings contribute to a deeper understanding of different semantic similarity tasks and models, highlighting how novel explainable AI methods enable in-depth analyses and corpus-level insights.
arXiv Detail & Related papers (2024-05-10T17:11:31Z) - ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base [51.777618249271725]
ANALOGYKB is a million-scale analogy knowledge base derived from existing knowledge graphs (KGs)
It identifies two types of analogies from the KGs: 1) analogies of the same relations, which can be directly extracted from the KGs, and 2) analogies of analogous relations, which are identified with a selection and filtering pipeline enabled by large language models (LLMs)
arXiv Detail & Related papers (2023-05-10T09:03:01Z) - Solving morphological analogies: from retrieval to generation [4.834203844100681]
Analogical inference is a capability of human reasoning, and has been used to solve hard reasoning tasks.
We propose a deep learning (DL) framework to address and tackle two key tasks in AR: analogy detection and solving.
The framework is thoroughly tested on the Siganalogies dataset of morphological analogical proportions (APs) between words, and shown to outperform symbolic approaches in many languages.
arXiv Detail & Related papers (2023-03-30T12:36:46Z) - A Causal Framework to Quantify the Robustness of Mathematical Reasoning
with Language Models [81.15974174627785]
We study the behavior of language models in terms of robustness and sensitivity to direct interventions in the input space.
Our analysis shows that robustness does not appear to continuously improve as a function of size, but the GPT-3 Davinci models (175B) achieve a dramatic improvement in both robustness and sensitivity compared to all other GPT variants.
arXiv Detail & Related papers (2022-10-21T15:12:37Z) - Amortized Inference for Causal Structure Learning [72.84105256353801]
Learning causal structure poses a search problem that typically involves evaluating structures using a score or independence test.
We train a variational inference model to predict the causal structure from observational/interventional data.
Our models exhibit robust generalization capabilities under substantial distribution shift.
arXiv Detail & Related papers (2022-05-25T17:37:08Z) - A Neural Approach for Detecting Morphological Analogies [7.89271130004391]
Analogical proportions are statements of the form "A is to B as C is to D"
We propose a deep learning approach to detect morphological analogies.
arXiv Detail & Related papers (2021-08-09T11:21:55Z) - On the Transferability of Neural Models of Morphological Analogies [7.89271130004391]
In this paper, we focus on morphological tasks and we propose a deep learning approach to detect morphological analogies.
We present an empirical study to see how our framework transfers across languages, and that highlights interesting similarities and differences between these languages.
In view of these results, we also discuss the possibility of building a multilingual morphological model.
arXiv Detail & Related papers (2021-08-09T11:08:33Z) - Model-agnostic multi-objective approach for the evolutionary discovery
of mathematical models [55.41644538483948]
In modern data science, it is more interesting to understand the properties of the model, which parts could be replaced to obtain better results.
We use multi-objective evolutionary optimization for composite data-driven model learning to obtain the algorithm's desired properties.
arXiv Detail & Related papers (2021-07-07T11:17:09Z) - Modelling Verbal Morphology in Nen [4.6877729174041605]
We use state-of-the-art machine learning models for morphological reinflection to model Nen verbal morphology.
Our results show sensitivity to training data composition; different distributions of verb type yield different accuracies.
We also demonstrate the types of patterns that can be inferred from the training data through the case study of syncretism.
arXiv Detail & Related papers (2020-11-30T01:22:05Z) - Discriminative and Generative Models for Anatomical Shape Analysison
Point Clouds with Deep Neural Networks [3.7814216736076434]
We introduce deep neural networks for the analysis of anatomical shapes that learn a low-dimensional shape representation from the given task.
Our framework is modular and consists of several computing blocks that perform fundamental shape processing tasks.
We propose a discriminative model for disease classification and age regression, as well as a generative model for the accruate reconstruction of shapes.
arXiv Detail & Related papers (2020-10-02T07:37:40Z) - Few-shot Visual Reasoning with Meta-analogical Contrastive Learning [141.2562447971]
We propose to solve a few-shot (or low-shot) visual reasoning problem, by resorting to analogical reasoning.
We extract structural relationships between elements in both domains, and enforce them to be as similar as possible with analogical learning.
We validate our method on RAVEN dataset, on which it outperforms state-of-the-art method, with larger gains when the training data is scarce.
arXiv Detail & Related papers (2020-07-23T14:00:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.