FST Morphological Analyser and Generator for Mapud\"ungun
- URL: http://arxiv.org/abs/2109.09176v1
- Date: Sun, 19 Sep 2021 17:32:17 GMT
- Title: FST Morphological Analyser and Generator for Mapud\"ungun
- Authors: Andr\'es Chand\'ia
- Abstract summary: This article describes the main morphophonological aspects of Mapud"ungun, explaining what triggers them and the contexts where they arise.
We present a computational approach producing a finite state morphological analyser capable of classifying and appropriately tagging all the components that interact in a Mapuche word form.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Following the Mapuche grammar by Smeets, this article describes the main
morphophonological aspects of Mapud\"ungun, explaining what triggers them and
the contexts where they arise. We present a computational approach producing a
finite state morphological analyser (and generator) capable of classifying and
appropriately tagging all the components (roots and suffixes) that interact in
a Mapuche word form. The bulk of the article focuses on presenting details
about the morphology of Mapud\"ungun verb and its formalisation using FOMA. A
system evaluation process and its results are also present in this article.
Related papers
- Lexical categories of stem-forming roots in Mapudüngun verb forms [0.0]
The primary focus is on the lexical category classification of Mapud"ungun roots recognised as verbal in the source.
Results of this lexical category revision directly benefit the computational analyser, as they are implemented as soon as they are verified.
It is hoped that these results will help clarify some uncertainties about lexical categories in the Mapuche language.
arXiv Detail & Related papers (2025-02-11T15:10:23Z) - Unsupervised Morphological Tree Tokenizer [36.584680344291556]
We introduce morphological structure guidance to tokenization and propose a deep model to induce character-level structures of words.
Specifically, the deep model jointly encodes internal structures and representations of words with a mechanism named $textitOverriding$ to ensure the indecomposability of morphemes.
Based on the induced structures, our algorithm tokenizes words through vocabulary matching in a top-down manner.
arXiv Detail & Related papers (2024-06-21T15:35:49Z) - Labeled Morphological Segmentation with Semi-Markov Models [127.69031138022534]
We present labeled morphological segmentation, an alternative view of morphological processing that unifies several tasks.
We additionally introduce a new hierarchy of morphotactic tagsets.
We develop modelname, a discriminative morphological segmentation system that explicitly models morphotactics.
arXiv Detail & Related papers (2024-04-13T12:51:53Z) - Conversational Semantic Parsing using Dynamic Context Graphs [68.72121830563906]
We consider the task of conversational semantic parsing over general purpose knowledge graphs (KGs) with millions of entities, and thousands of relation-types.
We focus on models which are capable of interactively mapping user utterances into executable logical forms.
arXiv Detail & Related papers (2023-05-04T16:04:41Z) - Classifying text using machine learning models and determining
conversation drift [4.785406121053965]
An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their relevance.
Text classification is a method of categorising documents.
It combines computer text classification and natural language processing to analyse text in aggregate.
arXiv Detail & Related papers (2022-11-15T18:09:45Z) - Uzbek affix finite state machine for stemming [0.0]
The proposed methodology is a morphologic analysis of Uzbek words by using an affix to find a root and without including any lexicon.
This method helps to perform morphological analysis of words from a large amount of text at high speed as well as it is not required using of memory for keeping vocabulary.
arXiv Detail & Related papers (2022-05-20T10:46:53Z) - Graph Adaptive Semantic Transfer for Cross-domain Sentiment
Classification [68.06496970320595]
Cross-domain sentiment classification (CDSC) aims to use the transferable semantics learned from the source domain to predict the sentiment of reviews in the unlabeled target domain.
We present Graph Adaptive Semantic Transfer (GAST) model, an adaptive syntactic graph embedding method that is able to learn domain-invariant semantics from both word sequences and syntactic graphs.
arXiv Detail & Related papers (2022-05-18T07:47:01Z) - Understanding Synonymous Referring Expressions via Contrastive Features [105.36814858748285]
We develop an end-to-end trainable framework to learn contrastive features on the image and object instance levels.
We conduct extensive experiments to evaluate the proposed algorithm on several benchmark datasets.
arXiv Detail & Related papers (2021-04-20T17:56:24Z) - Neural disambiguation of lemma and part of speech in morphologically
rich languages [0.6346772579930928]
We consider the problem of disambiguating the lemma and part of speech of ambiguous words in morphologically rich languages.
We propose a method for disambiguating ambiguous words in context, using a large un-annotated corpus of text, and a morphological analyser.
arXiv Detail & Related papers (2020-07-12T21:48:52Z) - Graph-Structured Referring Expression Reasoning in The Wild [105.95488002374158]
Grounding referring expressions aims to locate in an image an object referred to by a natural language expression.
We propose a scene graph guided modular network (SGMN) to perform reasoning over a semantic graph and a scene graph.
We also propose Ref-Reasoning, a large-scale real-world dataset for structured referring expression reasoning.
arXiv Detail & Related papers (2020-04-19T11:00:30Z) - A Simple Joint Model for Improved Contextual Neural Lemmatization [60.802451210656805]
We present a simple joint neural model for lemmatization and morphological tagging that achieves state-of-the-art results on 20 languages.
Our paper describes the model in addition to training and decoding procedures.
arXiv Detail & Related papers (2019-04-04T02:03:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.