Related papers: NLP for The Greek Language: A Longer Survey

NLP for The Greek Language: A Longer Survey

URL: http://arxiv.org/abs/2408.10962v1
Date: Tue, 20 Aug 2024 15:57:18 GMT
Title: NLP for The Greek Language: A Longer Survey
Authors: Katerina Papantoniou, Yannis Tzitzikas,
Abstract summary: We list and briefly discuss related works, resources and tools, categorized according to various processing layers and contexts. This survey can be useful for researchers and students interested in NLP tasks, Information Retrieval and Knowledge Management for the Greek language.
Score: 1.6114012813668932
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: English language is in the spotlight of the Natural Language Processing (NLP) community with other languages, like Greek, lagging behind in terms of offered methods, tools and resources. Due to the increasing interest in NLP, in this paper we try to condense research efforts for the automatic processing of Greek language covering the last three decades. In particular, we list and briefly discuss related works, resources and tools, categorized according to various processing layers and contexts. We are not restricted to the modern form of Greek language but also cover Ancient Greek and various Greek dialects. This survey can be useful for researchers and students interested in NLP tasks, Information Retrieval and Knowledge Management for the Greek language.

Related papers

The Nature of NLP: Analyzing Contributions in NLP Papers [77.31665252336157]
We quantitatively investigate what constitutes NLP research by examining research papers. Our findings reveal a rising involvement of machine learning in NLP since the early nineties. In post-2020, there has been a resurgence of focus on language and people.
arXiv Detail & Related papers (2024-09-29T01:29:28Z)
Towards Systematic Monolingual NLP Surveys: GenA of Greek NLP [2.3499129784547663]
This study fills the gap by introducing a method for creating systematic and comprehensive monolingual NLP surveys. Characterized by a structured search protocol, it can be used to select publications and organize them through a taxonomy of NLP tasks. By applying our method, we conducted a systematic literature review of Greek NLP from 2012 to 2022.
arXiv Detail & Related papers (2024-07-13T12:01:52Z)
EthioMT: Parallel Corpus for Low-resource Ethiopian Languages [49.80726355048843]
We introduce EthioMT -- a new parallel corpus for 15 languages. We also create a new benchmark by collecting a dataset for better-researched languages in Ethiopia. We evaluate the newly collected corpus and the benchmark dataset for 23 Ethiopian languages using transformer and fine-tuning approaches.
arXiv Detail & Related papers (2024-03-28T12:26:45Z)
Natural Language Processing for Dialects of a Language: A Survey [56.93337350526933]
State-of-the-art natural language processing (NLP) models are trained on massive training corpora, and report a superlative performance on evaluation datasets. This survey delves into an important attribute of these datasets: the dialect of a language. Motivated by the performance degradation of NLP models for dialectic datasets and its implications for the equity of language technologies, we survey past research in NLP for dialects in terms of datasets, and approaches.
arXiv Detail & Related papers (2024-01-11T03:04:38Z)
Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning [66.79173000135717]
We apply this work to teaching two Indian languages, Kannada and Marathi, which do not have well-developed resources for second language learning. We extract descriptions from a natural text corpus that answer questions about morphosyntax (learning of word order, agreement, case marking, or word formation) and semantics (learning of vocabulary). We enlist the help of language educators from schools in North America to perform a manual evaluation, who find the materials have potential to be used for their lesson preparation and learner evaluation.
arXiv Detail & Related papers (2023-10-27T18:17:29Z)
OYXOY: A Modern NLP Test Suite for Modern Greek [2.059776592203642]
This paper serves as a foundational step towards the development of a linguistically motivated evaluation suite for Greek NLP. We introduce four expert-verified evaluation tasks, specifically targeted at natural language inference, word sense disambiguation and metaphor detection. More than language-resourced replicas of existing tasks, we contribute two innovations which will resonate with the broader resource and evaluation community.
arXiv Detail & Related papers (2023-09-13T15:00:56Z)
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities [3.6328558641172553]
This survey delves into the current state of natural language processing (NLP) for four Ethiopian languages: Amharic, Afaan Oromo, Tigrinya, and Wolaytta.
arXiv Detail & Related papers (2023-03-25T09:04:29Z)
A Survey of Knowledge Enhanced Pre-trained Language Models [78.56931125512295]
We present a comprehensive review of Knowledge Enhanced Pre-trained Language Models (KE-PLMs) For NLU, we divide the types of knowledge into four categories: linguistic knowledge, text knowledge, knowledge graph (KG) and rule knowledge. The KE-PLMs for NLG are categorized into KG-based and retrieval-based methods.
arXiv Detail & Related papers (2022-11-11T04:29:02Z)
Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning [91.49622922938681]
We present an automatic framework that automatically discovers and visualizing descriptions of different aspects of grammar. Specifically, we extract descriptions from a natural text corpus that answer questions about morphosyntax and semantics. We apply this method for teaching the Indian languages, Kannada and Marathi, which, unlike English, do not have well-developed pedagogical resources.
arXiv Detail & Related papers (2022-06-10T14:52:22Z)
How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language [91.79339725967073]
More than 43% of the languages spoken in the world are endangered. In this work, we focus on discussing how NLP can help revitalize endangered languages. We take Cherokee, a severely-endangered Native American language, as a case study.
arXiv Detail & Related papers (2022-04-25T18:25:57Z)
Multi-granular Legal Topic Classification on Greek Legislation [4.09134848993518]
We study the task of classifying legal texts written in the Greek language. This is the first time the task of Greek legal text classification is considered in an open research project.
arXiv Detail & Related papers (2021-09-30T17:43:00Z)
Transfer Learning for Multi-lingual Tasks -- a Survey [11.596820548674266]
Cross languages content and multilingualism in natural language processing (NLP) are hot topics. We provide a comprehensive overview of the existing literature with a focus on transfer learning techniques in multilingual tasks.
arXiv Detail & Related papers (2021-08-28T20:29:43Z)
PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation [0.30938904602244344]
We present our submission for the EACL 2021 SRW; a methodology that aims at bridging the gap between high and low-resource languages. We build Neural Machine Translation (NMT) models for English-to-Greek and Greek-to-English based on the Transformer architecture. We leverage these NMT models to produce English translations of Greek text as input for our NLP pipeline, to which we apply a series of pre-processing and triple extraction tasks.
arXiv Detail & Related papers (2021-03-28T08:01:58Z)

This list is automatically generated from the titles and abstracts of the papers in this site.