Cross-lingual Editing in Multilingual Language Models
- URL: http://arxiv.org/abs/2401.10521v2
- Date: Sat, 3 Feb 2024 05:59:49 GMT
- Title: Cross-lingual Editing in Multilingual Language Models
- Authors: Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh
- Abstract summary: This paper introduces the cross-lingual model editing (textbfXME) paradigm, wherein a fact is edited in one language, and the subsequent update propagation is observed across other languages.
The results reveal notable performance limitations of state-of-the-art METs under the XME setting, mainly when the languages involved belong to two distinct script families.
- Score: 1.3062731746155414
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The training of large language models (LLMs) necessitates substantial data
and computational resources, and updating outdated LLMs entails significant
efforts and resources. While numerous model editing techniques (METs) have
emerged to efficiently update model outputs without retraining, their
effectiveness in multilingual LLMs, where knowledge is stored in diverse
languages, remains an underexplored research area. This research paper
introduces the cross-lingual model editing (\textbf{XME}) paradigm, wherein a
fact is edited in one language, and the subsequent update propagation is
observed across other languages. To investigate the XME paradigm, we conducted
experiments using BLOOM, mBERT, and XLM-RoBERTa using the two writing scripts:
\textit{Latin} (English, French, and Spanish) and \textit{Indic} (Hindi,
Gujarati, and Bengali). The results reveal notable performance limitations of
state-of-the-art METs under the XME setting, mainly when the languages involved
belong to two distinct script families. These findings highlight the need for
further research and development of XME techniques to address these challenges.
For more comprehensive information, the dataset used in this research and the
associated code are publicly available at the following
URL\url{https://github.com/lingo-iitgn/XME}.
Related papers
- Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models [62.91524967852552]
Large language models (LLMs) are typically multilingual due to pretraining on diverse multilingual corpora.
But can these models relate corresponding concepts across languages, effectively being crosslingual?
This study evaluates six state-of-the-art LLMs on inherently crosslingual tasks.
arXiv Detail & Related papers (2024-06-23T15:15:17Z) - CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large
Language Models in 167 Languages [86.90220551111096]
Training datasets for large language models (LLMs) are often not fully disclosed.
We present CulturaX, a substantial multilingual dataset with 6.3 trillion tokens in 167 languages.
arXiv Detail & Related papers (2023-09-17T23:49:10Z) - Adapters for Enhanced Modeling of Multilingual Knowledge and Text [54.02078328453149]
Language models have been extended to multilingual language models (MLLMs)
Knowledge graphs contain facts in an explicit triple format, which require careful curation and are only available in a few high-resource languages.
We propose to enhance MLLMs with knowledge from multilingual knowledge graphs (MLKGs) so as to tackle language and knowledge graph tasks across many languages.
arXiv Detail & Related papers (2022-10-24T21:33:42Z) - XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages [11.581072296148031]
We conduct an extensive study using popular Transformer-based text generation models on our extended multi-lingual dataset.
Our experiments show that a multi-lingual mT5 model which uses fact-aware embeddings with structure-aware input encoding leads to best results on average across the twelve languages.
arXiv Detail & Related papers (2022-09-22T18:01:27Z) - UNKs Everywhere: Adapting Multilingual Language Models to New Scripts [103.79021395138423]
Massively multilingual language models such as multilingual BERT (mBERT) and XLM-R offer state-of-the-art cross-lingual transfer performance on a range of NLP tasks.
Due to their limited capacity and large differences in pretraining data, there is a profound performance gap between resource-rich and resource-poor target languages.
We propose novel data-efficient methods that enable quick and effective adaptation of pretrained multilingual models to such low-resource languages and unseen scripts.
arXiv Detail & Related papers (2020-12-31T11:37:28Z) - X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained
Language Models [103.75890012041366]
Language models (LMs) have proven surprisingly successful at capturing factual knowledge.
However, studies on LMs' factual representation ability have almost invariably been performed on English.
We create a benchmark of cloze-style probes for 23 typologically diverse languages.
arXiv Detail & Related papers (2020-10-13T05:29:56Z) - FILTER: An Enhanced Fusion Method for Cross-lingual Language
Understanding [85.29270319872597]
We propose an enhanced fusion method that takes cross-lingual data as input for XLM finetuning.
During inference, the model makes predictions based on the text input in the target language and its translation in the source language.
To tackle this issue, we propose an additional KL-divergence self-teaching loss for model training, based on auto-generated soft pseudo-labels for translated text in the target language.
arXiv Detail & Related papers (2020-09-10T22:42:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.