Fugu-MT 論文翻訳(概要): Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation

論文の概要: Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation

arxiv url: http://arxiv.org/abs/2205.01133v1
Date: Mon, 2 May 2022 18:05:35 GMT
ステータス: 翻訳完了
システム内更新日: 2022-05-05 03:29:43.904382
Title: Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation
Title（参考訳）: Hausa Visual Genome: Hausa 機械翻訳のためのマルチモーダル英語データセット
Authors: Idris Abdulmumin, Satya Ranjan Dash, Musa Abdullahi Dawud, Shantipriya Parida, Shamsuddeen Hassan Muhammad, Ibrahim Sa'id Ahmad, Subhadarshi Panda, Ond\v{r}ej Bojar, Bashir Shehu Galadanci, Bello Shehu Bello
Abstract要約: この研究は、Hausa Visual Genome (HaVG) を提示する。データセットは32,923の画像とそれらの記述からなり、トレーニング、開発、テスト、チャレンジテストセットに分けられる。 HaVGはその種類の最初のデータセットであり、ハウサ・イングリッシュ機械翻訳、マルチモーダル・リサーチ、画像記述に使用することができる。
参考スコア（独自算出の注目度）: 0.7536909803290599
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multi-modal Machine Translation (MMT) enables the use of visual information to enhance the quality of translations. The visual information can serve as a valuable piece of context information to decrease the ambiguity of input sentences. Despite the increasing popularity of such a technique, good and sizeable datasets are scarce, limiting the full extent of their potential. Hausa, a Chadic language, is a member of the Afro-Asiatic language family. It is estimated that about 100 to 150 million people speak the language, with more than 80 million indigenous speakers. This is more than any of the other Chadic languages. Despite a large number of speakers, the Hausa language is considered low-resource in natural language processing (NLP). This is due to the absence of sufficient resources to implement most NLP tasks. While some datasets exist, they are either scarce, machine-generated, or in the religious domain. Therefore, there is a need to create training and evaluation data for implementing machine learning tasks and bridging the research gap in the language. This work presents the Hausa Visual Genome (HaVG), a dataset that contains the description of an image or a section within the image in Hausa and its equivalent in English. To prepare the dataset, we started by translating the English description of the images in the Hindi Visual Genome (HVG) into Hausa automatically. Afterward, the synthetic Hausa data was carefully post-edited considering the respective images. The dataset comprises 32,923 images and their descriptions that are divided into training, development, test, and challenge test set. The Hausa Visual Genome is the first dataset of its kind and can be used for Hausa-English machine translation, multi-modal research, and image description, among various other natural language processing and generation tasks.
Abstract（参考訳）: マルチモーダル機械翻訳(mmt)は、視覚情報を使用して翻訳の質を高めることを可能にする。視覚情報は、入力文の曖昧さを減少させる貴重な文脈情報として機能することができる。このような技術の人気が高まっているにもかかわらず、良質でスケール可能なデータセットは乏しく、その潜在能力を最大限に制限している。ハウサ語(Hausa)は、アフロ・アジア語族に属する言語である。約1億から1億5000万人がこの言語を話し、8000万人以上の先住民が話すと推定されている。これは他のどのチャド語よりも多い。話者数が多いにもかかわらず、Hausa言語は自然言語処理(NLP)において低リソースであると考えられている。これは、ほとんどのNLPタスクを実装するのに十分なリソースがないためである。いくつかのデータセットは存在するが、それらは希少、機械生成、または宗教領域にある。したがって、機械学習タスクを実装し、言語における研究ギャップを埋めるために、トレーニングと評価データを作成する必要がある。 hausa visual genome (havg)は、hausaの画像内の画像またはセクションの記述を含むデータセットであり、英語で等価である。データセットを作成するために、Hindi Visual Genome(HVG)の画像の英語記述をHausaに自動的に翻訳することから始めた。その後, 合成ハウサデータを各画像から慎重に後編集した。データセットは32,923の画像とその記述からなり、トレーニング、開発、テスト、チャレンジテストセットに分けられる。 hausa visual genomeはその種の最初のデータセットであり、様々な自然言語処理や生成タスクの中で、hausa- english machine translation、multi-modal research、image descriptionに使用できる。

関連論文リスト

Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa [2.303135660004888]
ハウサで人間と機械が生成するコンテンツを区別できる最初の大規模検出器を開発した。 AfriXLMRは99.23%、F1スコア99.21%で最高性能を達成した。
論文参考訳（メタデータ） (2025-03-17T12:13:37Z)
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines [74.25764182510295]
視覚言語モデル(VLM)は、特に英語以外の言語において、文化特有の知識に苦しむことが多い。我々は多言語および多文化の視覚的理解のための大規模ベンチマークであるWorld Cuisinesを紹介した。このベンチマークには、30の言語と方言にまたがるテキストイメージペアを備えた、視覚的質問応答(VQA)データセットが含まれている。
論文参考訳（メタデータ） (2024-10-16T16:11:49Z)
Multilingual Diversity Improves Vision-Language Representations [66.41030381363244]
このデータセットの事前トレーニングは、ImageNet上で英語のみまたは英語が支配するデータセットを使用してパフォーマンスが向上する。 GeoDEのような地理的に多様なタスクでは、アフリカから得られる最大の利益とともに、すべての地域における改善も観察します。
論文参考訳（メタデータ） (2024-05-27T08:08:51Z)
The First Swahili Language Scene Text Detection and Recognition Dataset [55.83178123785643]
低リソース言語、特にスワヒリ語には大きなギャップがある。スワヒリ語は東アフリカ諸国で広く話されているが、依然としてシーンテキスト認識において未発見言語である。本研究では,スワヒリシーンのテキスト画像の包括的データセットを提案し,異なるシーンのテキスト検出および認識モデルに基づくデータセットの評価を行う。
論文参考訳（メタデータ） (2024-05-19T03:55:02Z)
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance [53.974497865647336]
われわれは、画像の翻訳を文化的に意味のあるものにするための第一歩を踏み出した。タスクを行うために、最先端の生成モデルからなる3つのパイプラインを構築します。我々は,翻訳画像の人間による評価を行い,文化的意義と保存の意味を評価する。
論文参考訳（メタデータ） (2024-04-01T17:08:50Z)
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages [54.808217147579036]
インドネシアの地方言語について事例研究を行う。データセット構築におけるオンラインスクラップ,人文翻訳,および母語話者による段落作成の有効性を比較した。本研究は,母語話者による段落作成によって生成されたデータセットが,語彙的多様性と文化的内容の点で優れた品質を示すことを示す。
論文参考訳（メタデータ） (2023-09-19T14:42:33Z)
Ngambay-French Neural Machine Translation (sba-Fr) [16.55378462843573]
アフリカや世界全体では、言語障壁を克服するニューラルネットワーク翻訳(NMT)システムの開発に注目が集まっている。このプロジェクトでは,Ngambay-to- French翻訳のコーパスである,最初のsba-Frデータセットを作成しました。実験の結果,M2M100モデルは,オリジナルとオリジナルの両方の合成データに対して,BLEUスコアの高い他のモデルよりも優れていた。
論文参考訳（メタデータ） (2023-08-25T17:13:20Z)
The first large scale collection of diverse Hausa language datasets [0.0]
ハウサ語はサハラ以南のアフリカ諸言語の中でよく研究され文書化された言語と考えられている。 1億人以上がこの言語を話すと推定されている。言語の公式な形式と非公式な形式の両方からなる、拡張されたデータセットのコレクションを提供する。
論文参考訳（メタデータ） (2021-02-13T19:34:20Z)
TextMage: The Automated Bangla Caption Generator Based On Deep Learning [1.2330326247154968]
TextMageはバングラデシュの地理的文脈に属する視覚シーンを理解することができるシステムである。このデータセットには、9,154のイメージと、各イメージに対する2つのアノテーションが含まれている。
論文参考訳（メタデータ） (2020-10-15T23:24:15Z)
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision [110.66085917826648]
我々は,言語トークンを関連画像に文脈的にマッピングすることで,言語のみのデータに対するマルチモーダルアライメントを補間する手法を開発した。語彙化」は比較的小さな画像キャプションデータセットに基づいて訓練され、それを大規模言語コーパスのための語彙生成に適用する。これらの文脈的に生成された語彙を用いて学習し、視覚的に制御された言語モデルにより、複数の純粋言語タスクにおいて、自己教師による代替よりも一貫した改善が示される。
論文参考訳（メタデータ） (2020-10-14T02:11:51Z)
HausaMT v1.0: Towards English-Hausa Neural Machine Translation [0.012691047660244334]
英語・ハウサ語機械翻訳のベースラインモデルを構築した。ハーサ語は、アラビア語に次いで世界で2番目に大きいアフロ・アジア語である。
論文参考訳（メタデータ） (2020-06-09T02:08:03Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。