Machine Intelligence in Africa: a survey
- URL: http://arxiv.org/abs/2402.02218v1
- Date: Sat, 3 Feb 2024 17:27:14 GMT
- Title: Machine Intelligence in Africa: a survey
- Authors: Allahsera Auguste Tapo and Ali Traore and Sidy Danioko and Hamidou
Tembine
- Abstract summary: This paper summarizes recent developments in machine intelligence in Africa from a multi-layer multiscale and culture-aware ethics perspective.
It showcases MI use cases in 54 African countries through 400 articles on MI research, industry, government actions, as well as uses in art, music, the informal economy, and small businesses in Africa.
- Score: 1.511305953975385
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the last 5 years, the availability of large audio datasets in African
countries has opened unlimited opportunities to build machine intelligence (MI)
technologies that are closer to the people and speak, learn, understand, and do
businesses in local languages, including for those who cannot read and write.
Unfortunately, these audio datasets are not fully exploited by current MI
tools, leaving several Africans out of MI business opportunities. Additionally,
many state-of-the-art MI models are not culture-aware, and the ethics of their
adoption indexes are questionable. The lack thereof is a major drawback in many
applications in Africa. This paper summarizes recent developments in machine
intelligence in Africa from a multi-layer multiscale and culture-aware ethics
perspective, showcasing MI use cases in 54 African countries through 400
articles on MI research, industry, government actions, as well as uses in art,
music, the informal economy, and small businesses in Africa. The survey also
opens discussions on the reliability of MI rankings and indexes in the African
continent as well as algorithmic definitions of unclear terms used in MI.
Related papers
- Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset [74.74686464187474]
Emotion and Intent Joint Understanding in Multimodal Conversation (MC-EIU) aims to decode the semantic information manifested in a multimodal conversational history.
MC-EIU is enabling technology for many human-computer interfaces.
We propose an MC-EIU dataset, which features 7 emotion categories, 9 intent categories, 3 modalities, i.e., textual, acoustic, and visual content, and two languages, English and Mandarin.
arXiv Detail & Related papers (2024-07-03T01:56:00Z) - The State of Computer Vision Research in Africa [5.047087421666734]
This study analyzes 63,000 Scopus-indexed computer vision publications from Africa.
We utilize large language models to automatically parse their abstracts, to identify and categorize topics and datasets.
We also conduct a large-scale questionnaire among African computer vision researchers to identify the structural barriers they believe require urgent attention.
arXiv Detail & Related papers (2024-01-21T22:50:44Z) - A Survey on Multimodal Large Language Models [71.63375558033364]
Multimodal Large Language Model (MLLM) represented by GPT-4V has been a new rising research hotspot.
This paper aims to trace and summarize the recent progress of MLLMs.
arXiv Detail & Related papers (2023-06-23T15:21:52Z) - AfroDigits: A Community-Driven Spoken Digit Dataset for African
Languages [32.23306825605942]
AfroDigits is a minimalist dataset of spoken digits for African languages.
We conduct audio digit classification experiments on six African languages.
AfroDigits is the first published audio digit dataset for African languages.
arXiv Detail & Related papers (2023-03-22T14:09:20Z) - AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages [45.88640066767242]
Africa is home to over 2,000 languages from more than six language families and has the highest linguistic diversity among all continents.
Yet, there is little NLP research conducted on African languages. Crucial to enabling such research is the availability of high-quality annotated datasets.
In this paper, we introduce AfriSenti, a sentiment analysis benchmark that contains a total of >110,000 tweets in 14 African languages.
arXiv Detail & Related papers (2023-02-17T15:40:12Z) - MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity
Recognition [55.95128479289923]
African languages are spoken by over a billion people, but are underrepresented in NLP research and development.
We create the largest human-annotated NER dataset for 20 African languages.
We show that choosing the best transfer language improves zero-shot F1 scores by an average of 14 points.
arXiv Detail & Related papers (2022-10-22T08:53:14Z) - Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model [51.42415340921237]
We propose a Dynamic Inter- and Intra-modality Attention (DIIA) model to fuse the two modalities (audio and textual)
We further develop a Multimodal Knowledge Distillation (MKD) module to enable our multimodal MC model to accurately predict the answers based only on either the text or the audio.
arXiv Detail & Related papers (2021-07-04T08:35:20Z) - MasakhaNER: Named Entity Recognition for African Languages [48.34339599387944]
We create the first large publicly available high-quality dataset for named entity recognition in ten African languages.
We detail characteristics of the languages to help researchers understand the challenges that these languages pose for NER.
arXiv Detail & Related papers (2021-03-22T13:12:44Z) - Narratives and Counternarratives on Data Sharing in Africa [4.334357692599945]
Data sharing can support research and policy design to alleviate poverty, inequality, and derivative effects in Africa.
Many argue that data sharing can support research and policy design to alleviate poverty, inequality, and derivative effects in Africa.
These perspectives frequently employ a deficit narratives, often focusing on lack of education, training, and technological resources in the continent.
We argue that these narratives obfuscate and distort the full complexity of the African data sharing landscape.
arXiv Detail & Related papers (2021-03-01T18:07:35Z) - Lanfrica: A Participatory Approach to Documenting Machine Translation
Research on African Languages [0.012691047660244334]
Africa has the highest language diversity, with 1500-2000 documented languages and many more undocumented or extinct languages.
This makes it hard to keep track of the MT research, models and dataset that have been developed for some of them.
Online platforms can be useful creating accessibility to researches, benchmarks and datasets in these African languages.
arXiv Detail & Related papers (2020-08-03T18:14:04Z) - Masakhane -- Machine Translation For Africa [16.66010516114378]
MASAKHANE is an open-source, continent-wide, distributed, online research effort for machine translation for African languages.
We discuss our methodology for building the community and spurring research from the African continent.
arXiv Detail & Related papers (2020-03-13T09:01:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.