Related papers: Banknote Recognition for Visually Impaired People (Case of Ethiopian note)

Banknote Recognition for Visually Impaired People (Case of Ethiopian note)

URL: http://arxiv.org/abs/2209.03236v1
Date: Thu, 25 Aug 2022 19:46:34 GMT
Title: Banknote Recognition for Visually Impaired People (Case of Ethiopian note)
Authors: Nuredin Ali Abdelkadir
Abstract summary: We developed an Android and IOS compatible mobile application with a model that achieved 98.9% classification accuracy on our dataset. The application has a voice integrated feature that tells the type of the scanned currency in Amharic, the working language of Ethiopia.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Currency is used almost everywhere to facilitate business. In most developing countries, especially the ones in Africa, tangible notes are predominantly used in everyday financial transactions. One of these countries, Ethiopia, is believed to have one of the world highest rates of blindness (1.6%) and low vision (3.7%). There are around 4 million visually impaired people; With 1.7 million people being in complete vision loss. Those people face a number of challenges when they are in a bus station, in shopping centers, or anywhere which requires the physical exchange of money. In this paper, we try to provide a solution to this issue using AI/ML applications. We developed an Android and IOS compatible mobile application with a model that achieved 98.9% classification accuracy on our dataset. The application has a voice integrated feature that tells the type of the scanned currency in Amharic, the working language of Ethiopia. The application is developed to be easily accessible by its users. It is build to reduce the burden of visually impaired people in Ethiopia.

Related papers

Real-Time Currency Detection and Voice Feedback for Visually Impaired Individuals [0.0]
This paper presents a real-time currency detection system designed to assist visually impaired individuals.<n>The proposed model is trained on a dataset containing 30 classes of notes and coins, representing 3 types of currency: US dollar (USD), Euro (EUR), and Bangladeshi taka (BDT)
arXiv Detail & Related papers (2025-10-23T06:48:04Z)
Money Recognition for the Visually Impaired: A Case Study on Sri Lankan Banknotes [0.0]
This research proposes a user-friendly stand-alone system for the identification of Sri Lankan currency notes. A custom-created dataset of images of Sri Lankan currency notes was used to fine-tune an EfficientDet model. The model achieved 0.9847 AP on the validation dataset and performs exceptionally well in real-world scenarios.
arXiv Detail & Related papers (2025-02-20T05:07:46Z)
Real-time Yemeni Currency Detection [0.49109372384514843]
Banknote recognition is a major problem faced by visually Challenged people. This paper presents a real-time Yemeni currency detection system for visually impaired persons.
arXiv Detail & Related papers (2024-06-18T19:57:15Z)
Improve accessibility for Low Vision and Blind people using Machine Learning and Computer Vision [0.0]
This project explores how machine learning and computer vision could be utilized to improve accessibility for people with visual impairments. This project will concentrate on building a mobile application that helps blind people to orient in space by receiving audio and haptic feedback.
arXiv Detail & Related papers (2024-03-24T21:19:17Z)
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World [71.52132776748628]
We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world. We create a new dataset (AS-1B) with over 1 billion regions annotated with semantic tags, question-answering pairs, and detailed captions. We develop the All-Seeing model (ASM), a unified framework for panoptic visual recognition and understanding.
arXiv Detail & Related papers (2023-08-03T17:59:47Z)
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models [68.29126169579132]
API vendors charge their users based on usage, more specifically on the number of tokens'' processed or generated by the underlying language models. What constitutes a token, however, is training data and model dependent with a large variance in the number of tokens required to convey the same information in different languages. We conduct a systematic analysis of the cost and utility of OpenAI's language model API on multilingual benchmarks in 22 typologically diverse languages.
arXiv Detail & Related papers (2023-05-23T05:46:45Z)
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages [45.88640066767242]
Africa is home to over 2,000 languages from more than six language families and has the highest linguistic diversity among all continents. Yet, there is little NLP research conducted on African languages. Crucial to enabling such research is the availability of high-quality annotated datasets. In this paper, we introduce AfriSenti, a sentiment analysis benchmark that contains a total of >110,000 tweets in 14 African languages.
arXiv Detail & Related papers (2023-02-17T15:40:12Z)
Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK [6.52702503779308]
Smartphones enable understanding human behavior with activity recognition to support people's daily lives. People are more sedentary in the post-pandemic world with the prevalence of remote/hybrid work/study settings. We analyzed in-the-wild smartphone data and over 216K self-reports from 637 college students in five countries.
arXiv Detail & Related papers (2023-02-16T21:34:55Z)
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition [56.048783994698425]
Ethiopic/Amharic script is one of the oldest African writing systems, which serves at least 23 languages in East Africa. The Amharic writing system, Abugida, has 282 syllables, 15 punctuation marks, and 20 numerals. We presented the first comprehensive public datasets named HUST-ART, HUST-AST, ABE, and Tana for Amharic script detection and recognition in the natural scene.
arXiv Detail & Related papers (2022-03-23T03:19:35Z)
Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users [3.3946853660795884]
In many countries, illiterate people tend to speak only low-resource languages. We investigate the effectiveness of unsupervised speech representation learning on noisy radio broadcasting archives. Our contributions offer a path forward for ethical AI research to serve the needs of those most disadvantaged by the digital divide.
arXiv Detail & Related papers (2021-04-27T10:09:34Z)
Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties [77.2347265289855]
We focus on phoneme recognition using Allosaurus, a method for multilingual recognition based on phonetic annotation. To evaluate in a challenging real-world scenario, we curate phone recognition datasets for Bukusu and Saamia, two varieties of the Luhya language cluster of western Kenya and eastern Uganda. We find that fine-tuning of Allosaurus, even with just 100 utterances, leads to significant improvements in phone error rates.
arXiv Detail & Related papers (2021-04-04T15:07:55Z)
Skeleton Based Sign Language Recognition Using Whole-body Keypoints [71.97020373520922]
Sign language is used by deaf or speech impaired people to communicate. Skeleton-based recognition is becoming popular that it can be further ensembled with RGB-D based method to achieve state-of-the-art performance. Inspired by the recent development of whole-body pose estimation citejin 2020whole, we propose recognizing sign language based on the whole-body key points and features.
arXiv Detail & Related papers (2021-03-16T03:38:17Z)
SqueezeFacePoseNet: Lightweight Face Verification Across Different Poses for Mobile Platforms [55.84746218227712]
Face verification technologies can provide reliable and robust user authentication, given the availability of cameras in mobile devices. Deep Convolutional Neural Networks have resulted in many accurate face verification architectures, but their typical size (hundreds of megabytes) makes them infeasible to be incorporated in downloadable mobile applications. We develop a lightweight face recognition network of just a few megabytes that can operate with sufficient accuracy in comparison to much larger models.
arXiv Detail & Related papers (2020-07-16T19:02:38Z)
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning [21.56849865328527]
More than 77% apps have issues of missing labels, according to our analysis of 10,408 Android apps. We develop a deep-learning based model, called LabelDroid, to automatically predict the labels of image-based buttons. The experimental results show that our model can make accurate predictions and the generated labels are of higher quality than that from real Android developers.
arXiv Detail & Related papers (2020-03-01T02:31:26Z)

This list is automatically generated from the titles and abstracts of the papers in this site.