Brilla AI: AI Contestant for the National Science and Maths Quiz
- URL: http://arxiv.org/abs/2403.01699v3
- Date: Tue, 30 Apr 2024 19:22:36 GMT
- Title: Brilla AI: AI Contestant for the National Science and Maths Quiz
- Authors: George Boateng, Jonathan Abrefah Mensah, Kevin Takyi Yeboah, William Edor, Andrew Kojo Mensah-Onumah, Naafi Dasana Ibrahim, Nana Sam Yeboah,
- Abstract summary: This work describes and evaluates the first key output for the NSMQ AI Grand Challenge.
It proposes a robust, real-world benchmark for such an AI: "Build an AI to compete live in Ghana's National Science and Maths Quiz (NSMQ) competition and win"
In its debut, our AI answered one of the 4 riddles ahead of the 3 human contesting teams, unofficially placing second (tied)
- Score: 0.7329200485567825
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The African continent lacks enough qualified teachers which hampers the provision of adequate learning support. An AI could potentially augment the efforts of the limited number of teachers, leading to better learning outcomes. Towards that end, this work describes and evaluates the first key output for the NSMQ AI Grand Challenge, which proposes a robust, real-world benchmark for such an AI: "Build an AI to compete live in Ghana's National Science and Maths Quiz (NSMQ) competition and win - performing better than the best contestants in all rounds and stages of the competition". The NSMQ is an annual live science and mathematics competition for senior secondary school students in Ghana in which 3 teams of 2 students compete by answering questions across biology, chemistry, physics, and math in 5 rounds over 5 progressive stages until a winning team is crowned for that year. In this work, we built Brilla AI, an AI contestant that we deployed to unofficially compete remotely and live in the Riddles round of the 2023 NSMQ Grand Finale, the first of its kind in the 30-year history of the competition. Brilla AI is currently available as a web app that livestreams the Riddles round of the contest, and runs 4 machine learning systems: (1) speech to text (2) question extraction (3) question answering and (4) text to speech that work together in real-time to quickly and accurately provide an answer, and then say it with a Ghanaian accent. In its debut, our AI answered one of the 4 riddles ahead of the 3 human contesting teams, unofficially placing second (tied). Improvements and extensions of this AI could potentially be deployed to offer science tutoring to students and eventually enable millions across Africa to have one-on-one learning interactions, democratizing science education.
Related papers
- OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI [73.75520820608232]
We introduce OlympicArena, which includes 11,163 bilingual problems across both text-only and interleaved text-image modalities.
These challenges encompass a wide range of disciplines spanning seven fields and 62 international Olympic competitions, rigorously examined for data leakage.
Our evaluations reveal that even advanced models like GPT-4o only achieve a 39.97% overall accuracy, illustrating current AI limitations in complex reasoning and multimodal integration.
arXiv Detail & Related papers (2024-06-18T16:20:53Z) - Leveraging AI to Advance Science and Computing Education across Africa: Challenges, Progress and Opportunities [1.2691047660244332]
We describe our work developing and deploying AI in Education tools in Africa for science and computing education.
SuaCode is an AI-powered app that enables Africans to learn to code using their smartphones.
AutoGrad is an automated grading, and feedback tool for graphical and interactive coding assignments.
Kwame for Science is a web-based AI teaching assistant that provides instant answers to students' science questions.
arXiv Detail & Related papers (2024-02-12T04:10:09Z) - DanZero+: Dominating the GuanDan Game through Reinforcement Learning [95.90682269990705]
We develop an AI program for an exceptionally complex and popular card game called GuanDan.
We first put forward an AI program named DanZero for this game.
In order to further enhance the AI's capabilities, we apply policy-based reinforcement learning algorithm to GuanDan.
arXiv Detail & Related papers (2023-12-05T08:07:32Z) - Towards an AI to Win Ghana's National Science and Maths Quiz [1.4777718769290527]
The NSMQ is an annual live science and mathematics competition for senior secondary school students in Ghana.
The NSMQ is an exciting live quiz competition with interesting technical challenges across speech-to-text, text-to-speech, question-answering, and human-computer interaction.
An AI that conquers this grand challenge can have real-world impact on education such as enabling millions of students across Africa to have one-on-one learning support from this AI.
arXiv Detail & Related papers (2023-08-08T15:26:58Z) - Stimulating student engagement with an AI board game tournament [0.0]
We present a project-based and competition-based bachelor course that gives second-year students an introduction to search methods applied to board games.
In groups of two, students have to use network programming and AI methods to build an AI agent to compete in a board game tournament-othello was this year's game.
arXiv Detail & Related papers (2023-04-22T11:22:00Z) - A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
GPT-5 All You Need? [112.12974778019304]
generative AI (AIGC, a.k.a AI-generated content) has made headlines everywhere because of its ability to analyze and create text, images, and beyond.
In the era of AI transitioning from pure analysis to creation, it is worth noting that ChatGPT, with its most recent language model GPT-4, is just a tool out of numerous AIGC tasks.
This work focuses on the technological development of various AIGC tasks based on their output type, including text, images, videos, 3D content, etc.
arXiv Detail & Related papers (2023-03-21T10:09:47Z) - Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand
Challenge for Education [2.0625936401496237]
We propose the NSMQ AI Grand Challenge, an AI Grand Challenge for Education using Ghana's National Science and Maths Quiz competition (NSMQ) as a case study.
Our proposed grand challenge is to "Build an AI to compete live in Ghana's National Science and Maths Quiz (NSMQ) competition and win - performing better than the best contestants in all rounds and stages of the competition"
arXiv Detail & Related papers (2023-01-30T17:28:33Z) - Deep Q-Network for AI Soccer [6.417982603606359]
Deep Q-Network is designed to implement our original rewards, the state space, and the action space to train each agent.
Our algorithm was able to successfully train the agents, and its performance was preliminarily proven through the mini-competition.
With our algorithm, we got the achievement of advancing to the round of 16 in this international competition with 130 teams from 39 countries.
arXiv Detail & Related papers (2022-09-20T06:04:26Z) - Retrospective on the 2021 BASALT Competition on Learning from Human
Feedback [92.37243979045817]
The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks.
Rather than mandating the use of LfHF techniques, we described four tasks in natural language to be accomplished in the video game Minecraft.
Teams developed a diverse range of LfHF algorithms across a variety of possible human feedback types.
arXiv Detail & Related papers (2022-04-14T17:24:54Z) - The MineRL BASALT Competition on Learning from Human Feedback [58.17897225617566]
The MineRL BASALT competition aims to spur forward research on this important class of techniques.
We design a suite of four tasks in Minecraft for which we expect it will be hard to write down hardcoded reward functions.
We provide a dataset of human demonstrations on each of the four tasks, as well as an imitation learning baseline.
arXiv Detail & Related papers (2021-07-05T12:18:17Z) - The 5th AI City Challenge [51.83023045451549]
The fifth AI City Challenge attracted 305 participating teams across 38 countries.
The evaluation was conducted on both algorithmic effectiveness and computational efficiency.
Results show the promise of AI in Smarter Transportation.
arXiv Detail & Related papers (2021-04-25T19:15:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.