Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
- URL: http://arxiv.org/abs/2104.00513v1
- Date: Wed, 31 Mar 2021 14:56:48 GMT
- Title: Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
- Authors: Jingsong Wang, Yuxuan He, Chunyu Zhao, Qijie Shao, Wei-Wei Tu, Tom Ko,
Hung-yi Lee, Lei Xie
- Abstract summary: Auto-KWS 2021 challenge calls for automated machine learning (AutoML) solutions to automate the process of applying machine learning to a customized keyword spotting task.
The challenge focuses on the problem of customized keyword spotting, where the target device can only be awakened by an enrolled speaker with his specified keyword.
- Score: 63.82759886293636
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Auto-KWS 2021 challenge calls for automated machine learning (AutoML)
solutions to automate the process of applying machine learning to a customized
keyword spotting task. Compared with other keyword spotting tasks, Auto-KWS
challenge has the following three characteristics: 1) The challenge focuses on
the problem of customized keyword spotting, where the target device can only be
awakened by an enrolled speaker with his specified keyword. The speaker can use
any language and accent to define his keyword. 2) All dataset of the challenge
is recorded in realistic environment. It is to simulate different user
scenarios. 3) Auto-KWS is a "code competition", where participants need to
submit AutoML solutions, then the platform automatically runs the enrollment
and prediction steps with the submitted code.This challenge aims at promoting
the development of a more personalized and flexible keyword spotting system.
Two baseline systems are provided to all participants as references.
Related papers
- Responsible Task Automation: Empowering Large Language Models as
Responsible Task Automators [17.991044940694778]
Large Language Models (LLMs) have shown a promising prospect in automatically completing tasks upon user instructions.
A big question emerges: how can we make machines behave responsibly when helping humans automate tasks as personal copilots?
We present Responsible Task Automation (ResponsibleTA) as a fundamental framework to facilitate responsible collaboration between LLM-based coordinators and executors for task automation.
arXiv Detail & Related papers (2023-06-02T02:42:58Z) - AutoML-GPT: Automatic Machine Learning with GPT [74.30699827690596]
We propose developing task-oriented prompts and automatically utilizing large language models (LLMs) to automate the training pipeline.
We present the AutoML-GPT, which employs GPT as the bridge to diverse AI models and dynamically trains models with optimized hyper parameters.
This approach achieves remarkable results in computer vision, natural language processing, and other challenging areas.
arXiv Detail & Related papers (2023-05-04T02:09:43Z) - Personalized Keyword Spotting through Multi-task Learning [6.4423565043274795]
We design two personalized KWS tasks; (1) Target user Biased KWS (TB-KWS) and (2) Target user Only KWS (TO-KWS)
To solve the tasks, we propose personalized keyword spotting through multi-task learning (PK-MTL) that consists of multi-task learning and task-adaptation.
We evaluate our framework on conventional and personalized scenarios, and the results show that PK-MTL can dramatically reduce the false alarm rate.
arXiv Detail & Related papers (2022-06-28T02:48:34Z) - The VoicePrivacy 2022 Challenge Evaluation Plan [46.807999940446294]
Training, development and evaluation datasets are provided.
Participants apply their developed anonymization systems.
Results will be presented at a workshop held in conjunction with INTERSPEECH 2022.
arXiv Detail & Related papers (2022-03-23T15:05:18Z) - Connecting Language and Vision for Natural Language-Based Vehicle
Retrieval [77.88818029640977]
In this paper, we apply one new modality, i.e., the language description, to search the vehicle of interest.
To connect language and vision, we propose to jointly train the state-of-the-art vision models with the transformer-based language model.
Our proposed method has achieved the 1st place on the 5th AI City Challenge, yielding competitive performance 18.69% MRR accuracy.
arXiv Detail & Related papers (2021-05-31T11:42:03Z) - AutoSpeech 2020: The Second Automated Machine Learning Challenge for
Speech Classification [31.22181821515342]
The AutoSpeech challenge calls for automated machine learning (AutoML) solutions to automate the process of applying machine learning to speech processing tasks.
This paper outlines the challenge protocol, datasets, evaluation metric, starting kit, and baseline systems.
arXiv Detail & Related papers (2020-10-25T15:01:41Z) - ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue
Systems (ClariQ) [64.60303062063663]
This document presents a detailed description of the challenge on clarifying questions for dialogue systems (ClariQ)
The challenge is organized as part of the Conversational AI challenge series (ConvAI3) at Search Oriented Conversational AI (SCAI) EMNLP workshop in 2020.
arXiv Detail & Related papers (2020-09-23T19:48:02Z) - The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning
with Keywords and Sentence Length Estimation [49.41766997393417]
This report describes the system participating to the Detection and Classification of Acoustic Scenes and Events (DCASE) 2020 Challenge, Task 6.
Our submission focuses on solving two indeterminacy problems in automated audio captioning: word selection indeterminacy and sentence length indeterminacy.
We simultaneously solve the main caption generation and sub indeterminacy problems by estimating keywords and sentence length through multi-task learning.
arXiv Detail & Related papers (2020-07-01T04:26:27Z) - A Baseline for the Commands For Autonomous Vehicles Challenge [7.430057056425165]
The challenge is based on the recent textttTalk2Car dataset.
This document provides a technical overview of a model that we released to help participants get started in the competition.
arXiv Detail & Related papers (2020-04-20T13:35:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.