Few-shot bioacoustic event detection at the DCASE 2023 challenge
- URL: http://arxiv.org/abs/2306.09223v1
- Date: Thu, 15 Jun 2023 15:59:26 GMT
- Title: Few-shot bioacoustic event detection at the DCASE 2023 challenge
- Authors: Ines Nolasco, Burooj Ghani, Shubhr Singh, Ester Vida\~na-Vila, Helen
Whitehead, Emily Grout, Michael Emmerson, Frants Jensen, Ivan Kiskin, Joe
Morford, Ariana Strandburg-Peshkin, Lisa Gill, Hanna Pamu{\l}a, Vincent
Lostanlen, Dan Stowell
- Abstract summary: This task ran as part of the DCASE challenge for the third time this year with an evaluation set expanded to include new animal species.
The 2023 few shot task received submissions from 6 different teams with F-scores reaching as high as 63% on the evaluation set.
Not only have the F-score results steadily improved (40% to 60% to 63%), but the type of systems proposed have also become more complex.
- Score: 5.769642475512074
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Few-shot bioacoustic event detection consists in detecting sound events of
specified types, in varying soundscapes, while having access to only a few
examples of the class of interest. This task ran as part of the DCASE challenge
for the third time this year with an evaluation set expanded to include new
animal species, and a new rule: ensemble models were no longer allowed. The
2023 few shot task received submissions from 6 different teams with F-scores
reaching as high as 63% on the evaluation set. Here we describe the task,
focusing on describing the elements that differed from previous years. We also
take a look back at past editions to describe how the task has evolved. Not
only have the F-score results steadily improved (40% to 60% to 63%), but the
type of systems proposed have also become more complex. Sound event detection
systems are no longer simple variations of the baselines provided: multiple
few-shot learning methodologies are still strong contenders for the task.
Related papers
- Double Mixture: Towards Continual Event Detection from Speech [60.33088725100812]
Speech event detection is crucial for multimedia retrieval, involving the tagging of both semantic and acoustic events.
This paper tackles two primary challenges in speech event detection: the continual integration of new events without forgetting previous ones, and the disentanglement of semantic from acoustic events.
We propose a novel method, 'Double Mixture,' which merges speech expertise with robust memory mechanisms to enhance adaptability and prevent forgetting.
arXiv Detail & Related papers (2024-04-20T06:32:00Z) - Multitask frame-level learning for few-shot sound event detection [46.32294691870714]
This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples.
We introduce an innovative multitask frame-level SED framework and TimeFilterAug, a linear timing mask for data augmentation.
The proposed method achieves a F-score of 63.8%, securing the 1st rank in the few-shot bioacoustic event detection category.
arXiv Detail & Related papers (2024-03-17T05:00:40Z) - Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound
Detection [10.395255631261458]
We regularize supervised contrastive pre-training to learn features that can transfer well on new target tasks with animal sounds unseen during training.
This work aims to lower the entry bar to few-shot bioacoustic sound event detection by proposing a simple and yet effective framework for this task.
arXiv Detail & Related papers (2023-09-16T12:11:11Z) - Pretraining Representations for Bioacoustic Few-shot Detection using
Supervised Contrastive Learning [10.395255631261458]
In bioacoustic applications, most tasks come with few labelled training data, because annotating long recordings is time consuming and costly.
We show that learning a rich feature extractor from scratch can be achieved by leveraging data augmentation using a supervised contrastive learning framework.
We obtain an F-score of 63.46% on the validation set and 42.7% on the test set, ranking second in the DCASE challenge.
arXiv Detail & Related papers (2023-09-02T09:38:55Z) - Segment-level Metric Learning for Few-shot Bioacoustic Event Detection [56.59107110017436]
We propose a segment-level few-shot learning framework that utilizes both the positive and negative events during model optimization.
Our system achieves an F-measure of 62.73 on the DCASE 2022 challenge task 5 (DCASE2022-T5) validation set, outperforming the performance of the baseline prototypical network 34.02 by a large margin.
arXiv Detail & Related papers (2022-07-15T22:41:30Z) - Few-shot bioacoustic event detection at the DCASE 2022 challenge [0.0]
Few-shot sound event detection is the task of detecting sound events despite having only a few labelled examples.
This paper presents an overview of the second edition of the few-shot bioacoustic sound event detection task included in the DCASE 2022 challenge.
The highest F-score was of 60% on the evaluation set, which leads to a huge improvement over last year's edition.
arXiv Detail & Related papers (2022-07-14T09:33:47Z) - Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video
Parsing [52.2231419645482]
This paper focuses on the weakly-supervised audio-visual video parsing task.
It aims to recognize all events belonging to each modality and localize their temporal boundaries.
arXiv Detail & Related papers (2022-04-25T11:41:17Z) - Extensively Matching for Few-shot Learning Event Detection [66.31312496170139]
Event detection models under super-vised learning settings fail to transfer to new event types.
Few-shot learning has not beenexplored in event detection.
We propose two novelloss factors that matching examples in the sup-port set to provide more training signals to themodel.
arXiv Detail & Related papers (2020-06-17T18:30:30Z) - Any-Shot Object Detection [81.88153407655334]
'Any-shot detection' is where totally unseen and few-shot categories can simultaneously co-occur during inference.
We propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes.
Our framework can also be used solely for Zero-shot detection and Few-shot detection tasks.
arXiv Detail & Related papers (2020-03-16T03:43:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.