Deep Neural Decision Forest for Acoustic Scene Classification
- URL: http://arxiv.org/abs/2203.03436v1
- Date: Mon, 7 Mar 2022 14:39:42 GMT
- Title: Deep Neural Decision Forest for Acoustic Scene Classification
- Authors: Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley,
Volkan K{\i}l{\i}\c{c}, Wenwu Wang
- Abstract summary: Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment.
We propose a novel approach for ASC using deep neural decision forest (DNDF)
- Score: 45.886356124352226
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Acoustic scene classification (ASC) aims to classify an audio clip based on
the characteristic of the recording environment. In this regard, deep learning
based approaches have emerged as a useful tool for ASC problems. Conventional
approaches to improving the classification accuracy include integrating
auxiliary methods such as attention mechanism, pre-trained models and ensemble
multiple sub-networks. However, due to the complexity of audio clips captured
from different environments, it is difficult to distinguish their categories
without using any auxiliary methods for existing deep learning models using
only a single classifier. In this paper, we propose a novel approach for ASC
using deep neural decision forest (DNDF). DNDF combines a fixed number of
convolutional layers and a decision forest as the final classifier. The
decision forest consists of a fixed number of decision tree classifiers, which
have been shown to offer better classification performance than a single
classifier in some datasets. In particular, the decision forest differs
substantially from traditional random forests as it is stochastic,
differentiable, and capable of using the back-propagation to update and learn
feature representations in neural network. Experimental results on the
DCASE2019 and ESC-50 datasets demonstrate that our proposed DNDF method
improves the ASC performance in terms of classification accuracy and shows
competitive performance as compared with state-of-the-art baselines.
Related papers
- Informed deep hierarchical classification: a non-standard analysis inspired approach [0.0]
It consists in a multi-output deep neural network equipped with specific projection operators placed before each output layer.
The design of such an architecture, called lexicographic hybrid deep neural network (LH-DNN), has been possible by combining tools from different and quite distant research fields.
To assess the efficacy of the approach, the resulting network is compared against the B-CNN, a convolutional neural network tailored for hierarchical classification tasks.
arXiv Detail & Related papers (2024-09-25T14:12:50Z) - Confidence-aware Contrastive Learning for Selective Classification [20.573658672018066]
This work provides a generalization bound for selective classification, disclosing that optimizing feature layers helps improve the performance of selective classification.
Inspired by this theory, we propose to explicitly improve the selective classification model at the feature level for the first time, leading to a novel Confidence-aware Contrastive Learning method for Selective Classification, CCL-SC.
arXiv Detail & Related papers (2024-06-07T08:43:53Z) - Domain Adaptive Nuclei Instance Segmentation and Classification via
Category-aware Feature Alignment and Pseudo-labelling [65.40672505658213]
We propose a novel deep neural network, namely Category-Aware feature alignment and Pseudo-Labelling Network (CAPL-Net) for UDA nuclei instance segmentation and classification.
Our approach outperforms state-of-the-art UDA methods with a remarkable margin.
arXiv Detail & Related papers (2022-07-04T07:05:06Z) - Making CNNs Interpretable by Building Dynamic Sequential Decision
Forests with Top-down Hierarchy Learning [62.82046926149371]
We propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable.
We achieve this by building a differentiable decision forest on top of CNNs.
We name the transferred model deep Dynamic Sequential Decision Forest (dDSDF)
arXiv Detail & Related papers (2021-06-05T07:41:18Z) - An evidential classifier based on Dempster-Shafer theory and deep
learning [6.230751621285322]
We propose a new classification system based on Dempster-Shafer (DS) theory and a convolutional neural network (CNN) architecture for set-valued classification.
Experiments on image recognition, signal processing, and semantic-relationship classification tasks demonstrate that the proposed combination of deep CNN, DS layer, and expected utility layer makes it possible to improve classification accuracy.
arXiv Detail & Related papers (2021-03-25T01:29:05Z) - Relational Teacher Student Learning with Neural Label Embedding for
Device Adaptation in Acoustic Scene Classification [49.0621360050418]
We propose a domain adaptation framework to address the device mismatch issue in acoustic scene classification.
Taking into account the structural relationships between acoustic scene classes, our proposed framework captures such relationships which are intrinsically device-independent.
In the training stage, transferable knowledge is condensed in NLE from the source domain.
In the adaptation stage, a novel RTSL strategy is adopted to learn adapted target models without using paired source-target data.
arXiv Detail & Related papers (2020-07-31T23:07:20Z) - Capturing scattered discriminative information using a deep architecture
in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem.
We adopt a max feature map method to replace conventional non-linear activations in a deep neural network.
Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.