An Ontology-Aware Framework for Audio Event Classification
- URL: http://arxiv.org/abs/2001.10048v1
- Date: Mon, 27 Jan 2020 20:07:39 GMT
- Title: An Ontology-Aware Framework for Audio Event Classification
- Authors: Yiwei Sun and Shabnam Ghaffarzadegan
- Abstract summary: Recent advancements in audio event classification often ignore the structure and relation between the label classes available as prior information.
We propose an ontology-aware neural network containing two components: feed-forward ontology layers and graph convolutional networks (GCN)
The framework is evaluated on two benchmark datasets for single-label and multi-label audio event classification tasks.
- Score: 19.11706899266862
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent advancements in audio event classification often ignore the structure
and relation between the label classes available as prior information. This
structure can be defined by ontology and augmented in the classifier as a form
of domain knowledge. To capture such dependencies between the labels, we
propose an ontology-aware neural network containing two components:
feed-forward ontology layers and graph convolutional networks (GCN). The
feed-forward ontology layers capture the intra-dependencies of labels between
different levels of ontology. On the other hand, GCN mainly models
inter-dependency structure of labels within an ontology level. The framework is
evaluated on two benchmark datasets for single-label and multi-label audio
event classification tasks. The results demonstrate the proposed solutions
efficacy to capture and explore the ontology relations and improve the
classification performance.
Related papers
- Data-driven Coreference-based Ontology Building [48.995395445597225]
Coreference resolution is traditionally used as a component in individual document understanding.
We take a more global view and explore what can we learn about a domain from the set of all document-level coreference relations.
We release the coreference chains resulting under a creative-commons license, along with the code.
arXiv Detail & Related papers (2024-10-22T14:30:40Z) - Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for
Imbalanced Medical Classification [9.391704905671476]
This paper aims to rethink the data challenges in medical texts and present a novel framework-agnostic algorithm called Text2Tree.
We embed the ICD code tree structure of labels into cascade attention modules for learning hierarchy-aware label representations.
Two new learning schemes, Similarity Surrogate Learning (SSL) and Dissimilarity Mixup Learning (DML), are devised to boost text classification by reusing and distinguishing samples of other labels.
arXiv Detail & Related papers (2023-11-28T10:02:08Z) - Learning Disentangled Label Representations for Multi-label
Classification [39.97251974500034]
One-shared-Feature-for-Multiple-Labels (OFML) is not conducive to learning discriminative label features.
We introduce the One-specific-Feature-for-One-Label (OFOL) mechanism and propose a novel disentangled label feature learning framework.
We achieve state-of-the-art performance on eight datasets.
arXiv Detail & Related papers (2022-12-02T21:49:34Z) - Domain Adaptive Nuclei Instance Segmentation and Classification via
Category-aware Feature Alignment and Pseudo-labelling [65.40672505658213]
We propose a novel deep neural network, namely Category-Aware feature alignment and Pseudo-Labelling Network (CAPL-Net) for UDA nuclei instance segmentation and classification.
Our approach outperforms state-of-the-art UDA methods with a remarkable margin.
arXiv Detail & Related papers (2022-07-04T07:05:06Z) - Semantic Representation and Dependency Learning for Multi-Label Image
Recognition [76.52120002993728]
We propose a novel and effective semantic representation and dependency learning (SRDL) framework to learn category-specific semantic representation for each category.
Specifically, we design a category-specific attentional regions (CAR) module to generate channel/spatial-wise attention matrices to guide model.
We also design an object erasing (OE) module to implicitly learn semantic dependency among categories by erasing semantic-aware regions.
arXiv Detail & Related papers (2022-04-08T00:55:15Z) - The Overlooked Classifier in Human-Object Interaction Recognition [82.20671129356037]
We encode the semantic correlation among classes into the classification head by initializing the weights with language embeddings of HOIs.
We propose a new loss named LSE-Sign to enhance multi-label learning on a long-tailed dataset.
Our simple yet effective method enables detection-free HOI classification, outperforming the state-of-the-arts that require object detection and human pose by a clear margin.
arXiv Detail & Related papers (2022-03-10T23:35:00Z) - Label Hierarchy Transition: Delving into Class Hierarchies to Enhance
Deep Classifiers [40.993137740456014]
We propose a unified probabilistic framework based on deep learning to address the challenges of hierarchical classification.
The proposed framework can be readily adapted to any existing deep network with only minor modifications.
We extend our proposed LHT framework to the skin lesion diagnosis task and validate its great potential in computer-aided diagnosis.
arXiv Detail & Related papers (2021-12-04T14:58:36Z) - MMF: Multi-Task Multi-Structure Fusion for Hierarchical Image
Classification [10.713537820833665]
We consider that different label structures provide a variety of prior knowledge for category recognition.
We propose a multi-task multi-structure fusion model to integrate different label structures.
arXiv Detail & Related papers (2021-07-02T02:53:35Z) - LA-HCN: Label-based Attention for Hierarchical Multi-label
TextClassification Neural Network [16.12197413284402]
We propose a Label-based Attention for Hierarchical Mutlti-label Text Classification Neural Network (LA-HCN)
LA-HCN is designed to hierarchically extract important information from the text based on the labels from different hierarchy levels.
Separate local and global document embeddings are obtained and used to facilitate the respective local and global classifications.
arXiv Detail & Related papers (2020-09-23T06:18:25Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z) - Hierarchical Image Classification using Entailment Cone Embeddings [68.82490011036263]
We first inject label-hierarchy knowledge into an arbitrary CNN-based classifier.
We empirically show that availability of such external semantic information in conjunction with the visual semantics from images boosts overall performance.
arXiv Detail & Related papers (2020-04-02T10:22:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.