"Seeing Sound": Audio Classification with the Wigner-Wille Distribution
and Convolutional Neural Networks
- URL: http://arxiv.org/abs/2211.03202v1
- Date: Sun, 6 Nov 2022 19:01:02 GMT
- Title: "Seeing Sound": Audio Classification with the Wigner-Wille Distribution
and Convolutional Neural Networks
- Authors: Antonios Marios Christonasis, Stef van Eijndhoven, Peter Duin
- Abstract summary: Data coming from sensor networks are combined with sensor fusion and AI algorithms to drive innovation in fields such as self-driving cars.
This paper investigates the potential of the utilization of sound-sensor data in an urban context.
We propose a novel approach of classifying sound data using the Wigner-Ville distribution and Convolutional Neural Networks.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With big data becoming increasingly available, IoT hardware becoming widely
adopted, and AI capabilities becoming more powerful, organizations are
continuously investing in sensing. Data coming from sensor networks are
currently combined with sensor fusion and AI algorithms to drive innovation in
fields such as self-driving cars. Data from these sensors can be utilized in
numerous use cases, including alerts in safety systems of urban settings, for
events such as gun shots and explosions. Moreover, diverse types of sensors,
such as sound sensors, can be utilized in low-light conditions or at locations
where a camera is not available. This paper investigates the potential of the
utilization of sound-sensor data in an urban context. Technically, we propose a
novel approach of classifying sound data using the Wigner-Ville distribution
and Convolutional Neural Networks. In this paper, we report on the performance
of the approach on open-source datasets. The concept and work presented is
based on my doctoral thesis, which was performed as part of the Engineering
Doctorate program in Data Science at the University of Eindhoven, in
collaboration with the Dutch National Police. Additional work on real-world
datasets was performed during the thesis, which are not presented here due to
confidentiality.
Related papers
- Multi-Source Urban Traffic Flow Forecasting with Drone and Loop Detector Data [61.9426776237409]
Drone-captured data can create an accurate multi-sensor mobility observatory for large-scale urban networks.
A simple yet effective graph-based model HiMSNet is proposed to integrate multiple data modalities and learn-temporal correlations.
arXiv Detail & Related papers (2025-01-07T03:23:28Z) - Denoising and Adaptive Online Vertical Federated Learning for Sequential Multi-Sensor Data in Industrial Internet of Things [22.208830538149606]
This study focuses on an industrial assembly line scenario where multiple sensors sequentially collect real-time data.
We propose the Denoising and Adaptive Online Vertical Federated Learning (DAO-VFL) algorithm.
arXiv Detail & Related papers (2025-01-03T08:22:15Z) - Training a Distributed Acoustic Sensing Traffic Monitoring Network With Video Inputs [0.0]
We present a novel concept that integrates DAS data with co-located visual information.
Our model achieves a performance exceeding 94% for detection and classification, and about 1.2% false alarm rate.
arXiv Detail & Related papers (2024-12-17T10:06:42Z) - Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study [59.780800481241066]
Generative Artificial Intelligence (GenAI) can be an effective alternative to wireless data augmentation.
This article explores the potential and effectiveness of GenAI-driven data augmentation in wireless networks.
We propose a general generative diffusion model-based data augmentation framework for Wi-Fi gesture recognition.
arXiv Detail & Related papers (2024-11-13T05:15:25Z) - Networking Systems for Video Anomaly Detection: A Tutorial and Survey [55.28514053969056]
Video Anomaly Detection (VAD) is a fundamental research task within the Artificial Intelligence (AI) community.
In this article, we delineate the foundational assumptions, learning frameworks, and applicable scenarios of various deep learning-driven VAD routes.
We showcase our latest NSVAD research in industrial IoT and smart cities, along with an end-cloud collaborative architecture for deployable NSVAD.
arXiv Detail & Related papers (2024-05-16T02:00:44Z) - Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future [130.87142103774752]
This review systematically assesses over seventy open-source autonomous driving datasets.
It offers insights into various aspects, such as the principles underlying the creation of high-quality datasets.
It also delves into the scientific and technical challenges that warrant resolution.
arXiv Detail & Related papers (2023-12-06T10:46:53Z) - Smart Infrastructure: A Research Junction [5.172393727004225]
We introduce an intelligent research infrastructure equipped with visual sensor technology, located at a public inner-city junction in Aschaffenburg, Germany.
A multiple-view camera system monitors the traffic situation to perceive road users' behavior.
The system is used for research in data generation, evaluating new HAD sensors systems, algorithms, and Artificial Intelligence (AI) training strategies.
arXiv Detail & Related papers (2023-07-12T14:04:12Z) - Robustness of Physics-Informed Neural Networks to Noise in Sensor Data [0.0]
PINNs have been shown to be an effective way of incorporating physics-based domain knowledge into neural network models.
In this work, we conduct a preliminary investigation of the robustness of physics-informed neural networks to the magnitude of noise in the data.
arXiv Detail & Related papers (2022-11-22T06:24:43Z) - Federated Deep Learning Meets Autonomous Vehicle Perception: Design and
Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed.
FLCAV preserves privacy while reducing communication and annotation costs.
It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z) - Data Heterogeneity-Robust Federated Learning via Group Client Selection
in Industrial IoT [57.67687126339891]
FedGS is a hierarchical cloud-edge-end FL framework for 5G empowered industries.
Taking advantage of naturally clustered factory devices, FedGS uses a gradient-based binary permutation algorithm.
Experiments show that FedGS improves accuracy by 3.5% and reduces training rounds by 59% on average.
arXiv Detail & Related papers (2022-02-03T10:48:17Z) - Integrate-and-Fire Neurons for Low-Powered Pattern Recognition [0.0]
We introduce a low-powered neuron model called Integrate-and-Fire which exploits the charge and discharge properties of the capacitor.
Using parallel and series RC circuits, we developed a trainable neuron model that can be expressed in a recurrent form.
This paper is the full text of the research, presented at the 20th International Conference on Artificial Intelligence and Soft Computing Web System (ICAISC 2021)
arXiv Detail & Related papers (2021-06-28T12:08:00Z) - Heterogeneous Noisy Short Signal Camouflage in Multi-Domain Environment
Decision-Making [0.0]
We propose an approach to hide information (sensor signal) by transforming it to an image or an audio signal.
In one of the latest attempts to the military modernization, we investigate the challenges of enabling an intelligent identification and detection operation.
arXiv Detail & Related papers (2021-06-02T22:59:58Z) - Occupancy Detection in Room Using Sensor Data [0.0]
This paper provides a solution to detect occupancy using sensor data by using and testing several variables.
Seven famous algorithms in Machine Learning, namely as Decision Tree, Random Forest, Gradient Boosting Machine, Logistic Regression, Naive Bayes, Kernelized SVM and K-Nearest Neighbors are tested and compared.
arXiv Detail & Related papers (2021-01-10T19:53:57Z) - Synthetic Data: Opening the data floodgates to enable faster, more
directed development of machine learning methods [96.92041573661407]
Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data.
Many large-scale datasets are highly sensitive, such as healthcare data, and are not widely available to the machine learning community.
Generating synthetic data with privacy guarantees provides one such solution.
arXiv Detail & Related papers (2020-12-08T17:26:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.