Multimodal Multi-User Surface Recognition with the Kernel Two-Sample
Test
- URL: http://arxiv.org/abs/2303.04930v1
- Date: Wed, 8 Mar 2023 22:58:55 GMT
- Title: Multimodal Multi-User Surface Recognition with the Kernel Two-Sample
Test
- Authors: Behnam Khojasteh, Friedrich Solowjow, Sebastian Trimpe, Katherine J.
Kuchenbecker
- Abstract summary: We propose a framework that can handle heterogeneous data sources for classification tasks.
Our data-versus-data approach automatically quantifies distinctive differences in distributions in a high-dimensional space.
We achieve 97.2% accuracy on a standard multi-user dataset with 108 surface classes, outperforming the state-of-the-art machine-learning algorithm by 6% on a more difficult version of the task.
- Score: 15.051737123188174
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning and deep learning have been used extensively to classify
physical surfaces through images and time-series contact data. However, these
methods rely on human expertise and entail the time-consuming processes of data
and parameter tuning. To overcome these challenges, we propose an easily
implemented framework that can directly handle heterogeneous data sources for
classification tasks. Our data-versus-data approach automatically quantifies
distinctive differences in distributions in a high-dimensional space via kernel
two-sample testing between two sets extracted from multimodal data (e.g.,
images, sounds, haptic signals). We demonstrate the effectiveness of our
technique by benchmarking against expertly engineered classifiers for
visual-audio-haptic surface recognition due to the industrial relevance,
difficulty, and competitive baselines of this application; ablation studies
confirm the utility of key components of our pipeline. As shown in our
open-source code, we achieve 97.2% accuracy on a standard multi-user dataset
with 108 surface classes, outperforming the state-of-the-art machine-learning
algorithm by 6% on a more difficult version of the task. The fact that our
classifier obtains this performance with minimal data processing in the
standard algorithm setting reinforces the powerful nature of kernel methods for
learning to recognize complex patterns.
Related papers
- Leveraging Mixture of Experts for Improved Speech Deepfake Detection [53.69740463004446]
Speech deepfakes pose a significant threat to personal security and content authenticity.
We introduce a novel approach for enhancing speech deepfake detection performance using a Mixture of Experts architecture.
arXiv Detail & Related papers (2024-09-24T13:24:03Z) - Continual Learning for Multimodal Data Fusion of a Soft Gripper [1.0589208420411014]
A model trained on one data modality often fails when tested with a different modality.
We introduce a continual learning algorithm capable of incrementally learning different data modalities.
We evaluate the algorithm's effectiveness on a challenging custom multimodal dataset.
arXiv Detail & Related papers (2024-09-20T09:53:27Z) - Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution [62.71425232332837]
We show that training amortized models with noisy labels is inexpensive and surprisingly effective.
This approach significantly accelerates several feature attribution and data valuation methods, often yielding an order of magnitude speedup over existing approaches.
arXiv Detail & Related papers (2024-01-29T03:42:37Z) - Learning from the Best: Active Learning for Wireless Communications [9.523381807291049]
Active learning algorithms identify the most critical and informative samples in an unlabeled dataset and label only those samples, instead of the complete set.
We present a case study of deep learning-based mmWave beam selection, where labeling is performed by a compute-intensive algorithm based on exhaustive search.
Our results show that using an active learning algorithm for class-imbalanced datasets can reduce labeling overhead by up to 50% for this dataset.
arXiv Detail & Related papers (2024-01-23T12:21:57Z) - A Weighted K-Center Algorithm for Data Subset Selection [70.49696246526199]
Subset selection is a fundamental problem that can play a key role in identifying smaller portions of the training data.
We develop a novel factor 3-approximation algorithm to compute subsets based on the weighted sum of both k-center and uncertainty sampling objective functions.
arXiv Detail & Related papers (2023-12-17T04:41:07Z) - Understanding learning from EEG data: Combining machine learning and
feature engineering based on hidden Markov models and mixed models [0.0]
Frontal theta oscillations are thought to play an important role in spatial navigation and memory.
EEG datasets are very complex, making changes in the neural signal related to behaviour difficult to interpret.
This paper proposes using hidden Markov and linear mixed effects models to extract features from EEG data.
arXiv Detail & Related papers (2023-11-14T12:24:12Z) - A Dimensional Structure based Knowledge Distillation Method for
Cross-Modal Learning [15.544134849816528]
We discover the correlation between feature discriminability and dimensional structure (DS) by analyzing and observing features extracted from simple and hard tasks.
We propose a novel cross-modal knowledge distillation (CMKD) method for better supervised cross-modal learning (CML) performance.
The proposed method enforces output features to be channel-wise independent and intermediate ones to be uniformly distributed, thereby learning semantically irrelevant features from the hard task to boost its accuracy.
arXiv Detail & Related papers (2023-06-28T07:29:26Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data [21.596650236820377]
We introduce the problem of meta two-sample testing (M2ST)
M2ST aims to exploit (abundant) auxiliary data on related tasks to find an algorithm that can quickly identify a powerful test on new target tasks.
We provide both theoretical justification and empirical evidence that our proposed meta-testing schemes out-perform learning kernel-based tests directly from scarce observations.
arXiv Detail & Related papers (2021-06-14T17:52:50Z) - Fast accuracy estimation of deep learning based multi-class musical
source separation [79.10962538141445]
We propose a method to evaluate the separability of instruments in any dataset without training and tuning a neural network.
Based on the oracle principle with an ideal ratio mask, our approach is an excellent proxy to estimate the separation performances of state-of-the-art deep learning approaches.
arXiv Detail & Related papers (2020-10-19T13:05:08Z) - Federated Doubly Stochastic Kernel Learning for Vertically Partitioned
Data [93.76907759950608]
We propose a doubly kernel learning algorithm for vertically partitioned data.
We show that FDSKL is significantly faster than state-of-the-art federated learning methods when dealing with kernels.
arXiv Detail & Related papers (2020-08-14T05:46:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.