Optimizing Speech Emotion Recognition using Manta-Ray Based Feature
Selection
- URL: http://arxiv.org/abs/2009.08909v1
- Date: Fri, 18 Sep 2020 16:09:34 GMT
- Title: Optimizing Speech Emotion Recognition using Manta-Ray Based Feature
Selection
- Authors: Soham Chattopadhyay, Arijit Dey, Hritam Basak
- Abstract summary: We show that concatenation of features, extracted by using different existing feature extraction methods can boost the classification accuracy.
We also perform a novel application of Manta Ray optimization in speech emotion recognition tasks that resulted in a state-of-the-art result.
- Score: 1.4502611532302039
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Emotion recognition from audio signals has been regarded as a challenging
task in signal processing as it can be considered as a collection of static and
dynamic classification tasks. Recognition of emotions from speech data has been
heavily relied upon end-to-end feature extraction and classification using
machine learning models, though the absence of feature selection and
optimization have restrained the performance of these methods. Recent studies
have shown that Mel Frequency Cepstral Coefficients (MFCC) have been emerged as
one of the most relied feature extraction methods, though it circumscribes the
accuracy of classification with a very small feature dimension. In this paper,
we propose that the concatenation of features, extracted by using different
existing feature extraction methods can not only boost the classification
accuracy but also expands the possibility of efficient feature selection. We
have used Linear Predictive Coding (LPC) apart from the MFCC feature extraction
method, before feature merging. Besides, we have performed a novel application
of Manta Ray optimization in speech emotion recognition tasks that resulted in
a state-of-the-art result in this field. We have evaluated the performance of
our model using SAVEE and Emo-DB, two publicly available datasets. Our proposed
method outperformed all the existing methods in speech emotion analysis and
resulted in a decent result in these two datasets with a classification
accuracy of 97.06% and 97.68% respectively.
Related papers
- EMG-Based Hand Gesture Recognition through Diverse Domain Feature Enhancement and Machine Learning-Based Approach [1.8796659304823702]
Surface electromyography (EMG) serves as a pivotal tool in hand gesture recognition and human-computer interaction.
This study presents a novel methodology for classifying hand gestures using EMG signals.
arXiv Detail & Related papers (2024-08-25T04:55:42Z) - Iterative Feature Boosting for Explainable Speech Emotion Recognition [17.568724398229232]
We present a new supervised SER method based on an efficient feature engineering approach.
We pay particular attention to the explainability of results to evaluate feature relevance and refine feature sets.
The proposed method outperforms human-level performance (HLP) and state-of-the-art machine learning methods in emotion recognition on the TESS dataset.
arXiv Detail & Related papers (2024-05-30T15:44:27Z) - Feature Selection as Deep Sequential Generative Learning [50.00973409680637]
We develop a deep variational transformer model over a joint of sequential reconstruction, variational, and performance evaluator losses.
Our model can distill feature selection knowledge and learn a continuous embedding space to map feature selection decision sequences into embedding vectors associated with utility scores.
arXiv Detail & Related papers (2024-03-06T16:31:56Z) - Convolutional autoencoder-based multimodal one-class classification [80.52334952912808]
One-class classification refers to approaches of learning using data from a single class only.
We propose a deep learning one-class classification method suitable for multimodal data.
arXiv Detail & Related papers (2023-09-25T12:31:18Z) - Tuning Pre-trained Model via Moment Probing [62.445281364055795]
We propose a novel Moment Probing (MP) method to explore the potential of LP.
MP performs a linear classification head based on the mean of final features.
Our MP significantly outperforms LP and is competitive with counterparts at less training cost.
arXiv Detail & Related papers (2023-07-21T04:15:02Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Compactness Score: A Fast Filter Method for Unsupervised Feature
Selection [66.84571085643928]
We propose a fast unsupervised feature selection method, named as, Compactness Score (CSUFS) to select desired features.
Our proposed algorithm seems to be more accurate and efficient compared with existing algorithms.
arXiv Detail & Related papers (2022-01-31T13:01:37Z) - Automated Feature Extraction on AsMap for Emotion Classification using
EEG [0.0]
The asymmetry in the different brain regions are captured in a 2-D vector, termed as AsMap from the differential entropy (DE) features of EEG signals.
AsMaps are then used to extract features automatically using Convolutional Neural Network (CNN) model.
Highest classification accuracy of 97.10% is achieved on 3-class classification problem using SEED dataset.
arXiv Detail & Related papers (2022-01-28T11:38:29Z) - A concise method for feature selection via normalized frequencies [0.0]
In this paper, a concise method is proposed for universal feature selection.
The proposed method uses a fusion of the filter method and the wrapper method, rather than a combination of them.
The evaluation results show that the proposed method outperformed several state-of-the-art related works in terms of accuracy, precision, recall, F-score and AUC.
arXiv Detail & Related papers (2021-06-10T15:29:54Z) - Noise-resistant Deep Metric Learning with Ranking-based Instance
Selection [59.286567680389766]
We propose a noise-resistant training technique for DML, which we name Probabilistic Ranking-based Instance Selection with Memory (PRISM)
PRISM identifies noisy data in a minibatch using average similarity against image features extracted from several previous versions of the neural network.
To alleviate the high computational cost brought by the memory bank, we introduce an acceleration method that replaces individual data points with the class centers.
arXiv Detail & Related papers (2021-03-30T03:22:17Z) - Effect of Analysis Window and Feature Selection on Classification of
Hand Movements Using EMG Signal [0.20999222360659603]
Recently, research on myoelectric control based on pattern recognition (PR) shows promising results with the aid of machine learning classifiers.
By offering multiple class movements and intuitive control, this method has the potential to power an amputated subject to perform everyday life movements.
We show that effective data preprocessing and optimum feature selection helps to improve the classification accuracy of hand movements.
arXiv Detail & Related papers (2020-02-02T19:03:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.