Bayesian Convolutional Neural Networks for Seven Basic Facial Expression
Classifications
- URL: http://arxiv.org/abs/2107.04834v2
- Date: Tue, 13 Jul 2021 13:05:36 GMT
- Title: Bayesian Convolutional Neural Networks for Seven Basic Facial Expression
Classifications
- Authors: Yuan Tai, Yihua Tan, Wei Gong, Hailan Huang
- Abstract summary: Seven basic facial expression classifications are a basic way to express complex human emotions.
Based on the traditional Bayesian neural network framework, the ResNet18_BNN network constructed in this paper has been improved.
- Score: 5.365808418695478
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The seven basic facial expression classifications are a basic way to express
complex human emotions and are an important part of artificial intelligence
research. Based on the traditional Bayesian neural network framework, the
ResNet18_BNN network constructed in this paper has been improved in the
following three aspects: (1) A new objective function is proposed, which is
composed of the KL loss of uncertain parameters and the intersection of
specific parameters. Entropy loss composition. (2) Aiming at a special
objective function, a training scheme for alternately updating these two
parameters is proposed. (3) Only model the parameters of the last convolution
group. Through testing on the FER2013 test set, we achieved 71.5% and 73.1%
accuracy in PublicTestSet and PrivateTestSet, respectively. Compared with
traditional Bayesian neural networks, our method brings the highest
classification accuracy gain.
Related papers
- Entanglement Classification of Arbitrary Three-Qubit States via Artificial Neural Networks [2.715284063484557]
We design and implement artificial neural networks (ANNs) to detect and classify entanglement for three-qubit systems.
The models are trained and validated on a simulated dataset of randomly generated states.
Remarkably, we find that feeding only 7 diagonal elements of the density matrix into the ANN results in an accuracy greater than 94% for both tasks.
arXiv Detail & Related papers (2024-11-18T06:50:10Z) - Sparse Deep Learning Models with the $\ell_1$ Regularization [6.268040192346312]
Sparse neural networks are highly desirable in deep learning.
We study how choices of regularization parameters influence the sparsity level of learned neural networks.
arXiv Detail & Related papers (2024-08-05T19:38:45Z) - Robust Localization of Key Fob Using Channel Impulse Response of Ultra
Wide Band Sensors for Keyless Entry Systems [12.313730356985019]
Using neural networks for localization of key fob within and surrounding a car as a security feature for keyless entry is fast emerging.
The model's performance improved by 67% at certain ranges of adversarial magnitude for fast gradient sign method and 37% each for basic iterative method and projected gradient descent method.
arXiv Detail & Related papers (2024-01-16T22:35:14Z) - Permutation Equivariant Neural Functionals [92.0667671999604]
This work studies the design of neural networks that can process the weights or gradients of other neural networks.
We focus on the permutation symmetries that arise in the weights of deep feedforward networks because hidden layer neurons have no inherent order.
In our experiments, we find that permutation equivariant neural functionals are effective on a diverse set of tasks.
arXiv Detail & Related papers (2023-02-27T18:52:38Z) - Hybrid machine-learned homogenization: Bayesian data mining and
convolutional neural networks [0.0]
This study aims to improve the machine learned prediction by developing novel feature descriptors.
The iterative development of feature descriptors resulted in 37 novel features, being able to reduce the prediction error by roughly one third.
A combination of the feature based approach and the convolutional neural network leads to a hybrid neural network.
arXiv Detail & Related papers (2023-02-24T09:59:29Z) - Towards Better Out-of-Distribution Generalization of Neural Algorithmic
Reasoning Tasks [51.8723187709964]
We study the OOD generalization of neural algorithmic reasoning tasks.
The goal is to learn an algorithm from input-output pairs using deep neural networks.
arXiv Detail & Related papers (2022-11-01T18:33:20Z) - Learning to Learn with Generative Models of Neural Network Checkpoints [71.06722933442956]
We construct a dataset of neural network checkpoints and train a generative model on the parameters.
We find that our approach successfully generates parameters for a wide range of loss prompts.
We apply our method to different neural network architectures and tasks in supervised and reinforcement learning.
arXiv Detail & Related papers (2022-09-26T17:59:58Z) - Modeling from Features: a Mean-field Framework for Over-parameterized
Deep Neural Networks [54.27962244835622]
This paper proposes a new mean-field framework for over- parameterized deep neural networks (DNNs)
In this framework, a DNN is represented by probability measures and functions over its features in the continuous limit.
We illustrate the framework via the standard DNN and the Residual Network (Res-Net) architectures.
arXiv Detail & Related papers (2020-07-03T01:37:16Z) - Grafted network for person re-identification [14.372506245952383]
Convolutional neural networks have shown outstanding effectiveness in person re-identification (re-ID)
We propose a novel grafted network (GraftedNet), which is designed by grafting a high-accuracy rootstock and a light-weighted scion.
Experimental results show that the proposed GraftedNet achieves 93.02%, 85.3% and 76.2% in Rank-1 and 81.6%, 74.7% and 71.6% in mAP, with only 4.6M parameters.
arXiv Detail & Related papers (2020-06-02T22:33:44Z) - Beyond Dropout: Feature Map Distortion to Regularize Deep Neural
Networks [107.77595511218429]
In this paper, we investigate the empirical Rademacher complexity related to intermediate layers of deep neural networks.
We propose a feature distortion method (Disout) for addressing the aforementioned problem.
The superiority of the proposed feature map distortion for producing deep neural network with higher testing performance is analyzed and demonstrated.
arXiv Detail & Related papers (2020-02-23T13:59:13Z) - Learn to Predict Sets Using Feed-Forward Neural Networks [63.91494644881925]
This paper addresses the task of set prediction using deep feed-forward neural networks.
We present a novel approach for learning to predict sets with unknown permutation and cardinality.
We demonstrate the validity of our set formulations on relevant vision problems.
arXiv Detail & Related papers (2020-01-30T01:52:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.