Loss Function Considering Dead Zone for Neural Networks
- URL: http://arxiv.org/abs/2402.00393v1
- Date: Thu, 1 Feb 2024 07:28:55 GMT
- Title: Loss Function Considering Dead Zone for Neural Networks
- Authors: Koki Inami, Koki Yamane, Sho Sakaino
- Abstract summary: We propose a new loss function that considers only errors of joints not in dead zones.
Experiments on actual equipment using a three-degree-of-freedom (DOF) manipulator showed higher accuracy than conventional methods.
- Score: 0.8287206589886879
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: It is important to reveal the inverse dynamics of manipulators to improve
control performance of model-based control. Neural networks (NNs) are promising
techniques to represent complicated inverse dynamics while they require a large
amount of motion data. However, motion data in dead zones of actuators is not
suitable for training models decreasing the number of useful training data. In
this study, based on the fact that the manipulator joint does not work
irrespective of input torque in dead zones, we propose a new loss function that
considers only errors of joints not in dead zones. The proposed method enables
to increase in the amount of motion data available for training and the
accuracy of the inverse dynamics computation. Experiments on actual equipment
using a three-degree-of-freedom (DOF) manipulator showed higher accuracy than
conventional methods. We also confirmed and discussed the behavior of the model
of the proposed method in dead zones.
Related papers
- 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter [6.13623925528906]
3D Multi-Object Tracking (MOT) is essential for intelligent systems like autonomous driving and robotic sensing.
We propose a GRU-based MOT method, which introduces a learnable Kalman filter into the motion module.
This approach is able to learn object motion characteristics through data-driven learning, thereby avoiding the need for manual model design and model error.
arXiv Detail & Related papers (2024-11-13T08:34:07Z) - Analysis of Truncated Singular Value Decomposition for Koopman Operator-Based Lane Change Model [0.0]
Singular Value Decomposition (SVD) is employed to approximate Koopman operators from extensive datasets efficiently.
This study evaluates different basis functions used in EDMD and ranks for truncated SVD for representing lane change behavior models.
The findings, however, suggest that the technique of truncated SVD does not necessarily achieve substantial reductions in computational training time and results in significant information loss.
arXiv Detail & Related papers (2024-09-27T09:45:21Z) - Unsupervised Discovery of Interpretable Directions in h-space of
Pre-trained Diffusion Models [63.1637853118899]
We propose the first unsupervised and learning-based method to identify interpretable directions in h-space of pre-trained diffusion models.
We employ a shift control module that works on h-space of pre-trained diffusion models to manipulate a sample into a shifted version of itself.
By jointly optimizing them, the model will spontaneously discover disentangled and interpretable directions.
arXiv Detail & Related papers (2023-10-15T18:44:30Z) - DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions [52.63323657077447]
We propose DNMOT, an end-to-end trainable DeNoising Transformer for multiple object tracking.
Specifically, we augment the trajectory with noises during training and make our model learn the denoising process in an encoder-decoder architecture.
We conduct extensive experiments on the MOT17, MOT20, and DanceTrack datasets, and the experimental results show that our method outperforms previous state-of-the-art methods by a clear margin.
arXiv Detail & Related papers (2023-09-09T04:40:01Z) - Value function estimation using conditional diffusion models for control [62.27184818047923]
We propose a simple algorithm called Diffused Value Function (DVF)
It learns a joint multi-step model of the environment-robot interaction dynamics using a diffusion model.
We show how DVF can be used to efficiently capture the state visitation measure for multiple controllers.
arXiv Detail & Related papers (2023-06-09T18:40:55Z) - Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse
Data using a Learning-based Unscented Kalman Filter [65.93205328894608]
We learn the residual errors between a dynamic and/or simulator model and the real robot.
We show that with the learned residual errors, we can further close the reality gap between dynamic models, simulations, and actual hardware.
arXiv Detail & Related papers (2022-09-07T15:15:12Z) - Truncated tensor Schatten p-norm based approach for spatiotemporal
traffic data imputation with complicated missing patterns [77.34726150561087]
We introduce four complicated missing patterns, including missing and three fiber-like missing cases according to the mode-drivenn fibers.
Despite nonity of the objective function in our model, we derive the optimal solutions by integrating alternating data-mputation method of multipliers.
arXiv Detail & Related papers (2022-05-19T08:37:56Z) - Convolutional generative adversarial imputation networks for
spatio-temporal missing data in storm surge simulations [86.5302150777089]
Generative Adversarial Imputation Nets (GANs) and GAN-based techniques have attracted attention as unsupervised machine learning methods.
We name our proposed method as Con Conval Generative Adversarial Imputation Nets (Conv-GAIN)
arXiv Detail & Related papers (2021-11-03T03:50:48Z) - Unsupervised Model Drift Estimation with Batch Normalization Statistics
for Dataset Shift Detection and Model Selection [0.0]
We propose a novel method of model drift estimation by exploiting statistics of batch normalization layer on unlabeled test data.
We show the effectiveness of our method not only on dataset shift detection but also on model selection when there are multiple candidate models among model zoo or training trajectories in an unsupervised way.
arXiv Detail & Related papers (2021-07-01T03:04:47Z) - Unsupervised MR Motion Artifact Deep Learning using Outlier-Rejecting
Bootstrap Aggregation [37.41561581618164]
We propose a novel unsupervised deep learning scheme through outlier-rejecting bootstrap subsampling and aggregation.
Our method does not require any paired data because the training step only requires artifact-free images.
We verify that our method can be applied for artifact correction from simulated motion as well as real motion from TSM successfully.
arXiv Detail & Related papers (2020-11-12T12:10:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.