Tactile Grasp Refinement using Deep Reinforcement Learning and Analytic
Grasp Stability Metrics
- URL: http://arxiv.org/abs/2109.11234v1
- Date: Thu, 23 Sep 2021 09:20:19 GMT
- Title: Tactile Grasp Refinement using Deep Reinforcement Learning and Analytic
Grasp Stability Metrics
- Authors: Alexander Koenig, Zixi Liu, Lucas Janson, Robert Howe
- Abstract summary: We show that analytic grasp stability metrics constitute powerful optimization objectives for reinforcement learning algorithms.
We show that a combination of geometric and force-agnostic grasp stability metrics yields the highest average success rates of 95.4% for cuboids.
In a second experiment, we show that grasp refinement algorithms trained with contact feedback perform up to 6.6% better than a baseline that receives no tactile information.
- Score: 70.65363356763598
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Reward functions are at the heart of every reinforcement learning (RL)
algorithm. In robotic grasping, rewards are often complex and manually
engineered functions that do not rely on well-justified physical models from
grasp analysis. This work demonstrates that analytic grasp stability metrics
constitute powerful optimization objectives for RL algorithms that refine
grasps on a three-fingered hand using only tactile and joint position
information. We outperform a binary-reward baseline by 42.9% and find that a
combination of geometric and force-agnostic grasp stability metrics yields the
highest average success rates of 95.4% for cuboids, 93.1% for cylinders, and
62.3% for spheres across wrist position errors between 0 and 7 centimeters and
rotational errors between 0 and 14 degrees. In a second experiment, we show
that grasp refinement algorithms trained with contact feedback (contact
positions, normals, and forces) perform up to 6.6% better than a baseline that
receives no tactile information.
Related papers
- INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy [0.4124847249415279]
We present a novel framework designed to enhance the robustness of deep neural networks (DNNs) against noisy LiDAR data.
IntACT combines meta-learning with adversarial curriculum training (ACT) to address challenges posed by data corruption and sparsity in 3D point clouds.
IntACT's effectiveness is demonstrated through comprehensive evaluations on object detection, tracking, and classification benchmarks.
arXiv Detail & Related papers (2025-02-04T00:02:16Z) - Kimi k1.5: Scaling Reinforcement Learning with LLMs [84.2229964736678]
We report on the training practice of Kimi k1.5, our latest multi-modal language model trained with reinforcement learning.
Long context scaling and improved policy optimization methods are key ingredients of our approach.
Our system achieves state-of-the-art reasoning performance across multiple benchmarks and modalities.
arXiv Detail & Related papers (2025-01-22T02:48:14Z) - Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach [2.4184866684341473]
This study presents a novel approach using knowledge distillation techniques to enhance computational efficiency in gravitational wave analysis.
We develop a framework combining ResNet1D and Inverse Autoregressive Flow (IAF) architectures, where knowledge from a complex teacher model is transferred to a lighter student model.
Our experimental results show that the student model achieves a validation loss of 3.70 with optimal configuration (40,100,0.75), compared to the teacher model's 4.09, while reducing the number of parameters by 43%.
arXiv Detail & Related papers (2024-12-11T03:56:46Z) - SLYKLatent: A Learning Framework for Gaze Estimation Using Deep Facial Feature Learning [0.0]
We present SLYKLatent, a novel approach for enhancing gaze estimation by addressing appearance instability challenges in datasets.
SLYKLatent utilizes Self-Supervised Learning for initial training with facial expression datasets, followed by refinement with a patch-based tri-branch network.
Our evaluation on benchmark datasets achieves a 10.9% improvement on Gaze360, supersedes top MPIIFaceGaze results with 3.8%, and leads on a subset of ETH-XGaze by 11.6%.
arXiv Detail & Related papers (2024-02-02T16:47:18Z) - Tool-Augmented Reward Modeling [58.381678612409]
We propose a tool-augmented preference modeling approach, named Themis, to address limitations by empowering RMs with access to external environments.
Our study delves into the integration of external tools into RMs, enabling them to interact with diverse external sources.
In human evaluations, RLHF trained with Themis attains an average win rate of 32% when compared to baselines.
arXiv Detail & Related papers (2023-10-02T09:47:40Z) - Improving Machine Learning Robustness via Adversarial Training [3.7942983866014073]
We investigate ML robustness using adversarial training in centralized and decentralized environments.
In the centralized environment, we achieve a test accuracy of 65.41% and 83.0% when classifying adversarial examples.
In the decentralized environment, we study Federated learning (FL) robustness by using adversarial training with independent and identically distributed (IID) and non-IID data.
arXiv Detail & Related papers (2023-09-22T02:43:04Z) - A Lightweight and Accurate Face Detection Algorithm Based on Retinaface [0.5076419064097734]
We propose a lightweight and accurate face detection algorithm LAFD (Light and accurate face detection) based on Retinaface.
Backbone network in the algorithm is a modified MobileNetV3 network which adjusts the size of the convolution kernel.
If the input image is pre-processed and scaled to 1560px in length or 1200px in width, the model achieves an average accuracy of 86.2%.
arXiv Detail & Related papers (2023-08-08T15:36:57Z) - Robust Learning with Progressive Data Expansion Against Spurious
Correlation [65.83104529677234]
We study the learning process of a two-layer nonlinear convolutional neural network in the presence of spurious features.
Our analysis suggests that imbalanced data groups and easily learnable spurious features can lead to the dominance of spurious features during the learning process.
We propose a new training algorithm called PDE that efficiently enhances the model's robustness for a better worst-group performance.
arXiv Detail & Related papers (2023-06-08T05:44:06Z) - Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary
Data [100.33096338195723]
We focus on Few-shot Learning with Auxiliary Data (FLAD)
FLAD assumes access to auxiliary data during few-shot learning in hopes of improving generalization.
We propose two algorithms -- EXP3-FLAD and UCB1-FLAD -- and compare them with prior FLAD methods that either explore or exploit.
arXiv Detail & Related papers (2023-02-01T18:59:36Z) - 3D Pose Based Feedback for Physical Exercises [87.35086507661227]
We introduce a learning-based framework that identifies the mistakes made by a user.
Our framework does not rely on hard-coded rules, instead, it learns them from data.
Our approach yields 90.9% mistake identification accuracy and successfully corrects 94.2% of the mistakes.
arXiv Detail & Related papers (2022-08-05T16:15:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.