DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic
Manipulation of Deformable Linear Objects
- URL: http://arxiv.org/abs/2312.15204v1
- Date: Sat, 23 Dec 2023 09:26:20 GMT
- Title: DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic
Manipulation of Deformable Linear Objects
- Authors: Sun Zhaole, Jihong Zhu, Robert B. Fisher
- Abstract summary: We present DexDLO, a model-free framework that learns dexterous dynamic manipulation policies for deformable linear objects with a fixed-base dexterous hand.
We show that our framework can efficiently learn five different DLO manipulation tasks with the same framework parameters.
- Score: 7.72979328949568
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Deformable linear object (DLO) manipulation is needed in many fields.
Previous research on deformable linear object (DLO) manipulation has primarily
involved parallel jaw gripper manipulation with fixed grasping positions.
However, the potential for dexterous manipulation of DLOs using an
anthropomorphic hand is under-explored. We present DexDLO, a model-free
framework that learns dexterous dynamic manipulation policies for deformable
linear objects with a fixed-base dexterous hand in an end-to-end way. By
abstracting several common DLO manipulation tasks into goal-conditioned tasks,
our DexDLO can perform these tasks, such as DLO grabbing, DLO pulling, DLO
end-tip position controlling, etc. Using the Mujoco physics simulator, we
demonstrate that our framework can efficiently and effectively learn five
different DLO manipulation tasks with the same framework parameters. We further
provide a thorough analysis of learned policies, reward functions, and reduced
observations for a comprehensive understanding of the framework.
Related papers
- Certifiably Safe Manipulation of Deformable Linear Objects via Joint Shape and Tension Prediction [0.11510009152620666]
We propose a certifiably safe motion planning and framework for DLO manipulation.<n>At the core of our method is a predictive model that jointly estimates the DLO's future shape and tension.<n>Compared to state-of-theart methods, our approach achieves a higher task success rate while avoiding all safety violations.
arXiv Detail & Related papers (2025-05-20T03:50:29Z) - Latent Diffusion Planning for Imitation Learning [78.56207566743154]
Latent Diffusion Planning (LDP) is a modular approach consisting of a planner and inverse dynamics model.
By separating planning from action prediction, LDP can benefit from the denser supervision signals of suboptimal and action-free data.
On simulated visual robotic manipulation tasks, LDP outperforms state-of-the-art imitation learning approaches.
arXiv Detail & Related papers (2025-04-23T17:53:34Z) - Deformable Linear Object Surface Placement Using Elastica Planning and Local Shape Control [0.0]
This paper describes a two-layered approach for placing deformable linear objects (DLOs) on a flat surface using a single robot hand.
The high-level layer is a novel DLO surface placement method based on's elastica solutions.
The low-level layer forms a pipeline controller.
arXiv Detail & Related papers (2025-03-11T15:33:36Z) - Dynamic Path Navigation for Motion Agents with LLM Reasoning [69.5875073447454]
Large Language Models (LLMs) have demonstrated strong generalizable reasoning and planning capabilities.
We explore the zero-shot navigation and path generation capabilities of LLMs by constructing a dataset and proposing an evaluation protocol.
We demonstrate that, when tasks are well-structured in this manner, modern LLMs exhibit substantial planning proficiency in avoiding obstacles while autonomously refining navigation with the generated motion to reach the target.
arXiv Detail & Related papers (2025-03-10T13:39:09Z) - A Distributional Treatment of Real2Sim2Real for Vision-Driven Deformable Linear Object Manipulation [16.26835655544884]
We present an integrated framework for manipulating deformable linear objects (DLOs) based on visual perception.
We compute the posterior distributions for the physical parameters using which we can approximately simulate the behaviour of each specific DLO.
We then study the implications of the resulting domain distributions in sim-based policy learning and real-world performance.
arXiv Detail & Related papers (2025-02-25T20:01:06Z) - DEFT: Differentiable Branched Discrete Elastic Rods for Modeling Furcated DLOs in Real-Time [10.058145238101655]
Differentiable discrete branched Elastic rods for modeling Furcated DLOs in real-Time (DEFT)
This paper presents Differentiable discrete branched Elastic rods for modeling Furcated DLOs in real-Time (DEFT)
arXiv Detail & Related papers (2025-02-20T20:46:09Z) - Scaling Autonomous Agents via Automatic Reward Modeling And Planning [52.39395405893965]
Large language models (LLMs) have demonstrated remarkable capabilities across a range of tasks.
However, they still struggle with problems requiring multi-step decision-making and environmental feedback.
We propose a framework that can automatically learn a reward model from the environment without human annotations.
arXiv Detail & Related papers (2025-02-17T18:49:25Z) - MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation [52.739500459903724]
Large Language Models (LLMs) have demonstrated remarkable planning abilities across various domains, including robotics manipulation and navigation.
We propose a novel multi-agent LLM framework that distributes high-level planning and low-level control code generation across specialized LLM agents.
We evaluate our approach on nine RLBench tasks, including long-horizon tasks, and demonstrate its ability to solve robotics manipulation in a zero-shot setting.
arXiv Detail & Related papers (2024-11-26T17:53:44Z) - Modulating Reservoir Dynamics via Reinforcement Learning for Efficient Robot Skill Synthesis [0.0]
A random recurrent neural network, called a reservoir, can be used to learn robot movements conditioned on context inputs.
In this work, we propose a novel RC-based Learning from Demonstration (LfD) framework.
arXiv Detail & Related papers (2024-11-17T07:25:54Z) - Deep Learning for Koopman-based Dynamic Movement Primitives [0.0]
We propose a novel approach by joining the theories of Koopman Operators and Dynamic Movement Primitives to Learning from Demonstration.
Our approach, named glsadmd, projects nonlinear dynamical systems into linear latent spaces such that a solution reproduces the desired complex motion.
Our results are comparable to the Extended Dynamic Mode Decomposition on the LASA Handwriting dataset but with training on only a small fractions of the letters.
arXiv Detail & Related papers (2023-12-06T07:33:22Z) - Learning Quasi-Static 3D Models of Markerless Deformable Linear Objects
for Bimanual Robotic Manipulation [6.212335606641129]
This paper analyzes several learning-based 3D models of the Deformable Linear Objects (DLOs)
We propose a new one based on the Transformer architecture that achieves superior accuracy, even on the DLOs of different lengths.
We also introduce a data augmentation technique, which improves the prediction performance of almost all considered DLO data-driven models.
arXiv Detail & Related papers (2023-09-14T11:17:43Z) - Model-Based Reinforcement Learning with Multi-Task Offline Pretraining [59.82457030180094]
We present a model-based RL method that learns to transfer potentially useful dynamics and action demonstrations from offline data to a novel task.
The main idea is to use the world models not only as simulators for behavior learning but also as tools to measure the task relevance.
We demonstrate the advantages of our approach compared with the state-of-the-art methods in Meta-World and DeepMind Control Suite.
arXiv Detail & Related papers (2023-06-06T02:24:41Z) - DLOFTBs -- Fast Tracking of Deformable Linear Objects with B-splines [1.3836565669337055]
This paper proposes an algorithm for fast-tracking the shape of a DLO based on the masked image.
Experiments show that our solution outperforms the State-of-the-Art approaches in DLO's shape reconstruction accuracy and algorithm running time.
arXiv Detail & Related papers (2023-02-27T11:54:04Z) - In-Distribution Barrier Functions: Self-Supervised Policy Filters that
Avoid Out-of-Distribution States [84.24300005271185]
We propose a control filter that wraps any reference policy and effectively encourages the system to stay in-distribution with respect to offline-collected safe demonstrations.
Our method is effective for two different visuomotor control tasks in simulation environments, including both top-down and egocentric view settings.
arXiv Detail & Related papers (2023-01-27T22:28:19Z) - Visual Foresight With a Local Dynamics Model [1.370633147306388]
We propose the Local Dynamics Model (LDM) which efficiently learns the state-transition function for single-step manipulation primitives.
By combining the LDM with model-free policy learning, we can learn policies which can solve complex manipulation tasks using one-step lookahead planning.
arXiv Detail & Related papers (2022-06-29T17:58:14Z) - Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and
Heuristic Rule-based Methods for Object Manipulation [118.27432851053335]
This paper presents an overview and comparative analysis of our systems designed for the following two tracks in SAPIEN ManiSkill Challenge 2021: No Interaction Track.
The No Interaction track targets for learning policies from pre-collected demonstration trajectories.
In this track, we design a Heuristic Rule-based Method (HRM) to trigger high-quality object manipulation by decomposing the task into a series of sub-tasks.
For each sub-task, the simple rule-based controlling strategies are adopted to predict actions that can be applied to robotic arms.
arXiv Detail & Related papers (2022-06-13T16:20:42Z) - Learning Multi-Objective Curricula for Deep Reinforcement Learning [55.27879754113767]
Various automatic curriculum learning (ACL) methods have been proposed to improve the sample efficiency and final performance of deep reinforcement learning (DRL)
In this paper, we propose a unified automatic curriculum learning framework to create multi-objective but coherent curricula.
In addition to existing hand-designed curricula paradigms, we further design a flexible memory mechanism to learn an abstract curriculum.
arXiv Detail & Related papers (2021-10-06T19:30:25Z) - Neural Dynamic Policies for End-to-End Sensorimotor Learning [51.24542903398335]
The current dominant paradigm in sensorimotor control, whether imitation or reinforcement learning, is to train policies directly in raw action spaces.
We propose Neural Dynamic Policies (NDPs) that make predictions in trajectory distribution space.
NDPs outperform the prior state-of-the-art in terms of either efficiency or performance across several robotic control tasks.
arXiv Detail & Related papers (2020-12-04T18:59:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.