Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing
- URL: http://arxiv.org/abs/2411.01919v1
- Date: Mon, 04 Nov 2024 09:34:55 GMT
- Title: Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing
- Authors: Teng Bin, Jianming Yao, Tin Lun Lam, Tianwei Zhang,
- Abstract summary: We present a novel algorithm for real-time planar semantic mapping tailored for humanoid robots navigating complex terrains such as staircases.
We utilize an anisotropic diffusion filter on depth images to effectively minimize noise from gradient jumps while preserving essential edge details.
Our approach achieves real-time performance, processing single frames at rates exceeding $30Hz$, which facilitates detailed plane extraction and map management swiftly and efficiently.
- Score: 19.786955745157453
- License:
- Abstract: We present a novel algorithm for real-time planar semantic mapping tailored for humanoid robots navigating complex terrains such as staircases. Our method is adaptable to any odometry input and leverages GPU-accelerated processes for planar extraction, enabling the rapid generation of globally consistent semantic maps. We utilize an anisotropic diffusion filter on depth images to effectively minimize noise from gradient jumps while preserving essential edge details, enhancing normal vector images' accuracy and smoothness. Both the anisotropic diffusion and the RANSAC-based plane extraction processes are optimized for parallel processing on GPUs, significantly enhancing computational efficiency. Our approach achieves real-time performance, processing single frames at rates exceeding $30~Hz$, which facilitates detailed plane extraction and map management swiftly and efficiently. Extensive testing underscores the algorithm's capabilities in real-time scenarios and demonstrates its practical application in humanoid robot gait planning, significantly improving its ability to navigate dynamic environments.
Related papers
- EDCSSM: Edge Detection with Convolutional State Space Model [3.649463841174485]
Edge detection in images is the foundation of many complex tasks in computer graphics.
Due to the feature loss caused by multi-layer convolution and pooling architectures, learning-based edge detection models often produce thick edges.
This paper presents an edge detection algorithm which effectively addresses the aforementioned issues.
arXiv Detail & Related papers (2024-09-03T05:13:25Z) - Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks [93.38375271826202]
We present a method to improve generalization and robustness to distribution shifts in sim-to-real visual quadrotor navigation tasks.
We first build a simulator by integrating Gaussian splatting with quadrotor flight dynamics, and then, train robust navigation policies using Liquid neural networks.
In this way, we obtain a full-stack imitation learning protocol that combines advances in 3D Gaussian splatting radiance field rendering, programming of expert demonstration training data, and the task understanding capabilities of Liquid networks.
arXiv Detail & Related papers (2024-06-21T13:48:37Z) - FlowIE: Efficient Image Enhancement via Rectified Flow [71.6345505427213]
FlowIE is a flow-based framework that estimates straight-line paths from an elementary distribution to high-quality images.
Our contributions are rigorously validated through comprehensive experiments on synthetic and real-world datasets.
arXiv Detail & Related papers (2024-06-01T17:29:29Z) - Edge-Enabled Real-time Railway Track Segmentation [0.0]
We propose an edge-enabled real-time railway track segmentation algorithm.
It is optimized to be suitable for edge applications by optimizing the network structure and quantizing the model after training.
Experimental results demonstrate that our enhanced algorithm achieves an accuracy level of 83.3%.
arXiv Detail & Related papers (2024-01-21T13:45:52Z) - Unlocking the Performance of Proximity Sensors by Utilizing Transient
Histograms [20.994250740256458]
We provide methods which recover planar scene geometry by utilizing the transient histograms captured by a class of close-range time-of-flight (ToF) distance sensor.
A transient histogram is a one dimensional temporal waveform which encodes the arrival time of photons incident on the ToF sensor.
We demonstrate a simple robotics application which uses our method to sense the distance to and slope of a planar surface from a sensor mounted on the end effector of a robot arm.
arXiv Detail & Related papers (2023-08-25T16:20:41Z) - Improving Gradient Methods via Coordinate Transformations: Applications to Quantum Machine Learning [0.0]
Machine learning algorithms heavily rely on optimization algorithms based on gradients, such as gradient descent and alike.
The overall performance is dependent on the appearance of local minima and barren plateaus, which slow-down calculations and lead to non-optimal solutions.
In this paper we introduce a generic strategy to accelerate and improve the overall performance of such methods, allowing to alleviate the effect of barren plateaus and local minima.
arXiv Detail & Related papers (2023-04-13T18:26:05Z) - Globally Optimal Event-Based Divergence Estimation for Ventral Landing [55.29096494880328]
Event sensing is a major component in bio-inspired flight guidance and control systems.
We explore the usage of event cameras for predicting time-to-contact with the surface during ventral landing.
This is achieved by estimating divergence (inverse TTC), which is the rate of radial optic flow, from the event stream generated during landing.
Our core contributions are a novel contrast maximisation formulation for event-based divergence estimation, and a branch-and-bound algorithm to exactly maximise contrast and find the optimal divergence value.
arXiv Detail & Related papers (2022-09-27T06:00:52Z) - DiffSkill: Skill Abstraction from Differentiable Physics for Deformable
Object Manipulations with Tools [96.38972082580294]
DiffSkill is a novel framework that uses a differentiable physics simulator for skill abstraction to solve deformable object manipulation tasks.
In particular, we first obtain short-horizon skills using individual tools from a gradient-based simulator.
We then learn a neural skill abstractor from the demonstration trajectories which takes RGBD images as input.
arXiv Detail & Related papers (2022-03-31T17:59:38Z) - CUDA-Optimized real-time rendering of a Foveated Visual System [5.260841516691153]
We present a technique that exploits the GPU to efficiently generate Gaussian-based foveated images at high definition (1920x1080) in real-time (165 Hz)
Our algorithm can meet demand for spatially-varying processing across biological artificial agents so that foveation can be added easily on top of existing systems.
arXiv Detail & Related papers (2020-12-15T22:43:04Z) - AOT: Appearance Optimal Transport Based Identity Swapping for Forgery
Detection [76.7063732501752]
We provide a new identity swapping algorithm with large differences in appearance for face forgery detection.
The appearance gaps mainly arise from the large discrepancies in illuminations and skin colors.
A discriminator is introduced to distinguish the fake parts from a mix of real and fake image patches.
arXiv Detail & Related papers (2020-11-05T06:17:04Z) - Real-time Semantic Segmentation with Fast Attention [94.88466483540692]
We propose a novel architecture for semantic segmentation of high-resolution images and videos in real-time.
The proposed architecture relies on our fast spatial attention, which is a simple yet efficient modification of the popular self-attention mechanism.
We show that results on multiple datasets demonstrate superior performance with better accuracy and speed compared to existing approaches.
arXiv Detail & Related papers (2020-07-07T22:37:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.