Deteksi Sampah di Permukaan dan Dalam Perairan pada Objek Video dengan
Metode Robust and Efficient Post-Processing dan Tubelet-Level Bounding Box
Linking
- URL: http://arxiv.org/abs/2307.10039v1
- Date: Fri, 14 Jul 2023 04:04:15 GMT
- Title: Deteksi Sampah di Permukaan dan Dalam Perairan pada Objek Video dengan
Metode Robust and Efficient Post-Processing dan Tubelet-Level Bounding Box
Linking
- Authors: Bryan Tjandra, Made S. N. Negara, Nyoo S. C. Handoko
- Abstract summary: This paper provides an explanation of the methods that can be applied to perform video object detection in an automated trash-collecting robot.
The study utilizes the YOLOv5 model and the Robust & Efficient Post Processing (REPP) method, along with tubelet-level bounding box linking on the FloW and Roboflow datasets.
The results show that the post-processing stage and tubelet-level bounding box linking can improve the quality of detection, achieving approximately 3% better performance compared to YOLOv5 alone.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Indonesia, as a maritime country, has a significant portion of its territory
covered by water. Ineffective waste management has resulted in a considerable
amount of trash in Indonesian waters, leading to various issues. The
development of an automated trash-collecting robot can be a solution to address
this problem. The robot requires a system capable of detecting objects in
motion, such as in videos. However, using naive object detection methods in
videos has limitations, particularly when image focus is reduced and the target
object is obstructed by other objects. This paper's contribution provides an
explanation of the methods that can be applied to perform video object
detection in an automated trash-collecting robot. The study utilizes the YOLOv5
model and the Robust & Efficient Post Processing (REPP) method, along with
tubelet-level bounding box linking on the FloW and Roboflow datasets. The
combination of these methods enhances the performance of naive object detection
from YOLOv5 by considering the detection results in adjacent frames. The
results show that the post-processing stage and tubelet-level bounding box
linking can improve the quality of detection, achieving approximately 3% better
performance compared to YOLOv5 alone. The use of these methods has the
potential to detect surface and underwater trash and can be applied to a
real-time image-based trash-collecting robot. Implementing this system is
expected to mitigate the damage caused by trash in the past and improve
Indonesia's waste management system in the future.
Related papers
- Optimizing Waste Management with Advanced Object Detection for Garbage Classification [1.3499500088995462]
This paper reviews the implementation of AI models for classifying trash through object detection.
The study demonstrates how YOLO V5 can effectively identify various types of waste, including plastic, paper, glass, metal, cardboard, and biodegradables.
arXiv Detail & Related papers (2024-10-13T19:32:01Z) - Underwater Object Detection Enhancement via Channel Stabilization [12.994898879803642]
Marine trash endangers the aquatic ecosystem, presenting a persistent challenge.
We use Detectron2's backbone with various base models and configurations for this task.
We propose a novel channel stabilization technique alongside a simplified image enhancement model.
arXiv Detail & Related papers (2024-08-02T14:28:49Z) - Designing A Sustainable Marine Debris Clean-up Framework without Human Labels [0.0]
Marine debris poses a significant ecological threat to birds, fish, and other animal life.
Traditional methods for assessing debris accumulation involve labor-intensive and costly manual surveys.
This study introduces a framework that utilizes aerial imagery captured by drones to conduct remote trash surveys.
arXiv Detail & Related papers (2024-05-23T17:28:23Z) - Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring [71.60457491155451]
Eliminating image blur produced by various kinds of motion has been a challenging problem.
We propose a novel real-world deblurring filtering model called the Motion-adaptive Separable Collaborative Filter.
Our method provides an effective solution for real-world motion blur removal and achieves state-of-the-art performance.
arXiv Detail & Related papers (2024-04-19T19:44:24Z) - Diffusion-Based Particle-DETR for BEV Perception [94.88305708174796]
Bird-Eye-View (BEV) is one of the most widely-used scene representations for visual perception in Autonomous Vehicles (AVs)
Recent diffusion-based methods offer a promising approach to uncertainty modeling for visual perception but fail to effectively detect small objects in the large coverage of the BEV.
Here, we address this problem by combining the diffusion paradigm with current state-of-the-art 3D object detectors in BEV.
arXiv Detail & Related papers (2023-12-18T09:52:14Z) - Efficient Real-time Smoke Filtration with 3D LiDAR for Search and Rescue
with Autonomous Heterogeneous Robotic Systems [56.838297900091426]
Smoke and dust affect the performance of any mobile robotic platform due to their reliance on onboard perception systems.
This paper proposes a novel modular computation filtration pipeline based on intensity and spatial information.
arXiv Detail & Related papers (2023-08-14T16:48:57Z) - Implementing Edge Based Object Detection For Microplastic Debris [0.0]
Plastic has imbibed itself as an indispensable part of our day to day activities.
Plastic debris levels continue to rise with the accumulation of waste in garbage patches in landfills.
The project has been able to produce workable models that can perform on time detection of sampled images.
arXiv Detail & Related papers (2023-07-30T17:55:03Z) - Neural Scene Representation for Locomotion on Structured Terrain [56.48607865960868]
We propose a learning-based method to reconstruct the local terrain for a mobile robot traversing urban environments.
Using a stream of depth measurements from the onboard cameras and the robot's trajectory, the estimates the topography in the robot's vicinity.
We propose a 3D reconstruction model that faithfully reconstructs the scene, despite the noisy measurements and large amounts of missing data coming from the blind spots of the camera arrangement.
arXiv Detail & Related papers (2022-06-16T10:45:17Z) - FOVEA: Foveated Image Magnification for Autonomous Navigation [53.69803081925454]
We propose an attentional approach that elastically magnifies certain regions while maintaining a small input canvas.
Our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
On the autonomous driving datasets Argoverse-HD and BDD100K, we show our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
arXiv Detail & Related papers (2021-08-27T03:07:55Z) - ZeroWaste Dataset: Towards Automated Waste Recycling [51.053682077915546]
We present the first in-the-wild industrial-grade waste detection and segmentation dataset, ZeroWaste.
This dataset contains over1800fully segmented video frames collected from a real waste sorting plant.
We show that state-of-the-art segmentation methods struggle to correctly detect and classify target objects.
arXiv Detail & Related papers (2021-06-04T22:17:09Z) - Robotic Waste Sorter with Agile Manipulation and Quickly Trainable
Detector [3.3073775218038883]
The goal of automating the waste-sorting is to replace the human role of robust detection and agile manipulation of the waste items by robots.
First, we propose a combined manipulation method using graspless push-and-drop and pick-and-release manipulation.
Second, we propose a robotic system that can automatically collect object images to quickly train a deep neural network model.
arXiv Detail & Related papers (2021-04-02T22:19:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.