Related papers: Training Deep Learning Algorithms on Synthetic Forest Images for Tree Detection

Training Deep Learning Algorithms on Synthetic Forest Images for Tree Detection

URL: http://arxiv.org/abs/2210.04104v1
Date: Sat, 8 Oct 2022 20:49:40 GMT
Title: Training Deep Learning Algorithms on Synthetic Forest Images for Tree Detection
Authors: Vincent Grondin, Fran\c{c}ois Pomerleau, Philippe Gigu\`ere,
Abstract summary: We propose to use simulated forest environments to automatically generate 43 k realistic synthetic images with pixel-level annotations. We also report the promising transfer learning capability of features learned on our synthetic dataset by directly predicting bounding box, segmentation masks and keypoints on real images.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-based segmentation in forested environments is a key functionality for autonomous forestry operations such as tree felling and forwarding. Deep learning algorithms demonstrate promising results to perform visual tasks such as object detection. However, the supervised learning process of these algorithms requires annotations from a large diversity of images. In this work, we propose to use simulated forest environments to automatically generate 43 k realistic synthetic images with pixel-level annotations, and use it to train deep learning algorithms for tree detection. This allows us to address the following questions: i) what kind of performance should we expect from deep learning in harsh synthetic forest environments, ii) which annotations are the most important for training, and iii) what modality should be used between RGB and depth. We also report the promising transfer learning capability of features learned on our synthetic dataset by directly predicting bounding box, segmentation masks and keypoints on real images. Code available on GitHub (https://github.com/norlab-ulaval/PercepTreeV1).

Related papers

Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation [85.17735565146106]
Underwater Monocular Depth Estimation (UMDE) is a critical task that aims to estimate high-precision depth maps from underwater degraded images.<n>We develop a novel tree-aware Mamba method, dubbed Tree-Mamba, for estimating accurate monocular depth maps from underwater degraded images.<n>We construct an underwater depth estimation benchmark (called BlueDepth), which consists of 38,162 underwater image pairs with reliable depth labels.
arXiv Detail & Related papers (2025-07-10T12:10:51Z)
Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery [1.2770132985501168]
Current RGB tree segmentation methods rely on training specialized machine learning models with labeled tree datasets.<n>In this paper, we investigate the efficacy of using a state-of-the-art image segmentation model, Segment Anything Model 2 (SAM2) in a zero-shot manner for individual tree detection and segmentation.<n>Our results suggest that SAM2 not only has impressive generalization capabilities, but also can form a natural synergy with specialized methods trained on in-domain labeled data.
arXiv Detail & Related papers (2025-06-03T17:44:43Z)
Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis [0.0]
We present a novel pipeline for generating semantically segmented point clouds of forest environments.<n>We generate realistic RGB images of diverse forest scenes along with their corresponding semantic segmentation masks.<n>The resulting point clouds provide both geometric and semantic detail, offering a valuable resource for training and evaluating deep learning models.
arXiv Detail & Related papers (2025-05-15T23:34:55Z)
On the Learning with Augmented Class via Forests [17.606415934443554]
We focus on learning with augmented class via forests, where an augmented class may appear in testing data yet not in training data.<n>We develop the Learning with Augmented Class via Forests approach, which constructs shallow forests according to the augmented Gini impurity.<n>We also develop deep neural forests via an optimization objective based on our augmented Gini impurity.
arXiv Detail & Related papers (2025-05-14T11:22:22Z)
Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation [6.635604919499181]
We introduce a new large aerial dataset for forest inspection. It contains both real-world and virtual recordings of natural environments. We develop a framework to assess the deforestation degree of an area.
arXiv Detail & Related papers (2024-03-11T11:26:44Z)
TreeLearn: A deep learning method for segmenting individual trees from ground-based LiDAR forest point clouds [40.46280139210502]
TreeLearn is a deep learning approach for tree instance segmentation of forest point clouds. TreeLearn is trained on already segmented point clouds in a data-driven manner. We trained TreeLearn on forest point clouds of 6665 trees, labeled using the Lidar360 software.
arXiv Detail & Related papers (2023-09-15T15:20:16Z)
Improving Human-Object Interaction Detection via Virtual Image Learning [68.56682347374422]
Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects. In this paper, we propose to alleviate the impact of such an unbalanced distribution via Virtual Image Leaning (VIL) A novel label-to-image approach, Multiple Steps Image Creation (MUSIC), is proposed to create a high-quality dataset that has a consistent distribution with real images.
arXiv Detail & Related papers (2023-08-04T10:28:48Z)
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments [72.6405488990753]
Self-supervised learning can be used for mitigating the greedy needs of Vision Transformer networks. We propose a single-stage and standalone method, MOCA, which unifies both desired properties. We achieve new state-of-the-art results on low-shot settings and strong experimental results in various evaluation protocols.
arXiv Detail & Related papers (2023-07-18T15:46:20Z)
Tree Detection and Diameter Estimation Based on Deep Learning [0.0]
Tree perception is an essential building block toward autonomous forestry operations. Deep neural network models trained on our datasets achieve a precision of 90.4% for tree detection. Results offer promising avenues toward autonomous tree felling operations.
arXiv Detail & Related papers (2022-10-31T15:51:32Z)
Pixel-wise classification in graphene-detection with tree-based machine learning algorithms [0.0]
We introduce four different tree-based machine learning algorithms -- decision tree, random forest, extreme boost gradient, and light gradient boosting machine. We train them with five optical microscopy images of graphene, and evaluate their performances with multiple metrics and indices. The code developed in this paper will be released at indices.com/gjung-group/Graphene_segmentation.
arXiv Detail & Related papers (2022-08-24T08:10:27Z)
Learning Co-segmentation by Segment Swapping for Retrieval and Discovery [67.6609943904996]
The goal of this work is to efficiently identify visually similar patterns from a pair of images. We generate synthetic training pairs by selecting object segments in an image and copy-pasting them into another image. We show our approach provides clear improvements for artwork details retrieval on the Brueghel dataset.
arXiv Detail & Related papers (2021-10-29T16:51:16Z)
Exploiting the relationship between visual and textual features in social networks for image classification with zero-shot deep learning [0.0]
In this work, we propose a classifier ensemble based on the transferable learning capabilities of the CLIP neural network architecture. Our experiments, based on image classification tasks according to the labels of the Places dataset, are performed by first considering only the visual part. Considering the associated texts to the images can help to improve the accuracy depending on the goal.
arXiv Detail & Related papers (2021-07-08T10:54:59Z)
HistoTransfer: Understanding Transfer Learning for Histopathology [9.231495418218813]
We compare the performance of features extracted from networks trained on ImageNet and histopathology data. We investigate if features learned using more complex networks lead to gain in performance.
arXiv Detail & Related papers (2021-06-13T18:55:23Z)
Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation. We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths. In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z)
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning [60.75687261314962]
We introduce pixel-level pretext tasks for learning dense feature representations. A pixel-to-propagation consistency task produces better results than state-of-the-art approaches. Results demonstrate the strong potential of defining pretext tasks at the pixel level.
arXiv Detail & Related papers (2020-11-19T18:59:45Z)
Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation [67.88276573341734]
We propose a new method for unseen object instance segmentation by learning RGB-D feature embeddings from synthetic data. A metric learning loss function is utilized to learn to produce pixel-wise feature embeddings. We further improve the segmentation accuracy with a new two-stage clustering algorithm.
arXiv Detail & Related papers (2020-07-30T00:23:07Z)

This list is automatically generated from the titles and abstracts of the papers in this site.