BAWSeg: A UAV Multispectral Benchmark for Barley Weed Segmentation
- URL: http://arxiv.org/abs/2603.01932v1
- Date: Mon, 02 Mar 2026 14:49:05 GMT
- Title: BAWSeg: A UAV Multispectral Benchmark for Barley Weed Segmentation
- Authors: Haitian Wang, Xinyu Wang, Muhammad Ibrahim, Dustin Severtson, Ajmal Mian,
- Abstract summary: We propose a two-stream segmentation network that fuses radiance cues and normalized index cues at native resolution.<n>Vegetation-Index and Spectral Attention operates on vegetation-index maps with windowed self-attention.<n>Vegetation-Index and Spectral Attention achieves 75.6% mIoU and 63.5% weed IoU with 22.8M parameters, outperforming a multispectral SegFormer-B1 baseline by 1.2 mIoU and 1.9 weed IoU.
- Score: 31.004130414489698
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate weed mapping in cereal fields requires pixel-level segmentation from UAV imagery that remains reliable across fields, seasons, and illumination. Existing multispectral pipelines often depend on thresholded vegetation indices, which are brittle under radiometric drift and mixed crop--weed pixels, or on single-stream CNN and Transformer backbones that ingest stacked bands and indices, where radiance cues and normalized index cues interfere and reduce sensitivity to small weed clusters embedded in crop canopies. We propose VISA (Vegetation-Index and Spectral Attention), a two-stream segmentation network that decouples these cues and fuses them at native resolution. The radiance stream learns from calibrated five-band reflectance using residual spectral-spatial attention to preserve fine textures and row boundaries that are attenuated by ratio indices. The index stream operates on vegetation-index maps with windowed self-attention to model local structure efficiently, state-space layers to propagate field-scale context without quadratic attention cost, and Slot Attention to form stable region descriptors that improve discrimination of sparse weeds under canopy mixing. To support supervised training and deployment-oriented evaluation, we introduce BAWSeg, a four-year UAV multispectral dataset collected over commercial barley paddocks in Western Australia, providing radiometrically calibrated blue, green, red, red edge, and near-infrared orthomosaics, derived vegetation indices, and dense crop, weed, and other labels with leakage-free block splits. On BAWSeg, VISA achieves 75.6% mIoU and 63.5% weed IoU with 22.8M parameters, outperforming a multispectral SegFormer-B1 baseline by 1.2 mIoU and 1.9 weed IoU. Under cross-plot and cross-year protocols, VISA maintains 71.2% and 69.2% mIoU, respectively. The BAWSeg data, VISA code, and trained models will be released upon publication.
Related papers
- LeafInst - Unified Instance Segmentation Network for Fine-Grained Forestry Leaf Phenotype Analysis: A New UAV based Benchmark [10.61947524568352]
LeafInst is a novel segmentation framework tailored for irregular and multi-scale leaf structures.<n>It achieves 68.4 mAP, outperforming YOLOv11 by 7.1 percent and MaskDINO by 6.5 percent.
arXiv Detail & Related papers (2026-03-04T01:01:57Z) - Manual Labelling Artificially Inflates Deep Learning-Based Segmentation Performance on RGB Images of Closed Canopy: Validation Using TLS [0.0]
Traditional methods relying on field-based forest inventories are labor-intensive and limited in spatial coverage.<n>We generate high-fidelity validation labels from co-located Terrestrial Laser Scanning (TLS) data for drone imagery of boreal and Mediterranean forests.<n>We evaluate the performance of two widely used deep learning ITC segmentation models - DeepForest (RetinaNet) and Detectree2 (Mask R-CNN)<n>Both models showed very poor localisation accuracy at stricter IoU thresholds, even when restricted to canopy trees.
arXiv Detail & Related papers (2025-03-18T14:09:00Z) - Multi-Domain Biometric Recognition using Body Embeddings [51.36007967653781]
We show that body embeddings perform better than face embeddings in medium-wave infrared (MWIR) and long-wave infrared (LWIR) domains.<n>We leverage a vision transformer architecture to establish benchmark results on the IJB-MDF dataset.<n>We also show that finetuning a body model, pretrained exclusively on VIS data, with a simple combination of cross-entropy and triplet losses achieves state-of-the-art mAP scores.
arXiv Detail & Related papers (2025-03-13T22:38:18Z) - Multispectral Remote Sensing for Weed Detection in West Australian Agricultural Lands [3.6284577335311563]
The Kondinin region in Western Australia faces significant agricultural challenges due to pervasive weed infestations, causing economic losses and ecological impacts.<n>This study constructs a tailored multispectral remote sensing framework for weed detection to advance precision agriculture practices.<n>Unmanned aerial vehicles were used to collect raw multispectral data from two experimental areas over four years, covering 0.6046 km2 and ground truth annotations were created with GPS-enabled vehicles to manually label weeds and crops.
arXiv Detail & Related papers (2025-02-12T07:01:42Z) - Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds [1.6633665061166945]
This study proposes a fully unsupervised deep learning method for leaf-wood separation of high-density laser scanning point clouds.<n>GrowSP-ForMS achieved a mean accuracy of 84.3% and a mean intersection over union (mIoU) of 69.6% on our MS test set.
arXiv Detail & Related papers (2025-02-10T07:58:49Z) - Vision Transformers, a new approach for high-resolution and large-scale
mapping of canopy heights [50.52704854147297]
We present a new vision transformer (ViT) model optimized with a classification (discrete) and a continuous loss function.
This model achieves better accuracy than previously used convolutional based approaches (ConvNets) optimized with only a continuous loss function.
arXiv Detail & Related papers (2023-04-22T22:39:03Z) - Evaluation of the potential of Near Infrared Hyperspectral Imaging for
monitoring the invasive brown marmorated stink bug [53.682955739083056]
The brown marmorated stink bug (BMSB), Halyomorpha halys, is an invasive insect pest of global importance that damages several crops.
The present study consists in a preliminary evaluation at the laboratory level of Near Infrared Hyperspectral Imaging (NIR-HSI) as a possible technology to detect BMSB specimens.
arXiv Detail & Related papers (2023-01-19T11:37:20Z) - A multiscale spatiotemporal approach for smallholder irrigation
detection [0.0]
This paper presents an irrigation detection methodology that leverages multiscale satellite imagery of vegetation abundance.
The methodology is applied to detect smallholder irrigation in two states in the Ethiopian highlands, Tigray and Amhara.
arXiv Detail & Related papers (2022-02-09T02:50:42Z) - A Multi-Stage model based on YOLOv3 for defect detection in PV panels
based on IR and Visible Imaging by Unmanned Aerial Vehicle [65.99880594435643]
We propose a novel model to detect panel defects on aerial images captured by unmanned aerial vehicle.
The model combines detections of panels and defects to refine its accuracy.
The proposed model has been validated on two big PV plants in the south of Italy.
arXiv Detail & Related papers (2021-11-23T08:04:32Z) - A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows
from UAV Imagery [56.10033255997329]
We propose a novel deep learning method based on a Convolutional Neural Network (CNN)
It simultaneously detects and geolocates plantation-rows while counting its plants considering highly-dense plantation configurations.
The proposed method achieved state-of-the-art performance for counting and geolocating plants and plant-rows in UAV images from different types of crops.
arXiv Detail & Related papers (2020-12-31T18:51:17Z) - Ensemble Hyperspectral Band Selection for Detecting Nitrogen Status in
Grape Leaves [0.22499166814992436]
This study aimed to identify the optimal set of spectral bands for nitrogen detection in grape leaves using ensemble feature selection on hyperspectral data.
The pipeline identified less than 0.45% of the bands as most informative about grape nitrogen status.
The proposed pipeline may also be used for application-specific multispectral sensor design in domains other than agriculture.
arXiv Detail & Related papers (2020-10-08T19:09:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.