Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
- URL: http://arxiv.org/abs/2406.12271v1
- Date: Tue, 18 Jun 2024 04:59:04 GMT
- Title: Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
- Authors: Wang Liu, Zhiyu Wang, Puhong Duan, Xudong Kang, Shutao Li,
- Abstract summary: The Agriculture-Vision Challenge at CVPR 2024 aims at leveraging semantic segmentation models to produce pixel level semantic segmentation labels.
We propose a mosaic data augmentation with a rare class sampling strategy to enrich long-tail class samples.
Our methodology achieved a mean Intersection over Union (mIoU) score of 0.547 on the test set, securing second place in this challenge.
- Score: 20.631333392618327
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The Agriculture-Vision Challenge at CVPR 2024 aims at leveraging semantic segmentation models to produce pixel level semantic segmentation labels within regions of interest for multi-modality satellite images. It is one of the most famous and competitive challenges for global researchers to break the boundary between computer vision and agriculture sectors. However, there is a serious class imbalance problem in the agriculture-vision dataset, which hinders the semantic segmentation performance. To solve this problem, firstly, we propose a mosaic data augmentation with a rare class sampling strategy to enrich long-tail class samples. Secondly, we employ an adaptive class weight scheme to suppress the contribution of the common classes while increasing the ones of rare classes. Thirdly, we propose a probability post-process to increase the predicted value of the rare classes. Our methodology achieved a mean Intersection over Union (mIoU) score of 0.547 on the test set, securing second place in this challenge.
Related papers
- A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and
Weeds Competition [2.7624021966289605]
We propose an approach that combines the effectiveness of the Segment AnyThing Model (SAM) for instance segmentation with prompt input from object detection models.
Our best-performing model achieved a PQ+ score of 81.33 based on the evaluation metrics of the competition.
arXiv Detail & Related papers (2023-09-24T08:34:12Z) - Food Image Classification and Segmentation with Attention-based Multiple
Instance Learning [51.279800092581844]
The paper presents a weakly supervised methodology for training food image classification and semantic segmentation models.
The proposed methodology is based on a multiple instance learning approach in combination with an attention-based mechanism.
We conduct experiments on two meta-classes within the FoodSeg103 data set to verify the feasibility of the proposed approach.
arXiv Detail & Related papers (2023-08-22T13:59:47Z) - Extended Agriculture-Vision: An Extension of a Large Aerial Image
Dataset for Agricultural Pattern Analysis [11.133807938044804]
We release an improved version of the Agriculture-Vision dataset (Chiu et al., 2020b)
We extend this dataset with the release of 3600 large, high-resolution (10cm/pixel), full-field, red-green-blue and near-infrared images for pre-training.
We demonstrate the usefulness of this data by benchmarking different contrastive learning approaches on both downstream classification and semantic segmentation tasks.
arXiv Detail & Related papers (2023-03-04T17:35:24Z) - Fine-Grained Hard Negative Mining: Generalizing Mitosis Detection with a
Fifth of the MIDOG 2022 Dataset [1.2183405753834562]
We describe a candidate deep learning solution for the Mitosis Domain Generalization Challenge 2022 (MIDOG)
Our approach consists in training a rotation-invariant deep learning model using aggressive data augmentation.
Our model ensemble achieved a F1-score of.697 on the final test set after automated evaluation.
arXiv Detail & Related papers (2023-01-03T13:06:44Z) - Universal Object Detection with Large Vision Model [79.06618136217142]
This study focuses on the large-scale, multi-domain universal object detection problem.
To address these challenges, we introduce our approach to label handling, hierarchy-aware design, and resource-efficient model training.
Our method has demonstrated remarkable performance, securing a prestigious second-place ranking in the object detection track of the Robust Vision Challenge 2022.
arXiv Detail & Related papers (2022-12-19T12:40:13Z) - Rethinking Clustering-Based Pseudo-Labeling for Unsupervised
Meta-Learning [146.11600461034746]
Method for unsupervised meta-learning, CACTUs, is a clustering-based approach with pseudo-labeling.
This approach is model-agnostic and can be combined with supervised algorithms to learn from unlabeled data.
We prove that the core reason for this is lack of a clustering-friendly property in the embedding space.
arXiv Detail & Related papers (2022-09-27T19:04:36Z) - Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for
Agricultural Pattern Recognition via Transformer-based Models [11.615548490321123]
We propose our solution to the third Agriculture-Vision Challenge in CVPR 2022.
We leverage a data pre-processing scheme and several Transformer-based models as well as data augmentation techniques to achieve a mIoU of 0.582.
arXiv Detail & Related papers (2022-06-23T18:02:12Z) - A Boundary Based Out-of-Distribution Classifier for Generalized
Zero-Shot Learning [83.1490247844899]
Generalized Zero-Shot Learning (GZSL) is a challenging topic that has promising prospects in many realistic scenarios.
We propose a boundary based Out-of-Distribution (OOD) classifier which classifies the unseen and seen domains by only using seen samples for training.
We extensively validate our approach on five popular benchmark datasets including AWA1, AWA2, CUB, FLO and SUN.
arXiv Detail & Related papers (2020-08-09T11:27:19Z) - The 1st Agriculture-Vision Challenge: Methods and Results [144.57794061346974]
The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images.
Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation.
This paper provides a summary of notable methods and results in the challenge.
arXiv Detail & Related papers (2020-04-21T05:02:31Z) - Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation
Method for Semantic Segmentation [97.8552697905657]
A key challenge of this task is how to alleviate the data distribution discrepancy between the source and target domains.
We propose Alleviating Semantic-level Shift (ASS), which can successfully promote the distribution consistency from both global and local views.
We apply our ASS to two domain adaptation tasks, from GTA5 to Cityscapes and from Synthia to Cityscapes.
arXiv Detail & Related papers (2020-04-02T03:25:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.