Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding
- URL: http://arxiv.org/abs/2311.15206v2
- Date: Fri, 15 Mar 2024 16:15:00 GMT
- Title: Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding
- Authors: Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu,
- Abstract summary: Current machine vision model requires a large volume of data to achieve high performance.
We introduce a novel "Insect-1M" dataset, a game-changing resource poised to revolutionize insect-related foundation model training.
Covering a vast spectrum of insect species, our dataset, including 1 million images with dense identification labels of taxonomy hierarchy and insect descriptions, offers a panoramic view of entomology.
- Score: 15.383106771910274
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield. The current machine vision model requires a large volume of data to achieve high performance. However, there are approximately 5.5 million different insect species in the world. None of the existing insect datasets can cover even a fraction of them due to varying geographic locations and acquisition costs. In this paper, we introduce a novel "Insect-1M" dataset, a game-changing resource poised to revolutionize insect-related foundation model training. Covering a vast spectrum of insect species, our dataset, including 1 million images with dense identification labels of taxonomy hierarchy and insect descriptions, offers a panoramic view of entomology, enabling foundation models to comprehend visual and semantic information about insects like never before. Then, to efficiently establish an Insect Foundation Model, we develop a micro-feature self-supervised learning method with a Patch-wise Relevant Attention mechanism capable of discerning the subtle differences among insect images. In addition, we introduce Description Consistency loss to improve micro-feature modeling via insect descriptions. Through our experiments, we illustrate the effectiveness of our proposed approach in insect modeling and achieve State-of-the-Art performance on standard benchmarks of insect-related tasks. Our Insect Foundation Model and Dataset promise to empower the next generation of insect-related vision models, bringing them closer to the ultimate goal of precision agriculture.
Related papers
- Deep-Wide Learning Assistance for Insect Pest Classification [1.9912919001438378]
We present DeWi, novel learning assistance for insect pest classification.
With a one-stage and alternating training strategy, DeWi simultaneously improves several Convolutional Neural Networks.
Experimental results show that DeWi achieves the highest performances on two insect pest classification benchmarks.
arXiv Detail & Related papers (2024-09-16T16:29:41Z) - Artificial Immune System of Secure Face Recognition Against Adversarial Attacks [67.31542713498627]
optimisation is required for insect production to realise its full potential.
This can be by targeted improvement of traits of interest through selective breeding.
This review combines knowledge from diverse disciplines, bridging the gap between animal breeding, quantitative genetics, evolutionary biology, and entomology.
arXiv Detail & Related papers (2024-06-26T07:50:58Z) - Unleashing the Power of Transfer Learning Model for Sophisticated Insect Detection: Revolutionizing Insect Classification [0.520707246175575]
This study uses different models like MobileNetV2, ResNet152V2, Xecption, Custom CNN.
A Convolutional Neural Network (CNN) based on the ResNet152V2 architecture is constructed and evaluated in this work.
The results highlight its potential for real-world applications in insect classification and entomology studies.
arXiv Detail & Related papers (2024-06-11T20:52:42Z) - Multimodal Foundation Models for Zero-shot Animal Species Recognition in
Camera Trap Images [57.96659470133514]
Motion-activated camera traps constitute an efficient tool for tracking and monitoring wildlife populations across the globe.
Supervised learning techniques have been successfully deployed to analyze such imagery, however training such techniques requires annotations from experts.
Reducing the reliance on costly labelled data has immense potential in developing large-scale wildlife tracking solutions with markedly less human labor.
arXiv Detail & Related papers (2023-11-02T08:32:00Z) - Automated Visual Monitoring of Nocturnal Insects with Light-based Camera
Traps [9.274371635733836]
We present two datasets of nocturnal insects, especially moths as a subset of Lepidoptera, photographed in Central Europe.
One dataset, the EU-Moths dataset, was captured manually by citizen scientists and contains species annotations for 200 different species.
The second dataset consists of more than 27,000 images captured on 95 nights.
arXiv Detail & Related papers (2023-07-28T09:31:36Z) - Deep learning powered real-time identification of insects using citizen
science data [17.13608307250744]
InsectNet can identify invasive species, provide fine-grained insect species identification, and work effectively in challenging backgrounds.
It can also abstain from making predictions when uncertain, facilitating seamless human intervention and making it a practical and trustworthy tool.
arXiv Detail & Related papers (2023-06-04T23:56:53Z) - Advancing Plain Vision Transformer Towards Remote Sensing Foundation
Model [97.9548609175831]
We resort to plain vision transformers with about 100 million parameters and make the first attempt to propose large vision models customized for remote sensing tasks.
Specifically, to handle the large image size and objects of various orientations in RS images, we propose a new rotated varied-size window attention.
Experiments on detection tasks demonstrate the superiority of our model over all state-of-the-art models, achieving 81.16% mAP on the DOTA-V1.0 dataset.
arXiv Detail & Related papers (2022-08-08T09:08:40Z) - An Efficient Insect Pest Classification Using Multiple Convolutional
Neural Network Based Models [0.3222802562733786]
Insect pest classification is a difficult task because of various kinds, scales, shapes, complex backgrounds in the field, and high appearance similarity among insect species.
We present different convolutional neural network-based models in this work, including attention, feature pyramid, and fine-grained models.
The experimental results show that combining these convolutional neural network-based models can better perform than the state-of-the-art methods on these two datasets.
arXiv Detail & Related papers (2021-07-26T12:53:28Z) - Two-View Fine-grained Classification of Plant Species [66.75915278733197]
We propose a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained recognition of plant species.
A deep metric based on Siamese convolutional neural networks is used to reduce the dependence on a large number of training samples and make the method scalable to new plant species.
arXiv Detail & Related papers (2020-05-18T21:57:47Z) - Automatic image-based identification and biomass estimation of
invertebrates [70.08255822611812]
Time-consuming sorting and identification of taxa pose strong limitations on how many insect samples can be processed.
We propose to replace the standard manual approach of human expert-based sorting and identification with an automatic image-based technology.
We use state-of-the-art Resnet-50 and InceptionV3 CNNs for the classification task.
arXiv Detail & Related papers (2020-02-05T21:38:57Z) - Deformation-aware Unpaired Image Translation for Pose Estimation on
Laboratory Animals [56.65062746564091]
We aim to capture the pose of neuroscience model organisms, without using any manual supervision, to study how neural circuits orchestrate behaviour.
Our key contribution is the explicit and independent modeling of appearance, shape and poses in an unpaired image translation framework.
We demonstrate improved pose estimation accuracy on Drosophila melanogaster (fruit fly), Caenorhabditis elegans (worm) and Danio rerio (zebrafish)
arXiv Detail & Related papers (2020-01-23T15:34:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.