AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging
- URL: http://arxiv.org/abs/2512.12101v1
- Date: Sat, 13 Dec 2025 00:26:45 GMT
- Title: AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging
- Authors: Swarn S. Warshaneyan, Maksims Ivanovs, Blaž Cugmas, Inese Bērziņa, Laura Goldberga, Mindaugas Tamosiunas, Roberts Kadiķis,
- Abstract summary: We present a comprehensive study on fully automated pollen recognition across conventional optical and digital in-line holographic microscopy (DIHM) images.<n>Visually recognizing pollen in unreconstructed holographic images remains challenging due to speckle noise, twin-image artifacts and substantial divergence from bright-field appearances.<n>We employ a Wasserstein GAN with spectral normalization to create synthetic DIHM images, yielding an FID score of 58.246.
- Score: 0.1563562770254469
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present a comprehensive study on fully automated pollen recognition across both conventional optical and digital in-line holographic microscopy (DIHM) images of sample slides. Visually recognizing pollen in unreconstructed holographic images remains challenging due to speckle noise, twin-image artifacts and substantial divergence from bright-field appearances. We establish the performance baseline by training YOLOv8s for object detection and MobileNetV3L for classification on a dual-modality dataset of automatically annotated optical and affinely aligned DIHM images. On optical data, detection mAP50 reaches 91.3% and classification accuracy reaches 97%, whereas on DIHM data, we achieve only 8.15% for detection mAP50 and 50% for classification accuracy. Expanding the bounding boxes of pollens in DIHM images over those acquired in aligned optical images achieves 13.3% for detection mAP50 and 54% for classification accuracy. To improve object detection in DIHM images, we employ a Wasserstein GAN with spectral normalization (WGAN-SN) to create synthetic DIHM images, yielding an FID score of 58.246. Mixing real-world and synthetic data at the 1.0 : 1.5 ratio for DIHM images improves object detection up to 15.4%. These results demonstrate that GAN-based augmentation can reduce the performance divide, bringing fully automated DIHM workflows for veterinary imaging a small but important step closer to practice.
Related papers
- Automated Pollen Recognition in Optical and Holographic Microscopy Images [0.1563562770254469]
This study explores the application of deep learning to improve and automate pollen grain detection and classification in both optical and holographic microscopy images.<n>We used YOLOv8s for object detection and MobileNetV3L for the classification task, evaluating their performance across imaging modalities.<n>The models achieved 91.3% mAP50 for detection and 97% overall accuracy for classification on optical images, whereas the initial performance on greyscale holographic images was substantially lower.
arXiv Detail & Related papers (2025-12-09T13:31:02Z) - Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection [0.0]
We investigate the use of unconditional models to generate synthetic brightfield microscopy images.<n>A U-Net based diffusion model was trained and used to create datasets with varying ratios of synthetic and real images.<n>Experiments with YOLOv8, YOLOv9 and RT-DETR reveal that training with synthetic data can achieve improved detection accuracies.
arXiv Detail & Related papers (2025-11-25T08:57:23Z) - DeepAf: One-Shot Spatiospectral Auto-Focus Model for Digital Pathology [37.648157065553185]
We introduce a novel automated microscopic system powered by DeepAf.<n>DeepAf combines spatial and spectral features through a hybrid architecture for single-shot focus prediction.<n>System achieves 0.90 AUC in cancer classification at 4x magnification, a significant achievement at lower magnification than typical 20x WSI scans.
arXiv Detail & Related papers (2025-10-06T19:28:08Z) - Ensemble YOLO Framework for Multi-Domain Mitotic Figure Detection in Histopathology Images [0.7541656202645494]
Two modern one-stage detectors, YOLOv5 and YOLOv8, were trained on MIDOG++, CMC, and CCMCT datasets.<n>YOLOv5 achieved higher precision (84.3%), while YOLOv8 offered improved recall (82.6%)<n>Our ensemble ranked 5th with an F1 score of 79.2%, precision of 73.6%, and recall of 85.8%, confirming that the proposed strategy generalizes effectively across unseen test data.
arXiv Detail & Related papers (2025-09-03T02:43:02Z) - HistoART: Histopathology Artifact Detection and Reporting Tool [37.31105955164019]
Whole Slide Imaging (WSI) is widely used to digitize tissue specimens for detailed, high-resolution examination.<n>WSI remains vulnerable to artifacts introduced during slide preparation and scanning.<n>We propose and compare three robust artifact detection approaches for WSIs.
arXiv Detail & Related papers (2025-06-23T17:22:19Z) - Label-efficient Single Photon Images Classification via Active Learning [3.213673540843068]
Single-photon LiDAR achieves high-precision 3D imaging in extreme environments through quantum-level photon detection technology.<n>Current research focuses on reconstructing 3D scenes from sparse photon events, whereas the semantic interpretation of single-photon images remains underexplored.<n>This paper presents the first active learning framework for single-photon image classification.
arXiv Detail & Related papers (2025-05-07T12:57:40Z) - Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging [41.446379453352534]
Latent Diffusion Autoencoder (LDAE) is a novel encoder-decoder diffusion-based framework for efficient and meaningful unsupervised learning in medical imaging.<n>This study focuses on Alzheimer disease (AD) using brain MR from the ADNI database as a case study.
arXiv Detail & Related papers (2025-04-11T15:37:46Z) - MaskTerial: A Foundation Model for Automated 2D Material Flake Detection [48.73213960205105]
We present a deep learning model, called MaskTerial, that uses an instance segmentation network to reliably identify 2D material flakes.<n>The model is extensively pre-trained using a synthetic data generator, that generates realistic microscopy images from unlabeled data.<n>We demonstrate significant improvements over existing techniques in the detection of low-contrast materials such as hexagonal boron nitride.
arXiv Detail & Related papers (2024-12-12T15:01:39Z) - Generalist Segmentation Algorithm for Photoreceptors Analysis in Adaptive Optics Imaging [3.9111016990170286]
Confocal adaptive optics scanning light ophthalmoscope (AOSLO) imaging enables visualization of the cones from reflections of waveguiding cone photoreceptors.
This paper introduces a method based on deep learning (DL) for detecting and segmenting cones in AOSLO images.
arXiv Detail & Related papers (2024-08-27T06:49:21Z) - Direct Zernike Coefficient Prediction from Point Spread Functions and Extended Images using Deep Learning [36.136619420474766]
Existing adaptive optics systems rely on iterative search algorithm to correct for aberrations and improve images.
This study demonstrates the application of convolutional neural networks to characterise the optical aberration.
arXiv Detail & Related papers (2024-04-23T17:03:53Z) - Whole-body Detection, Recognition and Identification at Altitude and
Range [57.445372305202405]
We propose an end-to-end system evaluated on diverse datasets.
Our approach involves pre-training the detector on common image datasets and fine-tuning it on BRIAR's complex videos and images.
We conduct thorough evaluations under various conditions, such as different ranges and angles in indoor, outdoor, and aerial scenarios.
arXiv Detail & Related papers (2023-11-09T20:20:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.