ADA-YOLO: Dynamic Fusion of YOLOv8 and Adaptive Heads for Precise Image
Detection and Diagnosis
- URL: http://arxiv.org/abs/2312.10099v1
- Date: Thu, 14 Dec 2023 18:27:13 GMT
- Title: ADA-YOLO: Dynamic Fusion of YOLOv8 and Adaptive Heads for Precise Image
Detection and Diagnosis
- Authors: Shun Liu, Jianan Zhang, Ruocheng Song, Teik Toe Teoh
- Abstract summary: We propose ADA-YOLO, a light-weight yet effective method for medical object detection that integrates attention-based mechanisms with the YOLOv8 architecture.
Our proposed method leverages the dynamic feature localisation and parallel regression for computer vision tasks through textitadaptive head module.
- Score: 0.9804179673817571
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection and localization are crucial tasks for biomedical image
analysis, particularly in the field of hematology where the detection and
recognition of blood cells are essential for diagnosis and treatment decisions.
While attention-based methods have shown significant progress in object
detection in various domains, their application in medical object detection has
been limited due to the unique challenges posed by medical imaging datasets. To
address this issue, we propose ADA-YOLO, a light-weight yet effective method
for medical object detection that integrates attention-based mechanisms with
the YOLOv8 architecture. Our proposed method leverages the dynamic feature
localisation and parallel regression for computer vision tasks through
\textit{adaptive head} module. Empirical experiments were conducted on the
Blood Cell Count and Detection (BCCD) dataset to evaluate the effectiveness of
ADA-YOLO. The results showed that ADA-YOLO outperforms the YOLOv8 model in mAP
(mean average precision) on the BCCD dataset by using more than 3 times less
space than YOLOv8. This indicates that our proposed method is effective.
Moreover, the light-weight nature of our proposed method makes it suitable for
deployment in resource-constrained environments such as mobile devices or edge
computing systems. which could ultimately lead to improved diagnosis and
treatment outcomes in the field of hematology.
Related papers
- MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models [49.765466293296186]
Recent progress in Medical Large Vision-Language Models (Med-LVLMs) has opened up new possibilities for interactive diagnostic tools.
Med-LVLMs often suffer from factual hallucination, which can lead to incorrect diagnoses.
We propose a versatile multimodal RAG system, MMed-RAG, designed to enhance the factuality of Med-LVLMs.
arXiv Detail & Related papers (2024-10-16T23:03:27Z) - LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection [8.812471041082105]
We propose a novel model named Lightweight Shunt Matching-YOLO (LSM-YOLO), with Lightweight Adaptive Extraction (LAE) and Multipath Shunt Feature Matching (MSFM)
Experimental results demonstrate that LSM-YOLO achieves 48.6% AP on a private dataset of pancreatic tumors, 65.1% AP on the BCCD blood cell detection public dataset, and 73.0% AP on the Br35h brain tumor detection public dataset.
arXiv Detail & Related papers (2024-08-26T08:16:58Z) - CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery [0.0682074616451595]
CAF-YOLO is a nimble yet robust method for medical object detection that leverages the strengths of convolutional neural networks (CNNs) and transformers.
ACFM module enhances the modeling of both global and local features, enabling the capture of long-term feature dependencies.
MSNN improves multi-scale information aggregation by extracting features across diverse scales.
arXiv Detail & Related papers (2024-08-04T01:44:44Z) - Exploring Machine Learning Algorithms for Infection Detection Using GC-IMS Data: A Preliminary Study [2.4961885884659987]
Our research aims to tackle the ongoing issue of precise infection identification.
By utilizing Gas Chromatography-Ion Mobility Spectrometry (GC-IMS) data and incorporating machine learning algorithms into one platform, our research aims to tackle the issue.
arXiv Detail & Related papers (2024-04-24T09:25:16Z) - Super-resolution of biomedical volumes with 2D supervision [84.5255884646906]
Masked slice diffusion for super-resolution exploits the inherent equivalence in the data-generating distribution across all spatial dimensions of biological specimens.
We focus on the application of SliceR to stimulated histology (SRH), characterized by its rapid acquisition of high-resolution 2D images but slow and costly optical z-sectioning.
arXiv Detail & Related papers (2024-04-15T02:41:55Z) - Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary
Task Integration [54.76511683427566]
This research introduces a novel multimodal method for classifying skin lesions, integrating smartphone-captured images with essential clinical and demographic information.
A distinctive aspect of this method is the integration of an auxiliary task focused on super-resolution image prediction.
The experimental evaluations have been conducted using the PAD-UFES20 dataset, applying various deep-learning architectures.
arXiv Detail & Related papers (2024-02-16T05:16:20Z) - YOLO-World: Real-Time Open-Vocabulary Object Detection [87.08732047660058]
We introduce YOLO-World, an innovative approach that enhances YOLO with open-vocabulary detection capabilities.
Our method excels in detecting a wide range of objects in a zero-shot manner with high efficiency.
YOLO-World achieves 35.4 AP with 52.0 FPS on V100, which outperforms many state-of-the-art methods in terms of both accuracy and speed.
arXiv Detail & Related papers (2024-01-30T18:59:38Z) - Explainable AI in Diagnosing and Anticipating Leukemia Using Transfer
Learning Method [0.0]
This research paper focuses on Acute Lymphoblastic Leukemia (ALL), a form of blood cancer prevalent in children and teenagers.
It proposes an automated detection approach using computer-aided diagnostic (CAD) models, leveraging deep learning techniques.
The proposed method achieved an impressive 98.38% accuracy, outperforming other tested models.
arXiv Detail & Related papers (2023-12-01T10:37:02Z) - ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic
Diffusion Models [69.9178140563928]
Colonoscopy analysis is essential for assisting clinical diagnosis and treatment.
The scarcity of annotated data limits the effectiveness and generalization of existing methods.
We propose an Adaptive Refinement Semantic Diffusion Model (ArSDM) to generate colonoscopy images that benefit the downstream tasks.
arXiv Detail & Related papers (2023-09-03T07:55:46Z) - LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical
Imaging via Second-order Graph Matching [59.01894976615714]
We introduce LVM-Med, the first family of deep networks trained on large-scale medical datasets.
We have collected approximately 1.3 million medical images from 55 publicly available datasets.
LVM-Med empirically outperforms a number of state-of-the-art supervised, self-supervised, and foundation models.
arXiv Detail & Related papers (2023-06-20T22:21:34Z) - TE-YOLOF: Tiny and efficient YOLOF for blood cell detection [26.463853328783962]
Blood cell detection in microscopic images is an essential branch of medical image processing research.
In this work, an object detector based on YOLOF has been proposed to detect blood cell objects such as red blood cells, white blood cells and platelets.
For increasing efficiency and flexibility, the EfficientNet Convolutional Neural Network is utilized as the backbone for the proposed object detector.
arXiv Detail & Related papers (2021-08-27T14:45:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.