VisionCAD: An Integration-Free Radiology Copilot Framework
- URL: http://arxiv.org/abs/2511.00381v1
- Date: Sat, 01 Nov 2025 03:29:50 GMT
- Title: VisionCAD: An Integration-Free Radiology Copilot Framework
- Authors: Jiaming Li, Junlei Wu, Sheng Wang, Honglin Xiong, Jiangdong Cai, Zihao Zhao, Yitao Zhu, Yuan Yin, Dinggang Shen, Qian Wang,
- Abstract summary: VisionCAD is a vision-based radiological assistance framework.<n>It captures medical images directly from displays using a camera system.<n>System achieves diagnostic performance comparable to conventional CAD systems.
- Score: 42.29535854844036
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Widespread clinical deployment of computer-aided diagnosis (CAD) systems is hindered by the challenge of integrating with existing hospital IT infrastructure. Here, we introduce VisionCAD, a vision-based radiological assistance framework that circumvents this barrier by capturing medical images directly from displays using a camera system. The framework operates through an automated pipeline that detects, restores, and analyzes on-screen medical images, transforming camera-captured visual data into diagnostic-quality images suitable for automated analysis and report generation. We validated VisionCAD across diverse medical imaging datasets, demonstrating that our modular architecture can flexibly utilize state-of-the-art diagnostic models for specific tasks. The system achieves diagnostic performance comparable to conventional CAD systems operating on original digital images, with an F1-score degradation typically less than 2\% across classification tasks, while natural language generation metrics for automated reports remain within 1\% of those derived from original images. By requiring only a camera device and standard computing resources, VisionCAD offers an accessible approach for AI-assisted diagnosis, enabling the deployment of diagnostic capabilities in diverse clinical settings without modifications to existing infrastructure.
Related papers
- OnSight Pathology: A real-time platform-agnostic computational pathology companion for histopathology [0.1866854739883167]
We introduce OnSight Pathology, a platform-agnostic computer vision software that uses continuous custom screen captures.<n>OnSight Pathology operates locally on consumer-grade personal computers without complex software integration.<n>We show how OnSight Pathology can deliver real-time AI inferences across a broad range of pathology pipelines.
arXiv Detail & Related papers (2025-12-03T19:02:08Z) - Intelligent Healthcare Imaging Platform: A VLM-Based Framework for Automated Medical Image Analysis and Clinical Report Generation [0.0]
This work presents an intelligent multimodal framework for medical image analysis that leverages Vision-Language Models (VLMs)<n>The framework integrates Google Gemini 2.5 Flash for automated tumor detection and clinical report generation across multiple imaging modalities including CT, MRI, X-ray, and Ultrasound.
arXiv Detail & Related papers (2025-09-16T23:15:44Z) - UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System [48.83716673786449]
We propose UniCAD, a unified architecture that seamlessly handles both 2D and 3D medical images.<n>A low-rank adaptation strategy is employed to adapt a pre-trained visual model to the medical image domain, achieving performance on par with fully fine-tuned counterparts.<n>Building on this unified CAD architecture, we establish an open-source platform where researchers can share and access lightweight CAD experts.
arXiv Detail & Related papers (2025-05-14T06:21:27Z) - Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy [63.39037092484374]
Synthetic Data Generation based on Artificial Intelligence (AI) can transform the way clinical medicine is delivered.<n>This study focuses on the clinical evaluation of medical SDG, with a proof-of-concept investigation on diagnosing Inflammatory Bowel Disease (IBD) using Wireless Capsule Endoscopy (WCE) images.<n>The results show that TIDE-II generates clinically plausible, very realistic WCE images, of improved quality compared to relevant state-of-the-art generative models.
arXiv Detail & Related papers (2024-10-31T19:48:50Z) - COVID-19 Computer-aided Diagnosis through AI-assisted CT Imaging
Analysis: Deploying a Medical AI System [16.1664846590467]
We showcase the integration and reliable and fast deployment of a state-of-the-art AI system designed to automatically analyze CT images.
The suggested system is anticipated to reduce physicians' detection time and enhance the overall efficiency of COVID-19 detection.
arXiv Detail & Related papers (2024-03-10T15:54:45Z) - ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using
Large Language Models [53.73049253535025]
Large language models (LLMs) have recently demonstrated their potential in clinical applications.
This paper presents a method for integrating LLMs into medical-image CAD networks.
The goal is to merge the strengths of LLMs' medical domain knowledge and logical reasoning with the vision understanding capability of existing medical-image CAD models.
arXiv Detail & Related papers (2023-02-14T18:54:06Z) - Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation [116.87918100031153]
We propose a Cross-modal clinical Graph Transformer (CGT) for ophthalmic report generation (ORG)
CGT injects clinical relation triples into the visual features as prior knowledge to drive the decoding procedure.
Experiments on the large-scale FFA-IR benchmark demonstrate that the proposed CGT is able to outperform previous benchmark methods.
arXiv Detail & Related papers (2022-06-04T13:16:30Z) - Medical Imaging and Machine Learning [16.240472115235253]
The National Institutes of Health in 2018 identified key focus areas for the future of artificial intelligence in medical imaging.
Data availability, need for novel computing architectures and explainable AI algorithms, are still relevant.
In this paper we explore challenges unique to high dimensional clinical imaging data, in addition to highlighting some of the technical and ethical considerations.
arXiv Detail & Related papers (2021-03-02T18:53:39Z) - Cognitive Visual Inspection Service for LCD Manufacturing Industry [80.63336968475889]
This paper discloses a novel visual inspection system for liquid crystal display (LCD), which is currently a dominant type in the FPD industry.
System is based on two cornerstones: robust/high-performance defect recognition model and cognitive visual inspection service architecture.
arXiv Detail & Related papers (2021-01-11T08:14:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.