MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence
using Federated Evaluation
- URL: http://arxiv.org/abs/2110.01406v1
- Date: Wed, 29 Sep 2021 18:09:41 GMT
- Title: MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence
using Federated Evaluation
- Authors: Alexandros Karargyris, Renato Umeton, Micah J. Sheller, Alejandro
Aristizabal, Johnu George, Srini Bala, Daniel J. Beutel, Victor Bittorf,
Akshay Chaudhari, Alexander Chowdhury, Cody Coleman, Bala Desinghu, Gregory
Diamos, Debo Dutta, Diane Feddema, Grigori Fursin, Junyi Guo, Xinyuan Huang,
David Kanter, Satyananda Kashyap, Nicholas Lane, Indranil Mallick, Pietro
Mascagni, Virendra Mehta, Vivek Natarajan, Nikola Nikolov, Nicolas Padoy,
Gennady Pekhimenko, Vijay Janapa Reddi, G Anthony Reina, Pablo Ribalta, Jacob
Rosenthal, Abhishek Singh, Jayaraman J. Thiagarajan, Anna Wuest, Maria
Xenochristou, Daguang Xu, Poonam Yadav, Michael Rosenthal, Massimo Loda,
Jason M. Johnson, Peter Mattson
- Abstract summary: We argue that unlocking this potential requires a systematic way to measure the performance of medical AI models on large-scale heterogeneous data.
We are building MedPerf, an open framework for benchmarking machine learning in the medical domain.
- Score: 110.31526448744096
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Medical AI has tremendous potential to advance healthcare by supporting the
evidence-based practice of medicine, personalizing patient treatment, reducing
costs, and improving provider and patient experience. We argue that unlocking
this potential requires a systematic way to measure the performance of medical
AI models on large-scale heterogeneous data. To meet this need, we are building
MedPerf, an open framework for benchmarking machine learning in the medical
domain. MedPerf will enable federated evaluation in which models are securely
distributed to different facilities for evaluation, thereby empowering
healthcare organizations to assess and verify the performance of AI models in
an efficient and human-supervised process, while prioritizing privacy. We
describe the current challenges healthcare and AI communities face, the need
for an open platform, the design philosophy of MedPerf, its current
implementation status, and our roadmap. We call for researchers and
organizations to join us in creating the MedPerf open benchmarking platform.
Related papers
- STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering [58.79671189792399]
STLLaVA-Med is designed to train a policy model capable of auto-generating medical visual instruction data.
We validate the efficacy and data efficiency of STLLaVA-Med across three major medical Visual Question Answering (VQA) benchmarks.
arXiv Detail & Related papers (2024-06-28T15:01:23Z) - Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare [14.399086205317358]
Foundation models (FMs) are trained on vast datasets through methods including unsupervised pretraining, self-supervised learning, instructed fine-tuning, and reinforcement learning from human feedback.
These models are crucial for biomedical applications that require processing diverse data forms such as clinical reports, diagnostic images, and multimodal patient interactions.
The incorporation of FL with these sophisticated models presents a promising strategy to harness their analytical power while safeguarding the privacy of sensitive medical data.
arXiv Detail & Related papers (2024-05-10T19:22:24Z) - Medical Vision-Language Pre-Training for Brain Abnormalities [96.1408455065347]
We show how to automatically collect medical image-text aligned data for pretraining from public resources such as PubMed.
In particular, we present a pipeline that streamlines the pre-training process by initially collecting a large brain image-text dataset.
We also investigate the unique challenge of mapping subfigures to subcaptions in the medical domain.
arXiv Detail & Related papers (2024-04-27T05:03:42Z) - Data-Centric Foundation Models in Computational Healthcare: A Survey [21.53211505568379]
Foundation models (FMs) as an emerging suite of AI techniques have struck a wave of opportunities in computational healthcare.
We discuss key perspectives in AI security, assessment, and alignment with human values.
We offer a promising outlook of FM-based analytics to enhance the performance of patient outcome and clinical workflow.
arXiv Detail & Related papers (2024-01-04T08:00:32Z) - Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding [27.14794371879541]
This paper introduces MedFusionRank, a novel approach to zero-shot medical information retrieval (MIR)
The proposed approach leverages a pre-trained BERT-style model to extract compact yet informative keywords.
These keywords are then enriched with domain knowledge by linking them to conceptual entities within a medical knowledge graph.
arXiv Detail & Related papers (2023-10-31T16:26:33Z) - Applying Bayesian Ridge Regression AI Modeling in Virus Severity
Prediction [0.0]
We review the strengths and weaknesses of Bayesian Ridge Regression, an AI model that can be used to bring cutting edge virus analysis to healthcare professionals.
The model's accuracy assessment revealed promising results, with room for improvement.
In addition, the severity index serves as a valuable tool to gain a broad overview of patient care needs.
arXiv Detail & Related papers (2023-10-14T04:17:00Z) - PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering [56.25766322554655]
Medical Visual Question Answering (MedVQA) presents a significant opportunity to enhance diagnostic accuracy and healthcare delivery.
We propose a generative-based model for medical visual understanding by aligning visual information from a pre-trained vision encoder with a large language model.
We train the proposed model on PMC-VQA and then fine-tune it on multiple public benchmarks, e.g., VQA-RAD, SLAKE, and Image-Clef 2019.
arXiv Detail & Related papers (2023-05-17T17:50:16Z) - The Medkit-Learn(ing) Environment: Medical Decision Modelling through
Simulation [81.72197368690031]
We present a new benchmarking suite designed specifically for medical sequential decision making.
The Medkit-Learn(ing) Environment is a publicly available Python package providing simple and easy access to high-fidelity synthetic medical data.
arXiv Detail & Related papers (2021-06-08T10:38:09Z) - Privacy-preserving medical image analysis [53.4844489668116]
We present PriMIA, a software framework designed for privacy-preserving machine learning (PPML) in medical imaging.
We show significantly better classification performance of a securely aggregated federated learning model compared to human experts on unseen datasets.
We empirically evaluate the framework's security against a gradient-based model inversion attack.
arXiv Detail & Related papers (2020-12-10T13:56:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.