Cardea: An Open Automated Machine Learning Framework for Electronic
Health Records
- URL: http://arxiv.org/abs/2010.00509v1
- Date: Thu, 1 Oct 2020 15:58:13 GMT
- Title: Cardea: An Open Automated Machine Learning Framework for Electronic
Health Records
- Authors: Sarah Alnegheimish, Najat Alrashed, Faisal Aleissa, Shahad Althobaiti,
Dongyu Liu, Mansour Alsaleh and Kalyan Veeramachaneni
- Abstract summary: Cardea is an open-source automated machine learning framework.
It allows users to build predictive models with their own data.
We demonstrate our framework via 5 prediction tasks on MIMIC-III and Kaggle datasets.
- Score: 11.170152156043336
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An estimated 180 papers focusing on deep learning and EHR were published
between 2010 and 2018. Despite the common workflow structure appearing in these
publications, no trusted and verified software framework exists, forcing
researchers to arduously repeat previous work. In this paper, we propose
Cardea, an extensible open-source automated machine learning framework
encapsulating common prediction problems in the health domain and allows users
to build predictive models with their own data. This system relies on two
components: Fast Healthcare Interoperability Resources (FHIR) -- a standardized
data structure for electronic health systems -- and several AUTOML frameworks
for automated feature engineering, model selection, and tuning. We augment
these components with an adaptive data assembler and comprehensive data- and
model- auditing capabilities. We demonstrate our framework via 5 prediction
tasks on MIMIC-III and Kaggle datasets, which highlight Cardea's human
competitiveness, flexibility in problem definition, extensive feature
generation capability, adaptable automatic data assembler, and its usability.
Related papers
- AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving [68.73885845181242]
We propose an Automatic Data Engine (AIDE) that automatically identifies issues, efficiently curates data, improves the model through auto-labeling, and verifies the model through generation of diverse scenarios.
We further establish a benchmark for open-world detection on AV datasets to comprehensively evaluate various learning paradigms, demonstrating our method's superior performance at a reduced cost.
arXiv Detail & Related papers (2024-03-26T04:27:56Z) - Quantitative Assurance and Synthesis of Controllers from Activity
Diagrams [4.419843514606336]
Probabilistic model checking is a widely used formal verification technique to automatically verify qualitative and quantitative properties.
This makes it not accessible for researchers and engineers who may not have the required knowledge.
We propose a comprehensive verification framework for ADs, including a new profile for probability time, quality annotations, a semantics interpretation of ADs in three Markov models, and a set of transformation rules from activity diagrams to the PRISM language.
Most importantly, we developed algorithms for transformation and implemented them in a tool, called QASCAD, using model-based techniques, for fully automated verification.
arXiv Detail & Related papers (2024-02-29T22:40:39Z) - TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series [61.436361263605114]
Time series data are often scarce or highly sensitive, which precludes the sharing of data between researchers and industrial organizations.
We introduce Time Series Generative Modeling (TSGM), an open-source framework for the generative modeling of synthetic time series.
arXiv Detail & Related papers (2023-05-19T10:11:21Z) - ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a
Knowledge Graph [0.0]
Orkg-Leaderboard is designed to extract leaderboards from large collections of empirical research papers in Artificial Intelligence (AI)
The system is integrated with the Open Research Knowledge Graph (ORKG) platform, which fosters the machine-actionable publishing of findings.
Our best model performs above 90% F1 on the textitleaderboard extraction task, thus proving Orkg-Leaderboards a practically viable tool for real-world usage.
arXiv Detail & Related papers (2023-05-10T13:19:18Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Merging Subject Matter Expertise and Deep Convolutional Neural Network
for State-Based Online Machine-Part Interaction Classification [5.216662889312795]
Machine-part interaction classification is a key capability required by Cyber-Physical Systems (CPS), a pivotal enabler of Smart Manufacturing (SM)
In this work, we address point detection and time series classification for machine-part interactions with a deep Convolutional Neural Network (CNN) based framework.
arXiv Detail & Related papers (2021-12-08T20:23:13Z) - AEFE: Automatic Embedded Feature Engineering for Categorical Features [4.310748698480341]
We propose an automatic feature engineering framework for representing categorical features, which consists of various components including custom paradigm feature construction and multiple feature selection.
Experiments conducted on some typical e-commerce datasets indicate that our method outperforms the classical machine learning models and state-of-the-art deep learning models.
arXiv Detail & Related papers (2021-10-19T07:22:59Z) - CateCom: a practical data-centric approach to categorization of
computational models [77.34726150561087]
We present an effort aimed at organizing the landscape of physics-based and data-driven computational models.
We apply object-oriented design concepts and outline the foundations of an open-source collaborative framework.
arXiv Detail & Related papers (2021-09-28T02:59:40Z) - Families In Wild Multimedia: A Multimodal Database for Recognizing
Kinship [63.27052967981546]
We introduce the first publicly available multi-task MM kinship dataset.
To build FIW MM, we developed machinery to automatically collect, annotate, and prepare the data.
Results highlight edge cases to inspire future research with different areas of improvement.
arXiv Detail & Related papers (2020-07-28T22:36:57Z) - Towards Automated Neural Interaction Discovery for Click-Through Rate
Prediction [64.03526633651218]
Click-Through Rate (CTR) prediction is one of the most important machine learning tasks in recommender systems.
We propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR.
arXiv Detail & Related papers (2020-06-29T04:33:01Z) - A Unified Architecture for Data-Driven Metadata Tagging of Building
Automation Systems [0.0]
This article presents a Unified Architecture for automated point tagging of Building Automation System data.
We propose a UA that automates the process of point tagging by leveraging the data accessible through connection to the BAS.
The proposed methodology correctly applied 85-90 percent and 70-75 percent of the tags in each of these test scenarios.
arXiv Detail & Related papers (2020-02-27T00:35:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.