Multi-Instance Multi-Label Learning for Gene Mutation Prediction in
Hepatocellular Carcinoma
- URL: http://arxiv.org/abs/2005.04073v1
- Date: Fri, 8 May 2020 14:47:25 GMT
- Title: Multi-Instance Multi-Label Learning for Gene Mutation Prediction in
Hepatocellular Carcinoma
- Authors: Kaixin Xu, Ziyuan Zhao, Jiapan Gu, Zeng Zeng, Chan Wan Ying, Lim Kheng
Choon, Thng Choon Hua, Pierce KH Chow
- Abstract summary: Gene mutation prediction in carcinoma (HCC) is of great diagnostic and prognostic value for personalized treatments and precision medicine.
In this paper, we tackle this problem with multi-instance multi-label learning to address the difficulties on label correlations, label representations, etc.
- Score: 5.234375382973767
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Gene mutation prediction in hepatocellular carcinoma (HCC) is of great
diagnostic and prognostic value for personalized treatments and precision
medicine. In this paper, we tackle this problem with multi-instance multi-label
learning to address the difficulties on label correlations, label
representations, etc. Furthermore, an effective oversampling strategy is
applied for data imbalance. Experimental results have shown the superiority of
the proposed approach.
Related papers
- Survey and Improvement Strategies for Gene Prioritization with Large Language Models [61.24568051916653]
Large language models (LLMs) have performed well in medical exams, but their effectiveness in diagnosing rare genetic diseases has not been assessed.
We used multi-agent and Human Phenotype Ontology (HPO) classification to categorized patients based on phenotypes and solvability levels.
At baseline, GPT-4 outperformed other LLMs, achieving near 30% accuracy in ranking causal genes correctly.
arXiv Detail & Related papers (2025-01-30T23:03:03Z) - Multivariate Feature Selection and Autoencoder Embeddings of Ovarian Cancer Clinical and Genetic Data [2.973561339858947]
This study explores a data-driven approach to discovering novel clinical and genetic markers in ovarian cancer (OC)
In the autoencoder analysis, a clearer pattern emerged when using clinical features and the combination of clinical and genetic data.
Key clinical variables (such as type of surgery and neoadjuvant chemotherapy) and certain gene mutations showed strong relevance, along with low-risk genetic factors.
arXiv Detail & Related papers (2025-01-27T09:07:07Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification [119.13058298388101]
We develop a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances.
BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules.
BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules.
arXiv Detail & Related papers (2024-06-05T06:42:27Z) - scBeacon: single-cell biomarker extraction via identifying paired cell
clusters across biological conditions with contrastive siamese networks [0.9591674293850556]
scBeacon is a framework built upon a deep contrastive siamese network.
scBeacon adeptly identifies matched cell populations across varied conditions.
Comprehensive evaluations validate scBeacon's superiority over existing single-cell differential gene analysis tools.
arXiv Detail & Related papers (2023-11-05T08:27:24Z) - Functional Integrative Bayesian Analysis of High-dimensional
Multiplatform Genomic Data [0.8029049649310213]
We propose a framework called Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Genomic Data (fiBAG)
fiBAG allows simultaneous identification of upstream functional evidence of proteogenomic biomarkers.
We demonstrate the profitability of fiBAG via a pan-cancer analysis of 14 cancer types.
arXiv Detail & Related papers (2022-12-29T03:31:45Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - Cancer Gene Profiling through Unsupervised Discovery [49.28556294619424]
We introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers.
Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm.
Our signature reports promising results on distinguishing immune inflammatory and immune desert tumors.
arXiv Detail & Related papers (2021-02-11T09:04:45Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Multi-Phase Cross-modal Learning for Noninvasive Gene Mutation
Prediction in Hepatocellular Carcinoma [7.621860963237023]
We propose an end-to-end deep learning framework for mutation prediction in APOB, COL11A1 and ATRX genes using multiphasic CT scans.
arXiv Detail & Related papers (2020-05-08T14:36:59Z) - Two-Stage Penalized Regression Screening to Detect Biomarker-Treatment
Interactions in Randomized Clinical Trials [0.0]
High-dimensional biomarkers such as genomics are increasingly being measured in randomized clinical trials.
We adapt recently proposed two-stage interaction detecting procedures in the setting of randomized clinical trials.
We also propose a new stage 1 multivariate screening strategy using ridge regression to account for correlations among biomarkers.
arXiv Detail & Related papers (2020-04-25T00:50:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.