Identifying multi-omics interactions for lung cancer drug targets discovery using Kernel Machine Regression
- URL: http://arxiv.org/abs/2510.16093v1
- Date: Fri, 17 Oct 2025 17:13:39 GMT
- Title: Identifying multi-omics interactions for lung cancer drug targets discovery using Kernel Machine Regression
- Authors: Md. Imtyaz Ahmed, Md. Delwar Hossain, Md Mostafizer Rahman, Md. Ahsan Habib, Md. Mamunur Rashid, Md. Selim Reza, Md Ashad Alam,
- Abstract summary: Cancer exhibits diverse and complex phenotypes driven by multifaceted molecular interactions.<n>It is challenging to comprehend complex interactions among the features of multi-omics datasets compared to single omics.<n>We identified 38 genes significantly associated with lung cancer.
- Score: 0.41825304104307054
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cancer exhibits diverse and complex phenotypes driven by multifaceted molecular interactions. Recent biomedical research has emphasized the comprehensive study of such diseases by integrating multi-omics datasets (genome, proteome, transcriptome, epigenome). This approach provides an efficient method for identifying genetic variants associated with cancer and offers a deeper understanding of how the disease develops and spreads. However, it is challenging to comprehend complex interactions among the features of multi-omics datasets compared to single omics. In this paper, we analyze lung cancer multi-omics datasets from The Cancer Genome Atlas (TCGA). Using four statistical methods, LIMMA, the T test, Canonical Correlation Analysis (CCA), and the Wilcoxon test, we identified differentially expressed genes across gene expression, DNA methylation, and miRNA expression data. We then integrated these multi-omics data using the Kernel Machine Regression (KMR) approach. Our findings reveal significant interactions among the three omics: gene expression, miRNA expression, and DNA methylation in lung cancer. From our data analysis, we identified 38 genes significantly associated with lung cancer. From our data analysis, we identified 38 genes significantly associated with lung cancer. Among these, eight genes of highest ranking (PDGFRB, PDGFRA, SNAI1, ID1, FGF11, TNXB, ITGB1, ZIC1) were highlighted by rigorous statistical analysis. Furthermore, in silico studies identified three top-ranked potential candidate drugs (Selinexor, Orapred, and Capmatinib) that could play a crucial role in the treatment of lung cancer. These proposed drugs are also supported by the findings of other independent studies, which underscore their potential efficacy in the fight against lung cancer.
Related papers
- A Novel Approach to Linking Histology Images with DNA Methylation [8.947503179743167]
Abnormal methylation patterns can disrupt gene expression and have been linked to cancer development.<n>We propose an end-to-end graph neural network based weakly supervised learning framework to predict the methylation state of gene groups exhibiting coherent patterns across samples.<n>We conduct gene set enrichment analyses on the gene groups and show that majority of the gene groups are significantly enriched in important hallmarks and pathways.
arXiv Detail & Related papers (2025-04-07T18:19:01Z) - Precision Cancer Classification and Biomarker Identification from mRNA Gene Expression via Dimensionality Reduction and Explainable AI [0.9423257767158634]
This research presents a comprehensive pipeline designed to accurately identify 33 distinct cancer types and their corresponding gene sets.
It incorporates a combination of normalization and feature selection techniques to reduce dataset dimensionality effectively.
We leverage Explainable AI to elucidate the biological significance of the identified cancer-specific genes.
arXiv Detail & Related papers (2024-10-08T18:56:31Z) - LASSO-MOGAT: A Multi-Omics Graph Attention Framework for Cancer Classification [41.94295877935867]
This paper introduces LASSO-MOGAT, a graph-based deep learning framework that integrates messenger RNA, microRNA, and DNA methylation data to classify 31 cancer types.
arXiv Detail & Related papers (2024-08-30T16:26:04Z) - Self-Normalizing Foundation Model for Enhanced Multi-Omics Data Analysis in Oncology [0.0]
SeNMo is a foundation model that has been trained on multi-omics data across 33 cancer types.
We trained SeNMo for the task of overall survival of patients using pan-cancer multi-omics data involving 33 cancer sites.
SeNMo was validated on two independent cohorts: Moffitt Cancer Center and CPTAC lung squamous cell carcinoma.
arXiv Detail & Related papers (2024-05-13T22:45:44Z) - Exploring Gene Regulatory Interaction Networks and predicting
therapeutic molecules for Hypopharyngeal Cancer and EGFR-mutated lung
adenocarcinoma [5.178086150698542]
In this study, we select EGFR-mutated lung adenocarcinoma and Hypopharyngeal cancer by finding the metastases in hypopharyngeal cancer.
Our research findings have suggested common therapeutic molecules for the selected diseases based on 10 hub genes.
arXiv Detail & Related papers (2024-02-27T11:29:36Z) - Gene-MOE: A sparsely gated prognosis and classification framework
exploiting pan-cancer genomic information [13.57379781623848]
We introduce a novel sparsely gated RNA-seq analysis framework called Gene-MOE.
Gene-MOE exploits the potential of the MOE layers and the proposed mixture of attention expert layers to enhance the analysis accuracy.
It addresses overfitting challenges by integrating pan-cancer information from 33 distinct cancer types through pre-training.
arXiv Detail & Related papers (2023-11-29T07:09:25Z) - A new algorithm for Subgroup Set Discovery based on Information Gain [58.720142291102135]
Information Gained Subgroup Discovery (IGSD) is a new SD algorithm for pattern discovery.
We compare IGSD with two state-of-the-art SD algorithms: FSSD and SSD++.
IGSD provides better OR values than FSSD and SSD++, stating a higher dependence between patterns and targets.
arXiv Detail & Related papers (2023-07-26T21:42:34Z) - Pathology-and-genomics Multimodal Transformer for Survival Outcome
Prediction [43.1748594898772]
We propose a multimodal transformer (PathOmics) integrating pathology and genomics insights into colon-related cancer survival prediction.
We emphasize the unsupervised pretraining to capture the intrinsic interaction between tissue microenvironments in gigapixel whole slide images.
We evaluate our approach on both TCGA colon and rectum cancer cohorts, showing that the proposed approach is competitive and outperforms state-of-the-art studies.
arXiv Detail & Related papers (2023-07-22T00:59:26Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - Cancer Gene Profiling through Unsupervised Discovery [49.28556294619424]
We introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers.
Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm.
Our signature reports promising results on distinguishing immune inflammatory and immune desert tumors.
arXiv Detail & Related papers (2021-02-11T09:04:45Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.