Classification of Long Noncoding RNA Elements Using Deep Convolutional
Neural Networks and Siamese Networks
- URL: http://arxiv.org/abs/2102.05582v1
- Date: Wed, 10 Feb 2021 17:26:38 GMT
- Title: Classification of Long Noncoding RNA Elements Using Deep Convolutional
Neural Networks and Siamese Networks
- Authors: Brian McClannahan, Cucong Zhong, Guanghui Wang
- Abstract summary: This thesis proposes a new methodemploying deep convolutional neural networks (CNNs) to classifyncRNA sequences.
As a result, clas-sifying RNA sequences is converted to an image classificationproblem that can be efficiently solved by CNN-basedclassification models.
- Score: 17.8181080354116
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: In the last decade, the discovery of noncoding RNA(ncRNA) has exploded.
Classifying these ncRNA is critical todetermining their function. This thesis
proposes a new methodemploying deep convolutional neural networks (CNNs) to
classifyncRNA sequences. To this end, this paper first proposes anefficient
approach to convert the RNA sequences into imagescharacterizing their
base-pairing probability. As a result, clas-sifying RNA sequences is converted
to an image classificationproblem that can be efficiently solved by available
CNN-basedclassification models. This research also considers the
foldingpotential of the ncRNAs in addition to their primary sequence.Based on
the proposed approach, a benchmark image classifi-cation dataset is generated
from the RFAM database of ncRNAsequences. In addition, three classical CNN
models and threeSiamese network models have been implemented and comparedto
demonstrate the superior performance and efficiency of theproposed approach.
Extensive experimental results show thegreat potential of using deep learning
approaches for RNAclassification.
Related papers
- RNACG: A Universal RNA Sequence Conditional Generation model based on Flow-Matching [0.0]
We develop a universal RNA sequence generation model based on flow matching, namely RNACG.
RNACG can accommodate various conditional inputs and is portable, enabling users to customize the encoding network for conditional inputs.
RNACG exhibits extensive applicability in sequence generation and property prediction tasks.
arXiv Detail & Related papers (2024-07-29T09:46:46Z) - BEACON: Benchmark for Comprehensive RNA Tasks and Language Models [60.02663015002029]
We introduce the first comprehensive RNA benchmark BEACON (textbfBEnchmtextbfArk for textbfCOmprehensive RtextbfNA Task and Language Models).
First, BEACON comprises 13 distinct tasks derived from extensive previous work covering structural analysis, functional studies, and engineering applications.
Second, we examine a range of models, including traditional approaches like CNNs, as well as advanced RNA foundation models based on language models, offering valuable insights into the task-specific performances of these models.
Third, we investigate the vital RNA language model components
arXiv Detail & Related papers (2024-06-14T19:39:19Z) - RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching [7.600990806121113]
RNAFlow is a flow matching model for protein-conditioned RNA sequence-structure design.
Its denoising network integrates an RNA inverse folding model and a pre-trained RosettaFold2NA network for generation of RNA sequences and structures.
arXiv Detail & Related papers (2024-05-29T05:10:25Z) - RDesign: Hierarchical Data-efficient Representation Learning for
Tertiary Structure-based RNA Design [65.41144149958208]
This study aims to systematically construct a data-driven RNA design pipeline.
We crafted a benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure.
We incorporated extracted secondary structures with base pairs as prior knowledge to facilitate the RNA design process.
arXiv Detail & Related papers (2023-01-25T17:19:49Z) - Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective [63.3632827588974]
We introduce RFold, a method that learns to predict the most matching K-Rook solution from the given sequence.
RFold achieves competitive performance and about eight times faster inference efficiency than state-of-the-art approaches.
arXiv Detail & Related papers (2022-12-02T16:34:56Z) - E2Efold-3D: End-to-End Deep Learning Method for accurate de novo RNA 3D
Structure Prediction [46.38735421190187]
We develop the first end-to-end deep learning approach, E2Efold-3D, to accurately perform the textitde novo RNA structure prediction.
Several novel components are proposed to overcome the data scarcity, such as a fully-differentiable end-to-end pipeline, secondary structure-assisted self-distillation, and parameter-efficient backbone formulation.
arXiv Detail & Related papers (2022-07-04T17:15:35Z) - Improving RNA Secondary Structure Design using Deep Reinforcement
Learning [69.63971634605797]
We propose a new benchmark of applying reinforcement learning to RNA sequence design, in which the objective function is defined to be the free energy in the sequence's secondary structure.
We show results of the ablation analysis that we do for these algorithms, as well as graphs indicating the algorithm's performance across batches.
arXiv Detail & Related papers (2021-11-05T02:54:06Z) - TargetNet: Functional microRNA Target Prediction with Deep Neural
Networks [16.94702979550834]
We introduce TargetNet, a novel deep learning-based algorithm for functional target prediction.
The proposed model was trained with components-CTS pair datasets and evaluated with residual-mRNA pair datasets.
arXiv Detail & Related papers (2021-07-23T07:31:23Z) - Classification of Noncoding RNA Elements Using Deep Convolutional Neural
Networks [16.650383406063956]
The paper proposes to employ deep convolutional neural networks (CNNs) to classify noncoding RNA (ncRNA) sequences.
We first propose an efficient approach to convert the RNA sequences into images characterizing their base-pairing probability.
As a result, classifying RNA sequences is converted to an image classification problem that can be efficiently solved by available CNN-based classification models.
arXiv Detail & Related papers (2020-08-24T17:43:50Z) - Deep Belief Network based representation learning for lncRNA-disease
association prediction [0.0]
Accurately identifying lncRNA-disease association is essential in understanding lncRNA functionality and disease mechanism.
Deep belief networks (DBN) are recently used in biological network analysis to learn the latent representations of network features.
In this paper, we propose a DBN based lncRNA-disease association prediction model (DBNLDA) from lncRNA, disease and interactions.
arXiv Detail & Related papers (2020-06-22T18:05:28Z) - RNA Secondary Structure Prediction By Learning Unrolled Algorithms [70.09461537906319]
In this paper, we propose an end-to-end deep learning model, called E2Efold, for RNA secondary structure prediction.
The key idea of E2Efold is to directly predict the RNA base-pairing matrix, and use an unrolled algorithm for constrained programming as the template for deep architectures to enforce constraints.
With comprehensive experiments on benchmark datasets, we demonstrate the superior performance of E2Efold.
arXiv Detail & Related papers (2020-02-13T23:21:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.