Review of Machine-Learning Methods for RNA Secondary Structure
Prediction
- URL: http://arxiv.org/abs/2009.08868v1
- Date: Tue, 1 Sep 2020 03:17:15 GMT
- Title: Review of Machine-Learning Methods for RNA Secondary Structure
Prediction
- Authors: Qi Zhao, Zheng Zhao, Xiaoya Fan, Zhengwei Yuan, Qian Mao, Yudong Yao
- Abstract summary: We provide a comprehensive overview of RNA secondary structure prediction methods based on machine-learning technologies.
The current pending issues in the field of RNA secondary structure prediction and future trends are also discussed.
- Score: 21.3539253580504
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Secondary structure plays an important role in determining the function of
non-coding RNAs. Hence, identifying RNA secondary structures is of great value
to research. Computational prediction is a mainstream approach for predicting
RNA secondary structure. Unfortunately, even though new methods have been
proposed over the past 40 years, the performance of computational prediction
methods has stagnated in the last decade. Recently, with the increasing
availability of RNA structure data, new methods based on machine-learning
technologies, especially deep learning, have alleviated the issue. In this
review, we provide a comprehensive overview of RNA secondary structure
prediction methods based on machine-learning technologies and a tabularized
summary of the most important methods in this field. The current pending issues
in the field of RNA secondary structure prediction and future trends are also
discussed.
Related papers
- Predicting Distance matrix with large language models [1.8855270809505869]
RNA structure prediction remains a significant challenge due to data limitations.
Traditional methods such as nuclear magnetic resonance spectroscopy, Xray crystallography, and electron microscopy are expensive and time consuming.
Distance maps provide a simplified representation of spatial constraints between nucleotides, capturing essential relationships without requiring a full 3D model.
arXiv Detail & Related papers (2024-09-24T10:28:55Z) - BEACON: Benchmark for Comprehensive RNA Tasks and Language Models [60.02663015002029]
We introduce the first comprehensive RNA benchmark BEACON (textbfBEnchmtextbfArk for textbfCOmprehensive RtextbfNA Task and Language Models).
First, BEACON comprises 13 distinct tasks derived from extensive previous work covering structural analysis, functional studies, and engineering applications.
Second, we examine a range of models, including traditional approaches like CNNs, as well as advanced RNA foundation models based on language models, offering valuable insights into the task-specific performances of these models.
Third, we investigate the vital RNA language model components
arXiv Detail & Related papers (2024-06-14T19:39:19Z) - RNA Secondary Structure Prediction Using Transformer-Based Deep Learning Models [13.781096813376145]
The Human Genome Project has led to an exponential increase in data related to the sequence, structure, and function of biomolecules.
This paper discusses the fundamental concepts of RNA, RNA secondary structure, and its prediction.
The application of machine learning technologies in predicting the structure of biological macromolecules is explored.
arXiv Detail & Related papers (2024-04-14T08:36:14Z) - Rethinking Performance Measures of RNA Secondary Structure Problems [42.25267871026153]
Deep learning methods have surpassed traditional algorithms by predicting complex features like pseudoknots and multi-interacting base pairs.
We propose the Weisfeiler-Lehman graph kernel (WL) as an alternative metric.
arXiv Detail & Related papers (2023-12-04T08:46:24Z) - RDesign: Hierarchical Data-efficient Representation Learning for
Tertiary Structure-based RNA Design [65.41144149958208]
This study aims to systematically construct a data-driven RNA design pipeline.
We crafted a benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure.
We incorporated extracted secondary structures with base pairs as prior knowledge to facilitate the RNA design process.
arXiv Detail & Related papers (2023-01-25T17:19:49Z) - Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective [63.3632827588974]
We introduce RFold, a method that learns to predict the most matching K-Rook solution from the given sequence.
RFold achieves competitive performance and about eight times faster inference efficiency than state-of-the-art approaches.
arXiv Detail & Related papers (2022-12-02T16:34:56Z) - E2Efold-3D: End-to-End Deep Learning Method for accurate de novo RNA 3D
Structure Prediction [46.38735421190187]
We develop the first end-to-end deep learning approach, E2Efold-3D, to accurately perform the textitde novo RNA structure prediction.
Several novel components are proposed to overcome the data scarcity, such as a fully-differentiable end-to-end pipeline, secondary structure-assisted self-distillation, and parameter-efficient backbone formulation.
arXiv Detail & Related papers (2022-07-04T17:15:35Z) - Computational prediction of RNA tertiary structures using machine
learning methods [14.35527588241679]
Computational prediction approaches can help to understand RNA structures and their stabilizing factors.
Although their usage in protein-related fields has a long history, the use of machine learning methods in predicting RNA tertiary structures is new and rare.
arXiv Detail & Related papers (2020-09-03T04:01:43Z) - RNA Secondary Structure Prediction By Learning Unrolled Algorithms [70.09461537906319]
In this paper, we propose an end-to-end deep learning model, called E2Efold, for RNA secondary structure prediction.
The key idea of E2Efold is to directly predict the RNA base-pairing matrix, and use an unrolled algorithm for constrained programming as the template for deep architectures to enforce constraints.
With comprehensive experiments on benchmark datasets, we demonstrate the superior performance of E2Efold.
arXiv Detail & Related papers (2020-02-13T23:21:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.