A field guide to cultivating computational biology
- URL: http://arxiv.org/abs/2104.11364v1
- Date: Fri, 23 Apr 2021 01:24:21 GMT
- Title: A field guide to cultivating computational biology
- Authors: Anne E Carpenter, Casey S Greene, Piero Carnici, Benilton S Carvalho,
Michiel de Hoon, Stacey Finley, Kim-Anh Le Cao, Jerry SH Lee, Luigi
Marchionni, Suzanne Sindi, Fabian J Theis, Gregory P Way, Jean YH Yang, Elana
J Fertig
- Abstract summary: Biomedical research centers can empower basic discovery and novel therapeutic strategies by leveraging their large-scale datasets from experiments and patients.
This data, together with new technologies to create and analyze it, has ushered in an era of data-driven discovery which requires moving beyond the traditional individual, single-discipline investigator research model.
We propose solutions for individual scientists, institutions, journal publishers, funding agencies, and educators.
- Score: 1.040598660564506
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Biomedical research centers can empower basic discovery and novel therapeutic
strategies by leveraging their large-scale datasets from experiments and
patients. This data, together with new technologies to create and analyze it,
has ushered in an era of data-driven discovery which requires moving beyond the
traditional individual, single-discipline investigator research model. This
interdisciplinary niche is where computational biology thrives. It has matured
over the past three decades and made major contributions to scientific
knowledge and human health, yet researchers in the field often languish in
career advancement, publication, and grant review. We propose solutions for
individual scientists, institutions, journal publishers, funding agencies, and
educators.
Related papers
- Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation [48.29699224989952]
VirSci organizes a team of agents to collaboratively generate, evaluate, and refine research ideas.
We show that this multi-agent approach outperforms the state-of-the-art method in producing novel and impactful scientific ideas.
arXiv Detail & Related papers (2024-10-12T07:16:22Z) - A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery [68.48094108571432]
Large language models (LLMs) have revolutionized the way text and other modalities of data are handled.
We aim to provide a more holistic view of the research landscape by unveiling cross-field and cross-modal connections between scientific LLMs.
arXiv Detail & Related papers (2024-06-16T08:03:24Z) - ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models [56.08917291606421]
ResearchAgent is a large language model-powered research idea writing agent.
It generates problems, methods, and experiment designs while iteratively refining them based on scientific literature.
We experimentally validate our ResearchAgent on scientific publications across multiple disciplines.
arXiv Detail & Related papers (2024-04-11T13:36:29Z) - Progress and Opportunities of Foundation Models in Bioinformatics [77.74411726471439]
Foundations models (FMs) have ushered in a new era in computational biology, especially in the realm of deep learning.
Central to our focus is the application of FMs to specific biological problems, aiming to guide the research community in choosing appropriate FMs for their research needs.
Review analyses challenges and limitations faced by FMs in biology, such as data noise, model explainability, and potential biases.
arXiv Detail & Related papers (2024-02-06T02:29:17Z) - SciOps: Achieving Productivity and Reliability in Data-Intensive Research [0.8414742293641504]
Scientists are increasingly leveraging advances in instruments, automation, and collaborative tools to scale up their experiments and research goals.
Various scientific disciplines, including neuroscience, have adopted key technologies to enhance collaboration, inspiration and automation.
We introduce a five-level Capability Maturity Model describing the principles of rigorous scientific operations.
arXiv Detail & Related papers (2023-12-29T21:37:22Z) - ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab [67.24684071577211]
The challenge of replicating research results has posed a significant impediment to the field of molecular biology.
We first curate a comprehensive multimodal dataset, named ProBio, as an initial step towards this objective.
Next, we devise two challenging benchmarks, transparent solution tracking and multimodal action recognition, to emphasize the unique characteristics and difficulties associated with activity understanding in BioLab settings.
arXiv Detail & Related papers (2023-11-01T14:44:01Z) - How Data Scientists Review the Scholarly Literature [4.406926847270567]
We examine the literature review practices of data scientists.
Data science represents a field seeing an exponential rise in papers.
No prior work has examined the specific practices and challenges faced by these scientists.
arXiv Detail & Related papers (2023-01-10T03:53:05Z) - Research on Domain Information Mining and Theme Evolution of Scientific
Papers [5.747583451398117]
Cross-disciplinary research results have gradually become an emerging frontier research direction.
How to effectively use the huge number of scientific papers to help researchers becomes a challenge.
arXiv Detail & Related papers (2022-04-18T14:36:17Z) - Diversifying the Genomic Data Science Research Community [22.633385577446617]
We have formed the Genomic Data Science Community Network to identify opportunities and support broadening access to cloud-enabled genomic data science.
Here, we provide a summary of the priorities for faculty members at UIs, as well as administrators, funders, and R1 researchers to consider as we create a more diverse genomic data science community.
arXiv Detail & Related papers (2022-01-20T20:36:18Z) - A Search Engine for Discovery of Biomedical Challenges and Directions [38.72769142277108]
We construct and release an expert-annotated corpus of texts sampled from full-length papers.
We focus on a large corpus of interdisciplinary work relating to the COVID-19 pandemic.
We apply a model trained on our data to identify challenges and directions across the corpus and build a dedicated search engine for this information.
arXiv Detail & Related papers (2021-08-31T11:08:20Z) - Machine Learning Applications for Therapeutic Tasks with Genomics Data [49.98249191161107]
We review the literature on machine learning applications for genomics through the lens of therapeutic development.
We identify twenty-two machine learning in genomics applications across the entire therapeutics pipeline.
We pinpoint seven important challenges in this field with opportunities for expansion and impact.
arXiv Detail & Related papers (2021-05-03T21:20:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.