A field guide to cultivating computational biology
- URL: http://arxiv.org/abs/2104.11364v1
- Date: Fri, 23 Apr 2021 01:24:21 GMT
- Title: A field guide to cultivating computational biology
- Authors: Anne E Carpenter, Casey S Greene, Piero Carnici, Benilton S Carvalho,
Michiel de Hoon, Stacey Finley, Kim-Anh Le Cao, Jerry SH Lee, Luigi
Marchionni, Suzanne Sindi, Fabian J Theis, Gregory P Way, Jean YH Yang, Elana
J Fertig
- Abstract summary: Biomedical research centers can empower basic discovery and novel therapeutic strategies by leveraging their large-scale datasets from experiments and patients.
This data, together with new technologies to create and analyze it, has ushered in an era of data-driven discovery which requires moving beyond the traditional individual, single-discipline investigator research model.
We propose solutions for individual scientists, institutions, journal publishers, funding agencies, and educators.
- Score: 1.040598660564506
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Biomedical research centers can empower basic discovery and novel therapeutic
strategies by leveraging their large-scale datasets from experiments and
patients. This data, together with new technologies to create and analyze it,
has ushered in an era of data-driven discovery which requires moving beyond the
traditional individual, single-discipline investigator research model. This
interdisciplinary niche is where computational biology thrives. It has matured
over the past three decades and made major contributions to scientific
knowledge and human health, yet researchers in the field often languish in
career advancement, publication, and grant review. We propose solutions for
individual scientists, institutions, journal publishers, funding agencies, and
educators.
Related papers
- Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation [58.064940977804596]
A plethora of new AI models and tools has been proposed, promising to empower researchers and academics worldwide to conduct their research more effectively and efficiently.
Ethical concerns regarding shortcomings of these tools and potential for misuse take a particularly prominent place in our discussion.
arXiv Detail & Related papers (2025-02-07T18:26:45Z) - Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System [62.832818186789545]
Virtual Scientists (VirSci) is a multi-agent system designed to mimic the teamwork inherent in scientific research.
VirSci organizes a team of agents to collaboratively generate, evaluate, and refine research ideas.
We show that this multi-agent approach outperforms the state-of-the-art method in producing novel scientific ideas.
arXiv Detail & Related papers (2024-10-12T07:16:22Z) - A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery [68.48094108571432]
Large language models (LLMs) have revolutionized the way text and other modalities of data are handled.
We aim to provide a more holistic view of the research landscape by unveiling cross-field and cross-modal connections between scientific LLMs.
arXiv Detail & Related papers (2024-06-16T08:03:24Z) - ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models [56.08917291606421]
ResearchAgent is an AI-based system for ideation and operationalization of novel work.
ResearchAgent automatically defines novel problems, proposes methods and designs experiments, while iteratively refining them.
We experimentally validate our ResearchAgent on scientific publications across multiple disciplines.
arXiv Detail & Related papers (2024-04-11T13:36:29Z) - SciOps: Achieving Productivity and Reliability in Data-Intensive Research [0.8414742293641504]
Scientists are increasingly leveraging advances in instruments, automation, and collaborative tools to scale up their experiments and research goals.
Various scientific disciplines, including neuroscience, have adopted key technologies to enhance collaboration, inspiration and automation.
We introduce a five-level Capability Maturity Model describing the principles of rigorous scientific operations.
arXiv Detail & Related papers (2023-12-29T21:37:22Z) - ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab [67.24684071577211]
The challenge of replicating research results has posed a significant impediment to the field of molecular biology.
We first curate a comprehensive multimodal dataset, named ProBio, as an initial step towards this objective.
Next, we devise two challenging benchmarks, transparent solution tracking and multimodal action recognition, to emphasize the unique characteristics and difficulties associated with activity understanding in BioLab settings.
arXiv Detail & Related papers (2023-11-01T14:44:01Z) - How Data Scientists Review the Scholarly Literature [4.406926847270567]
We examine the literature review practices of data scientists.
Data science represents a field seeing an exponential rise in papers.
No prior work has examined the specific practices and challenges faced by these scientists.
arXiv Detail & Related papers (2023-01-10T03:53:05Z) - Research on Domain Information Mining and Theme Evolution of Scientific
Papers [5.747583451398117]
Cross-disciplinary research results have gradually become an emerging frontier research direction.
How to effectively use the huge number of scientific papers to help researchers becomes a challenge.
arXiv Detail & Related papers (2022-04-18T14:36:17Z) - Diversifying the Genomic Data Science Research Community [22.633385577446617]
We have formed the Genomic Data Science Community Network to identify opportunities and support broadening access to cloud-enabled genomic data science.
Here, we provide a summary of the priorities for faculty members at UIs, as well as administrators, funders, and R1 researchers to consider as we create a more diverse genomic data science community.
arXiv Detail & Related papers (2022-01-20T20:36:18Z) - A Search Engine for Discovery of Biomedical Challenges and Directions [38.72769142277108]
We construct and release an expert-annotated corpus of texts sampled from full-length papers.
We focus on a large corpus of interdisciplinary work relating to the COVID-19 pandemic.
We apply a model trained on our data to identify challenges and directions across the corpus and build a dedicated search engine for this information.
arXiv Detail & Related papers (2021-08-31T11:08:20Z) - Machine Learning Applications for Therapeutic Tasks with Genomics Data [49.98249191161107]
We review the literature on machine learning applications for genomics through the lens of therapeutic development.
We identify twenty-two machine learning in genomics applications across the entire therapeutics pipeline.
We pinpoint seven important challenges in this field with opportunities for expansion and impact.
arXiv Detail & Related papers (2021-05-03T21:20:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.