Developing an NLP-based Recommender System for the Ethical, Legal, and
Social Implications of Synthetic Biology
- URL: http://arxiv.org/abs/2207.06360v1
- Date: Sun, 10 Jul 2022 20:18:53 GMT
- Title: Developing an NLP-based Recommender System for the Ethical, Legal, and
Social Implications of Synthetic Biology
- Authors: Damien Dablain, Lilian Huang and Brandon Sepulvado
- Abstract summary: Synthetic biology involves the engineering and re-design of organisms for purposes such as food security, health, and environmental protection.
It poses numerous ethical, legal, and social implications (ELSI) for researchers and policy makers.
Various efforts have sought to embed social scientists and ethicists on synthetic biology projects.
This text proposes a different approach, asking is it possible to develop a well-performing recommender model based upon natural language processing (NLP) to connect synthetic biologists with information on the ELSI of their specific research?
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Synthetic biology is an emerging field that involves the engineering and
re-design of organisms for purposes such as food security, health, and
environmental protection. As such, it poses numerous ethical, legal, and social
implications (ELSI) for researchers and policy makers. Various efforts to
ensure socially responsible synthetic biology are underway. Policy making is
one regulatory avenue, and other initiatives have sought to embed social
scientists and ethicists on synthetic biology projects. However, given the
nascency of synthetic biology, the number of heterogeneous domains it spans,
and the open nature of many ethical questions, it has proven challenging to
establish widespread concrete policies, and including social scientists and
ethicists on synthetic biology teams has met with mixed success.
This text proposes a different approach, asking instead is it possible to
develop a well-performing recommender model based upon natural language
processing (NLP) to connect synthetic biologists with information on the ELSI
of their specific research? This recommender was developed as part of a larger
project building a Synthetic Biology Knowledge System (SBKS) to accelerate
discovery and exploration of the synthetic biology design space. Our approach
aims to distill for synthetic biologists relevant ethical and social scientific
information and embed it into synthetic biology research workflows.
Related papers
- Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation [58.064940977804596]
A plethora of new AI models and tools has been proposed, promising to empower researchers and academics worldwide to conduct their research more effectively and efficiently.
Ethical concerns regarding shortcomings of these tools and potential for misuse take a particularly prominent place in our discussion.
arXiv Detail & Related papers (2025-02-07T18:26:45Z) - Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models [6.364723262453785]
This paper harnesses the capabilities of large language models (LLMs) to mine key ecological entities from invasion biology literature.
Specifically, we focus on extracting species names, their locations, associated habitats, and ecosystems, information that is critical for understanding species spread.
This study lays the groundwork for more advanced, automated knowledge extraction tools that can aid researchers and practitioners in understanding and managing biological invasions.
arXiv Detail & Related papers (2025-01-30T11:55:44Z) - Applications and Challenges of AI and Microscopy in Life Science Research: A Review [7.771558261139913]
This paper explores the intersection of AI and microscopy in life sciences, emphasizing their potential applications and associated challenges.
We provide a detailed review of how various biological systems can benefit from AI, highlighting the types of data and labeling requirements unique to this domain.
Specifically attention is given to microscopy data, exploring the specific AI techniques required to process and interpret this information.
arXiv Detail & Related papers (2025-01-22T08:32:36Z) - Toward AI-Driven Digital Organism: Multiscale Foundation Models for Predicting, Simulating and Programming Biology at All Levels [37.64318681235737]
We present an approach of using AI to model and simulate biology and life.
An AI-Driven Digital Organism (AIDO) is a system of integrated multiscale foundation models.
We envision that an AIDO is poised to trigger a new wave of better-guided wet-lab experimentation.
arXiv Detail & Related papers (2024-12-09T20:59:59Z) - Automating Exploratory Proteomics Research via Language Models [22.302672656499315]
PROTEUS is a fully automated system for scientific discovery from raw data.
It produces a comprehensive set of research objectives, analysis results and novel biological hypotheses without human intervention.
arXiv Detail & Related papers (2024-11-06T08:16:56Z) - Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System [62.832818186789545]
Virtual Scientists (VirSci) is a multi-agent system designed to mimic the teamwork inherent in scientific research.
VirSci organizes a team of agents to collaboratively generate, evaluate, and refine research ideas.
We show that this multi-agent approach outperforms the state-of-the-art method in producing novel scientific ideas.
arXiv Detail & Related papers (2024-10-12T07:16:22Z) - BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning [77.90250740041411]
This paper introduces BioT5+, an extension of the BioT5 framework, tailored to enhance biological research and drug discovery.
BioT5+ incorporates several novel features: integration of IUPAC names for molecular understanding, inclusion of extensive bio-text and molecule data from sources like bioRxiv and PubChem, the multi-task instruction tuning for generality across tasks, and a numerical tokenization technique for improved processing of numerical data.
arXiv Detail & Related papers (2024-02-27T12:43:09Z) - An Evaluation of Large Language Models in Bioinformatics Research [52.100233156012756]
We study the performance of large language models (LLMs) on a wide spectrum of crucial bioinformatics tasks.
These tasks include the identification of potential coding regions, extraction of named entities for genes and proteins, detection of antimicrobial and anti-cancer peptides, molecular optimization, and resolution of educational bioinformatics problems.
Our findings indicate that, given appropriate prompts, LLMs like GPT variants can successfully handle most of these tasks.
arXiv Detail & Related papers (2024-02-21T11:27:31Z) - The whack-a-mole governance challenge for AI-enabled synthetic biology:
literature review and emerging frameworks [0.0]
AI-enabled synthetic biology has tremendous potential but also significantly increases biorisks.
How to achieve early warning systems that enable prevention and mitigation of future AI-enabled biohazards from the lab will constantly need to evolve.
Recent advances in chatbots enabled by generative AI have revived fears that advanced biological insight can more easily get into the hands of malignant individuals or organizations.
arXiv Detail & Related papers (2024-02-01T03:53:13Z) - Scientific Large Language Models: A Survey on Biological & Chemical Domains [47.97810890521825]
Large Language Models (LLMs) have emerged as a transformative power in enhancing natural language comprehension.
The application of LLMs extends beyond conventional linguistic boundaries, encompassing specialized linguistic systems developed within various scientific disciplines.
As a burgeoning area in the community of AI for Science, scientific LLMs warrant comprehensive exploration.
arXiv Detail & Related papers (2024-01-26T05:33:34Z) - BioT5: Enriching Cross-modal Integration in Biology with Chemical
Knowledge and Natural Language Associations [54.97423244799579]
$mathbfBioT5$ is a pre-training framework that enriches cross-modal integration in biology with chemical knowledge and natural language associations.
$mathbfBioT5$ distinguishes between structured and unstructured knowledge, leading to more effective utilization of information.
arXiv Detail & Related papers (2023-10-11T07:57:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.