Science and Technology Ontology: A Taxonomy of Emerging Topics
- URL: http://arxiv.org/abs/2305.04055v1
- Date: Sat, 6 May 2023 14:04:24 GMT
- Title: Science and Technology Ontology: A Taxonomy of Emerging Topics
- Authors: Mahender Kumar, Ruby Rani, Mirko Botarelli, Gregory Epiophaniou, and
Carsten Maple
- Abstract summary: We present an automatic Science and Technology Ontology (S&TO) that covers unconventional topics in different science and technology.
The proposed S&TO can promote the discovery of new research areas and collaborations across disciplines.
- Score: 6.4429045013848185
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Ontologies play a critical role in Semantic Web technologies by providing a
structured and standardized way to represent knowledge and enabling machines to
understand the meaning of data. Several taxonomies and ontologies have been
generated, but individuals target one domain, and only some of those have been
found expensive in time and manual effort. Also, they need more coverage of
unconventional topics representing a more holistic and comprehensive view of
the knowledge landscape and interdisciplinary collaborations. Thus, there needs
to be an ontology covering Science and Technology and facilitate
multidisciplinary research by connecting topics from different fields and
domains that may be related or have commonalities. To address these issues, we
present an automatic Science and Technology Ontology (S&TO) that covers
unconventional topics in different science and technology domains. The proposed
S&TO can promote the discovery of new research areas and collaborations across
disciplines. The ontology is constructed by applying BERTopic to a dataset of
393,991 scientific articles collected from Semantic Scholar from October 2021
to August 2022, covering four fields of science. Currently, S&TO includes 5,153
topics and 13,155 semantic relations. S&TO model can be updated by running
BERTopic on more recent datasets
Related papers
- Ontology Embedding: A Survey of Methods, Applications and Resources [54.3453925775069]
Ontologies are widely used for representing domain knowledge and meta data.
One straightforward solution is to integrate statistical analysis and machine learning.
Numerous papers have been published on embedding, but a lack of systematic reviews hinders researchers from gaining a comprehensive understanding of this field.
arXiv Detail & Related papers (2024-06-16T14:49:19Z) - DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery
through Sophisticated AI System Technologies [116.09762105379241]
DeepSpeed4Science aims to build unique capabilities through AI system technology innovations.
We showcase the early progress we made with DeepSpeed4Science in addressing two of the critical system challenges in structural biology research.
arXiv Detail & Related papers (2023-10-06T22:05:15Z) - Knowledge Graphs for the Life Sciences: Recent Developments, Challenges
and Opportunities [11.35513523308132]
We discuss developments and advances in the use of graph-based technologies in life sciences.
We focus on three broad topics: the construction and management of Knowledge Graphs (KGs), the use of KGs and associated technologies in the discovery of new knowledge, and the use of KGs in artificial intelligence applications to support explanations.
arXiv Detail & Related papers (2023-09-29T14:03:34Z) - Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems [268.585904751315]
New area of research known as AI for science (AI4Science)
Areas aim at understanding the physical world from subatomic (wavefunctions and electron density), atomic (molecules, proteins, materials, and interactions), to macro (fluids, climate, and subsurface) scales.
Key common challenge is how to capture physics first principles, especially symmetries, in natural systems by deep learning methods.
arXiv Detail & Related papers (2023-07-17T12:14:14Z) - AI for Science: An Emerging Agenda [30.260160661295682]
This report documents the programme and the outcomes of Dagstuhl Seminar 22382 "Machine Learning for Science: Bridging Data-Driven and Mechanistic Modelling"
The transformative potential of AI stems from its widespread applicability across disciplines, and will only be achieved through integration across research domains.
Alongside technical advances, the next wave of progress in the field will come from building a community of machine learning researchers, domain experts, citizen scientists, and engineers.
arXiv Detail & Related papers (2023-03-07T20:21:43Z) - The Semantic Scholar Open Data Platform [79.4493235243312]
Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by helping scholars discover and understand scientific literature.
We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF content extraction and automatic knowledge graph construction.
The graph includes advanced semantic features such as structurally parsed text, natural language summaries, and vector embeddings.
arXiv Detail & Related papers (2023-01-24T17:13:08Z) - Modeling Information Change in Science Communication with Semantically
Matched Paraphrases [50.67030449927206]
SPICED is the first paraphrase dataset of scientific findings annotated for degree of information change.
SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers.
Models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims.
arXiv Detail & Related papers (2022-10-24T07:44:38Z) - Foundations and Recent Trends in Multimodal Machine Learning:
Principles, Challenges, and Open Questions [68.6358773622615]
This paper provides an overview of the computational and theoretical foundations of multimodal machine learning.
We propose a taxonomy of 6 core technical challenges: representation, alignment, reasoning, generation, transference, and quantification.
Recent technical achievements will be presented through the lens of this taxonomy, allowing researchers to understand the similarities and differences across new approaches.
arXiv Detail & Related papers (2022-09-07T19:21:19Z) - Biological Robots: Perspectives on an Emerging Interdisciplinary Field [0.0]
We discuss issues at the intersection of developmental biology, computer science, and robotics.
In the context of biological robots, we explore changes across concepts and previously distinct fields.
We hope new fields can emerge as boundaries arising from technological limitations are overcome.
arXiv Detail & Related papers (2022-07-02T17:06:43Z) - A Search Engine for Scientific Publications: a Cybersecurity Case Study [0.7734726150561086]
This work proposes a new search engine for scientific publications which combines both information retrieval and reading comprehension algorithms.
The proposed solution although being applied to the context of cybersecurity exhibited great generalization capabilities and can be easily adapted to perform under other distinct knowledge domains.
arXiv Detail & Related papers (2021-06-30T20:10:04Z) - Semantic and Relational Spaces in Science of Science: Deep Learning
Models for Article Vectorisation [4.178929174617172]
We focus on document-level embeddings based on the semantic and relational aspects of articles, using Natural Language Processing (NLP) and Graph Neural Networks (GNNs)
Our results show that using NLP we can encode a semantic space of articles, while with GNN we are able to build a relational space where the social practices of a research community are also encoded.
arXiv Detail & Related papers (2020-11-05T14:57:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.