ATEM: A Topic Evolution Model for the Detection of Emerging Topics in
Scientific Archives
- URL: http://arxiv.org/abs/2306.02221v1
- Date: Sun, 4 Jun 2023 00:32:45 GMT
- Title: ATEM: A Topic Evolution Model for the Detection of Emerging Topics in
Scientific Archives
- Authors: Hamed Rahimi, Hubert Naacke, Camelia Constantin, Bernd Amann
- Abstract summary: ATEM is based on dynamic topic modeling and dynamic graph embedding techniques.
ATEM can efficiently detect emerging cross-disciplinary topics within the DBLP archive of over five million computer science articles.
- Score: 1.854328133293073
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents ATEM, a novel framework for studying topic evolution in
scientific archives. ATEM is based on dynamic topic modeling and dynamic graph
embedding techniques that explore the dynamics of content and citations of
documents within a scientific corpus. ATEM explores a new notion of contextual
emergence for the discovery of emerging interdisciplinary research topics based
on the dynamics of citation links in topic clusters. Our experiments show that
ATEM can efficiently detect emerging cross-disciplinary topics within the DBLP
archive of over five million computer science articles.
Related papers
- A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part II -- A Data Science Perspective [4.322038460697958]
We build a knowledge graph that encapsulates more than 5,400 papers, 10,000 authors, 400 venues, and 1,600 institutions for MOEA/D research.
We also explore the collaboration and citation networks of MOEA/D, uncovering hidden patterns in the growth of literature.
arXiv Detail & Related papers (2024-04-22T14:38:58Z) - MIReAD: Simple Method for Learning High-quality Representations from
Scientific Documents [77.34726150561087]
We propose MIReAD, a simple method that learns high-quality representations of scientific papers.
We train MIReAD on more than 500,000 PubMed and arXiv abstracts across over 2,000 journal classes.
arXiv Detail & Related papers (2023-05-07T03:29:55Z) - Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation [58.3921103230647]
We propose a novel framework for topic taxonomy expansion, named TopicExpan.
TopicExpan directly generates topic-related terms belonging to new topics.
Experimental results on two real-world text corpora show that TopicExpan significantly outperforms other baseline methods in terms of the quality of output.
arXiv Detail & Related papers (2022-10-18T22:38:49Z) - Knowledge-Aware Bayesian Deep Topic Model [50.58975785318575]
We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling.
Our proposed model efficiently integrates the prior knowledge and improves both hierarchical topic discovery and document representation.
arXiv Detail & Related papers (2022-09-20T09:16:05Z) - Overview of STEM Science as Process, Method, Material, and Data Named
Entities [0.0]
We develop and analyze a large-scale structured dataset of STEM articles across 10 different disciplines.
Our analysis is defined over a large-scale corpus comprising 60K abstracts structured as four scientific entities process, method, material, and data.
The STEM-NER-60k corpus, created in this work, comprises over 1M extracted entities from 60k STEM articles obtained from a major publishing platform.
arXiv Detail & Related papers (2022-05-24T07:35:24Z) - Constructing Neural Network-Based Models for Simulating Dynamical
Systems [59.0861954179401]
Data-driven modeling is an alternative paradigm that seeks to learn an approximation of the dynamics of a system using observations of the true system.
This paper provides a survey of the different ways to construct models of dynamical systems using neural networks.
In addition to the basic overview, we review the related literature and outline the most significant challenges from numerical simulations that this modeling paradigm must overcome.
arXiv Detail & Related papers (2021-11-02T10:51:42Z) - The CSO Classifier: Ontology-Driven Detection of Research Topics in
Scholarly Articles [0.0]
We present a new unsupervised approach for automatically classifying research papers according to the Computer Science Ontology (CSO)
The CSO takes as input the metadata associated with a research paper (title, abstract, keywords) and returns a selection of research concepts drawn from the ontology.
The approach was evaluated on a gold standard of manually annotated articles yielding a significant improvement over alternative methods.
arXiv Detail & Related papers (2021-04-02T09:02:32Z) - A New Neural Search and Insights Platform for Navigating and Organizing
AI Research [56.65232007953311]
We introduce a new platform, AI Research Navigator, that combines classical keyword search with neural retrieval to discover and organize relevant literature.
We give an overview of the overall architecture of the system and of the components for document analysis, question answering, search, analytics, expert search, and recommendations.
arXiv Detail & Related papers (2020-10-30T19:12:25Z) - Extracting a Knowledge Base of Mechanisms from COVID-19 Papers [50.17242035034729]
We pursue the construction of a knowledge base (KB) of mechanisms.
We develop a broad, unified schema that strikes a balance between relevance and breadth.
Experiments demonstrate the utility of our KB in supporting interdisciplinary scientific search over COVID-19 literature.
arXiv Detail & Related papers (2020-10-08T07:54:14Z) - The STEM-ECR Dataset: Grounding Scientific Entity References in STEM
Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources [8.54082916181163]
The STEM-ECR v1.0 dataset has been developed to provide a benchmark for the evaluation of scientific entity extraction, classification, and resolution tasks.
It comprises abstracts in 10 STEM disciplines that were found to be the most prolific ones on a major publishing platform.
arXiv Detail & Related papers (2020-03-02T16:35:17Z) - GASP! Generating Abstracts of Scientific Papers from Abstracts of Cited
Papers [9.472227971923672]
This paper introduces the novel, scientifically and philosophically challenging task of Generating Abstracts of Scientific Papers from abstracts of cited papers (GASP) as a text-to-text task.
arXiv Detail & Related papers (2020-02-28T14:58:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.