Caveats for the use of Web of Science Core Collection in old literature
retrieval and historical bibliometric analysis
- URL: http://arxiv.org/abs/2107.11521v1
- Date: Sat, 24 Jul 2021 03:39:19 GMT
- Title: Caveats for the use of Web of Science Core Collection in old literature
retrieval and historical bibliometric analysis
- Authors: Weishu Liu
- Abstract summary: Data demonstrated in Fosso Wamba's study implied that the year 1991 seemed to be a "watershed" of AI research.
This research note tries to uncover the 1991 phenomenon from the perspective of database limitation.
- Score: 0.43784114427842946
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: By using publications from Web of Science Core Collection (WoSCC), Fosso
Wamba and his colleagues published an interesting and comprehensive paper in
Technological Forecasting and Social Change to explore the structure and
dynamics of artificial intelligence (AI) scholarship. Data demonstrated in
Fosso Wamba's study implied that the year 1991 seemed to be a "watershed" of AI
research. This research note tried to uncover the 1991 phenomenon from the
perspective of database limitation by probing the limitations of search in
abstract/author keywords/keywords plus fields of WoSCC empirically. The low
availability rates of abstract/author keywords/keywords plus information in
WoSCC found in this study can explain the "watershed" phenomenon of AI
scholarship in 1991 to a large extent. Some other caveats for the use of WoSCC
in old literature retrieval and historical bibliometric analysis were also
mentioned in the discussion section. This research note complements Fosso Wamba
and his colleagues' study and also helps avoid improper interpretation in the
use of WoSCC in old literature retrieval and historical bibliometric analysis.
Related papers
- NLP-Powered Repository and Search Engine for Academic Papers: A Case Study on Cyber Risk Literature with CyLit [9.621564860645513]
We propose a novel framework that leverages Natural Language Processing (NLP) techniques.
This framework automates the retrieval, summarization, and clustering of academic literature within a specific research domain.
We introduce CyLit, an NLP-powered repository specifically designed for the cyber risk literature.
arXiv Detail & Related papers (2024-09-10T05:41:40Z) - Curatr: A Platform for Semantic Analysis and Curation of Historical
Literary Texts [5.075506385456811]
This paper presents Curatr, an online platform for the exploration and curation of literature with machine learning-supported semantic search.
The platform combines neural word embeddings with expert domain knowledge to enable the generation of thematic lexicons.
arXiv Detail & Related papers (2023-06-13T15:15:31Z) - A Bibliographic View on Constrained Clustering [4.705291741591329]
This paper presents general trends of the constrained clustering area and its sub-topics.
We list available software and analyse the experimental sections of our reference collection.
Among the topics we reviewed, applications studies were most abundant recently.
arXiv Detail & Related papers (2022-09-22T16:11:47Z) - Artificial Intelligence in Concrete Materials: A Scientometric View [77.34726150561087]
This chapter aims to uncover the main research interests and knowledge structure of the existing literature on AI for concrete materials.
To begin with, a total of 389 journal articles published from 1990 to 2020 were retrieved from the Web of Science.
Scientometric tools such as keyword co-occurrence analysis and documentation co-citation analysis were adopted to quantify features and characteristics of the research field.
arXiv Detail & Related papers (2022-09-17T18:24:56Z) - Searching for chromate replacements using natural language processing
and machine learning algorithms [0.0]
This study demonstrates it is possible to extract knowledge from the automated interpretation of the scientific literature and achieve expert human level insights.
We have employed the Word2Vec model, previously explored by others, and the BERT model - applying them towards a unique challenge in materials engineering.
arXiv Detail & Related papers (2022-08-11T07:21:18Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Deep Graph Learning for Anomalous Citation Detection [55.81334139806342]
We propose a novel deep graph learning model, namely GLAD (Graph Learning for Anomaly Detection), to identify anomalies in citation networks.
Within the GLAD framework, we propose an algorithm called CPU (Citation PUrpose) to discover the purpose of citation based on citation texts.
arXiv Detail & Related papers (2022-02-23T09:05:28Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Semantic and Relational Spaces in Science of Science: Deep Learning
Models for Article Vectorisation [4.178929174617172]
We focus on document-level embeddings based on the semantic and relational aspects of articles, using Natural Language Processing (NLP) and Graph Neural Networks (GNNs)
Our results show that using NLP we can encode a semantic space of articles, while with GNN we are able to build a relational space where the social practices of a research community are also encoded.
arXiv Detail & Related papers (2020-11-05T14:57:41Z) - A New Neural Search and Insights Platform for Navigating and Organizing
AI Research [56.65232007953311]
We introduce a new platform, AI Research Navigator, that combines classical keyword search with neural retrieval to discover and organize relevant literature.
We give an overview of the overall architecture of the system and of the components for document analysis, question answering, search, analytics, expert search, and recommendations.
arXiv Detail & Related papers (2020-10-30T19:12:25Z) - A Survey on Text Classification: From Shallow to Deep Learning [83.47804123133719]
The last decade has seen a surge of research in this area due to the unprecedented success of deep learning.
This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021.
We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification.
arXiv Detail & Related papers (2020-08-02T00:09:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.