pyBibX -- A Python Library for Bibliometric and Scientometric Analysis
Powered with Artificial Intelligence Tools
- URL: http://arxiv.org/abs/2304.14516v1
- Date: Thu, 27 Apr 2023 20:06:07 GMT
- Title: pyBibX -- A Python Library for Bibliometric and Scientometric Analysis
Powered with Artificial Intelligence Tools
- Authors: Valdecy Pereira, Marcio Pereira Basilio, Carlos Henrique Tarjano
Santos
- Abstract summary: pyBibX is a python library devised to conduct comprehensive bibliometric and scientometric analyses on raw data files sourced from Scopus, Web of Science, and PubMed.
The library executes a comprehensive EDA, presenting outcomes via visually appealing graphical illustrations.
It incorporates AI capabilities, including Embedding, Topic Modeling, Text Summarization, and other general language processing tasks.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Bibliometric and Scientometric analyses offer invaluable perspectives on the
complex research terrain and collaborative dynamics spanning diverse academic
disciplines. This paper presents pyBibX, a python library devised to conduct
comprehensive bibliometric and scientometric analyses on raw data files sourced
from Scopus, Web of Science, and PubMed, seamlessly integrating state of the
art AI capabilities into its core functionality. The library executes a
comprehensive EDA, presenting outcomes via visually appealing graphical
illustrations. Network capabilities have been deftly integrated, encompassing
Citation, Collaboration, and Similarity Analysis. Furthermore, the library
incorporates AI capabilities, including Embedding vectors, Topic Modeling, Text
Summarization, and other general Natural Language Processing tasks, employing
models such as Sentence-BERT, BerTopic, BERT, chatGPT, and PEGASUS. As a
demonstration, we have analyzed 184 documents associated with multiple-criteria
decision analysis published between 1984 and 2023. The EDA emphasized a growing
fascination with decision-making and fuzzy logic methodologies. Next, Network
Analysis further accentuated the significance of central authors and
intra-continental collaboration, identifying Canada and China as crucial
collaboration hubs. Finally, AI Analysis distinguished two primary topics and
chatGPT preeminence in Text Summarization. It also proved to be an
indispensable instrument for interpreting results, as our library enables
researchers to pose inquiries to chatGPT regarding bibliometric outcomes. Even
so, data homogeneity remains a daunting challenge due to database
inconsistencies. PyBibX is the first application integrating cutting-edge AI
capabilities for analyzing scientific publications, enabling researchers to
examine and interpret these outcomes more effectively.
Related papers
- Data Analysis in the Era of Generative AI [56.44807642944589]
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges.
We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow.
We then examine human-centered design principles that facilitate intuitive interactions, build user trust, and streamline the AI-assisted analysis workflow across multiple apps.
arXiv Detail & Related papers (2024-09-27T06:31:03Z) - Unleashing the Power of AI. A Systematic Review of Cutting-Edge Techniques in AI-Enhanced Scientometrics, Webometrics, and Bibliometrics [1.2374541748245838]
The study aims to analyze the synergy of Artificial Intelligence (AI) with scientometrics, webometrics, and bibliometrics.
Our aim is to explore the potential of AI in revolutionizing the methods used to measure and analyze scholarly communication.
arXiv Detail & Related papers (2024-02-22T15:10:02Z) - Extensible Multi-Granularity Fusion Network for Aspect-based Sentiment
Analysis [20.378588765134122]
Aspect-based Sentiment Analysis (ABSA) evaluates sentiment expressions within a text to comprehend sentiment information.
Recent research has examined the use of Graph Neural Networks (GNNs) on dependency and constituent trees for syntactic analysis.
This paper presents the Extensible Multi-Granularity Fusion (EMGF) network, which integrates information from dependency and constituent syntactic, attention semantic, and external knowledge graphs.
arXiv Detail & Related papers (2024-02-12T16:52:26Z) - Quantitative Analysis of AI-Generated Texts in Academic Research: A Study of AI Presence in Arxiv Submissions using AI Detection Tool [0.0]
This study will analyze a method that can see purposely manufactured content that academic organizations use to post on Arxiv.
The statistical analysis shows that Originality.ai is very accurate, with a rate of 98%.
arXiv Detail & Related papers (2024-02-09T17:20:48Z) - Document AI: A Comparative Study of Transformer-Based, Graph-Based
Models, and Convolutional Neural Networks For Document Layout Analysis [3.231170156689185]
Document AI aims to automatically analyze documents by leveraging natural language processing and computer vision techniques.
One of the major tasks of Document AI is document layout analysis, which structures document pages by interpreting the content and spatial relationships of layout, image, and text.
arXiv Detail & Related papers (2023-08-29T16:58:03Z) - PyRCA: A Library for Metric-based Root Cause Analysis [66.72542200701807]
PyRCA is an open-source machine learning library of Root Cause Analysis (RCA) for Artificial Intelligence for IT Operations (AIOps)
It provides a holistic framework to uncover the complicated metric causal dependencies and automatically locate root causes of incidents.
arXiv Detail & Related papers (2023-06-20T09:55:10Z) - GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training
Data Exploration [97.68234051078997]
We discuss how Pyserini can be integrated with the Hugging Face ecosystem of open-source AI libraries and artifacts.
We include a Jupyter Notebook-based walk through the core interoperability features, available on GitHub.
We present GAIA Search - a search engine built following previously laid out principles, giving access to four popular large-scale text collections.
arXiv Detail & Related papers (2023-06-02T12:09:59Z) - Investigating Fairness Disparities in Peer Review: A Language Model
Enhanced Approach [77.61131357420201]
We conduct a thorough and rigorous study on fairness disparities in peer review with the help of large language models (LMs)
We collect, assemble, and maintain a comprehensive relational database for the International Conference on Learning Representations (ICLR) conference from 2017 to date.
We postulate and study fairness disparities on multiple protective attributes of interest, including author gender, geography, author, and institutional prestige.
arXiv Detail & Related papers (2022-11-07T16:19:42Z) - DeepShovel: An Online Collaborative Platform for Data Extraction in
Geoscience Literature with AI Assistance [48.55345030503826]
Geoscientists need to read a huge amount of literature to locate, extract, and aggregate relevant results and data.
DeepShovel is a publicly-available AI-assisted data extraction system to support their needs.
A follow-up user evaluation with 14 researchers suggested DeepShovel improved users' efficiency of data extraction for building scientific databases.
arXiv Detail & Related papers (2022-02-21T12:18:08Z) - Knowledge Graph Augmented Network Towards Multiview Representation
Learning for Aspect-based Sentiment Analysis [96.53859361560505]
We propose a knowledge graph augmented network (KGAN) to incorporate external knowledge with explicitly syntactic and contextual information.
KGAN captures the sentiment feature representations from multiple perspectives, i.e., context-, syntax- and knowledge-based.
Experiments on three popular ABSA benchmarks demonstrate the effectiveness and robustness of our KGAN.
arXiv Detail & Related papers (2022-01-13T08:25:53Z) - MAIR: Framework for mining relationships between research articles,
strategies, and regulations in the field of explainable artificial
intelligence [2.280298858971133]
It is essential to understand the dynamics of the impact of regulation on research papers and AI-related policies.
This paper introduces a novel framework for joint analysis of AI-related policy documents and XAI research papers.
arXiv Detail & Related papers (2021-07-29T20:41:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.