Improving Community Detection in Academic Networks by Handling Publication Bias
- URL: http://arxiv.org/abs/2507.20449v1
- Date: Mon, 28 Jul 2025 00:48:33 GMT
- Title: Improving Community Detection in Academic Networks by Handling Publication Bias
- Authors: Md Asaduzzaman Noor, John Sheppard, Jason Clark,
- Abstract summary: We build a topic-based research network using BERTopic with a fine-tuned SciBERT model.<n>A major challenge we address is publication imbalance, where some researchers publish much more than others.<n>We introduce a cloning strategy that clusters a researcher's publications and treats each cluster as a separate node.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Finding potential research collaborators is a challenging task, especially in today's fast-growing and interdisciplinary research landscape. While traditional methods often rely on observable relationships such as co-authorships and citations to construct the research network, in this work, we focus solely on publication content to build a topic-based research network using BERTopic with a fine-tuned SciBERT model that connects and recommends researchers across disciplines based on shared topical interests. A major challenge we address is publication imbalance, where some researchers publish much more than others, often across several topics. Without careful handling, their less frequent interests are hidden under dominant topics, limiting the network's ability to detect their full research scope. To tackle this, we introduce a cloning strategy that clusters a researcher's publications and treats each cluster as a separate node. This allows researchers to be part of multiple communities, improving the detection of interdisciplinary links. Evaluation on the proposed method shows that the cloned network structure leads to more meaningful communities and uncovers a broader set of collaboration opportunities.
Related papers
- Modular versus Hierarchical: A Structural Signature of Topic Popularity in Mathematical Research [0.0]
We study how the popularity of a research topic is associated with the structure of that topic's collaboration network.<n>Our findings suggest that topic selection is an implicit choice between two fundamentally different collaborative environments.
arXiv Detail & Related papers (2025-06-28T16:39:57Z) - Not real or too soft? On the challenges of publishing interdisciplinary software engineering research [4.597329752530121]
Discipline of software engineering combines social and technological dimensions.<n>Interdisciplinary research submitted to software engineering venues may not receive the same level of recognition as more traditional or technical topics.
arXiv Detail & Related papers (2025-01-11T12:18:46Z) - DiscipLink: Unfolding Interdisciplinary Information Seeking Process via Human-AI Co-Exploration [34.23942131024738]
In this paper, we introduce DiscipLink, a novel interactive system that facilitates collaboration between researchers and large language models (LLMs)
Based on users' topics of interest, DiscipLink initiates exploratory questions from the perspectives of possible relevant fields of study.
Our evaluation, comprising a within-subject comparative experiment and an open-ended exploratory study, reveals that DiscipLink can effectively support researchers in breaking down disciplinary boundaries.
arXiv Detail & Related papers (2024-08-01T10:36:00Z) - ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models [56.08917291606421]
ResearchAgent is an AI-based system for ideation and operationalization of novel work.<n>ResearchAgent automatically defines novel problems, proposes methods and designs experiments, while iteratively refining them.<n>We experimentally validate our ResearchAgent on scientific publications across multiple disciplines.
arXiv Detail & Related papers (2024-04-11T13:36:29Z) - SurveyAgent: A Conversational System for Personalized and Efficient Research Survey [50.04283471107001]
This paper introduces SurveyAgent, a novel conversational system designed to provide personalized and efficient research survey assistance to researchers.
SurveyAgent integrates three key modules: Knowledge Management for organizing papers, Recommendation for discovering relevant literature, and Query Answering for engaging with content on a deeper level.
Our evaluation demonstrates SurveyAgent's effectiveness in streamlining research activities, showcasing its capability to facilitate how researchers interact with scientific literature.
arXiv Detail & Related papers (2024-04-09T15:01:51Z) - A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning [58.107474025048866]
Forgetting refers to the loss or deterioration of previously acquired knowledge.
Forgetting is a prevalent phenomenon observed in various other research domains within deep learning.
arXiv Detail & Related papers (2023-07-16T16:27:58Z) - Parsing Objects at a Finer Granularity: A Survey [54.72819146263311]
Fine-grained visual parsing is important in many real-world applications, e.g., agriculture, remote sensing, and space technologies.
Predominant research efforts tackle these fine-grained sub-tasks following different paradigms.
We conduct an in-depth study of the advanced work from a new perspective of learning the part relationship.
arXiv Detail & Related papers (2022-12-28T04:20:10Z) - Frequent Itemset-driven Search for Finding Minimum Node Separators in
Complex Networks [61.2383572324176]
We propose a frequent itemset-driven search approach, which integrates the concept of frequent itemset mining in data mining into the well-known memetic search framework.
It iteratively employs the frequent itemset recombination operator to generate promising offspring solution based on itemsets that frequently occur in high-quality solutions.
In particular, it discovers 29 new upper bounds and matches 18 previous best-known bounds.
arXiv Detail & Related papers (2022-01-18T11:16:40Z) - Bridger: Toward Bursting Scientific Filter Bubbles and Boosting
Innovation via Novel Author Discovery [22.839876884227536]
Bridger is a system for facilitating discovery of scholars and their work.
We construct a faceted representation of authors using information extracted from their papers and inferred personas.
We develop an approach that locates commonalities and contrasts between scientists.
arXiv Detail & Related papers (2021-08-12T11:24:23Z) - A Search Engine for Scientific Publications: a Cybersecurity Case Study [0.7734726150561086]
This work proposes a new search engine for scientific publications which combines both information retrieval and reading comprehension algorithms.
The proposed solution although being applied to the context of cybersecurity exhibited great generalization capabilities and can be easily adapted to perform under other distinct knowledge domains.
arXiv Detail & Related papers (2021-06-30T20:10:04Z) - Weight-Sharing Neural Architecture Search: A Battle to Shrink the
Optimization Gap [90.93522795555724]
Neural architecture search (NAS) has attracted increasing attentions in both academia and industry.
Weight-sharing methods were proposed in which exponentially many architectures share weights in the same super-network.
This paper provides a literature review on NAS, in particular the weight-sharing methods.
arXiv Detail & Related papers (2020-08-04T11:57:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.