A Tale of Two Communities: Exploring Academic References on Stack Overflow
- URL: http://arxiv.org/abs/2403.09856v2
- Date: Thu, 28 Mar 2024 17:19:48 GMT
- Title: A Tale of Two Communities: Exploring Academic References on Stack Overflow
- Authors: Run Huang, Souti Chattopadhyay,
- Abstract summary: We find that Stack Overflow communities with different domains of interest engage with academic literature at varying frequencies and speeds.
The contradicting patterns suggest that some disciplines may have diverged in their interests and development trajectories from the corresponding practitioner community.
- Score: 1.2914230269240388
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Stack Overflow is widely recognized by software practitioners as the go-to resource for addressing technical issues and sharing practical solutions. While not typically seen as a scholarly forum, users on Stack Overflow commonly refer to academic sources in their discussions. Yet, little is known about these referenced academic works and how they intersect the needs and interests of the Stack Overflow community. To bridge this gap, we conducted an exploratory large-scale study on the landscape of academic references in Stack Overflow. Our findings reveal that Stack Overflow communities with different domains of interest engage with academic literature at varying frequencies and speeds. The contradicting patterns suggest that some disciplines may have diverged in their interests and development trajectories from the corresponding practitioner community. Finally, we discuss the potential of Stack Overflow in gauging the real-world relevance of academic research.
Related papers
- A Systematic Mapping Study of Crowd Knowledge Enhanced Software Engineering Research Using Stack Overflow [0.8621608193534838]
30% of all software professionals visit the most popular Q&A site StackOverflow (SO) every day.
To find out the trend, implication, impact, and future research potential utilizing SO data, a systematic mapping study needs to be conducted.
We collected 384 SO-based research articles and categorized them into 10 facets (i.e., themes)
We found that SO contributes to 85% of SE research compared with popular Q&A sites such as Quora, and Reddit.
arXiv Detail & Related papers (2024-08-15T03:40:44Z) - ResearchBot: Bridging the Gap between Academic Research and Practical Programming Communities [1.7825757481227438]
This project introduces ResearchBot, a tool designed to bridge the academia-industry gap.
ResearchBot employs a modular approach, encompassing understanding questions, curating queries to obtain relevant papers and summarizing paper content.
The core objective of ResearchBot is to democratize access to academic knowledge for industry professionals.
arXiv Detail & Related papers (2024-07-02T20:17:13Z) - Software Engineers' Questions and Answers on Stack Exchange [0.0]
We analyze the questions and answers on the Software Engineering Stack Exchange site that encompasses a broader set of areas.
We found that the asked questions are most frequently related to database systems, quality assurance, and agile software development.
The most attractive topics were career and teamwork problems, and the least attractive ones were network programming and software modeling.
arXiv Detail & Related papers (2023-06-20T13:39:49Z) - Covidia: COVID-19 Interdisciplinary Academic Knowledge Graph [99.28342534985146]
Existing literature and knowledge platforms on COVID-19 only focus on collecting papers on biology and medicine.
We propose Covidia, COVID-19 interdisciplinary academic knowledge graph to bridge the gap between knowledge of COVID-19 on different domains.
arXiv Detail & Related papers (2023-04-14T16:45:38Z) - Entity Set Co-Expansion in StackOverflow [49.64523055423687]
Given a few seed entities of a certain type, entity set expansion aims to discover an extensive set of entities that share the same type as the seeds.
We study the entity set co-expansion task in StackOverflow, which extracts Library, OS, Application, and Language entities from StackOverflow question-answer threads.
During the co-expansion process, we use PLMs to derive embeddings of candidate entities for calculating similarities between entities.
arXiv Detail & Related papers (2022-12-05T13:50:35Z) - Tag-Aware Document Representation for Research Paper Recommendation [68.8204255655161]
We propose a hybrid approach that leverages deep semantic representation of research papers based on social tags assigned by users.
The proposed model is effective in recommending research papers even when the rating data is very sparse.
arXiv Detail & Related papers (2022-09-08T09:13:07Z) - Dive into Deep Learning [119.30375933463156]
The book is drafted in Jupyter notebooks, seamlessly integrating exposition figures, math, and interactive examples with self-contained code.
Our goal is to offer a resource that could (i) be freely available for everyone; (ii) offer sufficient technical depth to provide a starting point on the path to becoming an applied machine learning scientist; (iii) include runnable code, showing readers how to solve problems in practice; (iv) allow for rapid updates, both by us and also by the community at large.
arXiv Detail & Related papers (2021-06-21T18:19:46Z) - The Influence of Domain-Based Preprocessing on Subject-Specific
Clustering [55.41644538483948]
The sudden change of moving the majority of teaching online at Universities has caused an increased amount of workload for academics.
One way to deal with this problem is to cluster these questions depending on their topic.
In this paper, we explore the realms of tagging data sets, focusing on identifying code excerpts and providing empirical results.
arXiv Detail & Related papers (2020-11-16T17:47:19Z) - From Asking to Answering: Getting More Involved on Stack Overflow [3.1160359353275346]
This paper investigates issues on Stack Overflow, a popular question and answer community for computer programming.
We document evidence of a "leaky pipeline", specifically that there are many active users on the platform who never post an answer.
We find a user's individual features, such as their tenure, gender, and geographic location, have a significant relationship with their likelihood to post answers.
arXiv Detail & Related papers (2020-10-08T14:41:22Z) - StackOverflow vs Kaggle: A Study of Developer Discussions About Data
Science [0.0]
This paper conducts experiments to study the characteristics of 197836 posts from StackOverflow and Kaggle.
The main findings include that-related topics were most prevalent in StackOverflow.
Across both communities, DS discussion is increasing at a dramatic rate.
ensemble algorithms are the most mentioned ML/DL algorithms in Kaggle but are rarely discussed on StackOverflow.
arXiv Detail & Related papers (2020-06-06T06:51:11Z) - A Study of Knowledge Sharing related to Covid-19 Pandemic in Stack
Overflow [69.5231754305538]
Study of 464 Stack Overflow questions posted mainly in February and March 2020 and leveraging the power of text mining.
Findings reveal that indeed this global crisis sparked off an intense and increasing activity in Stack Overflow with most post topics reflecting a strong interest on the analysis of Covid-19 data.
arXiv Detail & Related papers (2020-04-18T08:19:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.