Who should I Collaborate with? A Comparative Study of Academia and
Industry Research Collaboration in NLP
- URL: http://arxiv.org/abs/2308.04524v1
- Date: Fri, 21 Jul 2023 01:26:29 GMT
- Title: Who should I Collaborate with? A Comparative Study of Academia and
Industry Research Collaboration in NLP
- Authors: Hussain Sadiq Abuwala, Bohan Zhang, Mushi Wang
- Abstract summary: The goal of our research was to investigate the effects of collaboration between academia and industry on Natural Language Processing (NLP)
We created a pipeline to extract affiliations and citations from NLP papers and divided them into three categories: academia, industry, and hybrid (collaborations between academia and industry)
Our empirical analysis found that there is a trend towards an increase in industry and academia-industry collaboration publications and that these types of publications tend to have a higher impact compared to those produced solely within academia.
- Score: 5.929956715430167
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The goal of our research was to investigate the effects of collaboration
between academia and industry on Natural Language Processing (NLP). To do this,
we created a pipeline to extract affiliations and citations from NLP papers and
divided them into three categories: academia, industry, and hybrid
(collaborations between academia and industry). Our empirical analysis found
that there is a trend towards an increase in industry and academia-industry
collaboration publications and that these types of publications tend to have a
higher impact compared to those produced solely within academia.
Related papers
- The Nature of NLP: Analyzing Contributions in NLP Papers [77.31665252336157]
We quantitatively investigate what constitutes NLP research by examining research papers.
Our findings reveal a rising involvement of machine learning in NLP since the early nineties.
In post-2020, there has been a resurgence of focus on language and people.
arXiv Detail & Related papers (2024-09-29T01:29:28Z) - MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows [58.56005277371235]
We introduce MASSW, a comprehensive text dataset on Multi-Aspect Summarization of ScientificAspects.
MASSW includes more than 152,000 peer-reviewed publications from 17 leading computer science conferences spanning the past 50 years.
We demonstrate the utility of MASSW through multiple novel machine-learning tasks that can be benchmarked using this new dataset.
arXiv Detail & Related papers (2024-06-10T15:19:09Z) - The complementary contributions of academia and industry to AI research [0.0]
We characterize the impact and type of AI produced by industry and academia over the last 25 years.
We find that articles published by industry teams tend to get greater attention, with a higher chance of being highly cited and citation-disruptive.
We find that academic-industry collaborations struggle to replicate the novelty of academic teams and tend to look similar to industry teams.
arXiv Detail & Related papers (2024-01-04T03:08:13Z) - Collaboration or Corporate Capture? Quantifying NLP's Reliance on Industry Artifacts and Contributions [2.6746207141044582]
We surveyed 100 papers published at EMNLP 2022 to determine the degree to which researchers rely on industry models.
Our work serves as a scaffold to enable future researchers to more accurately address whether collaboration with industry is still collaboration in the absence of an alternative.
arXiv Detail & Related papers (2023-12-06T21:12:22Z) - Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View [60.80731090755224]
This paper probes the collaboration mechanisms among contemporary NLP systems by practical experiments with theoretical insights.
We fabricate four unique societies' comprised of LLM agents, where each agent is characterized by a specific trait' (easy-going or overconfident) and engages in collaboration with a distinct thinking pattern' (debate or reflection)
Our results further illustrate that LLM agents manifest human-like social behaviors, such as conformity and consensus reaching, mirroring social psychology theories.
arXiv Detail & Related papers (2023-10-03T15:05:52Z) - The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research [28.382353702576314]
We use a corpus with comprehensive metadata of 78,187 NLP publications and 701 resumes of NLP publication authors.
We find that industry presence among NLP authors has been steady before a steep increase over the past five years.
A few companies account for most of the publications and provide funding to academic researchers through grants and internships.
arXiv Detail & Related papers (2023-05-04T12:57:18Z) - Covidia: COVID-19 Interdisciplinary Academic Knowledge Graph [99.28342534985146]
Existing literature and knowledge platforms on COVID-19 only focus on collecting papers on biology and medicine.
We propose Covidia, COVID-19 interdisciplinary academic knowledge graph to bridge the gap between knowledge of COVID-19 on different domains.
arXiv Detail & Related papers (2023-04-14T16:45:38Z) - A Decade of Knowledge Graphs in Natural Language Processing: A Survey [3.3358633215849927]
Knowledge graphs (KGs) have attracted a surge of interest from both academia and industry.
As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing.
arXiv Detail & Related papers (2022-09-30T21:53:57Z) - Revise and Resubmit: An Intertextual Model of Text-based Collaboration
in Peer Review [52.359007622096684]
Peer review is a key component of the publishing process in most fields of science.
Existing NLP studies focus on the analysis of individual texts.
editorial assistance often requires modeling interactions between pairs of texts.
arXiv Detail & Related papers (2022-04-22T16:39:38Z) - Industry and Academic Research in Computer Vision [5.634825161148484]
This work aims to study the dynamic between research in the industry and academia in computer vision.
The results are demonstrated on a set of top-5 vision conferences that are representative of the field.
arXiv Detail & Related papers (2021-07-10T20:09:52Z) - What's New? Summarizing Contributions in Scientific Literature [85.95906677964815]
We introduce a new task of disentangled paper summarization, which seeks to generate separate summaries for the paper contributions and the context of the work.
We extend the S2ORC corpus of academic articles by adding disentangled "contribution" and "context" reference labels.
We propose a comprehensive automatic evaluation protocol which reports the relevance, novelty, and disentanglement of generated outputs.
arXiv Detail & Related papers (2020-11-06T02:23:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.