Generative AI as a Linguistic Equalizer in Global Science
- URL: http://arxiv.org/abs/2511.11687v1
- Date: Wed, 12 Nov 2025 08:48:15 GMT
- Title: Generative AI as a Linguistic Equalizer in Global Science
- Authors: Dragan Filimonovic, Christian Rutzer, Jeffrey Macher, Rolf Weder,
- Abstract summary: We provide the first large-scale evidence testing whether generative AI (GenAI) acts as a linguistic equalizer in global science.<n>We compare GenAI-assisted and non-assisted publications from authors in non-English-speaking countries.<n>We find significant and growing convergence for GenAI-assisted publications after the release of ChatGPT in late 2022.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: For decades, the dominance of English has created a substantial barrier in global science, disadvantaging non-native speakers. The recent rise of generative AI (GenAI) offers a potential technological response to this long-standing inequity. We provide the first large-scale evidence testing whether GenAI acts as a linguistic equalizer in global science. Drawing on 5.65 million scientific articles published from 2021 to 2024, we compare GenAI-assisted and non-assisted publications from authors in non-English-speaking countries. Using text embeddings derived from a pretrained large language model (SciBERT), we measure each publication's linguistic similarity to a benchmark of scientific writing from U.S.-based authors and track stylistic convergence over time. We find significant and growing convergence for GenAI-assisted publications after the release of ChatGPT in late 2022. The effect is strongest for domestic coauthor teams from countries linguistically distant from English. These findings provide large-scale evidence that GenAI is beginning to reshape global science communication by reducing language barriers in research.
Related papers
- Towards Autonomous Mathematics Research [48.29504087871558]
We introduce Aletheia, a math research agent that iteratively generates, verifies, and revises solutions end-to-end in natural language.<n>Specifically, Aletheia is powered by an advanced version of Gemini Deep Think for challenging reasoning problems.<n>We demonstrate Aletheia from Olympiad problems to PhD-level exercises and most notably, through several distinct milestones in AI-assisted mathematics research.
arXiv Detail & Related papers (2026-02-10T18:50:15Z) - AI-Assisted Writing Is Growing Fastest Among Non-English-Speaking and Less Established Scientists [2.9557942678513007]
We analyze over two million full-text biomedical publications from PubMed Central from 2021 to 2024.<n>We observe a significant post-ChatGPT surge in AI-assisted writing, with adoption growing fastest in contexts where language barriers are most pronounced.<n>Increased AI usage was associated with a modest increase in productivity, narrowing the publication gap between scientists from English-speaking and non-English-speaking countries.
arXiv Detail & Related papers (2025-11-19T21:00:18Z) - Who Gets Seen in the Age of AI? Adoption Patterns of Large Language Models in Scholarly Writing and Citation Outcomes [1.5702201012657682]
Authors in the Global East adopt AI tools more aggressively, yet Western authors gain more per unit of adoption due to pre-existing penalties for "humanlike" writing.<n>Prestigious journals continue to privilege more human-sounding texts, creating tensions between visibility and gatekeeping.
arXiv Detail & Related papers (2025-09-10T06:05:34Z) - Generative AI and the future of scientometrics: current topics and future questions [0.1638581561083717]
The aim of this paper is to review the use of GenAI in scientometrics, and to begin a debate on the broader implications for the field.<n>We provide an introduction on GenAI's generative and probabilistic nature as rooted in distributional linguistics.<n>We relate this to the debate on the extent to which GenAI might be able to mimic human'reasoning'
arXiv Detail & Related papers (2025-07-01T14:22:16Z) - GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human [71.42669028683741]
We present a shared task on binary machine generated text detection conducted as a part of the GenAI workshop at COLING 2025.<n>The task consists of two subtasks: Monolingual (English) and Multilingual.<n>We provide a comprehensive overview of the data, a summary of the results, detailed descriptions of the participating systems, and an in-depth analysis of submissions.
arXiv Detail & Related papers (2025-01-19T11:11:55Z) - Rise of Generative Artificial Intelligence in Science [0.49157446832511503]
generative AI has experienced rapid growth and increasing presence in scientific publications.<n>Over the study period, U.S. researchers contributed nearly two-fifths of global GenAI publications.<n>GenAI research groups tend to have slightly smaller team sizes than found in other AI fields.
arXiv Detail & Related papers (2024-12-30T13:55:28Z) - The Influencer Next Door: How Misinformation Creators Use GenAI [1.1650821883155187]
We find that non-experts increasingly use GenAI to remix, repackage, and (re)produce content to meet their personal needs and desires.
We analyze how these understudied emergent uses of GenAI produce new or accelerated misinformation harms.
arXiv Detail & Related papers (2024-05-22T11:40:22Z) - Identifying and Mitigating the Security Risks of Generative AI [179.2384121957896]
This paper reports the findings of a workshop held at Google on the dual-use dilemma posed by GenAI.
GenAI can be used just as well by attackers to generate new attacks and increase the velocity and efficacy of existing attacks.
We discuss short-term and long-term goals for the community on this topic.
arXiv Detail & Related papers (2023-08-28T18:51:09Z) - Artificial intelligence adoption in the physical sciences, natural
sciences, life sciences, social sciences and the arts and humanities: A
bibliometric analysis of research publications from 1960-2021 [73.06361680847708]
In 1960 14% of 333 research fields were related to AI, but this increased to over half of all research fields by 1972, over 80% by 1986 and over 98% in current times.
In 1960 14% of 333 research fields were related to AI (many in computer science), but this increased to over half of all research fields by 1972, over 80% by 1986 and over 98% in current times.
We conclude that the context of the current surge appears different, and that interdisciplinary AI application is likely to be sustained.
arXiv Detail & Related papers (2023-06-15T14:08:07Z) - Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering
Approach for Open-Domain Question Answering [76.99585451345702]
Open-Retrieval Generative Question Answering (GenQA) is proven to deliver high-quality, natural-sounding answers in English.
We present the first generalization of the GenQA approach for the multilingual environment.
arXiv Detail & Related papers (2021-10-14T04:36:29Z) - Systematic Inequalities in Language Technology Performance across the
World's Languages [94.65681336393425]
We introduce a framework for estimating the global utility of language technologies.
Our analyses involve the field at large, but also more in-depth studies on both user-facing technologies and more linguistic NLP tasks.
arXiv Detail & Related papers (2021-10-13T14:03:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.