Algorithmically Curated Lies: How Search Engines Handle Misinformation
about US Biolabs in Ukraine
- URL: http://arxiv.org/abs/2401.13832v1
- Date: Wed, 24 Jan 2024 22:15:38 GMT
- Title: Algorithmically Curated Lies: How Search Engines Handle Misinformation
about US Biolabs in Ukraine
- Authors: Elizaveta Kuznetsova, Mykola Makhortykh, Maryna Sydorova, Aleksandra
Urman, Ilaria Vitulano, Martha Stolze
- Abstract summary: We conduct virtual agent-based algorithm audits of Google, Bing, and Yandex search outputs in June 2022.
We find significant disparities in misinformation exposure based on the language of search, with all search engines presenting a higher number of false stories in Russian.
These observations stress the possibility of AICSs being vulnerable to manipulation, in particular in the case of the unfolding propaganda campaigns.
- Score: 39.58317527488534
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The growing volume of online content prompts the need for adopting
algorithmic systems of information curation. These systems range from web
search engines to recommender systems and are integral for helping users stay
informed about important societal developments. However, unlike journalistic
editing the algorithmic information curation systems (AICSs) are known to be
subject to different forms of malperformance which make them vulnerable to
possible manipulation. The risk of manipulation is particularly prominent in
the case when AICSs have to deal with information about false claims that
underpin propaganda campaigns of authoritarian regimes. Using as a case study
of the Russian disinformation campaign concerning the US biolabs in Ukraine, we
investigate how one of the most commonly used forms of AICSs - i.e. web search
engines - curate misinformation-related content. For this aim, we conduct
virtual agent-based algorithm audits of Google, Bing, and Yandex search outputs
in June 2022. Our findings highlight the troubling performance of search
engines. Even though some search engines, like Google, were less likely to
return misinformation results, across all languages and locations, the three
search engines still mentioned or promoted a considerable share of false
content (33% on Google; 44% on Bing, and 70% on Yandex). We also find
significant disparities in misinformation exposure based on the language of
search, with all search engines presenting a higher number of false stories in
Russian. Location matters as well with users from Germany being more likely to
be exposed to search results promoting false information. These observations
stress the possibility of AICSs being vulnerable to manipulation, in particular
in the case of the unfolding propaganda campaigns, and underline the importance
of monitoring performance of these systems to prevent it.
Related papers
- Auditing Google's Search Algorithm: Measuring News Diversity Across Brazil, the UK, and the US [0.0]
This study examines the influence of Google's search algorithm on news diversity by analyzing search results in Brazil, the UK, and the US.
It explores how Google's system preferentially favors a limited number of news outlets.
Findings indicate a slight leftward bias in search outcomes and a preference for popular, often national outlets.
arXiv Detail & Related papers (2024-10-31T11:49:16Z) - Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions [89.35345649303451]
Generative search engines have the potential to transform how people seek information online.
But generated responses from existing large language models (LLMs)-backed generative search engines may not always be accurate.
Retrieval-augmented generation exacerbates safety concerns, since adversaries may successfully evade the entire system.
arXiv Detail & Related papers (2024-02-25T11:22:19Z) - User Attitudes to Content Moderation in Web Search [49.1574468325115]
We examine the levels of support for different moderation practices applied to potentially misleading and/or potentially offensive content in web search.
We find that the most supported practice is informing users about potentially misleading or offensive content, and the least supported one is the complete removal of search results.
More conservative users and users with lower levels of trust in web search results are more likely to be against content moderation in web search.
arXiv Detail & Related papers (2023-10-05T10:57:15Z) - Novelty in news search: a longitudinal study of the 2020 US elections [62.997667081978825]
We analyze novelty, a measurement of new items that emerge in the top news search results.
We find more new items emerging for election related queries compared to topical or stable queries.
We argue that such imbalances affect the visibility of political candidates in news searches during electoral periods.
arXiv Detail & Related papers (2022-11-09T08:42:37Z) - Searching for Representation: A sociotechnical audit of googling for
members of U.S. Congress [2.4366811507669124]
10% of the top Google search results are likely to mislead California information seekers who use search to identify their congressional representatives.
70% of the misleading results appear in featured snippets above the organic search results.
factors identified include Google's heavy reliance on Wikipedia, the lack of authoritative, machine parsable, high accuracy data about the identity of elected officials based on geographic location, and the search engine's treatment of under-specified queries.
arXiv Detail & Related papers (2021-09-14T23:13:02Z) - The Matter of Chance: Auditing Web Search Results Related to the 2020
U.S. Presidential Primary Elections Across Six Search Engines [68.8204255655161]
We look at the text search results for "us elections", "donald trump", "joe biden" and "bernie sanders" queries on Google, Baidu, Bing, DuckDuckGo, Yahoo, and Yandex.
Our findings indicate substantial differences in the search results between search engines and multiple discrepancies within the results generated for different agents.
arXiv Detail & Related papers (2021-05-03T11:18:19Z) - Detection of fake news on CoViD-19 on Web Search Engines [0.0]
After China reported the first cases of the new coronavirus (SARS-CoV-2), unreliable and not fully accurate information has started spreading faster than the virus itself.
This study aims to detect potential misleading and fake contents by capturing and analysing textual information, which flow through Search Engines.
arXiv Detail & Related papers (2021-03-22T13:07:26Z) - Machine Learning Explanations to Prevent Overtrust in Fake News
Detection [64.46876057393703]
This research investigates the effects of an Explainable AI assistant embedded in news review platforms for combating the propagation of fake news.
We design a news reviewing and sharing interface, create a dataset of news stories, and train four interpretable fake news detection algorithms.
For a deeper understanding of Explainable AI systems, we discuss interactions between user engagement, mental model, trust, and performance measures in the process of explaining.
arXiv Detail & Related papers (2020-07-24T05:42:29Z) - A Deeper Investigation of the Importance of Wikipedia Links to the
Success of Search Engines [7.433327915285967]
We report the results of an investigation into the incidence of Wikipedia links in search engine results pages (SERPs)
We find that Wikipedia links are extremely common in important search contexts, appearing in 67-84% of all SERPs for common and trending queries, but less often for medical queries.
Our findings reinforce the complementary notions that (1) Wikipedia content and research has major impact outside of the Wikipedia domain and (2) powerful technologies like search engines are highly reliant on free content created by volunteers.
arXiv Detail & Related papers (2020-04-21T19:58:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.