Searching for Representation: A sociotechnical audit of googling for
members of U.S. Congress
- URL: http://arxiv.org/abs/2109.07012v1
- Date: Tue, 14 Sep 2021 23:13:02 GMT
- Title: Searching for Representation: A sociotechnical audit of googling for
members of U.S. Congress
- Authors: Emma Lurie and Deirdre K. Mulligan
- Abstract summary: 10% of the top Google search results are likely to mislead California information seekers who use search to identify their congressional representatives.
70% of the misleading results appear in featured snippets above the organic search results.
factors identified include Google's heavy reliance on Wikipedia, the lack of authoritative, machine parsable, high accuracy data about the identity of elected officials based on geographic location, and the search engine's treatment of under-specified queries.
- Score: 2.4366811507669124
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: High-quality online civic infrastructure is increasingly critical for the
success of democratic processes. There is a pervasive reliance on search
engines to find facts and information necessary for political participation and
oversight. We find that approximately 10\% of the top Google search results are
likely to mislead California information seekers who use search to identify
their congressional representatives. 70\% of the misleading results appear in
featured snippets above the organic search results. We use both qualitative and
quantitative methods to understand what aspects of the information ecosystem
lead to this sociotechnical breakdown. Factors identified include Google's
heavy reliance on Wikipedia, the lack of authoritative, machine parsable, high
accuracy data about the identity of elected officials based on geographic
location, and the search engine's treatment of under-specified queries. We
recommend steps that Google can take to meet its stated commitment to providing
high quality civic information, and steps that information providers can take
to improve the legibility and quality of information about congressional
representatives available to search algorithms.
Related papers
- CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval [52.134133938779776]
We present CLARINET, a system that asks informative clarification questions by choosing questions whose answers would maximize certainty in the correct candidate.
Our approach works by augmenting a large language model (LLM) to condition on a retrieval distribution, finetuning end-to-end to generate the question that would have maximized the rank of the true candidate at each turn.
arXiv Detail & Related papers (2024-04-28T18:21:31Z) - Algorithmic Misjudgement in Google Search Results: Evidence from Auditing the US Online Electoral Information Environment [0.0]
We study the extent to which government-maintained web domains are represented in the online electoral information environment.
We find that 71% of all occurrences of state, county, and local government sources were mistargeted.
We argue that ensuring the correct targeting of these sources to their respective constituents is a critical part of Google's role in facilitating access to authoritative and locally-relevant electoral information.
arXiv Detail & Related papers (2024-04-06T17:09:04Z) - Algorithmically Curated Lies: How Search Engines Handle Misinformation
about US Biolabs in Ukraine [39.58317527488534]
We conduct virtual agent-based algorithm audits of Google, Bing, and Yandex search outputs in June 2022.
We find significant disparities in misinformation exposure based on the language of search, with all search engines presenting a higher number of false stories in Russian.
These observations stress the possibility of AICSs being vulnerable to manipulation, in particular in the case of the unfolding propaganda campaigns.
arXiv Detail & Related papers (2024-01-24T22:15:38Z) - Algorithmic amplification of biases on Google Search [0.6167267484484484]
This paper investigates how individuals' preexisting attitudes influence the modern information-seeking process.
Individuals with opposing attitudes on abortion receive different search results.
Google Search engine reinforces preexisting beliefs in search results.
arXiv Detail & Related papers (2024-01-17T08:24:57Z) - User Attitudes to Content Moderation in Web Search [49.1574468325115]
We examine the levels of support for different moderation practices applied to potentially misleading and/or potentially offensive content in web search.
We find that the most supported practice is informing users about potentially misleading or offensive content, and the least supported one is the complete removal of search results.
More conservative users and users with lower levels of trust in web search results are more likely to be against content moderation in web search.
arXiv Detail & Related papers (2023-10-05T10:57:15Z) - Personalization of Web Search During the 2020 US Elections [0.0]
We study the effect of user characteristics and behavior on search results in a politically relevant context.
We set up a population of 150 synthetic internet users who are randomly located across 25 US cities.
These users differ in their browsing preferences and political ideology, and they build up realistic browsing and search histories.
arXiv Detail & Related papers (2022-09-28T11:18:56Z) - Algorithmic Fairness Datasets: the Story so Far [68.45921483094705]
Data-driven algorithms are studied in diverse domains to support critical decisions, directly impacting people's well-being.
A growing community of researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of risks and opportunities of automated decision-making for historically disadvantaged populations.
Progress in fair Machine Learning hinges on data, which can be appropriately used only if adequately documented.
Unfortunately, the algorithmic fairness community suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available information (sparsity)
arXiv Detail & Related papers (2022-02-03T17:25:46Z) - Exposing Query Identification for Search Transparency [69.06545074617685]
We explore the feasibility of approximate exposing query identification (EQI) as a retrieval task by reversing the role of queries and documents in two classes of search systems.
We derive an evaluation metric to measure the quality of a ranking of exposing queries, as well as conducting an empirical analysis focusing on various practical aspects of approximate EQI.
arXiv Detail & Related papers (2021-10-14T20:19:27Z) - The Matter of Chance: Auditing Web Search Results Related to the 2020
U.S. Presidential Primary Elections Across Six Search Engines [68.8204255655161]
We look at the text search results for "us elections", "donald trump", "joe biden" and "bernie sanders" queries on Google, Baidu, Bing, DuckDuckGo, Yahoo, and Yandex.
Our findings indicate substantial differences in the search results between search engines and multiple discrepancies within the results generated for different agents.
arXiv Detail & Related papers (2021-05-03T11:18:19Z) - Political audience diversity and news reliability in algorithmic ranking [54.23273310155137]
We propose using the political diversity of a website's audience as a quality signal.
Using news source reliability ratings from domain experts and web browsing data from a diverse sample of 6,890 U.S. citizens, we first show that websites with more extreme and less politically diverse audiences have lower journalistic standards.
arXiv Detail & Related papers (2020-07-16T02:13:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.