Anomaly Detection and Automated Labeling for Voter Registration File
Changes
- URL: http://arxiv.org/abs/2106.15285v1
- Date: Wed, 16 Jun 2021 21:48:31 GMT
- Title: Anomaly Detection and Automated Labeling for Voter Registration File
Changes
- Authors: Sam Royston, Ben Greenberg, Omeed Tavasoli, Courtenay Cotton
- Abstract summary: Voter eligibility in United States elections is determined by a patchwork of state databases containing information about which citizens are eligible to vote.
Monitoring changes to Voter Registration Files (VRFs) is crucial, given that a malicious actor wishing to disrupt the democratic process in the US would be well-advised to manipulate the contents of these files in order to achieve their goals.
We present a set of methods that make use of machine learning to ease the burden on analysts and administrators in protecting voter rolls.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Voter eligibility in United States elections is determined by a patchwork of
state databases containing information about which citizens are eligible to
vote. Administrators at the state and local level are faced with the
exceedingly difficult task of ensuring that each of their jurisdictions is
properly managed, while also monitoring for improper modifications to the
database. Monitoring changes to Voter Registration Files (VRFs) is crucial,
given that a malicious actor wishing to disrupt the democratic process in the
US would be well-advised to manipulate the contents of these files in order to
achieve their goals. In 2020, we saw election officials perform admirably when
faced with administering one of the most contentious elections in US history,
but much work remains to secure and monitor the election systems Americans rely
on. Using data created by comparing snapshots taken of VRFs over time, we
present a set of methods that make use of machine learning to ease the burden
on analysts and administrators in protecting voter rolls. We first evaluate the
effectiveness of multiple unsupervised anomaly detection methods in detecting
VRF modifications by modeling anomalous changes as sparse additive noise. In
this setting we determine that statistical models comparing administrative
districts within a short time span and non-negative matrix factorization are
most effective for surfacing anomalous events for review. These methods were
deployed during 2019-2020 in our organization's monitoring system and were used
in collaboration with the office of the Iowa Secretary of State. Additionally,
we propose a newly deployed model which uses historical and demographic
metadata to label the likely root cause of database modifications. We hope to
use this model to predict which modifications have known causes and therefore
better identify potentially anomalous modifications.
Related papers
- Auditing for Bias in Ad Delivery Using Inferred Demographic Attributes [50.37313459134418]
We study the effects of inference error on auditing for bias in one prominent application: black-box audit of ad delivery using paired ads.
We propose a way to mitigate the inference error when evaluating skew in ad delivery algorithms.
arXiv Detail & Related papers (2024-10-30T18:57:03Z) - Understanding and Mitigating the Impacts of Differentially Private Census Data on State Level Redistricting [4.589972411795548]
Data users were shaken by the adoption of differential privacy in the 2020 DAS.
We consider two redistricting settings in which a data user might be concerned about the impacts of privacy preserving noise.
We observe that an analyst may come to incorrect conclusions if they do not account for noise.
arXiv Detail & Related papers (2024-09-10T18:11:54Z) - Publicly auditable privacy-preserving electoral rolls [0.32498796510544625]
We study the problem of designing publicly auditable yet privacy-preserving electoral rolls.
The audit can detect polling-day ballot stuffing and denials to eligible voters by malicious polling officers.
The entire electoral roll is never revealed, which prevents any large-scale systematic voter targeting and manipulation.
arXiv Detail & Related papers (2024-02-18T13:11:48Z) - Efficient Weighting Schemes for Auditing Instant-Runoff Voting Elections [57.67176250198289]
AWAIRE involves adaptively weighted averages of test statistics, essentially "learning" an effective set of hypotheses to test.
We explore schemes and settings more extensively, to identify and recommend efficient choices for practice.
A limitation of the current AWAIRE implementation is its restriction to a small number of candidates.
arXiv Detail & Related papers (2024-02-18T10:13:01Z) - The Decisive Power of Indecision: Low-Variance Risk-Limiting Audits and Election Contestation via Marginal Mark Recording [51.82772358241505]
Risk-limiting audits (RLAs) are techniques for verifying the outcomes of large elections.
We define new families of audits that improve efficiency and offer advances in statistical power.
New audits are enabled by revisiting the standard notion of a cast-vote record so that it can declare multiple possible mark interpretations.
arXiv Detail & Related papers (2024-02-09T16:23:54Z) - Adaptively Weighted Audits of Instant-Runoff Voting Elections: AWAIRE [61.872917066847855]
Methods for auditing instant-runoff voting (IRV) elections are either not risk-limiting or require cast vote records (CVRs), the voting system's electronic record of the votes on each ballot.
We develop an RLA method that uses adaptively weighted averages of test supermartingales to efficiently audit IRV elections when CVRs are not available.
arXiv Detail & Related papers (2023-07-20T15:55:34Z) - New Algorithms and Applications for Risk-Limiting Audits [4.375873233252245]
Risk-limiting audits (RLAs) are a significant tool in increasing confidence in the accuracy of elections.
This work suggests a new generic method, called Batchcomp", for converting classical (ballot-level) RLAs into ones that operate on batches.
We present an adaptation of ALPHA, an existing RLA method, to a method which applies to censuses.
arXiv Detail & Related papers (2023-05-06T13:34:39Z) - Auditing Ranked Voting Elections with Dirichlet-Tree Models: First Steps [23.14629947453497]
Ranked voting systems are used in many places around the world.
There is no known risk-limiting audit (RLA) method for STV other than a full hand count.
We present a new approach to auditing ranked systems that uses a statistical model, a Dirichlet-tree, that can cope with high-dimensional parameters in a computationally efficient manner.
arXiv Detail & Related papers (2022-06-29T13:06:42Z) - A New Bandit Setting Balancing Information from State Evolution and
Corrupted Context [52.67844649650687]
We propose a new sequential decision-making setting combining key aspects of two established online learning problems with bandit feedback.
The optimal action to play at any given moment is contingent on an underlying changing state which is not directly observable by the agent.
We present an algorithm that uses a referee to dynamically combine the policies of a contextual bandit and a multi-armed bandit.
arXiv Detail & Related papers (2020-11-16T14:35:37Z) - Leveraging Administrative Data for Bias Audits: Assessing Disparate
Coverage with Mobility Data for COVID-19 Policy [61.60099467888073]
We show how linking administrative data can enable auditing mobility data for bias.
We show that older and non-white voters are less likely to be captured by mobility data.
We show that allocating public health resources based on such mobility data could disproportionately harm high-risk elderly and minority groups.
arXiv Detail & Related papers (2020-11-14T02:04:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.