Integrating Reward Maximization and Population Estimation: Sequential
Decision-Making for Internal Revenue Service Audit Selection
- URL: http://arxiv.org/abs/2204.11910v3
- Date: Tue, 24 Jan 2023 13:51:06 GMT
- Title: Integrating Reward Maximization and Population Estimation: Sequential
Decision-Making for Internal Revenue Service Audit Selection
- Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Kristen Altenburger,
Alex Turk, John Guyton, Jacob Goldin, Daniel E. Ho
- Abstract summary: We introduce a new setting, optimize-and-estimate structured bandits.
This setting is inherent to many public and private sector applications.
We demonstrate its importance on real data from the United States Internal Revenue Service.
- Score: 2.2182596728059116
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce a new setting, optimize-and-estimate structured bandits. Here, a
policy must select a batch of arms, each characterized by its own context, that
would allow it to both maximize reward and maintain an accurate (ideally
unbiased) population estimate of the reward. This setting is inherent to many
public and private sector applications and often requires handling delayed
feedback, small data, and distribution shifts. We demonstrate its importance on
real data from the United States Internal Revenue Service (IRS). The IRS
performs yearly audits of the tax base. Two of its most important objectives
are to identify suspected misreporting and to estimate the "tax gap" -- the
global difference between the amount paid and true amount owed. Based on a
unique collaboration with the IRS, we cast these two processes as a unified
optimize-and-estimate structured bandit. We analyze optimize-and-estimate
approaches to the IRS problem and propose a novel mechanism for unbiased
population estimation that achieves rewards comparable to baseline approaches.
This approach has the potential to improve audit efficacy, while maintaining
policy-relevant estimates of the tax gap. This has important social
consequences given that the current tax gap is estimated at nearly half a
trillion dollars. We suggest that this problem setting is fertile ground for
further research and we highlight its interesting challenges. The results of
this and related research are currently being incorporated into the continual
improvement of the IRS audit selection methods.
Related papers
- Optimal Baseline Corrections for Off-Policy Contextual Bandits [61.740094604552475]
We aim to learn decision policies that optimize an unbiased offline estimate of an online reward metric.
We propose a single framework built on their equivalence in learning scenarios.
Our framework enables us to characterize the variance-optimal unbiased estimator and provide a closed-form solution for it.
arXiv Detail & Related papers (2024-05-09T12:52:22Z) - A Taxation Perspective for Fair Re-ranking [61.946428892727795]
We introduce a new fair re-ranking method named Tax-rank, which levies taxes based on the difference in utility between two items.
Our model Tax-rank offers a superior tax policy for fair re-ranking, theoretically demonstrating both continuity and controllability over accuracy loss.
arXiv Detail & Related papers (2024-04-27T08:21:29Z) - Fairness Incentives in Response to Unfair Dynamic Pricing [7.991187769447732]
We design a basic simulated economy, wherein we generate corporate taxation schedules geared to incentivizing firms towards adopting fair pricing behaviours.
To cover a range of possible policy scenarios, we formulate our social planner's learning problem as a multi-armed bandit, a contextual bandit and as a full reinforcement learning (RL) problem.
We find that social welfare improves on that of the fairness-agnostic baseline, and approaches that of the analytically optimal fairness-aware baseline for the multi-armed and contextual bandit settings.
arXiv Detail & Related papers (2024-04-22T23:12:58Z) - The Decisive Power of Indecision: Low-Variance Risk-Limiting Audits and Election Contestation via Marginal Mark Recording [51.82772358241505]
Risk-limiting audits (RLAs) are techniques for verifying the outcomes of large elections.
We define new families of audits that improve efficiency and offer advances in statistical power.
New audits are enabled by revisiting the standard notion of a cast-vote record so that it can declare multiple possible mark interpretations.
arXiv Detail & Related papers (2024-02-09T16:23:54Z) - Entropy Regularization for Population Estimation [3.0175479520609887]
Mean reward estimation tasks have been shown to be essential for public policy settings.
We show that leveraging entropy and KL divergence can yield a better trade-off between reward and estimator variance than existing baselines.
arXiv Detail & Related papers (2022-08-24T19:17:39Z) - Algorithmic Fairness and Vertical Equity: Income Fairness with IRS Tax
Audit Models [73.24381010980606]
This study examines issues of algorithmic fairness in the context of systems that inform tax audit selection by the IRS.
We show how the use of more flexible machine learning methods for selecting audits may affect vertical equity.
Our results have implications for the design of algorithmic tools across the public sector.
arXiv Detail & Related papers (2022-06-20T16:27:06Z) - Confidence-Budget Matching for Sequential Budgeted Learning [69.77435313099366]
We formalize decision-making problems with querying budget.
We consider multi-armed bandits, linear bandits, and reinforcement learning problems.
We show that CBM based algorithms perform well in the presence of adversity.
arXiv Detail & Related papers (2021-02-05T19:56:31Z) - An Evolutionary Game Model for Understanding Fraud in Consumption Taxes [0.0]
This paper presents a computational evolutionary game model to study and understand fraud dynamics in the consumption tax system.
Players are cooperators if they correctly declare their value added tax (VAT), and are defectors otherwise.
Since transactions between companies must be declared by both the buyer and seller, a strategy adopted by one influences the other's payoff.
arXiv Detail & Related papers (2021-01-12T11:53:31Z) - CRACT: Cascaded Regression-Align-Classification for Robust Visual
Tracking [97.84109669027225]
We introduce an improved proposal refinement module, Cascaded Regression-Align- Classification (CRAC)
CRAC yields new state-of-the-art performances on many benchmarks.
In experiments on seven benchmarks including OTB-2015, UAV123, NfS, VOT-2018, TrackingNet, GOT-10k and LaSOT, our CRACT exhibits very promising results in comparison with state-of-the-art competitors.
arXiv Detail & Related papers (2020-11-25T02:18:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.