Can LLMs Identify Tax Abuse?
- URL: http://arxiv.org/abs/2508.20097v1
- Date: Sun, 10 Aug 2025 15:15:45 GMT
- Title: Can LLMs Identify Tax Abuse?
- Authors: Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme,
- Abstract summary: We investigate whether large language models can discover and analyze U.S. tax-minimization strategies.<n>We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch.
- Score: 53.3007576756411
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We investigate whether large language models can discover and analyze U.S. tax-minimization strategies. This real-world domain challenges even seasoned human experts, and progress can reduce tax revenue lost from well-advised, wealthy taxpayers. We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch. This domain should be of particular interest to the LLM reasoning community: unlike synthetic challenge problems or scientific reasoning tasks, U.S. tax law involves navigating hundreds of thousands of pages of statutes, case law, and administrative guidance, all updated regularly. Notably, LLM-based reasoning identified an entirely novel tax strategy, highlighting these models' potential to revolutionize tax agencies' fight against tax abuse.
Related papers
- Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction [50.6851250608938]
Legal sentencing prediction (LSP) remains difficult due to its need for fine-grained objective knowledge and flexible subjective reasoning.<n>We propose $MSR2$, a framework that integrates multi-source retrieval and reasoning in LLMs with reinforcement learning.<n>Experiments on two real-world datasets show that $MSR2$ improves both accuracy and interpretability in LSP.
arXiv Detail & Related papers (2026-02-04T15:55:55Z) - Enabling Equitable Access to Trustworthy Financial Reasoning [50.73061215297832]
Tax filing requires complex reasoning, combining application of overlapping rules with numerical calculations.<n>We propose an approach that integrates LLMs with a symbolic solver to calculate tax obligations.<n>We show how combining up-front translation of plain-text rules into formal logic programs, combined with intelligently retrieved exemplars for formal case representations, can dramatically improve performance.
arXiv Detail & Related papers (2025-08-28T17:55:07Z) - Using Large Language Models for Legal Decision-Making in Austrian Value-Added Tax Law: An Experimental Study [0.0]
This paper provides an experimental evaluation of the capability of large language models (LLMs) to assist in legal decision-making within the framework of Austrian and European Union value-added tax (VAT) law.
arXiv Detail & Related papers (2025-07-11T10:19:56Z) - TaxAgent: How Large Language Model Designs Fiscal Policy [22.859190941594296]
This study introduces TaxAgent, a novel integration of large language models (LLMs) with agent-based modeling (ABM) to design adaptive tax policies.<n>In our macroeconomic simulation, heterogeneous H-Agents (households) simulate real-world taxpayer behaviors while the TaxAgent (government) utilizes LLMs to iteratively optimize tax rates, balancing equity and productivity.<n> Benchmarked against Saez Optimal Taxation, U.S. federal income taxes, and free markets, TaxAgent achieves superior equity-efficiency trade-offs.
arXiv Detail & Related papers (2025-06-03T13:06:19Z) - Can AI expose tax loopholes? Towards a new generation of legal policy assistants [7.237068561453082]
We introduce a novel prototype system designed to address the issues of tax loopholes and tax avoidance.<n>Our hybrid solution integrates a natural language interface with a domain-specific language tailored for planning.
arXiv Detail & Related papers (2025-03-21T17:40:06Z) - Taxation Perspectives from Large Language Models: A Case Study on Additional Tax Penalties [5.185522256407782]
We introduce PLAT, a new benchmark designed to assess the ability of LLMs to predict the legitimacy of additional tax penalties.<n>Our experiments with six LLMs reveal that their baseline capabilities are limited, especially when dealing with conflicting issues that demand a comprehensive understanding.
arXiv Detail & Related papers (2025-03-05T12:24:20Z) - RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios [58.90106984375913]
RuleArena is a novel and challenging benchmark designed to evaluate the ability of large language models (LLMs) to follow complex, real-world rules in reasoning.<n> Covering three practical domains -- airline baggage fees, NBA transactions, and tax regulations -- RuleArena assesses LLMs' proficiency in handling intricate natural language instructions.
arXiv Detail & Related papers (2024-12-12T06:08:46Z) - A Taxation Perspective for Fair Re-ranking [61.946428892727795]
We introduce a new fair re-ranking method named Tax-rank, which levies taxes based on the difference in utility between two items.
Our model Tax-rank offers a superior tax policy for fair re-ranking, theoretically demonstrating both continuity and controllability over accuracy loss.
arXiv Detail & Related papers (2024-04-27T08:21:29Z) - A Comprehensive Evaluation of Large Language Models on Legal Judgment
Prediction [60.70089334782383]
Large language models (LLMs) have demonstrated great potential for domain-specific applications.
Recent disputes over GPT-4's law evaluation raise questions concerning their performance in real-world legal tasks.
We design practical baseline solutions based on LLMs and test on the task of legal judgment prediction.
arXiv Detail & Related papers (2023-10-18T07:38:04Z) - A Knowledge Graph for Assessing Aggressive Tax Planning Strategies [1.4315915057750197]
Laws in different states may have unforeseen interaction effects, which can be exploited by allowing multinational companies to minimize taxes.
We present a knowledge graph of multinational companies and their relationships, comprising almost 1.5M business entities.
We show that commonly known tax planning strategies can be formulated as subgraph queries to that graph, which allows for identifying companies using certain strategies.
arXiv Detail & Related papers (2020-08-12T11:19:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.