Unsupervised Machine Learning for Explainable Medicare Fraud Detection
- URL: http://arxiv.org/abs/2211.02927v2
- Date: Wed, 9 Nov 2022 15:18:19 GMT
- Title: Unsupervised Machine Learning for Explainable Medicare Fraud Detection
- Authors: Shubhranshu Shekhar, Jetson Leder-Luis, Leman Akoglu
- Abstract summary: We develop novel machine learning tools to identify providers that overbill Medicare.
Using large-scale Medicare claims data, we identify patterns consistent with fraud or overbilling.
Our proposed approach for Medicare fraud detection is fully unsupervised, not relying on any labeled training data.
- Score: 16.275152941805622
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The US federal government spends more than a trillion dollars per year on
health care, largely provided by private third parties and reimbursed by the
government. A major concern in this system is overbilling, waste and fraud by
providers, who face incentives to misreport on their claims in order to receive
higher payments. In this paper, we develop novel machine learning tools to
identify providers that overbill Medicare, the US federal health insurance
program for elderly adults and the disabled. Using large-scale Medicare claims
data, we identify patterns consistent with fraud or overbilling among inpatient
hospitalizations. Our proposed approach for Medicare fraud detection is fully
unsupervised, not relying on any labeled training data, and is explainable to
end users, providing reasoning and interpretable insights into the potentially
suspicious behavior of the flagged providers. Data from the Department of
Justice on providers facing anti-fraud lawsuits and several case studies
validate our approach and findings both quantitatively and qualitatively.
Related papers
- Retrieval Augmented Thought Process for Private Data Handling in Healthcare [53.89406286212502]
We introduce the Retrieval-Augmented Thought Process (RATP)
RATP formulates the thought generation of Large Language Models (LLMs)
On a private dataset of electronic medical records, RATP achieves 35% additional accuracy compared to in-context retrieval-augmented generation for the question-answering task.
arXiv Detail & Related papers (2024-02-12T17:17:50Z) - Correlating Medi- Claim Service by Deep Learning Neural Networks [0.0]
Medical insurance claims are of organized crimes related to patients, physicians, diagnostic centers, and insurance providers.
The Convolution Neural Network architecture is used to detect fraudulent claims through a correlation study of regression models.
Supervised and unsupervised classifiers are used to detect fraud and non-fraud claims.
arXiv Detail & Related papers (2023-08-08T07:40:21Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Operationalizing Counterfactual Metrics: Incentives, Ranking, and
Information Asymmetry [62.53919624802853]
We analyze the incentive misalignments that arise from such average treated outcome metrics.
We show how counterfactual metrics can be modified to behave reasonably in patient-facing ranking systems.
arXiv Detail & Related papers (2023-05-24T00:24:38Z) - On some studies of Fraud Detection Pipeline and related issues from the
scope of Ensemble Learning and Graph-based Learning [0.5820960526832067]
The UK anti-fraud charity Fraud Advisory Panel estimates business costs of fraud at 144 billion.
Building an efficient fraud detection system is challenging due to many difficult problems, e.g.imbalanced data, computing costs, etc.
arXiv Detail & Related papers (2022-05-10T02:13:58Z) - Multiple Inputs Neural Networks for Medicare fraud Detection [0.0]
Medicare fraud costs around 13 billion euros in Europe and between 21 billion and 71 billion US dollars per year in the United States.
This study aims to use artificial neural network based classifiers to predict medicare fraud.
arXiv Detail & Related papers (2022-03-11T10:36:53Z) - FedMed-GAN: Federated Domain Translation on Unsupervised Cross-Modality
Brain Image Synthesis [55.939957482776194]
We propose a new benchmark for federated domain translation on unsupervised brain image synthesis (termed as FedMed-GAN)
FedMed-GAN mitigates the mode collapse without sacrificing the performance of generators.
A comprehensive evaluation is provided for comparing FedMed-GAN and other centralized methods.
arXiv Detail & Related papers (2022-01-22T02:50:29Z) - Unsupervised anomaly detection for discrete sequence healthcare data [1.2667973028134798]
We propose a machine learning model that automates fraud detection in an unsupervised way.
Two deep learning approaches include LSTM neural network for prediction next patient visit and a seq2seq model.
We use real data on sequences of patients' visits data from Allianz company for the validation.
arXiv Detail & Related papers (2020-07-20T13:42:21Z) - Computational Support for Substance Use Disorder Prevention, Detection,
Treatment, and Recovery [62.9980747784214]
Substance Use Disorders involve the misuse of alcohol, opioids, marijuana, and methamphetamine.
1 in 12 U.S. adults have or have had a substance use disorder.
National Institute on Drug Abuse estimates that SUDs cost the U.S. $520 billion annually.
arXiv Detail & Related papers (2020-06-23T18:30:20Z) - DFraud3- Multi-Component Fraud Detection freeof Cold-start [50.779498955162644]
The Cold-start is a significant problem referring to the failure of a detection system to recognize the authenticity of a new user.
In this paper, we model a review system as a Heterogeneous InformationNetwork (HIN) which enables a unique representation to every component.
HIN with graph induction helps to address the camouflage issue (fraudsterswith genuine reviews) which has shown to be more severe when it is coupled with cold-start, i.e., new fraudsters with genuine first reviews.
arXiv Detail & Related papers (2020-06-10T08:20:13Z) - Uncovering Insurance Fraud Conspiracy with Network Learning [34.609076567889694]
We develop a novel data-driven procedure to identify groups of organized fraudsters.
We introduce a device-sharing network among claimants.
We then develop an automated solution for fraud detection based on graph learning algorithms.
arXiv Detail & Related papers (2020-02-27T13:15:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.