SpamDam: Towards Privacy-Preserving and Adversary-Resistant SMS Spam Detection
- URL: http://arxiv.org/abs/2404.09481v1
- Date: Mon, 15 Apr 2024 06:07:10 GMT
- Title: SpamDam: Towards Privacy-Preserving and Adversary-Resistant SMS Spam Detection
- Authors: Yekai Li, Rufan Zhang, Wenxin Rong, Xianghang Mi,
- Abstract summary: SpamDam is a SMS spam detection framework designed to overcome key challenges in detecting and understanding SMS spam.
We have compiled over 76K SMS spam messages from Twitter and Weibo between 2018 and 2023, forming the largest dataset of its kind.
We have rigorously tested the adversarial robustness of SMS spam detection models, introducing the novel reverse backdoor attack.
- Score: 2.0355793807035094
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this study, we introduce SpamDam, a SMS spam detection framework designed to overcome key challenges in detecting and understanding SMS spam, such as the lack of public SMS spam datasets, increasing privacy concerns of collecting SMS data, and the need for adversary-resistant detection models. SpamDam comprises four innovative modules: an SMS spam radar that identifies spam messages from online social networks(OSNs); an SMS spam inspector for statistical analysis; SMS spam detectors(SSDs) that enable both central training and federated learning; and an SSD analyzer that evaluates model resistance against adversaries in realistic scenarios. Leveraging SpamDam, we have compiled over 76K SMS spam messages from Twitter and Weibo between 2018 and 2023, forming the largest dataset of its kind. This dataset has enabled new insights into recent spam campaigns and the training of high-performing binary and multi-label classifiers for spam detection. Furthermore, effectiveness of federated learning has been well demonstrated to enable privacy-preserving SMS spam detection. Additionally, we have rigorously tested the adversarial robustness of SMS spam detection models, introducing the novel reverse backdoor attack, which has shown effectiveness and stealthiness in practical tests.
Related papers
- SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs [1.3198171962008958]
We evaluate the potential of large language models (LLMs), both open-source and commercial, for SMS spam detection.
We compare their performance across zero-shot, few-shot, fine-tuning, and chain-of-thought prompting approaches.
Fine-tuning emerges as the most effective strategy, with Mixtral achieving 98.6% accuracy and a balanced false positive and false negative rate below 2%.
arXiv Detail & Related papers (2025-01-09T06:00:08Z) - Detection and Prevention of Smishing Attacks [0.0]
This work presents a smishing detection model using a content-based analysis approach.
To address the challenge posed by slang, abbreviations, and short forms in text communication, the model normalizes these into standard forms.
Experimental results demonstrate the model effectiveness, achieving classification accuracies of 97.14% for smishing and 96.12% for ham messages.
arXiv Detail & Related papers (2024-12-31T04:07:12Z) - SMS Spam Detection and Classification to Combat Abuse in Telephone Networks Using Natural Language Processing [0.0]
This research addresses the pervasive issue of SMS spam, which poses threats to users' privacy and security.
The study introduces a novel approach utilizing Natural Language Processing (NLP) and machine learning models, particularly BERT (Bidirectional Representations from Transformers) for spam detection and classification.
Evaluation results revealed that the Na"ive Bayes + BERT model achieves the highest accuracy at 97.31% with the fastest execution time of 0.3 seconds on the test dataset.
arXiv Detail & Related papers (2024-06-04T13:44:36Z) - ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis [2.849988619791745]
The number of SMS spam has expanded significantly in recent years.
The unstructured format of SMS data creates significant challenges for SMS spam detection.
We employ optimized and fine-tuned transformer-based Large Language Models (LLMs) to solve the problem of spam message detection.
arXiv Detail & Related papers (2024-05-12T11:42:05Z) - Can AI-Generated Text be Reliably Detected? [50.95804851595018]
Large Language Models (LLMs) perform impressively well in various applications.
The potential for misuse of these models in activities such as plagiarism, generating fake news, and spamming has raised concern about their responsible use.
We stress-test the robustness of these AI text detectors in the presence of an attacker.
arXiv Detail & Related papers (2023-03-17T17:53:19Z) - Verifying the Robustness of Automatic Credibility Assessment [50.55687778699995]
We show that meaning-preserving changes in input text can mislead the models.
We also introduce BODEGA: a benchmark for testing both victim models and attack methods on misinformation detection tasks.
Our experimental results show that modern large language models are often more vulnerable to attacks than previous, smaller solutions.
arXiv Detail & Related papers (2023-03-14T16:11:47Z) - Spam Detection Using BERT [0.0]
We build a spam detector using BERT pre-trained model that classifies emails and messages by understanding to their context.
Our spam detector performance was 98.62%, 97.83%, 99.13% and 99.28% respectively.
arXiv Detail & Related papers (2022-06-06T09:09:40Z) - Deep convolutional forest: a dynamic deep ensemble approach for spam
detection in text [219.15486286590016]
This paper introduces a dynamic deep ensemble model for spam detection that adjusts its complexity and extracts features automatically.
As a result, the model achieved high precision, recall, f1-score and accuracy of 98.38%.
arXiv Detail & Related papers (2021-10-10T17:19:37Z) - MOST: A Multi-Oriented Scene Text Detector with Localization Refinement [67.35280008722255]
We propose a new algorithm for scene text detection, which puts forward a set of strategies to significantly improve the quality of text localization.
Specifically, a Text Feature Alignment Module (TFAM) is proposed to dynamically adjust the receptive fields of features.
A Position-Aware Non-Maximum Suppression (PA-NMS) module is devised to exclude unreliable ones.
arXiv Detail & Related papers (2021-04-02T14:34:41Z) - Robust and Verifiable Information Embedding Attacks to Deep Neural
Networks via Error-Correcting Codes [81.85509264573948]
In the era of deep learning, a user often leverages a third-party machine learning tool to train a deep neural network (DNN) classifier.
In an information embedding attack, an attacker is the provider of a malicious third-party machine learning tool.
In this work, we aim to design information embedding attacks that are verifiable and robust against popular post-processing methods.
arXiv Detail & Related papers (2020-10-26T17:42:42Z) - Robust Spammer Detection by Nash Reinforcement Learning [64.80986064630025]
We develop a minimax game where the spammers and spam detectors compete with each other on their practical goals.
We show that an optimization algorithm can reliably find an equilibrial detector that can robustly prevent spammers with any mixed spamming strategies from attaining their practical goal.
arXiv Detail & Related papers (2020-06-10T21:18:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.