Related papers: AIMM: An AI-Driven Multimodal Framework for Detecting Social-Media-Influenced Stock Market Manipulation

AIMM: An AI-Driven Multimodal Framework for Detecting Social-Media-Influenced Stock Market Manipulation

URL: http://arxiv.org/abs/2512.16103v1
Date: Thu, 18 Dec 2025 02:42:01 GMT
Title: AIMM: An AI-Driven Multimodal Framework for Detecting Social-Media-Influenced Stock Market Manipulation
Authors: Sandeep Neela,
Abstract summary: We present AIMM, an AI-driven framework that fuses Reddit activity, bot and coordination indicators, and market features into a daily AIMM Manipulation Risk Score for each ticker.<n>Due to Reddit API restrictions, we employ synthetic social features matching documented event characteristics.<n>We analyze lead times and show that AIMM flagged GME 22 days before the January 2021 squeeze peak.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Market manipulation now routinely originates from coordinated social media campaigns, not isolated trades. Retail investors, regulators, and brokerages need tools that connect online narratives and coordination patterns to market behavior. We present AIMM, an AI-driven framework that fuses Reddit activity, bot and coordination indicators, and OHLCV market features into a daily AIMM Manipulation Risk Score for each ticker. The system uses a parquet-native pipeline with a Streamlit dashboard that allows analysts to explore suspicious windows, inspect underlying posts and price action, and log model outputs over time. Due to Reddit API restrictions, we employ calibrated synthetic social features matching documented event characteristics; market data (OHLCV) uses real historical data from Yahoo Finance. This release makes three contributions. First, we build the AIMM Ground Truth dataset (AIMM-GT): 33 labeled ticker-days spanning eight equities, drawing from SEC enforcement actions, community-verified manipulation cases, and matched normal controls. Second, we implement forward-walk evaluation and prospective prediction logging for both retrospective and deployment-style assessment. Third, we analyze lead times and show that AIMM flagged GME 22 days before the January 2021 squeeze peak. The current labeled set is small (33 ticker-days, 3 positive events), but results show preliminary discriminative capability and early warnings for the GME incident. We release the code, dataset schema, and dashboard design to support research on social media-driven market surveillance.

Related papers

PredictionMarketBench: A SWE-bench-Style Framework for Backtesting Trading Agents on Prediction Markets [0.0]
PredictionMarketBench is a SWE-bench-style benchmark for evaluating algorithmic and LLM-based trading agents on prediction markets.<n>PredictionMarketBench standardizes (i) episode construction from raw exchange streams (orderbooks, trades, lifecycle, settlement), (ii) an execution-realistic simulator with maker/taker semantics and fee modeling, and (iii) a tool-based agent interface.<n>We release four Kalshi-based episodes spanning cryptocurrency, weather, and sports. Baseline results show that naive trading agents can underperform due to transaction costs and settlement losses, while fee-aware algorithmic strategies remain competitive in volatile episodes
arXiv Detail & Related papers (2026-01-28T06:41:12Z)
When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents [74.55061622246824]
Agent Market Arena (AMA) is the first lifelong, real-time benchmark for evaluating Large Language Model (LLM)-based trading agents.<n>AMA integrates verified trading data, expert-checked news, and diverse agent architectures within a unified trading framework.<n>It evaluates agents across GPT-4o, GPT-4.1, Claude-3.5-haiku, Claude-sonnet-4, and Gemini-2.0-flash.
arXiv Detail & Related papers (2025-10-13T17:54:09Z)
Hide-and-Shill: A Reinforcement Learning Framework for Market Manipulation Detection in Symphony-a Decentralized Multi-Agent System [7.392937244789759]
Decentralized finance (DeFi) has introduced a new era of permissionless financial innovation but also led to unprecedented market manipulation.<n>We propose a Multi-Agent Reinforcement Learning framework for decentralized manipulation detection, modeling the interaction between manipulators and detectors as a dynamic adversarial game.<n>This framework identifies suspicious patterns using delayed token price reactions as financial indicators.
arXiv Detail & Related papers (2025-07-12T07:55:40Z)
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting [58.70072722290475]
Financial time series (FinTS) record the behavior of human-brain-augmented decision-making.<n>FinTSB is a comprehensive and practical benchmark for financial time series forecasting.
arXiv Detail & Related papers (2025-02-26T05:19:16Z)
Labeled Datasets for Research on Information Operations [71.34999856621306]
We present new labeled datasets about 26 campaigns, which contain both IO posts verified by a social media platform and over 13M posts by 303k accounts that discussed similar topics in the same time frames (control data) The datasets will facilitate the study of narratives, network interactions, and engagement strategies employed by coordinated accounts across various campaigns and countries.
arXiv Detail & Related papers (2024-11-15T22:15:01Z)
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments [55.19252983108372]
We have developed a multi-agent AI system called StockAgent, driven by LLMs. The StockAgent allows users to evaluate the impact of different external factors on investor trading. It avoids the test set leakage issue present in existing trading simulation systems based on AI Agents.
arXiv Detail & Related papers (2024-07-15T06:49:30Z)
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.<n>To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.<n>Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z)
Opinion Market Model: Stemming Far-Right Opinion Spread using Positive Interventions [4.635820333232681]
We introduce a two-tier online opinion ecosystem model that considers both inter-opinion interactions and the role of positive interventions. We test OMM on two learning tasks, applying to two real-world datasets to predict attention market shares and uncover latent relationships between online items. OMM outperforms the state-of-the-art predictive models on both datasets and captures latent cooperation-competition relations.
arXiv Detail & Related papers (2022-08-13T10:36:04Z)
Dual-CLVSA: a Novel Deep Learning Approach to Predict Financial Markets with Sentiment Measurements [11.97251638872227]
We propose a novel deep learning approach, named dual-CLVSA, to predict financial market movement with both trading data and the corresponding social sentiment measurements, each through a separate sequence-to-sequence channel. The experiment results show that dual-CLVSA can effectively fuse the two types of data, and verify that sentiment measurements are not only informative for financial market predictions, but they also contain extra profitable features to boost the performance of our predicting system.
arXiv Detail & Related papers (2022-01-27T20:32:46Z)
Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling [11.430440350359993]
We propose LARA, a novel price movement forecasting framework with two main components. LA-Attention extracts potentially profitable samples through masked attention scheme. RA-Labeling refines the noisy labels of potentially profitable samples. LARA significantly outperforms several machine learning based methods on the Qlib quantitative investment platform.
arXiv Detail & Related papers (2021-07-26T05:52:42Z)
Taking Over the Stock Market: Adversarial Perturbations Against Algorithmic Traders [47.32228513808444]
We present a realistic scenario in which an attacker influences algorithmic trading systems by using adversarial learning techniques. We show that when added to the input stream, our perturbation can fool the trading algorithms at future unseen data points.
arXiv Detail & Related papers (2020-10-19T06:28:05Z)

This list is automatically generated from the titles and abstracts of the papers in this site.