CSI-Bench: A Large-Scale In-the-Wild Dataset for Multi-task WiFi Sensing
- URL: http://arxiv.org/abs/2505.21866v1
- Date: Wed, 28 May 2025 01:29:29 GMT
- Title: CSI-Bench: A Large-Scale In-the-Wild Dataset for Multi-task WiFi Sensing
- Authors: Guozhen Zhu, Yuqian Hu, Weihang Gao, Wei-Hsiang Wang, Beibei Wang, K. J. Ray Liu,
- Abstract summary: WiFi sensing has emerged as a compelling contactless modality for human activity monitoring by capturing fine-grained variations in Channel State Information (CSI)<n>Existing WiFi sensing systems struggle to generalize in real-world settings due to datasets collected in controlled environments with homogeneous hardware and fragmented, session-based recordings that fail to reflect daily activity.<n>We present CSI-Bench, a large-scale, in-the-wild benchmark dataset collected using commercial WiFi edge devices across 26 diverse indoor environments with 35 real users.
- Score: 13.709208651007167
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: WiFi sensing has emerged as a compelling contactless modality for human activity monitoring by capturing fine-grained variations in Channel State Information (CSI). Its ability to operate continuously and non-intrusively while preserving user privacy makes it particularly suitable for health monitoring. However, existing WiFi sensing systems struggle to generalize in real-world settings, largely due to datasets collected in controlled environments with homogeneous hardware and fragmented, session-based recordings that fail to reflect continuous daily activity. We present CSI-Bench, a large-scale, in-the-wild benchmark dataset collected using commercial WiFi edge devices across 26 diverse indoor environments with 35 real users. Spanning over 461 hours of effective data, CSI-Bench captures realistic signal variability under natural conditions. It includes task-specific datasets for fall detection, breathing monitoring, localization, and motion source recognition, as well as a co-labeled multitask dataset with joint annotations for user identity, activity, and proximity. To support the development of robust and generalizable models, CSI-Bench provides standardized evaluation splits and baseline results for both single-task and multi-task learning. CSI-Bench offers a foundation for scalable, privacy-preserving WiFi sensing systems in health and broader human-centric applications.
Related papers
- PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection [51.20479454379662]
We propose a.
Federated Anomaly Detection framework named PeFAD with the increasing privacy concerns.
We conduct extensive evaluations on four real datasets, where PeFAD outperforms existing state-of-the-art baselines by up to 28.74%.
arXiv Detail & Related papers (2024-06-04T13:51:08Z) - Cross-domain Learning Framework for Tracking Users in RIS-aided Multi-band ISAC Systems with Sparse Labeled Data [55.70071704247794]
Integrated sensing and communications (ISAC) is pivotal for 6G communications and is boosted by the rapid development of reconfigurable intelligent surfaces (RISs)
This paper proposes the X2Track framework, where we model the tracking function by a hierarchical architecture, jointly utilizing multi-modal CSI indicators across multiple bands, and optimize it in a cross-domain manner.
Under X2Track, we design an efficient deep learning algorithm to minimize tracking errors, based on transformer neural networks and adversarial learning techniques.
arXiv Detail & Related papers (2024-05-10T08:04:27Z) - Effective Intrusion Detection in Heterogeneous Internet-of-Things Networks via Ensemble Knowledge Distillation-based Federated Learning [52.6706505729803]
We introduce Federated Learning (FL) to collaboratively train a decentralized shared model of Intrusion Detection Systems (IDS)
FLEKD enables a more flexible aggregation method than conventional model fusion techniques.
Experiment results show that the proposed approach outperforms local training and traditional FL in terms of both speed and performance.
arXiv Detail & Related papers (2024-01-22T14:16:37Z) - Data Augmentation Techniques for Cross-Domain WiFi CSI-based Human
Activity Recognition [1.7404865362620803]
WiFi Channel State Information (CSI) enables contactless and visual privacy-preserving sensing in indoor environments.
Poor model generalization, due to varying environmental conditions and sensing hardware, is a well-known problem in this space.
Data augmentation techniques commonly used in image-based learning are applied to WiFi CSI to investigate their effects on model generalization performance.
arXiv Detail & Related papers (2024-01-01T22:27:59Z) - Attention-Enhanced Deep Learning for Device-Free Through-the-Wall
Presence Detection Using Indoor WiFi Systems [9.087163485833054]
We propose a novel system for human presence detection using the channel state information (CSI) of WiFi signals.
Our system named attention-enhanced deep learning for presence detection (ALPD) employs an attention mechanism to automatically select informative subcarriers from the CSI data.
We evaluate the proposed ALPD system by deploying a pair of WiFi access points (APs) for collecting CSI dataset, which is further compared with several benchmarks.
arXiv Detail & Related papers (2023-04-25T19:17:36Z) - Contactless Human Activity Recognition using Deep Learning with Flexible
and Scalable Software Define Radio [1.3106429146573144]
This study investigates the use of Wi-Fi channel state information (CSI) as a novel method of ambient sensing.
These methods avoid additional costly hardware required for vision-based systems, which are privacy-intrusive.
This study presents a Wi-Fi CSI-based HAR system that assesses and contrasts deep learning approaches.
arXiv Detail & Related papers (2023-04-18T10:20:14Z) - Exposing the CSI: A Systematic Investigation of CSI-based Wi-Fi Sensing
Capabilities and Limitations [16.819111460629397]
This work aims to shed light on the impact of Wi-Fi 6 features on the sensing performance and to create a benchmark for future research on Wi-Fi sensing.
We perform an extensive CSI data collection campaign involving 3 individuals, 3 environments, and 12 activities, using Wi-Fi 6 signals.
An anonymized ground truth obtained through video recording accompanies our 80-GB dataset, which contains almost two hours of CSI data from three collectors.
arXiv Detail & Related papers (2023-02-02T10:21:00Z) - WiFi-based Spatiotemporal Human Action Perception [53.41825941088989]
An end-to-end WiFi signal neural network (SNN) is proposed to enable WiFi-only sensing in both line-of-sight and non-line-of-sight scenarios.
Especially, the 3D convolution module is able to explore thetemporal continuity of WiFi signals, and the feature self-attention module can explicitly maintain dominant features.
arXiv Detail & Related papers (2022-06-20T16:03:45Z) - A Wireless-Vision Dataset for Privacy Preserving Human Activity
Recognition [53.41825941088989]
A new WiFi-based and video-based neural network (WiNN) is proposed to improve the robustness of activity recognition.
Our results show that WiVi data set satisfies the primary demand and all three branches in the proposed pipeline keep more than $80%$ of activity recognition accuracy.
arXiv Detail & Related papers (2022-05-24T10:49:11Z) - GraSens: A Gabor Residual Anti-aliasing Sensing Framework for Action
Recognition using WiFi [52.530330427538885]
WiFi-based human action recognition (HAR) has been regarded as a promising solution in applications such as smart living and remote monitoring.
We propose an end-to-end Gabor residual anti-aliasing sensing network (GraSens) to directly recognize the actions using the WiFi signals from the wireless devices in diverse scenarios.
arXiv Detail & Related papers (2022-05-24T10:20:16Z) - Using GAN to Enhance the Accuracy of Indoor Human Activity Recognition [0.9239657838690226]
We present a semi-supervised learning method for activity recognition systems in which long short-term memory (LSTM) is employed to learn features and recognize seven different actions.
Our experimental results confirm that this model can increase classification accuracy by 3.4% and reduce the Log loss by almost 16%.
arXiv Detail & Related papers (2020-04-23T15:22:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.