RF-GPT: Teaching AI to See the Wireless World
- URL: http://arxiv.org/abs/2602.14833v1
- Date: Mon, 16 Feb 2026 15:24:56 GMT
- Title: RF-GPT: Teaching AI to See the Wireless World
- Authors: Hang Zou, Yu Tian, Bohao Wang, Lina Bariah, Samson Lasaulce, Chongwen Huang, Mérouane Debbah,
- Abstract summary: Large language models (LLMs) and multimodal models have become powerful general-purpose reasoning systems.<n>RF-GPT is a radio-frequency language model (RFLM) that utilizes the visual encoders of multimodals to process and understand RF spectrograms.<n>A text-only LLM then converts these captions into RF-grounded instruction-answer pairs, yielding roughly 12,000 RF scenes and 25 million examples.
- Score: 48.294819966466044
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Large language models (LLMs) and multimodal models have become powerful general-purpose reasoning systems. However, radio-frequency (RF) signals, which underpin wireless systems, are still not natively supported by these models. Existing LLM-based approaches for telecom focus mainly on text and structured data, while conventional RF deep-learning models are built separately for specific signal-processing tasks, highlighting a clear gap between RF perception and high-level reasoning. To bridge this gap, we introduce RF-GPT, a radio-frequency language model (RFLM) that utilizes the visual encoders of multimodal LLMs to process and understand RF spectrograms. In this framework, complex in-phase/quadrature (IQ) waveforms are mapped to time-frequency spectrograms and then passed to pretrained visual encoders. The resulting representations are injected as RF tokens into a decoder-only LLM, which generates RF-grounded answers, explanations, and structured outputs. To train RF-GPT, we perform supervised instruction fine-tuning of a pretrained multimodal LLM using a fully synthetic RF corpus. Standards-compliant waveform generators produce wideband scenes for six wireless technologies, from which we derive time-frequency spectrograms, exact configuration metadata, and dense captions. A text-only LLM then converts these captions into RF-grounded instruction-answer pairs, yielding roughly 12,000 RF scenes and 0.625 million instruction examples without any manual labeling. Across benchmarks for wideband modulation classification, overlap analysis, wireless-technology recognition, WLAN user counting, and 5G NR information extraction, RF-GPT achieves strong multi-task performance, whereas general-purpose VLMs with no RF grounding largely fail.
Related papers
- Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals [5.688789306551577]
Radio frequency (RF)-based indoor localization offers significant promise for applications such as indoor navigation, augmented reality, and pervasive computing.<n>We introduce Radiance-Field Reinforced Pretraining (RFRP)<n>This novel self-supervised pretraining framework couples a large localization model (LM) with a neural radio-frequency radiance field (RF-NeRF) in an asymmetrical autoencoder architecture.<n>We show that the RFRP-pretrained LM reduces localization error by over 40% compared to non-pretrained models and by 21% compared to those pretrained using supervised learning.
arXiv Detail & Related papers (2025-12-08T08:52:08Z) - Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting [10.200300617390013]
Modeling wireless radiance field (WRF) is fundamental to modern communication systems.<n>We propose SwiftWRF, a deformable 2D splatting framework that synthesizes WRF spectra at arbitrary positions.<n>Experiments on both real-world and synthetic indoor scenes demonstrate that SwiftWRF can reconstruct WRF up to 500x faster than existing state-of-the-art methods.
arXiv Detail & Related papers (2025-06-15T09:36:45Z) - RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings [15.98684925275276]
Large Language Models (LLMs) offer new potential for advancing Cognitive Radio Technology (CRT)<n>We propose RadioLLM, a novel framework that integrates Hybrid Prompt and Token Reprogramming (HPTR) for combining radio signal features with expert knowledge, and a Frequency-Attuned Fusion (FAF) module for enhanced high-frequency feature modeling.
arXiv Detail & Related papers (2025-01-28T07:38:04Z) - RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge [66.33067693672696]
We address the critical problem of interference rejection in radio-frequency (RF) signals using a data-driven approach that leverages deep-learning methods.<n>A primary contribution of this paper is the introduction of the RF Challenge, which is a publicly available, diverse RF signal dataset.
arXiv Detail & Related papers (2024-09-13T13:53:41Z) - Beam Prediction based on Large Language Models [51.45077318268427]
We formulate the millimeter wave (mmWave) beam prediction problem as a time series forecasting task.<n>We transform historical observations into text-based representations using a trainable tokenizer.<n>Our method harnesses the power of LLMs to predict future optimal beams.
arXiv Detail & Related papers (2024-08-16T12:40:01Z) - R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models [65.04475956174959]
Split federated learning (SFL) is a compute-efficient paradigm in distributed machine learning (ML)<n>A significant challenge in SFL, particularly when deployed over wireless channels, is the susceptibility of transmitted model parameters to adversarial jamming.<n>This paper develops a physical layer framework for resilient SFL with large language models (LLMs) and vision language models (VLMs) over wireless networks.
arXiv Detail & Related papers (2024-07-16T12:21:29Z) - RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion [15.175370227353406]
We introduce a novel Time-Frequency Diffusion theory to enhance the original diffusion model, enabling it to tap into the information within the time, frequency, and complex-valued domains of RF signals.
RF-Diffusion is a versatile solution to generate diverse, high-quality, and time-series RF data.
We also showcase the versatility of RF-Diffusion in boosting Wi-Fi sensing systems and performing channel estimation in 5G networks.
arXiv Detail & Related papers (2024-04-14T04:56:05Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Deep-Learned Compression for Radio-Frequency Signal Classification [0.49109372384514843]
Next-generation cellular concepts rely on the processing of large quantities of radio-frequency (RF) samples.
We propose a deep learned compression model, HQARF, to compress the complex-valued samples of RF signals.
We are assessing the effects of HQARF on the performance of an AI model trained to infer the modulation class of the RF signal.
arXiv Detail & Related papers (2024-03-05T17:42:39Z) - One-shot Generative Distribution Matching for Augmented RF-based UAV Identification [0.0]
This work addresses the challenge of identifying Unmanned Aerial Vehicles (UAV) using radiofrequency (RF) fingerprinting in limited RF environments.
The complexity and variability of RF signals, influenced by environmental interference and hardware imperfections, often render traditional RF-based identification methods ineffective.
One-shot generative methods for augmenting transformed RF signals offer a significant improvement in UAV identification.
arXiv Detail & Related papers (2023-01-20T02:35:43Z) - Three-Way Deep Neural Network for Radio Frequency Map Generation and
Source Localization [67.93423427193055]
Monitoring wireless spectrum over spatial, temporal, and frequency domains will become a critical feature in beyond-5G and 6G communication technologies.
In this paper, we present a Generative Adversarial Network (GAN) machine learning model to interpolate irregularly distributed measurements across the spatial domain.
arXiv Detail & Related papers (2021-11-23T22:25:10Z) - Wavelet-Based Network For High Dynamic Range Imaging [64.66969585951207]
Existing methods, such as optical flow based and end-to-end deep learning based solutions, are error-prone either in detail restoration or ghosting artifacts removal.
In this work, we propose a novel frequency-guided end-to-end deep neural network (FNet) to conduct HDR fusion in the frequency domain, and Wavelet Transform (DWT) is used to decompose inputs into different frequency bands.
The low-frequency signals are used to avoid specific ghosting artifacts, while the high-frequency signals are used for preserving details.
arXiv Detail & Related papers (2021-08-03T12:26:33Z) - Spectro-Temporal RF Identification using Deep Learning [3.8137985834223507]
WRIST is a Wideband, Real-time RF Identification system with Spectro-Temporal detection, framework and system.
Our resulting deep learning model is capable to detect, classify, and precisely locate RF emissions in time and frequency.
WRIST detector achieves 90 mean Average Precision even in extremely congested environment in the wild.
arXiv Detail & Related papers (2021-07-11T19:02:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.