Related papers: ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation

ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation

URL: http://arxiv.org/abs/2502.09896v1
Date: Fri, 14 Feb 2025 04:00:18 GMT
Title: ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation
Authors: Ye Dong, Yan Lin Aung, Sudipta Chattopadhyay, Jianying Zhou,
Abstract summary: ChatIoT is a large language model (LLM)-based IoT security assistant designed to disseminate IoT security and threat intelligence.<n>We develop an end-to-end data processing toolkit to handle heterogeneous datasets.
Score: 6.39666247062118
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Internet of Things (IoT) has gained widespread popularity, revolutionizing industries and daily life. However, it has also emerged as a prime target for attacks. Numerous efforts have been made to improve IoT security, and substantial IoT security and threat information, such as datasets and reports, have been developed. However, existing research often falls short in leveraging these insights to assist or guide users in harnessing IoT security practices in a clear and actionable way. In this paper, we propose ChatIoT, a large language model (LLM)-based IoT security assistant designed to disseminate IoT security and threat intelligence. By leveraging the versatile property of retrieval-augmented generation (RAG), ChatIoT successfully integrates the advanced language understanding and reasoning capabilities of LLM with fast-evolving IoT security information. Moreover, we develop an end-to-end data processing toolkit to handle heterogeneous datasets. This toolkit converts datasets of various formats into retrievable documents and optimizes chunking strategies for efficient retrieval. Additionally, we define a set of common use case specifications to guide the LLM in generating answers aligned with users' specific needs and expertise levels. Finally, we implement a prototype of ChatIoT and conduct extensive experiments with different LLMs, such as LLaMA3, LLaMA3.1, and GPT-4o. Experimental evaluations demonstrate that ChatIoT can generate more reliable, relevant, and technical in-depth answers for most use cases. When evaluating the answers with LLaMA3:70B, ChatIoT improves the above metrics by over 10% on average, particularly in relevance and technicality, compared to using LLMs alone.

Related papers

Leveraging Machine Learning Techniques in Intrusion Detection Systems for Internet of Things [11.185300073739098]
Traditional Intrusion Detection Systems (IDS) often fall short in managing the dynamic and large-scale nature of IoT networks. This paper explores how Machine Learning (ML) and Deep Learning (DL) techniques can significantly enhance IDS performance in IoT environments.
arXiv Detail & Related papers (2025-04-09T18:52:15Z)
Agentic Search Engine for Real-Time IoT Data [1.9275428660922078]
The Internet of Things (IoT) has enabled diverse devices to communicate over the Internet, yet the fragmentation of IoT systems limits seamless data sharing and coordinated management. This paper presents the IoT Agentic Search Engine (IoT-ASE), a real-time search engine tailored for IoT environments.
arXiv Detail & Related papers (2025-03-15T20:46:17Z)
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development [15.109121724888382]
GPIoT is a code generation system for IoT applications by fine-tuning locally deployable Small Language Models (SLMs) We propose GPIoT, a code generation system for IoT applications by fine-tuning locally deployable Small Language Models (SLMs) on IoT-specialized datasets.
arXiv Detail & Related papers (2025-03-02T01:55:40Z)
IoT-LLM: Enhancing Real-World IoT Task Reasoning with Large Language Models [15.779982408779945]
Large Language Models (LLMs) have demonstrated remarkable capabilities across textual and visual domains, but often generate outputs that violate physical laws. Inspired by human cognition, we explore augmenting LLMs with enhanced perception abilities using Internet of Things (IoT) sensor data and pertinent knowledge for IoT task reasoning in the physical world. We show that IoT-LLM significantly enhances the performance of IoT tasks reasoning of LLM, achieving an average improvement of 65% across various tasks against previous methods.
arXiv Detail & Related papers (2024-10-03T12:24:18Z)
IoT-LM: Large Multisensory Language Models for the Internet of Things [70.74131118309967]
IoT ecosystem provides rich source of real-world modalities such as motion, thermal, geolocation, imaging, depth, sensors, and audio. Machine learning presents a rich opportunity to automatically process IoT data at scale. We introduce IoT-LM, an open-source large multisensory language model tailored for the IoT ecosystem.
arXiv Detail & Related papers (2024-07-13T08:20:37Z)
OVEL: Large Language Model as Memory Manager for Online Video Entity Linking [57.70595589893391]
We propose a task called Online Video Entity Linking OVEL, aiming to establish connections between mentions in online videos and a knowledge base with high accuracy and timeliness. To effectively handle OVEL task, we leverage a memory block managed by a Large Language Model and retrieve entity candidates from the knowledge base to augment LLM performance on memory management.
arXiv Detail & Related papers (2024-03-03T06:47:51Z)
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models [107.82336341926134]
SALAD-Bench is a safety benchmark specifically designed for evaluating Large Language Models (LLMs) It transcends conventional benchmarks through its large scale, rich diversity, intricate taxonomy spanning three levels, and versatile functionalities.
arXiv Detail & Related papers (2024-02-07T17:33:54Z)
Unraveling Attacks in Machine Learning-based IoT Ecosystems: A Survey and the Open Libraries Behind Them [9.55194238764852]
The Internet of Things (IoT) has brought forth an era of unprecedented connectivity, with an estimated 80 billion smart devices expected to be in operation by the end of 2025. Machine Learning (ML) serves as a crucial technology, not only for analyzing IoT-generated data but also for diverse applications within the IoT ecosystem. This paper embarks on a comprehensive exploration of the security threats arising from ML's integration into various facets of IoT.
arXiv Detail & Related papers (2024-01-22T06:52:35Z)
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models [56.25156596019168]
This paper introduces the LMRL-Gym benchmark for evaluating multi-turn RL for large language models (LLMs) Our benchmark consists of 8 different language tasks, which require multiple rounds of language interaction and cover a range of tasks in open-ended dialogue and text games.
arXiv Detail & Related papers (2023-11-30T03:59:31Z)
A Survey on Detection of LLMs-Generated Content [97.87912800179531]
The ability to detect LLMs-generated content has become of paramount importance. We aim to provide a detailed overview of existing detection strategies and benchmarks. We also posit the necessity for a multi-faceted approach to defend against various attacks.
arXiv Detail & Related papers (2023-10-24T09:10:26Z)
Harris Hawks Feature Selection in Distributed Machine Learning for Secure IoT Environments [8.690178186919635]
Internet of Things (IoT) applications can collect and transfer sensitive data. It is necessary to develop new methods to detect hacked IoT devices. This paper proposes a Feature Selection (FS) model based on Harris Hawks Optimization (HHO) and Random Weight Network (RWN) to detect IoT botnet attacks.
arXiv Detail & Related papers (2023-02-20T09:38:12Z)
The Internet of Senses: Building on Semantic Communications and Edge Intelligence [67.75406096878321]
The Internet of Senses (IoS) holds the promise of flawless telepresence-style communication for all human receptors' We elaborate on how the emerging semantic communications and Artificial Intelligence (AI)/Machine Learning (ML) paradigms may satisfy the requirements of IoS use cases.
arXiv Detail & Related papers (2022-12-21T03:37:38Z)

This list is automatically generated from the titles and abstracts of the papers in this site.