Related papers: Penetrative AI: Making LLMs Comprehend the Physical World

Penetrative AI: Making LLMs Comprehend the Physical World

URL: http://arxiv.org/abs/2310.09605v3
Date: Wed, 12 Jun 2024 13:52:36 GMT
Title: Penetrative AI: Making LLMs Comprehend the Physical World
Authors: Huatao Xu, Liying Han, Qirui Yang, Mo Li, Mani Srivastava,
Abstract summary: Large Language Models (LLMs) have demonstrated remarkable capabilities across a range of tasks. This paper explores how LLMs can be extended to interact with and reason about the physical world through IoT sensors and actuators.
Score: 3.0266193917041306
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent developments in Large Language Models (LLMs) have demonstrated their remarkable capabilities across a range of tasks. Questions, however, persist about the nature of LLMs and their potential to integrate common-sense human knowledge when performing tasks involving information about the real physical world. This paper delves into these questions by exploring how LLMs can be extended to interact with and reason about the physical world through IoT sensors and actuators, a concept that we term "Penetrative AI". The paper explores such an extension at two levels of LLMs' ability to penetrate into the physical world via the processing of sensory signals. Our preliminary findings indicate that LLMs, with ChatGPT being the representative example in our exploration, have considerable and unique proficiency in employing the embedded world knowledge for interpreting IoT sensor data and reasoning over them about tasks in the physical realm. Not only this opens up new applications for LLMs beyond traditional text-based tasks, but also enables new ways of incorporating human knowledge in cyber-physical systems.

Related papers

Large Language Models in the IoT Ecosystem -- A Survey on Security Challenges and Applications [1.1312948048543685]
The Internet of Things (IoT) and Large Language Models (LLMs) have been two major emerging players in the information technology era.<n>This literature survey explores the current state-of-the-art in applying LLMs within IoT.<n>It emphasizes their applications in various domains/sectors of society, the significant role they play in enhancing IoT security.
arXiv Detail & Related papers (2025-05-23T07:46:27Z)
A Call for New Recipes to Enhance Spatial Reasoning in MLLMs [85.67171333213301]
Multimodal Large Language Models (MLLMs) have demonstrated impressive performance in general vision-language tasks. Recent studies have exposed critical limitations in their spatial reasoning capabilities. This deficiency in spatial reasoning significantly constrains MLLMs' ability to interact effectively with the physical world.
arXiv Detail & Related papers (2025-04-21T11:48:39Z)
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts [48.16515416987306]
We introduce analytic concepts, procedurally defined upon mathematical symbolism, that can be directly computed and simulated by machines. We are able to figure out the knowledge of object structure and functionality with physics-informed representations, and then use the physically grounded knowledge to instruct robot control policies.
arXiv Detail & Related papers (2025-03-30T08:12:43Z)
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap [51.198001060683296]
Large Language Models (LLMs) offer transformative potential to address transportation challenges. This survey first presents LLM4TR, a novel conceptual framework that systematically categorizes the roles of LLMs in transportation. For each role, our review spans diverse applications, from traffic prediction and autonomous driving to safety analytics and urban mobility optimization.
arXiv Detail & Related papers (2025-03-27T11:56:27Z)
Wi-Chat: Large Language Model Powered Wi-Fi Sensing [3.698359226442895]
We introduce Wi-Chat, the first LLM-powered Wi-Fi-based human activity recognition system. We show that LLMs can process raw Wi-Fi signals and infer human activities by incorporating Wi-Fi sensing principles into prompts.
arXiv Detail & Related papers (2025-02-18T01:43:31Z)
A Survey on Large Language Models with some Insights on their Capabilities and Limitations [0.3222802562733786]
Large Language Models (LLMs) exhibit remarkable performance across various language-related tasks. LLMs have demonstrated emergent abilities extending beyond their core functions. This paper explores the foundational components, scaling mechanisms, and architectural strategies that drive these capabilities.
arXiv Detail & Related papers (2025-01-03T21:04:49Z)
IoT-LLM: Enhancing Real-World IoT Task Reasoning with Large Language Models [15.779982408779945]
Large Language Models (LLMs) have demonstrated remarkable capabilities across textual and visual domains, but often generate outputs that violate physical laws. Inspired by human cognition, we explore augmenting LLMs with enhanced perception abilities using Internet of Things (IoT) sensor data and pertinent knowledge for IoT task reasoning in the physical world. We show that IoT-LLM significantly enhances the performance of IoT tasks reasoning of LLM, achieving an average improvement of 65% across various tasks against previous methods.
arXiv Detail & Related papers (2024-10-03T12:24:18Z)
A Roadmap for Embodied and Social Grounding in LLMs [43.74009805483536]
The fusion of Large Language Models and robotic systems has led to a transformative paradigm in the robotic field. The grounding of LLMs knowledge into the empirical world has been considered a crucial pathway to exploit the efficiency of LLMs in robotics. Taking inspiration from humans, this work draws attention to three necessary elements for an agent to grasp and experience the world.
arXiv Detail & Related papers (2024-09-25T13:09:23Z)
A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks [74.52259252807191]
Multimodal Large Language Models (MLLMs) address the complexities of real-world applications far beyond the capabilities of single-modality systems. This paper systematically sorts out the applications of MLLM in multimodal tasks such as natural language, vision, and audio.
arXiv Detail & Related papers (2024-08-02T15:14:53Z)
A Reality check of the benefits of LLM in business [1.9181612035055007]
Large language models (LLMs) have achieved remarkable performance in language understanding and generation tasks. This paper thoroughly examines the usefulness and readiness of LLMs for business processes.
arXiv Detail & Related papers (2024-06-09T02:36:00Z)
Generative AI-in-the-loop: Integrating LLMs and GPTs into the Next Generation Networks [11.509880721677156]
Large language models (LLMs) have recently emerged, demonstrating near-human-level performance in cognitive tasks. We propose the concept of "generative AI-in-the-loop" We believe that combining LLMs and ML models allows both to leverage their respective capabilities and achieve better results than either model alone.
arXiv Detail & Related papers (2024-06-06T17:25:07Z)
ChatGPT Alternative Solutions: Large Language Models Survey [0.0]
Large Language Models (LLMs) have ignited a surge in research contributions within this domain. Recent years have witnessed a dynamic synergy between academia and industry, propelling the field of LLM research to new heights. This survey furnishes a well-rounded perspective on the current state of generative AI, shedding light on opportunities for further exploration, enhancement, and innovation.
arXiv Detail & Related papers (2024-03-21T15:16:50Z)
Rethinking Interpretability in the Era of Large Language Models [76.1947554386879]
Large language models (LLMs) have demonstrated remarkable capabilities across a wide array of tasks. The capability to explain in natural language allows LLMs to expand the scale and complexity of patterns that can be given to a human. These new capabilities raise new challenges, such as hallucinated explanations and immense computational costs.
arXiv Detail & Related papers (2024-01-30T17:38:54Z)
Insights into Classifying and Mitigating LLMs' Hallucinations [48.04565928175536]
This paper delves into the underlying causes of AI hallucination and elucidates its significance in artificial intelligence. We explore potential strategies to mitigate hallucinations, aiming to enhance the overall reliability of large language models.
arXiv Detail & Related papers (2023-11-14T12:30:28Z)
ExpeL: LLM Agents Are Experiential Learners [60.54312035818746]
We introduce the Experiential Learning (ExpeL) agent to allow learning from agent experiences without requiring parametric updates. Our agent autonomously gathers experiences and extracts knowledge using natural language from a collection of training tasks. At inference, the agent recalls its extracted insights and past experiences to make informed decisions.
arXiv Detail & Related papers (2023-08-20T03:03:34Z)
Brain in a Vat: On Missing Pieces Towards Artificial General Intelligence in Large Language Models [83.63242931107638]
We propose four characteristics of generally intelligent agents. We argue that active engagement with objects in the real world delivers more robust signals for forming conceptual representations. We conclude by outlining promising future research directions in the field of artificial general intelligence.
arXiv Detail & Related papers (2023-07-07T13:58:16Z)
Inner Monologue: Embodied Reasoning through Planning with Language Models [81.07216635735571]
Large Language Models (LLMs) can be applied to domains beyond natural language processing. LLMs planning in embodied environments need to consider not just what skills to do, but also how and when to do them. We propose that by leveraging environment feedback, LLMs are able to form an inner monologue that allows them to more richly process and plan in robotic control scenarios.
arXiv Detail & Related papers (2022-07-12T15:20:48Z)

This list is automatically generated from the titles and abstracts of the papers in this site.