Datasheets for Machine Learning Sensors: Towards Transparency,
Auditability, and Responsibility for Intelligent Sensing
- URL: http://arxiv.org/abs/2306.08848v3
- Date: Sat, 17 Feb 2024 03:03:33 GMT
- Title: Datasheets for Machine Learning Sensors: Towards Transparency,
Auditability, and Responsibility for Intelligent Sensing
- Authors: Matthew Stewart, Pete Warden, Yasmine Omri, Shvetank Prakash, Joao
Santos, Shawn Hymel, Benjamin Brown, Jim MacArthur, Nat Jeffries, Sachin
Katti, Brian Plancher, Vijay Janapa Reddi
- Abstract summary: Machine learning (ML) sensors are enabling intelligence at the edge by empowering end-users with greater control over their data.
We introduce a standard template for these ML sensors and discuss and evaluate the design and motivation for each section of the dasheet.
To provide a case study of the application of our template, we also designed and developed two examples for ML sensors performing computer vision-based person detection.
- Score: 9.686781507805113
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine learning (ML) sensors are enabling intelligence at the edge by
empowering end-users with greater control over their data. ML sensors offer a
new paradigm for sensing that moves the processing and analysis to the device
itself rather than relying on the cloud, bringing benefits like lower latency
and greater data privacy. The rise of these intelligent edge devices, while
revolutionizing areas like the internet of things (IoT) and healthcare, also
throws open critical questions about privacy, security, and the opacity of AI
decision-making. As ML sensors become more pervasive, it requires judicious
governance regarding transparency, accountability, and fairness. To this end,
we introduce a standard datasheet template for these ML sensors and discuss and
evaluate the design and motivation for each section of the datasheet in detail
including: standard dasheet components like the system's hardware
specifications, IoT and AI components like the ML model and dataset attributes,
as well as novel components like end-to-end performance metrics, and expanded
environmental impact metrics. To provide a case study of the application of our
datasheet template, we also designed and developed two examples for ML sensors
performing computer vision-based person detection: one an open-source ML sensor
designed and developed in-house, and a second commercial ML sensor developed by
our industry collaborators. Together, ML sensors and their datasheets provide
greater privacy, security, transparency, explainability, auditability, and
user-friendliness for ML-enabled embedded systems. We conclude by emphasizing
the need for standardization of datasheets across the broader ML community to
ensure the responsible use of sensor data.
Related papers
- SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing [6.8009140511761546]
Large Language Models (LLMs) have promising capabilities in processing sensory data, suggesting their potential as copilots for developing sensing systems.
We construct a comprehensive benchmark, SensorBench, to establish a quantifiable objective.
The results show that while LLMs exhibit considerable proficiency in simpler tasks, they face inherent challenges in processing compositional tasks.
arXiv Detail & Related papers (2024-10-14T17:21:39Z) - SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition [9.072495000412943]
We bridge the gap between wearable sensor technology and personalized AI assistants by enabling Large Language Models (LLMs) to understand time-series tasks like human activity recognition (HAR)
We introduce SensorLLM, a two-stage framework to unlock LLMs' potential for sensor data tasks.
We show that SensorLLM evolves into an effective sensor learner, reasoner, and learner, enabling it to generalize across diverse datasets for HAR tasks.
arXiv Detail & Related papers (2024-10-14T15:30:41Z) - LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models [55.903148392998965]
We introduce LOKI, a novel benchmark designed to evaluate the ability of LMMs to detect synthetic data across multiple modalities.
The benchmark includes coarse-grained judgment and multiple-choice questions, as well as fine-grained anomaly selection and explanation tasks.
We evaluate 22 open-source LMMs and 6 closed-source models on LOKI, highlighting their potential as synthetic data detectors and also revealing some limitations in the development of LMM capabilities.
arXiv Detail & Related papers (2024-10-13T05:26:36Z) - A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data
Transmission [10.174575604689391]
We propose a novel sensing module to equip sensing frameworks with intelligent data transmission capabilities.
We integrate a highly efficient machine learning model placed near the sensor.
This model provides prompt feedback for the sensing system to transmit only valuable data while discarding irrelevant information.
arXiv Detail & Related papers (2024-02-03T05:41:39Z) - Federated Learning on Edge Sensing Devices: A Review [0.0]
Federated Learning (FL) is emerging as a solution to privacy, hardware, and connectivity limitations.
We focus on the key FL principles, software frameworks, and testbeds.
We also explore the current sensor technologies, properties of the sensing devices and sensing applications where FL is utilized.
arXiv Detail & Related papers (2023-11-02T12:55:26Z) - Vulnerability of Machine Learning Approaches Applied in IoT-based Smart Grid: A Review [51.31851488650698]
Machine learning (ML) sees an increasing prevalence of being used in the internet-of-things (IoT)-based smart grid.
adversarial distortion injected into the power signal will greatly affect the system's normal control and operation.
It is imperative to conduct vulnerability assessment for MLsgAPPs applied in the context of safety-critical power systems.
arXiv Detail & Related papers (2023-08-30T03:29:26Z) - Low-cost Efficient Wireless Intelligent Sensor (LEWIS) for Engineering,
Research, and Education [72.2614468437919]
The vision of smart cities equipped with sensors informing decisions has not been realized to date.
Civil engineers lack of knowledge in sensor technology.
The electrical components and computer knowledge associated with sensors are still a challenge for civil engineers.
arXiv Detail & Related papers (2023-03-23T21:49:26Z) - Semantic Information Marketing in The Metaverse: A Learning-Based
Contract Theory Framework [68.8725783112254]
We address the problem of designing incentive mechanisms by a virtual service provider (VSP) to hire sensing IoT devices to sell their sensing data.
Due to the limited bandwidth, we propose to use semantic extraction algorithms to reduce the delivered data by the sensing IoT devices.
We propose a novel iterative contract design and use a new variant of multi-agent reinforcement learning (MARL) to solve the modelled multi-dimensional contract problem.
arXiv Detail & Related papers (2023-02-22T15:52:37Z) - Machine Learning Sensors [4.263101392970408]
Machine learning sensors represent a paradigm shift for the future of embedded machine learning applications.
Current instantiations of embedded machine learning (ML) suffer from complex integration, lack of modularity, and privacy and security concerns.
This article proposes a more data-centric paradigm for embedding sensor intelligence on edge devices to combat these challenges.
arXiv Detail & Related papers (2022-06-07T13:22:13Z) - Bayesian Imitation Learning for End-to-End Mobile Manipulation [80.47771322489422]
Augmenting policies with additional sensor inputs, such as RGB + depth cameras, is a straightforward approach to improving robot perception capabilities.
We show that using the Variational Information Bottleneck to regularize convolutional neural networks improves generalization to held-out domains.
We demonstrate that our method is able to help close the sim-to-real gap and successfully fuse RGB and depth modalities.
arXiv Detail & Related papers (2022-02-15T17:38:30Z) - SensiX++: Bringing MLOPs and Multi-tenant Model Serving to Sensory Edge
Devices [69.1412199244903]
We present a multi-tenant runtime for adaptive model execution with integrated MLOps on edge devices, e.g., a camera, a microphone, or IoT sensors.
S SensiX++ operates on two fundamental principles - highly modular componentisation to externalise data operations with clear abstractions and document-centric manifestation for system-wide orchestration.
We report on the overall throughput and quantified benefits of various automation components of SensiX++ and demonstrate its efficacy to significantly reduce operational complexity and lower the effort to deploy, upgrade, reconfigure and serve embedded models on edge devices.
arXiv Detail & Related papers (2021-09-08T22:06:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.