Lightweight Modeling of User Context Combining Physical and Virtual
Sensor Data
- URL: http://arxiv.org/abs/2306.16029v1
- Date: Wed, 28 Jun 2023 08:57:01 GMT
- Title: Lightweight Modeling of User Context Combining Physical and Virtual
Sensor Data
- Authors: Mattia Giovanni Campana, Dimitris Chatzopoulos, Franca Delmastro, Pan
Hui
- Abstract summary: We present a framework to collect datasets containing heterogeneous sensing data from personal mobile devices.
We propose a lightweight approach to model the user context able to efficiently perform the entire reasoning process.
We achieve a 10x speed up and a feature reduction of more than 90% while keeping the accuracy loss less than 3%.
- Score: 15.800978541993706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The multitude of data generated by sensors available on users' mobile
devices, combined with advances in machine learning techniques, support
context-aware services in recognizing the current situation of a user (i.e.,
physical context) and optimizing the system's personalization features.
However, context-awareness performances mainly depend on the accuracy of the
context inference process, which is strictly tied to the availability of
large-scale and labeled datasets. In this work, we present a framework
developed to collect datasets containing heterogeneous sensing data derived
from personal mobile devices. The framework has been used by 3 voluntary users
for two weeks, generating a dataset with more than 36K samples and 1331
features. We also propose a lightweight approach to model the user context able
to efficiently perform the entire reasoning process on the user mobile device.
To this aim, we used six dimensionality reduction techniques in order to
optimize the context classification. Experimental results on the generated
dataset show that we achieve a 10x speed up and a feature reduction of more
than 90% while keeping the accuracy loss less than 3%.
Related papers
- A CLIP-Powered Framework for Robust and Generalizable Data Selection [51.46695086779598]
Real-world datasets often contain redundant and noisy data, imposing a negative impact on training efficiency and model performance.
Data selection has shown promise in identifying the most representative samples from the entire dataset.
We propose a novel CLIP-powered data selection framework that leverages multimodal information for more robust and generalizable sample selection.
arXiv Detail & Related papers (2024-10-15T03:00:58Z) - ContextLabeler Dataset: physical and virtual sensors data collected from
smartphone usage in-the-wild [7.310043452300736]
This paper describes a data collection campaign and the resulting dataset derived from smartphone sensors.
The collected dataset represents a useful source of real data to both define and evaluate a broad set of novel context-aware solutions.
arXiv Detail & Related papers (2023-07-07T13:28:29Z) - MyDigitalFootprint: an extensive context dataset for pervasive computing
applications at the edge [7.310043452300736]
MyDigitalFootprint is a large-scale dataset comprising smartphone sensor data, physical proximity information, and Online Social Networks interactions.
It spans two months of measurements from 31 volunteer users in their natural environment, allowing for unrestricted behavior.
To demonstrate the dataset's effectiveness, we present three context-aware applications utilizing various machine learning tasks.
arXiv Detail & Related papers (2023-06-28T07:59:47Z) - On-device modeling of user's social context and familiar places from
smartphone-embedded sensor data [7.310043452300736]
This paper proposes an unsupervised and lightweight approach to model the user's social context and locations directly on the mobile device.
For the social context, the approach utilizes data on physical and cyber social interactions among users and their devices.
The effectiveness of the proposed approach is demonstrated through three sets of experiments, employing five real-world datasets.
arXiv Detail & Related papers (2023-06-27T12:53:14Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - On-device modeling of user's social context and familiar places from
smartphone-embedded sensor data [7.310043452300736]
We propose a novel, unsupervised and lightweight approach to model the user's social context and her locations.
We exploit data related to both physical and cyber social interactions among users and their devices.
We show the performance of 3 machine learning algorithms to recognize daily-life situations.
arXiv Detail & Related papers (2022-05-18T08:32:26Z) - Information-Theoretic Odometry Learning [83.36195426897768]
We propose a unified information theoretic framework for learning-motivated methods aimed at odometry estimation.
The proposed framework provides an elegant tool for performance evaluation and understanding in information-theoretic language.
arXiv Detail & Related papers (2022-03-11T02:37:35Z) - DAPPER: Label-Free Performance Estimation after Personalization for
Heterogeneous Mobile Sensing [95.18236298557721]
We present DAPPER (Domain AdaPtation Performance EstimatoR) that estimates the adaptation performance in a target domain with unlabeled target data.
Our evaluation with four real-world sensing datasets compared against six baselines shows that DAPPER outperforms the state-of-the-art baseline by 39.8% in estimation accuracy.
arXiv Detail & Related papers (2021-11-22T08:49:33Z) - SensiX: A Platform for Collaborative Machine Learning on the Edge [69.1412199244903]
We present SensiX, a personal edge platform that stays between sensor data and sensing models.
We demonstrate its efficacy in developing motion and audio-based multi-device sensing systems.
Our evaluation shows that SensiX offers a 7-13% increase in overall accuracy and up to 30% increase across different environment dynamics at the expense of 3mW power overhead.
arXiv Detail & Related papers (2020-12-04T23:06:56Z) - Improving the Performance of Fine-Grain Image Classifiers via Generative
Data Augmentation [0.5161531917413706]
We develop Data Augmentation from Proficient Pre-Training of Robust Generative Adrial Networks (DAPPER GAN)
DAPPER GAN is an ML analytics support tool that automatically generates novel views of training images.
We experimentally evaluate this technique on the Stanford Cars dataset, demonstrating improved vehicle make and model classification accuracy.
arXiv Detail & Related papers (2020-08-12T15:29:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.