Dynamic Spatio-Temporal Summarization using Information Based Fusion
- URL: http://arxiv.org/abs/2310.01617v1
- Date: Mon, 2 Oct 2023 20:21:43 GMT
- Title: Dynamic Spatio-Temporal Summarization using Information Based Fusion
- Authors: Humayra Tasnim, Soumya Dutta, Melanie Moses
- Abstract summary: We propose a dynamic-temporal data summarization technique that identifies informative features in key timesteps and fuses less informative ones.
Unlike existing methods, our method retains both raw and summarized timesteps, ensuring a comprehensive view of information changes over time.
We demonstrate the versatility of our technique across diverse datasets, encompassing particle-based flow simulations, security and surveillance applications, and biological cell interactions within the immune system.
- Score: 3.038642416291856
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the era of burgeoning data generation, managing and storing large-scale
time-varying datasets poses significant challenges. With the rise of
supercomputing capabilities, the volume of data produced has soared,
intensifying storage and I/O overheads. To address this issue, we propose a
dynamic spatio-temporal data summarization technique that identifies
informative features in key timesteps and fuses less informative ones. This
approach minimizes storage requirements while preserving data dynamics. Unlike
existing methods, our method retains both raw and summarized timesteps,
ensuring a comprehensive view of information changes over time. We utilize
information-theoretic measures to guide the fusion process, resulting in a
visual representation that captures essential data patterns. We demonstrate the
versatility of our technique across diverse datasets, encompassing
particle-based flow simulations, security and surveillance applications, and
biological cell interactions within the immune system. Our research
significantly contributes to the realm of data management, introducing enhanced
efficiency and deeper insights across diverse multidisciplinary domains. We
provide a streamlined approach for handling massive datasets that can be
applied to in situ analysis as well as post hoc analysis. This not only
addresses the escalating challenges of data storage and I/O overheads but also
unlocks the potential for informed decision-making. Our method empowers
researchers and experts to explore essential temporal dynamics while minimizing
storage requirements, thereby fostering a more effective and intuitive
understanding of complex data behaviors.
Related papers
- Earth System Data Cubes: Avenues for advancing Earth system research [4.408949931570938]
Earth System Data Cubes ( ESDCs) have emerged as one suitable solution for transforming this flood of data into a simple yet robust format.
ESDCs achieve this by organising data into an analysis-ready format with atemporal grid.
There exist barriers to realising the full potential of data in light of novel cloud-based technologies.
arXiv Detail & Related papers (2024-08-05T09:50:16Z) - Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research [90.91438597133211]
We introduce WarpSci, a framework designed to overcome crucial system bottlenecks in the application of reinforcement learning.
We eliminate the need for data transfer between the CPU and GPU, enabling the concurrent execution of thousands of simulations.
arXiv Detail & Related papers (2024-08-01T21:38:09Z) - A Simple Background Augmentation Method for Object Detection with Diffusion Model [53.32935683257045]
In computer vision, it is well-known that a lack of data diversity will impair model performance.
We propose a simple yet effective data augmentation approach by leveraging advancements in generative models.
Background augmentation, in particular, significantly improves the models' robustness and generalization capabilities.
arXiv Detail & Related papers (2024-08-01T07:40:00Z) - DISCOVER: A Data-driven Interactive System for Comprehensive Observation, Visualization, and ExploRation of Human Behaviour [6.716560115378451]
We introduce a modular, flexible, yet user-friendly software framework specifically developed to streamline computational-driven data exploration for human behavior analysis.
Our primary objective is to democratize access to advanced computational methodologies, thereby enabling researchers across disciplines to engage in detailed behavioral analysis without the need for extensive technical proficiency.
arXiv Detail & Related papers (2024-07-18T11:28:52Z) - MaSS: Multi-attribute Selective Suppression for Utility-preserving Data Transformation from an Information-theoretic Perspective [10.009178591853058]
We propose a formal information-theoretic definition for this utility-preserving privacy protection problem.
We design a data-driven learnable data transformation framework that is capable of suppressing sensitive attributes from target datasets.
Results demonstrate the effectiveness and generalizability of our method under various configurations.
arXiv Detail & Related papers (2024-05-23T18:35:46Z) - CUDC: A Curiosity-Driven Unsupervised Data Collection Method with
Adaptive Temporal Distances for Offline Reinforcement Learning [62.58375643251612]
We propose a Curiosity-driven Unsupervised Data Collection (CUDC) method to expand feature space using adaptive temporal distances for task-agnostic data collection.
With this adaptive reachability mechanism in place, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity.
Empirically, CUDC surpasses existing unsupervised methods in efficiency and learning performance in various downstream offline RL tasks of the DeepMind control suite.
arXiv Detail & Related papers (2023-12-19T14:26:23Z) - Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.
We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy.
Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z) - Building Flexible, Scalable, and Machine Learning-ready Multimodal
Oncology Datasets [17.774341783844026]
This work proposes Multimodal Integration of Oncology Data System (MINDS)
MINDS is a flexible, scalable, and cost-effective metadata framework for efficiently fusing disparate data from public sources.
By harmonizing multimodal data, MINDS aims to potentially empower researchers with greater analytical ability.
arXiv Detail & Related papers (2023-09-30T15:44:39Z) - iSAGE: An Incremental Version of SAGE for Online Explanation on Data
Streams [8.49072000414555]
iSAGE is a time- and memory-efficient incrementalization of SAGE.
We show that iSAGE adheres to similar theoretical properties as SAGE.
arXiv Detail & Related papers (2023-03-02T11:51:54Z) - A Comprehensive Survey of Dataset Distillation [73.15482472726555]
It has become challenging to handle the unlimited growth of data with limited computing power.
Deep learning technology has developed unprecedentedly in the last decade.
This paper provides a holistic understanding of dataset distillation from multiple aspects.
arXiv Detail & Related papers (2023-01-13T15:11:38Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.