Xtreaming: an incremental multidimensional projection technique and its
application to streaming data
- URL: http://arxiv.org/abs/2003.09017v1
- Date: Sun, 8 Mar 2020 04:53:16 GMT
- Title: Xtreaming: an incremental multidimensional projection technique and its
application to streaming data
- Authors: T\'acito T. A. T. Neves, Rafael M. Martins, Danilo B. Coimbra,
Kostiantyn Kucher, Andreas Kerren, Fernando V. Paulovich
- Abstract summary: Xtreaming is a novel incremental projection technique that continuously updates the visual representation to reflect new emerging structures or patterns without visiting the multidimensional data more than once.
Our tests show that Xtreaming is competitive in terms of global distance preservation if compared to other streaming and incremental techniques.
- Score: 58.92615359254597
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Streaming data applications are becoming more common due to the ability of
different information sources to continuously capture or produce data, such as
sensors and social media. Despite recent advances, most visualization
approaches, in particular, multidimensional projection or dimensionality
reduction techniques, cannot be directly applied in such scenarios due to the
transient nature of streaming data. Currently, only a few methods address this
limitation using online or incremental strategies, continuously processing
data, and updating the visualization. Despite their relative success, most of
them impose the need for storing and accessing the data multiple times, not
being appropriate for streaming where data continuously grow. Others do not
impose such requirements but are not capable of updating the position of the
data already projected, potentially resulting in visual artifacts. In this
paper, we present Xtreaming, a novel incremental projection technique that
continuously updates the visual representation to reflect new emerging
structures or patterns without visiting the multidimensional data more than
once. Our tests show that Xtreaming is competitive in terms of global distance
preservation if compared to other streaming and incremental techniques, but it
is orders of magnitude faster. To the best of our knowledge, it is the first
methodology that is capable of evolving a projection to faithfully represent
new emerging structures without the need to store all data, providing reliable
results for efficiently and effectively projecting streaming data.
Related papers
- High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates [50.406127962933915]
We develop solutions to problems which enable us to learn a communication-efficient distributed logistic regression model.
In our experiments we demonstrate a large improvement in accuracy over distributed algorithms with only a few distributed update steps needed.
arXiv Detail & Related papers (2024-07-08T19:34:39Z) - Iterative Forgetting: Online Data Stream Regression Using Database-Inspired Adaptive Granulation [1.6874375111244329]
We present a database-inspired datastream regression model that uses inspiration from R*-trees to create granules from incoming datastreams.
Experiments demonstrate that the ability of this method to discard data produces a significant order-of-magnitude improvement in latency and training time.
arXiv Detail & Related papers (2024-03-14T17:26:00Z) - Segue: Side-information Guided Generative Unlearnable Examples for
Facial Privacy Protection in Real World [64.4289385463226]
We propose Segue: Side-information guided generative unlearnable examples.
To improve transferability, we introduce side information such as true labels and pseudo labels.
It can resist JPEG compression, adversarial training, and some standard data augmentations.
arXiv Detail & Related papers (2023-10-24T06:22:37Z) - Fast Non-Rigid Radiance Fields from Monocularized Data [66.74229489512683]
This paper proposes a new method for full 360deg inward-facing novel view synthesis of non-rigidly deforming scenes.
At the core of our method are 1) An efficient deformation module that decouples the processing of spatial and temporal information for accelerated training and inference; and 2) A static module representing the canonical scene as a fast hash-encoded neural radiance field.
In both cases, our method is significantly faster than previous methods, converging in less than 7 minutes and achieving real-time framerates at 1K resolution, while obtaining a higher visual accuracy for generated novel views.
arXiv Detail & Related papers (2022-12-02T18:51:10Z) - A Feature-space Multimodal Data Augmentation Technique for Text-video
Retrieval [16.548016892117083]
Text-video retrieval methods have received increased attention over the past few years.
Data augmentation techniques were introduced to increase the performance on unseen test examples.
We propose a multimodal data augmentation technique which works in the feature space and creates new videos and captions by mixing semantically similar samples.
arXiv Detail & Related papers (2022-08-03T14:05:20Z) - DC-BENCH: Dataset Condensation Benchmark [79.18718490863908]
This work provides the first large-scale standardized benchmark on dataset condensation.
It consists of a suite of evaluations to comprehensively reflect the generability and effectiveness of condensation methods.
The benchmark library is open-sourced to facilitate future research and application.
arXiv Detail & Related papers (2022-07-20T03:54:05Z) - Metric Based Few-Shot Graph Classification [18.785949422663233]
Few-shot learning allows employing modern deep learning models in scarce data regimes without waiving their effectiveness.
We show that a simple distance metric learning baseline with a state-of-the-art graph embedder allows to obtain competitive results on the task.
We also propose a MixUp-based online data augmentation technique acting in the latent space and show its effectiveness on the task.
arXiv Detail & Related papers (2022-06-08T06:29:46Z) - MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with
Concept Drift [20.143379054091536]
We propose MemStream, a streaming multi-aspect anomaly detection framework.
We leverage the power of a denoising autoencoder to learn representations and a memory module to learn the dynamically changing trend in data.
Experimental results show the effectiveness of our approach compared to state-of-the-art streaming baselines.
arXiv Detail & Related papers (2021-06-07T17:54:57Z) - Adversarial Knowledge Transfer from Unlabeled Data [62.97253639100014]
We present a novel Adversarial Knowledge Transfer framework for transferring knowledge from internet-scale unlabeled data to improve the performance of a classifier.
An important novel aspect of our method is that the unlabeled source data can be of different classes from those of the labeled target data, and there is no need to define a separate pretext task.
arXiv Detail & Related papers (2020-08-13T08:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.