Sequential Data Generation with Groupwise Diffusion Process
- URL: http://arxiv.org/abs/2310.01400v1
- Date: Mon, 2 Oct 2023 17:58:47 GMT
- Title: Sequential Data Generation with Groupwise Diffusion Process
- Authors: Sangyun Lee, Gayoung Lee, Hyunsu Kim, Junho Kim, Youngjung Uh
- Abstract summary: Groupwise Diffusion Model (GDM) generates data sequentially from one group at one time interval.
As an extension of diffusion models, GDM generalizes certain forms of autoregressive models and cascaded diffusion models.
- Score: 24.664800903343462
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present the Groupwise Diffusion Model (GDM), which divides data into
multiple groups and diffuses one group at one time interval in the forward
diffusion process. GDM generates data sequentially from one group at one time
interval, leading to several interesting properties. First, as an extension of
diffusion models, GDM generalizes certain forms of autoregressive models and
cascaded diffusion models. As a unified framework, GDM allows us to investigate
design choices that have been overlooked in previous works, such as
data-grouping strategy and order of generation. Furthermore, since one group of
the initial noise affects only a certain group of the generated data, latent
space now possesses group-wise interpretable meaning. We can further extend GDM
to the frequency domain where the forward process sequentially diffuses each
group of frequency components. Dividing the frequency bands of the data as
groups allows the latent variables to become a hierarchical representation
where individual groups encode data at different levels of abstraction. We
demonstrate several applications of such representation including
disentanglement of semantic attributes, image editing, and generating
variations.
Related papers
- The Research of Group Re-identification from Multiple Cameras [0.4955551943523977]
Group re-identification is very challenging since it is not only interfered by view-point and human pose variations in the traditional re-identification tasks.
This paper introduces a novel approach which leverages the multi-granularity information inside groups to facilitate group re-identification.
arXiv Detail & Related papers (2024-07-19T18:28:13Z) - DGCformer: Deep Graph Clustering Transformer for Multivariate Time Series Forecasting [16.081071155397186]
This paper proposes the Deep Graph Clustering Transformer (DGCformer) for multivariate time series forecasting.
It first groups these relevant variables by a graph convolutional network integrated with an autoencoder, and a former-latter masked self-attention mechanism is then considered.
arXiv Detail & Related papers (2024-05-14T09:01:33Z) - Redefining Event Types and Group Evolution in Temporal Data [0.16385815610837165]
In temporal data, the predominant approach for characterizing group evolution has been through the identification of events"
We think of events as archetypes" characterized by a unique combination of quantitative dimensions that we call facet extremities"
We apply our framework to evolving groups from several face-to-face interaction datasets, showing it enables richer, more reliable characterization of group dynamics.
arXiv Detail & Related papers (2024-03-11T14:39:24Z) - DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control [68.14798033899955]
Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability to generate creative content.
However, are they usable as large-scale data generators, e.g., to improve tasks in the perception stack, like semantic segmentation?
We investigate this question in the context of autonomous driving, and answer it with a resounding "yes"
arXiv Detail & Related papers (2023-12-05T18:34:12Z) - Unified and Dynamic Graph for Temporal Character Grouping in Long Videos [31.192044026127032]
Video temporal character grouping locates appearing moments of major characters within a video according to their identities.
Recent works have evolved from unsupervised clustering to graph-based supervised clustering.
We present a unified and dynamic graph (UniDG) framework for temporal character grouping.
arXiv Detail & Related papers (2023-08-27T13:22:55Z) - GADformer: A Transparent Transformer Model for Group Anomaly Detection on Trajectories [0.9971221656644376]
Group Anomaly Detection (GAD) identifies unusual pattern in groups where individual members might not be anomalous.
This paper introduces GADformer, a BERT-based model for attention-driven GAD on trajectories in unsupervised and semi-supervised settings.
We also introduce the Block-Attention-anomaly-Score (BAS) to enhance model transparency by scoring attention patterns.
arXiv Detail & Related papers (2023-03-17T08:49:09Z) - Consistency and Diversity induced Human Motion Segmentation [231.36289425663702]
We propose a novel Consistency and Diversity induced human Motion (CDMS) algorithm.
Our model factorizes the source and target data into distinct multi-layer feature spaces.
A multi-mutual learning strategy is carried out to reduce the domain gap between the source and target data.
arXiv Detail & Related papers (2022-02-10T06:23:56Z) - Learning Multi-Attention Context Graph for Group-Based Re-Identification [214.84551361855443]
Learning to re-identify or retrieve a group of people across non-overlapped camera systems has important applications in video surveillance.
In this work, we consider employing context information for identifying groups of people, i.e., group re-id.
We propose a novel unified framework based on graph neural networks to simultaneously address the group-based re-id tasks.
arXiv Detail & Related papers (2021-04-29T09:57:47Z) - Graph Gamma Process Generalized Linear Dynamical Systems [60.467040479276704]
We introduce graph gamma process (GGP) linear dynamical systems to model real multivariate time series.
For temporal pattern discovery, the latent representation under the model is used to decompose the time series into a parsimonious set of multivariate sub-sequences.
We use the generated random graph, whose number of nonzero-degree nodes is finite, to define both the sparsity pattern and dimension of the latent state transition matrix.
arXiv Detail & Related papers (2020-07-25T04:16:34Z) - Ensemble Model with Batch Spectral Regularization and Data Blending for
Cross-Domain Few-Shot Learning with Unlabeled Data [75.94147344921355]
We build a multi-branch ensemble framework by using diverse feature transformation matrices.
We propose a data blending method to exploit the unlabeled data and augment the sparse support set in the target domain.
arXiv Detail & Related papers (2020-06-08T02:27:34Z) - Semantically Multi-modal Image Synthesis [58.87967932525891]
We focus on semantically multi-modal image synthesis (SMIS) task, namely, generating multi-modal images at the semantic level.
We propose a novel Group Decreasing Network (GroupDNet) that leverages group convolutions in the generator and progressively decreases the group numbers of the convolutions in the decoder.
GroupDNet is armed with much more controllability on translating semantic labels to natural images and has plausible high-quality yields for datasets with many classes.
arXiv Detail & Related papers (2020-03-28T04:03:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.