A spatiotemporal style transfer algorithm for dynamic visual stimulus
  generation
        - URL: http://arxiv.org/abs/2403.04940v1
- Date: Thu, 7 Mar 2024 23:07:46 GMT
- Title: A spatiotemporal style transfer algorithm for dynamic visual stimulus
  generation
- Authors: Antonino Greco and Markus Siegel
- Abstract summary: We introduce the Spatiotemporal Style Transfer (STST) algorithm, a dynamic visual stimulus generation framework.
It is based on a two-stream deep neural network model that factorizes spatial and temporal features to generate dynamic visual stimuli.
We show that our algorithm enables the generation of model metamers, dynamic stimuli whose layer activations are matched to those of natural videos.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract:   Understanding how visual information is encoded in biological and artificial
systems often requires vision scientists to generate appropriate stimuli to
test specific hypotheses. Although deep neural network models have
revolutionized the field of image generation with methods such as image style
transfer, available methods for video generation are scarce. Here, we introduce
the Spatiotemporal Style Transfer (STST) algorithm, a dynamic visual stimulus
generation framework that allows powerful manipulation and synthesis of video
stimuli for vision research. It is based on a two-stream deep neural network
model that factorizes spatial and temporal features to generate dynamic visual
stimuli whose model layer activations are matched to those of input videos. As
an example, we show that our algorithm enables the generation of model
metamers, dynamic stimuli whose layer activations within our two-stream model
are matched to those of natural videos. We show that these generated stimuli
match the low-level spatiotemporal features of their natural counterparts but
lack their high-level semantic features, making it a powerful paradigm to study
object recognition. Late layer activations in deep vision models exhibited a
lower similarity between natural and metameric stimuli compared to early
layers, confirming the lack of high-level information in the generated stimuli.
Finally, we use our generated stimuli to probe the representational
capabilities of predictive coding deep networks. These results showcase
potential applications of our algorithm as a versatile tool for dynamic
stimulus generation in vision science.
 
      
        Related papers
        - Deep Neural Encoder-Decoder Model to Relate fMRI Brain Activity with   Naturalistic Stimuli [2.7149743794003913]
 We propose an end-to-end deep neural encoder-decoder model to encode and decode brain activity in response to naturalistic stimuli.<n>We employ temporal convolutional layers in our architecture, which effectively allows to bridge the temporal resolution gap between natural movie stimuli and fMRI.
 arXiv  Detail & Related papers  (2025-07-16T08:08:48Z)
- Langevin Flows for Modeling Neural Latent Dynamics [81.81271685018284]
 We introduce LangevinFlow, a sequential Variational Auto-Encoder where the time evolution of latent variables is governed by the underdamped Langevin equation.<n>Our approach incorporates physical priors -- such as inertia, damping, a learned potential function, and forces -- to represent both autonomous and non-autonomous processes in neural systems.<n>Our method outperforms state-of-the-art baselines on synthetic neural populations generated by a Lorenz attractor.
 arXiv  Detail & Related papers  (2025-07-15T17:57:48Z)
- Visualizing and Controlling Cortical Responses Using Voxel-Weighted   Activation Maximization [0.0]
 Deep neural networks (DNNs) are trained on visual representations that resemble those in the human visual system.<n>We show that activation can be applied to DNN-based encoding models.<n>We generate images optimized for predicted responses in individual voxels.
 arXiv  Detail & Related papers  (2025-06-04T18:48:08Z)
- Time-Dependent VAE for Building Latent Representations from Visual   Neural Activity with Complex Dynamics [25.454851828755054]
 TiDeSPL-VAE can effectively analyze complex visual neural activity and model temporal relationships in a natural way.
Results show that our model not only yields the best decoding performance on naturalistic scenes/movies but also extracts explicit neural dynamics.
 arXiv  Detail & Related papers  (2024-08-15T03:27:23Z)
- On the Trade-off Between Efficiency and Precision of Neural Abstraction [62.046646433536104]
 Neural abstractions have been recently introduced as formal approximations of complex, nonlinear dynamical models.
We employ formal inductive synthesis procedures to generate neural abstractions that result in dynamical models with these semantics.
 arXiv  Detail & Related papers  (2023-07-28T13:22:32Z)
- Long-Range Feedback Spiking Network Captures Dynamic and Static   Representations of the Visual Cortex under Movie Stimuli [25.454851828755054]
 There is limited insight into how the visual cortex represents natural movie stimuli that contain context-rich information.
This work proposes the long-range feedback spiking network (LoRaFB-SNet), which mimics top-down connections between cortical regions.
We present Time-Series Representational Similarity Analysis (TSRSA) to measure the similarity between model representations and visual cortical representations of mice.
 arXiv  Detail & Related papers  (2023-06-02T08:25:58Z)
- Modelling Human Visual Motion Processing with Trainable Motion Energy
  Sensing and a Self-attention Network [1.9458156037869137]
 We propose an image-computable model of human motion perception by bridging the gap between biological and computer vision models.
This model architecture aims to capture the computations in V1-MT, the core structure for motion perception in the biological visual system.
In silico neurophysiology reveals that our model's unit responses are similar to mammalian neural recordings regarding motion pooling and speed tuning.
 arXiv  Detail & Related papers  (2023-05-16T04:16:07Z)
- Contrastive-Signal-Dependent Plasticity: Self-Supervised Learning in   Spiking Neural Circuits [61.94533459151743]
 This work addresses the challenge of designing neurobiologically-motivated schemes for adjusting the synapses of spiking networks.
Our experimental simulations demonstrate a consistent advantage over other biologically-plausible approaches when training recurrent spiking networks.
 arXiv  Detail & Related papers  (2023-03-30T02:40:28Z)
- Adapting Brain-Like Neural Networks for Modeling Cortical Visual
  Prostheses [68.96380145211093]
 Cortical prostheses are devices implanted in the visual cortex that attempt to restore lost vision by electrically stimulating neurons.
Currently, the vision provided by these devices is limited, and accurately predicting the visual percepts resulting from stimulation is an open challenge.
We propose to address this challenge by utilizing 'brain-like' convolutional neural networks (CNNs), which have emerged as promising models of the visual system.
 arXiv  Detail & Related papers  (2022-09-27T17:33:19Z)
- Emergent organization of receptive fields in networks of excitatory and
  inhibitory neurons [3.674863913115431]
 Motivated by a leaky integrate-and-fire model of neural waves, we propose an activation model that is more typical of artificial neural networks.
Experiments with a synthetic model of somatosensory input are used to investigate how the network dynamics may affect plasticity of neuronal maps under changes to the inputs.
 arXiv  Detail & Related papers  (2022-05-26T20:43:14Z)
- Deep Representations for Time-varying Brain Datasets [4.129225533930966]
 This paper builds an efficient graph neural network model that incorporates both region-mapped fMRI sequences and structural connectivities as inputs.
We find good representations of the latent brain dynamics through learning sample-level adaptive adjacency matrices.
These modules can be easily adapted to and are potentially useful for other applications outside the neuroscience domain.
 arXiv  Detail & Related papers  (2022-05-23T21:57:31Z)
- Backprop-Free Reinforcement Learning with Active Neural Generative
  Coding [84.11376568625353]
 We propose a computational framework for learning action-driven generative models without backpropagation of errors (backprop) in dynamic environments.
We develop an intelligent agent that operates even with sparse rewards, drawing inspiration from the cognitive theory of planning as inference.
The robust performance of our agent offers promising evidence that a backprop-free approach for neural inference and learning can drive goal-directed behavior.
 arXiv  Detail & Related papers  (2021-07-10T19:02:27Z)
- Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes [70.76742458931935]
 We introduce a new representation that models the dynamic scene as a time-variant continuous function of appearance, geometry, and 3D scene motion.
Our representation is optimized through a neural network to fit the observed input views.
We show that our representation can be used for complex dynamic scenes, including thin structures, view-dependent effects, and natural degrees of motion.
 arXiv  Detail & Related papers  (2020-11-26T01:23:44Z)
- Continuous Emotion Recognition with Spatiotemporal Convolutional Neural
  Networks [82.54695985117783]
 We investigate the suitability of state-of-the-art deep learning architectures for continuous emotion recognition using long video sequences captured in-the-wild.
We have developed and evaluated convolutional recurrent neural networks combining 2D-CNNs and long short term-memory units, and inflated 3D-CNN models, which are built by inflating the weights of a pre-trained 2D-CNN model during fine-tuning.
 arXiv  Detail & Related papers  (2020-11-18T13:42:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.