Decoding natural image stimuli from fMRI data with a surface-based
convolutional network
- URL: http://arxiv.org/abs/2212.02409v1
- Date: Mon, 5 Dec 2022 16:47:19 GMT
- Title: Decoding natural image stimuli from fMRI data with a surface-based
convolutional network
- Authors: Zijin Gu, Keith Jamison, Amy Kuceyeski and Mert Sabuncu
- Abstract summary: We propose a novel approach to decode visual stimuli with high semantic fidelity and rich fine-grained detail.
Our proposed method achieves state-of-the-art semantic fidelity, while yielding good fine-grained similarity with the ground-truth stimulus.
- Score: 1.8352113484137629
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Due to the low signal-to-noise ratio and limited resolution of functional MRI
data, and the high complexity of natural images, reconstructing a visual
stimulus from human brain fMRI measurements is a challenging task. In this
work, we propose a novel approach for this task, which we call Cortex2Image, to
decode visual stimuli with high semantic fidelity and rich fine-grained detail.
In particular, we train a surface-based convolutional network model that maps
from brain response to semantic image features first (Cortex2Semantic). We then
combine this model with a high-quality image generator (Instance-Conditioned
GAN) to train another mapping from brain response to fine-grained image
features using a variational approach (Cortex2Detail). Image reconstructions
obtained by our proposed method achieve state-of-the-art semantic fidelity,
while yielding good fine-grained similarity with the ground-truth stimulus. Our
code is available at: https://github.com/zijin-gu/meshconv-decoding.git.
Related papers
- Realistic Extreme Image Rescaling via Generative Latent Space Learning [51.85790402171696]
We propose a novel framework called Latent Space Based Image Rescaling (LSBIR) for extreme image rescaling tasks.
LSBIR effectively leverages powerful natural image priors learned by a pre-trained text-to-image diffusion model to generate realistic HR images.
In the first stage, a pseudo-invertible encoder-decoder models the bidirectional mapping between the latent features of the HR image and the target-sized LR image.
In the second stage, the reconstructed features from the first stage are refined by a pre-trained diffusion model to generate more faithful and visually pleasing details.
arXiv Detail & Related papers (2024-08-17T09:51:42Z) - Deep Cardiac MRI Reconstruction with ADMM [7.694990352622926]
We present a deep learning (DL)-based method for accelerated cine and multi-contrast reconstruction in the context of cardiac imaging.
Our method optimize in both the image and k-space domains, allowing for high reconstruction fidelity.
arXiv Detail & Related papers (2023-10-10T13:46:11Z) - GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from
Multi-view Images [79.39247661907397]
We introduce an effective framework Generalizable Model-based Neural Radiance Fields to synthesize free-viewpoint images.
Specifically, we propose a geometry-guided attention mechanism to register the appearance code from multi-view 2D images to a geometry proxy.
arXiv Detail & Related papers (2023-03-24T03:32:02Z) - Natural scene reconstruction from fMRI signals using generative latent
diffusion [1.90365714903665]
We present a two-stage scene reconstruction framework called Brain-Diffuser''
In the first stage, we reconstruct images that capture low-level properties and overall layout using a VDVAE (Very Deep Vari Autoencoder) model.
In the second stage, we use the image-to-image framework of a latent diffusion model conditioned on predicted multimodal (text and visual) features.
arXiv Detail & Related papers (2023-03-09T15:24:26Z) - BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP
for Generic Natural Visual Stimulus Decoding [51.911473457195555]
BrainCLIP is a task-agnostic fMRI-based brain decoding model.
It bridges the modality gap between brain activity, image, and text.
BrainCLIP can reconstruct visual stimuli with high semantic fidelity.
arXiv Detail & Related papers (2023-02-25T03:28:54Z) - Mind Reader: Reconstructing complex images from brain activities [16.78619734818198]
We focus on reconstructing the complex image stimuli from fMRI (functional magnetic resonance imaging) signals.
Unlike previous works that reconstruct images with single objects or simple shapes, our work aims to reconstruct image stimuli rich in semantics.
We find that incorporating an additional text modality is beneficial for the reconstruction problem compared to directly translating brain signals to images.
arXiv Detail & Related papers (2022-09-30T06:32:46Z) - Facial Image Reconstruction from Functional Magnetic Resonance Imaging
via GAN Inversion with Improved Attribute Consistency [5.705640492618758]
We propose a new framework to reconstruct facial images from fMRI data.
The proposed framework accomplishes two goals: (1) reconstructing clear facial images from fMRI data and (2) maintaining the consistency of semantic characteristics.
arXiv Detail & Related papers (2022-07-03T11:18:35Z) - Spatial-Separated Curve Rendering Network for Efficient and
High-Resolution Image Harmonization [59.19214040221055]
We propose a novel spatial-separated curve rendering network (S$2$CRNet) for efficient and high-resolution image harmonization.
The proposed method reduces more than 90% parameters compared with previous methods.
Our method can work smoothly on higher resolution images in real-time which is more than 10$times$ faster than the existing methods.
arXiv Detail & Related papers (2021-09-13T07:20:16Z) - Adaptive Gradient Balancing for UndersampledMRI Reconstruction and
Image-to-Image Translation [60.663499381212425]
We enhance the image quality by using a Wasserstein Generative Adversarial Network combined with a novel Adaptive Gradient Balancing technique.
In MRI, our method minimizes artifacts, while maintaining a high-quality reconstruction that produces sharper images than other techniques.
arXiv Detail & Related papers (2021-04-05T13:05:22Z) - Deep Parallel MRI Reconstruction Network Without Coil Sensitivities [4.559089047554929]
We propose a novel deep neural network architecture by mapping the robust proximal gradient scheme for fast image reconstruction in parallel MRI (pMRI) with regularization function trained from data.
The proposed network learns to adaptively combine the multi-coil images from incomplete pMRI data into a single image with homogeneous contrast, which is then passed to a nonlinear encoder to efficiently extract sparse features of the image.
arXiv Detail & Related papers (2020-08-04T08:39:36Z) - Neural Sparse Representation for Image Restoration [116.72107034624344]
Inspired by the robustness and efficiency of sparse coding based image restoration models, we investigate the sparsity of neurons in deep networks.
Our method structurally enforces sparsity constraints upon hidden neurons.
Experiments show that sparse representation is crucial in deep neural networks for multiple image restoration tasks.
arXiv Detail & Related papers (2020-06-08T05:15:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.