Investigating the changes in BOLD responses during viewing of images
with varied complexity: An fMRI time-series based analysis on human vision
- URL: http://arxiv.org/abs/2309.15495v1
- Date: Wed, 27 Sep 2023 08:46:09 GMT
- Title: Investigating the changes in BOLD responses during viewing of images
with varied complexity: An fMRI time-series based analysis on human vision
- Authors: Naveen Kanigiri, Manohar Suggula, Debanjali Bhattacharya and Neelam
Sinha
- Abstract summary: This work aims to investigate the neurological variation of human brain responses during viewing of images with varied complexity using fMRI time series (TS) analysis.
Our first study employs classical machine learning and deep learning strategies to classify image complexity-specific fMRI TS.
The obtained result of this analysis has established a baseline in studying how differently human brain functions while looking into images of diverse complexities.
- Score: 2.7036595757881323
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Functional MRI (fMRI) is widely used to examine brain functionality by
detecting alteration in oxygenated blood flow that arises with brain activity.
This work aims to investigate the neurological variation of human brain
responses during viewing of images with varied complexity using fMRI time
series (TS) analysis. Publicly available BOLD5000 dataset is used for this
purpose which contains fMRI scans while viewing 5254 distinct images of diverse
categories, drawn from three standard computer vision datasets: COCO, Imagenet
and SUN. To understand vision, it is important to study how brain functions
while looking at images of diverse complexities. Our first study employs
classical machine learning and deep learning strategies to classify image
complexity-specific fMRI TS, represents instances when images from COCO,
Imagenet and SUN datasets are seen. The implementation of this classification
across visual datasets holds great significance, as it provides valuable
insights into the fluctuations in BOLD signals when perceiving images of
varying complexities. Subsequently, temporal semantic segmentation is also
performed on whole fMRI TS to segment these time instances. The obtained result
of this analysis has established a baseline in studying how differently human
brain functions while looking into images of diverse complexities. Therefore,
accurate identification and distinguishing of variations in BOLD signals from
fMRI TS data serves as a critical initial step in vision studies, providing
insightful explanations for how static images with diverse complexities are
perceived.
Related papers
- Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation [51.28453192441364]
Multimodal brain magnetic resonance (MR) imaging is indispensable in neuroscience and neurology.
Current MR image synthesis approaches are typically trained on independent datasets for specific tasks.
We present TUMSyn, a Text-guided Universal MR image Synthesis model, which can flexibly generate brain MR images.
arXiv Detail & Related papers (2024-09-25T11:14:47Z) - Autoregressive Sequence Modeling for 3D Medical Image Representation [48.706230961589924]
We introduce a pioneering method for learning 3D medical image representations through an autoregressive sequence pre-training framework.
Our approach various 3D medical images based on spatial, contrast, and semantic correlations, treating them as interconnected visual tokens within a token sequence.
arXiv Detail & Related papers (2024-09-13T10:19:10Z) - MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding [50.55024115943266]
We introduce a novel semantic alignment method of multi-subject fMRI signals using so-called MindFormer.
This model is specifically designed to generate fMRI-conditioned feature vectors that can be used for conditioning Stable Diffusion model for fMRI- to-image generation or large language model (LLM) for fMRI-to-text generation.
Our experimental results demonstrate that MindFormer generates semantically consistent images and text across different subjects.
arXiv Detail & Related papers (2024-05-28T00:36:25Z) - MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction [21.531569319105877]
Reconstructing high-quality images in cross-subject tasks is a challenging problem due to profound individual differences between subjects.
MindTuner achieves high-quality and rich-semantic reconstructions using only 1 hour of fMRI training data.
arXiv Detail & Related papers (2024-04-19T05:12:04Z) - Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity [60.983327742457995]
Reconstructing the viewed images from human brain activity bridges human and computer vision through the Brain-Computer Interface.
We devise Psychometry, an omnifit model for reconstructing images from functional Magnetic Resonance Imaging (fMRI) obtained from different subjects.
arXiv Detail & Related papers (2024-03-29T07:16:34Z) - Brain-ID: Learning Contrast-agnostic Anatomical Representations for
Brain Imaging [11.06907516321673]
We introduce Brain-ID, an anatomical representation learning model for brain imaging.
With the proposed "mild-to-severe" intrasubject generation, Brain-ID is robust to the subject-specific brain anatomy.
We present new metrics to validate the intra- and inter-subject robustness, and evaluate their performance on four downstream applications.
arXiv Detail & Related papers (2023-11-28T16:16:10Z) - Image complexity based fMRI-BOLD visual network categorization across
visual datasets using topological descriptors and deep-hybrid learning [3.522950356329991]
The aim of this study is to examine how network topology differs in response to distinct visual stimuli from visual datasets.
To achieve this, 0- and 1-dimensional persistence diagrams are computed for each visual network representing COCO, ImageNet, and SUN.
The extracted K-means cluster features are fed to a novel deep-hybrid model that yields accuracy in the range of 90%-95% in classifying these visual networks.
arXiv Detail & Related papers (2023-11-03T14:05:57Z) - Spatial encoding of BOLD fMRI time series for categorizing static images
across visual datasets: A pilot study on human vision [3.038642416291856]
Specific image categorization is performed using fMRI time series (TS) to understand differences in neuronal activities related to vision.
To understand vision, it is important to study how brain functions while looking at different images.
It is seen that parallel CNN model outperforms other network models with an improvement of 7% for multi-class classification.
arXiv Detail & Related papers (2023-09-07T09:31:27Z) - On Sensitivity and Robustness of Normalization Schemes to Input
Distribution Shifts in Automatic MR Image Diagnosis [58.634791552376235]
Deep Learning (DL) models have achieved state-of-the-art performance in diagnosing multiple diseases using reconstructed images as input.
DL models are sensitive to varying artifacts as it leads to changes in the input data distribution between the training and testing phases.
We propose to use other normalization techniques, such as Group Normalization and Layer Normalization, to inject robustness into model performance against varying image artifacts.
arXiv Detail & Related papers (2023-06-23T03:09:03Z) - BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP
for Generic Natural Visual Stimulus Decoding [51.911473457195555]
BrainCLIP is a task-agnostic fMRI-based brain decoding model.
It bridges the modality gap between brain activity, image, and text.
BrainCLIP can reconstruct visual stimuli with high semantic fidelity.
arXiv Detail & Related papers (2023-02-25T03:28:54Z) - The Neural Correlates of Image Texture in the Human Vision Using
Magnetoencephalography [1.3198689566654107]
textural property of an image is one of the most important features in object recognition task in both human and computer vision applications.
Here, we investigated the neural signatures of four well-known statistical texture features including contrast, homogeneity, energy, and correlation.
Results: hierarchical structure in the processing of these four texture descriptors in the human brain with the order of contrast, homogeneity, energy, and correlation.
arXiv Detail & Related papers (2021-11-16T01:09:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.