A Dual-Module Denoising Approach with Curriculum Learning for Enhancing Multimodal Aspect-Based Sentiment Analysis
- URL: http://arxiv.org/abs/2412.08489v1
- Date: Wed, 11 Dec 2024 15:53:13 GMT
- Title: A Dual-Module Denoising Approach with Curriculum Learning for Enhancing Multimodal Aspect-Based Sentiment Analysis
- Authors: Nguyen Van Doan, Dat Tran Nguyen, Cam-Van Thi Nguyen,
- Abstract summary: Multimodal Aspect-Based Sentiment Analysis (MABSA) combines text and images to perform sentiment analysis.
Existing methodologies address either sentence-image denoising or aspect-image denoising but fail to tackle both types of noise.
We propose DualDe, a novel approach comprising two distinct components: the Hybrid Curriculum Denoising Module (HCD) and the Aspect-Enhance Denoising Module (AED)
- Score: 0.0
- License:
- Abstract: Multimodal Aspect-Based Sentiment Analysis (MABSA) combines text and images to perform sentiment analysis but often struggles with irrelevant or misleading visual information. Existing methodologies typically address either sentence-image denoising or aspect-image denoising but fail to comprehensively tackle both types of noise. To address these limitations, we propose DualDe, a novel approach comprising two distinct components: the Hybrid Curriculum Denoising Module (HCD) and the Aspect-Enhance Denoising Module (AED). The HCD module enhances sentence-image denoising by incorporating a flexible curriculum learning strategy that prioritizes training on clean data. Concurrently, the AED module mitigates aspect-image noise through an aspect-guided attention mechanism that filters out noisy visual regions which unrelated to the specific aspects of interest. Our approach demonstrates effectiveness in addressing both sentence-image and aspect-image noise, as evidenced by experimental evaluations on benchmark datasets.
Related papers
- Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising [26.67217493971613]
Existing self-supervised image denoising paradigms rely heavily on information-lossy operations.
We propose a novel self-supervised single image denoising paradigm, Positive2Negative, to break the information-lossy barrier.
Our paradigm achieves state-of-the-art performance in self-supervised single image denoising with significant speed improvements.
arXiv Detail & Related papers (2024-12-21T03:25:01Z) - Enhance Vision-Language Alignment with Noise [59.2608298578913]
We investigate whether the frozen model can be fine-tuned by customized noise.
We propose Positive-incentive Noise (PiNI) which can fine-tune CLIP via injecting noise into both visual and text encoders.
arXiv Detail & Related papers (2024-12-14T12:58:15Z) - denoiSplit: a method for joint microscopy image splitting and unsupervised denoising [7.362569187959687]
denoiSplit is a method to tackle the challenge of joint semantic image splitting and unsupervised denoising.
Image splitting involves dissecting an image into its distinguishable semantic structures.
We show that the current state-of-the-art method for this task struggles in the presence of image noise.
arXiv Detail & Related papers (2024-03-18T15:03:56Z) - M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal
Aspect-based Sentiment Analysis [32.9772577419091]
Multimodal Aspect-based Sentiment Analysis (MABSA) is a fine-grained Sentiment Analysis task.
We propose a Multi-grained Multi-curriculum Denoising Framework (M2DF) which can achieve denoising by adjusting the order of training data.
Our framework consistently outperforms state-of-the-art work on three sub-tasks of MABSA.
arXiv Detail & Related papers (2023-10-23T06:22:39Z) - Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling [56.506240377714754]
We present a novel strategy called the Diffusion Model for Image Denoising (DMID)
Our strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model.
Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics.
arXiv Detail & Related papers (2023-07-08T14:59:41Z) - Deep Semantic Statistics Matching (D2SM) Denoising Network [70.01091467628068]
We introduce the Deep Semantic Statistics Matching (D2SM) Denoising Network.
It exploits semantic features of pretrained classification networks, then it implicitly matches the probabilistic distribution of clear images at the semantic feature space.
By learning to preserve the semantic distribution of denoised images, we empirically find our method significantly improves the denoising capabilities of networks.
arXiv Detail & Related papers (2022-07-19T14:35:42Z) - Deformed2Self: Self-Supervised Denoising for Dynamic Medical Imaging [0.0]
We propose Deformed2Self, an end-to-end self-supervised deep learning framework for dynamic imaging denoising.
It combines single-image and multi-image denoising to improve image quality and use a spatial transformer network to model motion between different slices.
arXiv Detail & Related papers (2021-06-23T05:50:19Z) - Synergy Between Semantic Segmentation and Image Denoising via Alternate
Boosting [102.19116213923614]
We propose a boosting network to perform denoising and segmentation alternately.
We observe that not only denoising helps combat the drop of segmentation accuracy due to noise, but also pixel-wise semantic information boosts the capability of denoising.
Experimental results show that the denoised image quality is improved substantially and the segmentation accuracy is improved to close to that of clean images.
arXiv Detail & Related papers (2021-02-24T06:48:45Z) - ISCL: Interdependent Self-Cooperative Learning for Unpaired Image
Denoising [3.796436257221662]
We propose a novel image denoising scheme, Interdependent Self-Cooperative Learning (ISCL)
ISCL combines cyclic adversarial learning with self-supervised residual learning.
We demonstrate that ISCL is superior to conventional and current state-of-the-art deep learning-based image denoising methods.
arXiv Detail & Related papers (2021-02-19T10:54:25Z) - Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person
Re-Identification [208.1227090864602]
Visible-infrared person re-identification (VI-ReID) is a challenging cross-modality pedestrian retrieval problem.
Existing VI-ReID methods tend to learn global representations, which have limited discriminability and weak robustness to noisy images.
We propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID.
arXiv Detail & Related papers (2020-07-18T03:08:13Z) - Fully Unsupervised Diversity Denoising with Convolutional Variational
Autoencoders [81.30960319178725]
We propose DivNoising, a denoising approach based on fully convolutional variational autoencoders (VAEs)
First we introduce a principled way of formulating the unsupervised denoising problem within the VAE framework by explicitly incorporating imaging noise models into the decoder.
We show that such a noise model can either be measured, bootstrapped from noisy data, or co-learned during training.
arXiv Detail & Related papers (2020-06-10T21:28:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.