Fugu-MT 論文翻訳(概要): ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks

論文の概要: ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks

arxiv url: http://arxiv.org/abs/2603.21084v1
Date: Sun, 22 Mar 2026 06:46:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.230011
Title: ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks
Title（参考訳）: ViCLSR - 自然言語理解タスクのための自然言語推論を用いた教師付きコントラスト学習フレームワーク
Authors: Tin Van Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen,
Abstract要約: ViR(Vietnamese Contrastive Learning for Sentence Representations)は、ベトナム語における文の埋め込みを最適化するために設計された、教師付きコントラスト学習フレームワークである。実験の結果,5つのベンチマークNLUデータセット上で,VRは強力なモノリンガル事前学習モデルであるPhoBERTよりも優れていた。
参考スコア（独自算出の注目度）: 9.232020878700967
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: High-quality text representations are crucial for natural language understanding (NLU), but low-resource languages like Vietnamese face challenges due to limited annotated data. While pre-trained models like PhoBERT and CafeBERT perform well, their effectiveness is constrained by data scarcity. Contrastive learning (CL) has recently emerged as a promising approach for improving sentence representations, enabling models to effectively distinguish between semantically similar and dissimilar sentences. We propose ViCLSR (Vietnamese Contrastive Learning for Sentence Representations), a novel supervised contrastive learning framework specifically designed to optimize sentence embeddings for Vietnamese, leveraging existing natural language inference (NLI) datasets. Additionally, we propose a process to adapt existing Vietnamese datasets for supervised learning, ensuring compatibility with CL methods. Our experiments demonstrate that ViCLSR significantly outperforms the powerful monolingual pre-trained model PhoBERT on five benchmark NLU datasets such as ViNLI (+6.97% F1), ViWikiFC (+4.97% F1), ViFactCheck (+9.02% F1), UIT-ViCTSD (+5.36% F1), and ViMMRC2.0 (+4.33% Accuracy). ViCLSR shows that supervised contrastive learning can effectively address resource limitations in Vietnamese NLU tasks and improve sentence representation learning for low-resource languages. Furthermore, we conduct an in-depth analysis of the experimental results to uncover the factors contributing to the superior performance of contrastive learning models. ViCLSR is released for research purposes in advancing natural language processing tasks.
Abstract（参考訳）: 高品質なテキスト表現は自然言語理解(NLU)に欠かせないが、ベトナム語のような低リソースの言語は、限られた注釈付きデータのために困難に直面している。 PhoBERTやCafeBERTのような事前訓練されたモデルはよく機能するが、その有効性はデータの不足によって制約される。コントラスト学習(CL)は、最近、文表現を改善するための有望なアプローチとして現れ、モデルが意味論的に類似した文と異種文を効果的に区別できるようにする。 ViCLSR(Vietnamese Contrastive Learning for Sentence Representations)は,ベトナム語に対する文の埋め込みを最適化し,既存の自然言語推論(NLI)データセットを活用するために設計された,教師付きコントラスト学習フレームワークである。さらに,既存のベトナムのデータセットを教師付き学習に適用し,CL手法との互換性を確保するプロセスを提案する。実験の結果, ViCLSRは, ViNLI (+6.97% F1), ViWikiFC (+4.97% F1), ViFactCheck (+9.02% F1), UIT-ViCTSD (+5.36% F1), ViMMRC2.0 (+4.33% Accuracy) の5つのベンチマークNLUデータセットにおいて,強力なモノリンガル事前学習モデルであるPhoBERTよりも優れていた。 ViCLSRはベトナムのNLUタスクにおいて、教師付きコントラスト学習が資源制限に効果的に対応できることを示し、低リソース言語における文表現学習を改善する。さらに,比較学習モデルの優れた性能に寄与する要因を明らかにするために,実験結果の詳細な分析を行う。 ViCLSRは、自然言語処理タスクの進歩の研究目的のためにリリースされた。

論文の概要: ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks

関連論文リスト