Fugu-MT 論文翻訳(概要): JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings

論文の概要: JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings

arxiv url: http://arxiv.org/abs/2505.02366v1
Date: Mon, 05 May 2025 05:09:21 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-06 18:49:35.564528
Title: JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings
Title（参考訳）: JTCSE: テキスト埋め込みの教師なしコントラスト学習における関節腱・関節拘束と交差注意
Authors: Tianyu Zong, Hongzhu Yi, Bingkang Shi, Yuanxiang Wang, Jungang Xu,
Abstract要約: 我々は,新しい textbfJoint textbfTensor representation modulus constraint と textbfCross-attention unsupervised contrastive learning textbfSentence textbfEmbedding representation framework JTCSE を提案する。
参考スコア（独自算出の注目度）: 5.152575977825381
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Unsupervised contrastive learning has become a hot research topic in natural language processing. Existing works usually aim at constraining the orientation distribution of the representations of positive and negative samples in the high-dimensional semantic space in contrastive learning, but the semantic representation tensor possesses both modulus and orientation features, and the existing works ignore the modulus feature of the representations and cause insufficient contrastive learning. % Therefore, we firstly propose a training objective that aims at modulus constraints on the semantic representation tensor, to strengthen the alignment between the positive samples in contrastive learning. Therefore, we first propose a training objective that is designed to impose modulus constraints on the semantic representation tensor, to strengthen the alignment between positive samples in contrastive learning. Then, the BERT-like model suffers from the phenomenon of sinking attention, leading to a lack of attention to CLS tokens that aggregate semantic information. In response, we propose a cross-attention structure among the twin-tower ensemble models to enhance the model's attention to CLS token and optimize the quality of CLS Pooling. Combining the above two motivations, we propose a new \textbf{J}oint \textbf{T}ensor representation modulus constraint and \textbf{C}ross-attention unsupervised contrastive learning \textbf{S}entence \textbf{E}mbedding representation framework JTCSE, which we evaluate in seven semantic text similarity computation tasks, and the experimental results show that JTCSE's twin-tower ensemble model and single-tower distillation model outperform the other baselines and become the current SOTA. In addition, we have conducted an extensive zero-shot downstream task evaluation, which shows that JTCSE outperforms other baselines overall on more than 130 tasks.
Abstract（参考訳）: 教師なしのコントラスト学習は、自然言語処理においてホットな研究トピックとなっている。既存の研究は通常、高次元意味空間における正および負のサンプルの表現の向き分布を制約することを目的としているが、意味表現テンソルはモジュラー特徴と向き特徴の両方を持ち、既存の研究は表現のモジュラー特徴を無視し、対照的な学習を引き起こす。そこで,本研究では,まず,意味表現テンソルの剛性制約を目的とした学習目標を提案し,比較学習における正のサンプル間のアライメントを強化する。そこで我々はまず, 意味表現テンソルに変調制約を課し, 対照的な学習における正のサンプル間のアライメントを強化するための訓練目標を提案する。そして、BERTのようなモデルは注意を落としてしまう現象に悩まされ、意味情報を集約するCLSトークンに注意が払われなくなる。そこで本研究では,CLSトークンに対するモデルの注目度を高め,CLSプールの品質を最適化するために,ツイン・トウ・アンサンブルモデル間のクロスアテンション構造を提案する。以上の2つのモチベーションを組み合わせることで、新しい \textbf{J}oint \textbf{T}ensor representation modulus constraint と \textbf{C}ross-attention unsupervised contrastive learning \textbf{S}entence \textbf{E}mbedding representation framework JTCSE が提案され、7つの意味的テキスト類似性計算タスクで評価され、JTCSE のツイン・トウ・アンサンブルモデルとシングル・トウワー蒸留モデルが他方のベースラインを上回り、現在の SOTA となることを示す。さらに,130以上のタスクにおいて,JTCSEが他のベースラインを上回っていることを示す,広範囲なゼロショットダウンストリームタスク評価を実施している。

関連論文リスト

TNCSE: Tensor's Norm Constraints for Unsupervised Contrastive Learning of Sentence Embeddings [4.62170384991303]
本稿では,新しい文埋め込み表現フレームワーク TNCSE を提案する。我々は,7つの意味的テキスト類似性タスクを評価し,TNCSEと派生モデルが現在最先端のアプローチであることを示す。
論文参考訳（メタデータ） (2025-03-17T02:14:42Z)
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation [56.87049651707208]
セマンティックはインコンテクストタスクへと発展し、一般化的セグメンテーションモデルを評価する上で重要な要素となった。我々の最初の焦点は、クエリイメージとサポートイメージの相互作用を容易にする方法を理解することであり、その結果、自己注意フレームワーク内のKV融合法が提案される。そこで我々はDiffewSというシンプルで効果的なフレームワークを構築し,従来の潜在拡散モデルの生成フレームワークを最大限に保持する。
論文参考訳（メタデータ） (2024-10-03T10:33:49Z)
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation [24.743048965822297]
本稿では,ItTakesTwo (IT2) と呼ばれる半教師付きLiDARセマンティックセマンティックセマンティクスフレームワークを提案する。 IT2は、ピアLiDAR表現からの一貫性のある予測を保証するために設計されており、一貫性学習における摂動効率を改善する。その結果,本手法は従来のSOTA法よりも顕著に改善されていることがわかった。
論文参考訳（メタデータ） (2024-07-09T18:26:53Z)
A Probabilistic Model Behind Self-Supervised Learning [53.64989127914936]
自己教師付き学習(SSL)では、アノテートラベルなしで補助的なタスクを通じて表現が学習される。自己教師型学習のための生成潜在変数モデルを提案する。対照的な方法を含む識別的SSLのいくつかのファミリーは、表現に匹敵する分布を誘導することを示した。
論文参考訳（メタデータ） (2024-02-02T13:31:17Z)
Co-guiding for Multi-intent Spoken Language Understanding [53.30511968323911]
本稿では,2つのタスク間の相互指導を実現するための2段階のフレームワークを実装した,コガイドネットと呼ばれる新しいモデルを提案する。第1段階では,単一タスクによる教師付きコントラスト学習を提案し,第2段階ではコガイドによる教師付きコントラスト学習を提案する。マルチインテリジェントSLU実験の結果,我々のモデルは既存のモデルよりも大きなマージンで優れていることがわかった。
論文参考訳（メタデータ） (2023-11-22T08:06:22Z)
Identical and Fraternal Twins: Fine-Grained Semantic Contrastive Learning of Sentence Representations [6.265789210037749]
コントラスト学習フレームワークのIdentical Twins と Fraternal Twins を導入する。また,提案したツインズ・ロスの有効性を証明するために,概念実証実験と対照的な目的を組み合わせる。
論文参考訳（メタデータ） (2023-07-20T15:02:42Z)
Language as a Latent Sequence: deep latent variable models for semi-supervised paraphrase generation [47.33223015862104]
本稿では,観測されたテキストから遅延シーケンス推論を行うVSARという新しい教師なしモデルを提案する。また、テキストペアからの情報を活用するために、提案したVSARモデルと統合するために設計されたDDLと呼ばれる新しい教師付きモデルを導入する。実験により, このモデルを組み合わせることで, 完全データに基づく最先端の教師付きベースラインに対して, 競争性能が向上することが示唆された。
論文参考訳（メタデータ） (2023-01-05T19:35:30Z)
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning [37.48292304239107]
本稿では, DUET という変換器を用いたエンドツーエンドZSL手法を提案する。画像からセマンティック属性を分離するモデルの能力を調べるために,モーダルなセマンティックグラウンドネットワークを開発した。 DUETは、しばしば最先端のパフォーマンスを達成することができ、そのコンポーネントは有効であり、予測は解釈可能である。
論文参考訳（メタデータ） (2022-07-04T11:12:12Z)
Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth [83.94528876742096]
我々は,意味的セグメンテーションと深さ推定という2つの密なタスクのMTL問題に取り組み,クロスチャネル注意モジュール(CCAM)と呼ばれる新しいアテンションモジュールを提案する。次に,AffineMixと呼ばれる予測深度を用いた意味分節タスクのための新しいデータ拡張と,ColorAugと呼ばれる予測セマンティクスを用いた単純な深度増分を定式化する。最後に,提案手法の性能向上をCityscapesデータセットで検証し,深度と意味に基づく半教師付きジョイントモデルにおける最先端結果の実現を支援する。
論文参考訳（メタデータ） (2022-06-21T17:40:55Z)
SDCUP: Schema Dependency-Enhanced Curriculum Pre-Training for Table Semantic Parsing [19.779493883522072]
本稿では,テーブル事前学習のための学習表現に所望の帰納バイアスを課すために,2つの新しい事前学習目標を設計する。本稿では,雑音の影響を緩和し,事前学習データから容易にハードな方法で効果的に学習する,スキーマ対応のカリキュラム学習手法を提案する。
論文参考訳（メタデータ） (2021-11-18T02:51:04Z)
Dense Contrastive Visual-Linguistic Pretraining [53.61233531733243]
画像とテキストを共同で表現するマルチモーダル表現学習手法が提案されている。これらの手法は,大規模マルチモーダル事前学習から高レベルな意味情報を取得することにより,優れた性能を実現する。そこで本稿では,非バイアスのDense Contrastive Visual-Linguistic Pretrainingを提案する。
論文参考訳（メタデータ） (2021-09-24T07:20:13Z)
Orthogonal Ensemble Networks for Biomedical Image Segmentation [10.011414604407681]
モデル多様性を明示する新しいフレームワークであるOrthogonal Ensemble Networks (OEN)を紹介する。提案手法を2つの課題脳病変セグメンテーションタスクでベンチマークする。実験結果から,本手法はより頑健でよく校正されたアンサンブルモデルを生成することが示された。
論文参考訳（メタデータ） (2021-05-22T23:44:55Z)
Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning [94.35586521144117]
コントラスト学習を微調整に適用することでさらにメリットが得られるか検討する。本研究では,コントラスト正規化調律(core-tuning)を提案する。
論文参考訳（メタデータ） (2021-02-12T16:31:24Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。