Fugu-MT 論文翻訳(概要): A Comparison of Methods for Treatment Assignment with an Application to Playlist Generation

論文の概要: A Comparison of Methods for Treatment Assignment with an Application to Playlist Generation

arxiv url: http://arxiv.org/abs/2004.11532v5
Date: Sat, 30 Apr 2022 23:16:10 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-10 03:07:34.528640
Title: A Comparison of Methods for Treatment Assignment with an Application to Playlist Generation
Title（参考訳）: プレイリスト生成における治療課題の方法と応用の比較
Authors: Carlos Fern\'andez-Lor\'ia, Foster Provost, Jesse Anderton, Benjamin Carterette, Praveen Chandar
Abstract要約: 文献で提案される様々な手法をアルゴリズムの3つの一般的なクラス(またはメタナー)に分類する。結果や因果効果の予測を最適化することは、治療課題の最適化と同じではないことを分析的および実証的に示す。これは、大規模な実世界のアプリケーションにおける3つの異なるメタラーナーの最初の比較である。
参考スコア（独自算出の注目度）: 13.804332504576301
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This study presents a systematic comparison of methods for individual treatment assignment, a general problem that arises in many applications and has received significant attention from economists, computer scientists, and social scientists. We group the various methods proposed in the literature into three general classes of algorithms (or metalearners): learning models to predict outcomes (the O-learner), learning models to predict causal effects (the E-learner), and learning models to predict optimal treatment assignments (the A-learner). We compare the metalearners in terms of (1) their level of generality and (2) the objective function they use to learn models from data; we then discuss the implications that these characteristics have for modeling and decision making. Notably, we demonstrate analytically and empirically that optimizing for the prediction of outcomes or causal effects is not the same as optimizing for treatment assignments, suggesting that in general the A-learner should lead to better treatment assignments than the other metalearners. We demonstrate the practical implications of our findings in the context of choosing, for each user, the best algorithm for playlist generation in order to optimize engagement. This is the first comparison of the three different metalearners on a real-world application at scale (based on more than half a billion individual treatment assignments). In addition to supporting our analytical findings, the results show how large A/B tests can provide substantial value for learning treatment assignment policies, rather than simply choosing the variant that performs best on average.
Abstract（参考訳）: 本研究は,多くの応用において発生し,経済学者,計算機科学者,社会科学者から注目されている,個々の治療課題の体系的比較を示す。論文で提案されている様々な手法を3つの一般的なアルゴリズム(メタルイヤー)に分類した: 結果を予測する学習モデル(oリーナー)、因果効果を予測する学習モデル(eリーナー)、最適な治療課題を予測するための学習モデル(aリーナー)。我々は,(1)一般性のレベルと(2)データからモデルを学ぶために使用する目的関数を比較し,これらの特徴がモデリングや意思決定に持つ意味について考察する。特に, 結果や因果効果の予測を最適化することは, 治療課題の最適化と同等ではなく, 一般にAラーナーは, 他のメタナーよりも優れた治療課題に導かれることが示唆された。本研究は,各ユーザに対して,エンゲージメントを最適化するためにプレイリスト生成に最適なアルゴリズムを選択するという文脈で,本研究の実用的意義を示す。これは、実世界のアプリケーション(50億以上の個別の処理課題に基づく)における3つの異なるメタラーナーの最初の比較である。分析結果の裏付けに加えて,a/bテストの規模は,平均的にベストな亜種を単に選択するのではなく,治療割当方針の学習にどの程度の価値があるかを示した。

関連論文リスト

Evolved SampleWeights for Bias Mitigation: Effectiveness Depends on Optimization Objectives [0.36569643583149225]
実世界のデータに基づいてトレーニングされた機械学習モデルは、必然的に偏見のある予測を生み出す可能性がある。重み付け(reweighting)は、モデルトレーニングで使用される各データポイントに重みを割り当てることで、モデル予測におけるそのようなバイアスを軽減する方法である。進化したサンプル重みは、代替重み付け法よりも公正度と予測性能のトレードオフが良いモデルを生成することができることを示す。
論文参考訳（メタデータ） (2025-11-25T22:50:59Z)
What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? [83.83230167222852]
モデルの一般化動作は,事前記憶列車の精度と呼ばれるトレーニング指標によって効果的に特徴づけられることがわかった。モデルの学習行動と一般化を結びつけることで、トレーニング戦略に目標とする改善を導くことができる。
論文参考訳（メタデータ） (2024-11-12T09:52:40Z)
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data [102.16105233826917]
好みラベルからの学習は、微調整された大きな言語モデルにおいて重要な役割を果たす。好みの微調整には、教師付き学習、オンライン強化学習(RL)、コントラスト学習など、いくつかの異なるアプローチがある。
論文参考訳（メタデータ） (2024-04-22T17:20:18Z)
HyperImpute: Generalized Iterative Imputation with Automatic Model Selection [77.86861638371926]
カラムワイズモデルを適応的かつ自動的に構成するための一般化反復計算フレームワークを提案する。既製の学習者,シミュレータ,インターフェースを備えた具体的な実装を提供する。
論文参考訳（メタデータ） (2022-06-15T19:10:35Z)
An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs [67.23285413610243]
大規模知識グラフから抽出した情報に基づく自己監督は、言語モデルの一般化を改善することが示されている。本研究では,言語モデルに適用可能な合成データを生成するための知識サンプリング戦略とサイズの影響について検討する。
論文参考訳（メタデータ） (2022-05-21T19:49:04Z)
Nonparametric Estimation of Heterogeneous Treatment Effects: From Theory to Learning Algorithms [91.3755431537592]
プラグイン推定と擬似出力回帰に依存する4つの幅広いメタ学習戦略を解析する。この理論的推論を用いて、アルゴリズム設計の原則を導出し、分析を実践に翻訳する方法について強調する。
論文参考訳（メタデータ） (2021-01-26T17:11:40Z)
Machine learning with incomplete datasets using multi-objective optimization models [1.933681537640272]
分類モデルが学習されている間、欠落した値を扱うオンラインアプローチを提案する。命令とモデル選択のための2つの目的関数を持つ多目的最適化モデルを開発する。 NSGA IIに基づく進化的アルゴリズムを用いて最適解を求める。
論文参考訳（メタデータ） (2020-12-04T03:44:33Z)
View selection in multi-view stacking: Choosing the meta-learner [0.2812395851874055]
マルチビュー・スタックング(Multi-view stacking)は、異なるビューからの情報を組み合わせて同じオブジェクト群を記述するフレームワークである。このフレームワークでは、各ビューに対してベースラーナーアルゴリズムを個別にトレーニングし、その予測をメタラーナーアルゴリズムで組み合わせる。
論文参考訳（メタデータ） (2020-10-30T13:45:14Z)
Double Robust Representation Learning for Counterfactual Prediction [68.78210173955001]
そこで本稿では, 対実予測のための2次ロバスト表現を学習するための, スケーラブルな新しい手法を提案する。我々は、個々の治療効果と平均的な治療効果の両方に対して、堅牢で効率的な対実的予測を行う。このアルゴリズムは,実世界の最先端技術と合成データとの競合性能を示す。
論文参考訳（メタデータ） (2020-10-15T16:39:26Z)
Does imputation matter? Benchmark for predictive models [5.802346990263708]
本稿では,予測モデルに対するデータ計算アルゴリズムの実証的効果を体系的に評価する。主な貢献は,(1)実生活の分類タスクに基づく経験的ベンチマークのための一般的な手法の推薦である。
論文参考訳（メタデータ） (2020-07-06T15:47:36Z)
Bayesian Meta-Prior Learning Using Empirical Bayes [3.666114237131823]
本稿では,情報的事前の欠如とパラメータ学習率の制御能力に対処する階層的経験ベイズ手法を提案する。本手法は,データ自体から経験的メタプライヤを学習し,その学習率を1次および2次の特徴の分離に利用する。スパースデータの最適化は、しばしば課題となるため、私たちの発見は有望です。
論文参考訳（メタデータ） (2020-02-04T05:08:17Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。