Fugu-MT 論文翻訳(概要): Two-Stage Classifier for Campaign Negativity Detection using Axis Embeddings: A Case Study on Tweets of Political Users during 2021 Presidential Election in Iran

論文の概要: Two-Stage Classifier for Campaign Negativity Detection using Axis Embeddings: A Case Study on Tweets of Political Users during 2021 Presidential Election in Iran

arxiv url: http://arxiv.org/abs/2311.00143v1
Date: Tue, 31 Oct 2023 20:31:41 GMT
ステータス: 翻訳完了
システム内更新日: 2023-11-02 15:46:34.846805
Title: Two-Stage Classifier for Campaign Negativity Detection using Axis Embeddings: A Case Study on Tweets of Political Users during 2021 Presidential Election in Iran
Title（参考訳）: 軸埋め込みを用いた選挙ネガティビティ検出のための2段階分類法:2021年イラン大統領選挙における政治ユーザのつぶやきを事例として
Authors: Fatemeh Rajabi and Ali Mohades
Abstract要約: 世界中の選挙において、候補者は失敗や時間的プレッシャーのため、ネガティビティへのキャンペーンを転換する可能性がある。本稿では,2つの機械学習モデルの強みを組み合わせた2段階分類器によるキャンペーン負性検出のハイブリッドモデルを提案する。我々の最良のモデル(RF-RF)はマクロF1スコアの79%、重み付きF1スコアの82%を達成できた。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In elections around the world, the candidates may turn their campaigns toward negativity due to the prospect of failure and time pressure. In the digital age, social media platforms such as Twitter are rich sources of political discourse. Therefore, despite the large amount of data that is published on Twitter, the automatic system for campaign negativity detection can play an essential role in understanding the strategy of candidates and parties in their campaigns. In this paper, we propose a hybrid model for detecting campaign negativity consisting of a two-stage classifier that combines the strengths of two machine learning models. Here, we have collected Persian tweets from 50 political users, including candidates and government officials. Then we annotated 5,100 of them that were published during the year before the 2021 presidential election in Iran. In the proposed model, first, the required datasets of two classifiers based on the cosine similarity of tweet embeddings with axis embeddings (which are the average of embedding in positive and negative classes of tweets) from the training set (85\%) are made, and then these datasets are considered the training set of the two classifiers in the hybrid model. Finally, our best model (RF-RF) was able to achieve 79\% for the macro F1 score and 82\% for the weighted F1 score. By running the best model on the rest of the tweets of 50 political users that were published one year before the election and with the help of statistical models, we find that the publication of a tweet by a candidate has nothing to do with the negativity of that tweet, and the presence of the names of political persons and political organizations in the tweet is directly related to its negativity.
Abstract（参考訳）: 世界中の選挙において、候補者は失敗や時間的プレッシャーのため、ネガティビティへのキャンペーンを転換する可能性がある。デジタル時代には、Twitterのようなソーシャルメディアプラットフォームは政治的議論の豊富な源泉となっている。したがって、Twitter上で大量のデータが公開されているにもかかわらず、キャンペーン否定検出の自動システムは、候補者や参加者のキャンペーン戦略を理解する上で重要な役割を果たす可能性がある。本論文では,2つの機械学習モデルの強みを組み合わせた2段階の分類器からなるキャンペーンネガティビティを検出するハイブリッドモデルを提案する。ここでは、候補者や政府高官を含む50人の政治ユーザーからペルシア人のツイートを収集した。そして2021年のイラン大統領選挙の前年に発行された5100冊を注釈した。提案モデルでは,まず,訓練セット(85\%)から,軸埋め込み(ツイートの正のクラスと負のクラスへの埋め込みの平均値)を用いたツイート埋め込みのコサイン類似性に基づく2つの分類器の必要なデータセットを作成し,それらのデータセットをハイブリッドモデルにおける2つの分類器のトレーニングセットと見なす。最後に,最良モデル(rf-rf)はマクロf1スコアで79\%,重み付けf1スコアで82\%を達成した。選挙の1年前に公表された50人の政治ユーザーのツイートの残りの最良のモデルを実行し、統計モデルの助けを借りて、候補者によるツイートの公開は、そのツイートの否定性とは無関係であり、そのツイートにおける政治家や政治組織の名前の存在は、その否定性に直接関係していることがわかった。

関連論文リスト

Large-Scale, Longitudinal Study of Large Language Models During the 2024 US Election Season [43.092041950140164]
2024年アメリカ合衆国大統領選挙は、大きな言語モデル(LLM)が普及して以来、アメリカ合衆国で最初の主要大会である。この瞬間は、LLMが情報エコシステムをどう形成し、政治談話に影響を与えるかという緊急の疑問を提起する。我々は,2024年7月から11月にかけて,ほぼ毎日のケイデンスに関する12,000以上の質問を構造化された調査を用いて,12種類のモデルについて大規模な縦断調査を行った。
論文参考訳（メタデータ） (2025-09-22T22:04:19Z)
On the Use of Proxies in Political Ad Targeting [49.61009579554272]
我々は、主要な政治広告主がプロキシ属性をターゲットとして緩和を回避したことを示す。本研究は政治広告の規制に関する議論に重要な意味を持つ。
論文参考訳（メタデータ） (2024-10-18T17:15:13Z)
Representation Bias in Political Sample Simulations with Large Language Models [54.48283690603358]
本研究は,大規模言語モデルを用いた政治サンプルのシミュレーションにおけるバイアスの同定と定量化を目的とする。 GPT-3.5-Turboモデルを用いて、米国選挙研究、ドイツ縦割り選挙研究、ズオビアオデータセット、中国家族パネル研究のデータを活用する。
論文参考訳（メタデータ） (2024-07-16T05:52:26Z)
Context-Based Tweet Engagement Prediction [0.0]
この論文は、ツイートのエンゲージメントの可能性を予測するために、コンテキスト単独がいかにうまく使われるかを調査する。私たちはTU WienのLittle Big Data ClusterにSparkエンジンを使用して、スケーラブルなデータ前処理、機能エンジニアリング、機能選択、マシンラーニングパイプラインを作成しました。また, 予測アルゴリズム, トレーニングデータセットサイズ, トレーニングデータセットサンプリング手法, 特徴選択などの因子が, 結果に有意な影響を及ぼすことがわかった。
論文参考訳（メタデータ） (2023-09-28T08:36:57Z)
Electoral Agitation Data Set: The Use Case of the Polish Election [3.671887117122512]
ポーランド語における選挙の扇動を検出するための最初の公開データセットを提示する。これには、法的に条件付けされた4つのカテゴリにタグ付けされた6,112人の人手によるツイートが含まれている。新たに作成されたデータセットは、HerBERTと呼ばれるポーランド語モデルの微調整に使用された。
論文参考訳（メタデータ） (2023-07-13T18:14:43Z)
Computational Assessment of Hyperpartisanship in News Titles [55.92100606666497]
われわれはまず、超党派ニュースタイトル検出のための新しいデータセットを開発するために、人間の誘導する機械学習フレームワークを採用する。全体的に右派メディアは比例的に超党派的なタイトルを使う傾向にある。我々は、外国問題、政治システム、ニュースタイトルにおける過党主義を示唆する社会問題を含む3つの主要なトピックを識別する。
論文参考訳（メタデータ） (2023-01-16T05:56:58Z)
Design and analysis of tweet-based election models for the 2021 Mexican legislative election [55.41644538483948]
選挙日前の6ヶ月の間に、1500万件の選挙関連ツイートのデータセットを使用します。地理的属性を持つデータを用いたモデルが従来のポーリング法よりも精度と精度で選挙結果を決定することがわかった。
論文参考訳（メタデータ） (2023-01-02T12:40:05Z)
Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation [83.2079454464572]
本稿では,DARPAセマンティック・フォレスティクス(SemaFor)プログラムにおける画像テキスト不整合検出へのアプローチについて述べる。 Twitter-COMMsは大規模マルチモーダルデータセットで、884万のツイートが気候変動、新型コロナウイルス、軍用車両のトピックに関連する。我々は、最先端のCLIPモデルに基づいて、自動生成されたランダムとハードのネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガネガ
論文参考訳（メタデータ） (2021-12-16T03:37:20Z)
Shifting Polarization and Twitter News Influencers between two U.S. Presidential Elections [92.33485580547801]
我々は2016年米大統領選挙と2020年米大統領選挙の間の分極の変化を分析した。トップインフルエンサーのほとんどが、両選挙の間にメディア組織に所属していた。 2020年のトップインフルエンサーの75%は2016年は存在しなかった。
論文参考訳（メタデータ） (2021-11-03T20:08:54Z)
Prediction of Political Leanings of Chinese Speaking Twitter Users [0.0]
まず、有名な政治人物とその関連ユーザーのツイートをスクラップしてデータを収集する。第二に、中国共産党への承認を示すグループと、そうでないグループである。 Twitter上のツイートからユーザの政治的スタンスを理解するために,高精度な分類モデルを生成する。
論文参考訳（メタデータ） (2021-10-12T03:18:10Z)
Political Advertising Dataset: the use case of the Polish 2020 Presidential Elections [4.560033258611709]
ポーランド語における特定のテキストチャンクや政治広告のカテゴリを検出するための、最初の公開データセットを提示する。 9つのカテゴリーにタグ付けされた1,705件の人称注釈付きツイートが含まれており、これはポーランドの選挙法の下でのキャンペーンである。
論文参考訳（メタデータ） (2020-06-17T23:58:01Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。