Fugu-MT 論文翻訳(概要): Can You Fool AI by Doing a 180? $\unicode{x2013}$ A Case Study on Authorship Analysis of Texts by Arata Osada

論文の概要: Can You Fool AI by Doing a 180? $\unicode{x2013}$ A Case Study on Authorship Analysis of Texts by Arata Osada

arxiv url: http://arxiv.org/abs/2207.09085v1
Date: Tue, 19 Jul 2022 05:43:49 GMT
ステータス: 翻訳完了
システム内更新日: 2022-07-20 12:56:22.903717
Title: Can You Fool AI by Doing a 180? $\unicode{x2013}$ A Case Study on Authorship Analysis of Texts by Arata Osada
Title（参考訳）: AIは180度でも使えるか? $\unicode{x2013}$ A Case Study on Authorship Analysis of Texts by Arata Osada
Authors: Jagna Nieuwazny, Karol Nowakowski, Michal Ptaszynski, Fumito Masui
Abstract要約: 本稿では,倫理学と著者分析の分野をカバーする2つの疑問に答える試みである。まず,著者識別システムが,作品の作者への正しい属性付けが可能かどうかを,長年にわたって大きな心理的移行を経た上で確認することに興味を抱いた。第2に、著者の倫理的価値観の進化の観点から、著者の帰属体系が単一著者の発見に困難に直面する場合、それが何を意味するのかを確認した。
参考スコア（独自算出の注目度）: 2.6954666679827137
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: This paper is our attempt at answering a twofold question covering the areas of ethics and authorship analysis. Firstly, since the methods used for performing authorship analysis imply that an author can be recognized by the content he or she creates, we were interested in finding out whether it would be possible for an author identification system to correctly attribute works to authors if in the course of years they have undergone a major psychological transition. Secondly, and from the point of view of the evolution of an author's ethical values, we checked what it would mean if the authorship attribution system encounters difficulties in detecting single authorship. We set out to answer those questions through performing a binary authorship analysis task using a text classifier based on a pre-trained transformer model and a baseline method relying on conventional similarity metrics. For the test set, we chose works of Arata Osada, a Japanese educator and specialist in the history of education, with half of them being books written before the World War II and another half in the 1950s, in between which he underwent a transformation in terms of political opinions. As a result, we were able to confirm that in the case of texts authored by Arata Osada in a time span of more than 10 years, while the classification accuracy drops by a large margin and is substantially lower than for texts by other non-fiction writers, confidence scores of the predictions remain at a similar level as in the case of a shorter time span, indicating that the classifier was in many instances tricked into deciding that texts written over a time span of multiple years were actually written by two different people, which in turn leads us to believe that such a change can affect authorship analysis, and that historical events have great impact on a person's ethical outlook as expressed in their writings.
Abstract（参考訳）: 本稿は,倫理と著者分析の領域をカバーする2つの質問に回答する試みである。まず,著者分析に用いた手法は,著者が作成した内容によって作者が認識できることを示唆するものであるため,著者識別システムが著者に正しく属性付けできるかどうかを,数年のうちに大きな心理的移行を経た場合には,著者の正当性を判断することに興味がある。第二に、著者の倫理的価値の進化の観点から、著者の帰属システムが単独の著者を検知する困難に遭遇した場合、その意味を検証した。そこで我々は,事前学習したトランスフォーマーモデルに基づくテキスト分類器と,従来の類似度尺度に依存するベースライン手法を用いて,二元的オーサシップ解析タスクを実行することで,これらの質問に答えることにした。テストセットについては、日本の教育史の教育者・専門家である尾田荒太の作品を選び、その半分は第二次世界大戦前と1950年代前半に書かれた書物であり、その間に政治的意見の転換が行われた。 As a result, we were able to confirm that in the case of texts authored by Arata Osada in a time span of more than 10 years, while the classification accuracy drops by a large margin and is substantially lower than for texts by other non-fiction writers, confidence scores of the predictions remain at a similar level as in the case of a shorter time span, indicating that the classifier was in many instances tricked into deciding that texts written over a time span of multiple years were actually written by two different people, which in turn leads us to believe that such a change can affect authorship analysis, and that historical events have great impact on a person's ethical outlook as expressed in their writings.

関連論文リスト

ScholarPeer: A Context-Aware Multi-Agent Framework for Automated Peer Review [48.60540055009675]
ScholarPeerは、上級研究者の認知過程をエミュレートするために設計された、検索可能なマルチエージェントフレームワークである。 We evaluate ScholarPeer on DeepReview-13K and the results showed that ScholarPeer achieve significant win-rates against state-of-the-art approach in side-side-side evaluations。
論文参考訳（メタデータ） (2026-01-30T06:54:55Z)
Can professional translators identify machine-generated text? [0.0]
本研究は,人工知能(AI)がイタリア語で生成した短編を,事前の専門訓練なしに確実に識別できるかどうかを考察する。 6人の翻訳者が人体実験に参加し、3つの匿名化された短編を評価した。低いバーストさと物語の矛盾が、合成著者の最も信頼できる指標として現れた。
論文参考訳（メタデータ） (2026-01-22T10:25:52Z)
Author-in-the-Loop Response Generation and Evaluation: Integrating Author Expertise and Intent in Responses to Peer Review [53.99984738447279]
最近の作業は、このタスクを著者の専門知識と意図を活かした自動テキスト生成として捉えている。本稿では,著者の明示的な入力,多属性制御,評価誘導による改良を統合したREspGenについて紹介する。この定式化をサポートするために、アライメントされたレビュー-レスポンス-リビジョン三つ子の最初の大規模データセットであるRe$3$Alignを構築した。
論文参考訳（メタデータ） (2026-01-19T14:07:10Z)
The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing [1.3654846342364306]
我々は5つの公開データセット(1,471ストーリー、101アノテータ、批評家、学生、レイリーダー)を使って17の参照なしテキストの特徴を抽出します。我々は、個々の読み手の好みをモデル化し、テキストの優先順位を反映した特徴重要ベクトルを導出する。本研究は,文学的品質の測定が,文章の特徴と読み手の好みがどのように一致しているかを定量的に説明するものである。
論文参考訳（メタデータ） (2025-06-03T18:50:22Z)
Stylomech: Unveiling Authorship via Computational Stylometry in English and Romanized Sinhala [0.0]
英語とローマ字の両方における著者の帰属は、ここ数十年で主要な要件となった。本研究は計算言語学の分野に大きく貢献する。著者帰属の範囲を多種多様な言語文脈に広げることで、デジタルコミュニケーションにおける信頼と説明責任の育成に寄与する。
論文参考訳（メタデータ） (2025-01-16T14:26:48Z)
A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution [57.309390098903]
著者の属性は、文書の起源または著者を特定することを目的としている。大きな言語モデル(LLM)とその深い推論能力と長距離テキストアソシエーションを維持する能力は、有望な代替手段を提供する。 IMDbおよびブログデータセットを用いた結果, 著者10名を対象に, 著者1名に対して, 85%の精度が得られた。
論文参考訳（メタデータ） (2024-10-29T04:14:23Z)
Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation [52.72682366640554]
著者検証(英語: Authorship Verification, AV)とは、ある特定の著者によって書かれたか、別の人物によって書かれたのかを推測するテキスト分類タスクである。多くのAVシステムは敵の攻撃に弱いことが示されており、悪意のある著者は、その書体スタイルを隠蔽するか、あるいは他の著者の書体を模倣することによって、積極的に分類者を騙そうとしている。
論文参考訳（メタデータ） (2024-03-17T16:36:26Z)
A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence [58.6354685593418]
本稿では, レビューを評価するために, 記事レベル, フィールド正規化, 大規模言語モデルを用いた書誌指標を提案する。新たに登場したAI生成の文献レビューも評価されている。この研究は、文学レビューの現在の課題についての洞察を与え、彼らの開発に向けた今後の方向性を思い起こさせる。
論文参考訳（メタデータ） (2024-02-20T11:28:50Z)
A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts [11.430810978707173]
私たちの研究は、興味深い疑問を浮き彫りにしている: テキストは、多くのパラフレーズを実行したときに、オリジナルの著者を保っているか? 計算手法を用いることで,テキスト分類モデルにおける性能低下が,各パラフレーズが原作者のスタイルから逸脱する程度と密接に関連していることが判明した。
論文参考訳（メタデータ） (2023-11-14T18:40:42Z)
Verifying the Robustness of Automatic Credibility Assessment [50.55687778699995]
入力テキストにおける意味保存的変化がモデルを誤解させる可能性があることを示す。また、誤情報検出タスクにおける被害者モデルと攻撃方法の両方をテストするベンチマークであるBODEGAについても紹介する。我々の実験結果によると、現代の大規模言語モデルは、以前のより小さなソリューションよりも攻撃に対して脆弱であることが多い。
論文参考訳（メタデータ） (2023-03-14T16:11:47Z)
Same or Different? Diff-Vectors for Authorship Analysis [78.83284164605473]
古典的な著作物分析において、特徴ベクトルは文書を表し、特徴の値は文書中の特徴の相対周波数(関数の増大)を表し、クラスラベルは文書の著者を表す。筆者らの実験は共著者検証,著者検証,クローズドセットの著者帰属に取り組んでおり,DVは自然に第1の問題を解くのに向いているが,第2と第3の問題を解くための2つの新しい方法も提供している。
論文参考訳（メタデータ） (2023-01-24T08:48:12Z)
PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
文書を書く著者は、語彙、レジストリ、句読点、ミススペル、絵文字の使用など、テキスト内での識別情報をインプリントする。以前の作品では、手作りのフィーチャや分類タスクを使用して著者モデルをトレーニングし、ドメイン外の著者に対するパフォーマンスの低下につながった。セマンティクスの代わりにtextbfauthorship の埋め込みを学習するために、対照的に訓練されたモデルを提案する。
論文参考訳（メタデータ） (2022-09-30T11:08:39Z)
TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective [4.123763595394021]
クロスドメインシナリオにおける27,000人以上の著者と1.4万のサンプルによるオーサリング属性実験の結果、90%のアトリビューション精度が得られた。年齢などの身体的特徴を用いて、TraSE上で定性的な分析を行い、認知的特徴を捉えているという主張を検証する。
論文参考訳（メタデータ） (2022-06-21T19:55:07Z)
LG4AV: Combining Language Models and Graph Neural Networks for Author Verification [0.11421942894219898]
本稿では,著者検証のための言語モデルとグラフニューラルネットワークを組み合わせたLG4AVを提案する。トレーニング済みのトランスフォーマーアーキテクチャで利用可能なテキストを直接供給することで、我々のモデルは手作りのスタイル幾何学的特徴を一切必要としない。我々のモデルは、検証プロセスに関して意味のある著者間の関係から恩恵を受けることができる。
論文参考訳（メタデータ） (2021-09-03T12:45:28Z)
A computational model implementing subjectivity with the 'Room Theory'. The case of detecting Emotion from Text [68.8204255655161]
本研究は,テキスト分析における主観性と一般的文脈依存性を考慮した新しい手法を提案する。単語間の類似度を用いて、ベンチマーク中の要素の相対的関連性を抽出することができる。この方法は、主観的評価がテキストの相対値や意味を理解するために関係しているすべてのケースに適用できる。
論文参考訳（メタデータ） (2020-05-12T21:26:04Z)
Automatic Identification of Types of Alterations in Historical Manuscripts [0.0]
文書の変更を分類するための機械学習に基づく手法を提案する。特に、コンテンツ関連変更を分類する新しい確率モデルを提案する。ラベルのないデータについて、 alterLDA を適用すると、著者、編集者、その他の原稿寄稿者の変更行動に関する興味深い新しい洞察がもたらされる。
論文参考訳（メタデータ） (2020-03-20T08:05:27Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。