Fugu-MT 論文翻訳(概要): Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing

論文の概要: Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing

arxiv url: http://arxiv.org/abs/2010.10817v1
Date: Wed, 21 Oct 2020 08:24:18 GMT
ステータス: 翻訳完了
システム内更新日: 2022-10-04 23:14:24.604924
Title: Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing
Title（参考訳）: 学術論文のフルテキストコンテンツを用いた自然言語処理領域におけるアルゴリズムエンティティの同定と評価
Authors: Yuzhuo Wang, Chengzhi Zhang
Abstract要約: 本稿では、自然言語処理(NLP)の分野を例として取り上げ、この分野の学術論文からアルゴリズムを同定する。論文内容を手動で注釈付けしてアルゴリズムの辞書を構築し、辞書にアルゴリズムを含む文を辞書ベースのマッチングにより抽出する。アルゴリズムに言及する記事の数は、そのアルゴリズムの影響を分析する指標として使用される。
参考スコア（独自算出の注目度）: 7.163189900803623
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In the era of big data, the advancement, improvement, and application of algorithms in academic research have played an important role in promoting the development of different disciplines. Academic papers in various disciplines, especially computer science, contain a large number of algorithms. Identifying the algorithms from the full-text content of papers can determine popular or classical algorithms in a specific field and help scholars gain a comprehensive understanding of the algorithms and even the field. To this end, this article takes the field of natural language processing (NLP) as an example and identifies algorithms from academic papers in the field. A dictionary of algorithms is constructed by manually annotating the contents of papers, and sentences containing algorithms in the dictionary are extracted through dictionary-based matching. The number of articles mentioning an algorithm is used as an indicator to analyze the influence of that algorithm. Our results reveal the algorithm with the highest influence in NLP papers and show that classification algorithms represent the largest proportion among the high-impact algorithms. In addition, the evolution of the influence of algorithms reflects the changes in research tasks and topics in the field, and the changes in the influence of different algorithms show different trends. As a preliminary exploration, this paper conducts an analysis of the impact of algorithms mentioned in the academic text, and the results can be used as training data for the automatic extraction of large-scale algorithms in the future. The methodology in this paper is domain-independent and can be applied to other domains.
Abstract（参考訳）: ビッグデータの時代、学術研究におけるアルゴリズムの進歩、改善、応用は、異なる分野の発展を促進する上で重要な役割を果たしてきた。様々な分野、特にコンピュータ科学の学術論文には、多くのアルゴリズムが含まれている。論文の全文コンテンツからアルゴリズムを識別することで、特定の分野におけるポピュラーなアルゴリズムや古典的なアルゴリズムを決定でき、研究者がアルゴリズムや分野の包括的な理解を得るのに役立つ。本稿では,自然言語処理(NLP)の分野を例として取り上げ,その分野の学術論文からアルゴリズムを同定する。論文内容を手動で注釈付けしてアルゴリズムの辞書を構築し、辞書にアルゴリズムを含む文を辞書ベースのマッチングにより抽出する。アルゴリズムに言及する記事の数は、そのアルゴリズムの影響を分析する指標として使用される。以上の結果から,nlp論文に最も影響の大きいアルゴリズムが示され,分類アルゴリズムがハイインパクトアルゴリズムの中で最も高い割合を表わすことが示された。さらに、アルゴリズムの影響の進化は、分野における研究課題やトピックの変化を反映しており、異なるアルゴリズムの影響の変化は異なる傾向を示している。予備的な調査として,本論文では,学術論文で言及されているアルゴリズムの影響を解析し,将来,大規模アルゴリズムの自動抽出のためのトレーニングデータとして利用することができる。本稿ではドメインに依存しない方法論を他のドメインに適用できる。

関連論文リスト

Evolutionary Algorithms Approach For Search Based On Semantic Document Similarity [0.0]
我々は,様々なテキスト表現技術を用いて,クラスタリング,レコメンデーション,質問応答システムを開発した。テキストの意味的類似性を捉えるために,ユニバーサル・センテンス・ベクター (USE) が用いられていることを示す。また, 遺伝的アルゴリズム (GA) と微分進化 (DE) のアルゴリズムを用いて, 関連するトップN文書の検索と検索を行う。
論文参考訳（メタデータ） (2025-02-20T18:56:52Z)
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models [63.188607839223046]
この調査は、推論中に計算をスケールするメリットに焦点を当てている。我々はトークンレベルの生成アルゴリズム、メタジェネレーションアルゴリズム、効率的な生成という3つの領域を統一的な数学的定式化の下で探索する。
論文参考訳（メタデータ） (2024-06-24T17:45:59Z)
A Gold Standard Dataset for the Reviewer Assignment Problem [117.59690218507565]
類似度スコア(Similarity score)とは、論文のレビューにおいて、レビュアーの専門知識を数値で見積もるものである。私たちのデータセットは、58人の研究者による477の自己申告された専門知識スコアで構成されています。 2つの論文をレビュアーに関連付けるタスクは、簡単なケースでは12%～30%、ハードケースでは36%～43%である。
論文参考訳（メタデータ） (2023-03-23T16:15:03Z)
An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation [77.44921096644698]
本稿では,復号化アルゴリズムがLMフェアネスに与える影響を体系的に分析する。公平さ、多様性、品質のトレードオフを分析します。
論文参考訳（メタデータ） (2022-10-07T21:33:34Z)
The CLRS Algorithmic Reasoning Benchmark [28.789225199559834]
アルゴリズムの学習表現は機械学習の新たな領域であり、ニューラルネットワークから古典的なアルゴリズムで概念をブリッジしようとしている。本稿では,従来のアルゴリズムを包括するCLRS Algorithmic Reasoning Benchmarkを提案する。我々のベンチマークは、ソート、探索、動的プログラミング、グラフアルゴリズム、文字列アルゴリズム、幾何アルゴリズムなど、様々なアルゴリズムの推論手順にまたがっている。
論文参考訳（メタデータ） (2022-05-31T09:56:44Z)
An Approach for Automatic Construction of an Algorithmic Knowledge Graph from Textual Resources [3.723553383515688]
本稿では,非構造化データからアルゴリズム問題の知識グラフを自動的に作成する手法を提案する。アルゴリズムKGは、アルゴリズムメタデータに追加のコンテキストと説明可能性を与える。
論文参考訳（メタデータ） (2022-05-13T18:59:23Z)
Deep Algorithm Unrolling for Biomedical Imaging [99.73317152134028]
本章では,アルゴリズムのアンロールによるバイオメディカル応用とブレークスルーについて概説する。我々はアルゴリズムのアンローリングの起源を辿り、反復アルゴリズムをディープネットワークにアンローリングする方法に関する包括的なチュートリアルを提供する。オープンな課題を議論し、今後の研究方向性を提案することで、この章を締めくくります。
論文参考訳（メタデータ） (2021-08-15T01:06:26Z)
Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms [15.338931971492288]
我々は、アルゴリズムの革新と実装決定を分離するために、一連の推論に基づくアクター批判アルゴリズムに焦点を当てる。実装の詳細がアルゴリズムの選択に一致すると、パフォーマンスが大幅に低下します。結果は、どの実装の詳細がアルゴリズムと共適応され、共進化しているかを示す。
論文参考訳（メタデータ） (2021-03-31T17:55:20Z)
Critical Analysis: Bat Algorithm based Investigation and Application on Several Domains [1.1802674324027231]
このアルゴリズムのアイデアはコウモリのエコーロケーション能力から取られた。バットアルゴリズムは、背景、特徴、制限の観点から詳細に与えられる。
論文参考訳（メタデータ） (2021-01-18T19:25:12Z)
A Novel Word Sense Disambiguation Approach Using WordNet Knowledge Graph [0.0]
本稿では,SCSMM (Sequential Contextual Likeity Matrix multiplication) という知識に基づく単語感覚解読アルゴリズムを提案する。 SCSMMアルゴリズムは、セマンティックな類似性、知識、文書コンテキストを組み合わせて、それぞれローカルコンテキストのメリットを利用する。提案されたアルゴリズムは、金の標準データセットの名詞を曖昧にするときに他のアルゴリズムよりも優れていた。
論文参考訳（メタデータ） (2021-01-08T06:47:32Z)
Accelerating Text Mining Using Domain-Specific Stop Word Lists [57.76576681191192]
本稿では,超平面的アプローチと呼ばれるドメイン固有語の自動抽出手法を提案する。ハイパープレーンベースのアプローチは、無関係な特徴を排除することによって、テキストの寸法を著しく削減することができる。その結果,超平面型アプローチはコーパスの寸法を90%削減し,相互情報より優れることがわかった。
論文参考訳（メタデータ） (2020-11-18T17:42:32Z)
A Survey of Embedding Space Alignment Methods for Language and Knowledge Graphs [77.34726150561087]
単語,文,知識グラフの埋め込みアルゴリズムに関する現在の研究状況について調査する。本稿では、関連するアライメント手法の分類と、この研究分野で使用されるベンチマークデータセットについて論じる。
論文参考訳（メタデータ） (2020-10-26T16:08:13Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。