Fugu-MT 論文翻訳(概要): How many preprints have actually been printed and why: a case study of computer science preprints on arXiv

論文の概要: How many preprints have actually been printed and why: a case study of computer science preprints on arXiv

arxiv url: http://arxiv.org/abs/2308.01899v1
Date: Thu, 3 Aug 2023 17:56:16 GMT
ステータス: 翻訳完了
システム内更新日: 2023-08-04 13:10:48.017157
Title: How many preprints have actually been printed and why: a case study of computer science preprints on arXiv
Title（参考訳）: arXivのコンピューターサイエンス・プレプリントのケーススタディ
Authors: Jialiang Lin, Yao Yu, Yu Zhou, Zhiyang Zhou, Xiaodong Shi
Abstract要約: 我々は、最終的にピアレビューされた会場で、どれだけのプレプリントが印刷されたかを定量化します。刊行された写本のうち、いくつかは異なるタイトルで出版され、arXivの事前版も更新されていない。コンピュータ科学の分野では、プレプリントは適切なリビジョン、複数の著者、詳細な抽象化と導入、広範囲で権威のある参照、利用可能なソースコードを特徴としている。
参考スコア（独自算出の注目度）: 9.783989953810725
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Preprints play an increasingly critical role in academic communities. There are many reasons driving researchers to post their manuscripts to preprint servers before formal submission to journals or conferences, but the use of preprints has also sparked considerable controversy, especially surrounding the claim of priority. In this paper, a case study of computer science preprints submitted to arXiv from 2008 to 2017 is conducted to quantify how many preprints have eventually been printed in peer-reviewed venues. Among those published manuscripts, some are published under different titles and without an update to their preprints on arXiv. In the case of these manuscripts, the traditional fuzzy matching method is incapable of mapping the preprint to the final published version. In view of this issue, we introduce a semantics-based mapping method with the employment of Bidirectional Encoder Representations from Transformers (BERT). With this new mapping method and a plurality of data sources, we find that 66% of all sampled preprints are published under unchanged titles and 11% are published under different titles and with other modifications. A further analysis was then performed to investigate why these preprints but not others were accepted for publication. Our comparison reveals that in the field of computer science, published preprints feature adequate revisions, multiple authorship, detailed abstract and introduction, extensive and authoritative references and available source code.
Abstract（参考訳）: プレプリントは学術界でますます重要な役割を担っている。学術誌や会議に公式提出する前に、研究者が原稿をプレプリントサーバーに投稿するよう促す理由はたくさんあるが、プレプリントの使用は、特に優先権の主張に関して、かなりの論争を巻き起こしている。本稿では,2008年から2017年にかけてarxivに提出されたコンピュータ科学用プリプリントの事例研究を行い,ピアレビューされた会場で最終的に印刷されたプレプリントの数を定量化する。これらの写本のうち、いくつかは異なるタイトルで出版され、arxivのプレプリントに更新されていない。これらの写本の場合、従来のファジィマッチング法では、プレプリントを最終版にマッピングできない。本稿では,変換器からの双方向エンコーダ表現(BERT)を用いたセマンティックスに基づくマッピング手法を提案する。この新たなマッピング手法と複数のデータソースにより,全サンプルプレプリントの66%が変更のないタイトルで公開され,11%が異なるタイトルで公開され,他の変更が加えられていることがわかった。その後、これらのプレプリントがなぜ出版に受け入れられなかったのかを調べるためにさらなる分析が行われた。コンピュータ科学の分野では、プレプリントは適切な改訂、複数著者の紹介、詳細な抽象化と紹介、広範囲かつ権威のある参照、利用可能なソースコードが特徴である。

関連論文リスト

Paper2Web: Let's Make Your Paper Alive! [51.75896846964824]
学術Webページ生成を評価するためのベンチマークデータセットとフレームワークであるPaper2Webを紹介する。 PWAgentは、科学論文をインタラクティブでマルチメディアに富んだ学術ホームページに変換する自律パイプラインである。
論文参考訳（メタデータ） (2025-10-17T17:35:58Z)
SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From [65.75182441010327]
我々は,LDMフィンガープリントのより強く,より本質的な概念であるSeedPrintsを提案する。トレーニングされていないモデルでは,パラメータのみに依存した再現可能なトークン選択バイアスが示される。 LLaMAスタイルとQwenスタイルのモデルの実験では、SeedPrintsはシードレベルの識別性を実現し、バイオメトリック指紋に似た生来からライフサイクルの識別認証を提供する。
論文参考訳（メタデータ） (2025-09-30T15:34:08Z)
Toward Reproducibility of Digital Twin Research: Exemplified with the PiCar-X [49.44419860570116]
デジタル双生児は、モノのインターネットと産業の4.0でますます重要になっている。 dtsの概念には統一された定義がなく、検証の課題に直面している。本稿では,様々なdt概念を再現可能な実験室で実証する。
論文参考訳（メタデータ） (2024-08-25T15:34:00Z)
Mapping the Increasing Use of LLMs in Scientific Papers [99.67983375899719]
2020年1月から2024年2月にかけて、arXiv、bioRxiv、Natureのポートフォリオジャーナルで950,965の論文をまとめて、体系的で大規模な分析を行った。計算機科学の論文では, LLMの使用が着実に増加し, 最大, 最速の成長が観察された。
論文参考訳（メタデータ） (2024-04-01T17:45:15Z)
CausalCite: A Causal Formulation of Paper Citations [80.82622421055734]
CausalCiteは紙の意義を測定するための新しい方法だ。これは、従来のマッチングフレームワークを高次元のテキスト埋め込みに適応させる、新しい因果推論手法であるTextMatchに基づいている。科学専門家が報告した紙衝撃と高い相関性など,各種基準におけるCausalCiteの有効性を実証する。
論文参考訳（メタデータ） (2023-11-05T23:09:39Z)
Estimating the Causal Effect of Early ArXiving on Paper Acceptance [56.538813945721685]
我々は,論文の審査期間(初期arXiving)前にarXivingが会議の受理に与える影響を推定する。以上の結果から,早期のarXivingは,論文の受容に少なからぬ影響を及ぼす可能性が示唆された。
論文参考訳（メタデータ） (2023-06-24T07:45:38Z)
Contrastive Attention Networks for Attribution of Early Modern Print [23.344655278038392]
本研究では,1500年～1800年(1500年～1800年)の英語印刷において,未知のプリンタを識別する機械学習技術を開発した。具体的には、匿名で印刷された書籍において、一意に破損した文字タイプインプリントと、既知のプリンタと連携することに焦点を当てる。
論文参考訳（メタデータ） (2023-06-12T19:57:11Z)
Cracking Double-Blind Review: Authorship Attribution with Deep Learning [43.483063713471935]
本稿では、匿名の原稿を著者に属性付けるトランスフォーマーベースのニューラルネットワークアーキテクチャを提案する。我々は、arXivで公開されているすべての研究論文を200万冊以上の原稿に活用する。本手法は, 論文の最大73%を正解する, 前代未聞の著者帰属精度を実現する。
論文参考訳（メタデータ） (2022-11-14T15:50:24Z)
Scientometric engineering: Exploring citation dynamics via arXiv eprints [0.0]
本稿では,arXiv上の150万以上の電子プリントの引用データについて検討する。典型的成長パターンと可溶化パターンは, 分野によって異なることが判明した。我々は, 励起成長と可溶化の観測量的, 時間的特性に整合したモデルを導出した。
論文参考訳（メタデータ） (2021-06-09T12:38:44Z)
Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
引用グラフを用いて科学論文の要約作業を再定義します。我々は,141kの研究論文を異なる領域に格納した,新しい科学論文要約データセットセマンティックスタディネットワーク(ssn)を構築した。我々のモデルは、事前訓練されたモデルと比較して競争性能を達成することができる。
論文参考訳（メタデータ） (2021-04-07T11:13:35Z)
Is preprint the future of science? A thirty year journey of online preprint services [7.063908865620109]
Preprintは、正式な査読の前に公開された科学論文のバージョンである。 1991年にarXivが発売されて以来、印刷物は紙のコピーとは対照的にインターネット上に流通してきた。
論文参考訳（メタデータ） (2021-02-17T23:08:01Z)
Preprints as accelerator of scholarly communication: An empirical analysis in Mathematics [9.899221738408581]
出版の遅れと影響の2つの影響を測定する。プレプリント版のある記事は、ソーシャルメディアで言及されることが多く、Altmetricの注意の遅れが短い。
論文参考訳（メタデータ） (2020-11-24T07:32:35Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。