Fugu-MT 論文翻訳(概要): Why Authors and Maintainers Link (or Don't Link) Their PyPI Libraries to Code Repositories and Donation Platforms

論文の概要: Why Authors and Maintainers Link (or Don't Link) Their PyPI Libraries to Code Repositories and Donation Platforms

arxiv url: http://arxiv.org/abs/2601.15139v1
Date: Wed, 21 Jan 2026 16:13:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-22 21:27:50.451815
Title: Why Authors and Maintainers Link (or Don't Link) Their PyPI Libraries to Code Repositories and Donation Platforms
Title（参考訳）: 著者と管理者がPyPIライブラリをコードリポジトリと寄付プラットフォームにリンクする(リンクしない)理由
Authors: Alexandros Tsakpinis, Nicolas Raube, Alexander Pretschner,
Abstract要約: Python Package Index(PyPI)上のライブラリのメタデータは、オープンソースライブラリの透明性、信頼性、持続性をサポートする上で重要な役割を果たす。本稿は,5万PyPIの著者とメンテナに送付された2つの対象調査を組み合わせた大規模実証研究である。我々は,大規模言語モデル(LLM)に基づくトピックモデリングを用いて1,400以上の応答を分析し,リポジトリと寄付プラットフォームのリンクに関連する重要なモチベーションと障壁を明らかにする。
参考スコア（独自算出の注目度）: 83.16077040470975
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Metadata of libraries on the Python Package Index (PyPI)-including links to source code repositories and donation platforms-plays a critical role in supporting the transparency, trust, and sustainability of open-source libraries. Yet, many packages lack such metadata, and little is known about the underlying reasons. This paper presents a large-scale empirical study combining two targeted surveys sent to 50,000 PyPI authors and maintainers. We analyze more than 1,400 responses using large language model (LLM)-based topic modeling to uncover key motivations and barriers related to linking repositories and donation platforms. While repository URLs are often linked to foster collaboration, increase transparency, and enable issue tracking, some maintainers omit them due to oversight, laziness, or the perceived irrelevance to their project. Donation platform links are reported to support open source work or receive financial contributions, but are hindered by skepticism, technical friction, and organizational constraints. Cross-cutting challenges-such as outdated links, lack of awareness, and unclear guidance-affect both types of metadata. We further assess the robustness of our topic modeling pipeline across 30 runs (84% lexical and 89% semantic similarity) and validate topic quality with 23 expert raters (Randolph's kappa = 0.55). The study contributes empirical insights into PyPI's metadata practices and provides recommendations for improving them, while also demonstrating the effectiveness of our topic modeling approach for analyzing short-text survey responses.
Abstract（参考訳）: Python Package Index(PyPI)上のライブラリのメタデータ - ソースコードリポジトリや寄付プラットフォームへのリンクを含む - は、オープンソースライブラリの透明性、信頼性、持続性をサポートする上で重要な役割を果たす。しかし、多くのパッケージにはそのようなメタデータが欠けており、その根底にある理由についてはほとんど分かっていない。本稿は,5万 PyPI の著者とメンテナに送付された2つの対象調査を組み合わせた大規模実証研究である。我々は,大規模言語モデル(LLM)に基づくトピックモデリングを用いて1,400以上の応答を分析し,リポジトリと寄付プラットフォームのリンクに関連する重要なモチベーションと障壁を明らかにする。リポジトリURLは、しばしばコラボレーションの促進、透明性の向上、イシュートラッキングの有効化にリンクされているが、一部のメンテナは、監視、怠慢、プロジェクトに対する認識できないため、それらを省略している。寄付プラットフォームリンクは、オープンソースの作業を支援したり、金銭的な貢献を受けるように報告されているが、懐疑主義、技術的な摩擦、組織的な制約によって妨げられている。横断的課題 - 古いリンク、認識の欠如、ガイダンスの欠如などは、両方のメタデータに影響を与えます。さらに、30ランにわたるトピックモデリングパイプラインの堅牢性(84%の語彙と89%のセマンティックな類似性)を評価し、23のエキスパートトレーナー(Randolph's kappa = 0.55)でトピック品質を検証する。本研究は,PyPIのメタデータの実践に関する実証的な知見を提供し,その改善を推奨するとともに,短文アンケート応答の分析におけるトピックモデリング手法の有効性を実証する。

論文の概要: Why Authors and Maintainers Link (or Don't Link) Their PyPI Libraries to Code Repositories and Donation Platforms

関連論文リスト