Fugu-MT 論文翻訳(概要): Large Language Models for Education and Research: An Empirical and User Survey-based Analysis

論文の概要: Large Language Models for Education and Research: An Empirical and User Survey-based Analysis

arxiv url: http://arxiv.org/abs/2512.08057v1
Date: Mon, 08 Dec 2025 21:35:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-10 22:28:07.730509
Title: Large Language Models for Education and Research: An Empirical and User Survey-based Analysis
Title（参考訳）: 教育・研究のための大規模言語モデル:実証的およびユーザ調査に基づく分析
Authors: Md Mostafizer Rahman, Ariful Islam Shiplu, Md Faizul Ibne Amin, Yutaka Watanobe, Lu Peng,
Abstract要約: ChatGPTとDeepSeekは数学、科学、医学、文学、プログラミングにおいて強力な能力を示している。テキスト生成、プログラミング、特殊問題解決において、これらのLLMのパフォーマンスをベンチマークした。どちらのモデルも医学的に正確な診断出力を提供し、複雑な数学的問題を効果的に解決する。
参考スコア（独自算出の注目度）: 0.7077668691277661
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Pretrained Large Language Models (LLMs) have achieved remarkable success across diverse domains, with education and research emerging as particularly impactful areas. Among current state-of-the-art LLMs, ChatGPT and DeepSeek exhibit strong capabilities in mathematics, science, medicine, literature, and programming. In this study, we present a comprehensive evaluation of these two LLMs through background technology analysis, empirical experiments, and a real-world user survey. The evaluation explores trade-offs among model accuracy, computational efficiency, and user experience in educational and research affairs. We benchmarked these LLMs performance in text generation, programming, and specialized problem-solving. Experimental results show that ChatGPT excels in general language understanding and text generation, while DeepSeek demonstrates superior performance in programming tasks due to its efficiency- focused design. Moreover, both models deliver medically accurate diagnostic outputs and effectively solve complex mathematical problems. Complementing these quantitative findings, a survey of students, educators, and researchers highlights the practical benefits and limitations of these models, offering deeper insights into their role in advancing education and research.
Abstract（参考訳）: 事前訓練された大規模言語モデル(LLM)は、様々な領域で顕著な成功を収めており、特に影響力のある分野として教育と研究が出現している。現在最先端のLLMの中で、ChatGPTとDeepSeekは数学、科学、医学、文学、プログラミングに強力な能力を示している。本研究では,背景技術分析,実証実験,実世界のユーザ調査を通じて,これらの2つのLCMの総合的な評価を行う。本評価では, モデル精度, 計算効率, ユーザエクスペリエンスのトレードオフについて検討する。テキスト生成、プログラミング、特殊問題解決において、これらのLLMのパフォーマンスをベンチマークした。実験の結果,ChatGPTは汎用言語理解とテキスト生成に優れており,DeepSeekは効率を重視した設計により,プログラミングタスクにおける優れた性能を示している。さらに、どちらのモデルも医学的に正確な診断出力を提供し、複雑な数学的問題を効果的に解決する。これらの定量的発見を補完し、学生、教育者、研究者による調査は、これらのモデルの実用的メリットと限界を強調し、教育と研究の進展における彼らの役割について深い洞察を提供する。

論文の概要: Large Language Models for Education and Research: An Empirical and User Survey-based Analysis

関連論文リスト