Fugu-MT 論文翻訳(概要): Evaluating the Use of LLMs for Automated DOM-Level Resolution of Web Performance Issues

論文の概要: Evaluating the Use of LLMs for Automated DOM-Level Resolution of Web Performance Issues

arxiv url: http://arxiv.org/abs/2601.05502v1
Date: Fri, 09 Jan 2026 03:21:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-12 17:41:49.827184
Title: Evaluating the Use of LLMs for Automated DOM-Level Resolution of Web Performance Issues
Title（参考訳）: Webパフォーマンス問題のDOMレベル自動解決におけるLLMの利用評価
Authors: Gideon Peters, SayedHassan Khatoonabadi, Emad Shihab,
Abstract要約: 本研究は文書オブジェクトモデル(DOM)の変更に焦点を当てる。大規模言語モデル(LLM)の最近の進歩は、この複雑なタスクを自動化するための有望な道を提供する。本研究は、Webパフォーマンス自動解決のための9つの最先端LCMの有効性を評価する。
参考スコア（独自算出の注目度）: 3.9189409002585567
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Users demand fast, seamless webpage experiences, yet developers often struggle to meet these expectations within tight constraints. Performance optimization, while critical, is a time-consuming and often manual process. One of the most complex tasks in this domain is modifying the Document Object Model (DOM), which is why this study focuses on it. Recent advances in Large Language Models (LLMs) offer a promising avenue to automate this complex task, potentially transforming how developers address web performance issues. This study evaluates the effectiveness of nine state-of-the-art LLMs for automated web performance issue resolution. For this purpose, we first extracted the DOM trees of 15 popular webpages (e.g., Facebook), and then we used Lighthouse to retrieve their performance audit reports. Subsequently, we passed the extracted DOM trees and corresponding audits to each model for resolution. Our study considers 7 unique audit categories, revealing that LLMs universally excel at SEO & Accessibility issues. However, their efficacy in performance-critical DOM manipulations is mixed. While high-performing models like GPT-4.1 delivered significant reductions in areas like Initial Load, Interactivity, and Network Optimization (e.g., 46.52% to 48.68% audit incidence reductions), others, such as GPT-4o-mini, notably underperformed, consistently. A further analysis of these modifications showed a predominant additive strategy and frequent positional changes, alongside regressions particularly impacting Visual Stability.
Abstract（参考訳）: ユーザは高速でシームレスなWebページエクスペリエンスを要求するが、開発者は厳しい制約の中でこれらの期待を満たすのに苦労することが多い。パフォーマンスの最適化は、重要ではあるが、時間を要する、しばしば手動のプロセスである。この領域でもっとも複雑なタスクの1つは、ドキュメントオブジェクトモデル(DOM)を変更することである。最近のLLM(Large Language Models)の進歩は、開発者がWebパフォーマンス問題に対処する方法を変える可能性のある、この複雑なタスクを自動化するための有望な道を提供する。本研究は、Webパフォーマンス自動解決のための9つの最先端LCMの有効性を評価する。この目的のために、まず15の人気のあるWebページ(例えばFacebook)のDOMツリーを抽出し、それからLighthouseを使ってパフォーマンス監査レポートを検索しました。その後、抽出したDOMツリーとそれに対応する監査を各モデルに渡して解決した。本研究は,SEOとアクセシビリティの問題においてLLMが普遍的に優れていることを明らかにする,7つのユニークな監査カテゴリーについて考察した。しかし、パフォーマンスクリティカルなDOM操作におけるそれらの有効性は混在している。 GPT-4.1のような高性能モデルは初期負荷、インターアクティビティ、ネットワーク最適化(例:46.52%から48.68%の監査インシデント削減)などの領域で大幅な性能低下をもたらしたが、GPT-4o-mini(特に性能の低下)は一貫して性能の低下を招いた。これらの修正のさらなる分析は、特に視覚安定に影響を及ぼす回帰とともに、主に付加的な戦略と頻繁な位置変化を示した。

論文の概要: Evaluating the Use of LLMs for Automated DOM-Level Resolution of Web Performance Issues

関連論文リスト