E-GEO: A Testbed for Generative Engine Optimization in E-Commerce
- URL: http://arxiv.org/abs/2511.20867v1
- Date: Tue, 25 Nov 2025 21:28:40 GMT
- Title: E-GEO: A Testbed for Generative Engine Optimization in E-Commerce
- Authors: Puneet S. Bagga, Vivek F. Farias, Tamar Korkotashvili, Tianyi Peng, Yuhang Wu,
- Abstract summary: generative engine optimization improves content visibility and relevance for generative engines.<n>Current GEO practices are ad hoc, and their impacts remain poorly understood.<n>E-GEO is the first benchmark built specifically for e-commerce GEO.
- Score: 9.66101096555058
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rise of large language models (LLMs), generative engines are becoming powerful alternatives to traditional search, reshaping retrieval tasks. In e-commerce, for instance, conversational shopping agents now guide consumers to relevant products. This shift has created the need for generative engine optimization (GEO)--improving content visibility and relevance for generative engines. Yet despite its growing importance, current GEO practices are ad hoc, and their impacts remain poorly understood, especially in e-commerce. We address this gap by introducing E-GEO, the first benchmark built specifically for e-commerce GEO. E-GEO contains over 7,000 realistic, multi-sentence consumer product queries paired with relevant listings, capturing rich intent, constraints, preferences, and shopping contexts that existing datasets largely miss. Using this benchmark, we conduct the first large-scale empirical study of e-commerce GEO, evaluating 15 common rewriting heuristics and comparing their empirical performance. To move beyond heuristics, we further formulate GEO as a tractable optimization problem and develop a lightweight iterative prompt-optimization algorithm that can significantly outperform these baselines. Surprisingly, the optimized prompts reveal a stable, domain-agnostic pattern--suggesting the existence of a "universally effective" GEO strategy. Our data and code are publicly available at https://github.com/psbagga17/E-GEO.
Related papers
- GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI [52.13138825802668]
GeoFMs are transforming Earth Observation, but evaluation lacks standardized protocols.<n> GEO-Bench-2 addresses this with a comprehensive framework spanning classification, segmentation, regression, object detection, and instance segmentation.<n>Code, data, and leaderboard for GEO-Bench-2 are publicly released under a permissive license.
arXiv Detail & Related papers (2025-11-19T17:45:02Z) - What Generative Search Engines Like and How to Optimize Web Content Cooperatively [29.736392688493808]
Generative Engines, such as Google AI overview and ChatGPT, provide significantly enhanced user experiences.<n>Their rapid adoption also drives the needs of Generative Engine Optimization (GEO), as content providers are eager to gain more traction from them.<n>We introduce AutoGEO, a framework to automatically learn generative engine preferences when using retrieved contents for response generation.
arXiv Detail & Related papers (2025-10-13T14:10:26Z) - GeoGPT-RAG Technical Report [48.23789135946953]
GeoGPT is an open large language model system built to advance research in the geosciences.<n>RAG augments model outputs with relevant information retrieved from an external knowledge source.<n>RAG uses RAG to draw from the GeoGPT Library, a specialized corpus curated for geoscientific content.
arXiv Detail & Related papers (2025-08-18T08:29:22Z) - Role-Augmented Intent-Driven Generative Search Engine Optimization [9.876307656819039]
We propose a Role-Augmented Intent-Driven Generative Search Engine Optimization (G-SEO) method.<n>Our method models search intent through reflective refinement across diverse informational roles, enabling targeted content enhancement.<n> Experimental results demonstrate that search intent serves as an effective signal for guiding content optimization.
arXiv Detail & Related papers (2025-08-15T02:08:55Z) - Beyond SEO: A Transformer-Based Approach for Reinventing Web Content Optimisation [0.0]
We present a domain-specific fine-tuning approach for Generative Engine Optimization (GEO)<n>Our method fine-tunes a BART-base transformer on synthetically generated training data comprising 1,905 cleaned travel website content pairs.<n> optimized content demonstrates substantial visibility gains in generative search responses with 15.63% improvement in absolute word count and 30.96% improvement in position-adjusted word count metrics.
arXiv Detail & Related papers (2025-07-03T20:52:10Z) - GEO: Generative Engine Optimization [50.45232692363787]
We formalize the unified framework of generative engines (GEs)
GEs use large language models (LLMs) to gather and summarize information to answer user queries.
Generative Engines typically satisfy queries by synthesizing information from multiple sources and summarizing them.
We introduce Generative Engine Optimization (GEO), the first novel paradigm to aid content creators in improving their content visibility in generative engine responses.
arXiv Detail & Related papers (2023-11-16T10:06:09Z) - GNN-GMVO: Graph Neural Networks for Optimizing Gross Merchandise Value
in Similar Item Recommendation [12.25382490978895]
Similar item recommendation is a critical task in the e-Commerce industry, which helps customers explore similar and relevant alternatives.
Despite the traditional machine learning models, Graph Neural Networks (GNNs) can understand complex relations like similarity between products.
We propose a new GNN architecture called GNN-GMVO (Graph Neural Network - Gross Merchandise Value) to address these issues.
arXiv Detail & Related papers (2023-10-26T18:43:16Z) - Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product
Retrieval [152.3504607706575]
This research aims to conduct weakly-supervised multi-modal instance-level product retrieval for fine-grained product categories.
We first contribute the Product1M datasets, and define two real practical instance-level retrieval tasks.
We exploit to train a more effective cross-modal model which is adaptively capable of incorporating key concept information from the multi-modal data.
arXiv Detail & Related papers (2022-06-17T15:40:45Z) - PreSizE: Predicting Size in E-Commerce using Transformers [76.33790223551074]
PreSizE is a novel deep learning framework which utilizes Transformers for accurate size prediction.
We demonstrate that PreSizE is capable of achieving superior prediction performance compared to previous state-of-the-art baselines.
As a proof of concept, we demonstrate that size predictions made by PreSizE can be effectively integrated into an existing production recommender system.
arXiv Detail & Related papers (2021-05-04T15:23:59Z) - GEO: Enhancing Combinatorial Optimization with Classical and Quantum
Generative Models [62.997667081978825]
We introduce a new framework that leverages machine learning models known as generative models to solve optimization problems.
We focus on a quantum-inspired version of GEO relying on tensor-network Born machines.
We show its superior performance when the goal is to find the best minimum given a fixed budget for the number of function calls.
arXiv Detail & Related papers (2021-01-15T18:18:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.