Evaluating Multi-Turn Bargain Skills in LLM-Based Seller Agent
- URL: http://arxiv.org/abs/2509.06341v1
- Date: Mon, 08 Sep 2025 05:12:03 GMT
- Title: Evaluating Multi-Turn Bargain Skills in LLM-Based Seller Agent
- Authors: Issue Yishu Wang, Kakam Chong, Xiaofeng Wang, Xu Yan, DeXin Kong, Chen Ju, Ming Chen, Shuai Xiao, Shuguang Han, jufeng chen,
- Abstract summary: We introduce a multi-turn evaluation framework for measuring the bargaining ability of seller agents in e-commerce dialogues.<n>Our contributions are: (1) a large-scale e-commerce bargaining benchmark spanning 622 categories, 9,892 products, and 3,014 tasks; (2) a turn-level evaluation framework grounded in Theory of Mind with annotated buyer intents, moving beyond outcome-only metrics; and (3) an automated pipeline that extracts reliable intent from massive dialogue data.
- Score: 20.0134260493017
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In online second-hand marketplaces, multi-turn bargaining is a crucial part of seller-buyer interactions. Large Language Models (LLMs) can act as seller agents, negotiating with buyers on behalf of sellers under given business constraints. A critical ability for such agents is to track and accurately interpret cumulative buyer intents across long negotiations, which directly impacts bargaining effectiveness. We introduce a multi-turn evaluation framework for measuring the bargaining ability of seller agents in e-commerce dialogues. The framework tests whether an agent can extract and track buyer intents. Our contributions are: (1) a large-scale e-commerce bargaining benchmark spanning 622 categories, 9,892 products, and 3,014 tasks; (2) a turn-level evaluation framework grounded in Theory of Mind (ToM) with annotated buyer intents, moving beyond outcome-only metrics; and (3) an automated pipeline that extracts reliable intent from massive dialogue data.
Related papers
- AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions [49.49718899185783]
Large language model (LLM)-based agents are increasingly expected to negotiate, coordinate, and transact autonomously.<n>We introduce AgenticPay, a benchmark and simulation framework for multi-agent buyer-seller negotiation driven by natural language.
arXiv Detail & Related papers (2026-02-05T18:50:36Z) - MADIAVE: Multi-Agent Debate for Implicit Attribute Value Extraction [52.89860691282002]
Implicit Attribute Value Extraction (AVE) is essential for accurately representing products in e-commerce.<n>Despite advances in multimodal large language models (MLLMs), implicit AVE remains challenging due to the complexity of multidimensional data.<n>We introduce textscmodelname, a multi-agent debate framework that employs multiple MLLM agents to iteratively refine inferences.
arXiv Detail & Related papers (2025-10-07T06:27:42Z) - FishBargain: An LLM-Empowered Bargaining Agent for Online Fleamarket Platform Sellers [26.343587134457415]
We propose an LLM-empowered bargaining agent designed for online fleamarket platform sellers, named as FishBargain.<n>FishBargain understands the chat context and product information, chooses both action and language skill considering possible adversary actions and generates utterances.<n>Both qualitative and quantitative experiments demonstrate that FishBargain can effectively help sellers make more deals.
arXiv Detail & Related papers (2025-01-22T06:12:25Z) - Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method [17.388837360641276]
This paper describes the Bargaining task as an asymmetric incomplete information game.
It allows us to quantitatively assess an agent's performance in the Bargain task.
We propose a novel approach called OG-Narrator that integrates a deterministic Offer Generator to control the price range of Buyer's offers.
arXiv Detail & Related papers (2024-02-24T13:36:58Z) - Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues [47.977032883078664]
We develop assistive agents based on Large Language Models (LLMs) that aid interlocutors in business negotiations.<n>A third LLM acts as a remediator agent to rewrite utterances violating norms for improving negotiation outcomes.<n>We provide rich empirical evidence to demonstrate its effectiveness in negotiations across three different negotiation topics.
arXiv Detail & Related papers (2024-01-29T09:07:40Z) - Salespeople vs SalesBot: Exploring the Role of Educational Value in
Conversational Recommender Systems [78.84530426424838]
Existing conversational recommender systems often overlook users' lack of background knowledge, focusing solely on gathering preferences.
We introduce SalesOps, a framework that facilitates the simulation and evaluation of such systems.
We build SalesBot and ShopperBot, a pair of LLM-powered agents that can simulate either side of the framework.
arXiv Detail & Related papers (2023-10-26T19:44:06Z) - Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation [52.930183136111864]
We propose using scorable negotiation to evaluate Large Language Models (LLMs)
To reach an agreement, agents must have strong arithmetic, inference, exploration, and planning capabilities.
We provide procedures to create new games and increase games' difficulty to have an evolving benchmark.
arXiv Detail & Related papers (2023-09-29T13:33:06Z) - Improving Language Model Negotiation with Self-Play and In-Context
Learning from AI Feedback [97.54519989641388]
We study whether multiple large language models (LLMs) can autonomously improve each other in a negotiation game by playing, reflecting, and criticizing.
Only a subset of the language models we consider can self-play and improve the deal price from AI feedback.
arXiv Detail & Related papers (2023-05-17T11:55:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.