What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain
- URL: http://arxiv.org/abs/2408.08907v1
- Date: Tue, 13 Aug 2024 11:11:27 GMT
- Title: What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain
- Authors: Antonis Maronikolakis, Ana Peleteiro Ramallo, Weiwei Cheng, Thomas Kober,
- Abstract summary: Large language models (LLMs) are poised to revolutionize the domain of online fashion retail.
We create a multilingual evaluation dataset of 4k conversations between customers and a fashion assistant in a large e-commerce fashion platform.
- Score: 1.6549387146138885
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Large language models (LLMs) are poised to revolutionize the domain of online fashion retail, enhancing customer experience and discovery of fashion online. LLM-powered conversational agents introduce a new way of discovery by directly interacting with customers, enabling them to express in their own ways, refine their needs, obtain fashion and shopping advice that is relevant to their taste and intent. For many tasks in e-commerce, such as finding a specific product, conversational agents need to convert their interactions with a customer to a specific call to different backend systems, e.g., a search system to showcase a relevant set of products. Therefore, evaluating the capabilities of LLMs to perform those tasks related to calling other services is vital. However, those evaluations are generally complex, due to the lack of relevant and high quality datasets, and do not align seamlessly with business needs, amongst others. To this end, we created a multilingual evaluation dataset of 4k conversations between customers and a fashion assistant in a large e-commerce fashion platform to measure the capabilities of LLMs to serve as an assistant between customers and a backend engine. We evaluate a range of models, showcasing how our dataset scales to business needs and facilitates iterative development of tools.
Related papers
- Personalized Multimodal Large Language Models: A Survey [127.9521218125761]
Multimodal Large Language Models (MLLMs) have become increasingly important due to their state-of-the-art performance and ability to integrate multiple data modalities.
This paper presents a comprehensive survey on personalized multimodal large language models, focusing on their architecture, training methods, and applications.
arXiv Detail & Related papers (2024-12-03T03:59:03Z) - Learning to Ask: Conversational Product Search via Representation Learning [34.689904564411506]
We propose a new conversational product search model, ConvPS, to assist users in locating desirable items.
The model is first trained to jointly learn the semantic representations of user, query, item, and conversation.
Our proposed ConvPS model can naturally integrate the representation learning of the user, query, item, and conversation into a unified generative framework.
arXiv Detail & Related papers (2024-11-18T14:05:43Z) - ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models [47.27645876623092]
We present ConvKGYarn, a scalable method for generating up-to-date and conversational KGQA datasets.
We showcase its utility by testing LLMs on diverse conversations - exploring model behavior on conversational KGQA sets with different configurations grounded in the same KG fact set.
arXiv Detail & Related papers (2024-08-12T06:48:43Z) - Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting [51.54907796704785]
Existing methods rely on modeling the latent semantics of resumes and job descriptions and learning a matching function between them.
Inspired by the powerful role-playing capabilities of Large Language Models (LLMs), we propose to introduce a mock interview process between LLM-played interviewers and candidates.
We propose MockLLM, a novel applicable framework that divides the person-job matching process into two modules: mock interview generation and two-sided evaluation in handshake protocol.
arXiv Detail & Related papers (2024-05-28T12:23:16Z) - Leveraging Large Language Models for Enhanced Product Descriptions in
eCommerce [6.318353155416729]
This paper introduces a novel methodology for automating product description generation using the LLAMA 2.0 7B language model.
We train the model on a dataset of authentic product descriptions from Walmart, one of the largest eCommerce platforms.
Our findings reveal that the system is not only scalable but also significantly reduces the human workload involved in creating product descriptions.
arXiv Detail & Related papers (2023-10-24T00:55:14Z) - Recommender AI Agent: Integrating Large Language Models for Interactive
Recommendations [53.76682562935373]
We introduce an efficient framework called textbfInteRecAgent, which employs LLMs as the brain and recommender models as tools.
InteRecAgent achieves satisfying performance as a conversational recommender system, outperforming general-purpose LLMs.
arXiv Detail & Related papers (2023-08-31T07:36:44Z) - A Meta-learning based Stacked Regression Approach for Customer Lifetime
Value Prediction [3.6002910014361857]
Customer Lifetime Value (CLV) is the total monetary value of transactions/purchases made by a customer with the business over an intended period of time.
CLV finds application in a number of distinct business domains such as Banking, Insurance, Online-entertainment, Gaming, and E-Commerce.
We propose a system which is able to qualify both as effective, and comprehensive yet simple and interpretable.
arXiv Detail & Related papers (2023-08-07T14:22:02Z) - Intent Recognition in Conversational Recommender Systems [0.0]
We introduce a pipeline to contextualize the input utterances in conversations.
We then take the next step towards leveraging reverse feature engineering to link the contextualized input and learning model to support intent recognition.
arXiv Detail & Related papers (2022-12-06T11:02:42Z) - Personalized Fashion Recommendation from Personal Social Media Data: An
Item-to-Set Metric Learning Approach [71.63618051547144]
We study the problem of personalized fashion recommendation from social media data.
We present an item-to-set metric learning framework that learns to compute the similarity between a set of historical fashion items of a user to a new fashion item.
To validate the effectiveness of our approach, we collect a real-world social media dataset.
arXiv Detail & Related papers (2020-05-25T23:24:24Z) - Cross-Lingual Low-Resource Set-to-Description Retrieval for Global
E-Commerce [83.72476966339103]
Cross-lingual information retrieval is a new task in cross-border e-commerce.
We propose a novel cross-lingual matching network (CLMN) with the enhancement of context-dependent cross-lingual mapping.
Experimental results indicate that our proposed CLMN yields impressive results on the challenging task.
arXiv Detail & Related papers (2020-05-17T08:10:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.