Dataset-Agnostic Recommender Systems
- URL: http://arxiv.org/abs/2501.07294v1
- Date: Mon, 13 Jan 2025 13:01:00 GMT
- Title: Dataset-Agnostic Recommender Systems
- Authors: Tri Kurniawan Wijaya, Edoardo D'Amico, Xinyang Shao,
- Abstract summary: DAReS aims to enable a single system to autonomously adapt to various datasets without the need for fine-tuning.<n>DareS offers a more efficient and scalable solution for building recommender systems across diverse application domains.
- Score: 0.47109219881156855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: [This is a position paper and does not contain any empirical or theoretical results] Recommender systems have become a cornerstone of personalized user experiences, yet their development typically involves significant manual intervention, including dataset-specific feature engineering, hyperparameter tuning, and configuration. To this end, we introduce a novel paradigm: Dataset-Agnostic Recommender Systems (DAReS) that aims to enable a single codebase to autonomously adapt to various datasets without the need for fine-tuning, for a given recommender system task. Central to this approach is the Dataset Description Language (DsDL), a structured format that provides metadata about the dataset's features and labels, and allow the system to understand dataset's characteristics, allowing it to autonomously manage processes like feature selection, missing values imputation, noise removal, and hyperparameter optimization. By reducing the need for domain-specific expertise and manual adjustments, DAReS offers a more efficient and scalable solution for building recommender systems across diverse application domains. It addresses critical challenges in the field, such as reusability, reproducibility, and accessibility for non-expert users or entry-level researchers.
Related papers
- Universal Reusability in Recommender Systems: The Case for Dataset- and Task-Independent Frameworks [0.7124736158080939]
We propose a framework aimed at maximizing the reusability of recommender systems while minimizing barriers to entry.<n>This paper introduces the concept of DTIRS and establishes a roadmap for transitioning from Level-1 automation to Level-2 automation.<n>We discuss key challenges, including the trade-offs between generalization and specialization, computational overhead, and scalability.
arXiv Detail & Related papers (2025-06-03T21:00:34Z) - Multi-agents based User Values Mining for Recommendation [52.26100802380767]
We propose a zero-shot multi-LLM collaborative framework for effective and accurate user value extraction.<n>We apply text summarization techniques to condense item content while preserving essential meaning.<n>To mitigate hallucinations, we introduce two specialized agent roles: evaluators and supervisors.
arXiv Detail & Related papers (2025-05-02T04:01:31Z) - Data Requirement Goal Modeling for Machine Learning Systems [0.8854624631197942]
This work proposes an approach to guide non-experts in identifying data requirements for Machine Learning systems.
We first develop the Data Requirement Goal Model (DRGM) by surveying the white literature.
We then validate the approach through two illustrative examples based on real-world projects.
arXiv Detail & Related papers (2025-04-10T11:30:25Z) - Modeling Churn in Recommender Systems with Aggregated Preferences [6.261444979025644]
We propose a model that addresses the dual challenges of leveraging aggregated user information and mitigating churn risk.
Our model assumes that the RS operates with a probabilistic prior over user types and aggregated satisfaction levels for various content types.
arXiv Detail & Related papers (2025-02-09T13:12:11Z) - Laser: Parameter-Efficient LLM Bi-Tuning for Sequential Recommendation with Collaborative Information [76.62949982303532]
We propose a parameter-efficient Large Language Model Bi-Tuning framework for sequential recommendation with collaborative information (Laser)
In our Laser, the prefix is utilized to incorporate user-item collaborative information and adapt the LLM to the recommendation task, while the suffix converts the output embeddings of the LLM from the language space to the recommendation space for the follow-up item recommendation.
M-Former is a lightweight MoE-based querying transformer that uses a set of query experts to integrate diverse user-specific collaborative information encoded by frozen ID-based sequential recommender systems.
arXiv Detail & Related papers (2024-09-03T04:55:03Z) - Personalized Multi-task Training for Recommender System [80.23030752707916]
PMTRec is the first personalized multi-task learning algorithm to obtain comprehensive user/item embeddings from various information sources.
Our contributions open new avenues for advancing personalized multi-task training in recommender systems.
arXiv Detail & Related papers (2024-07-31T06:27:06Z) - SKADA-Bench: Benchmarking Unsupervised Domain Adaptation Methods with Realistic Validation [55.87169702896249]
Unsupervised Domain Adaptation (DA) consists of adapting a model trained on a labeled source domain to perform well on an unlabeled target domain with some data distribution shift.
We propose a framework to evaluate DA methods and present a fair evaluation of existing shallow algorithms, including reweighting, mapping, and subspace alignment.
Our benchmark highlights the importance of realistic validation and provides practical guidance for real-life applications.
arXiv Detail & Related papers (2024-07-16T12:52:29Z) - Dataset Regeneration for Sequential Recommendation [69.93516846106701]
We propose a data-centric paradigm for developing an ideal training dataset using a model-agnostic dataset regeneration framework called DR4SR.
To demonstrate the effectiveness of the data-centric paradigm, we integrate our framework with various model-centric methods and observe significant performance improvements across four widely adopted datasets.
arXiv Detail & Related papers (2024-05-28T03:45:34Z) - ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems [40.838320650137625]
This paper presents ERASE, a comprehensive bEnchmaRk for feAture SElection for Deep Recommender Systems (DRS)
ERASE comprises a thorough evaluation of eleven feature selection methods, covering both traditional and deep learning approaches.
Our code is available online for ease of reproduction.
arXiv Detail & Related papers (2024-03-19T11:49:35Z) - DNS-Rec: Data-aware Neural Architecture Search for Recommender Systems [79.76519917171261]
This paper addresses the computational overhead and resource inefficiency prevalent in Sequential Recommender Systems (SRSs)<n>We introduce an innovative approach combining pruning methods with advanced model designs.<n>Our principal contribution is the development of a Data-aware Neural Architecture Search for Recommender System (DNS-Rec)
arXiv Detail & Related papers (2024-02-01T07:22:52Z) - Utilizing deep learning for automated tuning of database management
systems [0.12891210250935148]
OtterTune identifies influential knobs, analyze previously unseen workloads, and provide recommendations for knob settings.
The effectiveness of this approach is demonstrated through the evaluation of a new tool called OtterTune on three different database management systems (DBMSs)
arXiv Detail & Related papers (2023-06-25T21:50:14Z) - AUGUST: an Automatic Generation Understudy for Synthesizing
Conversational Recommendation Datasets [56.052803235932686]
We propose a novel automatic dataset synthesis approach that can generate both large-scale and high-quality recommendation dialogues.
In doing so, we exploit: (i) rich personalized user profiles from traditional recommendation datasets, (ii) rich external knowledge from knowledge graphs, and (iii) the conversation ability contained in human-to-human conversational recommendation datasets.
arXiv Detail & Related papers (2023-06-16T05:27:14Z) - AutoField: Automating Feature Selection in Deep Recommender Systems [36.70138179483737]
Feature selection is a critical process in developing deep learning-based recommender systems.
We propose an AutoML framework that can adaptively select the essential feature fields in an automatic manner.
arXiv Detail & Related papers (2022-04-19T18:06:02Z) - i-Razor: A Differentiable Neural Input Razor for Feature Selection and
Dimension Search in DNN-Based Recommender Systems [8.992480061695138]
Noisy features and inappropriate embedding dimension assignments can deteriorate the performance of recommender systems.
We propose a differentiable neural input razor (i-Razor) that enables joint optimization of feature selection and dimension search.
arXiv Detail & Related papers (2022-04-01T08:30:06Z) - Unsupervised Model Personalization while Preserving Privacy and
Scalability: An Open Problem [55.21502268698577]
This work investigates the task of unsupervised model personalization, adapted to continually evolving, unlabeled local user images.
We provide a novel Dual User-Adaptation framework (DUA) to explore the problem.
This framework flexibly disentangles user-adaptation into model personalization on the server and local data regularization on the user device.
arXiv Detail & Related papers (2020-03-30T09:35:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.