A Scalable Framework for Automatic Playlist Continuation on Music
Streaming Services
- URL: http://arxiv.org/abs/2304.09061v1
- Date: Wed, 12 Apr 2023 08:46:04 GMT
- Title: A Scalable Framework for Automatic Playlist Continuation on Music
Streaming Services
- Authors: Walid Bendada and Guillaume Salha-Galvan and Thomas Bouab\c{c}a and
Tristan Cazenave
- Abstract summary: We introduce a general framework to build scalable yet effective Automatic Playlist Continuation models.
We demonstrate the relevance of this framework through in-depth experimental validation on Spotify's Million Playlist dataset.
We report results from a large-scale online A/B test on this service, emphasizing the practical impact of our approach.
- Score: 5.215058915075775
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Music streaming services often aim to recommend songs for users to extend the
playlists they have created on these services. However, extending playlists
while preserving their musical characteristics and matching user preferences
remains a challenging task, commonly referred to as Automatic Playlist
Continuation (APC). Besides, while these services often need to select the best
songs to recommend in real-time and among large catalogs with millions of
candidates, recent research on APC mainly focused on models with few
scalability guarantees and evaluated on relatively small datasets. In this
paper, we introduce a general framework to build scalable yet effective APC
models for large-scale applications. Based on a represent-then-aggregate
strategy, it ensures scalability by design while remaining flexible enough to
incorporate a wide range of representation learning and sequence modeling
techniques, e.g., based on Transformers. We demonstrate the relevance of this
framework through in-depth experimental validation on Spotify's Million
Playlist Dataset (MPD), the largest public dataset for APC. We also describe
how, in 2022, we successfully leveraged this framework to improve APC in
production on Deezer. We report results from a large-scale online A/B test on
this service, emphasizing the practical impact of our approach in such a
real-world application.
Related papers
- Audio Processing using Pattern Recognition for Music Genre Classification [0.0]
This project explores the application of machine learning techniques for music genre classification using the GTZAN dataset.
Motivated by the growing demand for personalized music recommendations, we focused on classifying five genres-Blues, Classical, Jazz, Hip Hop, and Country.
The ANN model demonstrated the best performance, achieving a validation accuracy of 92.44%.
arXiv Detail & Related papers (2024-10-19T05:44:05Z) - Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation [12.295794664393368]
We introduce PISA, a session-level sequential recommender system for music streaming services.
PISA employs a Transformer architecture learning embedding representations of listening sessions and users.
We demonstrate the empirical relevance of PISA using both publicly available listening data from Last.fm and proprietary data from Deezer.
arXiv Detail & Related papers (2024-08-29T14:44:12Z) - VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling [71.01050359126141]
We propose VidMuse, a framework for generating music aligned with video inputs.
VidMuse produces high-fidelity music that is both acoustically and semantically aligned with the video.
arXiv Detail & Related papers (2024-06-06T17:58:11Z) - MuPT: A Generative Symbolic Music Pretrained Transformer [56.09299510129221]
We explore the application of Large Language Models (LLMs) to the pre-training of music.
To address the challenges associated with misaligned measures from different tracks during generation, we propose a Synchronized Multi-Track ABC Notation (SMT-ABC Notation)
Our contributions include a series of models capable of handling up to 8192 tokens, covering 90% of the symbolic music data in our training set.
arXiv Detail & Related papers (2024-04-09T15:35:52Z) - MusicAgent: An AI Agent for Music Understanding and Generation with
Large Language Models [54.55063772090821]
MusicAgent integrates numerous music-related tools and an autonomous workflow to address user requirements.
The primary goal of this system is to free users from the intricacies of AI-music tools, enabling them to concentrate on the creative aspect.
arXiv Detail & Related papers (2023-10-18T13:31:10Z) - Related Rhythms: Recommendation System To Discover Music You May Like [2.7152798636894193]
In this paper, a distributed Machine Learning pipeline is delineated, which is capable of taking a subset of songs as input and producing a new subset of songs identified as being similar to the inputted subset.
The publicly accessible Million Songs dataset (MSD) enables researchers to develop and explore reasonably efficient systems for audio track analysis and recommendations.
The objective of the proposed application is to leverage an ML system trained to optimally recommend songs that a user might like.
arXiv Detail & Related papers (2023-09-24T04:18:40Z) - MARBLE: Music Audio Representation Benchmark for Universal Evaluation [79.25065218663458]
We introduce the Music Audio Representation Benchmark for universaL Evaluation, termed MARBLE.
It aims to provide a benchmark for various Music Information Retrieval (MIR) tasks by defining a comprehensive taxonomy with four hierarchy levels, including acoustic, performance, score, and high-level description.
We then establish a unified protocol based on 14 tasks on 8 public-available datasets, providing a fair and standard assessment of representations of all open-sourced pre-trained models developed on music recordings as baselines.
arXiv Detail & Related papers (2023-06-18T12:56:46Z) - Simple and Controllable Music Generation [94.61958781346176]
MusicGen is a single Language Model (LM) that operates over several streams of compressed discrete music representation, i.e., tokens.
Unlike prior work, MusicGen is comprised of a single-stage transformer LM together with efficient token interleaving patterns.
arXiv Detail & Related papers (2023-06-08T15:31:05Z) - MATT: A Multiple-instance Attention Mechanism for Long-tail Music Genre
Classification [1.8275108630751844]
Imbalanced music genre classification is a crucial task in the Music Information Retrieval (MIR) field.
Most of the existing models are designed for class-balanced music datasets.
We propose a novel mechanism named Multi-instance Attention (MATT) to boost the performance for identifying tail classes.
arXiv Detail & Related papers (2022-09-09T03:52:44Z) - Lets Play Music: Audio-driven Performance Video Generation [58.77609661515749]
We propose a new task named Audio-driven Per-formance Video Generation (APVG)
APVG aims to synthesize the video of a person playing a certain instrument guided by a given music audio clip.
arXiv Detail & Related papers (2020-11-05T03:13:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.