PlayMyData: a curated dataset of multi-platform video games
- URL: http://arxiv.org/abs/2401.08561v2
- Date: Thu, 18 Jan 2024 18:52:57 GMT
- Title: PlayMyData: a curated dataset of multi-platform video games
- Authors: Andrea D'Angelo, Claudio Di Sipio, Cristiano Politowski, and Riccardo
Rubei
- Abstract summary: PlayMyData is a curated dataset composed of 99,864 multi-platform games gathered by IGDB website.
By exploiting a dedicated API, we collect relevant metadata for each game, e.g., description, genre, rating, gameplay video URLs, and screenshots.
PlayMyData can be used to foster cross-domain investigations built on top of the provided multimedia data.
- Score: 2.7498981662768536
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Being predominant in digital entertainment for decades, video games have been
recognized as valuable software artifacts by the software engineering (SE)
community just recently. Such an acknowledgment has unveiled several research
opportunities, spanning from empirical studies to the application of AI
techniques for classification tasks. In this respect, several curated game
datasets have been disclosed for research purposes even though the collected
data are insufficient to support the application of advanced models or to
enable interdisciplinary studies. Moreover, the majority of those are limited
to PC games, thus excluding notorious gaming platforms, e.g., PlayStation,
Xbox, and Nintendo. In this paper, we propose PlayMyData, a curated dataset
composed of 99,864 multi-platform games gathered by IGDB website. By exploiting
a dedicated API, we collect relevant metadata for each game, e.g., description,
genre, rating, gameplay video URLs, and screenshots. Furthermore, we enrich
PlayMyData with the timing needed to complete each game by mining the HLTB
website. To the best of our knowledge, this is the most comprehensive dataset
in the domain that can be used to support different automated tasks in SE. More
importantly, PlayMyData can be used to foster cross-domain investigations built
on top of the provided multimedia data.
Related papers
- Serious Games in Digital Gaming: A Comprehensive Review of Applications,
Game Engines and Advancements [55.2480439325792]
In recent years, serious games have become increasingly popular due to their ability to simultaneously educate and entertain users.
In this review, we provide a comprehensive overview of the different types of digital games and expand on the serious games genre.
We present the most widely used game engines used in the game development industry and extend the Unity game machine advantages.
arXiv Detail & Related papers (2023-11-03T09:17:09Z) - An Innovative Tool for Uploading/Scraping Large Image Datasets on Social
Networks [9.27070946719462]
We propose an automated approach by means of a digital tool that we created on purpose.
The tool is capable of automatically uploading an entire image dataset to the desired digital platform and then downloading all the uploaded pictures.
arXiv Detail & Related papers (2023-11-01T23:27:37Z) - BridgeData V2: A Dataset for Robot Learning at Scale [73.86688388408021]
BridgeData V2 is a large and diverse dataset of robotic manipulation behaviors.
It contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot.
arXiv Detail & Related papers (2023-08-24T17:41:20Z) - The ProfessionAl Go annotation datasEt (PAGE) [3.1723119892509573]
We present the ProfessionsEt dataset, containing 98,525 games played by 2,007 professional players and spans over 70 years.
The dataset includes rich AI analysis results for each move. Moreover, PAGE provides detailed metadata for every player and game after manual cleaning and labeling.
arXiv Detail & Related papers (2022-11-03T02:41:41Z) - SC2EGSet: StarCraft II Esport Replay and Game-state Dataset [0.0]
This work aims to open esports to a broader scientific community by supplying raw and pre-processed files from StarCraft II esports tournaments.
We have publicly available game-engine generated "replays" of tournament matches and performed data extraction using a low-level application programming interface (API) library.
Our dataset contains replays from major and premiere StarCraft II tournaments since 2016.
arXiv Detail & Related papers (2022-07-07T16:52:53Z) - PGD: A Large-scale Professional Go Dataset for Data-driven Analytics [3.747666374070152]
This paper creates the Professional Go dataset, containing 98,043 games played by 2,148 professional players from 1950 to 2021.
The dataset includes analysis results for each move in the match evaluated by advanced AlphaZero-based AI.
With the help of complete meta-information and constructed in-game features, our results prediction system achieves an accuracy of 75.30%.
arXiv Detail & Related papers (2022-04-30T12:53:04Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - DataLab: A Platform for Data Analysis and Intervention [96.75253335629534]
DataLab is a unified data-oriented platform that allows users to interactively analyze the characteristics of data.
toolname has features for dataset recommendation and global vision analysis.
So far, DataLab covers 1,715 datasets and 3,583 of its transformed version.
arXiv Detail & Related papers (2022-02-25T18:32:19Z) - Meta Arcade: A Configurable Environment Suite for Meta-Learning [4.962850553353402]
We present Meta Arcade, a tool to easily define and configure custom 2D arcade games that share common visuals, state spaces, action spaces, game components, and scoring mechanisms.
We include a suite of 24 predefined games that collectively illustrate the possibilities of this framework and discuss how these games can be configured for research applications.
arXiv Detail & Related papers (2021-12-01T15:54:38Z) - Playing for 3D Human Recovery [88.91567909861442]
In this work, we obtain massive human sequences by playing the video game with automatically annotated 3D ground truths.
Specifically, we contribute GTA-Human, a large-scale 3D human dataset generated with the GTA-V game engine.
A simple frame-based baseline trained on GTA-Human outperforms more sophisticated methods by a large margin.
arXiv Detail & Related papers (2021-10-14T17:49:42Z) - Design and Implementation of TAG: A Tabletop Games Framework [59.60094442546867]
This document describes the design and implementation of the Tabletop Games framework (TAG)
TAG is a Java-based benchmark for developing modern board games for AI research.
arXiv Detail & Related papers (2020-09-25T07:27:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.