1st Place Solution to the 1st SkatingVerse Challenge
- URL: http://arxiv.org/abs/2404.14032v1
- Date: Mon, 22 Apr 2024 09:50:05 GMT
- Title: 1st Place Solution to the 1st SkatingVerse Challenge
- Authors: Tao Sun, Yuanzi Fu, Kaicheng Yang, Jian Wu, Ziyong Feng,
- Abstract summary: This paper presents the winning solution for the 1stVerse Skating Challenge.
We leverage the DINO framework to extract the Region of Interest (ROI) and perform precise cropping of the raw video footage.
By ensembling the prediction results based on logits, our solution attains an impressive leaderboard score of 95.73%.
- Score: 12.17968838503053
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents the winning solution for the 1st SkatingVerse Challenge. We propose a method that involves several steps. To begin, we leverage the DINO framework to extract the Region of Interest (ROI) and perform precise cropping of the raw video footage. Subsequently, we employ three distinct models, namely Unmasked Teacher, UniformerV2, and InfoGCN, to capture different aspects of the data. By ensembling the prediction results based on logits, our solution attains an impressive leaderboard score of 95.73%.
Related papers
- AIM 2024 Sparse Neural Rendering Challenge: Methods and Results [64.19942455360068]
This paper reviews the challenge on Sparse Neural Rendering that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2024.
The challenge aims at producing novel camera view synthesis of diverse scenes from sparse image observations.
Participants are asked to optimise objective fidelity to the ground-truth images as measured via the Peak Signal-to-Noise Ratio (PSNR) metric.
arXiv Detail & Related papers (2024-09-23T14:17:40Z) - First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge [4.075139470537149]
We present our first-place solution to the Multiple-choice Video Question Answering track of The Second Perception Test Challenge.
This competition posed a complex video understanding task, requiring models to accurately comprehend and answer questions about video content.
arXiv Detail & Related papers (2024-09-20T14:31:13Z) - Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge [9.915564470970049]
We present our solution for the WSDM2023 Toloka Visual Question Answering Challenge.
Inspired by the application of multimodal pre-trained models, we designed a three-stage solution.
Our team achieved a score of 76.342 on the final leaderboard, ranking second.
arXiv Detail & Related papers (2024-07-05T04:56:05Z) - The SkatingVerse Workshop & Challenge: Methods and Results [137.81522563074287]
The SkatingVerse Workshop & Challenge aims to encourage research in developing novel and accurate methods for human action understanding.
The dataset used for the SkatingVerse Challenge has been publicly released.
Around 10 participating teams from the globe competed in the SkatingVerse Challenge.
arXiv Detail & Related papers (2024-05-27T14:12:07Z) - The First Pathloss Radio Map Prediction Challenge [59.11388233415274]
We have launched the ICASSP 2023 First Pathloss Radio Map Prediction Challenge.
In this short overview paper, we briefly describe the pathloss prediction problem, the provided datasets, the challenge task and the challenge evaluation methodology.
arXiv Detail & Related papers (2023-10-11T17:00:03Z) - Low-Resolution Action Recognition for Tiny Actions Challenge [52.4358152877632]
Tiny Actions Challenge focuses on understanding human activities in real-world surveillance.
There are two main difficulties for activity recognition in this scenario.
We propose a comprehensive recognition solution in this paper.
arXiv Detail & Related papers (2022-09-28T00:49:13Z) - A Technical Report for ICCV 2021 VIPriors Re-identification Challenge [5.940699390639281]
This paper introduces our solution for the re-identification track in VIPriors Challenge 2021.
It shows use state-of-the-art data processing strategies, model designs, and post-processing ensemble methods.
The final score of our team (ALONG) is 96.5154% mAP, ranking first in the leaderboard.
arXiv Detail & Related papers (2021-09-30T14:29:31Z) - NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Dataset
and Study [95.36629866768999]
This paper introduces a novel dataset for video enhancement and studies the state-of-the-art methods of the NTIRE 2021 challenge.
The challenge is the first NTIRE challenge in this direction, with three competitions, hundreds of participants and tens of proposed solutions.
We find that the NTIRE 2021 challenge advances the state-of-the-art of quality enhancement on compressed video.
arXiv Detail & Related papers (2021-04-21T22:18:33Z) - Single Image Super-Resolution [0.0]
This study presents a chronological overview of the single image super-resolution problem.
We first define the problem thoroughly and mention some of the serious challenges.
Then the problem formulation and the performance metrics are defined.
arXiv Detail & Related papers (2021-01-08T00:10:03Z) - Top-1 Solution of Multi-Moments in Time Challenge 2019 [56.15819266653481]
We conduct several experiments with popular Image-Based action recognition methods TRN, TSN, and TSM.
A novel temporal interlacing network is proposed towards fast and accurate recognition.
We ensemble all the above models and achieve 67.22% on the validation set and 60.77% on the test set, which ranks 1st on the final leaderboard.
arXiv Detail & Related papers (2020-03-12T15:11:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.