The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)
- URL: http://arxiv.org/abs/2008.00744v1
- Date: Mon, 3 Aug 2020 09:55:26 GMT
- Title: The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)
- Authors: Samuel Albanie, Yang Liu, Arsha Nagrani, Antoine Miech, Ernesto Coto,
Ivan Laptev, Rahul Sukthankar, Bernard Ghanem, Andrew Zisserman, Valentin
Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid, Shizhe Chen, Yida Zhao,
Qin Jin, Kaixu Cui, Hui Liu, Chen Wang, Yudong Jiang, Xiaoshuai Hao
- Abstract summary: We present a new video understanding pentathlon challenge, an open competition held in conjunction with the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020.
The objective of the challenge was to explore and evaluate new methods for text-to-video retrieval.
- Score: 186.7816349401443
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a new video understanding pentathlon challenge, an open
competition held in conjunction with the IEEE Conference on Computer Vision and
Pattern Recognition (CVPR) 2020. The objective of the challenge was to explore
and evaluate new methods for text-to-video retrieval-the task of searching for
content within a corpus of videos using natural language queries. This report
summarizes the results of the first edition of the challenge together with the
findings of the participants.
Related papers
- Perception Test 2023: A Summary of the First Challenge And Outcome [67.0525378209708]
The First Perception Test challenge was held as a half-day workshop alongside the IEEE/CVF International Conference on Computer Vision (ICCV) 2023.
The goal was to benchmarking state-of-the-art video models on the recently proposed Perception Test benchmark.
We summarise in this report the task descriptions, metrics, baselines, and results.
arXiv Detail & Related papers (2023-12-20T15:12:27Z) - VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge [95.6159736804855]
The VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22) was held in conjunction with INTERSPEECH 2022.
The goal of this challenge was to evaluate how well state-of-the-art speaker recognition systems can diarise and recognise speakers from speech obtained "in the wild"
arXiv Detail & Related papers (2023-02-20T19:27:14Z) - ViSeRet: A simple yet effective approach to moment retrieval via
fine-grained video segmentation [6.544437737391409]
This paper presents the 1st place solution to the video retrieval track of the ICCV VALUE Challenge 2021.
We present a simple yet effective approach to jointly tackle two video-text retrieval tasks.
We create an ensemble model that achieves the new state-of-the-art performance on all four datasets.
arXiv Detail & Related papers (2021-10-11T10:39:13Z) - NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Dataset
and Study [95.36629866768999]
This paper introduces a novel dataset for video enhancement and studies the state-of-the-art methods of the NTIRE 2021 challenge.
The challenge is the first NTIRE challenge in this direction, with three competitions, hundreds of participants and tens of proposed solutions.
We find that the NTIRE 2021 challenge advances the state-of-the-art of quality enhancement on compressed video.
arXiv Detail & Related papers (2021-04-21T22:18:33Z) - Bridging Vision and Language from the Video-to-Text Perspective: A
Comprehensive Review [1.0520692160489133]
This review categorizes and describes the state-of-the-art techniques for the video-to-text problem.
It covers the main video-to-text methods and the ways to evaluate their performance.
State-of-the-art techniques are still a long way from achieving human-like performance in generating or retrieving video descriptions.
arXiv Detail & Related papers (2021-03-27T02:12:28Z) - VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge [99.82500204110015]
We held the second installment of the VoxCeleb Speaker Recognition Challenge in conjunction with Interspeech 2020.
The goal of this challenge was to assess how well current speaker recognition technology is able to diarise and recognize speakers in unconstrained or in the wild' data.
This paper outlines the challenge, and describes the baselines, methods used, and results.
arXiv Detail & Related papers (2020-12-12T17:20:57Z) - Pre-training for Video Captioning Challenge 2020 Summary [118.45411524199164]
The Pre-training for Video Captioning Challenge 2020 Summary.
Results and challenge participants' technical reports.
Video Captioning Challenge 2020.
arXiv Detail & Related papers (2020-07-27T15:31:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.