Multimodal and multiview distillation for real-time player detection on
a football field
- URL: http://arxiv.org/abs/2004.07544v1
- Date: Thu, 16 Apr 2020 09:16:20 GMT
- Title: Multimodal and multiview distillation for real-time player detection on
a football field
- Authors: Anthony Cioppa, Adrien Deli\`ege, Noor Ul Huda, Rikke Gade, Marc Van
Droogenbroeck, Thomas B. Moeslund
- Abstract summary: We develop a system that detects players from a unique cheap and wide-angle fisheye camera assisted by a single narrow-angle thermal camera.
We show that our solution is effective in detecting players on the whole field filmed by the fisheye camera.
- Score: 31.355119048749618
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Monitoring the occupancy of public sports facilities is essential to assess
their use and to motivate their construction in new places. In the case of a
football field, the area to cover is large, thus several regular cameras should
be used, which makes the setup expensive and complex. As an alternative, we
developed a system that detects players from a unique cheap and wide-angle
fisheye camera assisted by a single narrow-angle thermal camera. In this work,
we train a network in a knowledge distillation approach in which the student
and the teacher have different modalities and a different view of the same
scene. In particular, we design a custom data augmentation combined with a
motion detection algorithm to handle the training in the region of the fisheye
camera not covered by the thermal one. We show that our solution is effective
in detecting players on the whole field filmed by the fisheye camera. We
evaluate it quantitatively and qualitatively in the case of an online
distillation, where the student detects players in real time while being
continuously adapted to the latest video conditions.
Related papers
- Design and Implementation of A Soccer Ball Detection System with
Multiple Cameras [15.399112952297335]
This paper designed and implemented football detection system under multiple cameras for the detection and capture of targets in real-time matches.
The main work mainly consists of three parts, football detector, single camera detection, and multi-cameras detection.
By testing the system, it shows that the system can accurately detect and capture the moving targets in 3D.
arXiv Detail & Related papers (2023-01-31T22:04:53Z) - Graph-Based Multi-Camera Soccer Player Tracker [1.6244541005112743]
The paper presents a multi-camera tracking method intended for tracking soccer players in long shot video recordings from multiple calibrated cameras installed around the playing field.
The large distance to the camera makes it difficult to visually distinguish individual players, which adversely affects the performance of traditional solutions.
Our method focuses on individual player dynamics and interactions between neighborhood players to improve tracking performance.
arXiv Detail & Related papers (2022-11-03T20:01:48Z) - Drone Detection and Tracking in Real-Time by Fusion of Different Sensing
Modalities [66.4525391417921]
We design and evaluate a multi-sensor drone detection system.
Our solution integrates a fish-eye camera as well to monitor a wider part of the sky and steer the other cameras towards objects of interest.
The thermal camera is shown to be a feasible solution as good as the video camera, even if the camera employed here has a lower resolution.
arXiv Detail & Related papers (2022-07-05T10:00:58Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - SurroundDepth: Entangling Surrounding Views for Self-Supervised
Multi-Camera Depth Estimation [101.55622133406446]
We propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.
Specifically, we employ a joint network to process all the surrounding views and propose a cross-view transformer to effectively fuse the information from multiple views.
In experiments, our method achieves the state-of-the-art performance on the challenging multi-camera depth estimation datasets.
arXiv Detail & Related papers (2022-04-07T17:58:47Z) - Smart Director: An Event-Driven Directing System for Live Broadcasting [110.30675947733167]
Smart Director aims at mimicking the typical human-in-the-loop broadcasting process to automatically create near-professional broadcasting programs in real-time.
Our system is the first end-to-end automated directing system for multi-camera sports broadcasting.
arXiv Detail & Related papers (2022-01-11T16:14:41Z) - SVDistNet: Self-Supervised Near-Field Distance Estimation on Surround
View Fisheye Cameras [30.480562747903186]
A 360deg perception of scene geometry is essential for automated driving, notably for parking and urban driving scenarios.
We present novel camera-geometry adaptive multi-scale convolutions which utilize the camera parameters as a conditional input.
We evaluate our approach on the Fisheye WoodScape surround-view dataset, significantly improving over previous approaches.
arXiv Detail & Related papers (2021-04-09T15:20:20Z) - Unsupervised Depth and Ego-motion Estimation for Monocular Thermal Video
using Multi-spectral Consistency Loss [76.77673212431152]
We propose an unsupervised learning method for the all-day depth and ego-motion estimation.
The proposed method exploits multi-spectral consistency loss to gives complementary supervision for the networks.
Networks trained with the proposed method robustly estimate the depth and pose from monocular thermal video under low-light and even zero-light conditions.
arXiv Detail & Related papers (2021-03-01T05:29:04Z) - OmniDet: Surround View Cameras based Multi-task Visual Perception
Network for Autonomous Driving [10.3540046389057]
This work presents a multi-task visual perception network on unrectified fisheye images.
It consists of six primary tasks necessary for an autonomous driving system.
We demonstrate that the jointly trained model performs better than the respective single task versions.
arXiv Detail & Related papers (2021-02-15T10:46:24Z) - Real-Time Camera Pose Estimation for Sports Fields [43.50671336931042]
We propose a novel framework that combines accurate localization and robust identification of specific keypoints in the image.
Our algorithm exploits both the field lines and the players' image locations, assuming their ground plane positions to be given.
We will demonstrate its effectiveness on challenging soccer, basketball, and volleyball benchmark datasets.
arXiv Detail & Related papers (2020-03-31T11:27:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.