Large-Scale Person Detection and Localization using Overhead Fisheye
Cameras
- URL: http://arxiv.org/abs/2307.08252v1
- Date: Mon, 17 Jul 2023 05:36:01 GMT
- Title: Large-Scale Person Detection and Localization using Overhead Fisheye
Cameras
- Authors: Lu Yang, Liulei Li, Xueshi Xin, Yifan Sun, Qing Song, Wenguan Wang
- Abstract summary: We present the first large-scale overhead fisheye dataset for person detection and localization.
We build a fisheye person detection network, which exploits the fisheye distortions by a rotation-equivariant training strategy.
Our whole fisheye positioning solution is able to locate all persons in FOV with an accuracy of 0.5 m, within 0.1 s.
- Score: 40.004888590123954
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Location determination finds wide applications in daily life. Instead of
existing efforts devoted to localizing tourist photos captured by perspective
cameras, in this article, we focus on devising person positioning solutions
using overhead fisheye cameras. Such solutions are advantageous in large field
of view (FOV), low cost, anti-occlusion, and unaggressive work mode (without
the necessity of cameras carried by persons). However, related studies are
quite scarce, due to the paucity of data. To stimulate research in this
exciting area, we present LOAF, the first large-scale overhead fisheye dataset
for person detection and localization. LOAF is built with many essential
features, e.g., i) the data cover abundant diversities in scenes, human pose,
density, and location; ii) it contains currently the largest number of
annotated pedestrian, i.e., 457K bounding boxes with groundtruth location
information; iii) the body-boxes are labeled as radius-aligned so as to fully
address the positioning challenge. To approach localization, we build a fisheye
person detection network, which exploits the fisheye distortions by a
rotation-equivariant training strategy and predict radius-aligned human boxes
end-to-end. Then, the actual locations of the detected persons are calculated
by a numerical solution on the fisheye model and camera altitude data.
Extensive experiments on LOAF validate the superiority of our fisheye detector
w.r.t. previous methods, and show that our whole fisheye positioning solution
is able to locate all persons in FOV with an accuracy of 0.5 m, within 0.1 s.
Related papers
- RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation [88.54817424560056]
We propose a distortion vector map (DVM) that measures the degree and direction of local distortion.
By learning the DVM, the model can independently identify local distortions at each pixel without relying on global distortion patterns.
In the pre-training stage, it predicts the distortion vector map and perceives the local distortion features of each pixel.
In the fine-tuning stage, it predicts a pixel-wise flow map for deviated fisheye image rectification.
arXiv Detail & Related papers (2024-06-27T06:38:56Z) - FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving [4.972459365804512]
Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms.
Standard bounding box representation fails in fisheye cameras due to heavy radial distortion, particularly in the periphery.
We design rotated bounding boxes, ellipse, generic polygon as polar arc/angle representations and define an instance segmentation mIOU metric to analyze these representations.
The proposed model FisheyeDetNet with polygon outperforms others and achieves a mAP score of 49.5 % on Valeo fisheye surround-view dataset for automated driving applications.
arXiv Detail & Related papers (2024-04-20T18:50:57Z) - SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets [65.64958606221069]
Multi-camera systems are often used in autonomous driving to achieve a 360$circ$ perception.
These 360$circ$ camera sets often have limited or low-quality overlap regions, making multi-view stereo methods infeasible for the entire image.
We propose the Stereo Guided Depth Estimation (SGDE) method, which enhances depth estimation of the full image by explicitly utilizing multi-view stereo results on the overlap.
arXiv Detail & Related papers (2024-02-19T02:41:37Z) - FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset
Generalization [57.98448472585241]
We propose a method to train a stereo depth estimation model on the widely available pinhole data.
We show strong generalization ability of our approach on both indoor and outdoor datasets.
arXiv Detail & Related papers (2024-01-24T20:07:59Z) - Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based
Motion Refinement [65.08165593201437]
We explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion.
This task presents significant challenges due to the lack of high-quality datasets, fisheye camera distortion, and human body self-occlusion.
We propose a novel approach that leverages FisheyeViT to extract fisheye image features, which are converted into pixel-aligned 3D heatmap representations for 3D human body pose prediction.
arXiv Detail & Related papers (2023-11-28T07:13:47Z) - Estimating Distances Between People using a Single Overhead Fisheye
Camera with Application to Social-Distancing Oversight [8.337649176647645]
We propose two approaches for monitoring distances between people indoors.
One method leverages a geometric model of the fisheye lens, while the other uses a neural network to predict the 3D-world distance.
The algorithms achieve 1-2 ft distance error and over 95% accuracy in detecting social-distance violations.
arXiv Detail & Related papers (2023-03-21T00:50:14Z) - ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye
Camera [3.0868856870169625]
We propose ARPD, a single-stage anchor-free fully convolutional network to detect arbitrarily rotated people in fish-eye images.
Our method competes favorably with state-of-the-art algorithms while running significantly faster.
arXiv Detail & Related papers (2022-01-25T05:49:50Z) - Estimating Egocentric 3D Human Pose in Global Space [70.7272154474722]
We present a new method for egocentric global 3D body pose estimation using a single-mounted fisheye camera.
Our approach outperforms state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2021-04-27T20:01:57Z) - Generalized Object Detection on Fisheye Cameras for Autonomous Driving:
Dataset, Representations and Baseline [5.1450366450434295]
We explore better representations like oriented bounding box, ellipse, and generic polygon for object detection in fisheye images.
We design a novel curved bounding box model that has optimal properties for fisheye distortion models.
It is the first detailed study on object detection on fisheye cameras for autonomous driving scenarios.
arXiv Detail & Related papers (2020-12-03T18:00:16Z) - RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images [13.290341167863495]
We develop an end-to-end-aware people detection method, named RAPiD, that detects people using arbitrarily-oriented bounding boxes.
Our fully-convolutional neural network directly regresses the angle of each bounding box using a periodic loss rotation function.
We show that our simple, yet effective method outperforms state-of-the-art results on three fisheye-image datasets.
arXiv Detail & Related papers (2020-05-23T23:47:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.