RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images
- URL: http://arxiv.org/abs/2005.11623v1
- Date: Sat, 23 May 2020 23:47:18 GMT
- Title: RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images
- Authors: Zhihao Duan, M. Ozan Tezcan, Hayato Nakamura, Prakash Ishwar, Janusz
Konrad
- Abstract summary: We develop an end-to-end-aware people detection method, named RAPiD, that detects people using arbitrarily-oriented bounding boxes.
Our fully-convolutional neural network directly regresses the angle of each bounding box using a periodic loss rotation function.
We show that our simple, yet effective method outperforms state-of-the-art results on three fisheye-image datasets.
- Score: 13.290341167863495
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent methods for people detection in overhead, fisheye images either use
radially-aligned bounding boxes to represent people, assuming people always
appear along image radius or require significant pre-/post-processing which
radically increases computational complexity. In this work, we develop an
end-to-end rotation-aware people detection method, named RAPiD, that detects
people using arbitrarily-oriented bounding boxes. Our fully-convolutional
neural network directly regresses the angle of each bounding box using a
periodic loss function, which accounts for angle periodicities. We have also
created a new dataset with spatio-temporal annotations of rotated bounding
boxes, for people detection as well as other vision tasks in overhead fisheye
videos. We show that our simple, yet effective method outperforms
state-of-the-art results on three fisheye-image datasets. Code and dataset are
available at http://vip.bu.edu/rapid .
Related papers
- FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving [4.972459365804512]
Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms.
Standard bounding box representation fails in fisheye cameras due to heavy radial distortion, particularly in the periphery.
We design rotated bounding boxes, ellipse, generic polygon as polar arc/angle representations and define an instance segmentation mIOU metric to analyze these representations.
The proposed model FisheyeDetNet with polygon outperforms others and achieves a mAP score of 49.5 % on Valeo fisheye surround-view dataset for automated driving applications.
arXiv Detail & Related papers (2024-04-20T18:50:57Z) - NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth
Supervision for Indoor Multi-View 3D Detection [72.0098999512727]
NeRF-Det has achieved impressive performance in indoor multi-view 3D detection by utilizing NeRF to enhance representation learning.
We present three corresponding solutions, including semantic enhancement, perspective-aware sampling, and ordinal depth supervision.
The resulting algorithm, NeRF-Det++, has exhibited appealing performance in the ScanNetV2 and AR KITScenes datasets.
arXiv Detail & Related papers (2024-02-22T11:48:06Z) - Large-Scale Person Detection and Localization using Overhead Fisheye
Cameras [40.004888590123954]
We present the first large-scale overhead fisheye dataset for person detection and localization.
We build a fisheye person detection network, which exploits the fisheye distortions by a rotation-equivariant training strategy.
Our whole fisheye positioning solution is able to locate all persons in FOV with an accuracy of 0.5 m, within 0.1 s.
arXiv Detail & Related papers (2023-07-17T05:36:01Z) - Deep Dynamic Scene Deblurring from Optical Flow [53.625999196063574]
Deblurring can provide visually more pleasant pictures and make photography more convenient.
It is difficult to model the non-uniform blur mathematically.
We develop a convolutional neural network (CNN) to restore the sharp images from the deblurred features.
arXiv Detail & Related papers (2023-01-18T06:37:21Z) - FuNNscope: Visual microscope for interactively exploring the loss
landscape of fully connected neural networks [77.34726150561087]
We show how to explore high-dimensional landscape characteristics of neural networks.
We generalize observations on small neural networks to more complex systems.
An interactive dashboard opens up a number of possible application networks.
arXiv Detail & Related papers (2022-04-09T16:41:53Z) - ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye
Camera [3.0868856870169625]
We propose ARPD, a single-stage anchor-free fully convolutional network to detect arbitrarily rotated people in fish-eye images.
Our method competes favorably with state-of-the-art algorithms while running significantly faster.
arXiv Detail & Related papers (2022-01-25T05:49:50Z) - DARDet: A Dense Anchor-free Rotated Object Detector in Aerial Images [11.45718985586972]
We propose a dense anchor-free rotated object detector (DARDet) for rotated object detection in aerial images.
Our DARDet directly predicts five parameters of rotated boxes at each foreground pixel of feature maps.
Our method achieves state-of-the-art performance on three commonly used aerial objects datasets.
arXiv Detail & Related papers (2021-10-03T15:28:14Z) - Data Augmentation for Object Detection via Differentiable Neural
Rendering [71.00447761415388]
It is challenging to train a robust object detector when annotated data is scarce.
Existing approaches to tackle this problem include semi-supervised learning that interpolates labeled data from unlabeled data.
We introduce an offline data augmentation method for object detection, which semantically interpolates the training data with novel views.
arXiv Detail & Related papers (2021-03-04T06:31:06Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Towards Dense People Detection with Deep Learning and Depth images [9.376814409561726]
This paper proposes a DNN-based system that detects multiple people from a single depth image.
Our neural network processes a depth image and outputs a likelihood map in image coordinates.
We show this strategy to be effective, producing networks that generalize to work with scenes different from those used during training.
arXiv Detail & Related papers (2020-07-14T16:43:02Z) - DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes [54.239416488865565]
We propose a fast single-stage 3D object detection method for LIDAR data.
The core novelty of our method is a fast, single-pass architecture that both detects objects in 3D and estimates their shapes.
We find that our proposed method achieves state-of-the-art results by 5% on object detection in ScanNet scenes, and it gets top results by 3.4% in the Open dataset.
arXiv Detail & Related papers (2020-04-02T17:48:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.