PP-YOLOv2: A Practical Object Detector
- URL: http://arxiv.org/abs/2104.10419v1
- Date: Wed, 21 Apr 2021 08:55:37 GMT
- Title: PP-YOLOv2: A Practical Object Detector
- Authors: Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng
Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun
Ma, Osamu Yoshie
- Abstract summary: We evaluate a collection of existing refinements to improve the performance of PP-YOLO.
By combining multiple effective refinements, we boost PP-YOLO's performance from 45.9% mAP to 49.5% mAP.
In terms of speed, PP-YOLOv2 runs in 68.9FPS at 640x640 input size.
- Score: 13.262416549127664
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Being effective and efficient is essential to an object detector for
practical use. To meet these two concerns, we comprehensively evaluate a
collection of existing refinements to improve the performance of PP-YOLO while
almost keep the infer time unchanged. This paper will analyze a collection of
refinements and empirically evaluate their impact on the final model
performance through incremental ablation study. Things we tried that didn't
work will also be discussed. By combining multiple effective refinements, we
boost PP-YOLO's performance from 45.9% mAP to 49.5% mAP on COCO2017 test-dev.
Since a significant margin of performance has been made, we present PP-YOLOv2.
In terms of speed, PP-YOLOv2 runs in 68.9FPS at 640x640 input size. Paddle
inference engine with TensorRT, FP16-precision, and batch size = 1 further
improves PP-YOLOv2's infer speed, which achieves 106.5 FPS. Such a performance
surpasses existing object detectors with roughly the same amount of parameters
(i.e., YOLOv4-CSP, YOLOv5l). Besides, PP-YOLOv2 with ResNet101 achieves 50.3%
mAP on COCO2017 test-dev. Source code is at
https://github.com/PaddlePaddle/PaddleDetection.
Related papers
- YOLOv10: Real-Time End-to-End Object Detection [68.28699631793967]
YOLOs have emerged as the predominant paradigm in the field of real-time object detection.
The reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs.
We introduce the holistic efficiency-accuracy driven model design strategy for YOLOs.
arXiv Detail & Related papers (2024-05-23T11:44:29Z) - YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time
Object Detection [80.11152626362109]
We provide an efficient and performant object detector, termed YOLO-MS.
We train our YOLO-MS on the MS COCO dataset from scratch without relying on any other large-scale datasets.
Our work can also be used as a plug-and-play module for other YOLO models.
arXiv Detail & Related papers (2023-08-10T10:12:27Z) - EdgeYOLO: An Edge-Real-Time Object Detector [69.41688769991482]
This paper proposes an efficient, low-complexity and anchor-free object detector based on the state-of-the-art YOLO framework.
We develop an enhanced data augmentation method to effectively suppress overfitting during training, and design a hybrid random loss function to improve the detection accuracy of small objects.
Our baseline model can reach the accuracy of 50.6% AP50:95 and 69.8% AP50 in MS 2017 dataset, 26.4% AP50:95 and 44.8% AP50 in VisDrone 2019-DET dataset, and it meets real-time requirements (FPS>=30) on edge-computing device Nvidia
arXiv Detail & Related papers (2023-02-15T06:05:14Z) - PP-YOLOE-R: An Efficient Anchor-Free Rotated Object Detector [14.263912554269435]
PP-YOLOE-R is an anchor-free rotated object detector based on PP-YOLOE.
PP-YOLOE-R-l and PP-YOLOE-R-x achieve 78.14 and 78.28 mAP on DOTA 1.0 dataset with single-scale training and testing.
PP-YOLOE-R-x surpasses all anchor-free methods and demonstrates competitive performance to state-of-the-art anchor-based two-stage models.
arXiv Detail & Related papers (2022-11-04T11:38:30Z) - PP-YOLOE: An evolved version of YOLO [4.9022682894446685]
We present PP-YOLOE, an industrial state-of-the-art object detector with high performance and friendly deployment.
We optimize on the basis of the previous PP-YOLOv2, using anchor-free paradigm, more powerful backbone and neck equipped with CSPRepResStage, ET-head and dynamic label assignment algorithm TAL.
As a result, PP-YOLOE-l achieves 51.4 mAP on COCO test-dev and 78.1 FPS on Tesla V100, yielding a remarkable improvement of (+1.9 AP, +13.35% speed up) and (+1.3 AP, +24.96%
arXiv Detail & Related papers (2022-03-30T12:31:39Z) - Rethinking Keypoint Representations: Modeling Keypoints and Poses as
Objects for Multi-Person Human Pose Estimation [79.78017059539526]
We propose a new heatmap-free keypoint estimation method in which individual keypoints and sets of spatially related keypoints (i.e., poses) are modeled as objects within a dense single-stage anchor-based detection framework.
In experiments, we observe that KAPAO is significantly faster and more accurate than previous methods, which suffer greatly from heatmap post-processing.
Our large model, KAPAO-L, achieves an AP of 70.6 on the Microsoft COCO Keypoints validation set without test-time augmentation.
arXiv Detail & Related papers (2021-11-16T15:36:44Z) - PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices [13.62426382827205]
PP-PicoDet family of real-time object detectors achieves superior performance on object detection for mobile devices.
Models achieve better trade-offs between accuracy and latency compared to other popular models.
arXiv Detail & Related papers (2021-11-01T12:53:17Z) - PP-YOLO: An Effective and Efficient Implementation of Object Detector [44.189808709103865]
This paper implements an object detector with relatively balanced effectiveness and efficiency.
Considering that YOLOv3 has been widely used in practice, we develop a new object detector based on YOLOv3.
Since all experiments in this paper are conducted based on PaddlePaddle, we call it PP-YOLO.
arXiv Detail & Related papers (2020-07-23T16:06:16Z) - RepPoints V2: Verification Meets Regression for Object Detection [65.120827759348]
We introduce verification tasks into the localization prediction of RepPoints.
RepPoints v2 provides consistent improvements of about 2.0 mAP over the original RepPoints.
We show that the proposed approach can more generally elevate other object detection frameworks as well as applications such as instance segmentation.
arXiv Detail & Related papers (2020-07-16T17:57:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.