PP-YOLO: An Effective and Efficient Implementation of Object Detector
- URL: http://arxiv.org/abs/2007.12099v3
- Date: Mon, 3 Aug 2020 03:53:24 GMT
- Title: PP-YOLO: An Effective and Efficient Implementation of Object Detector
- Authors: Xiang Long, Kaipeng Deng, Guanzhong Wang, Yang Zhang, Qingqing Dang,
Yuan Gao, Hui Shen, Jianguo Ren, Shumin Han, Errui Ding, Shilei Wen
- Abstract summary: This paper implements an object detector with relatively balanced effectiveness and efficiency.
Considering that YOLOv3 has been widely used in practice, we develop a new object detector based on YOLOv3.
Since all experiments in this paper are conducted based on PaddlePaddle, we call it PP-YOLO.
- Score: 44.189808709103865
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection is one of the most important areas in computer vision, which
plays a key role in various practical scenarios. Due to limitation of hardware,
it is often necessary to sacrifice accuracy to ensure the infer speed of the
detector in practice. Therefore, the balance between effectiveness and
efficiency of object detector must be considered. The goal of this paper is to
implement an object detector with relatively balanced effectiveness and
efficiency that can be directly applied in actual application scenarios, rather
than propose a novel detection model. Considering that YOLOv3 has been widely
used in practice, we develop a new object detector based on YOLOv3. We mainly
try to combine various existing tricks that almost not increase the number of
model parameters and FLOPs, to achieve the goal of improving the accuracy of
detector as much as possible while ensuring that the speed is almost unchanged.
Since all experiments in this paper are conducted based on PaddlePaddle, we
call it PP-YOLO. By combining multiple tricks, PP-YOLO can achieve a better
balance between effectiveness (45.2% mAP) and efficiency (72.9 FPS), surpassing
the existing state-of-the-art detectors such as EfficientDet and YOLOv4.Source
code is at https://github.com/PaddlePaddle/PaddleDetection.
Related papers
- YOLOv10: Real-Time End-to-End Object Detection [68.28699631793967]
YOLOs have emerged as the predominant paradigm in the field of real-time object detection.
The reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs.
We introduce the holistic efficiency-accuracy driven model design strategy for YOLOs.
arXiv Detail & Related papers (2024-05-23T11:44:29Z) - MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO [10.183459286746196]
We introduce YOLO Phantom, one of the smallest YOLO models ever conceived.
YOLO Phantom achieves comparable accuracy to the latest YOLOv8n model while simultaneously reducing both parameters and model size.
Its real-world efficacy is demonstrated on an IoT platform with advanced low-light and RGB cameras, seamlessly connecting to an AWS-based notification endpoint.
arXiv Detail & Related papers (2024-02-12T18:56:53Z) - YOLO-World: Real-Time Open-Vocabulary Object Detection [87.08732047660058]
We introduce YOLO-World, an innovative approach that enhances YOLO with open-vocabulary detection capabilities.
Our method excels in detecting a wide range of objects in a zero-shot manner with high efficiency.
YOLO-World achieves 35.4 AP with 52.0 FPS on V100, which outperforms many state-of-the-art methods in terms of both accuracy and speed.
arXiv Detail & Related papers (2024-01-30T18:59:38Z) - YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time
Object Detection [80.11152626362109]
We provide an efficient and performant object detector, termed YOLO-MS.
We train our YOLO-MS on the MS COCO dataset from scratch without relying on any other large-scale datasets.
Our work can also be used as a plug-and-play module for other YOLO models.
arXiv Detail & Related papers (2023-08-10T10:12:27Z) - YOLOv6: A Single-Stage Object Detection Framework for Industrial
Applications [16.047499394184985]
YOLOv6-N hits 35.9% AP on the COCO dataset at a throughput of 1234 FPS on an NVIDIA Tesla T4 GPU.
YOLOv6-S strikes 43.5% AP at 495 FPS, outperforming other mainstream detectors at the same scale.
YOLOv6-M/L achieves better accuracy performance (i.e., 49.5%/52.3%) than other detectors with a similar inference speed.
arXiv Detail & Related papers (2022-09-07T07:47:58Z) - StreamYOLO: Real-time Object Detection for Streaming Perception [84.2559631820007]
We endow the models with the capacity of predicting the future, significantly improving the results for streaming perception.
We consider multiple velocities driving scene and propose Velocity-awared streaming AP (VsAP) to jointly evaluate the accuracy.
Our simple method achieves the state-of-the-art performance on Argoverse-HD dataset and improves the sAP and VsAP by 4.7% and 8.2% respectively.
arXiv Detail & Related papers (2022-07-21T12:03:02Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - YOLO-Z: Improving small object detection in YOLOv5 for autonomous
vehicles [5.765622319599904]
This study explores how the popular YOLOv5 object detector can be modified to improve its performance in detecting smaller objects.
We propose a series of models at different scales, which we name YOLO-Z', and which display an improvement of up to 6.9% in mAP when detecting smaller objects at 50% IOU.
Our objective is to inform future research on the potential of adjusting a popular detector such as YOLOv5 to address specific tasks.
arXiv Detail & Related papers (2021-12-22T11:03:43Z) - PP-YOLOv2: A Practical Object Detector [13.262416549127664]
We evaluate a collection of existing refinements to improve the performance of PP-YOLO.
By combining multiple effective refinements, we boost PP-YOLO's performance from 45.9% mAP to 49.5% mAP.
In terms of speed, PP-YOLOv2 runs in 68.9FPS at 640x640 input size.
arXiv Detail & Related papers (2021-04-21T08:55:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.