AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation
- URL: http://arxiv.org/abs/2008.07018v1
- Date: Sun, 16 Aug 2020 22:27:43 GMT
- Title: AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation
- Authors: Xinyu Gong, Wuyang Chen, Yifan Jiang, Ye Yuan, Xianming Liu, Qian
Zhang, Yuan Li, Zhangyang Wang
- Abstract summary: We present AutoPose, a novel neural architecture search(NAS) framework.
It is capable of automatically discovering multiple parallel branches of cross-scale connections towards accurate and high-resolution 2D human pose estimation.
- Score: 96.29533512606078
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present AutoPose, a novel neural architecture search(NAS) framework that
is capable of automatically discovering multiple parallel branches of
cross-scale connections towards accurate and high-resolution 2D human pose
estimation. Recently, high-performance hand-crafted convolutional networks for
pose estimation show growing demands on multi-scale fusion and high-resolution
representations. However, current NAS works exhibit limited flexibility on
scale searching, they dominantly adopt simplified search spaces of
single-branch architectures. Such simplification limits the fusion of
information at different scales and fails to maintain high-resolution
representations. The presentedAutoPose framework is able to search for
multi-branch scales and network depth, in addition to the cell-level
microstructure. Motivated by the search space, a novel bi-level optimization
method is presented, where the network-level architecture is searched via
reinforcement learning, and the cell-level search is conducted by the
gradient-based method. Within 2.5 GPU days, AutoPose is able to find very
competitive architectures on the MS COCO dataset, that are also transferable to
the MPII dataset. Our code is available at
https://github.com/VITA-Group/AutoPose.
Related papers
- SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture
Search [6.121126813817338]
Recent one-shot Neural Architecture Search algorithms rely on training a hardware-agnostic super-network tailored to a specific task and then extracting efficient sub-networks for different hardware platforms.
We show that by using multi-objective search algorithms paired with lightly trained predictors, we can efficiently search for both the sub-network architecture and the corresponding quantization policy.
arXiv Detail & Related papers (2023-12-19T22:08:49Z) - HKNAS: Classification of Hyperspectral Imagery Based on Hyper Kernel
Neural Architecture Search [104.45426861115972]
We propose to directly generate structural parameters by utilizing the specifically designed hyper kernels.
We obtain three kinds of networks to separately conduct pixel-level or image-level classifications with 1-D or 3-D convolutions.
A series of experiments on six public datasets demonstrate that the proposed methods achieve state-of-the-art results.
arXiv Detail & Related papers (2023-04-23T17:27:40Z) - Multi-Objective Evolutionary for Object Detection Mobile Architectures
Search [21.14296703753317]
We propose a mobile object detection backbone network architecture search algorithm based on non-dominated sorting for NAS scenarios.
The proposed approach can search the backbone networks with different depths, widths, or expansion sizes via a technique of weight mapping.
Under similar computational complexity, the accuracy of the backbone network architecture we search for is 2.0% mAP higher than MobileDet.
arXiv Detail & Related papers (2022-11-05T00:28:49Z) - Adaptive Linear Span Network for Object Skeleton Detection [56.78705071830965]
We propose adaptive linear span network (AdaLSN) to automatically configure and integrate scale-aware features for object skeleton detection.
AdaLSN substantiates its versatility by achieving significantly higher accuracy and latency trade-off.
It also demonstrates general applicability to image-to-mask tasks such as edge detection and road extraction.
arXiv Detail & Related papers (2020-11-08T12:51:14Z) - Auto-Panoptic: Cooperative Multi-Component Architecture Search for
Panoptic Segmentation [144.50154657257605]
We propose an efficient framework to simultaneously search for all main components including backbone, segmentation branches, and feature fusion module.
Our searched architecture, namely Auto-Panoptic, achieves the new state-of-the-art on the challenging COCO and ADE20K benchmarks.
arXiv Detail & Related papers (2020-10-30T08:34:35Z) - Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint
Decision and Feature Aggregation [54.28963233377946]
We propose a joint search framework, called AutoRTNet, to automate the design of segmentation strategies.
Specifically, we propose hyper-cells to jointly decide the network depth and downsampling strategy, and an aggregation cell to achieve automatic multi-scale feature aggregation.
arXiv Detail & Related papers (2020-03-31T14:02:25Z) - DCNAS: Densely Connected Neural Architecture Search for Semantic Image
Segmentation [44.46852065566759]
We propose a Densely Connected NAS (DCNAS) framework, which directly searches the optimal network structures for the multi-scale representations of visual information.
Specifically, by connecting cells with each other using learnable weights, we introduce a densely connected search space to cover an abundance of mainstream network designs.
We demonstrate that the architecture obtained from our DCNAS algorithm achieves state-of-the-art performances on public semantic image segmentation benchmarks.
arXiv Detail & Related papers (2020-03-26T13:21:33Z) - NAS-Count: Counting-by-Density with Neural Architecture Search [74.92941571724525]
We automate the design of counting models with Neural Architecture Search (NAS)
We introduce an end-to-end searched encoder-decoder architecture, Automatic Multi-Scale Network (AMSNet)
arXiv Detail & Related papers (2020-02-29T09:18:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.