Efficient Cloud-edge Collaborative Inference for Object
Re-identification
- URL: http://arxiv.org/abs/2401.02041v1
- Date: Thu, 4 Jan 2024 02:56:50 GMT
- Title: Efficient Cloud-edge Collaborative Inference for Object
Re-identification
- Authors: Chuanming Wang, Yuxin Yang, Mengshi Qi, Huadong Ma
- Abstract summary: We pioneer a cloud-edge collaborative inference framework for ReID systems.
We propose a distribution-aware correlation modeling network (DaCM) to make the desired image return to the cloud server.
DaCM embeds the spatial-temporal correlations implicitly included in the timestamps into a graph structure, and it can be applied in the cloud to regulate the size of the upload window.
- Score: 27.952445808987036
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Current object re-identification (ReID) system follows the centralized
processing paradigm, i.e., all computations are conducted in the cloud server
and edge devices are only used to capture and send images. As the number of
videos experiences a rapid escalation, this paradigm has become impractical due
to the finite computational resources. In such a scenario, the ReID system
should be converted to fit in the cloud-edge collaborative processing paradigm,
which is crucial to boost the scalability and practicality of ReID systems.
However, current relevant work lacks research on this issue, making it
challenging for ReID methods to be adapted effectively. Therefore, we pioneer a
cloud-edge collaborative inference framework for ReID systems and particularly
propose a distribution-aware correlation modeling network (DaCM) to make the
desired image return to the cloud server as soon as possible via learning to
model the spatial-temporal correlations among instances. DaCM embeds the
spatial-temporal correlations implicitly included in the timestamps into a
graph structure, and it can be applied in the cloud to regulate the size of the
upload window and on the edge device to adjust the sequence of images,
respectively. Traditional ReID methods can be combined with DaCM seamlessly,
enabling their application within our proposed edge-cloud collaborative
framework. Extensive experiments demonstrate that our method obviously reduces
transmission overhead and significantly improves performance. We will release
our code and model.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training [3.792729116385123]
We propose a new model merging scheme by sharing representations at the edge, guided by representation similarity S.
We show that S is extremely highly correlated with merged model's accuracy with Pearson Correlation Coefficient |r| > 0.94 than other metrics.
arXiv Detail & Related papers (2024-10-15T03:35:54Z) - GERA: Geometric Embedding for Efficient Point Registration Analysis [20.690695788384517]
We propose a novel point cloud registration network that leverages a pure geometric architecture, constructing geometric information offline.
Our method is the first to replace 3D coordinate inputs with offline-constructed geometric encoding, improving generalization and stability.
arXiv Detail & Related papers (2024-10-01T11:19:56Z) - Binarized Diffusion Model for Image Super-Resolution [61.963833405167875]
Binarization, an ultra-compression algorithm, offers the potential for effectively accelerating advanced diffusion models (DMs)
Existing binarization methods result in significant performance degradation.
We introduce a novel binarized diffusion model, BI-DiffSR, for image SR.
arXiv Detail & Related papers (2024-06-09T10:30:25Z) - Collaborative Feedback Discriminative Propagation for Video Super-Resolution [66.61201445650323]
Key success of video super-resolution (VSR) methods stems mainly from exploring spatial and temporal information.
Inaccurate alignment usually leads to aligned features with significant artifacts.
propagation modules only propagate the same timestep features forward or backward.
arXiv Detail & Related papers (2024-04-06T22:08:20Z) - A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC
Orchestration [12.914011030970814]
Multi-access Edge Computing (MEC) can be implemented together with Open Radio Access Network (O-RAN) over commodity platforms to offer low-cost deployment.
In this paper, a joint O-RAN/MEC orchestration using a Bayesian deep reinforcement learning (RL)-based framework is proposed.
arXiv Detail & Related papers (2023-12-26T18:04:49Z) - Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous
Distributed Learning [3.7722254371820987]
We consider a simple alternative based on minimal feedback, which we call Decoupled Greedy Learning (DGL)
It is based on a classic greedy relaxation of the joint training objective, recently shown to be effective in the context of Convolutional Neural Networks (CNNs) on large-scale image classification.
We show theoretically and empirically that this approach converges and compare it to the sequential solvers.
arXiv Detail & Related papers (2021-06-11T13:55:17Z) - Tailored Learning-Based Scheduling for Kubernetes-Oriented Edge-Cloud
System [54.588242387136376]
We introduce KaiS, a learning-based scheduling framework for edge-cloud systems.
First, we design a coordinated multi-agent actor-critic algorithm to cater to decentralized request dispatch.
Second, for diverse system scales and structures, we use graph neural networks to embed system state information.
Third, we adopt a two-time-scale scheduling mechanism to harmonize request dispatch and service orchestration.
arXiv Detail & Related papers (2021-01-17T03:45:25Z) - Identity-Aware Attribute Recognition via Real-Time Distributed Inference
in Mobile Edge Clouds [53.07042574352251]
We design novel models for pedestrian attribute recognition with re-ID in an MEC-enabled camera monitoring system.
We propose a novel inference framework with a set of distributed modules, by jointly considering the attribute recognition and person re-ID.
We then devise a learning-based algorithm for the distributions of the modules of the proposed distributed inference framework.
arXiv Detail & Related papers (2020-08-12T12:03:27Z) - Joint Self-Attention and Scale-Aggregation for Self-Calibrated Deraining
Network [13.628218953897946]
In this paper, we propose an effective algorithm, called JDNet, to solve the single image deraining problem.
By designing the Scale-Aggregation and Self-Attention modules with Self-Calibrated convolution skillfully, the proposed model has better deraining results.
arXiv Detail & Related papers (2020-08-06T17:04:34Z) - Deep Adaptive Inference Networks for Single Image Super-Resolution [72.7304455761067]
Single image super-resolution (SISR) has witnessed tremendous progress in recent years owing to the deployment of deep convolutional neural networks (CNNs)
In this paper, we take a step forward to address this issue by leveraging the adaptive inference networks for deep SISR (AdaDSR)
Our AdaDSR involves an SISR model as backbone and a lightweight adapter module which takes image features and resource constraint as input and predicts a map of local network depth.
arXiv Detail & Related papers (2020-04-08T10:08:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.