Decentralized Autofocusing System with Hierarchical Agents
- URL: http://arxiv.org/abs/2108.12842v1
- Date: Sun, 29 Aug 2021 13:45:15 GMT
- Title: Decentralized Autofocusing System with Hierarchical Agents
- Authors: Anna Anikina, Oleg Y. Rogov and Dmitry V. Dylov
- Abstract summary: We propose a hierarchical multi-agent deep reinforcement learning approach for intelligently controlling the camera and the lens focusing settings.
The algorithm relies on the latent representation of the camera's stream and, thus, it is the first method to allow a completely no-reference tuning of the camera.
- Score: 2.7716102039510564
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: State-of-the-art object detection models are frequently trained offline using
available datasets, such as ImageNet: large and overly diverse data that are
unbalanced and hard to cluster semantically. This kind of training drops the
object detection performance should the change in illumination, in the
environmental conditions (e.g., rain), or in the lens positioning (out-of-focus
blur) occur. We propose a decentralized hierarchical multi-agent deep
reinforcement learning approach for intelligently controlling the camera and
the lens focusing settings, leading to significant improvement to the capacity
of the popular detection models (YOLO, Fast R-CNN, and Retina are considered).
The algorithm relies on the latent representation of the camera's stream and,
thus, it is the first method to allow a completely no-reference tuning of the
camera, where the system trains itself to auto-focus itself.
Related papers
- VICAN: Very Efficient Calibration Algorithm for Large Camera Networks [49.17165360280794]
We introduce a novel methodology that extends Pose Graph Optimization techniques.
We consider the bipartite graph encompassing cameras, object poses evolving dynamically, and camera-object relative transformations at each time step.
Our framework retains compatibility with traditional PGO solvers, but its efficacy benefits from a custom-tailored optimization scheme.
arXiv Detail & Related papers (2024-03-25T17:47:03Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Aberration-Aware Depth-from-Focus [20.956132508261664]
We investigate the domain gap caused by off-axis aberrations that will affect the decision of the best-focused frame in a focal stack.
We then explore bridging this domain gap through aberration-aware training (AAT)
Our approach involves a lightweight network that models lens aberrations at different positions and focus distances, which is then integrated into the conventional network training pipeline.
arXiv Detail & Related papers (2023-03-08T15:21:33Z) - A Novel Hierarchical-Classification-Block Based Convolutional Neural
Network for Source Camera Model Identification [0.5482532589225552]
Different camera brands left behind different intrinsic processing noises which can be used to identify the camera brand.
One prominent solution is to utilize a hierarchical classification system rather than the traditional single-classifier approach.
We propose a classifier-block-level hierarchical system instead of a network-level one for source camera model classification.
arXiv Detail & Related papers (2022-12-08T09:28:51Z) - Camera Alignment and Weighted Contrastive Learning for Domain Adaptation
in Video Person ReID [17.90248359024435]
Systems for person re-identification (ReID) can achieve a high accuracy when trained on large fully-labeled image datasets.
The domain shift associated with diverse operational capture conditions (e.g., camera viewpoints and lighting) may translate to a significant decline in performance.
This paper focuses on unsupervised domain adaptation (UDA) for video-based ReID.
arXiv Detail & Related papers (2022-11-07T15:32:56Z) - Centralized Feature Pyramid for Object Detection [53.501796194901964]
Visual feature pyramid has shown its superiority in both effectiveness and efficiency in a wide range of applications.
In this paper, we propose a OLO Feature Pyramid for object detection, which is based on a globally explicit centralized feature regulation.
arXiv Detail & Related papers (2022-10-05T08:32:54Z) - Exploring Resolution and Degradation Clues as Self-supervised Signal for
Low Quality Object Detection [77.3530907443279]
We propose a novel self-supervised framework to detect objects in degraded low resolution images.
Our methods has achieved superior performance compared with existing methods when facing variant degradation situations.
arXiv Detail & Related papers (2022-08-05T09:36:13Z) - A Quality Index Metric and Method for Online Self-Assessment of
Autonomous Vehicles Sensory Perception [164.93739293097605]
We propose a novel evaluation metric, named as the detection quality index (DQI), which assesses the performance of camera-based object detection algorithms.
We have developed a superpixel-based attention network (SPA-NET) that utilizes raw image pixels and superpixels as input to predict the proposed DQI evaluation metric.
arXiv Detail & Related papers (2022-03-04T22:16:50Z) - Learning to Estimate Kernel Scale and Orientation of Defocus Blur with
Asymmetric Coded Aperture [19.472377706422474]
Consistent in-focus input imagery is an essential precondition for machine vision systems to perceive the dynamic environment.
A defocus blur severely degrades the performance of vision systems.
We propose a deep-learning-based framework estimating the kernel scale and orientation of the defocus blur to adjust lens focus rapidly.
arXiv Detail & Related papers (2021-03-10T03:12:15Z) - ClusterVO: Clustering Moving Instances and Estimating Visual Odometry
for Self and Surroundings [54.33327082243022]
ClusterVO is a stereo Visual Odometry which simultaneously clusters and estimates the motion of both ego and surrounding rigid clusters/objects.
Unlike previous solutions relying on batch input or imposing priors on scene structure or dynamic object models, ClusterVO is online, general and thus can be used in various scenarios including indoor scene understanding and autonomous driving.
arXiv Detail & Related papers (2020-03-29T09:06:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.