Camera View Adjustment Prediction for Improving Image Composition
- URL: http://arxiv.org/abs/2104.07608v1
- Date: Thu, 15 Apr 2021 17:18:31 GMT
- Title: Camera View Adjustment Prediction for Improving Image Composition
- Authors: Yu-Chuan Su, Raviteja Vemulapalli, Ben Weiss, Chun-Te Chu, Philip
Andrew Mansfield, Lior Shapira, Colvin Pitts
- Abstract summary: We propose a deep learning-based approach that provides suggestions to the photographer on how to adjust the camera view before capturing.
By optimizing the composition before a photo is captured, our system helps photographers to capture better photos.
- Score: 14.541539156817045
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Image composition plays an important role in the quality of a photo. However,
not every camera user possesses the knowledge and expertise required for
capturing well-composed photos. While post-capture cropping can improve the
composition sometimes, it does not work in many common scenarios in which the
photographer needs to adjust the camera view to capture the best shot. To
address this issue, we propose a deep learning-based approach that provides
suggestions to the photographer on how to adjust the camera view before
capturing. By optimizing the composition before a photo is captured, our system
helps photographers to capture better photos. As there is no publicly-available
dataset for this task, we create a view adjustment dataset by repurposing
existing image cropping datasets. Furthermore, we propose a two-stage
semi-supervised approach that utilizes both labeled and unlabeled images for
training a view adjustment model. Experiment results show that the proposed
semi-supervised approach outperforms the corresponding supervised alternatives,
and our user study results show that the suggested view adjustment improves
image composition 79% of the time.
Related papers
- ProCrop: Learning Aesthetic Image Cropping from Professional Compositions [57.949730056500634]
ProCrop is a retrieval-based method that leverages professional photography to guide cropping decisions.<n>We present a large-scale dataset of 242K weakly-annotated images, generated by out-painting professional images.<n>This composition-aware dataset generation offers diverse high-quality crop proposals guided by aesthetic principles.
arXiv Detail & Related papers (2025-05-28T15:38:44Z) - Towards Smart Point-and-Shoot Photography [16.192062592740154]
We present a first of its kind smart point and shoot (SPAS) system to help users to take good photos.<n>Our SPAS proposes to help users to compose a good shot of a scene by automatically guiding the users to adjust the camera pose live on the scene.<n>We will present extensive results to demonstrate the performances of our SPAS system using publicly available image composition datasets.
arXiv Detail & Related papers (2025-05-06T15:40:14Z) - INRetouch: Context Aware Implicit Neural Representation for Photography Retouching [54.17599183365242]
We propose a novel retouch transfer approach that learns from professional edits through before-after image pairs.
We develop a context-aware Implicit Neural Representation that learns to apply edits adaptively based on image content and context.
Our method extracts implicit transformations from reference edits and adaptively applies them to new images.
arXiv Detail & Related papers (2024-12-05T03:31:48Z) - Exposure Bracketing is All You Need for Unifying Image Restoration and Enhancement Tasks [50.822601495422916]
We propose to utilize exposure bracketing photography to unify image restoration and enhancement tasks.
Due to the difficulty in collecting real-world pairs, we suggest a solution that first pre-trains the model with synthetic paired data.
In particular, a temporally modulated recurrent network (TMRNet) and self-supervised adaptation method are proposed.
arXiv Detail & Related papers (2024-01-01T14:14:35Z) - Beyond Image Borders: Learning Feature Extrapolation for Unbounded Image
Composition [80.14697389188143]
We present a joint framework for both recommendation of camera view and image composition (i.e., UNIC)
Specifically, our framework takes the current camera preview frame as input and provides a recommendation for view adjustment.
Our method converges and results in both a camera view and a bounding box showing the image composition recommendation.
arXiv Detail & Related papers (2023-09-21T13:10:28Z) - Photoswap: Personalized Subject Swapping in Images [56.2650908740358]
Photoswap learns the visual concept of the subject from reference images and swaps it into the target image using pre-trained diffusion models.
Photoswap significantly outperforms baseline methods in human ratings across subject swapping, background preservation, and overall quality.
arXiv Detail & Related papers (2023-05-29T17:56:13Z) - Enhanced Stable View Synthesis [86.69338893753886]
We introduce an approach to enhance the novel view synthesis from images taken from a freely moving camera.
The introduced approach focuses on outdoor scenes where recovering accurate geometric scaffold and camera pose is challenging.
arXiv Detail & Related papers (2023-03-30T01:53:14Z) - Controllable Image Enhancement [66.18525728881711]
We present a semiautomatic image enhancement algorithm that can generate high-quality images with multiple styles by controlling a few parameters.
An encoder-decoder framework encodes the retouching skills into latent codes and decodes them into the parameters of image signal processing functions.
arXiv Detail & Related papers (2022-06-16T23:54:53Z) - PhotoApp: Photorealistic Appearance Editing of Head Portraits [97.23638022484153]
We present an approach for high-quality intuitive editing of the camera viewpoint and scene illumination in a portrait image.
Most editing approaches rely on supervised learning using training data captured with setups such as light and camera stages.
We design a supervised problem which learns in the latent space of StyleGAN.
This combines the best of supervised learning and generative adversarial modeling.
arXiv Detail & Related papers (2021-03-13T08:59:49Z) - NICER: Aesthetic Image Enhancement with Humans in the Loop [0.7756211500979312]
This work proposes a neural network based approach to no-reference image enhancement in a fully-, semi-automatic or fully manual process.
We show that NICER can improve image aesthetics without user interaction and that allowing user interaction leads to diverse enhancement outcomes.
arXiv Detail & Related papers (2020-12-03T09:14:10Z) - IMU-Assisted Learning of Single-View Rolling Shutter Correction [16.242924916178282]
Rolling shutter distortion is highly undesirable for photography and computer vision algorithms.
We propose a deep neural network to predict depth and row-wise pose from a single image for rolling shutter correction.
arXiv Detail & Related papers (2020-11-05T21:33:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.