ProCrop: Learning Aesthetic Image Cropping from Professional Compositions
- URL: http://arxiv.org/abs/2505.22490v1
- Date: Wed, 28 May 2025 15:38:44 GMT
- Title: ProCrop: Learning Aesthetic Image Cropping from Professional Compositions
- Authors: Ke Zhang, Tianyu Ding, Jiachen Jiang, Tianyi Chen, Ilya Zharkov, Vishal M. Patel, Luming Liang,
- Abstract summary: ProCrop is a retrieval-based method that leverages professional photography to guide cropping decisions.<n>We present a large-scale dataset of 242K weakly-annotated images, generated by out-painting professional images.<n>This composition-aware dataset generation offers diverse high-quality crop proposals guided by aesthetic principles.
- Score: 57.949730056500634
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Image cropping is crucial for enhancing the visual appeal and narrative impact of photographs, yet existing rule-based and data-driven approaches often lack diversity or require annotated training data. We introduce ProCrop, a retrieval-based method that leverages professional photography to guide cropping decisions. By fusing features from professional photographs with those of the query image, ProCrop learns from professional compositions, significantly boosting performance. Additionally, we present a large-scale dataset of 242K weakly-annotated images, generated by out-painting professional images and iteratively refining diverse crop proposals. This composition-aware dataset generation offers diverse high-quality crop proposals guided by aesthetic principles and becomes the largest publicly available dataset for image cropping. Extensive experiments show that ProCrop significantly outperforms existing methods in both supervised and weakly-supervised settings. Notably, when trained on the new dataset, our ProCrop surpasses previous weakly-supervised methods and even matches fully supervised approaches. Both the code and dataset will be made publicly available to advance research in image aesthetics and composition analysis.
Related papers
- Photography Perspective Composition: Towards Aesthetic Perspective Recommendation [8.915832522709529]
Traditional photography composition approaches are dominated by 2D cropping-based methods.<n>Professional photographers often employ perspective adjustment as a form of 3D recomposition.<n>We propose photography perspective composition (PPC), extending beyond traditional cropping-based methods.
arXiv Detail & Related papers (2025-05-27T03:04:48Z) - INRetouch: Context Aware Implicit Neural Representation for Photography Retouching [54.17599183365242]
We propose a novel retouch transfer approach that learns from professional edits through before-after image pairs.<n>We develop a context-aware Implicit Neural Representation that learns to apply edits adaptively based on image content and context.<n>Our method extracts implicit transformations from reference edits and adaptively applies them to new images.
arXiv Detail & Related papers (2024-12-05T03:31:48Z) - Cropper: Vision-Language Model for Image Cropping through In-Context Learning [54.34593968212081]
We propose an effective approach to leverage large vision-language models (VLMs) for image cropping.<n>First, we propose an efficient prompt retrieval mechanism for image cropping to automate the selection of in-context examples.<n>Second, we introduce an iterative refinement strategy to iteratively enhance the predicted crops.<n>The proposed framework, we refer to as Cropper, is applicable to a wide range of cropping tasks, including free-form cropping, subject-aware cropping, and aspect ratio-aware cropping.
arXiv Detail & Related papers (2024-08-14T20:03:03Z) - Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model [80.61157097223058]
A prevalent strategy to bolster image classification performance is through augmenting the training set with synthetic images generated by T2I models.
In this study, we scrutinize the shortcomings of both current generative and conventional data augmentation techniques.
We introduce an innovative inter-class data augmentation method known as Diff-Mix, which enriches the dataset by performing image translations between classes.
arXiv Detail & Related papers (2024-03-28T17:23:45Z) - Learning Subject-Aware Cropping by Outpainting Professional Photos [69.0772948657867]
We propose a weakly-supervised approach to learn what makes a high-quality subject-aware crop from professional stock images.
Our insight is to combine a library of stock images with a modern, pre-trained text-to-image diffusion model.
We are able to automatically generate a large dataset of cropped-uncropped training pairs to train a cropping model.
arXiv Detail & Related papers (2023-12-19T11:57:54Z) - Identifying Professional Photographers Through Image Quality and
Aesthetics in Flickr [0.0]
This study reveals the lack of suitable data sets in photo and video sharing platforms.
We created one of the largest labelled data sets in Flickr with the multimodal data which has been open sourced.
We examined the relationship between the aesthetics and technical quality of a picture and the social activity of that picture.
arXiv Detail & Related papers (2023-07-04T14:55:37Z) - Procedural Image Programs for Representation Learning [62.557911005179946]
We propose training with a large dataset of twenty-one thousand programs, each one generating a diverse set of synthetic images.
These programs are short code snippets, which are easy to modify and fast to execute using.
The proposed dataset can be used for both supervised and unsupervised representation learning, and reduces the gap between pre-training with real and procedurally generated images by 38%.
arXiv Detail & Related papers (2022-11-29T17:34:22Z) - Camera View Adjustment Prediction for Improving Image Composition [14.541539156817045]
We propose a deep learning-based approach that provides suggestions to the photographer on how to adjust the camera view before capturing.
By optimizing the composition before a photo is captured, our system helps photographers to capture better photos.
arXiv Detail & Related papers (2021-04-15T17:18:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.