Efficient Multi-Crop Saliency Partitioning for Automatic Image Cropping
- URL: http://arxiv.org/abs/2506.22814v1
- Date: Sat, 28 Jun 2025 08:32:53 GMT
- Title: Efficient Multi-Crop Saliency Partitioning for Automatic Image Cropping
- Authors: Andrew Hamara, Andrew C. Freeman,
- Abstract summary: We extend the Fixed Aspect Ratio Cropping algorithm to efficiently extract multiple non-overlapping crops in linear time.<n>Our approach dynamically adjusts attention thresholds and removes selected crops from consideration without recomputing the entire saliency map.
- Score: 0.6906005491572401
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Automatic image cropping aims to extract the most visually salient regions while preserving essential composition elements. Traditional saliency-aware cropping methods optimize a single bounding box, making them ineffective for applications requiring multiple disjoint crops. In this work, we extend the Fixed Aspect Ratio Cropping algorithm to efficiently extract multiple non-overlapping crops in linear time. Our approach dynamically adjusts attention thresholds and removes selected crops from consideration without recomputing the entire saliency map. We discuss qualitative results and introduce the potential for future datasets and benchmarks.
Related papers
- Foreground-Aware Dataset Distillation via Dynamic Patch Selection [56.565143366562495]
We propose a foreground-aware dataset distillation method that enhances patch selection in a content-adaptive manner.<n>Experiments on multiple benchmarks show that the proposed method consistently improves distillation performance over existing approaches.
arXiv Detail & Related papers (2026-01-06T05:44:02Z) - CropNeRF: A Neural Radiance Field-Based Framework for Crop Counting [9.600466490978665]
We introduce a novel crop counting framework designed for exact enumeration via 3D instance segmentation.<n>Our approach utilizes 2D images captured from multiple viewpoints and associates independent instance masks for neural radiance field (NeRF) view synthesis.<n>We validate our framework on three agricultural datasets consisting of cotton bolls, apples, and pears, and demonstrate consistent counting performance.
arXiv Detail & Related papers (2026-01-01T04:51:02Z) - Carousel: A High-Resolution Dataset for Multi-Target Automatic Image Cropping [0.16311150636417257]
We introduce a dataset of 277 relevant images and human labels.<n>We evaluate the efficacy of several single-crop models with an image partitioning algorithm as a pre-processing step.
arXiv Detail & Related papers (2025-11-06T18:59:52Z) - Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection [54.21851618853518]
We present a concise yet effective approach called Patch Generation-to-Selection to enhance CLIP's training efficiency.<n>Our approach, CLIP-PGS, sets new state-of-the-art results in zero-shot classification and retrieval tasks.
arXiv Detail & Related papers (2025-03-21T12:10:38Z) - Self-Supervised Multi-Scale Network for Blind Image Deblurring via Alternating Optimization [12.082424048578753]
We present a self-supervised multi-scale blind image deblurring method to jointly estimate the latent image and the blur kernel.
Thanks to the collaborative estimation across multiple scales, our method avoids the computationally intensive coarse-to-fine propagation and additional image deblurring processes.
arXiv Detail & Related papers (2024-09-02T07:08:17Z) - Cropper: Vision-Language Model for Image Cropping through In-Context Learning [54.34593968212081]
We propose an effective approach to leverage large vision-language models (VLMs) for image cropping.<n>First, we propose an efficient prompt retrieval mechanism for image cropping to automate the selection of in-context examples.<n>Second, we introduce an iterative refinement strategy to iteratively enhance the predicted crops.<n>The proposed framework, we refer to as Cropper, is applicable to a wide range of cropping tasks, including free-form cropping, subject-aware cropping, and aspect ratio-aware cropping.
arXiv Detail & Related papers (2024-08-14T20:03:03Z) - FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior [50.0535198082903]
We offer a novel approach to image composition, which integrates multiple input images into a single, coherent image.
We showcase the potential of utilizing the powerful generative prior inherent in large-scale pre-trained diffusion models to accomplish generic image composition.
arXiv Detail & Related papers (2024-07-06T03:35:43Z) - Real-time object detection and robotic manipulation for agriculture
using a YOLO-based learning approach [8.482182765640022]
This study presents a new framework that combines two separate architectures of convolutional neural networks (CNNs)
Crop images in a simulated environment are subjected to random rotations, cropping, brightness, and contrast adjustments to create augmented images for dataset generation.
The proposed method subsequently utilise the acquired image data via a visual geometry group model in order to reveal the grasping positions for the robotic manipulation.
arXiv Detail & Related papers (2024-01-28T22:30:50Z) - An Experience-based Direct Generation approach to Automatic Image
Cropping [0.0]
We propose a novel method to crop images directly without explicitly modeling image aesthetics.
Our model is trained on a large dataset of images cropped by experienced editors.
We show that our strategy is competitive with or performs better than existing methods in two related tasks.
arXiv Detail & Related papers (2022-12-30T06:25:27Z) - CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping [97.05377757299672]
We present a simple method, CropMix, for producing a rich input distribution from the original dataset distribution.
CropMix can be seamlessly applied to virtually any training recipe and neural network architecture performing classification tasks.
We show that CropMix is of benefit to both contrastive learning and masked image modeling towards more powerful representations.
arXiv Detail & Related papers (2022-05-31T16:57:28Z) - Repurposing Existing Deep Networks for Caption and Aesthetic-Guided
Image Cropping [33.46066328197085]
We propose a novel optimization framework that crops a given image based on user description and aesthetics.
Our framework can produce crops that are well-aligned to intended user descriptions and aesthetically pleasing.
arXiv Detail & Related papers (2022-01-07T00:23:40Z) - Deep Photo Cropper and Enhancer [65.11910918427296]
We propose a new type of image enhancement problem: to crop an image which is embedded within a photo.
We split our proposed approach into two deep networks: deep photo cropper and deep image enhancer.
In the photo cropper network, we employ a spatial transformer to extract the embedded image.
In the photo enhancer, we employ super-resolution to increase the number of pixels in the embedded image.
arXiv Detail & Related papers (2020-08-03T03:50:20Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.