Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach
for Computing Structural Beauty of Images or the Livingness of Space
- URL: http://arxiv.org/abs/2310.10149v2
- Date: Tue, 7 Nov 2023 15:40:25 GMT
- Title: Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach
for Computing Structural Beauty of Images or the Livingness of Space
- Authors: Yao Qianxiang and Bin Jiang
- Abstract summary: This study introduces the concept of "structural beauty" as an objective computational approach for evaluating the aesthetic appeal of images.
The application of our method to the Scenic or Not dataset, a repository of subjective scenic ratings, demonstrates a high degree of consistency with subjective ratings in the 0-6 score range.
Our method not only provides computational results but also offers transparency and interpretability, positioning it as a novel avenue in the realm of Explainable AI (XAI)
- Score: 4.959120401369489
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This study introduces the concept of "structural beauty" as an objective
computational approach for evaluating the aesthetic appeal of images. Through
the utilization of the Segment anything model (SAM), we propose a method that
leverages recursive segmentation to extract finer-grained substructures.
Additionally, by reconstructing the hierarchical structure, we obtain a more
accurate representation of substructure quantity and hierarchy. This approach
reproduces and extends our previous research, allowing for the simultaneous
assessment of Livingness in full-color images without the need for grayscale
conversion or separate computations for foreground and background Livingness.
Furthermore, the application of our method to the Scenic or Not dataset, a
repository of subjective scenic ratings, demonstrates a high degree of
consistency with subjective ratings in the 0-6 score range. This underscores
that structural beauty is not solely a subjective perception, but a
quantifiable attribute accessible through objective computation. Through our
case studies, we have arrived at three significant conclusions. 1) our method
demonstrates the capability to accurately segment meaningful objects, including
trees, buildings, and windows, as well as abstract substructures within
paintings. 2) we observed that the clarity of an image impacts our
computational results; clearer images tend to yield higher Livingness scores.
However, for equally blurry images, Livingness does not exhibit a significant
reduction, aligning with human visual perception. 3) our approach fundamentally
differs from methods employing Convolutional Neural Networks (CNNs) for
predicting image scores. Our method not only provides computational results but
also offers transparency and interpretability, positioning it as a novel avenue
in the realm of Explainable AI (XAI).
Related papers
- Patch-Based Deep Unsupervised Image Segmentation using Graph Cuts [0.0]
We propose a patch-based unsupervised image segmentation strategy that bridges advances in unsupervised feature extraction with the algorithmic help of classical graph-based methods.
We show that a simple convolutional neural network, trained to classify image patches, naturally leads to a state-of-the-art fully-convolutional unsupervised pixel-level segmenter.
arXiv Detail & Related papers (2023-11-01T19:59:25Z) - SimNP: Learning Self-Similarity Priors Between Neural Points [52.4201466988562]
SimNP is a method to learn category-level self-similarities.
We show that SimNP is able to outperform previous methods in reconstructing symmetric unseen object regions.
arXiv Detail & Related papers (2023-09-07T16:02:40Z) - Unsupervised Part Discovery from Contrastive Reconstruction [90.88501867321573]
The goal of self-supervised visual representation learning is to learn strong, transferable image representations.
We propose an unsupervised approach to object part discovery and segmentation.
Our method yields semantic parts consistent across fine-grained but visually distinct categories.
arXiv Detail & Related papers (2021-11-11T17:59:42Z) - SALYPATH: A Deep-Based Architecture for visual attention prediction [5.068678962285629]
Visual attention is useful for many computer vision applications such as image compression, recognition, and captioning.
We propose an end-to-end deep-based method, so-called SALYPATH, that efficiently predicts the scanpath of an image through features of a saliency model.
The idea is predict the scanpath by exploiting the capacity of a deep-based model to predict the saliency.
arXiv Detail & Related papers (2021-06-29T08:53:51Z) - Learned Spatial Representations for Few-shot Talking-Head Synthesis [68.3787368024951]
We propose a novel approach for few-shot talking-head synthesis.
We show that this disentangled representation leads to a significant improvement over previous methods.
arXiv Detail & Related papers (2021-04-29T17:59:42Z) - ShaRF: Shape-conditioned Radiance Fields from a Single View [54.39347002226309]
We present a method for estimating neural scenes representations of objects given only a single image.
The core of our method is the estimation of a geometric scaffold for the object.
We demonstrate in several experiments the effectiveness of our approach in both synthetic and real images.
arXiv Detail & Related papers (2021-02-17T16:40:28Z) - Shelf-Supervised Mesh Prediction in the Wild [54.01373263260449]
We propose a learning-based approach to infer 3D shape and pose of object from a single image.
We first infer a volumetric representation in a canonical frame, along with the camera pose.
The coarse volumetric prediction is then converted to a mesh-based representation, which is further refined in the predicted camera frame.
arXiv Detail & Related papers (2021-02-11T18:57:10Z) - Deep Convolutional Neural Network for Identifying Seam-Carving Forgery [10.324492319976798]
We propose a convolutional neural network (CNN)-based approach to classifying seam-carving-based image for reduction and expansion.
Our work exhibits state-of-the-art performance in terms of three-class classification (original, seam inserted, and seam removed)
arXiv Detail & Related papers (2020-07-05T17:20:51Z) - Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation [128.03739769844736]
Two neural co-attentions are incorporated into the classifier to capture cross-image semantic similarities and differences.
In addition to boosting object pattern learning, the co-attention can leverage context from other related images to improve localization map inference.
Our algorithm sets new state-of-the-arts on all these settings, demonstrating well its efficacy and generalizability.
arXiv Detail & Related papers (2020-07-03T21:53:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.