FreeCOS: Self-Supervised Learning from Fractals and Unlabeled Images for
Curvilinear Object Segmentation
- URL: http://arxiv.org/abs/2307.07245v1
- Date: Fri, 14 Jul 2023 09:38:08 GMT
- Title: FreeCOS: Self-Supervised Learning from Fractals and Unlabeled Images for
Curvilinear Object Segmentation
- Authors: Tianyi Shi, Xiaohuan Ding, Liang Zhang, Xin Yang
- Abstract summary: This paper proposes a self-supervised curvilinear object segmentation method that learns robust and distinctive features from fractals and unlabeled images.
The key contributions include a novel Fractal-FDA synthesis (FFS) module and a geometric information alignment (GIA) approach.
GIA reduces the intensity differences between the synthetic and unlabeled images by comparing the intensity order of a given pixel to the values of its nearby neighbors.
- Score: 7.078356641689271
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Curvilinear object segmentation is critical for many applications. However,
manually annotating curvilinear objects is very time-consuming and error-prone,
yielding insufficiently available annotated datasets for existing supervised
methods and domain adaptation methods. This paper proposes a self-supervised
curvilinear object segmentation method that learns robust and distinctive
features from fractals and unlabeled images (FreeCOS). The key contributions
include a novel Fractal-FDA synthesis (FFS) module and a geometric information
alignment (GIA) approach. FFS generates curvilinear structures based on the
parametric Fractal L-system and integrates the generated structures into
unlabeled images to obtain synthetic training images via Fourier Domain
Adaptation. GIA reduces the intensity differences between the synthetic and
unlabeled images by comparing the intensity order of a given pixel to the
values of its nearby neighbors. Such image alignment can explicitly remove the
dependency on absolute intensity values and enhance the inherent geometric
characteristics which are common in both synthetic and real images. In
addition, GIA aligns features of synthetic and real images via the prediction
space adaptation loss (PSAL) and the curvilinear mask contrastive loss (CMCL).
Extensive experimental results on four public datasets, i.e., XCAD, DRIVE,
STARE and CrackTree demonstrate that our method outperforms the
state-of-the-art unsupervised methods, self-supervised methods and traditional
methods by a large margin. The source code of this work is available at
https://github.com/TY-Shi/FreeCOS.
Related papers
- The Texture-Shape Dilemma: Boundary-Safe Synthetic Generation for 3D Medical Transformers [22.195374577712133]
ViTs have revolutionized medical image analysis, yet their data-hungry nature clashes with the scarcity and privacy constraints of clinical archives.<n>Formula-Driven Supervised Learning has emerged as a promising solution to this bottleneck, synthesizing infinite annotated samples from mathematical formulas without utilizing real patient data.<n>Existing F paradigms rely on simple geometric with homogeneous intensities, creating a substantial gap by neglecting tissue textures and noise patterns inherent in modalities like CT and MRI.<n>In this paper, we identify a critical conflict boundary aliasing: when high-frequency synthetic textures are naively added, they corrupt the image gradient signals necessary for learning structural boundaries, causing
arXiv Detail & Related papers (2026-03-01T08:27:45Z) - Low performing pixel correction in computed tomography with unrolled network and synthetic data training [0.16777183511743465]
Low performance pixels (LPP) in Computed Tomography (CT) detectors would lead to ring and streak artifacts in reconstructed images.<n>We propose an unrolled dual-domain method based on synthetic data to correct LPP artifacts.
arXiv Detail & Related papers (2026-01-28T19:46:30Z) - How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings [106.3726679697804]
We compare the two most common techniques for mitigating this spectral bias: Fourier feature encodings (FFE) and multigrid parametric encodings (MPE)
MPEs are seen as the standard for low dimensional mappings, but MPEs often outperform them and learn representations with higher resolution and finer detail.
We prove that MPEs improve a network's performance through the structure of their grid and not their learnable embedding.
arXiv Detail & Related papers (2025-04-18T02:18:08Z) - Sequence Analysis Using the Bezier Curve [3.9052860539161918]
We introduce a novel approach to transform sequences into images using the B'ezier curve concept for element mapping.
Mapping the elements onto a curve enhances the sequence information representation in the respective images.
arXiv Detail & Related papers (2025-03-18T15:40:46Z) - Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets [1.0104586293349587]
Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale.
This paper introduces a novel approach for expanding curvilinear object segmentation datasets.
Our method enriches synthetic data informativeness by generating curvilinear objects through their multiple textual features.
arXiv Detail & Related papers (2024-07-11T06:25:26Z) - Image-GS: Content-Adaptive Image Representation via 2D Gaussians [52.598772767324036]
We introduce Image-GS, a content-adaptive image representation based on 2D Gaussians radiance.<n>It supports hardware-friendly rapid access for real-time usage, requiring only 0.3K MACs to decode a pixel.<n>We demonstrate its versatility with several applications, including texture compression, semantics-aware compression, and joint image compression and restoration.
arXiv Detail & Related papers (2024-07-02T00:45:21Z) - Multi-Scale Texture Loss for CT denoising with GANs [0.9349653765341301]
Generative Adversarial Networks (GANs) have proved as a powerful framework for denoising applications in medical imaging.
This work presents a loss function that leverages the intrinsic multi-scale nature of the Gray-Level-Co-occurrence Matrix (GLCM)
Our approach also introduces a self-attention layer that dynamically aggregates the multi-scale texture information extracted from the images.
arXiv Detail & Related papers (2024-03-25T11:28:52Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Padding-free Convolution based on Preservation of Differential
Characteristics of Kernels [1.3597551064547502]
We present a non-padding-based method for size-keeping convolution based on the preservation of differential characteristics of kernels.
The main idea is to make convolution over an incomplete sliding window "collapse" to a linear differential operator evaluated locally at its central pixel.
arXiv Detail & Related papers (2023-09-12T16:36:12Z) - GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from
Multi-view Images [79.39247661907397]
We introduce an effective framework Generalizable Model-based Neural Radiance Fields to synthesize free-viewpoint images.
Specifically, we propose a geometry-guided attention mechanism to register the appearance code from multi-view 2D images to a geometry proxy.
arXiv Detail & Related papers (2023-03-24T03:32:02Z) - RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline
Model and DoF-based Curriculum Learning [62.86400614141706]
We propose a new learning model, i.e., Rectangling Rectification Network (RecRecNet)
Our model can flexibly warp the source structure to the target domain and achieves an end-to-end unsupervised deformation.
Experiments show the superiority of our solution over the compared methods on both quantitative and qualitative evaluations.
arXiv Detail & Related papers (2023-01-04T15:12:57Z) - Multi-View Object Pose Refinement With Differentiable Renderer [22.040014384283378]
This paper introduces a novel multi-view 6 DoF object pose refinement approach focusing on improving methods trained on synthetic data.
It is based on the DPOD detector, which produces dense 2D-3D correspondences between the model vertices and the image pixels in each frame.
We report excellent performance in comparison to the state-of-the-art methods trained on the synthetic and real data.
arXiv Detail & Related papers (2022-07-06T17:02:22Z) - Self-supervised Geometric Perception [96.89966337518854]
Self-supervised geometric perception is a framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels.
We show that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.
arXiv Detail & Related papers (2021-03-04T15:34:43Z) - ProAlignNet : Unsupervised Learning for Progressively Aligning Noisy
Contours [12.791313859673187]
"ProAlignNet" accounts for large scale misalignments and complex transformations between the contour shapes.
It learns by training with a novel loss function which is derived an upperbound of a proximity-sensitive and local shape-dependent similarity metric.
In two real-world applications, the proposed models consistently perform superior to state-of-the-art methods.
arXiv Detail & Related papers (2020-05-23T14:56:14Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z) - Residual-Sparse Fuzzy $C$-Means Clustering Incorporating Morphological
Reconstruction and Wavelet frames [146.63177174491082]
Fuzzy $C$-Means (FCM) algorithm incorporates a morphological reconstruction operation and a tight wavelet frame transform.
We present an improved FCM algorithm by imposing an $ell_0$ regularization term on the residual between the feature set and its ideal value.
Experimental results reported for synthetic, medical, and color images show that the proposed algorithm is effective and efficient, and outperforms other algorithms.
arXiv Detail & Related papers (2020-02-14T10:00:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.