PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model
for Image Classification
- URL: http://arxiv.org/abs/2002.03141v1
- Date: Sat, 8 Feb 2020 11:08:54 GMT
- Title: PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model
for Image Classification
- Authors: Yueru Chen, Mozhdeh Rouhsedaghat, Suya You, Raghuveer Rao and C.-C.
Jay Kuo
- Abstract summary: We propose an improved PixelHop method and call it PixelHop++.
In PixelHop++, one can control the learning model size of fine-granularity,offering a flexible tradeoff between the model size and the classification performance.
- Score: 30.49387075658641
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The successive subspace learning (SSL) principle was developed and used to
design an interpretable learning model, known as the PixelHop method,for image
classification in our prior work. Here, we propose an improved PixelHop method
and call it PixelHop++. First, to make the PixelHop model size smaller, we
decouple a joint spatial-spectral input tensor to multiple spatial tensors (one
for each spectral component) under the spatial-spectral separability assumption
and perform the Saab transform in a channel-wise manner, called the
channel-wise (c/w) Saab transform.Second, by performing this operation from one
hop to another successively, we construct a channel-decomposed feature tree
whose leaf nodes contain features of one dimension (1D). Third, these 1D
features are ranked according to their cross-entropy values, which allows us to
select a subset of discriminant features for image classification. In
PixelHop++, one can control the learning model size of
fine-granularity,offering a flexible tradeoff between the model size and the
classification performance. We demonstrate the flexibility of PixelHop++ on
MNIST, Fashion MNIST, and CIFAR-10 three datasets.
Related papers
- Center-guided Classifier for Semantic Segmentation of Remote Sensing Images [2.803715177543843]
CenterSeg is a novel classifier for semantic segmentation of remote sensing images.
It solves problems with multiple prototypes, direct supervision under Grassmann manifold, and interpretability strategy.
Besides the superior performance, CenterSeg has the advantages of simplicity, lightweight, compatibility, and interpretability.
arXiv Detail & Related papers (2025-03-21T09:21:37Z) - Bayesian Active Learning for Semantic Segmentation [9.617769135242973]
We introduce a Bayesian active learning framework based on sparse pixel-level annotation.
BalEnt captures the information between the models' predicted marginalized probability distribution and the pixel labels.
We train our proposed active learning framework for Cityscapes, Camvid, ADE20K and VOC2012 benchmark datasets.
arXiv Detail & Related papers (2024-08-03T07:26:10Z) - FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models [56.71672127740099]
We focus on the task of image segmentation, which is traditionally solved by training models on closed-vocabulary datasets.
We leverage different and relatively small-sized, open-source foundation models for zero-shot open-vocabulary segmentation.
Our approach (dubbed FreeSeg-Diff), which does not rely on any training, outperforms many training-based approaches on both Pascal VOC and COCO datasets.
arXiv Detail & Related papers (2024-03-29T10:38:25Z) - Differentiable Registration of Images and LiDAR Point Clouds with
VoxelPoint-to-Pixel Matching [58.10418136917358]
Cross-modality registration between 2D images from cameras and 3D point clouds from LiDARs is a crucial task in computer vision and robotic training.
Previous methods estimate 2D-3D correspondences by matching point and pixel patterns learned by neural networks.
We learn a structured cross-modality matching solver to represent 3D features via a different latent pixel space.
arXiv Detail & Related papers (2023-12-07T05:46:10Z) - Sketch-Guided Text-to-Image Diffusion Models [57.12095262189362]
We introduce a universal approach to guide a pretrained text-to-image diffusion model.
Our method does not require to train a dedicated model or a specialized encoder for the task.
We take a particular focus on the sketch-to-image translation task, revealing a robust and expressive way to generate images.
arXiv Detail & Related papers (2022-11-24T18:45:32Z) - Distilling Ensemble of Explanations for Weakly-Supervised Pre-Training
of Image Segmentation Models [54.49581189337848]
We propose a method to enable the end-to-end pre-training for image segmentation models based on classification datasets.
The proposed method leverages a weighted segmentation learning procedure to pre-train the segmentation network en masse.
Experiment results show that, with ImageNet accompanied by PSSL as the source dataset, the proposed end-to-end pre-training strategy successfully boosts the performance of various segmentation models.
arXiv Detail & Related papers (2022-07-04T13:02:32Z) - Rethinking Semantic Segmentation: A Prototype View [126.59244185849838]
We present a nonparametric semantic segmentation model based on non-learnable prototypes.
Our framework yields compelling results over several datasets.
We expect this work will provoke a rethink of the current de facto semantic segmentation model design.
arXiv Detail & Related papers (2022-03-28T21:15:32Z) - Multi-level Second-order Few-shot Learning [111.0648869396828]
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition.
We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction.
We demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.
arXiv Detail & Related papers (2022-01-15T19:49:00Z) - E-PixelHop: An Enhanced PixelHop Method for Object Classification [29.062926577155885]
We propose an enhanced solution for object classification, called E-PixelHop, in this work.
E-PixelHop consists of the following steps.
We conduct pixel-level classification at each hop with various receptive fields.
Forth, pixel-level decisions from each hop and from each color subspace are fused together for image-level decision.
arXiv Detail & Related papers (2021-07-07T01:22:12Z) - AnomalyHop: An SSL-based Image Anomaly Localization Method [47.56319291471639]
AnomalyHop is an image anomaly localization method based on the successive subspace learning (SSL) framework.
AnomalyHop is mathematically transparent, easy to train, and fast in its inference speed.
Its area under the ROC curve (ROC-AUC) performance on the MVTec AD dataset is 95.9%, which is among the best of several benchmarking methods.
arXiv Detail & Related papers (2021-05-08T23:17:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.