Rethinking Learnable Tree Filter for Generic Feature Transform
- URL: http://arxiv.org/abs/2012.03482v1
- Date: Mon, 7 Dec 2020 07:16:47 GMT
- Title: Rethinking Learnable Tree Filter for Generic Feature Transform
- Authors: Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Xiangyu Zhang, Hongbin
Sun, Jian Sun, Nanning Zheng
- Abstract summary: Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation.
To relax the geometric constraint, we give the analysis by reformulating it as a Markov Random Field and introduce a learnable unary term.
For semantic segmentation, we achieve leading performance (82.1% mIoU) on the Cityscapes benchmark without bells-and-whistles.
- Score: 71.77463476808585
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Learnable Tree Filter presents a remarkable approach to model
structure-preserving relations for semantic segmentation. Nevertheless, the
intrinsic geometric constraint forces it to focus on the regions with close
spatial distance, hindering the effective long-range interactions. To relax the
geometric constraint, we give the analysis by reformulating it as a Markov
Random Field and introduce a learnable unary term. Besides, we propose a
learnable spanning tree algorithm to replace the original non-differentiable
one, which further improves the flexibility and robustness. With the above
improvements, our method can better capture long-range dependencies and
preserve structural details with linear complexity, which is extended to
several vision tasks for more generic feature transform. Extensive experiments
on object detection/instance segmentation demonstrate the consistent
improvements over the original version. For semantic segmentation, we achieve
leading performance (82.1% mIoU) on the Cityscapes benchmark without
bells-and-whistles. Code is available at
https://github.com/StevenGrove/LearnableTreeFilterV2.
Related papers
- GeoFormer: A Multi-Polygon Segmentation Transformer [10.097953939411868]
In remote sensing there exists a common need for learning scale invariant shapes of objects like buildings.
We introduce the GeoFormer, a novel architecture which presents a remedy to the said challenges, learning to generate multipolygons end-to-end.
By modeling keypoints as spatially dependent tokens in an auto-regressive manner, the GeoFormer outperforms existing works in delineating building objects from satellite imagery.
arXiv Detail & Related papers (2024-11-25T17:54:44Z) - GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning [51.677086019209554]
We propose a Generalized Structural Sparse to capture powerful relationships across modalities for pair-wise similarity learning.
The distance metric delicately encapsulates two formats of diagonal and block-diagonal terms.
Experiments on cross-modal and two extra uni-modal retrieval tasks have validated its superiority and flexibility.
arXiv Detail & Related papers (2024-10-20T03:45:50Z) - Terminating Differentiable Tree Experts [77.2443883991608]
We propose a neuro-symbolic Differentiable Tree Machine that learns tree operations using a combination of transformers and Representation Products.
We first remove a series of different transformer layers that are used in every step by introducing a mixture of experts.
We additionally propose a new termination algorithm to provide the model the power to choose how many steps to make automatically.
arXiv Detail & Related papers (2024-07-02T08:45:38Z) - GrootVL: Tree Topology is All You Need in State Space Model [66.36757400689281]
GrootVL is a versatile multimodal framework that can be applied to both visual and textual tasks.
Our method significantly outperforms existing structured state space models on image classification, object detection and segmentation.
By fine-tuning large language models, our approach achieves consistent improvements in multiple textual tasks at minor training cost.
arXiv Detail & Related papers (2024-06-04T15:09:29Z) - Hierarchical clustering with dot products recovers hidden tree structure [53.68551192799585]
In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure.
We recommend a simple variant of the standard algorithm, in which clusters are merged by maximum average dot product and not, for example, by minimum distance or within-cluster variance.
We demonstrate that the tree output by this algorithm provides a bona fide estimate of generative hierarchical structure in data, under a generic probabilistic graphical model.
arXiv Detail & Related papers (2023-05-24T11:05:12Z) - Convex Polytope Trees [57.56078843831244]
convex polytope trees (CPT) are proposed to expand the family of decision trees by an interpretable generalization of their decision boundary.
We develop a greedy method to efficiently construct CPT and scalable end-to-end training algorithms for the tree parameters when the tree structure is given.
arXiv Detail & Related papers (2020-10-21T19:38:57Z) - From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical
Clustering [33.000371053304676]
We present the first continuous relaxation of Dasgupta's discrete optimization problem with provable quality guarantees.
We show that even approximate solutions found with gradient descent have superior quality than agglomerative clusterings.
We also highlight the flexibility of HypHC using end-to-end training in a downstream classification task.
arXiv Detail & Related papers (2020-10-01T13:43:19Z) - A Flexible Pipeline for the Optimization of CSG Trees [3.622365857213782]
CSG trees are an intuitive, yet powerful technique for the representation of geometry using a combination of Boolean set-operations and geometric primitives.
We present a systematic comparison of newly developed and existing tree optimization methods and propose a flexible processing pipeline with a focus on tree editability.
arXiv Detail & Related papers (2020-08-09T06:45:10Z) - FREEtree: A Tree-based Approach for High Dimensional Longitudinal Data
With Correlated Features [2.00191482700544]
FREEtree is a tree-based method for high dimensional longitudinal data with correlated features.
It exploits the network structure of the features by first clustering them using weighted correlation network analysis.
It then conducts a screening step within each cluster of features and a selection step among the surviving features.
arXiv Detail & Related papers (2020-06-17T07:28:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.