Abstract Flow for Temporal Semantic Segmentation on the Permutohedral
  Lattice
        - URL: http://arxiv.org/abs/2203.15469v1
- Date: Tue, 29 Mar 2022 12:14:31 GMT
- Title: Abstract Flow for Temporal Semantic Segmentation on the Permutohedral
  Lattice
- Authors: Peer Sch\"utt, Radu Alexandru Rosu and Sven Behnke
- Abstract summary: We extend a backbone LatticeNet to process temporal point cloud data.
We propose a new module called Abstract Flow which allows the network to match parts of the scene with similar abstract features.
We obtain state-of-the-art results on the Semantic KITTI dataset that contains LiDAR scans from real urban environments.
- Score: 27.37701107719647
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Semantic segmentation is a core ability required by autonomous agents, as
being able to distinguish which parts of the scene belong to which object class
is crucial for navigation and interaction with the environment. Approaches
which use only one time-step of data cannot distinguish between moving objects
nor can they benefit from temporal integration. In this work, we extend a
backbone LatticeNet to process temporal point cloud data. Additionally, we take
inspiration from optical flow methods and propose a new module called Abstract
Flow which allows the network to match parts of the scene with similar abstract
features and gather the information temporally. We obtain state-of-the-art
results on the SemanticKITTI dataset that contains LiDAR scans from real urban
environments. We share the PyTorch implementation of TemporalLatticeNet at
https://github.com/AIS-Bonn/temporal_latticenet .
 
      
        Related papers
        - 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic   Segmentation [21.300636683882338]
 We propose a new method to generate cluster labels that reflect the complete spatial structure and temporal information of objects.
We achieve state-of-the-art results on the multi-scan semantic and moving object segmentation on Semantic KITTI and nuScenes datasets.
 arXiv  Detail & Related papers  (2025-01-06T11:23:13Z)
- Learning Spatial-Semantic Features for Robust Video Object Segmentation [108.045326229865]
 We propose a robust video object segmentation framework that learns spatial-semantic features and discriminative object queries.
The proposed method achieves state-of-the-art performance on benchmark data sets, including the DAVIS 2017 test (textbf87.8%), YoutubeVOS 2019 (textbf88.1%), MOSE val (textbf74.0%), and LVOS test (textbf73.0%)
 arXiv  Detail & Related papers  (2024-07-10T15:36:00Z)
- STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning   for Real-world Scene Flow [5.476991379461233]
 We propose global attentive flow embedding to match all-to-all point pairs in both Euclidean space.
We leverage novel domain adaptive losses to bridge the gap of motion inference from synthetic to real-world.
Our approach achieves state-of-the-art performance across various datasets, with particularly outstanding results on real-world LiDAR-scanned datasets.
 arXiv  Detail & Related papers  (2024-03-11T04:56:10Z)
- Refining Segmentation On-the-Fly: An Interactive Framework for Point
  Cloud Semantic Segmentation [9.832150567595718]
 We present the first interactive framework for point cloud semantic segmentation, named InterPCSeg.
We develop an interaction simulation scheme tailored for the interactive point cloud semantic segmentation task.
We evaluate our framework on the S3DIS and ScanNet datasets with off-the-shelf segmentation networks.
 arXiv  Detail & Related papers  (2024-03-11T03:24:58Z)
- Semantics Meets Temporal Correspondence: Self-supervised Object-centric   Learning in Videos [63.94040814459116]
 Self-supervised methods have shown remarkable progress in learning high-level semantics and low-level temporal correspondence.
We propose a novel semantic-aware masked slot attention on top of the fused semantic features and correspondence maps.
We adopt semantic- and instance-level temporal consistency as self-supervision to encourage temporally coherent object-centric representations.
 arXiv  Detail & Related papers  (2023-08-19T09:12:13Z)
- Event-Free Moving Object Segmentation from Moving Ego Vehicle [88.33470650615162]
 Moving object segmentation (MOS) in dynamic scenes is an important, challenging, but under-explored research topic for autonomous driving.
Most segmentation methods leverage motion cues obtained from optical flow maps.
We propose to exploit event cameras for better video understanding, which provide rich motion cues without relying on optical flow.
 arXiv  Detail & Related papers  (2023-04-28T23:43:10Z)
- DS-Net: Dynamic Spatiotemporal Network for Video Salient Object
  Detection [78.04869214450963]
 We propose a novel dynamic temporal-temporal network (DSNet) for more effective fusion of temporal and spatial information.
We show that the proposed method achieves superior performance than state-of-the-art algorithms.
 arXiv  Detail & Related papers  (2020-12-09T06:42:30Z)
- IAUnet: Global Context-Aware Feature Learning for Person
  Re-Identification [106.50534744965955]
 IAU block enables the feature to incorporate the globally spatial, temporal, and channel context.
It is lightweight, end-to-end trainable, and can be easily plugged into existing CNNs to form IAUnet.
Experiments show that IAUnet performs favorably against state-of-the-art on both image and video reID tasks.
 arXiv  Detail & Related papers  (2020-09-02T13:07:10Z)
- ASAP-Net: Attention and Structure Aware Point Cloud Sequence
  Segmentation [49.15948235059343]
 We further improve point-temporal cloud feature with a flexible module called ASAP.
Our ASAP module contains an attentive temporal embedding layer to fuse the relatively informative local features across frames in a recurrent fashion.
We show the generalization ability of the proposed ASAP module with different computation backbone networks for point cloud sequence segmentation.
 arXiv  Detail & Related papers  (2020-08-12T07:37:16Z)
- Fast Video Object Segmentation With Temporal Aggregation Network and
  Dynamic Template Matching [67.02962970820505]
 We introduce "tracking-by-detection" into Video Object (VOS)
We propose a new temporal aggregation network and a novel dynamic time-evolving template matching mechanism to achieve significantly improved performance.
We achieve new state-of-the-art performance on the DAVIS benchmark without complicated bells and whistles in both speed and accuracy, with a speed of 0.14 second per frame and J&F measure of 75.9% respectively.
 arXiv  Detail & Related papers  (2020-07-11T05:44:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.