Multi-Task Classification of Sewer Pipe Defects and Properties using a
Cross-Task Graph Neural Network Decoder
- URL: http://arxiv.org/abs/2111.07846v1
- Date: Mon, 15 Nov 2021 15:36:50 GMT
- Title: Multi-Task Classification of Sewer Pipe Defects and Properties using a
Cross-Task Graph Neural Network Decoder
- Authors: Joakim Bruslund Haurum, Meysam Madadi, Sergio Escalera, Thomas B.
Moeslund
- Abstract summary: We present a novel decoder-focused multi-task classification architecture Cross-Task Graph Neural Network (CT-GNN)
CT-GNN refines the disjointed per-task predictions using cross-task information.
We achieve state-of-the-art performance on all four classification tasks in the Sewer-ML dataset.
- Score: 56.673599764041384
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The sewerage infrastructure is one of the most important and expensive
infrastructures in modern society. In order to efficiently manage the sewerage
infrastructure, automated sewer inspection has to be utilized. However, while
sewer defect classification has been investigated for decades, little attention
has been given to classifying sewer pipe properties such as water level, pipe
material, and pipe shape, which are needed to evaluate the level of sewer pipe
deterioration.
In this work we classify sewer pipe defects and properties concurrently and
present a novel decoder-focused multi-task classification architecture
Cross-Task Graph Neural Network (CT-GNN), which refines the disjointed per-task
predictions using cross-task information. The CT-GNN architecture extends the
traditional disjointed task-heads decoder, by utilizing a cross-task graph and
unique class node embeddings. The cross-task graph can either be determined a
priori based on the conditional probability between the task classes or
determined dynamically using self-attention. CT-GNN can be added to any
backbone and trained end-to-end at a small increase in the parameter count. We
achieve state-of-the-art performance on all four classification tasks in the
Sewer-ML dataset, improving defect classification and water level
classification by 5.3 and 8.0 percentage points, respectively. We also
outperform the single task methods as well as other multi-task classification
approaches while introducing 50 times fewer parameters than previous
model-focused approaches. The code and models are available at the project page
http://vap.aau.dk/ctgnn
Related papers
- Multi-label Sewer Pipe Defect Recognition with Mask Attention Feature Enhancement and Label Correlation Learning [5.9184143707401775]
Multi-label pipe defect recognition is proposed based on mask attention guided feature enhancement and label correlation learning.
The proposed method can achieve current approximate state-of-the-art classification performance using just 1/16 of the Sewer-ML training dataset.
arXiv Detail & Related papers (2024-08-01T11:51:50Z) - Dynamic Perceiver for Efficient Visual Recognition [87.08210214417309]
We propose Dynamic Perceiver (Dyn-Perceiver) to decouple the feature extraction procedure and the early classification task.
A feature branch serves to extract image features, while a classification branch processes a latent code assigned for classification tasks.
Early exits are placed exclusively within the classification branch, thus eliminating the need for linear separability in low-level features.
arXiv Detail & Related papers (2023-06-20T03:00:22Z) - Performance-aware Approximation of Global Channel Pruning for Multitask
CNNs [13.356477450355547]
Global channel pruning (GCP) aims to remove a subset of channels (filters) across different layers from a deep model without hurting the performance.
We propose a Performance-Aware Global Channel Pruning (PAGCP) framework.
Experiments on several multitask datasets show that the proposed PAGCP can reduce the FLOPs and parameters by over 60% with minor performance drop.
arXiv Detail & Related papers (2023-03-21T15:15:21Z) - FlowNAS: Neural Architecture Search for Optical Flow Estimation [65.44079917247369]
We propose a neural architecture search method named FlowNAS to automatically find the better encoder architecture for flow estimation task.
Experimental results show that the discovered architecture with the weights inherited from the super-network achieves 4.67% F1-all error on KITTI.
arXiv Detail & Related papers (2022-07-04T09:05:25Z) - DOTIN: Dropping Task-Irrelevant Nodes for GNNs [119.17997089267124]
Recent graph learning approaches have introduced the pooling strategy to reduce the size of graphs for learning.
We design a new approach called DOTIN (underlineDrunderlineopping underlineTask-underlineIrrelevant underlineNodes) to reduce the size of graphs.
Our method speeds up GAT by about 50% on graph-level tasks including graph classification and graph edit distance.
arXiv Detail & Related papers (2022-04-28T12:00:39Z) - Hierarchical Prototype Networks for Continual Graph Representation
Learning [90.78466005753505]
We present Hierarchical Prototype Networks (HPNs) which extract different levels of abstract knowledge in the form of prototypes to represent the continuously expanded graphs.
We show that HPNs not only outperform state-of-the-art baseline techniques but also consume relatively less memory.
arXiv Detail & Related papers (2021-11-30T14:15:14Z) - Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and
Benchmark [29.728476976320913]
We present a novel and publicly available multi-label classification dataset for image-based sewer defect classification called Sewer-ML.
The dataset consists of 1.3 million images annotated by professional sewer inspectors from three different utility companies over nine years.
We also present a benchmark algorithm and a novel metric for assessing performance.
arXiv Detail & Related papers (2021-03-19T16:32:37Z) - Triplet-Watershed for Hyperspectral Image Classification [7.285139308970045]
We propose a novel approach to train deep learning networks to obtain representations suitable for the watershed classifier.
We show that exploiting such characteristics allows the Triplet-Watershed to achieve state-of-art results.
Results are validated on Indianpines (IP), University of Pavia (UP), and Kennedy Space Center (KSC) datasets.
arXiv Detail & Related papers (2021-03-17T01:06:49Z) - MTP: Multi-Task Pruning for Efficient Semantic Segmentation Networks [32.84644563020912]
We present a multi-task channel pruning approach for semantic segmentation networks.
The importance of each convolution filter wrt the channel of an arbitrary layer will be simultaneously determined by the classification and segmentation tasks.
Experimental results on several benchmarks illustrate the superiority of the proposed algorithm over the state-of-the-art pruning methods.
arXiv Detail & Related papers (2020-07-16T15:03:01Z) - Pre-Trained Models for Heterogeneous Information Networks [57.78194356302626]
We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network.
PF-HIN consistently and significantly outperforms state-of-the-art alternatives on each of these tasks, on four datasets.
arXiv Detail & Related papers (2020-07-07T03:36:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.