dacl10k: Benchmark for Semantic Bridge Damage Segmentation
- URL: http://arxiv.org/abs/2309.00460v1
- Date: Fri, 1 Sep 2023 13:46:24 GMT
- Title: dacl10k: Benchmark for Semantic Bridge Damage Segmentation
- Authors: Johannes Flotzinger, Philipp J. R\"osch, Thomas Braml
- Abstract summary: "dacl10k" is an exceptionally diverse RCD dataset for semantic segmentation comprising 9,920 images deriving from real-world bridge inspections.
"dacl10k" distinguishes 12 damage classes as well as 6 bridge components that play a key role in the building assessment and recommending actions.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reliably identifying reinforced concrete defects (RCDs)plays a crucial role
in assessing the structural integrity, traffic safety, and long-term durability
of concrete bridges, which represent the most common bridge type worldwide.
Nevertheless, available datasets for the recognition of RCDs are small in terms
of size and class variety, which questions their usability in real-world
scenarios and their role as a benchmark. Our contribution to this problem is
"dacl10k", an exceptionally diverse RCD dataset for multi-label semantic
segmentation comprising 9,920 images deriving from real-world bridge
inspections. dacl10k distinguishes 12 damage classes as well as 6 bridge
components that play a key role in the building assessment and recommending
actions, such as restoration works, traffic load limitations or bridge
closures. In addition, we examine baseline models for dacl10k which are
subsequently evaluated. The best model achieves a mean intersection-over-union
of 0.42 on the test set. dacl10k, along with our baselines, will be openly
accessible to researchers and practitioners, representing the currently biggest
dataset regarding number of images and class diversity for semantic
segmentation in the bridge inspection domain.
Related papers
- Law of the Weakest Link: Cross Capabilities of Large Language Models [102.91861246827797]
We show that Large Language Models (LLMs) exhibit the "Law of the Weakest Link," where cross-capability performance is significantly constrained by the weakest component.
These results highlight the under-performance of LLMs in cross-capability tasks.
arXiv Detail & Related papers (2024-09-30T05:12:01Z) - Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure [52.2025114590481]
We introduce Hybrid-Segmentor, an encoder-decoder based approach that is capable of extracting both fine-grained local and global crack features.
This allows the model to improve its generalization capabilities in distinguish various type of shapes, surfaces and sizes of cracks.
The proposed model outperforms existing benchmark models across 5 quantitative metrics (accuracy 0.971, precision 0.804, recall 0.744, F1-score 0.770, and IoU score 0.630), achieving state-of-the-art status.
arXiv Detail & Related papers (2024-09-04T16:47:16Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.
Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - dacl1k: Real-World Bridge Damage Dataset Putting Open-Source Data to the
Test [0.6827423171182154]
"dacl1k" is a multi-label RCD dataset for multi-label classification based on building inspections including 1,474 images.
We trained the models on different combinations of open-source data (meta datasets) which were subsequently evaluated both extrinsically and intrinsically.
The performance analysis on dacl1k shows practical usability of the meta data, where the best model shows an Exact Match Ratio of 32%.
arXiv Detail & Related papers (2023-09-07T15:05:35Z) - Cascade-DETR: Delving into High-Quality Universal Object Detection [99.62131881419143]
We introduce Cascade-DETR for high-quality universal object detection.
We propose the Cascade Attention layer, which explicitly integrates object-centric information into the detection decoder.
Lastly, we introduce a universal object detection benchmark, UDB10, that contains 10 datasets from diverse domains.
arXiv Detail & Related papers (2023-07-20T17:11:20Z) - A Multitask Deep Learning Model for Parsing Bridge Elements and
Segmenting Defect in Bridge Inspection Images [1.476043573732074]
The vast network of bridges in the United States raises a high requirement for its maintenance and rehabilitation.
The massive cost of visual inspection to assess the conditions of the bridges turns out to be a burden to some extent.
This paper develops a multitask deep neural network that fully utilizes such interdependence between bridge elements and defects.
arXiv Detail & Related papers (2022-09-06T02:48:15Z) - A semi-supervised self-training method to develop assistive intelligence
for segmenting multiclass bridge elements from inspection videos [6.75013674088437]
This paper develops an assistive intelligence model for segmenting multiclass bridge elements from inspection videos.
A Mask Region-based Convolutional Neural Network (Mask R-CNN) pre-trained on a large public dataset was transferred to the new task.
A semi-supervised self-training (S$3$T) method was developed to engage experienced inspectors in refining the network.
arXiv Detail & Related papers (2021-09-10T19:22:45Z) - HierMUD: Hierarchical Multi-task Unsupervised Domain Adaptation between
Bridges for Drive-by Damage Diagnosis [9.261126434781744]
We introduce a new framework that transfers the model learned from one bridge to diagnose damage in another bridge without any labels from the target bridge.
Our framework trains a hierarchical neural network model in an adversarial way to extract task-shared and task-specific features.
We evaluate our framework on experimental data collected from 2 bridges and 3 vehicles.
arXiv Detail & Related papers (2021-07-23T19:39:32Z) - Robust Object Detection via Instance-Level Temporal Cycle Confusion [89.1027433760578]
We study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors.
Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf)
For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision.
arXiv Detail & Related papers (2021-04-16T21:35:08Z) - Knowledge transfer between bridges for drive-by monitoring using
adversarial and multi-task learning [6.462702225377603]
Monitoring bridge health using vibrations of drive-by vehicles has various benefits, such as low cost and no need for direct installation or on-site maintenance of equipment on the bridge.
Many such approaches require labeled data from every bridge, which is expensive and time-consuming, if not impossible, to obtain.
We introduce a transfer learning framework using domain-adversarial training and multi-task learning to detect, localize and quantify damage.
arXiv Detail & Related papers (2020-06-05T19:18:45Z) - DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery
Detection [93.24684159708114]
DeeperForensics-1.0 is the largest face forgery detection dataset by far, with 60,000 videos constituted by a total of 17.6 million frames.
The quality of generated videos outperforms those in existing datasets, validated by user studies.
The benchmark features a hidden test set, which contains manipulated videos achieving high deceptive scores in human evaluations.
arXiv Detail & Related papers (2020-01-09T14:37:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.