A Study on Evaluation Standard for Automatic Crack Detection Regard the
Random Fractal
- URL: http://arxiv.org/abs/2007.12082v1
- Date: Thu, 23 Jul 2020 15:46:29 GMT
- Title: A Study on Evaluation Standard for Automatic Crack Detection Regard the
Random Fractal
- Authors: Hongyu Li, Jihe Wang, Yu Zhang, Zirui Wang, and Tiejun Wang
- Abstract summary: We find that automatic crack detectors based on deep learning are obviously underestimated by the widely used mean Average Precision (mAP) standard.
As a solution, a fractal-available evaluation standard named CovEval is proposed to correct the underestimation in crack detection.
In experiments using several common frameworks for object detection, models get much higher scores in crack detection according to CovEval.
- Score: 15.811209242988257
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A reasonable evaluation standard underlies construction of effective deep
learning models. However, we find in experiments that the automatic crack
detectors based on deep learning are obviously underestimated by the widely
used mean Average Precision (mAP) standard. This paper presents a study on the
evaluation standard. It is clarified that the random fractal of crack disables
the mAP standard, because the strict box matching in mAP calculation is
unreasonable for the fractal feature. As a solution, a fractal-available
evaluation standard named CovEval is proposed to correct the underestimation in
crack detection. In CovEval, a different matching process based on the idea of
covering box matching is adopted for this issue. In detail, Cover Area rate
(CAr) is designed as a covering overlap, and a multi-match strategy is employed
to release the one-to-one matching restriction in mAP. Extended Recall (XR),
Extended Precision (XP) and Extended F-score (Fext) are defined for scoring the
crack detectors. In experiments using several common frameworks for object
detection, models get much higher scores in crack detection according to
CovEval, which matches better with the visual performance. Moreover, based on
faster R-CNN framework, we present a case study to optimize a crack detector
based on CovEval standard. Recall (XR) of our best model achieves an
industrial-level at 95.8, which implies that with reasonable standard for
evaluation, the methods for object detection are with great potential for
automatic industrial inspection.
Related papers
- AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning [93.77763753231338]
Adversarial Contrastive Prompt Tuning (ACPT) is proposed to fine-tune the CLIP image encoder to extract similar embeddings for any two intermediate adversarial queries.
We show that ACPT can detect 7 state-of-the-art query-based attacks with $>99%$ detection rate within 5 shots.
We also show that ACPT is robust to 3 types of adaptive attacks.
arXiv Detail & Related papers (2024-08-04T09:53:50Z) - Rank-DETR for High Quality Object Detection [52.82810762221516]
A highly performant object detector requires accurate ranking for the bounding box predictions.
In this work, we introduce a simple and highly performant DETR-based object detector by proposing a series of rank-oriented designs.
arXiv Detail & Related papers (2023-10-13T04:48:32Z) - Parametric Classification for Generalized Category Discovery: A Baseline
Study [70.73212959385387]
Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.
We investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem.
We propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers.
arXiv Detail & Related papers (2022-11-21T18:47:11Z) - CFARnet: deep learning for target detection with constant false alarm
rate [2.2940141855172036]
We introduce a framework of CFAR constrained detectors.
Practically, we develop a deep learning framework for fitting neural networks that approximate it.
Experiments of target detection in different setting demonstrate that the proposed CFARnet allows a flexible tradeoff between CFAR and accuracy.
arXiv Detail & Related papers (2022-08-04T05:54:36Z) - Oversampling Divide-and-conquer for Response-skewed Kernel Ridge
Regression [20.00435452480056]
We develop a novel response-adaptive partition strategy to overcome the limitation of the divide-and-conquer method.
We show the proposed estimate has a smaller mean squared error (AMSE) than that of the classical dacKRR estimate under mild conditions.
arXiv Detail & Related papers (2021-07-13T04:01:04Z) - Risk Minimization from Adaptively Collected Data: Guarantees for
Supervised and Policy Learning [57.88785630755165]
Empirical risk minimization (ERM) is the workhorse of machine learning, but its model-agnostic guarantees can fail when we use adaptively collected data.
We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimize the average of a loss function over a hypothesis class.
For policy learning, we provide rate-optimal regret guarantees that close an open gap in the existing literature whenever exploration decays to zero.
arXiv Detail & Related papers (2021-06-03T09:50:13Z) - On Provable Backdoor Defense in Collaborative Learning [35.22450536986004]
Malicious users can upload data to prevent the model's convergence or inject hidden backdoors.
Backdoor attacks are especially difficult to detect since the model behaves normally on standard test data but gives wrong outputs when triggered by certain backdoor keys.
We propose a novel framework that generalizes existing subset aggregation methods.
arXiv Detail & Related papers (2021-01-19T14:39:32Z) - FADER: Fast Adversarial Example Rejection [19.305796826768425]
Recent defenses have been shown to improve adversarial robustness by detecting anomalous deviations from legitimate training samples at different layer representations.
We introduce FADER, a novel technique for speeding up detection-based methods.
Our experiments outline up to 73x prototypes reduction compared to analyzed detectors for MNIST dataset and up to 50x for CIFAR10 respectively.
arXiv Detail & Related papers (2020-10-18T22:00:11Z) - Lower bounds in multiple testing: A framework based on derandomized
proxies [107.69746750639584]
This paper introduces an analysis strategy based on derandomization, illustrated by applications to various concrete models.
We provide numerical simulations of some of these lower bounds, and show a close relation to the actual performance of the Benjamini-Hochberg (BH) algorithm.
arXiv Detail & Related papers (2020-05-07T19:59:51Z) - Pre-training Is (Almost) All You Need: An Application to Commonsense
Reasoning [61.32992639292889]
Fine-tuning of pre-trained transformer models has become the standard approach for solving common NLP tasks.
We introduce a new scoring method that casts a plausibility ranking task in a full-text format.
We show that our method provides a much more stable training phase across random restarts.
arXiv Detail & Related papers (2020-04-29T10:54:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.