Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge
- URL: http://arxiv.org/abs/2505.16674v1
- Date: Thu, 22 May 2025 13:39:52 GMT
- Title: Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge
- Authors: Marcella Astrid, Abdelrahman Shabayek, Djamila Aouada,
- Abstract summary: Anomaly detection in battery thermal images helps identify failures early.<n>Traditional deep learning methods require extensive labeled data.<n>We explore zero-shot anomaly detection using Visual Question Answering (VQA) models.
- Score: 11.377891847991718
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Batteries are essential for various applications, including electric vehicles and renewable energy storage, making safety and efficiency critical concerns. Anomaly detection in battery thermal images helps identify failures early, but traditional deep learning methods require extensive labeled data, which is difficult to obtain, especially for anomalies due to safety risks and high data collection costs. To overcome this, we explore zero-shot anomaly detection using Visual Question Answering (VQA) models, which leverage pretrained knowledge and textbased prompts to generalize across vision tasks. By incorporating prior knowledge of normal battery thermal behavior, we design prompts to detect anomalies without battery-specific training data. We evaluate three VQA models (ChatGPT-4o, LLaVa-13b, and BLIP-2) analyzing their robustness to prompt variations, repeated trials, and qualitative outputs. Despite the lack of finetuning on battery data, our approach demonstrates competitive performance compared to state-of-the-art models that are trained with the battery data. Our findings highlight the potential of VQA-based zero-shot learning for battery anomaly detection and suggest future directions for improving its effectiveness.
Related papers
- BatteryBERT for Realistic Battery Fault Detection Using Point-Masked Signal Modeling [1.7397173676239939]
We propose a novel framework that adapts BERT-style pretraining for battery fault detection.<n>We extend the standard BERT architecture with a customized time-series-to-token representation module and a point-level Masked Signal Modeling (point-MSM) pretraining task.<n>This approach enables self-supervised learning on sequential current, voltage and other charge-discharge cycle data.
arXiv Detail & Related papers (2025-05-31T06:06:08Z) - Deep Learning Methods for Detecting Thermal Runaway Events in Battery Production Lines [0.5530212768657544]
We investigate the use of deep learning for detecting thermal runaway in the battery production line of VDL Nedcar, a Dutch automobile manufacturer.<n>We collect data from the production line to represent both baseline (non thermal runaway) and thermal runaway conditions.<n>The data consisted of both optical and thermal images which were then preprocessed and fused before serving as input to our models.
arXiv Detail & Related papers (2025-04-11T15:35:50Z) - Survival Analysis with Machine Learning for Predicting Li-ion Battery Remaining Useful Life [4.6033243214188415]
We propose a hybrid survival analysis framework integrating survival data reconstruction, survival model learning, and survival probability estimation.<n>Our approach transforms battery voltage time series into time-to-failure data using path signatures.<n> Experiments conducted on the Toyota battery and NASA battery datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2025-03-17T02:49:34Z) - What Really Matters for Learning-based LiDAR-Camera Calibration [50.2608502974106]
This paper revisits the development of learning-based LiDAR-Camera calibration.<n>We identify the critical limitations of regression-based methods with the widely used data generation pipeline.<n>We also investigate how the input data format and preprocessing operations impact network performance.
arXiv Detail & Related papers (2025-01-28T14:12:32Z) - Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models [68.90917438865078]
Deepfake techniques for facial synthesis and editing pose serious risks for generative models.<n>In this paper, we investigate how detection performance varies across model backbones, types, and datasets.<n>We introduce Contrastive Blur, which enhances performance on facial images, and MINDER, which addresses noise type bias, balancing performance across domains.
arXiv Detail & Related papers (2024-11-28T13:04:45Z) - Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos [92.38662956154256]
Real-world applications often face challenges with incomplete modalities due to privacy concerns, efficiency needs, or hardware issues.<n>We propose a novel approach to address this issue at test time without requiring retraining.<n>MiDl represents the first self-supervised, online solution for handling missing modalities exclusively at test time.
arXiv Detail & Related papers (2024-04-23T16:01:33Z) - Cycle Life Prediction for Lithium-ion Batteries: Machine Learning and More [0.0]
Batteries are dynamic systems with complicated nonlinear aging.
This tutorial begins with an overview of first-principles, machine learning, and hybrid battery models.
We highlight the challenges of machine learning models, motivating the incorporation of physics in hybrid modeling approaches.
arXiv Detail & Related papers (2024-04-05T12:05:20Z) - Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines [83.65380507372483]
Large pre-trained models can dramatically reduce the amount of task-specific data required to solve a problem, but they often fail to capture domain-specific nuances out of the box.
This paper shows how to leverage recent advances in NLP and multi-modal learning to augment a pre-trained model with search engine retrieval.
arXiv Detail & Related papers (2023-11-29T05:33:28Z) - Enhanced Gaussian Process Dynamical Models with Knowledge Transfer for
Long-term Battery Degradation Forecasting [0.9208007322096533]
Predicting the end-of-life or remaining useful life of batteries in electric vehicles is a critical and challenging problem.
A number of algorithms have incorporated features that are available from data collected by battery management systems.
We develop a highly-accurate method that can overcome this limitation.
arXiv Detail & Related papers (2022-12-03T12:59:51Z) - Ranking-Based Physics-Informed Line Failure Detection in Power Grids [66.0797334582536]
Real-time and accurate detecting of potential line failures is the first step to mitigating the extreme weather impact and activating emergency controls.
Power balance equations nonlinearity, increased uncertainty in generation during extreme events, and lack of grid observability compromise the efficiency of traditional data-driven failure detection methods.
This paper proposes a Physics-InformEd Line failure Detector (FIELD) that leverages grid topology information to reduce sample and time complexities and improve localization accuracy.
arXiv Detail & Related papers (2022-08-31T18:19:25Z) - Data-driven Thermal Anomaly Detection for Batteries using Unsupervised
Shape Clustering [4.805591270997103]
We propose a data-driven method to detect battery thermal anomaly based on comparing shape-similarity between thermal measurements.
Unlike model-based or other data-driven methods, the proposed method is robust to data loss and requires minimal reference data for different pack configurations.
arXiv Detail & Related papers (2021-03-16T01:29:41Z) - Anomaly Detection in Video via Self-Supervised and Multi-Task Learning [113.81927544121625]
Anomaly detection in video is a challenging computer vision problem.
In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level.
arXiv Detail & Related papers (2020-11-15T10:21:28Z) - Universal Battery Performance and Degradation Model for Electric
Aircraft [52.77024349608834]
Design, analysis, and operation of electric vertical takeoff and landing aircraft (eVTOLs) requires fast and accurate prediction of Li-ion battery performance.
We generate a battery performance and thermal behavior dataset specific to eVTOL duty cycles.
We use this dataset to develop a battery performance and degradation model (Cellfit) which employs physics-informed machine learning.
arXiv Detail & Related papers (2020-07-06T16:10:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.