Related papers: Uint: Building Uint Detection Dataset

Uint: Building Uint Detection Dataset

URL: http://arxiv.org/abs/2508.03139v1
Date: Tue, 05 Aug 2025 06:36:41 GMT
Title: Uint: Building Uint Detection Dataset
Authors: Haozhou Zhai, Yanzhe Gao, Tianjiang Hu,
Abstract summary: Fire scene datasets are crucial for training robust computer vision models.<n>There is a significant shortage of annotated data specifically targeting building units.<n>We introduce an annotated dataset of building units captured by drones, which incorporates multiple enhancement techniques.
Score: 1.2166468091046596
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fire scene datasets are crucial for training robust computer vision models, particularly in tasks such as fire early warning and emergency rescue operations. However, among the currently available fire-related data, there is a significant shortage of annotated data specifically targeting building units.To tackle this issue, we introduce an annotated dataset of building units captured by drones, which incorporates multiple enhancement techniques. We construct backgrounds using real multi-story scenes, combine motion blur and brightness adjustment to enhance the authenticity of the captured images, simulate drone shooting conditions under various circumstances, and employ large models to generate fire effects at different locations.The synthetic dataset generated by this method encompasses a wide range of building scenarios, with a total of 1,978 images. This dataset can effectively improve the generalization ability of fire unit detection, providing multi-scenario and scalable data while reducing the risks and costs associated with collecting real fire data. The dataset is available at https://github.com/boilermakerr/FireUnitData.

Related papers

AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis [57.249817395828174]
We propose a scalable framework combining pseudo-synthetic renderings from 3D city-wide meshes with real, ground-level crowd-sourced images.<n>The pseudo-synthetic data simulates a wide range of aerial viewpoints, while the real, crowd-sourced images help improve visual fidelity for ground-level images.<n>Using this hybrid dataset, we fine-tune several state-of-the-art algorithms and achieve significant improvements on real-world, zero-shot aerial-ground tasks.
arXiv Detail & Related papers (2025-04-17T17:57:05Z)
Combined Physics and Event Camera Simulator for Slip Detection [11.309936820480111]
This paper presents a simulation pipeline for generating slip data using the described camera-gripper configuration in a robot arm.<n>It provides the ability to alter the setup at any time, simplify the process of repetition and the generation of arbitrarily large data sets.
arXiv Detail & Related papers (2025-03-05T14:50:21Z)
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion [4.038140001938416]
We present a training-free, diffusion-based framework designed to generate realistic wildfire images with paired ground truth. Our framework uses augmented masks, sampled from real wildfire data, and applies Perlin noise to guide the generation of realistic flames. We evaluate the generated images using normalized Frechet Inception Distance, CLIP Score, and a custom CLIP Confidence metric.
arXiv Detail & Related papers (2024-03-06T04:59:38Z)
WIT-UAS: A Wildland-fire Infrared Thermal Dataset to Detect Crew Assets From Aerial Views [0.8741284539870512]
We present the Wildland-fire Infrared Thermal (WIT-UAS) dataset for long-wave infrared sensing of crew and vehicle assets amidst prescribed wildland fire environments. WIT-UAS-ROS consists of full ROS bag files containing sensor and robot data of UAS flight over the fire, and WIT-UAS-Image contains hand-labeled long-wave infrared (LWIR) images extracted from WIT-UAS-ROS. Our dataset is the first to focus on asset detection in a wildland fire environment.
arXiv Detail & Related papers (2023-12-14T17:29:26Z)
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models [61.906934570771256]
We present a generic dataset generation model that can produce diverse synthetic images and perception annotations. Our method builds upon the pre-trained diffusion model and extends text-guided image synthesis to perception data generation. We show that the rich latent code of the diffusion model can be effectively decoded as accurate perception annotations using a decoder module.
arXiv Detail & Related papers (2023-08-11T14:38:11Z)
TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets. We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z)
Next Day Wildfire Spread: A Machine Learning Data Set to Predict Wildfire Spreading from Remote-Sensing Data [5.814925201882753]
Next Day Wildfire Spread' is a curated data set of historical wildfires aggregating nearly a decade of remote-sensing data across the United States. We implement a convolutional autoencoder that takes advantage of the spatial information of this data to predict wildfire spread. This data set can be used as a benchmark for developing wildfire propagation models based on remote sensing data for a lead time of one day.
arXiv Detail & Related papers (2021-12-04T23:28:44Z)
Data Augmentation for Object Detection via Differentiable Neural Rendering [71.00447761415388]
It is challenging to train a robust object detector when annotated data is scarce. Existing approaches to tackle this problem include semi-supervised learning that interpolates labeled data from unlabeled data. We introduce an offline data augmentation method for object detection, which semantically interpolates the training data with novel views.
arXiv Detail & Related papers (2021-03-04T06:31:06Z)
Artificial Dummies for Urban Dataset Augmentation [0.0]
Existing datasets for training pedestrian detectors in images suffer from limited appearance and pose variation. This paper describes an augmentation method for controlled synthesis of urban scenes containing people. We demonstrate that the data generated by our DummyNet improve performance of several existing person detectors across various datasets.
arXiv Detail & Related papers (2020-12-15T13:17:25Z)
The Cube++ Illumination Estimation Dataset [50.58610459038332]
A new illumination estimation dataset is proposed in this paper. It consists of 4890 images with known illumination colors as well as with additional semantic data. The dataset can be used for training and testing of methods that perform single or two-illuminant estimation.
arXiv Detail & Related papers (2020-11-19T18:50:08Z)
Uncertainty Aware Wildfire Management [6.997483623023005]
Recent wildfires in the United States have resulted in loss of life and billions of dollars. There are limited resources to be deployed over a massive area and the spread of the fire is challenging to predict. This paper proposes a decision-theoretic approach to combat wildfires.
arXiv Detail & Related papers (2020-10-15T17:47:31Z)
Scene relighting with illumination estimation in the latent space on an encoder-decoder scheme [68.8204255655161]
In this report we present methods that we tried to achieve that goal. Our models are trained on a rendered dataset of artificial locations with varied scene content, light source location and color temperature. With this dataset, we used a network with illumination estimation component aiming to infer and replace light conditions in the latent space representation of the concerned scenes.
arXiv Detail & Related papers (2020-06-03T15:25:11Z)

This list is automatically generated from the titles and abstracts of the papers in this site.