Related papers: DF2023: The Digital Forensics 2023 Dataset for Image Forgery Detection

DF2023: The Digital Forensics 2023 Dataset for Image Forgery Detection

URL: http://arxiv.org/abs/2503.22417v1
Date: Fri, 28 Mar 2025 13:31:19 GMT
Title: DF2023: The Digital Forensics 2023 Dataset for Image Forgery Detection
Authors: David Fischinger, Martin Boyer,
Abstract summary: The deliberate manipulation of public opinion, especially through altered images, poses a significant danger to society.<n>To fight this issue on a technical level we support the research community by releasing the Digital Forensics 2023 (DF2023) training and validation dataset.<n>This dataset enables an objective comparison of network architectures and can significantly reduce the time and effort of researchers preparing datasets.
Score: 0.4143603294943439
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The deliberate manipulation of public opinion, especially through altered images, which are frequently disseminated through online social networks, poses a significant danger to society. To fight this issue on a technical level we support the research community by releasing the Digital Forensics 2023 (DF2023) training and validation dataset, comprising one million images from four major forgery categories: splicing, copy-move, enhancement and removal. This dataset enables an objective comparison of network architectures and can significantly reduce the time and effort of researchers preparing datasets.

Related papers

Is JPEG AI going to change image forensics? [50.92778618091496]
We investigate the counter-forensic effects of the new JPEG AI standard based on neural image compression. Our results demonstrate a reduction in the performance of leading forensic detectors when analyzing content processed through JPEG AI.
arXiv Detail & Related papers (2024-12-04T12:07:20Z)
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors [62.63467652611788]
We introduce SEMI-TRUTHS, featuring 27,600 real images, 223,400 masks, and 1,472,700 AI-augmented images. Each augmented image is accompanied by metadata for standardized and targeted evaluation of detector robustness. Our findings suggest that state-of-the-art detectors exhibit varying sensitivities to the types and degrees of perturbations, data distributions, and augmentation methods used.
arXiv Detail & Related papers (2024-11-12T01:17:27Z)
Blind Data Adaptation to tackle Covariate Shift in Operational Steganalysis [9.565324766070407]
Image Steganography allows individuals to hide illegal information in digital images without arousing suspicions. It is crucial to develop effective steganalysis methods enabling to detect manipulated images for clandestine communications. We develop TADA, a novel methodology enabling to emulate sources aligned with specific targets in steganalysis.
arXiv Detail & Related papers (2024-05-27T08:55:22Z)
Getting it Right: Improving Spatial Consistency in Text-to-Image Models [103.52640413616436]
One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt. We create SPRIGHT, the first spatially focused, large-scale dataset, by re-captioning 6 million images from 4 widely used vision datasets. We find that training on images containing a larger number of objects leads to substantial improvements in spatial consistency, including state-of-the-art results on T2I-CompBench with a spatial score of 0.2133, by fine-tuning on 500 images.
arXiv Detail & Related papers (2024-04-01T15:55:25Z)
An Innovative Tool for Uploading/Scraping Large Image Datasets on Social Networks [9.27070946719462]
We propose an automated approach by means of a digital tool that we created on purpose. The tool is capable of automatically uploading an entire image dataset to the desired digital platform and then downloading all the uploaded pictures.
arXiv Detail & Related papers (2023-11-01T23:27:37Z)
Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results [73.98594459933008]
Face anti-spoofing (FAS) is an essential mechanism for safeguarding the integrity of automated face recognition systems. This limitation can be attributed to the scarcity and lack of diversity in publicly available FAS datasets. We introduce the Wild Face Anti-Spoofing dataset, a large-scale, diverse FAS dataset collected in unconstrained settings.
arXiv Detail & Related papers (2023-04-12T10:29:42Z)
Urban feature analysis from aerial remote sensing imagery using self-supervised and semi-supervised computer vision [8.124947412639704]
Analysis of overhead imagery using computer vision is a problem that has received considerable attention in academic literature. These problems are addressed here through the development of a more generic framework, incorporating advances in representation learning. The successful low-level detection of urban infrastructure evolution over a 10-year period from 60 million unlabeled images, exemplifies the substantial potential of our approach to advance quantitative urban research.
arXiv Detail & Related papers (2022-08-17T03:41:56Z)
Benchmarking Scientific Image Forgery Detectors [18.225190509954874]
This paper presents an extendable open-source library that reproduces the most common image forgery operations reported by the research integrity community. We create a large scientific forgery image benchmark (39,423 images) with an enriched ground-truth. In addition, concerned about the high number of retracted papers due to image duplication, this work evaluates the state-of-the-art copy-move detection methods in the proposed dataset.
arXiv Detail & Related papers (2021-05-26T22:58:20Z)
DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results [144.5252578415748]
This paper reports methods and results in the DeeperForensics Challenge 2020 on real-world face forgery detection. The challenge employs the DeeperForensics-1.0 dataset, with 60,000 videos constituted by a total of 17.6 million frames. A total of 115 participants registered for the competition, and 25 teams made valid submissions.
arXiv Detail & Related papers (2021-02-18T16:48:57Z)
Improving Object Detection with Selective Self-supervised Self-training [62.792445237541145]
We study how to leverage Web images to augment human-curated object detection datasets. We retrieve Web images by image-to-image search, which incurs less domain shift from the curated data than other search methods. We propose a novel learning method motivated by two parallel lines of work that explore unlabeled data for image classification.
arXiv Detail & Related papers (2020-07-17T18:05:01Z)
Syn2Real: Forgery Classification via Unsupervised Domain Adaptation [1.8229783460536682]
We propose to create a synthetic forged dataset using deep semantic image inpainting and copy-move forgery algorithm. We use unsupervised domain adaptation networks to detect copy-move forgery in new domains by mapping the feature space from our synthetically generated dataset.
arXiv Detail & Related papers (2020-02-03T15:02:58Z)

This list is automatically generated from the titles and abstracts of the papers in this site.