Learning from scarce information: using synthetic data to classify Roman
fine ware pottery
- URL: http://arxiv.org/abs/2107.01401v1
- Date: Sat, 3 Jul 2021 10:30:46 GMT
- Title: Learning from scarce information: using synthetic data to classify Roman
fine ware pottery
- Authors: Santos J. N\'u\~nez Jare\~no, Dani\"el P. van Helden, Evgeny M.
Mirkes, Ivan Y. Tyukin, Penelope M. Allison
- Abstract summary: We propose to use a transfer learning approach whereby the model is first trained on a synthetic dataset replicating features of the original objects.
Taking the replicated features from published profile drawings of pottery forms allowed the integration of expert knowledge into the process.
After this first initial training the model was fine-tuned with data from photographs of real vessels.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this article we consider a version of the challenging problem of learning
from datasets whose size is too limited to allow generalisation beyond the
training set. To address the challenge we propose to use a transfer learning
approach whereby the model is first trained on a synthetic dataset replicating
features of the original objects. In this study the objects were smartphone
photographs of near-complete Roman terra sigillata pottery vessels from the
collection of the Museum of London. Taking the replicated features from
published profile drawings of pottery forms allowed the integration of expert
knowledge into the process through our synthetic data generator. After this
first initial training the model was fine-tuned with data from photographs of
real vessels. We show, through exhaustive experiments across several popular
deep learning architectures, different test priors, and considering the impact
of the photograph viewpoint and excessive damage to the vessels, that the
proposed hybrid approach enables the creation of classifiers with appropriate
generalisation performance. This performance is significantly better than that
of classifiers trained exclusively on the original data which shows the promise
of the approach to alleviate the fundamental issue of learning from small
datasets.
Related papers
- Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets [4.696575161583618]
This study focuses on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving.
The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights.
Experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets.
arXiv Detail & Related papers (2024-10-30T07:11:41Z) - Zero-Shot Object-Centric Representation Learning [72.43369950684057]
We study current object-centric methods through the lens of zero-shot generalization.
We introduce a benchmark comprising eight different synthetic and real-world datasets.
We find that training on diverse real-world images improves transferability to unseen scenarios.
arXiv Detail & Related papers (2024-08-17T10:37:07Z) - Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities [88.398085358514]
Contrastive Deepfake Embeddings (CoDE) is a novel embedding space specifically designed for deepfake detection.
CoDE is trained via contrastive learning by additionally enforcing global-local similarities.
arXiv Detail & Related papers (2024-07-29T18:00:10Z) - Reconstructing Training Data From Real World Models Trained with Transfer Learning [29.028185455223785]
We present a novel approach enabling data reconstruction in realistic settings for models trained on high-resolution images.
Our method adapts the reconstruction scheme of arXiv:2206.07758 to real-world scenarios.
We introduce a novel clustering-based method to identify good reconstructions from thousands of candidates.
arXiv Detail & Related papers (2024-07-22T17:59:10Z) - Reinforcing Pre-trained Models Using Counterfactual Images [54.26310919385808]
This paper proposes a novel framework to reinforce classification models using language-guided generated counterfactual images.
We identify model weaknesses by testing the model using the counterfactual image dataset.
We employ the counterfactual images as an augmented dataset to fine-tune and reinforce the classification model.
arXiv Detail & Related papers (2024-06-19T08:07:14Z) - Fantastic Gains and Where to Find Them: On the Existence and Prospect of
General Knowledge Transfer between Any Pretrained Model [74.62272538148245]
We show that for arbitrary pairings of pretrained models, one model extracts significant data context unavailable in the other.
We investigate if it is possible to transfer such "complementary" knowledge from one model to another without performance degradation.
arXiv Detail & Related papers (2023-10-26T17:59:46Z) - UMat: Uncertainty-Aware Single Image High Resolution Material Capture [2.416160525187799]
We propose a learning-based method to recover normals, specularity, and roughness from a single diffuse image of a material.
Our method is the first one to deal with the problem of modeling uncertainty in material digitization.
arXiv Detail & Related papers (2023-05-25T17:59:04Z) - Generalizing Dataset Distillation via Deep Generative Prior [75.9031209877651]
We propose to distill an entire dataset's knowledge into a few synthetic images.
The idea is to synthesize a small number of synthetic data points that, when given to a learning algorithm as training data, result in a model approximating one trained on the original data.
We present a new optimization algorithm that distills a large number of images into a few intermediate feature vectors in the generative model's latent space.
arXiv Detail & Related papers (2023-05-02T17:59:31Z) - TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose
Estimation [55.94900327396771]
We introduce neural texture learning for 6D object pose estimation from synthetic data.
We learn to predict realistic texture of objects from real image collections.
We learn pose estimation from pixel-perfect synthetic data.
arXiv Detail & Related papers (2022-12-25T13:36:32Z) - TartanVO: A Generalizable Learning-based VO [9.12375405509935]
We present the first learning-based visual odometry (VO) model, which generalizes to multiple datasets and real-world scenarios.
We achieve this by leveraging the SLAM dataset TartanAir, which provides a large amount of diverse synthetic data in challenging environments.
arXiv Detail & Related papers (2020-10-31T20:49:33Z) - Improving Learning Effectiveness For Object Detection and Classification
in Cluttered Backgrounds [6.729108277517129]
This paper develops a framework that permits to autonomously generate a training dataset in heterogeneous cluttered backgrounds.
It is clear that the learning effectiveness of the proposed framework should be improved in complex and heterogeneous environments.
The performance of the proposed framework is investigated through empirical tests and compared with that of the model trained with the COCO dataset.
arXiv Detail & Related papers (2020-02-27T22:28:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.