Deep Aramaic: Towards a Synthetic Data Paradigm Enabling Machine
Learning in Epigraphy
- URL: http://arxiv.org/abs/2310.07310v1
- Date: Wed, 11 Oct 2023 08:47:29 GMT
- Title: Deep Aramaic: Towards a Synthetic Data Paradigm Enabling Machine
Learning in Epigraphy
- Authors: Andrei C. Aioanei, Regine Hunziker-Rodewald, Konstantin Klein, Dominik
L. Michels
- Abstract summary: Our research pioneers an innovative methodology for generating synthetic training data tailored to Old Aramaic letters.
Our pipeline synthesizes photo-realistic Aramaic letter inscription, incorporating textural features, lighting, damage, and augmentations.
This comprehensive corpus provides a robust volume of data for training a residual neural network (ResNet) to classify highly degraded Aramaic letters.
- Score: 6.281814525187968
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Epigraphy increasingly turns to modern artificial intelligence (AI)
technologies such as machine learning (ML) for extracting insights from ancient
inscriptions. However, scarce labeled data for training ML algorithms severely
limits current techniques, especially for ancient scripts like Old Aramaic. Our
research pioneers an innovative methodology for generating synthetic training
data tailored to Old Aramaic letters. Our pipeline synthesizes photo-realistic
Aramaic letter datasets, incorporating textural features, lighting, damage, and
augmentations to mimic real-world inscription diversity. Despite minimal real
examples, we engineer a dataset of 250,000 training and 25,000 validation
images covering the 22 letter classes in the Aramaic alphabet. This
comprehensive corpus provides a robust volume of data for training a residual
neural network (ResNet) to classify highly degraded Aramaic letters. The ResNet
model demonstrates high accuracy in classifying real images from the 8th
century BCE Hadad statue inscription. Additional experiments validate
performance on varying materials and styles, proving effective generalization.
Our results validate the model's capabilities in handling diverse real-world
scenarios, proving the viability of our synthetic data approach and avoiding
the dependence on scarce training data that has constrained epigraphic
analysis. Our innovative framework elevates interpretation accuracy on damaged
inscriptions, thus enhancing knowledge extraction from these historical
resources.
Related papers
- Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing [71.29488677105127]
Existing scene text recognition (STR) methods struggle to recognize challenging texts, especially for artistic and severely distorted characters.
We propose a contrastive learning-based STR framework by leveraging synthetic and real unlabeled data without any human cost.
Our method achieves SOTA performance (94.7% and 70.9% average accuracy on common benchmarks and Union14M-Benchmark.
arXiv Detail & Related papers (2024-11-23T15:24:47Z) - Improving Text Generation on Images with Synthetic Captions [2.1175632266708733]
latent diffusion models such as SDXL and SD 1.5 have shown significant capability in generating realistic images.
We propose a low-cost approach by leveraging SDXL without any time-consuming training on large-scale datasets.
Our results demonstrate how our small scale fine-tuning approach can improve the accuracy of text generation in different scenarios.
arXiv Detail & Related papers (2024-06-01T17:27:34Z) - StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis [0.0]
This study proposes StyloAI, a data-driven model that uses 31 stylometric features to identify AI-generated texts.
StyloAI achieves accuracy rates of 81% and 98% on the test set of the AuTextification dataset and the Education dataset, respectively.
arXiv Detail & Related papers (2024-05-16T14:28:01Z) - AI-Generated Images as Data Source: The Dawn of Synthetic Era [61.879821573066216]
generative AI has unlocked the potential to create synthetic images that closely resemble real-world photographs.
This paper explores the innovative concept of harnessing these AI-generated images as new data sources.
In contrast to real data, AI-generated data exhibit remarkable advantages, including unmatched abundance and scalability.
arXiv Detail & Related papers (2023-10-03T06:55:19Z) - Image Captions are Natural Prompts for Text-to-Image Models [70.30915140413383]
We analyze the relationship between the training effect of synthetic data and the synthetic data distribution induced by prompts.
We propose a simple yet effective method that prompts text-to-image generative models to synthesize more informative and diverse training data.
Our method significantly improves the performance of models trained on synthetic training data.
arXiv Detail & Related papers (2023-07-17T14:38:11Z) - Harnessing the Power of Text-image Contrastive Models for Automatic
Detection of Online Misinformation [50.46219766161111]
We develop a self-learning model to explore the constrastive learning in the domain of misinformation identification.
Our model shows the superior performance of non-matched image-text pair detection when the training data is insufficient.
arXiv Detail & Related papers (2023-04-19T02:53:59Z) - AGTGAN: Unpaired Image Translation for Photographic Ancient Character
Generation [27.77329906930072]
We propose an unsupervised generative adversarial network called AGTGAN.
By explicit global and local glyph shape style modeling, our method can generate characters with diverse glyphs and realistic textures.
With our generated images, experiments on the largest photographic oracle bone character dataset show that our method can achieve a significant increase in classification accuracy, up to 16.34%.
arXiv Detail & Related papers (2023-03-13T11:18:41Z) - Is synthetic data from generative models ready for image recognition? [69.42645602062024]
We study whether and how synthetic images generated from state-of-the-art text-to-image generation models can be used for image recognition tasks.
We showcase the powerfulness and shortcomings of synthetic data from existing generative models, and propose strategies for better applying synthetic data for recognition tasks.
arXiv Detail & Related papers (2022-10-14T06:54:24Z) - Lexically Aware Semi-Supervised Learning for OCR Post-Correction [90.54336622024299]
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents.
Previous work has demonstrated the utility of neural post-correction methods on recognition of less-well-resourced languages.
We present a semi-supervised learning method that makes it possible to utilize raw images to improve performance.
arXiv Detail & Related papers (2021-11-04T04:39:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.