ICFHR 2020 Competition on Image Retrieval for Historical Handwritten
Fragments
- URL: http://arxiv.org/abs/2010.10197v1
- Date: Tue, 20 Oct 2020 11:12:35 GMT
- Title: ICFHR 2020 Competition on Image Retrieval for Historical Handwritten
Fragments
- Authors: Mathias Seuret, Anguelos Nicolaou, Dominique Stutzmann, Andreas Maier,
Vincent Christlein
- Abstract summary: This competition succeeds upon a line of competitions for writer and style analysis of historical document images.
Although the most teams submitted methods based on convolutional neural networks, the winning entry achieves an mAP below 40%.
- Score: 11.154300222718879
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This competition succeeds upon a line of competitions for writer and style
analysis of historical document images. In particular, we investigate the
performance of large-scale retrieval of historical document fragments in terms
of style and writer identification. The analysis of historic fragments is a
difficult challenge commonly solved by trained humanists. In comparison to
previous competitions, we make the results more meaningful by addressing the
issue of sample granularity and moving from writer to page fragment retrieval.
The two approaches, style and author identification, provide information on
what kind of information each method makes better use of and indirectly
contribute to the interpretability of the participating method. Therefore, we
created a large dataset consisting of more than 120 000 fragments. Although the
most teams submitted methods based on convolutional neural networks, the
winning entry achieves an mAP below 40%.
Related papers
- Temporal Image Caption Retrieval Competition -- Description and Results [0.9999629695552195]
This paper addresses the multimodal challenge of Text-Image retrieval and introduces a novel task that extends the modalities to include temporal data.
The Temporal Image Caption Retrieval Competition (TICRC) presented in this paper is based on the Chronicling America and Challenging America projects, which offer access to an extensive collection of digitized historic American newspapers spanning 274 years.
arXiv Detail & Related papers (2024-10-08T19:45:53Z) - PHD: Pixel-Based Language Modeling of Historical Documents [55.75201940642297]
We propose a novel method for generating synthetic scans to resemble real historical documents.
We pre-train our model, PHD, on a combination of synthetic scans and real historical newspapers from the 1700-1900 period.
We successfully apply our model to a historical QA task, highlighting its usefulness in this domain.
arXiv Detail & Related papers (2023-10-22T08:45:48Z) - ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images [198.35937007558078]
The competition opened on 30th December, 2022 and closed on 24th March, 2023.
There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2.
According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios.
arXiv Detail & Related papers (2023-06-05T22:20:52Z) - DAPR: A Benchmark on Document-Aware Passage Retrieval [57.45793782107218]
We propose and name this task emphDocument-Aware Passage Retrieval (DAPR)
While analyzing the errors of the State-of-The-Art (SoTA) passage retrievers, we find the major errors (53.5%) are due to missing document context.
Our created benchmark enables future research on developing and comparing retrieval systems for the new task.
arXiv Detail & Related papers (2023-05-23T10:39:57Z) - Fine-Grained Distillation for Long Document Retrieval [86.39802110609062]
Long document retrieval aims to fetch query-relevant documents from a large-scale collection.
Knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross-encoder.
We propose a new learning framework, fine-grained distillation (FGD), for long-document retrievers.
arXiv Detail & Related papers (2022-12-20T17:00:36Z) - Writer Retrieval and Writer Identification in Greek Papyri [4.44566870214758]
Writer identification refers to the classification of known writers while writer retrieval seeks to find the writer by means of image similarity in a dataset of images.
While automatic writer identification/retrieval methods already provide promising results for many historical document types, papyri data is very challenging due to the fiber structures and severe artifacts.
We investigate several methods and show that a good binarization is key to an improved writer identification in papyri writings.
arXiv Detail & Related papers (2022-12-15T08:42:25Z) - Placing (Historical) Facts on a Timeline: A Classification cum Coref
Resolution Approach [4.809236881780707]
A timeline provides one of the most effective ways to visualize the important historical facts that occurred over a period of time.
We introduce a two staged system for event timeline generation from multiple (historical) text documents.
Our results can be extremely helpful for historians, in advancing research in history and in understanding the socio-political landscape of a country.
arXiv Detail & Related papers (2022-06-28T15:36:44Z) - Where Does the Performance Improvement Come From? - A Reproducibility
Concern about Image-Text Retrieval [85.03655458677295]
Image-text retrieval has gradually become a major research direction in the field of information retrieval.
We first examine the related concerns and why the focus is on image-text retrieval tasks.
We analyze various aspects of the reproduction of pretrained and nonpretrained retrieval models.
arXiv Detail & Related papers (2022-03-08T05:01:43Z) - ICDAR 2021 Competition on Components Segmentation Task of Document
Photos [63.289361617237944]
Three challenge tasks were proposed entailing different segmentation assignments to be performed on a provided dataset.
The collected data are from several types of Brazilian ID documents, whose personal information was conveniently replaced.
Different Deep Learning models were applied by the entrants with diverse strategies to achieve the best results in each of the tasks.
arXiv Detail & Related papers (2021-06-16T00:49:58Z) - Whole page recognition of historical handwriting [1.2183405753834562]
We investigate an end-to-end inference approach without text localization which takes a handwritten page and transcribes its full text.
No explicit character, word or line segmentation is involved in inference which is why we call this approach "segmentation free"
We conclude that a whole page inference approach without text localization and segmentation is competitive.
arXiv Detail & Related papers (2020-09-22T15:46:33Z) - Combining Visual and Textual Features for Semantic Segmentation of
Historical Newspapers [2.5899040911480187]
We introduce a multimodal approach for the semantic segmentation of historical newspapers.
Based on experiments on diachronic Swiss and Luxembourgish newspapers, we investigate the predictive power of visual and textual features.
Results show consistent improvement of multimodal models in comparison to a strong visual baseline.
arXiv Detail & Related papers (2020-02-14T17:56:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.