ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
- URL: http://arxiv.org/abs/2103.10213v1
- Date: Thu, 18 Mar 2021 12:33:41 GMT
- Title: ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
- Authors: Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas,
Shjian Lu, and C.V. Jawahar
- Abstract summary: In recognition of the technical challenges, importance and huge commercial potentials of SROIE, we organized the ICDAR 2019 competition on SROIE.
A new dataset with 1000 whole scanned receipt images and annotations is created for the competition.
In this report we will presents the motivation, competition datasets, task definition, evaluation protocol, submission statistics, performance of submitted methods and results analysis.
- Score: 70.71240097723745
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Scanned receipts OCR and key information extraction (SROIE) represent the
processeses of recognizing text from scanned receipts and extracting key texts
from them and save the extracted tests to structured documents. SROIE plays
critical roles for many document analysis applications and holds great
commercial potentials, but very little research works and advances have been
published in this area. In recognition of the technical challenges, importance
and huge commercial potentials of SROIE, we organized the ICDAR 2019
competition on SROIE. In this competition, we set up three tasks, namely,
Scanned Receipt Text Localisation (Task 1), Scanned Receipt OCR (Task 2) and
Key Information Extraction from Scanned Receipts (Task 3). A new dataset with
1000 whole scanned receipt images and annotations is created for the
competition. In this report we will presents the motivation, competition
datasets, task definition, evaluation protocol, submission statistics,
performance of submitted methods and results analysis.
Related papers
- ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents [21.17856299966841]
Large language models (LLMs) have exhibited remarkable performance across various tasks in natural language processing.
We develop ResearchArena, a benchmark that measures LLM agents' ability to conduct academic surveys.
arXiv Detail & Related papers (2024-06-13T03:26:30Z) - CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset [12.828786692835369]
This paper introduces the Comprehensive Post-OCR Parsing and Receipt Understanding dataset (CORU)
CORU consists of over 20,000 annotated receipts from diverse retail settings, including supermarkets and clothing stores.
We establish the baseline performance for a range of models on CORU to evaluate the effectiveness of traditional methods.
arXiv Detail & Related papers (2024-06-06T20:38:15Z) - RealKIE: Five Novel Datasets for Enterprise Key Information Extraction [0.0]
RealKIE is a benchmark of five challenging datasets aimed at advancing key information extraction methods.
The datasets include a diverse range of documents including SEC S1 Filings, US Non-disclosure Agreements, UK Charity Reports, FCC Invoices, and Resource Contracts.
arXiv Detail & Related papers (2024-03-29T10:31:32Z) - Advancements and Challenges in Arabic Optical Character Recognition: A
Comprehensive Survey [0.6629765271909505]
This paper seeks to offer an exhaustive review of contemporary applications, methodologies, and challenges associated with Arabic Optical Character Recognition (OCR)
A thorough analysis is conducted on prevailing techniques utilized throughout the OCR process, with a dedicated effort to discern the most efficacious approaches that demonstrate enhanced outcomes.
In addition to presenting cutting-edge techniques and methods, this paper critically identifies research gaps within the realm of Arabic OCR.
arXiv Detail & Related papers (2023-12-19T03:01:31Z) - ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images [198.35937007558078]
The competition opened on 30th December, 2022 and closed on 24th March, 2023.
There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2.
According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios.
arXiv Detail & Related papers (2023-06-05T22:20:52Z) - Layout-Aware Information Extraction for Document-Grounded Dialogue:
Dataset, Method and Demonstration [75.47708732473586]
We propose a layout-aware document-level Information Extraction dataset, LIE, to facilitate the study of extracting both structural and semantic knowledge from visually rich documents.
LIE contains 62k annotations of three extraction tasks from 4,061 pages in product and official documents.
Empirical results show that layout is critical for VRD-based extraction, and system demonstration also verifies that the extracted knowledge can help locate the answers that users care about.
arXiv Detail & Related papers (2022-07-14T07:59:45Z) - One-shot Key Information Extraction from Document with Deep Partial
Graph Matching [60.48651298832829]
Key Information Extraction (KIE) from documents improves efficiency, productivity, and security in many industrial scenarios.
Existing supervised learning methods for the KIE task need to feed a large number of labeled samples and learn separate models for different types of documents.
We propose a deep end-to-end trainable network for one-shot KIE using partial graph matching.
arXiv Detail & Related papers (2021-09-26T07:45:53Z) - Zero-Shot Information Extraction as a Unified Text-to-Triple Translation [56.01830747416606]
We cast a suite of information extraction tasks into a text-to-triple translation framework.
We formalize the task as a translation between task-specific input text and output triples.
We study the zero-shot performance of this framework on open information extraction.
arXiv Detail & Related papers (2021-09-23T06:54:19Z) - ICDAR 2021 Competition on Components Segmentation Task of Document
Photos [63.289361617237944]
Three challenge tasks were proposed entailing different segmentation assignments to be performed on a provided dataset.
The collected data are from several types of Brazilian ID documents, whose personal information was conveniently replaced.
Different Deep Learning models were applied by the entrants with diverse strategies to achieve the best results in each of the tasks.
arXiv Detail & Related papers (2021-06-16T00:49:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.