ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images
- URL: http://arxiv.org/abs/2306.03287v1
- Date: Mon, 5 Jun 2023 22:20:52 GMT
- Title: ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images
- Authors: Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen,
Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun
Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang
Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang,
Dimosthenis Karatzas, Xing Sun, Jingdong Wang, and Xiang Bai
- Abstract summary: The competition opened on 30th December, 2022 and closed on 24th March, 2023.
There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2.
According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios.
- Score: 198.35937007558078
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Structured text extraction is one of the most valuable and challenging
application directions in the field of Document AI. However, the scenarios of
past benchmarks are limited, and the corresponding evaluation protocols usually
focus on the submodules of the structured text extraction scheme. In order to
eliminate these problems, we organized the ICDAR 2023 competition on Structured
text extraction from Visually-Rich Document images (SVRD). We set up two tracks
for SVRD including Track 1: HUST-CELL and Track 2: Baidu-FEST, where HUST-CELL
aims to evaluate the end-to-end performance of Complex Entity Linking and
Labeling, and Baidu-FEST focuses on evaluating the performance and
generalization of Zero-shot / Few-shot Structured Text extraction from an
end-to-end perspective. Compared to the current document benchmarks, our two
tracks of competition benchmark enriches the scenarios greatly and contains
more than 50 types of visually-rich document images (mainly from the actual
enterprise applications). The competition opened on 30th December, 2022 and
closed on 24th March, 2023. There are 35 participants and 91 valid submissions
received for Track 1, and 15 participants and 26 valid submissions received for
Track 2. In this report we will presents the motivation, competition datasets,
task definition, evaluation protocol, and submission summaries. According to
the performance of the submissions, we believe there is still a large gap on
the expected information extraction performance for complex and zero-shot
scenarios. It is hoped that this competition will attract many researchers in
the field of CV and NLP, and bring some new thoughts to the field of Document
AI.
Related papers
- ICDAR 2023 Competition on Robust Layout Segmentation in Corporate
Documents [3.6700088931938835]
ICDAR has a long tradition in hosting competitions to benchmark the state-of-the-art.
To raise the bar over previous competitions, we engineered a hard competition dataset and proposed the recent DocLayNet dataset for training.
We recognize interesting combinations of recent computer vision models, data augmentation strategies and ensemble methods to achieve remarkable accuracy in the task we posed.
arXiv Detail & Related papers (2023-05-24T09:56:47Z) - ICDAR 2023 Competition on Hierarchical Text Detection and Recognition [60.68100769639923]
The competition is aimed to promote research into deep learning models and systems that can jointly perform text detection and recognition.
We present details of the proposed competition organization, including tasks, datasets, evaluations, and schedule.
During the competition period (from January 2nd 2023 to April 1st 2023), at least 50 submissions from more than 20 teams were made in the 2 proposed tasks.
arXiv Detail & Related papers (2023-05-16T18:56:12Z) - ICDAR 2023 Competition on Reading the Seal Title [58.866588777012744]
To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (ReST)
We constructed a dataset of 10,000 real seal data, covering the most common classes of seals, and labeled all seal title texts with text and text contents.
The competition attracted 53 participants from academia and industry including 28 submissions for Task 1 and 25 submissions for Task 2, which demonstrated significant interest in this challenging task.
arXiv Detail & Related papers (2023-04-24T10:01:41Z) - ICDAR 2023 Video Text Reading Competition for Dense and Small Text [61.138557702185274]
We establish a video text reading benchmark, DSText, which focuses on dense and small text reading challenges in the video.
Compared with the previous datasets, the proposed dataset mainly include three new challenges.
The proposed DSText includes 100 video clips from 12 open scenarios, supporting two tasks (i.e., video text tracking (Task 1) and end-to-end video text spotting (Task 2)
arXiv Detail & Related papers (2023-04-10T04:20:34Z) - Out-of-Vocabulary Challenge Report [15.827931962904115]
The Out-Of-Vocabulary 2022 (OOV) challenge introduces the recognition of unseen scene text instances at training time.
The competition compiles a collection of public scene text datasets comprising of 326,385 images with 4,864,405 scene text instances.
A thorough analysis of results from baselines and different participants is presented.
arXiv Detail & Related papers (2022-09-14T15:25:54Z) - ICDAR 2021 Competition on Components Segmentation Task of Document
Photos [63.289361617237944]
Three challenge tasks were proposed entailing different segmentation assignments to be performed on a provided dataset.
The collected data are from several types of Brazilian ID documents, whose personal information was conveniently replaced.
Different Deep Learning models were applied by the entrants with diverse strategies to achieve the best results in each of the tasks.
arXiv Detail & Related papers (2021-06-16T00:49:58Z) - ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction [70.71240097723745]
In recognition of the technical challenges, importance and huge commercial potentials of SROIE, we organized the ICDAR 2019 competition on SROIE.
A new dataset with 1000 whole scanned receipt images and annotations is created for the competition.
In this report we will presents the motivation, competition datasets, task definition, evaluation protocol, submission statistics, performance of submitted methods and results analysis.
arXiv Detail & Related papers (2021-03-18T12:33:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.