Related papers: EEVEE: An Easy Annotation Tool for Natural Language Processing

EEVEE: An Easy Annotation Tool for Natural Language Processing

URL: http://arxiv.org/abs/2402.02864v1
Date: Mon, 5 Feb 2024 10:24:40 GMT
Title: EEVEE: An Easy Annotation Tool for Natural Language Processing
Authors: Axel Sorensen, Siyao Peng, Barbara Plank, Rob van der Goot
Abstract summary: We propose EEVEE, an annotation tool focused on simplicity, efficiency, and ease of use. It can run directly in the browser (no setup required) and uses tab-separated files (as opposed to character offsets or task-specific formats) for annotation.
Score: 32.111061774093
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Annotation tools are the starting point for creating Natural Language Processing (NLP) datasets. There is a wide variety of tools available; setting up these tools is however a hindrance. We propose EEVEE, an annotation tool focused on simplicity, efficiency, and ease of use. It can run directly in the browser (no setup required) and uses tab-separated files (as opposed to character offsets or task-specific formats) for annotation. It allows for annotation of multiple tasks on a single dataset and supports four task-types: sequence labeling, span labeling, text classification and seq2seq.

Related papers

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients" [53.7887350405379]
Prior work synthesizes tool-use LLM datasets by first generating a user query, followed by complex tool-use annotations like DFS.<n>We introduce ToolGrad, an agentic framework that inverts this paradigm. ToolGrad first constructs valid tool-use chains through an iterative process guided by textual "gradients"<n>This "answer-first" approach led to ToolGrad-5k, a dataset generated with more complex tool use, lower cost, and 100% pass rate.
arXiv Detail & Related papers (2025-08-06T05:04:00Z)
Antarlekhaka: A Comprehensive Tool for Multi-task Natural Language Annotation [0.0]
Antarlekhaka is a tool for manual annotation of a comprehensive set of tasks relevant to Natural Language Processing. The tool is Unicode-compatible, language-agnostic, Web-deployable and supports distributed annotation by multiple simultaneous annotators. It has been used for two real-life annotation tasks on two different languages, namely, Sanskrit and Bengali.
arXiv Detail & Related papers (2023-10-11T19:09:07Z)
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models [90.96816639172464]
Large language models (LLMs) are taught to use new tools by providing a few demonstrations of the tool's usage. We advocate the use of tool documentation, descriptions for the individual tool usage, over demonstrations.
arXiv Detail & Related papers (2023-08-01T17:21:38Z)
POTATO: The Portable Text Annotation Tool [8.924906491840119]
We present POTATO, a free, fully open-sourced annotation system. It supports labeling many types of text and multimodal data. It offers easy-to-configure features to maximize the productivity of both deployers and annotators.
arXiv Detail & Related papers (2022-12-16T17:57:41Z)
PartAL: Efficient Partial Active Learning in Multi-Task Visual Settings [57.08386016411536]
We show that it is more effective to select not only the images to be annotated but also a subset of tasks for which to provide annotations at each Active Learning (AL) We demonstrate the effectiveness of our approach on several popular multi-task datasets.
arXiv Detail & Related papers (2022-11-21T15:08:35Z)
Binding Language Models in Symbolic Languages [146.3027328556881]
Binder is a training-free neural-symbolic framework that maps the task input to a program. In the parsing stage, Codex is able to identify the part of the task input that cannot be answerable by the original programming language. In the execution stage, Codex can perform versatile functionalities given proper prompts in the API calls.
arXiv Detail & Related papers (2022-10-06T12:55:17Z)
SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling [55.71459234749639]
SciAnnotate is a web-based tool for text annotation called SciAnnotate, which stands for scientific annotation tool. Our tool provides users with multiple user-friendly interfaces for creating weak labels. In this study, we take multi-source weak label denoising as an example, we utilized a Bertifying Conditional Hidden Markov Model to denoise the weak label generated by our tool.
arXiv Detail & Related papers (2022-08-07T19:18:13Z)
Annotationsaurus: A Searchable Directory of Annotation Tools [0.0]
We create a comprehensive directory of annotation tools that currently includes 93 tools. We implement simple scripts and a Web application that filters the tools based on chosen criteria. We present two use cases using the directory and propose ideas for its maintenance.
arXiv Detail & Related papers (2020-10-13T09:22:48Z)
DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool [15.268017930901332]
The Data AnnotatoR Tool (DART) is an interactive application that reduces human efforts in annotating large quantities of structured data. By using a sequence-to-sequence model, our system iteratively analyzes the annotated labels in order to better sample unlabeled data. In a simulation experiment performed on annotating large quantities of structured data, DART has been shown to reduce the total number of annotations needed with active learning and automatically suggesting relevant labels.
arXiv Detail & Related papers (2020-10-08T17:36:34Z)
HUMAN: Hierarchical Universal Modular ANnotator [14.671297336775387]
We introduce a novel web-based annotation tool that addresses the above problems by a) covering a variety of annotation tasks on both textual and image data, and b) the usage of an internal deterministic state machine. Humane comes with an easy-to-use graphical user interface that simplifies the annotation task and management.
arXiv Detail & Related papers (2020-10-02T16:20:30Z)
TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing [64.87699383581885]
We introduce TextBrewer, an open-source knowledge distillation toolkit for natural language processing. It supports various kinds of supervised learning tasks, such as text classification, reading comprehension, sequence labeling. As a case study, we use TextBrewer to distill BERT on several typical NLP tasks.
arXiv Detail & Related papers (2020-02-28T09:44:07Z)

This list is automatically generated from the titles and abstracts of the papers in this site.