GUIDO: A Hybrid Approach to Guideline Discovery & Ordering from Natural
Language Texts
- URL: http://arxiv.org/abs/2307.09959v1
- Date: Wed, 19 Jul 2023 13:01:03 GMT
- Title: GUIDO: A Hybrid Approach to Guideline Discovery & Ordering from Natural
Language Texts
- Authors: Nils Freyer, Dustin Thewes, Matthias Meinecke
- Abstract summary: GUIDO is a hybrid approach to the process model extraction task.
The presented approach achieves significantly better results than a pure rule-based approach.
Still, in comparison to purely machine-learning-based approaches, the annotation costs stay low.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Extracting workflow nets from textual descriptions can be used to simplify
guidelines or formalize textual descriptions of formal processes like business
processes and algorithms. The task of manually extracting processes, however,
requires domain expertise and effort. While automatic process model extraction
is desirable, annotating texts with formalized process models is expensive.
Therefore, there are only a few machine-learning-based extraction approaches.
Rule-based approaches, in turn, require domain specificity to work well and can
rarely distinguish relevant and irrelevant information in textual descriptions.
In this paper, we present GUIDO, a hybrid approach to the process model
extraction task that first, classifies sentences regarding their relevance to
the process model, using a BERT-based sentence classifier, and second, extracts
a process model from the sentences classified as relevant, using dependency
parsing. The presented approach achieves significantly better results than a
pure rule-based approach. GUIDO achieves an average behavioral similarity score
of $0.93$. Still, in comparison to purely machine-learning-based approaches,
the annotation costs stay low.
Related papers
- Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery [6.037276428689637]
This paper introduces DISCOvery Graph (DISCOG), a hybrid approach that combines the strengths of two worlds: a graph-based method for accurate document relevance prediction.
Our approach drastically reduces document review costs by 99.9% compared to manual processes and by 95% compared to LLM-based classification methods.
arXiv Detail & Related papers (2024-05-29T15:08:55Z) - From Dialogue to Diagram: Task and Relationship Extraction from Natural
Language for Accelerated Business Process Prototyping [0.0]
This paper introduces a contemporary solution, where central to our approach, is the use of dependency parsing and Named Entity Recognition (NER)
We utilize Subject-Verb-Object (SVO) constructs for identifying action relationships and integrate semantic analysis tools, including WordNet, for enriched contextual understanding.
The system adeptly handles data transformation and visualization, converting verbose extracted information into BPMN (Business Process Model and Notation) diagrams.
arXiv Detail & Related papers (2023-12-16T12:35:28Z) - Generative Context-aware Fine-tuning of Self-supervised Speech Models [54.389711404209415]
We study the use of generative large language models (LLM) generated context information.
We propose an approach to distill the generated information during fine-tuning of self-supervised speech models.
We evaluate the proposed approach using the SLUE and Libri-light benchmarks for several downstream tasks: automatic speech recognition, named entity recognition, and sentiment analysis.
arXiv Detail & Related papers (2023-12-15T15:46:02Z) - Automated Few-shot Classification with Instruction-Finetuned Language
Models [76.69064714392165]
We show that AuT-Few outperforms state-of-the-art few-shot learning methods.
We also show that AuT-Few is the best ranking method across datasets on the RAFT few-shot benchmark.
arXiv Detail & Related papers (2023-05-21T21:50:27Z) - Supporting Vision-Language Model Inference with Confounder-pruning Knowledge Prompt [71.77504700496004]
Vision-language models are pre-trained by aligning image-text pairs in a common space to deal with open-set visual concepts.
To boost the transferability of the pre-trained models, recent works adopt fixed or learnable prompts.
However, how and what prompts can improve inference performance remains unclear.
arXiv Detail & Related papers (2022-05-23T07:51:15Z) - FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
Assessment [93.09267863425492]
We argue that understanding both high-level semantics and internal temporal structures of actions in competitive sports videos is the key to making predictions accurate and interpretable.
We construct a new fine-grained dataset, called FineDiving, developed on diverse diving events with detailed annotations on action procedures.
arXiv Detail & Related papers (2022-04-07T17:59:32Z) - Value Retrieval with Arbitrary Queries for Form-like Documents [50.5532781148902]
We propose value retrieval with arbitrary queries for form-like documents.
Our method predicts target value for an arbitrary query based on the understanding of layout and semantics of a form.
We propose a simple document language modeling (simpleDLM) strategy to improve document understanding on large-scale model pre-training.
arXiv Detail & Related papers (2021-12-15T01:12:02Z) - Augmenting Modelers with Semantic Autocompletion of Processes [5.279475826661643]
Business process modelers need expertise and knowledge of the domain that may not always be available to them.
We present a method for process autocompletion at design time, that is based on the semantic similarity of sub-processes.
arXiv Detail & Related papers (2021-05-24T16:23:07Z) - Process Discovery for Structured Program Synthesis [70.29027202357385]
A core task in process mining is process discovery which aims to learn an accurate process model from event log data.
In this paper, we propose to use (block-) structured programs directly as target process models.
We develop a novel bottom-up agglomerative approach to the discovery of such structured program process models.
arXiv Detail & Related papers (2020-08-13T10:33:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.