Modeling Structural Similarities between Documents for Coherence
Assessment with Graph Convolutional Networks
- URL: http://arxiv.org/abs/2306.06472v1
- Date: Sat, 10 Jun 2023 16:08:47 GMT
- Title: Modeling Structural Similarities between Documents for Coherence
Assessment with Graph Convolutional Networks
- Authors: Wei Liu, Xiyan Fu, Michael Strube
- Abstract summary: Coherence is an important aspect of text quality, and various approaches have been applied to coherence modeling.
We investigate a GCN-based coherence model that is capable of capturing structural similarities between documents.
We evaluate our method on two tasks, assessing discourse coherence and automated essay scoring.
- Score: 17.853960157501742
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Coherence is an important aspect of text quality, and various approaches have
been applied to coherence modeling. However, existing methods solely focus on a
single document's coherence patterns, ignoring the underlying correlation
between documents. We investigate a GCN-based coherence model that is capable
of capturing structural similarities between documents. Our model first creates
a graph structure for each document, from where we mine different subgraph
patterns. We then construct a heterogeneous graph for the training corpus,
connecting documents based on their shared subgraphs. Finally, a GCN is applied
to the heterogeneous graph to model the connectivity relationships. We evaluate
our method on two tasks, assessing discourse coherence and automated essay
scoring. Results show that our GCN-based model outperforms all baselines,
achieving a new state-of-the-art on both tasks.
Related papers
- Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks [50.42343781348247]
We develop a graph Poisson factor analysis (GPFA) which provides analytic conditional posteriors to improve the inference accuracy.
We also extend GPFA to a multi-stochastic-layer version named graph Poisson gamma belief network (GPGBN) to capture the hierarchical document relationships at multiple semantic levels.
Our models can extract high-quality hierarchical latent document representations and achieve promising performance on various graph analytic tasks.
arXiv Detail & Related papers (2024-10-13T02:22:14Z) - Hypergraph based Understanding for Document Semantic Entity Recognition [65.84258776834524]
We build a novel hypergraph attention document semantic entity recognition framework, HGA, which uses hypergraph attention to focus on entity boundaries and entity categories at the same time.
Our results on FUNSD, CORD, XFUNDIE show that our method can effectively improve the performance of semantic entity recognition tasks.
arXiv Detail & Related papers (2024-07-09T14:35:49Z) - Sparse Graph Representations for Procedural Instructional Documents [7.205864119886871]
We propose two approaches to model document similarity by representing document pairs as a directed and sparse JCIG.
We show that our sparse directed graph model architecture achieves comparable results to the baseline on not containing sequential information.
arXiv Detail & Related papers (2024-02-06T12:34:15Z) - Enhancing Visually-Rich Document Understanding via Layout Structure
Modeling [91.07963806829237]
We propose GraphLM, a novel document understanding model that injects layout knowledge into the model.
We evaluate our model on various benchmarks, including FUNSD, XFUND and CORD, and achieve state-of-the-art results.
arXiv Detail & Related papers (2023-08-15T13:53:52Z) - Document-Level Relation Extraction with Sentences Importance Estimation
and Focusing [52.069206266557266]
Document-level relation extraction (DocRE) aims to determine the relation between two entities from a document of multiple sentences.
We propose a Sentence Estimation and Focusing (SIEF) framework for DocRE, where we design a sentence importance score and a sentence focusing loss.
Experimental results on two domains show that our SIEF not only improves overall performance, but also makes DocRE models more robust.
arXiv Detail & Related papers (2022-04-27T03:20:07Z) - Joint Graph Learning and Matching for Semantic Feature Correspondence [69.71998282148762]
We propose a joint emphgraph learning and matching network, named GLAM, to explore reliable graph structures for boosting graph matching.
The proposed method is evaluated on three popular visual matching benchmarks (Pascal VOC, Willow Object and SPair-71k)
It outperforms previous state-of-the-art graph matching methods by significant margins on all benchmarks.
arXiv Detail & Related papers (2021-09-01T08:24:02Z) - Coarse-to-Fine Entity Representations for Document-level Relation
Extraction [28.39444850200523]
Document-level Relation Extraction (RE) requires extracting relations expressed within and across sentences.
Recent works show that graph-based methods, usually constructing a document-level graph that captures document-aware interactions, can obtain useful entity representations.
We propose the textbfCoarse-to-textbfFine textbfEntity textbfRepresentation model (textbfCFER) that adopts a coarse-to-fine strategy.
arXiv Detail & Related papers (2020-12-04T10:18:59Z) - Neural Topic Modeling by Incorporating Document Relationship Graph [18.692100955163713]
Graph Topic Model (GTM) is a GNN based neural topic model that represents a corpus as a document relationship graph.
Documents and words in the corpus become nodes in the graph and are connected based on document-word co-occurrences.
arXiv Detail & Related papers (2020-09-29T12:45:55Z) - Double Graph Based Reasoning for Document-level Relation Extraction [29.19714611415326]
Document-level relation extraction aims to extract relations among entities within a document.
We propose Graph Aggregation-and-Inference Network (GAIN) featuring double graphs.
Experiments on the public dataset, DocRED, show GAIN achieves a significant performance improvement (2.85 on F1) over the previous state-of-the-art.
arXiv Detail & Related papers (2020-09-29T03:41:01Z) - Leveraging Graph to Improve Abstractive Multi-Document Summarization [50.62418656177642]
We develop a neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents.
Our model utilizes graphs to encode documents in order to capture cross-document relations, which is crucial to summarizing long documents.
Our model can also take advantage of graphs to guide the summary generation process, which is beneficial for generating coherent and concise summaries.
arXiv Detail & Related papers (2020-05-20T13:39:47Z) - Reasoning with Latent Structure Refinement for Document-Level Relation
Extraction [20.308845516900426]
We propose a novel model that empowers the relational reasoning across sentences by automatically inducing the latent document-level graph.
Specifically, our model achieves an F1 score of 59.05 on a large-scale document-level dataset (DocRED)
arXiv Detail & Related papers (2020-05-13T13:36:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.