EduRABSA: An Education Review Dataset for Aspect-based Sentiment Analysis Tasks
- URL: http://arxiv.org/abs/2508.17008v1
- Date: Sat, 23 Aug 2025 12:38:40 GMT
- Title: EduRABSA: An Education Review Dataset for Aspect-based Sentiment Analysis Tasks
- Authors: Yan Cathy Hua, Paul Denny, Jörg Wicker, Katerina Taskova,
- Abstract summary: We present EduRABSA (Education Review ABSA), the first public, ABSA education review dataset.<n>It covers three review subject types (course, teaching staff, university) in the English language.<n>We also share ASQE-DPT (Data Processing Tool), an offline, lightweight, installation-free manual data annotation tool.
- Score: 2.622434937753741
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Every year, most educational institutions seek and receive an enormous volume of text feedback from students on courses, teaching, and overall experience. Yet, turning this raw feedback into useful insights is far from straightforward. It has been a long-standing challenge to adopt automatic opinion mining solutions for such education review text data due to the content complexity and low-granularity reporting requirements. Aspect-based Sentiment Analysis (ABSA) offers a promising solution with its rich, sub-sentence-level opinion mining capabilities. However, existing ABSA research and resources are very heavily focused on the commercial domain. In education, they are scarce and hard to develop due to limited public datasets and strict data protection. A high-quality, annotated dataset is urgently needed to advance research in this under-resourced area. In this work, we present EduRABSA (Education Review ABSA), the first public, annotated ABSA education review dataset that covers three review subject types (course, teaching staff, university) in the English language and all main ABSA tasks, including the under-explored implicit aspect and implicit opinion extraction. We also share ASQE-DPT (Data Processing Tool), an offline, lightweight, installation-free manual data annotation tool that generates labelled datasets for comprehensive ABSA tasks from a single-task annotation. Together, these resources contribute to the ABSA community and education domain by removing the dataset barrier, supporting research transparency and reproducibility, and enabling the creation and sharing of further resources. The dataset, annotation tool, and scripts and statistics for dataset processing and sampling are available at https://github.com/yhua219/edurabsa_dataset_and_annotation_tool.
Related papers
- Data-Efficient Adaptation and a Novel Evaluation Method for Aspect-based Sentiment Analysis [2.3949433621799283]
ABSA research needs are unmet in high-demand yet low-resource areas such as education and healthcare.<n>Traditional evaluation methods based on exact matches are overly rigid for ABSA tasks.<n>We propose a novel evaluation method, Flexible Text Similarity Matching and Optimal Bipartite Pairing.<n>We present the first ABSA study of small decoder-only generative language models.
arXiv Detail & Related papers (2025-11-04T22:11:10Z) - M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis [23.523947343171926]
We present M-ABSA, a comprehensive dataset spanning 7 domains and 21 languages.<n>Our primary focus is on triplet extraction, which involves identifying aspect terms, aspect categories, and sentiment polarities.<n>Our empirical findings highlight that the dataset enables diverse evaluation tasks, such as multilingual and multi-domain transfer learning.
arXiv Detail & Related papers (2025-02-17T14:16:01Z) - ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection for ABSA [50.90538760832107]
This research presents a novel task, Review-Level Opinion Aspect Sentiment Target (ROAST)
ROAST seeks to close the gap between sentence-level and text-level ABSA by identifying every ABSA constituent at the review level.
We extend the available datasets to enable ROAST, addressing the drawbacks noted in previous research.
arXiv Detail & Related papers (2024-05-30T17:29:15Z) - A Systematic Review of Aspect-based Sentiment Analysis: Domains, Methods, and Trends [2.781593421115434]
Aspect-based sentiment analysis (ABSA) is a fine-grained type of sentiment analysis that identifies aspects and their associated opinions from a text.
With the surge of digital opinionated text data, ABSA gained increasing popularity for its ability to mine more detailed and targeted insights.
This paper presents a systematic literature review of ABSA studies with a focus on trends and high-level relationships among these fundamental components.
arXiv Detail & Related papers (2023-11-16T06:01:47Z) - MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based
Sentiment Analysis [23.959356414518957]
We propose a large-scale Multi-Element Multi-Domain dataset (MEMD) that covers the four elements across five domains.
We evaluate generative and non-generative baselines on multiple ABSA subtasks under the open domain setting.
arXiv Detail & Related papers (2023-06-29T14:03:49Z) - MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
of Videos [106.06278332186106]
Multimodal summarization with multimodal output (MSMO) has emerged as a promising research direction.
Numerous limitations exist within existing public MSMO datasets.
We have meticulously curated the textbfMMSum dataset.
arXiv Detail & Related papers (2023-06-07T07:43:11Z) - Towards Complex Document Understanding By Discrete Reasoning [77.91722463958743]
Document Visual Question Answering (VQA) aims to understand visually-rich documents to answer questions in natural language.
We introduce a new Document VQA dataset, named TAT-DQA, which consists of 3,067 document pages and 16,558 question-answer pairs.
We develop a novel model named MHST that takes into account the information in multi-modalities, including text, layout and visual image, to intelligently address different types of questions.
arXiv Detail & Related papers (2022-07-25T01:43:19Z) - Survey of Aspect-based Sentiment Analysis Datasets [55.61047894397937]
Aspect-based sentiment analysis (ABSA) is a natural language processing problem that requires analyzing user-generated reviews.
Numerous yet scattered corpora for ABSA make it difficult for researchers to identify corpora best suited for a specific ABSA subtask quickly.
This study aims to present a database of corpora that can be used to train and assess autonomous ABSA systems.
arXiv Detail & Related papers (2022-04-11T16:23:36Z) - Understanding Pre-trained BERT for Aspect-based Sentiment Analysis [71.40586258509394]
This paper analyzes the pre-trained hidden representations learned from reviews on BERT for tasks in aspect-based sentiment analysis (ABSA)
It is not clear how the general proxy task of (masked) language model trained on unlabeled corpus without annotations of aspects or opinions can provide important features for downstream tasks in ABSA.
arXiv Detail & Related papers (2020-10-31T02:21:43Z) - Aspect-Based Sentiment Analysis in Education Domain [0.0]
We present a comprehensive review of the existing work in ABSA with a focus in the education domain.
ABSA has found itself useful in a wide range of domains.
Being able to understand and find out what students like and don't like most about a course, professor, or teaching methodology can be of great importance for the respective institutions.
arXiv Detail & Related papers (2020-10-03T21:51:47Z) - ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine
Reading Comprehension [53.037401638264235]
We present an evaluation server, ORB, that reports performance on seven diverse reading comprehension datasets.
The evaluation server places no restrictions on how models are trained, so it is a suitable test bed for exploring training paradigms and representation learning.
arXiv Detail & Related papers (2019-12-29T07:27:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.