Cognitive Representation Learning of Self-Media Online Article Quality
- URL: http://arxiv.org/abs/2008.05658v1
- Date: Thu, 13 Aug 2020 02:59:52 GMT
- Title: Cognitive Representation Learning of Self-Media Online Article Quality
- Authors: Yiru Wang, Shen Huang, Gongfu Li, Qiang Deng, Dongliang Liao, Pengda
Si, Yujiu Yang, Jin Xu
- Abstract summary: Self-media online articles are mainly created by users, which have the appearance characteristics of different text levels and multi-modal hybrid editing.
We establish a joint model CoQAN in combination with the layout organization, writing characteristics and text semantics.
We have also constructed a large scale real-world assessment dataset.
- Score: 24.084727302752377
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The automatic quality assessment of self-media online articles is an urgent
and new issue, which is of great value to the online recommendation and search.
Different from traditional and well-formed articles, self-media online articles
are mainly created by users, which have the appearance characteristics of
different text levels and multi-modal hybrid editing, along with the potential
characteristics of diverse content, different styles, large semantic spans and
good interactive experience requirements. To solve these challenges, we
establish a joint model CoQAN in combination with the layout organization,
writing characteristics and text semantics, designing different representation
learning subnetworks, especially for the feature learning process and
interactive reading habits on mobile terminals. It is more consistent with the
cognitive style of expressing an expert's evaluation of articles. We have also
constructed a large scale real-world assessment dataset. Extensive experimental
results show that the proposed framework significantly outperforms
state-of-the-art methods, and effectively learns and integrates different
factors of the online article quality assessment.
Related papers
- Automated Genre-Aware Article Scoring and Feedback Using Large Language Models [8.10826723408637]
This paper focuses on the development of an advanced intelligent article scoring system.
It assesses the overall quality of written work and offers detailed feature-based scoring tailored to various article genres.
arXiv Detail & Related papers (2024-10-18T04:13:51Z) - Multi-Facet Counterfactual Learning for Content Quality Evaluation [48.73583736357489]
We propose a framework for efficiently constructing evaluators that perceive multiple facets of content quality evaluation.
We leverage a joint training strategy based on contrastive learning and supervised learning to enable the evaluator to distinguish between different quality facets.
arXiv Detail & Related papers (2024-10-10T08:04:10Z) - Language Guided Domain Generalized Medical Image Segmentation [68.93124785575739]
Single source domain generalization holds promise for more reliable and consistent image segmentation across real-world clinical settings.
We propose an approach that explicitly leverages textual information by incorporating a contrastive learning mechanism guided by the text encoder features.
Our approach achieves favorable performance against existing methods in literature.
arXiv Detail & Related papers (2024-04-01T17:48:15Z) - Multi-Dimensional Evaluation of Text Summarization with In-Context
Learning [79.02280189976562]
In this paper, we study the efficacy of large language models as multi-dimensional evaluators using in-context learning.
Our experiments show that in-context learning-based evaluators are competitive with learned evaluation frameworks for the task of text summarization.
We then analyze the effects of factors such as the selection and number of in-context examples on performance.
arXiv Detail & Related papers (2023-06-01T23:27:49Z) - OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models [122.27878464009181]
We conducted a comprehensive evaluation of Large Multimodal Models, such as GPT4V and Gemini, in various text-related visual tasks.
OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available.
arXiv Detail & Related papers (2023-05-13T11:28:37Z) - Large Language Models are Diverse Role-Players for Summarization
Evaluation [82.31575622685902]
A document summary's quality can be assessed by human annotators on various criteria, both objective ones like grammar and correctness, and subjective ones like informativeness, succinctness, and appeal.
Most of the automatic evaluation methods like BLUE/ROUGE may be not able to adequately capture the above dimensions.
We propose a new evaluation framework based on LLMs, which provides a comprehensive evaluation framework by comparing generated text and reference text from both objective and subjective aspects.
arXiv Detail & Related papers (2023-03-27T10:40:59Z) - Investigating Fairness Disparities in Peer Review: A Language Model
Enhanced Approach [77.61131357420201]
We conduct a thorough and rigorous study on fairness disparities in peer review with the help of large language models (LMs)
We collect, assemble, and maintain a comprehensive relational database for the International Conference on Learning Representations (ICLR) conference from 2017 to date.
We postulate and study fairness disparities on multiple protective attributes of interest, including author gender, geography, author, and institutional prestige.
arXiv Detail & Related papers (2022-11-07T16:19:42Z) - Interpretable Fake News Detection with Topic and Deep Variational Models [2.15242029196761]
We focus on fake news detection using interpretable features and methods.
We have developed a deep probabilistic model that integrates a dense representation of textual news.
Our model achieves comparable performance to state-of-the-art competing models.
arXiv Detail & Related papers (2022-09-04T05:31:00Z) - KGEA: A Knowledge Graph Enhanced Article Quality Identification Dataset [4.811084336809668]
We propose a knowledge graph enhanced article quality identification dataset (KGEA) based on Baidu Encyclopedia.
We quantified the articles through 7 dimensions and use co-occurrence of the entities between the articles and the Baidu encyclopedia to construct the knowledge graph for every article.
We also compared some text classification baselines and found that external knowledge can guide the articles to a more competitive classification with the graph neural networks.
arXiv Detail & Related papers (2022-06-15T14:15:41Z) - Optical character recognition quality affects perceived usefulness of
historical newspaper clippings [0.6299766708197884]
Thirty-two users made searches to an article collection of Finnish newspaper Uusi Suometar 1869-1918.
Our article search database had two versions of each article with different quality optical character recognition.
arXiv Detail & Related papers (2022-06-01T10:07:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.