MisinfoTeleGraph: Network-driven Misinformation Detection for German Telegram Messages
- URL: http://arxiv.org/abs/2506.22529v1
- Date: Fri, 27 Jun 2025 12:32:19 GMT
- Title: MisinfoTeleGraph: Network-driven Misinformation Detection for German Telegram Messages
- Authors: Lu Kalkbrenner, Veronika Solopova, Steffen Zeiler, Robert Nickel, Dorothea Kolossa,
- Abstract summary: Misinfo-TeleGraph is the first German-language Telegram-based graph dataset for misinformation detection.<n>It includes over 5 million messages from public channels, enriched with metadata, channel relationships, and both weak and strong labels.<n>We evaluate both text-only models and graph neural networks (GNNs) that incorporate message forwarding as a network structure.
- Score: 5.533030792414604
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Connectivity and message propagation are central, yet often underutilized, sources of information in misinformation detection -- especially on poorly moderated platforms such as Telegram, which has become a critical channel for misinformation dissemination, namely in the German electoral context. In this paper, we introduce Misinfo-TeleGraph, the first German-language Telegram-based graph dataset for misinformation detection. It includes over 5 million messages from public channels, enriched with metadata, channel relationships, and both weak and strong labels. These labels are derived via semantic similarity to fact-checks and news articles using M3-embeddings, as well as manual annotation. To establish reproducible baselines, we evaluate both text-only models and graph neural networks (GNNs) that incorporate message forwarding as a network structure. Our results show that GraphSAGE with LSTM aggregation significantly outperforms text-only baselines in terms of Matthews Correlation Coefficient (MCC) and F1-score. We further evaluate the impact of subscribers, view counts, and automatically versus human-created labels on performance, and highlight both the potential and challenges of weak supervision in this domain. This work provides a reproducible benchmark and open dataset for future research on misinformation detection in German-language Telegram networks and other low-moderation social platforms.
Related papers
- pytopicgram: A library for data extraction and topic modeling from Telegram channels [0.0]
pytopicgram is a Python library that helps researchers collect, organize, and analyze these Telegram messages.<n>pytopicgram allows users to understand how content spreads and how audiences interact on Telegram.
arXiv Detail & Related papers (2025-02-07T12:41:47Z) - Beyond Message Passing: Neural Graph Pattern Machine [50.78679002846741]
We introduce the Neural Graph Pattern Machine (GPM), a novel framework that bypasses message passing by learning directly from graph substructures.<n>GPM efficiently extracts, encodes, and prioritizes task-relevant graph patterns, offering greater expressivity and improved ability to capture long-range dependencies.
arXiv Detail & Related papers (2025-01-30T20:37:47Z) - MGM: Global Understanding of Audience Overlap Graphs for Predicting the Factuality and the Bias of News Media [38.09491667371551]
We study the classification problem of profiling news media from the lens of political bias and factuality.<n>Traditional profiling methods, such as Pre-trained Language Models (PLMs) and Graph Neural Networks (GNNs) have shown promising results.<n>We propose MediaGraphMind (MGM), an effective solution within a variational Expectation-Maximization (EM) framework.
arXiv Detail & Related papers (2024-12-12T18:37:32Z) - Bridging Local Details and Global Context in Text-Attributed Graphs [62.522550655068336]
GraphBridge is a framework that bridges local and global perspectives by leveraging contextual textual information.
Our method achieves state-of-theart performance, while our graph-aware token reduction module significantly enhances efficiency and solves scalability issues.
arXiv Detail & Related papers (2024-06-18T13:35:25Z) - Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph [19.409935976725446]
We present a novel method for detecting fake news in social media.
Our method densifies the graph's connectivity to capture denser interaction better.
We use three publicly available fake news datasets, Twitter, PHEME, and Weibo, for evaluation.
arXiv Detail & Related papers (2024-06-14T09:55:54Z) - LOSS-GAT: Label Propagation and One-Class Semi-Supervised Graph
Attention Network for Fake News Detection [2.6396287656676725]
Loss-GAT is a semi-supervised and one-class approach for fake news detection.
We employ a two-step label propagation algorithm to categorize news into two groups: interest (fake) and non-interest (real)
arXiv Detail & Related papers (2024-02-13T12:02:37Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.<n>To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.<n>Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - TGDataset: Collecting and Exploring the Largest Telegram Channels Dataset [57.2282378772772]
This paper presents the TGDataset, a new dataset that includes 120,979 Telegram channels and over 400 million messages.<n>We analyze the languages spoken within our dataset and the topic covered by English channels.<n>In addition to the raw dataset, we released the scripts we used to analyze the dataset and the list of channels belonging to the network of a new conspiracy theory called Sabmyk.
arXiv Detail & Related papers (2023-03-09T15:42:38Z) - Evidential Temporal-aware Graph-based Social Event Detection via
Dempster-Shafer Theory [76.4580340399321]
We propose ETGNN, a novel Evidential Temporal-aware Graph Neural Network.
We construct view-specific graphs whose nodes are the texts and edges are determined by several types of shared elements respectively.
Considering the view-specific uncertainty, the representations of all views are converted into mass functions through evidential deep learning (EDL) neural networks.
arXiv Detail & Related papers (2022-05-24T16:22:40Z) - Detecting Offensive Language on Social Networks: An End-to-end Detection
Method based on Graph Attention Networks [7.723697303436006]
We propose an end-to-end method based on community structure and text features for offensive language detection (CT-OLD)
We add user opinion to the community structure for representing user features. The user opinion is represented by user historical behavior information, which outperforms that represented by text information.
arXiv Detail & Related papers (2022-03-04T03:57:18Z) - Jointly Cross- and Self-Modal Graph Attention Network for Query-Based
Moment Localization [77.21951145754065]
We propose a novel Cross- and Self-Modal Graph Attention Network (CSMGAN) that recasts this task as a process of iterative messages passing over a joint graph.
Our CSMGAN is able to effectively capture high-order interactions between two modalities, thus enabling a further precise localization.
arXiv Detail & Related papers (2020-08-04T08:25:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.