A Browser-based Open Source Assistant for Multimodal Content Verification
- URL: http://arxiv.org/abs/2603.02842v1
- Date: Tue, 03 Mar 2026 10:39:32 GMT
- Title: A Browser-based Open Source Assistant for Multimodal Content Verification
- Authors: Rosanna Milner, Michael Foster, Olesya Razuvayevskaya, Ian Roberts, Valentin Porcellini, Denis Teyssou, Kalina Bontcheva,
- Abstract summary: Disinformation and false content produced by generative AI pose a significant challenge for journalists and fact-checkers.<n>There is an abundance of NLP models for detecting credibility signals such as persuasion techniques, subjectivity, or machine-generated text.<n>This paper demonstrates the VERIFICATION ASSISTANT, a browser-based tool designed to bridge this gap.
- Score: 5.0488681454219675
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Disinformation and false content produced by generative AI pose a significant challenge for journalists and fact-checkers who must rapidly verify digital media information. While there is an abundance of NLP models for detecting credibility signals such as persuasion techniques, subjectivity, or machine-generated text, such methods often remain inaccessible to non-expert users and are not integrated into their daily workflows as a unified framework. This paper demonstrates the VERIFICATION ASSISTANT, a browser-based tool designed to bridge this gap. The VERIFICATION ASSISTANT, a core component of the widely adopted VERIFICATION PLUGIN (140,000+ users), allows users to submit URLs or media files to a unified interface. It automatically extracts content and routes it to a suite of backend NLP classifiers, delivering actionable credibility signals, estimating AI-generated content, and providing other verification guidance in a clear, easy-to-digest format. This paper showcases the tool architecture, its integration of multiple NLP services, and its real-world application to detecting disinformation.
Related papers
- Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval [60.25608870901428]
Trustworthiness is a core research challenge for agentic AI systems built on Large Language Models (LLMs)<n>We propose the task of fact-checking without retrieval, focusing on the verification of arbitrary natural language claims, independent of their source robustness.
arXiv Detail & Related papers (2026-03-05T18:42:51Z) - IDP Accelerator: Agentic Document Intelligence from Extraction to Compliance Validation [3.539467892338473]
We present IDP (Intelligent Document Processing) Accelerator, a framework enabling agentic AI for end-to-end document intelligence.<n>The interactive demonstration enables users to upload document packets, visualize classification results, and explore extracted data.
arXiv Detail & Related papers (2026-02-26T20:20:38Z) - CheckIfExist: Detecting Citation Hallucinations in the Era of AI-Generated Content [0.0]
"CheckIfExist" is an open-source tool designed to provide immediate verification of references against scholarly databases.<n>The proposed tool fills this gap by employing string similarity algorithms to compute multi-dimensional match confidence scores.<n>The system supports both single-reference verification and batch processing of Bib entries through a unified interface.
arXiv Detail & Related papers (2026-01-27T20:26:24Z) - Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding [61.36285696607487]
Document understanding is critical for applications from financial analysis to scientific discovery.<n>Current approaches, whether OCR-based pipelines feeding Large Language Models (LLMs) or native Multimodal LLMs (MLLMs) face key limitations.<n>Retrieval-Augmented Generation (RAG) helps ground models in external data, but documents' multimodal nature, combining text, tables, charts, and layout, demands a more advanced paradigm: Multimodal RAG.
arXiv Detail & Related papers (2025-10-17T02:33:16Z) - Context-Aware Visual Prompting: Automating Geospatial Web Dashboards with Large Language Models and Agent Self-Validation for Decision Support [1.506501956463029]
Development of web-based dashboards for risk analysis and decision making often challenged by difficulty in big, multidimensional data.<n>We introduce a generative AI framework that automates the creation of interactive geospatial dashboards from user-defined inputs.
arXiv Detail & Related papers (2025-10-10T10:58:15Z) - Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline [56.790045049514326]
Two major forms of deception dominate: human-crafted misinformation and AI-generated content.<n>We propose Unified Multimodal Fake Content Detection (UMFDet), a framework designed to handle both forms of deception.<n>UMFDet achieves robust and consistent performance across both misinformation types, outperforming specialized baselines.
arXiv Detail & Related papers (2025-09-30T09:26:32Z) - Semi-automated Fact-checking in Portuguese: Corpora Enrichment using Retrieval with Claim extraction [0.0]
This dissertation develops, applying, and analyzing a methodology to enrich Portuguese news corpora with external evidence.<n>The approach simulates a user's verification process, employing Large Language Models (LLMs) to extract the main claim from texts.<n>A data validation and preprocessing framework, including near-duplicate detection, is introduced to enhance the quality of the base corpora.
arXiv Detail & Related papers (2025-07-19T23:46:40Z) - FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text [39.804889153032526]
Facts&Evidence is an interactive tool for user-driven verification of complex text.<n>It visualizes the credibility of individual claims along with an explanation of model decisions and attribution to multiple, diverse evidence sources.
arXiv Detail & Related papers (2025-03-19T00:14:55Z) - Unified Pretraining Framework for Document Understanding [52.224359498792836]
We present UDoc, a new unified pretraining framework for document understanding.
UDoc is designed to support most document understanding tasks, extending the Transformer to take multimodal embeddings as input.
An important feature of UDoc is that it learns a generic representation by making use of three self-supervised losses.
arXiv Detail & Related papers (2022-04-22T21:47:04Z) - MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and
Understanding [70.16678926775475]
MMOCR is an open-source toolbox for text detection and recognition.
It implements 14 state-of-the-art algorithms, which is more than all the existing open-source OCR projects we are aware of to date.
arXiv Detail & Related papers (2021-08-14T14:10:23Z) - Rethinking Text Line Recognition Models [57.47147190119394]
We consider two decoder families (Connectionist Temporal Classification and Transformer) and three encoder modules (Bidirectional LSTMs, Self-Attention, and GRCLs)
We compare their accuracy and performance on widely used public datasets of scene and handwritten text.
Unlike the more common Transformer-based models, this architecture can handle inputs of arbitrary length.
arXiv Detail & Related papers (2021-04-15T21:43:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.