Related papers: Toward Accessible and Safe Live Streaming Using Distributed Content Filtering with MoQ

Toward Accessible and Safe Live Streaming Using Distributed Content Filtering with MoQ

URL: http://arxiv.org/abs/2505.08990v1
Date: Tue, 13 May 2025 22:00:22 GMT
Title: Toward Accessible and Safe Live Streaming Using Distributed Content Filtering with MoQ
Authors: Andrew C. Freeman,
Abstract summary: Live video streaming is increasingly popular on social media platforms.<n>Live streaming imposes restrictions on latency for both analysis and distribution.<n>We present extensions to the in-progress Media Over QUIC Transport protocol that enable real-time content moderation.
Score: 0.8158530638728501
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Live video streaming is increasingly popular on social media platforms. With the growth of live streaming comes an increased need for robust content moderation to remove dangerous, illegal, or otherwise objectionable content. Whereas video on demand distribution enables offline content analysis, live streaming imposes restrictions on latency for both analysis and distribution. In this paper, we present extensions to the in-progress Media Over QUIC Transport protocol that enable real-time content moderation in one-to-many video live streams. Importantly, our solution removes only the video segments that contain objectionable content, allowing playback resumption as soon as the stream conforms to content policies again. Content analysis tasks may be transparently distributed to arbitrary client devices. We implement and evaluate our system in the context of light strobe removal for photosensitive viewers, finding that streaming clients experience an increased latency of only one group-of-pictures duration.

Related papers

StreamSense: Streaming Social Task Detection with Selective Vision-Language Model Routing [56.32296785595906]
StreamSense is a streaming detector that couples a lightweight streaming encoder with selective routing to a Vision-Language Model expert.<n>We evaluate StreamSense on multiple social streaming detection tasks (e.g., sentiment classification and hate content moderation)<n>Our results indicate that selective escalation and deferral are effective primitives for understanding streaming social tasks.
arXiv Detail & Related papers (2026-01-30T09:19:22Z)
Foresight Prediction Enhanced Live-Streaming Recommendation [18.07489662404993]
Live-streaming, due to the dynamics of content and time, poses higher requirements for the recommendation algorithm of the platform.<n>We perform semantic quantization on live-streaming segments to obtain Semantic ids (Sid), encode the historical Sid sequence to capture the author's characteristics, and model Sid evolution trend to enable foresight prediction of future content.
arXiv Detail & Related papers (2025-12-07T07:25:38Z)
Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime! [88.12304235156591]
We propose textbfstReaming drag-oriEnted interactiVe vidEo manipuLation (REVEL), a new task that enables users to modify generated videos emphanytime on emphanything via fine-grained, interactive drag.<n>Our method can be seamlessly integrated into existing autoregressive video diffusion models.
arXiv Detail & Related papers (2025-10-03T22:38:35Z)
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos [47.91239059703758]
TimeChat-Online is a novel online VideoLLM that revolutionizes real-time video interaction.<n>Our Differential Token Drop (DTD) module addresses the challenge of visual redundancy in streaming videos.<n>Our experiments demonstrate that DTD achieves an 82.8% reduction in video tokens while maintaining 98% performance on StreamingBench.
arXiv Detail & Related papers (2025-04-24T07:59:46Z)
Delayed takedown of illegal content on social media makes moderation ineffective [4.4134057281132195]
This study models the relationship between the timeliness of illegal content removal and its prevalence, reach, and exposure on social media.<n>By simulating illegal content diffusion using empirical data from the DSA Transparency Database, we demonstrate that rapid takedown (within hours) significantly reduces illegal content prevalence and exposure.<n>While these findings support tight takedown deadlines for content removal, such deadlines cannot address the delay in identifying the illegal content and can adversely affect the quality of content moderation.
arXiv Detail & Related papers (2025-02-12T23:16:39Z)
SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis [52.050036778325094]
We introduce SALOVA: Segment-Augmented Video Assistant, a novel video-LLM framework designed to enhance the comprehension of lengthy video content.<n>We present a high-quality collection of 87.8K long videos, each densely captioned at the segment level to enable models to capture scene continuity and maintain rich context.<n>Our framework mitigates the limitations of current video-LMMs by allowing for precise identification and retrieval of relevant video segments in response to queries.
arXiv Detail & Related papers (2024-11-25T08:04:47Z)
Implementing an Optimized and Secured Multimedia Streaming Protocol in a Participatory Sensing Scenario [0.0]
Crowdsensing can distribute information about shared video contents among multiple users in network. Crowdsensing introduces several security constraints that must be taken into account to ensure confidentiality, integrity, and availability of the data. In this article, we will discuss the use of a symmetric AES-CTR encryption based protocol for securing data streaming over a crowd-sensed network.
arXiv Detail & Related papers (2024-11-14T07:35:53Z)
Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner [53.671484175063995]
Video-LLMs are pre-trained to process short videos, limiting their broader application for understanding longer video content. We introduce an alternative video token rearrangement technique that circumvents limitations imposed by the fixed video encoder and alignment projector.
arXiv Detail & Related papers (2024-09-19T17:59:55Z)
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams [78.72965584414368]
We present Flash-VStream, a video-language model that simulates the memory mechanism of human. Compared to existing models, Flash-VStream achieves significant reductions in latency inference and VRAM consumption. We propose VStream-QA, a novel question answering benchmark specifically designed for online video streaming understanding.
arXiv Detail & Related papers (2024-06-12T11:07:55Z)
Content Moderation on Social Media in the EU: Insights From the DSA Transparency Database [0.0]
Digital Services Act (DSA) requires large social media platforms in the EU to provide clear and specific information whenever they restrict access to certain content. Statements of Reasons (SoRs) are collected in the DSA Transparency Database to ensure transparency and scrutiny of content moderation decisions. We empirically analyze 156 million SoRs within an observation period of two months to provide an early look at content moderation decisions of social media platforms in the EU.
arXiv Detail & Related papers (2023-12-07T16:56:19Z)
VADER: Video Alignment Differencing and Retrieval [70.88247176534426]
VADER matches and aligns partial video fragments to candidate videos using a robust visual descriptor and scalable search over chunked video content. A space-time comparator module identifies regions of manipulation between content, invariant to any changes due to any residual temporal misalignments or artifacts arising from non-editorial changes of the content.
arXiv Detail & Related papers (2023-03-23T11:50:44Z)
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos [82.48910259277984]
Livestream tutorial videos are usually hours long, recorded, and uploaded to the Internet directly after the live sessions, making it hard for other people to catch up quickly. An outline will be a beneficial solution, which requires the video to be temporally segmented according to topics. We propose LiveSeg, an unsupervised Livestream video temporal solution, which takes advantage of multimodal features from different domains.
arXiv Detail & Related papers (2022-10-12T00:08:17Z)
Modeling Live Video Streaming: Real-Time Classification, QoE Inference, and Field Evaluation [1.4353812560047186]
ReCLive is a machine learning method for live video detection and QoE measurement based on network-level behavioral characteristics. We analyze about 23,000 video streams from Twitch and YouTube, and identify key features in their traffic profile that differentiate live and on-demand streaming. Our solution provides ISPs with fine-grained visibility into live video streams, enabling them to measure and improve user experience.
arXiv Detail & Related papers (2021-12-05T17:53:06Z)

This list is automatically generated from the titles and abstracts of the papers in this site.