Fugu-MT 論文翻訳(概要): StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding

論文の概要: StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding

arxiv url: http://arxiv.org/abs/2604.09000v1
Date: Fri, 10 Apr 2026 06:11:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-13 17:57:53.712911
Title: StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding
Title（参考訳）: StreamMeCo: 効率的なストリーミングビデオ理解のための長期エージェントメモリ圧縮
Authors: Junxi Wang, Te Sun, Jiayi Zhu, Junxian Li, Haowen Xu, Zichen Wen, Xuming Hu, Zhiyu Li, Linfeng Zhang,
Abstract要約: 視覚エージェントメモリは、ストリーミングビデオ理解において顕著な効果を示した。本稿では,効率的なストリームエージェントメモリ圧縮フレームワークStreamMeCoを提案する。 70%のメモリグラフ圧縮では、StreamMeCoは1.87*の高速化を実現し、平均精度は1.0%向上した。
参考スコア（独自算出の注目度）: 43.20225248425961
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Vision agent memory has shown remarkable effectiveness in streaming video understanding. However, storing such memory for videos incurs substantial memory overhead, leading to high costs in both storage and computation. To address this issue, we propose StreamMeCo, an efficient Stream Agent Memory Compression framework. Specifically, based on the connectivity of the memory graph, StreamMeCo introduces edge-free minmax sampling for the isolated nodes and an edge-aware weight pruning for connected nodes, evicting the redundant memory nodes while maintaining the accuracy. In addition, we introduce a time-decay memory retrieval mechanism to further eliminate the performance degradation caused by memory compression. Extensive experiments on three challenging benchmark datasets (M3-Bench-robot, M3-Bench-web and Video-MME-Long) demonstrate that under 70% memory graph compression, StreamMeCo achieves a 1.87* speedup in memory retrieval while delivering an average accuracy improvement of 1.0%. Our code is available at https://github.com/Celina-love-sweet/StreamMeCo.
Abstract（参考訳）: 視覚エージェントメモリは、ストリーミングビデオ理解において顕著な効果を示した。しかし、そのようなメモリをビデオに保存するとメモリのオーバーヘッドが大きくなり、ストレージと計算の両方で高いコストがかかる。本稿では,効率的なストリームエージェントメモリ圧縮フレームワークStreamMeCoを提案する。具体的には、メモリグラフの接続性に基づいて、StreamMeCoは分離されたノードに対してエッジフリーのminmaxサンプリングを導入し、接続されたノードに対してエッジ対応の重み付けを導入し、正確性を保ちながら冗長なメモリノードを排除した。さらに,メモリ圧縮による性能劣化を解消する時間遅延メモリ検索機構を導入する。 M3-Bench-robot、M3-Bench-web、Video-MME-Longの3つの挑戦的なベンチマークデータセットに対する大規模な実験では、70%のメモリグラフ圧縮では、StreamMeCoは平均精度1.0%の向上を達成しながら、メモリ検索の1.87*高速化を実現している。私たちのコードはhttps://github.com/Celina-love-sweet/StreamMeCoで利用可能です。

論文の概要: StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding

関連論文リスト