Related papers: Sub-millisecond Video Synchronization of Multiple Android Smartphones

Sub-millisecond Video Synchronization of Multiple Android Smartphones

URL: http://arxiv.org/abs/2107.00987v1
Date: Fri, 2 Jul 2021 11:56:33 GMT
Title: Sub-millisecond Video Synchronization of Multiple Android Smartphones
Authors: Azat Akhmetyanov, Anastasiia Kornilova, Marsel Faizullin, David Pozo, Gonzalo Ferrer
Abstract summary: This paper addresses the problem of building an affordable easy-to-setup synchronized multi-view camera system. We propose a solution for this problem - a publicly-available Android application for synchronized video recording on multiple smartphones with sub-millisecond accuracy.
Score: 2.283665431721732
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: This paper addresses the problem of building an affordable easy-to-setup synchronized multi-view camera system, which is in demand for many Computer Vision and Robotics applications in high-dynamic environments. In our work, we propose a solution for this problem - a publicly-available Android application for synchronized video recording on multiple smartphones with sub-millisecond accuracy. We present a generalized mathematical model of timestamping for Android smartphones and prove its applicability on 47 different physical devices. Also, we estimate the time drift parameter for those smartphones, which is less than 1.2 millisecond per minute for most of the considered devices, that makes smartphones' camera system a worthy analog for professional multi-view systems. Finally, we demonstrate Android-app performance on the camera system built from Android smartphones quantitatively, showing less than 300 microseconds synchronization error, and qualitatively - on panorama stitching task.

Related papers

MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices [42.00270347221752]
We propose MobileI2V, a 270M lightweight diffusion model for real-time image-to-video generation on mobile devices.<n>We design a time-step distillation strategy that compresses the I2V sampling steps from more than 20 to only two without significant quality loss.<n>MobileI2V enables, for the first time, fast 720p image-to-video generation on mobile devices, with quality comparable to existing models.
arXiv Detail & Related papers (2025-11-26T15:09:02Z)
RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems [38.099313678683224]
We present a low-cost, general-purpose synchronization method that achieves millisecond-level temporal alignment across diverse camera systems.<n>The proposed solution employs a custom-built itLED Clock that encodes time through red and infrared, allowing visual decoding of the exposure window.<n>We validate the system in large-scale surgical recordings involving over 25 heterogeneous cameras spanning both IR and RGB modalities.
arXiv Detail & Related papers (2025-11-18T22:13:06Z)
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model [60.171601995737646]
Mobile-VideoGPT is an efficient multimodal framework for video understanding. It consists of lightweight dual visual encoders, efficient projectors, and a small language model (SLM) Our results show that Mobile-VideoGPT-0.5B can generate up to 46 tokens per second.
arXiv Detail & Related papers (2025-03-27T17:59:58Z)
MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion [0.6261722394141346]
We propose a new method for multi-exposure fusion based on an encoder-decoder deep learning architecture. Our model is capable of processing 4K resolution images in less than 2 seconds on mid-range smartphones.
arXiv Detail & Related papers (2024-08-15T05:03:14Z)
Replay: Multi-modal Multi-view Acted Videos for Casual Holography [76.49914880351167]
Replay is a collection of multi-view, multi-modal videos of humans interacting socially. Overall, the dataset contains over 4000 minutes of footage and over 7 million timestamped high-resolution frames. The Replay dataset has many potential applications, such as novel-view synthesis, 3D reconstruction, novel-view acoustic synthesis, human body and face analysis, and training generative models.
arXiv Detail & Related papers (2023-07-22T12:24:07Z)
Deep learning-based stereo camera multi-video synchronization [5.305803516459996]
A software-based synchronization method would reduce the cost, weight and size of the entire system. This study paves the way to a production ready software-based video synchronization system.
arXiv Detail & Related papers (2023-03-22T21:14:36Z)
Real-Time Under-Display Cameras Image Restoration and HDR on Mobile Devices [81.61356052916855]
The images captured by under-display cameras (UDCs) are degraded by the screen in front of them. Deep learning methods for image restoration can significantly reduce the degradation of captured images. We propose a lightweight model for blind UDC Image Restoration and HDR, and we also provide a benchmark comparing the performance and runtime of different methods on smartphones.
arXiv Detail & Related papers (2022-11-25T11:46:57Z)
MicroISP: Processing 32MP Photos on Mobile Devices with Deep Learning [114.66037224769005]
We present a novel MicroISP model designed specifically for edge devices. The proposed solution is capable of processing up to 32MP photos on recent smartphones using the standard mobile ML libraries. The architecture of the model is flexible, allowing to adjust its complexity to devices of different computational power.
arXiv Detail & Related papers (2022-11-08T17:40:50Z)
Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows [83.54243912535667]
We first collect a novel benchmark on this setting with four diverse scenarios including concerts, sports games, gala shows, and contests. It contains 88-hour raw videos that contribute to the 14-hour edited videos. We propose a new approach temporal and contextual transformer that utilizes clues from historical shots and other views to make shot transition decisions.
arXiv Detail & Related papers (2022-10-17T04:11:23Z)
Face Deblurring using Dual Camera Fusion on Mobile Phones [23.494813096697815]
Motion blur of fast-moving subjects is a longstanding problem in photography. We develop a novel face deblurring system based on the dual camera fusion technique for mobile phones. Our algorithm runs efficiently on Google Pixel 6, which takes 463 ms overhead per shot.
arXiv Detail & Related papers (2022-07-23T22:50:46Z)
SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis [1.981491298222699]
We present a dataset of 1000 video sequences of human portraits recorded in real and uncontrolled conditions. The collected dataset contains 200 people captured in different poses and locations. The main purpose is to bridge the gap between raw measurements obtained from a smartphone and downstream applications.
arXiv Detail & Related papers (2022-04-21T15:47:38Z)
Synchronized Smartphone Video Recording System of Depth and RGB Image Frames with Sub-millisecond Precision [2.1286051580524523]
We propose a recording system with high time synchronization (sync) precision. It consists of heterogeneous sensors such as smartphone, depth camera, IMU, etc.
arXiv Detail & Related papers (2021-11-05T15:16:54Z)
Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report [65.91472671013302]
We introduce the first Mobile AI challenge, where the target is to develop quantized deep learning-based camera scene classification solutions. The proposed solutions are fully compatible with all major mobile AI accelerators and can demonstrate more than 100-200 FPS on the majority of recent smartphone platforms.
arXiv Detail & Related papers (2021-05-17T13:55:38Z)
Single-Frame based Deep View Synchronization for Unsynchronized Multi-Camera Surveillance [56.964614522968226]
Multi-camera surveillance has been an active research topic for understanding and modeling scenes. It is usually assumed that the cameras are all temporally synchronized when designing models for these multi-camera based tasks. Our view synchronization models are applied to different DNNs-based multi-camera vision tasks under the unsynchronized setting.
arXiv Detail & Related papers (2020-07-08T04:39:38Z)

This list is automatically generated from the titles and abstracts of the papers in this site.