MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry
- URL: http://arxiv.org/abs/2507.04750v2
- Date: Thu, 10 Jul 2025 02:40:29 GMT
- Title: MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry
- Authors: Zicheng Lin, Xiaoqiang Li, Yichao Wang, Chuang Zhu,
- Abstract summary: Particle Image Velocimetry (PIV) is fundamental to fluid dynamics, yet deep learning applications face significant hurdles.<n>A critical gap exists: the lack of comprehensive evaluation of how diverse optical flow models perform specifically on PIV data.<n>This work provides both a foundational benchmark resource and a state-of-the-art method tailored for PIV challenges.
- Score: 8.170526185155747
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Particle Image Velocimetry (PIV) is fundamental to fluid dynamics, yet deep learning applications face significant hurdles. A critical gap exists: the lack of comprehensive evaluation of how diverse optical flow models perform specifically on PIV data, largely due to limitations in available datasets and the absence of a standardized benchmark. This prevents fair comparison and hinders progress. To address this, our primary contribution is a novel, large-scale synthetic PIV benchmark dataset generated from diverse CFD simulations (JHTDB and Blasius). It features unprecedented variety in particle densities, flow velocities, and continuous motion, enabling, for the first time, a standardized and rigorous evaluation of various optical flow and PIV algorithms. Complementing this, we propose Multi Cost Volume PIV (MCFormer), a new deep network architecture leveraging multi-frame temporal information and multiple cost volumes, specifically designed for PIV's sparse nature. Our comprehensive benchmark evaluation, the first of its kind, reveals significant performance variations among adapted optical flow models and demonstrates that MCFormer significantly outperforms existing methods, achieving the lowest overall normalized endpoint error (NEPE). This work provides both a foundational benchmark resource essential for future PIV research and a state-of-the-art method tailored for PIV challenges. We make our benchmark dataset and code publicly available to foster future research in this area.
Related papers
- FindRec: Stein-Guided Entropic Flow for Multi-Modal Sequential Recommendation [50.438552588818]
We propose textbfFindRec (textbfFlexible unified textbfinformation textbfdisentanglement for multi-modal sequential textbfRecommendation)<n>A Stein kernel-based Integrated Information Coordination Module (IICM) theoretically guarantees distribution consistency between multimodal features and ID streams.<n>A cross-modal expert routing mechanism that adaptively filters and combines multimodal features based on their contextual relevance.
arXiv Detail & Related papers (2025-07-07T04:09:45Z) - FD-Bench: A Modular and Fair Benchmark for Data-driven Fluid Simulation [33.45842684810109]
FD-Bench is the first fair, modular, comprehensive and reproducible benchmark for data-driven fluid simulation.<n>It provides fair comparisons across spatial, temporal, and loss function modules.<n>It lays a foundation for robust evaluation of future data-driven fluid models.
arXiv Detail & Related papers (2025-05-25T23:24:18Z) - Spike Imaging Velocimetry: Dense Motion Estimation of Fluids Using Spike Cameras [56.30609439239127]
This study investigates the tremendous potential of spike cameras (a type of ultra-high-speed, high-dynamic-range camera) in Particle Image Velocimetry (PIV)<n>We propose a deep learning framework, Spike Imaging Velocimetry (SIV), designed specifically for highly turbulent and intricate flow fields.<n>We present a spike-based PIV dataset, Particle Scenes with Spike and Displacement (PSSD), which provides labeled data for three challenging fluid dynamics scenarios.
arXiv Detail & Related papers (2025-04-26T09:14:45Z) - FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data [56.08867996209236]
Fine-tuning Multimodal Large Language Models (MLLMs) with Federated Learning (FL) allows for expanding the training data scope by including private data sources.<n>We introduce a benchmark to evaluate the performance of federated fine-tuning of MLLMs across various multimodal heterogeneous scenarios.<n>We develop a general FedMLLM framework that integrates classic FL methods alongside two modality-agnostic strategies.
arXiv Detail & Related papers (2024-11-22T04:09:23Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark [97.8968058408759]
Pre-trained vision models (PVMs) have demonstrated remarkable adaptability across a wide range of downstream vision tasks.<n>As these models scale to billions or even trillions of parameters, conventional full fine-tuning has become increasingly impractical due to its high computational and storage demands.<n> parameter-efficient fine-tuning (PEFT) has emerged as a promising alternative, aiming to achieve performance comparable to full fine-tuning while making minimal adjustments to the model parameters.
arXiv Detail & Related papers (2024-02-03T19:12:20Z) - Unsupervised Cross-Domain Soft Sensor Modelling via Deep
Physics-Inspired Particle Flow Bayes [3.2307729081989334]
We propose a deep Particle Flow Bayes framework for cross-domain soft sensor modeling.
In particular, a sequential Bayes objective is first formulated to perform the maximum likelihood estimation.
We validate the framework on a complex industrial multiphase flow process system.
arXiv Detail & Related papers (2023-06-08T03:43:32Z) - Neural Particle Image Velocimetry [4.416484585765027]
We introduce a convolutional neural network adapted to the problem, namely Volumetric Correspondence Network (VCN)
The network is thoroughly trained and tested on a dataset containing both synthetic and real flow data.
Our analysis indicates that the proposed approach provides improved efficiency also keeping accuracy on par with other state-of-the-art methods in the field.
arXiv Detail & Related papers (2021-01-28T12:03:39Z) - Unsupervised Learning of Particle Image Velocimetry [6.69579674554491]
Deep learning has inspired new approaches to tackle the Particle Image Velocimetry problem.
It is difficult to collect reliable ground truth data in large-scale, real-world scenarios.
We present here what we believe to be the first work which takes an unsupervised learning based approach to tackle PIV problems.
arXiv Detail & Related papers (2020-07-28T21:08:37Z) - Recent Developments Combining Ensemble Smoother and Deep Generative
Networks for Facies History Matching [58.720142291102135]
This research project focuses on the use of autoencoders networks to construct a continuous parameterization for facies models.
We benchmark seven different formulations, including VAE, generative adversarial network (GAN), Wasserstein GAN, variational auto-encoding GAN, principal component analysis (PCA) with cycle GAN, PCA with transfer style network and VAE with style loss.
arXiv Detail & Related papers (2020-05-08T21:32:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.