A Principled Design of Image Representation: Towards Forensic Tasks
- URL: http://arxiv.org/abs/2203.00913v1
- Date: Wed, 2 Mar 2022 07:46:52 GMT
- Title: A Principled Design of Image Representation: Towards Forensic Tasks
- Authors: Shuren Qi, Yushu Zhang, Chao Wang, Jiantao Zhou, Xiaochun Cao
- Abstract summary: We investigate the forensic-oriented image representation as a distinct problem, from the perspectives of theory, implementation, and application.
At the theoretical level, we propose a new representation framework for forensics, called Dense Invariant Representation (DIR), which is characterized by stable description with mathematical guarantees.
We demonstrate the above arguments on the dense-domain pattern detection and matching experiments, providing comparison results with state-of-the-art descriptors.
- Score: 75.40968680537544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image forensics is a rising topic as the trustworthy multimedia content is
critical for modern society. Like other vision-related applications, forensic
analysis relies heavily on the proper image representation. Despite the
importance, current theoretical understanding for such representation remains
limited, with varying degrees of neglect for its key role. For this gap, we
attempt to investigate the forensic-oriented image representation as a distinct
problem, from the perspectives of theory, implementation, and application. Our
work starts from the abstraction of basic principles that the representation
for forensics should satisfy, especially revealing the criticality of
robustness, interpretability, and coverage. At the theoretical level, we
propose a new representation framework for forensics, called Dense Invariant
Representation (DIR), which is characterized by stable description with
mathematical guarantees. At the implementation level, the discrete calculation
problems of DIR are discussed, and the corresponding accurate and fast
solutions are designed with generic nature and constant complexity. We
demonstrate the above arguments on the dense-domain pattern detection and
matching experiments, providing comparison results with state-of-the-art
descriptors. Also, at the application level, the proposed DIR is initially
explored in passive and active forensics, namely copy-move forgery detection
and perceptual hashing, exhibiting the benefits in fulfilling the requirements
of such forensic tasks.
Related papers
- Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales [54.78115855552886]
We show how to construct over-complete invariants with a Convolutional Neural Networks (CNN)-like hierarchical architecture.
With the over-completeness, discriminative features w.r.t. the task can be adaptively formed in a Neural Architecture Search (NAS)-like manner.
For robust and interpretable vision tasks at larger scales, hierarchical invariant representation can be considered as an effective alternative to traditional CNN and invariants.
arXiv Detail & Related papers (2024-02-23T16:50:07Z) - Robust Saliency-Aware Distillation for Few-shot Fine-grained Visual
Recognition [57.08108545219043]
Recognizing novel sub-categories with scarce samples is an essential and challenging research topic in computer vision.
Existing literature addresses this challenge by employing local-based representation approaches.
This article proposes a novel model, Robust Saliency-aware Distillation (RSaD), for few-shot fine-grained visual recognition.
arXiv Detail & Related papers (2023-05-12T00:13:17Z) - Synergies between Disentanglement and Sparsity: Generalization and
Identifiability in Multi-Task Learning [79.83792914684985]
We prove a new identifiability result that provides conditions under which maximally sparse base-predictors yield disentangled representations.
Motivated by this theoretical result, we propose a practical approach to learn disentangled representations based on a sparsity-promoting bi-level optimization problem.
arXiv Detail & Related papers (2022-11-26T21:02:09Z) - Shrinking the Semantic Gap: Spatial Pooling of Local Moment Invariants
for Copy-Move Forgery Detection [7.460203098159187]
Copy-move forgery is a manipulation of copying and pasting specific patches from and to an image, with potentially illegal or unethical uses.
Recent advances in the forensic methods for copy-move forgery have shown increasing success in detection accuracy and robustness.
For images with high self-similarity or strong signal corruption, the existing algorithms often exhibit inefficient processes and unreliable results.
arXiv Detail & Related papers (2022-07-19T09:11:43Z) - An Empirical Investigation of Representation Learning for Imitation [76.48784376425911]
Recent work in vision, reinforcement learning, and NLP has shown that auxiliary representation learning objectives can reduce the need for large amounts of expensive, task-specific data.
We propose a modular framework for constructing representation learning algorithms, then use our framework to evaluate the utility of representation learning for imitation.
arXiv Detail & Related papers (2022-05-16T11:23:42Z) - A Survey of Orthogonal Moments for Image Representation: Theory,
Implementation, and Evaluation [70.0671278823937]
Moment-based image representation has been reported to be effective in satisfying the core conditions of semantic description.
This paper presents a comprehensive survey of the orthogonal moments for image representation, covering recent advances in fast/accurate calculation, robustness/invariance optimization, and definition extension.
The presented theory analysis, software implementation, and evaluation results can support the community, particularly in developing novel techniques and promoting real-world applications.
arXiv Detail & Related papers (2021-03-27T03:41:08Z) - Discriminative Residual Analysis for Image Set Classification with
Posture and Age Variations [27.751472312581228]
Discriminant Residual Analysis (DRA) is proposed to improve the classification performance.
DRA attempts to obtain a powerful projection which casts the residual representations into a discriminant subspace.
Two regularization approaches are used to deal with the probable small sample size problem.
arXiv Detail & Related papers (2020-08-23T08:53:06Z) - Anatomy of Catastrophic Forgetting: Hidden Representations and Task
Semantics [24.57617154267565]
We investigate how forgetting affects representations in neural network models.
We find that deeper layers are disproportionately the source of forgetting.
We also introduce a novel CIFAR-100 based task approximating realistic input distribution shift.
arXiv Detail & Related papers (2020-07-14T23:31:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.