Computer Vision for Supporting Image Search
- URL: http://arxiv.org/abs/2111.08772v1
- Date: Tue, 16 Nov 2021 20:50:32 GMT
- Title: Computer Vision for Supporting Image Search
- Authors: Alan F. Smeaton
- Abstract summary: We leverage the benefits of huge amounts of data available for training, we have enormous computer processing available and we have seen the evolution of machine learning as a suite of techniques to process data and deliver accurate vision-based systems.
We use this in autonomous vehicle navigation or in security applications, searching CCTV for example, and in medical image analysis for healthcare diagnostics.
One application which is not widespread is image or video search directly by users. In this paper we present the need for such image finding or re-finding by examining human memory and when it fails, thus motivating the need for a different approach to image search which is
- Score: 2.18624447693809
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Computer vision and multimedia information processing have made extreme
progress within the last decade and many tasks can be done with a level of
accuracy as if done by humans, or better. This is because we leverage the
benefits of huge amounts of data available for training, we have enormous
computer processing available and we have seen the evolution of machine
learning as a suite of techniques to process data and deliver accurate
vision-based systems. What kind of applications do we use this processing for ?
We use this in autonomous vehicle navigation or in security applications,
searching CCTV for example, and in medical image analysis for healthcare
diagnostics. One application which is not widespread is image or video search
directly by users. In this paper we present the need for such image finding or
re-finding by examining human memory and when it fails, thus motivating the
need for a different approach to image search which is outlined, along with the
requirements of computer vision to support it.
Related papers
- Medical Image Retrieval Using Pretrained Embeddings [0.6827423171182154]
We show that medical image retrieval is feasible using pretrained networks without any additional training or fine-tuning steps.
Using pretrained embeddings, we achieved a recall of 1 for various tasks at modality, body region, and organ level.
arXiv Detail & Related papers (2023-11-22T17:42:33Z) - Tuning computer vision models with task rewards [88.45787930908102]
Misalignment between model predictions and intended usage can be detrimental for the deployment of computer vision models.
In natural language processing, this is often addressed using reinforcement learning techniques that align models with a task reward.
We adopt this approach and show its surprising effectiveness across multiple computer vision tasks, such as object detection, panoptic segmentation, colorization and image captioning.
arXiv Detail & Related papers (2023-02-16T11:49:48Z) - A Trustworthy Framework for Medical Image Analysis with Deep Learning [71.48204494889505]
TRUDLMIA is a trustworthy deep learning framework for medical image analysis.
It is anticipated that the framework will support researchers and clinicians in advancing the use of deep learning for dealing with public health crises including COVID-19.
arXiv Detail & Related papers (2022-12-06T05:30:22Z) - Computer Vision on X-ray Data in Industrial Production and Security
Applications: A survey [89.45221564651145]
This survey reviews the recent research on using computer vision and machine learning for X-ray analysis in industrial production and security applications.
It covers the applications, techniques, evaluation metrics, datasets, and performance comparison of those techniques on publicly available datasets.
arXiv Detail & Related papers (2022-11-10T13:37:36Z) - Multi-Modal Masked Autoencoders for Medical Vision-and-Language
Pre-Training [62.215025958347105]
We propose a self-supervised learning paradigm with multi-modal masked autoencoders.
We learn cross-modal domain knowledge by reconstructing missing pixels and tokens from randomly masked images and texts.
arXiv Detail & Related papers (2022-09-15T07:26:43Z) - CapillaryX: A Software Design Pattern for Analyzing Medical Images in
Real-time using Deep Learning [0.688204255655161]
This paper provides a computing architecture that locally and in parallel can analyze medical images in real-time.
We focus on a specific medical-industrial case study, namely the quantifying of blood vessels in microcirculation images.
Our results show that our system is approximately 78% faster than its serial system counterpart and 12% faster than a master-slave parallel system architecture.
arXiv Detail & Related papers (2022-04-13T18:47:04Z) - Automatic Image Content Extraction: Operationalizing Machine Learning in
Humanistic Photographic Studies of Large Visual Archives [81.88384269259706]
We introduce Automatic Image Content Extraction framework for machine learning-based search and analysis of large image archives.
The proposed framework can be applied in several domains in humanities and social sciences.
arXiv Detail & Related papers (2022-04-05T12:19:24Z) - Vision Transformers in Medical Computer Vision -- A Contemplative
Retrospection [0.9677949377607575]
Vision Transformers are evolved as one of the most contemporary and dominant architectures that are being used in the field of computer vision.
We surveyed the application of Vision transformers in different areas of medical computer vision such as image-based disease classification, anatomical structure segmentation, registration, region-based lesion Detection, captioning, report generation.
We also put some light on available data sets, adopted methodology, their performance measures, challenges and their solutions in form of discussion.
arXiv Detail & Related papers (2022-03-29T06:32:43Z) - A survey of image labelling for computer vision applications [0.0]
Recent rise of deep learning algorithms for recognising image content has led to the emergence of ad-hoc labelling tools.
We perform a structured literature review to compile the underlying concepts and features of image labelling software.
arXiv Detail & Related papers (2021-04-18T16:01:55Z) - Factors of Influence for Transfer Learning across Diverse Appearance
Domains and Task Types [50.1843146606122]
A simple form of transfer learning is common in current state-of-the-art computer vision models.
Previous systematic studies of transfer learning have been limited and the circumstances in which it is expected to work are not fully understood.
In this paper we carry out an extensive experimental exploration of transfer learning across vastly different image domains.
arXiv Detail & Related papers (2021-03-24T16:24:20Z) - A Review on Near Duplicate Detection of Images using Computer Vision
Techniques [0.0]
The presence of near-duplicates affects the performance of the search engines critically.
The main application of computer vision is image understanding.
There is no proper survey in literature related to near duplicate detection of images.
arXiv Detail & Related papers (2020-09-07T16:41:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.