Automatic Image Content Extraction: Operationalizing Machine Learning in
Humanistic Photographic Studies of Large Visual Archives
- URL: http://arxiv.org/abs/2204.02149v1
- Date: Tue, 5 Apr 2022 12:19:24 GMT
- Title: Automatic Image Content Extraction: Operationalizing Machine Learning in
Humanistic Photographic Studies of Large Visual Archives
- Authors: Anssi M\"annist\"o, Mert Seker, Alexandros Iosifidis, Jenni Raitoharju
- Abstract summary: We introduce Automatic Image Content Extraction framework for machine learning-based search and analysis of large image archives.
The proposed framework can be applied in several domains in humanities and social sciences.
- Score: 81.88384269259706
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Applying machine learning tools to digitized image archives has a potential
to revolutionize quantitative research of visual studies in humanities and
social sciences. The ability to process a hundredfold greater number of photos
than has been traditionally possible and to analyze them with an extensive set
of variables will contribute to deeper insight into the material. Overall,
these changes will help to shift the workflow from simple manual tasks to more
demanding stages.
In this paper, we introduce Automatic Image Content Extraction (AICE)
framework for machine learning-based search and analysis of large image
archives. We developed the framework in a multidisciplinary research project as
framework for future photographic studies by reformulating and expanding the
traditional visual content analysis methodologies to be compatible with the
current and emerging state-of-the-art machine learning tools and to cover the
novel machine learning opportunities for automatic content analysis. The
proposed framework can be applied in several domains in humanities and social
sciences, and it can be adjusted and scaled into various research settings. We
also provide information on the current state of different machine learning
techniques and show that there are already various publicly available methods
that are suitable to a wide-scale of visual content analysis tasks.
Related papers
- GalleryGPT: Analyzing Paintings with Large Multimodal Models [64.98398357569765]
Artwork analysis is important and fundamental skill for art appreciation, which could enrich personal aesthetic sensibility and facilitate the critical thinking ability.
Previous works for automatically analyzing artworks mainly focus on classification, retrieval, and other simple tasks, which is far from the goal of AI.
We introduce a superior large multimodal model for painting analysis composing, dubbed GalleryGPT, which is slightly modified and fine-tuned based on LLaVA architecture.
arXiv Detail & Related papers (2024-08-01T11:52:56Z) - An Image-based Typology for Visualization [23.716718517642878]
We present and discuss the results of a qualitative analysis of visual representations from images.
We derive a typology of 10 visualization types of defined groups.
We provide a dataset of 6,833 tagged images and an online tool that can be used to explore and analyze the large set of labeled images.
arXiv Detail & Related papers (2024-03-07T04:33:42Z) - Document AI: A Comparative Study of Transformer-Based, Graph-Based
Models, and Convolutional Neural Networks For Document Layout Analysis [3.231170156689185]
Document AI aims to automatically analyze documents by leveraging natural language processing and computer vision techniques.
One of the major tasks of Document AI is document layout analysis, which structures document pages by interpreting the content and spatial relationships of layout, image, and text.
arXiv Detail & Related papers (2023-08-29T16:58:03Z) - Domain Generalization for Mammographic Image Analysis with Contrastive
Learning [62.25104935889111]
The training of an efficacious deep learning model requires large data with diverse styles and qualities.
A novel contrastive learning is developed to equip the deep learning models with better style generalization capability.
The proposed method has been evaluated extensively and rigorously with mammograms from various vendor style domains and several public datasets.
arXiv Detail & Related papers (2023-04-20T11:40:21Z) - Computer Vision on X-ray Data in Industrial Production and Security
Applications: A survey [89.45221564651145]
This survey reviews the recent research on using computer vision and machine learning for X-ray analysis in industrial production and security applications.
It covers the applications, techniques, evaluation metrics, datasets, and performance comparison of those techniques on publicly available datasets.
arXiv Detail & Related papers (2022-11-10T13:37:36Z) - Leveraging Computer Vision Application in Visual Arts: A Case Study on
the Use of Residual Neural Network to Classify and Analyze Baroque Paintings [0.0]
In this case study, we focus on the classification of a selected painting 'Portrait of the Painter Charles Bruni' by Johann Kupetzky.
We show that the features extracted during residual network training can be useful for image retrieval within search systems in online art collections.
arXiv Detail & Related papers (2022-10-27T10:15:36Z) - A survey of image labelling for computer vision applications [0.0]
Recent rise of deep learning algorithms for recognising image content has led to the emergence of ad-hoc labelling tools.
We perform a structured literature review to compile the underlying concepts and features of image labelling software.
arXiv Detail & Related papers (2021-04-18T16:01:55Z) - Factors of Influence for Transfer Learning across Diverse Appearance
Domains and Task Types [50.1843146606122]
A simple form of transfer learning is common in current state-of-the-art computer vision models.
Previous systematic studies of transfer learning have been limited and the circumstances in which it is expected to work are not fully understood.
In this paper we carry out an extensive experimental exploration of transfer learning across vastly different image domains.
arXiv Detail & Related papers (2021-03-24T16:24:20Z) - EXACT: A collaboration toolset for algorithm-aided annotation of images
with annotation version control [7.6457287740201]
EXACT enables the collaborative interdisciplinary analysis of images online and offline.
Software can be adapted to diverse applications such as counting mitotic figures with a screening mode.
It has already been successfully applied to a broad range of annotation tasks.
arXiv Detail & Related papers (2020-04-30T06:07:21Z) - Image Segmentation Using Deep Learning: A Survey [58.37211170954998]
Image segmentation is a key topic in image processing and computer vision.
There has been a substantial amount of works aimed at developing image segmentation approaches using deep learning models.
arXiv Detail & Related papers (2020-01-15T21:37:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.