Related papers: Evaluating image matching methods for book cover identification

Evaluating image matching methods for book cover identification

URL: http://arxiv.org/abs/2001.05200v1
Date: Wed, 15 Jan 2020 09:52:38 GMT
Title: Evaluating image matching methods for book cover identification
Authors: Rabie Hachemi, Ikram Achar, Biasi Wiga, Mahfoud Sidi Ali Mebarek
Abstract summary: We explore different feature detectors and matching methods for book cover identification. This will allow libraries to develop interactive services based on cover book picture. Tests have been performed by taking into account different transformations of each book cover image.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Humans are capable of identifying a book only by looking at its cover, but how can computers do the same? In this paper, we explore different feature detectors and matching methods for book cover identification, and compare their performances in terms of both speed and accuracy. This will allow, for example, libraries to develop interactive services based on cover book picture. Only one single image of a cover book needs to be available through a database. Tests have been performed by taking into account different transformations of each book cover image. Encouraging results have been achieved.

Related papers

Image-text matching for large-scale book collections [10.444851303425589]
We address the problem of detecting and mapping all books in a collection of images to entries in a given book catalogue. We combine a state-of-the-art segmentation method (SAM) to detect book spines and extract book information using a commercial OCR. To evaluate our approach, we publish a new dataset of annotated bookshelf images that covers the whole book collection of a public library in Spain.
arXiv Detail & Related papers (2024-07-29T09:05:04Z)
Interleaving GANs with knowledge graphs to support design creativity for book covers [77.34726150561087]
We apply Generative Adversarial Networks (GANs) to the book covers domain. We interleave GANs with knowledge graphs to alter the input title to obtain multiple possible options for any given title. Finally, we use the discriminator obtained during the training phase to select the best images generated with new titles.
arXiv Detail & Related papers (2023-08-03T08:56:56Z)
Book Cover Synthesis from the Summary [0.0]
We explore ways to produce a book cover using artificial intelligence based on the fact that there exists a relationship between the summary of the book and its cover. We construct a dataset of English books that contains a large number of samples of summaries of existing books and their cover images. We apply different text-to-image synthesis techniques to generate book covers from the summary and exhibit the results in this paper.
arXiv Detail & Related papers (2022-11-03T20:43:40Z)
The Curious Layperson: Fine-Grained Image Recognition without Expert Labels [90.88501867321573]
We consider a new problem: fine-grained image recognition without expert annotations. We learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine-grained textual similarity model that matches image descriptions with documents on a sentence-level basis.
arXiv Detail & Related papers (2021-11-05T17:58:37Z)
Image Collation: Matching illustrations in manuscripts [76.21388548732284]
We introduce the task of illustration collation and a large annotated public dataset to evaluate solutions. We analyze state of the art similarity measures for this task and show that they succeed in simple cases but struggle for large manuscripts. We show clear evidence that significant performance boosts can be expected by exploiting cycle-consistent correspondences.
arXiv Detail & Related papers (2021-08-18T12:12:14Z)
Font Style that Fits an Image -- Font Generation Based on Image Context [7.646713951724013]
We propose a method of generating a book title image based on its context within a book cover. We propose an end-to-end neural network that inputs the book cover, a target location mask, and a desired book title and outputs stylized text suitable for the cover. We demonstrate that the proposed method can effectively produce desirable and appropriate book cover text through quantitative and qualitative results.
arXiv Detail & Related papers (2021-05-19T01:53:04Z)
Intrinsic Image Captioning Evaluation [53.51379676690971]
We propose a learning based metrics for image captioning, which we call Intrinsic Image Captioning Evaluation(I2CE) Experiment results show that our proposed method can keep robust performance and give more flexible scores to candidate captions when encountered with semantic similar expression or less aligned semantics.
arXiv Detail & Related papers (2020-12-14T08:36:05Z)
Deep multi-modal networks for book genre classification based on its cover [0.0]
We propose a multi-modal deep learning framework to solve the cover-based book classification problem. Our method adds an extra modality by extracting texts automatically from the book covers. Results show that the multi-modal framework significantly outperforms the current state-of-the-art image-based models.
arXiv Detail & Related papers (2020-11-15T23:27:43Z)
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets [52.3731631461383]
We aim to improve the distinctiveness of image captions through training with sets of similar images. Our metric shows that the human annotations of each image are not equivalent based on distinctiveness.
arXiv Detail & Related papers (2020-07-14T07:40:39Z)
A Novel Attention-based Aggregation Function to Combine Vision and Language [55.7633883960205]
We propose a novel fully-attentive reduction method for vision and language. Specifically, our approach computes a set of scores for each element of each modality employing a novel variant of cross-attention. We test our approach on image-text matching and visual question answering, building fair comparisons with other reduction choices.
arXiv Detail & Related papers (2020-04-27T18:09:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.