Neural networks for semantic segmentation of historical city maps:
Cross-cultural performance and the impact of figurative diversity
- URL: http://arxiv.org/abs/2101.12478v1
- Date: Fri, 29 Jan 2021 09:08:12 GMT
- Title: Neural networks for semantic segmentation of historical city maps:
Cross-cultural performance and the impact of figurative diversity
- Authors: R\'emi Petitpierre (Ecole polytechnique f\'ed\'erale de Lausanne,
EPFL, Switzerland)
- Abstract summary: We present a new semantic segmentation model for historical city maps based on convolutional neural networks.
We show that these networks are able to semantically segment map data of a very large figurative diversity with efficiency.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this work, we present a new semantic segmentation model for historical
city maps that surpasses the state of the art in terms of flexibility and
performance. Research in automatic map processing is largely focused on
homogeneous corpora or even individual maps, leading to inflexible algorithms.
Recently, convolutional neural networks have opened new perspectives for the
development of more generic tools. Based on two new maps corpora, the first one
centered on Paris and the second one gathering cities from all over the world,
we propose a method for operationalizing the figuration based on traditional
computer vision algorithms that allows large-scale quantitative analysis. In a
second step, we propose a semantic segmentation model based on neural networks
and implement several improvements. Finally, we analyze the impact of map
figuration on segmentation performance and evaluate future ways to improve the
representational flexibility of neural networks. To conclude, we show that
these networks are able to semantically segment map data of a very large
figurative diversity with efficiency.
Related papers
- Ensembling convolutional neural networks for human skin segmentation [2.8391355909797644]
We propose to train a convolutional network using the datasets focused on different features to create an ensemble whose individual outcomes are effectively combined.
We expect that this study will help in developing new ensemble-based techniques that will improve the performance of semantic segmentation systems.
arXiv Detail & Related papers (2024-07-27T17:55:28Z) - End-to-end Neuron Instance Segmentation based on Weakly Supervised
Efficient UNet and Morphological Post-processing [0.0]
We present an end-to-end weakly-supervised framework to automatically detect and segment NeuN stained neuronal cells on histological images.
We integrate the state-of-the-art network, EfficientNet, into our U-Net-like architecture.
arXiv Detail & Related papers (2022-02-17T14:35:45Z) - A Unified Architecture of Semantic Segmentation and Hierarchical
Generative Adversarial Networks for Expression Manipulation [52.911307452212256]
We develop a unified architecture of semantic segmentation and hierarchical GANs.
A unique advantage of our framework is that on forward pass the semantic segmentation network conditions the generative model.
We evaluate our method on two challenging facial expression translation benchmarks, AffectNet and RaFD, and a semantic segmentation benchmark, CelebAMask-HQ.
arXiv Detail & Related papers (2021-12-08T22:06:31Z) - X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task
Distillation [69.9604394044652]
We propose a novel method to improve the self-supervised training of monocular depth via cross-task knowledge distillation.
During training, we utilize a pretrained semantic segmentation teacher network and transfer its semantic knowledge to the depth network.
We extensively evaluate the efficacy of our proposed approach on the KITTI benchmark and compare it with the latest state of the art.
arXiv Detail & Related papers (2021-10-24T19:47:14Z) - Classification of Urban Morphology with Deep Learning: Application on
Urban Vitality [0.0]
We propose a deep learning-based technique to automatically classify road networks into four classes on a visual basis.
Nine cities around the world are selected as the study areas and their road networks are acquired from OpenStreetMap.
Latent subgroups among the cities are uncovered through a clustering on the percentage of each road network category.
An advanced tree-based regression model is for the first time designated to establish the relationship between morphological indices and vitality indicators.
arXiv Detail & Related papers (2021-05-07T08:53:31Z) - Interpretable Semantic Photo Geolocalization [4.286838964398275]
We present two contributions in order to improve the interpretability of a geolocalization model.
We propose a novel, semantic partitioning method which intuitively leads to an improved understanding of the predictions.
We also introduce a novel metric to assess the importance of semantic visual concepts for a certain prediction.
arXiv Detail & Related papers (2021-04-30T13:28:18Z) - Joint Learning of Neural Transfer and Architecture Adaptation for Image
Recognition [77.95361323613147]
Current state-of-the-art visual recognition systems rely on pretraining a neural network on a large-scale dataset and finetuning the network weights on a smaller dataset.
In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness.
Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks.
arXiv Detail & Related papers (2021-03-31T08:15:17Z) - Variational Structured Attention Networks for Deep Visual Representation
Learning [49.80498066480928]
We propose a unified deep framework to jointly learn both spatial attention maps and channel attention in a principled manner.
Specifically, we integrate the estimation and the interaction of the attentions within a probabilistic representation learning framework.
We implement the inference rules within the neural network, thus allowing for end-to-end learning of the probabilistic and the CNN front-end parameters.
arXiv Detail & Related papers (2021-03-05T07:37:24Z) - SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from
Monocular images [94.36401543589523]
We introduce the concept of semantic objectness to exploit the geometric relationship of these two tasks.
We then propose a Semantic Object and Depth Estimation Network (SOSD-Net) based on the objectness assumption.
To the best of our knowledge, SOSD-Net is the first network that exploits the geometry constraint for simultaneous monocular depth estimation and semantic segmentation.
arXiv Detail & Related papers (2021-01-19T02:41:03Z) - Towards Deeper Graph Neural Networks [63.46470695525957]
Graph convolutions perform neighborhood aggregation and represent one of the most important graph operations.
Several recent studies attribute this performance deterioration to the over-smoothing issue.
We propose Deep Adaptive Graph Neural Network (DAGNN) to adaptively incorporate information from large receptive fields.
arXiv Detail & Related papers (2020-07-18T01:11:14Z) - Semantic Segmentation With Multi Scale Spatial Attention For Self
Driving Cars [2.7317088388886384]
We present a novel neural network using multi scale feature fusion at various scales for accurate and efficient semantic image segmentation.
We used ResNet based feature extractor, dilated convolutional layers in downsampling part, atrous convolutional layers in the upsampling part and used concat operation to merge them.
A new attention module is proposed to encode more contextual information and enhance the receptive field of the network.
arXiv Detail & Related papers (2020-06-30T20:19:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.