Discovering Geo-dependent Stories by Combining Density-based Clustering
and Thread-based Aggregation techniques
- URL: http://arxiv.org/abs/2312.11076v1
- Date: Mon, 18 Dec 2023 10:17:12 GMT
- Title: Discovering Geo-dependent Stories by Combining Density-based Clustering
and Thread-based Aggregation techniques
- Authors: H\'ector Cerezo-Costas, Ana Fern\'andez Vilas, Manuela
Mart\'in-Vicente, Rebeca P. D\'iaz-Redondo
- Abstract summary: This paper introduces a global analysis of the geo-tagged posts in social media.
It supports (i) the detection of unexpected behavior in the city and (ii) the analysis of the posts to infer what is happening.
We have applied our methodology to a dataset obtained from Instagram activity in New York City for seven months.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Citizens are actively interacting with their surroundings, especially through
social media. Not only do shared posts give important information about what is
happening (from the users' perspective), but also the metadata linked to these
posts offer relevant data, such as the GPS-location in Location-based Social
Networks (LBSNs). In this paper we introduce a global analysis of the
geo-tagged posts in social media which supports (i) the detection of unexpected
behavior in the city and (ii) the analysis of the posts to infer what is
happening. The former is obtained by applying density-based clustering
techniques, whereas the latter is consequence of applying natural language
processing. We have applied our methodology to a dataset obtained from
Instagram activity in New York City for seven months obtaining promising
results. The developed algorithms require very low resources, being able to
analyze millions of data-points in commodity hardware in less than one hour
without applying complex parallelization techniques. Furthermore, the solution
can be easily adapted to other geo-tagged data sources without extra effort.
Related papers
- GeoMix: Towards Geometry-Aware Data Augmentation [76.09914619612812]
Mixup has shown considerable success in mitigating the challenges posed by limited labeled data in image classification.
We propose Geometric Mixup (GeoMix), a simple and interpretable Mixup approach leveraging in-place graph editing.
arXiv Detail & Related papers (2024-07-15T12:58:04Z) - Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input [2.516307239032451]
We present a method which represents real-world locations as averaged embeddings from labeled user-input location names.
We show that our approach improves geo-entity linking on a global and multilingual social media dataset.
arXiv Detail & Related papers (2024-04-29T15:18:33Z) - GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models.
We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods.
Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z) - GeoCLIP: Clip-Inspired Alignment between Locations and Images for
Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth.
Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task.
We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo)
MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching.
Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z) - Geo-Information Harvesting from Social Media Data [22.061969480185482]
Massive imagery and text messages from open platforms and social media form a temporally quasi-seamless, spatially multi-perspective stream.
We address key aspects in the field, including data availability, analysis-ready data preparation and data management.
We present the first extensive discussion of ethical considerations of social media data in the context of geo-information harvesting and geographic applications.
arXiv Detail & Related papers (2022-11-01T15:47:18Z) - Semantic Segmentation of Vegetation in Remote Sensing Imagery Using Deep
Learning [77.34726150561087]
We propose an approach for creating a multi-modal and large-temporal dataset comprised of publicly available Remote Sensing data.
We use Convolutional Neural Networks (CNN) models that are capable of separating different classes of vegetation.
arXiv Detail & Related papers (2022-09-28T18:51:59Z) - FedLoc: Federated Learning Framework for Data-Driven Cooperative
Localization and Location Data Processing [12.518673970373422]
Data-driven learning model-based cooperative localization and location data processing are considered.
We first review state-of-the-art algorithms in the context of federated learning.
We demonstrate various practical use cases that are summarized from a mixture of standard, newly published, and unpublished works.
arXiv Detail & Related papers (2020-03-08T01:51:56Z) - Geosocial Location Classification: Associating Type to Places Based on
Geotagged Social-Media Posts [22.313111311130662]
Associating type to locations can be used to enrich maps and can serve a plethora of geospatial applications.
We study the problem of Geosocial Location Classification, where the type of a site, e.g., a building, is discovered based on social-media posts.
arXiv Detail & Related papers (2020-02-05T16:09:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.