Semantically-Enriched Search Engine for Geoportals: A Case Study with
ArcGIS Online
- URL: http://arxiv.org/abs/2003.06561v1
- Date: Sat, 14 Mar 2020 06:16:30 GMT
- Title: Semantically-Enriched Search Engine for Geoportals: A Case Study with
ArcGIS Online
- Authors: Gengchen Mai, Krzysztof Janowicz, Sathya Prasad, Meilin Shi, Ling Cai,
Rui Zhu, Blake Regalia, Ni Lao
- Abstract summary: We propose a semantically-enriched search engine for geoportals using Lucene-based techniques.
A benchmark dataset is constructed to evaluate the proposed framework.
Our evaluation results show that the proposed semantic query expansion framework is very effective in capturing a user's search intention.
- Score: 7.005838154484841
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many geoportals such as ArcGIS Online are established with the goal of
improving geospatial data reusability and achieving intelligent knowledge
discovery. However, according to previous research, most of the existing
geoportals adopt Lucene-based techniques to achieve their core search
functionality, which has a limited ability to capture the user's search
intentions. To better understand a user's search intention, query expansion can
be used to enrich the user's query by adding semantically similar terms. In the
context of geoportals and geographic information retrieval, we advocate the
idea of semantically enriching a user's query from both geospatial and thematic
perspectives. In the geospatial aspect, we propose to enrich a query by using
both place partonomy and distance decay. In terms of the thematic aspect,
concept expansion and embedding-based document similarity are used to infer the
implicit information hidden in a user's query. This semantic query expansion 1
2 G. Mai et al. framework is implemented as a semantically-enriched search
engine using ArcGIS Online as a case study. A benchmark dataset is constructed
to evaluate the proposed framework. Our evaluation results show that the
proposed semantic query expansion framework is very effective in capturing a
user's search intention and significantly outperforms a well-established
baseline-Lucene's practical scoring function-with more than 3.0 increments in
DCG@K (K=3,5,10).
Related papers
- Geo-FuB: A Method for Constructing an Operator-Function Knowledge Base for Geospatial Code Generation Tasks Using Large Language Models [0.5242869847419834]
This study introduces a framework to construct such a knowledge base, leveraging geospatial script semantics.
An example knowledge base, Geo-FuB, built from 154,075 Google Earth Engine scripts, is available on GitHub.
arXiv Detail & Related papers (2024-10-28T12:50:27Z) - Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval [49.42043077545341]
We propose a knowledge-aware query expansion framework, augmenting LLMs with structured document relations from knowledge graph (KG)
We leverage document texts as rich KG node representations and use document-based relation filtering for our Knowledge-Aware Retrieval (KAR)
arXiv Detail & Related papers (2024-10-17T17:03:23Z) - Improving Retrieval in Sponsored Search by Leveraging Query Context Signals [6.152499434499752]
We propose an approach to enhance query understanding by augmenting queries with rich contextual signals.
We use web search titles and snippets to ground queries in real-world information and utilize GPT-4 to generate query rewrites and explanations.
Our context-aware approach substantially outperforms context-free models.
arXiv Detail & Related papers (2024-07-19T14:28:53Z) - Improving Retrieval in Theme-specific Applications using a Corpus
Topical Taxonomy [52.426623750562335]
We introduce ToTER (Topical taxonomy Enhanced Retrieval) framework.
ToTER identifies the central topics of queries and documents with the guidance of the taxonomy, and exploits their topical relatedness to supplement missing contexts.
As a plug-and-play framework, ToTER can be flexibly employed to enhance various PLM-based retrievers.
arXiv Detail & Related papers (2024-03-07T02:34:54Z) - DiscoverPath: A Knowledge Refinement and Retrieval System for
Interdisciplinarity on Biomedical Research [96.10765714077208]
Traditional keyword-based search engines fall short in assisting users who may not be familiar with specific terminologies.
We present a knowledge graph-based paper search engine for biomedical research to enhance the user experience.
The system, dubbed DiscoverPath, employs Named Entity Recognition (NER) and part-of-speech (POS) tagging to extract terminologies and relationships from article abstracts to create a KG.
arXiv Detail & Related papers (2023-09-04T20:52:33Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - Improving Content Retrievability in Search with Controllable Query
Generation [5.450798147045502]
Machine-learned search engines have a high retrievability bias, where the majority of the queries return the same entities.
We propose CtrlQGen, a method that generates queries for a chosen underlying intent-narrow or broad.
Our results on datasets from the domains of music, podcasts, and books reveal that we can significantly decrease the retrievability bias of a dense retrieval model.
arXiv Detail & Related papers (2023-03-21T07:46:57Z) - MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo)
MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching.
Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z) - Graph Enhanced BERT for Query Understanding [55.90334539898102]
query understanding plays a key role in exploring users' search intents and facilitating users to locate their most desired information.
In recent years, pre-trained language models (PLMs) have advanced various natural language processing tasks.
We propose a novel graph-enhanced pre-training framework, GE-BERT, which can leverage both query content and the query graph.
arXiv Detail & Related papers (2022-04-03T16:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.