Design Challenges for a Multi-Perspective Search Engine
- URL: http://arxiv.org/abs/2112.08357v1
- Date: Wed, 15 Dec 2021 18:59:57 GMT
- Title: Design Challenges for a Multi-Perspective Search Engine
- Authors: Sihao Chen and Siyi Liu and Xander Uyttendaele and Yi Zhang and
William Bruno and Dan Roth
- Abstract summary: We study a new perspective-oriented document retrieval paradigm.
We discuss and assess the inherent natural language understanding challenges in order to achieve the goal.
We use the prototype system to conduct a user survey in order to assess the utility of our paradigm.
- Score: 44.48345943046946
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Many users turn to document retrieval systems (e.g. search engines) to seek
answers to controversial questions. Answering such user queries usually require
identifying responses within web documents, and aggregating the responses based
on their different perspectives.
Classical document retrieval systems fall short at delivering a set of direct
and diverse responses to the users. Naturally, identifying such responses
within a document is a natural language understanding task. In this paper, we
examine the challenges of synthesizing such language understanding objectives
with document retrieval, and study a new perspective-oriented document
retrieval paradigm. We discuss and assess the inherent natural language
understanding challenges in order to achieve the goal. Following the design
challenges and principles, we demonstrate and evaluate a practical prototype
pipeline system. We use the prototype system to conduct a user survey in order
to assess the utility of our paradigm, as well as understanding the user
information needs for controversial queries.
Related papers
- Can Users Detect Biases or Factual Errors in Generated Responses in Conversational Information-Seeking? [13.790574266700006]
We investigate the limitations of response generation in conversational information-seeking systems.
The study addresses the problem of query answerability and the challenge of response incompleteness.
Our analysis reveals that it is easier for users to detect response incompleteness than query answerability.
arXiv Detail & Related papers (2024-10-28T20:55:00Z) - Follow-Up Questions Improve Documents Generated by Large Language Models [0.0]
This study investigates the impact of Large Language Models (LLMs) generating follow-up questions in response to user requests for short (1-page) text documents.
Users interacted with a novel web-based AI system designed to ask follow-up questions.
arXiv Detail & Related papers (2024-06-27T07:16:46Z) - An Interactive Query Generation Assistant using LLM-based Prompt
Modification and User Feedback [9.461978375200102]
The proposed interface is a novel search interface which supports automatic and interactive query generation over a mono-linguial or multi-lingual document collection.
The interface enables the users to refine the queries generated by different LLMs, to provide feedback on the retrieved documents or passages, and is able to incorporate the users' feedback as prompts to generate more effective queries.
arXiv Detail & Related papers (2023-11-19T04:42:24Z) - Social Commonsense-Guided Search Query Generation for Open-Domain
Knowledge-Powered Conversations [66.16863141262506]
We present a novel approach that focuses on generating internet search queries guided by social commonsense.
Our proposed framework addresses passive user interactions by integrating topic tracking, commonsense response generation and instruction-driven query generation.
arXiv Detail & Related papers (2023-10-22T16:14:56Z) - A Question Answering Framework for Decontextualizing User-facing
Snippets from Scientific Documents [47.39561727838956]
We use language models to rewrite snippets from scientific documents to be read on their own.
We propose a framework that decomposes the task into three stages: question generation, question answering, and rewriting.
arXiv Detail & Related papers (2023-05-24T06:23:02Z) - Evaluating Mixed-initiative Conversational Search Systems via User
Simulation [9.066817876491053]
We propose a conversational User Simulator, called USi, for automatic evaluation of such search systems.
We show that responses generated by USi are both inline with the underlying information need and comparable to human-generated answers.
arXiv Detail & Related papers (2022-04-17T16:27:33Z) - Text Summarization with Latent Queries [60.468323530248945]
We introduce LaQSum, the first unified text summarization system that learns Latent Queries from documents for abstractive summarization with any existing query forms.
Under a deep generative framework, our system jointly optimize a latent query model and a conditional language model, allowing users to plug-and-play queries of any type at test time.
Our system robustly outperforms strong comparison systems across summarization benchmarks with different query types, document settings, and target domains.
arXiv Detail & Related papers (2021-05-31T21:14:58Z) - On the Social and Technical Challenges of Web Search Autosuggestion
Moderation [118.47867428272878]
Autosuggestions are typically generated by machine learning (ML) systems trained on a corpus of search logs and document representations.
While current search engines have become increasingly proficient at suppressing such problematic suggestions, there are still persistent issues that remain.
We discuss several dimensions of problematic suggestions, difficult issues along the pipeline, and why our discussion applies to the increasing number of applications beyond web search.
arXiv Detail & Related papers (2020-07-09T19:22:00Z) - Open-Retrieval Conversational Question Answering [62.11228261293487]
We introduce an open-retrieval conversational question answering (ORConvQA) setting, where we learn to retrieve evidence from a large collection before extracting answers.
We build an end-to-end system for ORConvQA, featuring a retriever, a reranker, and a reader that are all based on Transformers.
arXiv Detail & Related papers (2020-05-22T19:39:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.