Fugu-MT 論文翻訳(概要): End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

論文の概要: End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

arxiv url: http://arxiv.org/abs/2106.00200v1
Date: Tue, 1 Jun 2021 03:13:35 GMT
ステータス: 翻訳完了
システム内更新日: 2021-06-02 14:40:34.810177
Title: End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents
Title（参考訳）: 長文合成質問応答のためのエンドツーエンドマルチホップ検索
Authors: Haitian Sun, William W. Cohen, Ruslan Salakhutdinov
Abstract要約: 本稿では,長い文書の合成問題に答えるマルチホップ検索手法であるDocHopperを提案する。各ステップでDocHopperは文書から段落や文を検索し、検索した結果とクエリを混合し、次のステップでクエリを更新する。文書構造を活用すれば、長い文書の質問応答や検索性能を大幅に改善できることを示す。
参考スコア（独自算出の注目度）: 93.55268936974971
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Answering complex questions from long documents requires aggregating multiple pieces of evidence and then predicting the answers. In this paper, we propose a multi-hop retrieval method, DocHopper, to answer compositional questions over long documents. At each step, DocHopper retrieves a paragraph or sentence embedding from the document, mixes the retrieved result with the query, and updates the query for the next step. In contrast to many other retrieval-based methods (e.g., RAG or REALM) the query is not augmented with a token sequence: instead, it is augmented by "numerically" combining it with another neural representation. This means that model is end-to-end differentiable. We demonstrate that utilizing document structure in this was can largely improve question-answering and retrieval performance on long documents. We experimented with DocHopper on three different QA tasks that require reading long documents to answer compositional questions: discourse entailment reasoning, factual QA with table and text, and information seeking QA from academic papers. DocHopper outperforms all baseline models and achieves state-of-the-art results on all datasets. Additionally, DocHopper is efficient at inference time, being 3~10 times faster than the baselines.
Abstract（参考訳）: 長い文書から複雑な質問に答えるには、複数の証拠をまとめて答えを予測する必要がある。本稿では,長い文書に対して合成質問に答えるマルチホップ検索手法であるdochopperを提案する。各ステップでDocHopperは文書から段落や文を検索し、検索した結果とクエリを混合し、次のステップでクエリを更新する。他の多くの検索ベースメソッド(ragやrealmなど)とは対照的に、クエリはトークンシーケンスでは拡張されない。これはモデルがエンドツーエンドで微分可能であることを意味する。文書構造を活用すれば、長い文書の質問応答や検索性能を大幅に改善できることを示す。我々はDocHopperを3つの異なるQAタスクで実験し、長い文書を読むことで構成的疑問に答える:談話内容推論、テーブルとテキストによる事実的QA、学術論文からのQAを求める情報。 DocHopperはすべてのベースラインモデルを上回っ、すべてのデータセットで最先端の結果を達成する。さらに、DocHopperは推論時に効率的で、ベースラインの3～10倍高速である。

論文の概要: End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

関連論文リスト