Generating Correct Answers for Progressive Matrices Intelligence Tests
- URL: http://arxiv.org/abs/2011.00496v1
- Date: Sun, 1 Nov 2020 13:21:07 GMT
- Title: Generating Correct Answers for Progressive Matrices Intelligence Tests
- Authors: Niv Pekar, Yaniv Benny, Lior Wolf
- Abstract summary: Raven's Progressive Matrices are multiple-choice intelligence tests, where one tries to complete the missing location in a $3times 3$ grid of abstract images.
Previous attempts to address this test have focused solely on selecting the right answer out of the multiple choices.
In this work, we focus, instead, on generating a correct answer given the grid, without seeing the choices, which is a harder task, by definition.
- Score: 88.78821060331582
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Raven's Progressive Matrices are multiple-choice intelligence tests, where
one tries to complete the missing location in a $3\times 3$ grid of abstract
images. Previous attempts to address this test have focused solely on selecting
the right answer out of the multiple choices. In this work, we focus, instead,
on generating a correct answer given the grid, without seeing the choices,
which is a harder task, by definition. The proposed neural model combines
multiple advances in generative models, including employing multiple pathways
through the same network, using the reparameterization trick along two pathways
to make their encoding compatible, a dynamic application of variational losses,
and a complex perceptual loss that is coupled with a selective backpropagation
procedure. Our algorithm is able not only to generate a set of plausible
answers, but also to be competitive to the state of the art methods in
multiple-choice tests.
Related papers
- OFER: Occluded Face Expression Reconstruction [16.06622406877353]
We introduce OFER, a novel approach for single image 3D face reconstruction that can generate plausible, diverse, and expressive 3D faces.
We propose a novel ranking mechanism that sorts the outputs of the shape diffusion network based on the predicted shape accuracy scores to select the best match.
arXiv Detail & Related papers (2024-10-29T00:21:26Z) - Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering [27.601353412882258]
Multi-choice Machine Reading (MMRC) aims to select the correct answer from a set of options based on a given passage and question.
In this paper, we reconstruct multi-choice to single-choice by training a binary classification to distinguish whether a certain answer is correct.
Our proposed method gets rid of the multi-choice framework and can leverage resources of other tasks.
arXiv Detail & Related papers (2024-04-27T16:02:55Z) - Feature Selection as Deep Sequential Generative Learning [50.00973409680637]
We develop a deep variational transformer model over a joint of sequential reconstruction, variational, and performance evaluator losses.
Our model can distill feature selection knowledge and learn a continuous embedding space to map feature selection decision sequences into embedding vectors associated with utility scores.
arXiv Detail & Related papers (2024-03-06T16:31:56Z) - Learning Abstract Visual Reasoning via Task Decomposition: A Case Study
in Raven Progressive Matrices [0.24475591916185496]
In Raven Progressive Matrices, the task is to choose one of the available answers given a context.
In this study, we propose a deep learning architecture based on the transformer blueprint.
The multidimensional predictions obtained in this way are then directly juxtaposed to choose the answer.
arXiv Detail & Related papers (2023-08-12T11:02:21Z) - Discovering Non-monotonic Autoregressive Orderings with Variational
Inference [67.27561153666211]
We develop an unsupervised parallelizable learner that discovers high-quality generation orders purely from training data.
We implement the encoder as a Transformer with non-causal attention that outputs permutations in one forward pass.
Empirical results in language modeling tasks demonstrate that our method is context-aware and discovers orderings that are competitive with or even better than fixed orders.
arXiv Detail & Related papers (2021-10-27T16:08:09Z) - Context-guided Triple Matching for Multiple Choice Question Answering [13.197150032345895]
Multiple choice question answering (MCQA) refers to identifying a suitable answer from multiple candidates, by estimating the matching score among the triple of the passage, question and answer.
Existing methods decouple the process into several pair-wise or dual matching steps, that limited the ability of assessing cases with multiple evidence sentences.
This paper introduces a novel Context-guided Triple Matching algorithm, which is achieved by integrating a Triple Matching (TM) module and a Contrastive Regularization (CR)
arXiv Detail & Related papers (2021-09-27T12:30:39Z) - Determinantal Beam Search [75.84501052642361]
Beam search is a go-to strategy for decoding neural sequence models.
In use-cases that call for multiple solutions, a diverse or representative set is often desired.
By posing iterations in beam search as a series of subdeterminant problems, we can turn the algorithm into a diverse subset selection process.
arXiv Detail & Related papers (2021-06-14T13:01:46Z) - Recurrent Multi-view Alignment Network for Unsupervised Surface
Registration [79.72086524370819]
Learning non-rigid registration in an end-to-end manner is challenging due to the inherent high degrees of freedom and the lack of labeled training data.
We propose to represent the non-rigid transformation with a point-wise combination of several rigid transformations.
We also introduce a differentiable loss function that measures the 3D shape similarity on the projected multi-view 2D depth images.
arXiv Detail & Related papers (2020-11-24T14:22:42Z) - Composing Answer from Multi-spans for Reading Comprehension [77.32873012668783]
We present a novel method to generate answers for non-extraction machine reading comprehension (MRC) tasks.
The proposed method has a better performance on accurately generating long answers, and substantially outperforms two competitive typical one-span and Seq2Seq baseline decoders.
arXiv Detail & Related papers (2020-09-14T01:44:42Z) - DiverseNet: When One Right Answer is not Enough [35.764028730120096]
We introduce a simple method for training a neural network, which enables diverse structured predictions to be made for each test-time query.
Our method results in quantitative improvements across three challenging tasks: 2D image completion, 3D volume estimation, and flow prediction.
arXiv Detail & Related papers (2020-08-24T18:12:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.