Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks
- URL: http://arxiv.org/abs/2312.15610v1
- Date: Mon, 25 Dec 2023 04:41:52 GMT
- Title: Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks
- Authors: Yijia Weng, Kaichun Mo, Ruoxi Shi, Yanchao Yang, Leonidas J. Guibas
- Abstract summary: Low-dimensional yet crucial geometric eigen-lengths often determine the success of some geometric tasks.
Humans have materialized such crucial geometric eigen-lengths in common sense.
It remains obscure and underexplored if learning systems can be equipped with similar capabilities.
- Score: 62.89746245940464
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Some extremely low-dimensional yet crucial geometric eigen-lengths often
determine the success of some geometric tasks. For example, the height of an
object is important to measure to check if it can fit between the shelves of a
cabinet, while the width of a couch is crucial when trying to move it through a
doorway. Humans have materialized such crucial geometric eigen-lengths in
common sense since they are very useful in serving as succinct yet effective,
highly interpretable, and universal object representations. However, it remains
obscure and underexplored if learning systems can be equipped with similar
capabilities of automatically discovering such key geometric quantities from
doing tasks. In this work, we therefore for the first time formulate and
propose a novel learning problem on this question and set up a benchmark suite
including tasks, data, and evaluation metrics for studying the problem. We
focus on a family of common fitting tasks as the testbed for the proposed
learning problem. We explore potential solutions and demonstrate the
feasibility of learning eigen-lengths from simply observing successful and
failed fitting trials. We also attempt geometric grounding for more accurate
eigen-length measurement and study the reusability of the learned eigen-lengths
across multiple tasks. Our work marks the first exploratory step toward
learning crucial geometric eigen-lengths and we hope it can inspire future
research in tackling this important yet underexplored problem.
Related papers
- ShapeGrasp: Zero-Shot Task-Oriented Grasping with Large Language Models through Geometric Decomposition [8.654140442734354]
Task-oriented grasping of unfamiliar objects is a necessary skill for robots in dynamic in-home environments.
We present a novel zero-shot task-oriented grasping method leveraging a geometric decomposition of the target object into simple convex shapes.
Our approach employs minimal essential information - the object's name and the intended task - to facilitate zero-shot task-oriented grasping.
arXiv Detail & Related papers (2024-03-26T19:26:53Z) - G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model [124.68242155098189]
Large language models (LLMs) have shown remarkable proficiency in human-level reasoning and generation capabilities.
G-LLaVA demonstrates exceptional performance in solving geometric problems, significantly outperforming GPT-4-V on the MathVista benchmark with only 7B parameters.
arXiv Detail & Related papers (2023-12-18T17:36:20Z) - Multitask Learning with No Regret: from Improved Confidence Bounds to
Active Learning [79.07658065326592]
Quantifying uncertainty in the estimated tasks is of pivotal importance for many downstream applications, such as online or active learning.
We provide novel multitask confidence intervals in the challenging setting when neither the similarity between tasks nor the tasks' features are available to the learner.
We propose a novel online learning algorithm that achieves such improved regret without knowing this parameter in advance.
arXiv Detail & Related papers (2023-08-03T13:08:09Z) - Neural Bregman Divergences for Distance Learning [60.375385370556145]
We propose a new approach to learning arbitrary Bregman divergences in a differentiable manner via input convex neural networks.
We show that our method more faithfully learns divergences over a set of both new and previously studied tasks.
Our tests further extend to known asymmetric, but non-Bregman tasks, where our method still performs competitively despite misspecification.
arXiv Detail & Related papers (2022-06-09T20:53:15Z) - AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated
Objects via Few-shot Interactions [13.802675708793014]
Perceiving and interacting with 3D articulated objects, such as cabinets, doors, and faucets, pose particular challenges for future home-assistant robots.
We propose a novel framework, named AdaAfford, that learns to perform very few test-time interactions for quickly adapting the affordance priors to more accurate instance-specific posteriors.
arXiv Detail & Related papers (2021-12-01T03:00:05Z) - GeoQA: A Geometric Question Answering Benchmark Towards Multimodal
Numerical Reasoning [172.36214872466707]
We focus on solving geometric problems, which requires a comprehensive understanding of textual descriptions, visual diagrams, and theorem knowledge.
We propose a Geometric Question Answering dataset GeoQA, containing 5,010 geometric problems with corresponding annotated programs.
arXiv Detail & Related papers (2021-05-30T12:34:17Z) - Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges [50.22269760171131]
The last decade has witnessed an experimental revolution in data science and machine learning, epitomised by deep learning methods.
This text is concerned with exposing pre-defined regularities through unified geometric principles.
It provides a common mathematical framework to study the most successful neural network architectures, such as CNNs, RNNs, GNNs, and Transformers.
arXiv Detail & Related papers (2021-04-27T21:09:51Z) - SpatialSim: Recognizing Spatial Configurations of Objects with Graph
Neural Networks [31.695447265278126]
We show how a machine can learn and compare classes of geometric spatial configurations that are invariant to the point of view of an external observer.
We propose SpatialSim (Spatial Similarity), a novel geometrical reasoning benchmark, and argue that progress on this benchmark would pave the way towards a general solution.
Secondly, we study how inductive relational biases exhibited by fully-connected message-passing Graph Neural Networks (MPGNNs) are useful to solve those tasks.
arXiv Detail & Related papers (2020-04-09T14:13:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.