Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph
Analysis
- URL: http://arxiv.org/abs/2103.05558v1
- Date: Tue, 9 Mar 2021 17:09:46 GMT
- Title: Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph
Analysis
- Authors: Chaoyi Zhang, Jianhui Yu, Yang Song, Weidong Cai
- Abstract summary: We propose a 3D point-based scene graph generation framework to bridge perception and reasoning.
Within the reasoning stage, an EDGE-oriented Graph Convolutional Network is created to exploit multi-dimensional edge features.
Our experimental results show promising edge-oriented reasoning effects on scene graph generation studies.
- Score: 21.920148546359016
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Scene understanding is a critical problem in computer vision. In this paper,
we propose a 3D point-based scene graph generation ($\mathbf{SGG_{point}}$)
framework to effectively bridge perception and reasoning to achieve scene
understanding via three sequential stages, namely scene graph construction,
reasoning, and inference. Within the reasoning stage, an EDGE-oriented Graph
Convolutional Network ($\texttt{EdgeGCN}$) is created to exploit
multi-dimensional edge features for explicit relationship modeling, together
with the exploration of two associated twinning interaction mechanisms between
nodes and edges for the independent evolution of scene graph representations.
Overall, our integrated $\mathbf{SGG_{point}}$ framework is established to seek
and infer scene structures of interest from both real-world and synthetic 3D
point-based scenes. Our experimental results show promising edge-oriented
reasoning effects on scene graph generation studies. We also demonstrate our
method advantage on several traditional graph representation learning benchmark
datasets, including the node-wise classification on citation networks and
whole-graph recognition problems for molecular analysis.
Related papers
- Multiview Scene Graph [7.460438046915524]
A proper scene representation is central to the pursuit of spatial intelligence.
We propose to build Multiview Scene Graphs (MSG) from unposed images.
MSG represents a scene topologically with interconnected place and object nodes.
arXiv Detail & Related papers (2024-10-15T02:04:05Z) - On Support Relations Inference and Scene Hierarchy Graph Construction from Point Cloud in Clustered Environments [3.4535508414601344]
In 3D scenes, rich spatial geometric and topological information are often ignored by RGB-based approaches for scene understanding.
In this study, we develop a bottom-up approach for scene understanding that infers support relations between objects from a point cloud.
arXiv Detail & Related papers (2024-04-22T02:42:32Z) - Graph Transformer GANs with Graph Masked Modeling for Architectural
Layout Generation [153.92387500677023]
We present a novel graph Transformer generative adversarial network (GTGAN) to learn effective graph node relations.
The proposed graph Transformer encoder combines graph convolutions and self-attentions in a Transformer to model both local and global interactions.
We also propose a novel self-guided pre-training method for graph representation learning.
arXiv Detail & Related papers (2024-01-15T14:36:38Z) - Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and
Message Passing Neural Network [3.9280441311534653]
Scene graph generation (SGG) captures the relationships between objects in an image and creates a structured graph-based representation.
Existing SGG methods have a limited ability to accurately predict detailed relationships.
A new approach to the modeling multiobject relationships, called edge dual scene graph generation (EdgeSGG), is proposed herein.
arXiv Detail & Related papers (2023-11-02T12:36:52Z) - Incremental 3D Semantic Scene Graph Prediction from RGB Sequences [86.77318031029404]
We propose a real-time framework that incrementally builds a consistent 3D semantic scene graph of a scene given an RGB image sequence.
Our method consists of a novel incremental entity estimation pipeline and a scene graph prediction network.
The proposed network estimates 3D semantic scene graphs with iterative message passing using multi-view and geometric features extracted from the scene entities.
arXiv Detail & Related papers (2023-05-04T11:32:16Z) - Hierarchical Graph Networks for 3D Human Pose Estimation [50.600944798627786]
Recent 2D-to-3D human pose estimation works tend to utilize the graph structure formed by the topology of the human skeleton.
We argue that this skeletal topology is too sparse to reflect the body structure and suffer from serious 2D-to-3D ambiguity problem.
We propose a novel graph convolution network architecture, Hierarchical Graph Networks, to overcome these weaknesses.
arXiv Detail & Related papers (2021-11-23T15:09:03Z) - Dense Graph Convolutional Neural Networks on 3D Meshes for 3D Object
Segmentation and Classification [0.0]
We present new designs of graph convolutional neural networks (GCNs) on 3D meshes for 3D object classification and segmentation.
We use the faces of the mesh as basic processing units and represent a 3D mesh as a graph where each node corresponds to a face.
arXiv Detail & Related papers (2021-06-30T02:17:16Z) - Joint Deep Multi-Graph Matching and 3D Geometry Learning from
Inhomogeneous 2D Image Collections [57.60094385551773]
We propose a trainable framework for learning a deformable 3D geometry model from inhomogeneous image collections.
We in addition obtain the underlying 3D geometry of the objects depicted in the 2D images.
arXiv Detail & Related papers (2021-03-31T17:25:36Z) - SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D
Sequences [76.28527350263012]
We propose a method to incrementally build up semantic scene graphs from a 3D environment given a sequence of RGB-D frames.
We aggregate PointNet features from primitive scene components by means of a graph neural network.
Our approach outperforms 3D scene graph prediction methods by a large margin and its accuracy is on par with other 3D semantic and panoptic segmentation methods while running at 35 Hz.
arXiv Detail & Related papers (2021-03-27T13:00:36Z) - Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions [94.17683799712397]
We focus on scene graphs, a data structure that organizes the entities of a scene in a graph.
We propose a learned method that regresses a scene graph from the point cloud of a scene.
We show the application of our method in a domain-agnostic retrieval task, where graphs serve as an intermediate representation for 3D-3D and 2D-3D matching.
arXiv Detail & Related papers (2020-04-08T12:25:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.