Deep Dependency Networks for Multi-Label Classification
- URL: http://arxiv.org/abs/2302.00633v1
- Date: Wed, 1 Feb 2023 17:52:40 GMT
- Title: Deep Dependency Networks for Multi-Label Classification
- Authors: Shivvrat Arya, Yu Xiang and Vibhav Gogate
- Abstract summary: We show that the performance of previous approaches that combine Markov Random Fields with neural networks can be modestly improved.
We propose a new modeling framework called deep dependency networks, which augments a dependency network.
Despite its simplicity, jointly learning this new architecture yields significant improvements in performance.
- Score: 24.24496964886951
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose a simple approach which combines the strengths of probabilistic
graphical models and deep learning architectures for solving the multi-label
classification task, focusing specifically on image and video data. First, we
show that the performance of previous approaches that combine Markov Random
Fields with neural networks can be modestly improved by leveraging more
powerful methods such as iterative join graph propagation, integer linear
programming, and $\ell_1$ regularization-based structure learning. Then we
propose a new modeling framework called deep dependency networks, which
augments a dependency network, a model that is easy to train and learns more
accurate dependencies but is limited to Gibbs sampling for inference, to the
output layer of a neural network. We show that despite its simplicity, jointly
learning this new architecture yields significant improvements in performance
over the baseline neural network. In particular, our experimental evaluation on
three video activity classification datasets: Charades, Textually Annotated
Cooking Scenes (TACoS), and Wetlab, and three multi-label image classification
datasets: MS-COCO, PASCAL VOC, and NUS-WIDE show that deep dependency networks
are almost always superior to pure neural architectures that do not use
dependency networks.
Related papers
- Towards Scalable and Versatile Weight Space Learning [51.78426981947659]
This paper introduces the SANE approach to weight-space learning.
Our method extends the idea of hyper-representations towards sequential processing of subsets of neural network weights.
arXiv Detail & Related papers (2024-06-14T13:12:07Z) - Deep Dependency Networks and Advanced Inference Schemes for Multi-Label Classification [7.643377057724898]
We present a unified framework called deep dependency networks (DDNs)
DDNs combine dependency networks and deep learning architectures for multi-label classification.
A drawback of DDNs compared to Markov networks is their lack of advanced inference schemes.
arXiv Detail & Related papers (2024-04-17T18:04:37Z) - Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural
Networks [49.808194368781095]
We show that three-layer neural networks have provably richer feature learning capabilities than two-layer networks.
This work makes progress towards understanding the provable benefit of three-layer neural networks over two-layer networks in the feature learning regime.
arXiv Detail & Related papers (2023-05-11T17:19:30Z) - Two-level Graph Network for Few-Shot Class-Incremental Learning [7.815043173207539]
Few-shot class-incremental learning (FSCIL) aims to design machine learning algorithms that can continually learn new concepts from a few data points.
Existing FSCIL methods ignore the semantic relationships between sample-level and class-level.
In this paper, we designed a two-level graph network for FSCIL named Sample-level and Class-level Graph Neural Network (SCGN)
arXiv Detail & Related papers (2023-03-24T08:58:08Z) - Deep Image Clustering with Contrastive Learning and Multi-scale Graph
Convolutional Networks [58.868899595936476]
This paper presents a new deep clustering approach termed image clustering with contrastive learning and multi-scale graph convolutional networks (IcicleGCN)
Experiments on multiple image datasets demonstrate the superior clustering performance of IcicleGCN over the state-of-the-art.
arXiv Detail & Related papers (2022-07-14T19:16:56Z) - SIRe-Networks: Skip Connections over Interlaced Multi-Task Learning and
Residual Connections for Structure Preserving Object Classification [28.02302915971059]
In this paper, we introduce an interlaced multi-task learning strategy, defined SIRe, to reduce the vanishing gradient in relation to the object classification task.
The presented methodology directly improves a convolutional neural network (CNN) by enforcing the input image structure preservation through auto-encoders.
To validate the presented methodology, a simple CNN and various implementations of famous networks are extended via the SIRe strategy and extensively tested on the CIFAR100 dataset.
arXiv Detail & Related papers (2021-10-06T13:54:49Z) - Improving Graph Neural Networks with Simple Architecture Design [7.057970273958933]
We introduce several key design strategies for graph neural networks.
We present a simple and shallow model, Feature Selection Graph Neural Network (FSGNN)
We show that the proposed model outperforms other state of the art GNN models and achieves up to 64% improvements in accuracy on node classification tasks.
arXiv Detail & Related papers (2021-05-17T06:46:01Z) - Firefly Neural Architecture Descent: a General Approach for Growing
Neural Networks [50.684661759340145]
Firefly neural architecture descent is a general framework for progressively and dynamically growing neural networks.
We show that firefly descent can flexibly grow networks both wider and deeper, and can be applied to learn accurate but resource-efficient neural architectures.
In particular, it learns networks that are smaller in size but have higher average accuracy than those learned by the state-of-the-art methods.
arXiv Detail & Related papers (2021-02-17T04:47:18Z) - Spatio-Temporal Inception Graph Convolutional Networks for
Skeleton-Based Action Recognition [126.51241919472356]
We design a simple and highly modularized graph convolutional network architecture for skeleton-based action recognition.
Our network is constructed by repeating a building block that aggregates multi-granularity information from both the spatial and temporal paths.
arXiv Detail & Related papers (2020-11-26T14:43:04Z) - Graph-Based Neural Network Models with Multiple Self-Supervised
Auxiliary Tasks [79.28094304325116]
Graph Convolutional Networks are among the most promising approaches for capturing relationships among structured data points.
We propose three novel self-supervised auxiliary tasks to train graph-based neural network models in a multi-task fashion.
arXiv Detail & Related papers (2020-11-14T11:09:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.