Unsupervised Graph-based Topic Modeling from Video Transcriptions
- URL: http://arxiv.org/abs/2105.01466v1
- Date: Tue, 4 May 2021 12:48:17 GMT
- Title: Unsupervised Graph-based Topic Modeling from Video Transcriptions
- Authors: Lukas Stappen, Gerhard Hagerer, Bj\"orn W. Schuller, Georg Groh
- Abstract summary: We develop a topic extractor on video transcriptions using neural word embeddings and a graph-based clustering method.
Experimental results on the real-life multimodal data set MuSe-CaR demonstrate that our approach extracts coherent and meaningful topics.
- Score: 5.210353244951637
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: To unfold the tremendous amount of audiovisual data uploaded daily to social
media platforms, effective topic modelling techniques are needed. Existing work
tends to apply variants of topic models on text data sets. In this paper, we
aim at developing a topic extractor on video transcriptions. The model improves
coherence by exploiting neural word embeddings through a graph-based clustering
method. Unlike typical topic models, this approach works without knowing the
true number of topics. Experimental results on the real-life multimodal data
set MuSe-CaR demonstrates that our approach extracts coherent and meaningful
topics, outperforming baseline methods. Furthermore, we successfully
demonstrate the generalisability of our approach on a pure text review data
set.
Related papers
- An Iterative Approach to Topic Modelling [0.0]
We propose to use an iterative process to perform topic modelling that gives rise to a sense of completeness of the resulting topics when the process is complete.
We demonstrate how the modelling process can be applied iteratively to arrive at a set of topics that could not be further improved upon using one of the three selected measures for clustering comparison.
arXiv Detail & Related papers (2024-07-25T09:26:07Z) - Reinforcing Pre-trained Models Using Counterfactual Images [54.26310919385808]
This paper proposes a novel framework to reinforce classification models using language-guided generated counterfactual images.
We identify model weaknesses by testing the model using the counterfactual image dataset.
We employ the counterfactual images as an augmented dataset to fine-tune and reinforce the classification model.
arXiv Detail & Related papers (2024-06-19T08:07:14Z) - GINopic: Topic Modeling with Graph Isomorphism Network [0.8962460460173959]
We introduce GINopic, a topic modeling framework based on graph isomorphism networks to capture the correlation between words.
We demonstrate the effectiveness of GINopic compared to existing topic models and highlight its potential for advancing topic modeling.
arXiv Detail & Related papers (2024-04-02T17:18:48Z) - Let the Pretrained Language Models "Imagine" for Short Texts Topic
Modeling [29.87929724277381]
In short texts, co-occurrence information is minimal, which results in feature sparsity in document representation.
Existing topic models (probabilistic or neural) mostly fail to mine patterns from them to generate coherent topics.
We extend short text into longer sequences using existing pre-trained language models (PLMs)
arXiv Detail & Related papers (2023-10-24T00:23:30Z) - StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized
Image-Dialogue Data [129.92449761766025]
We propose a novel data collection methodology that synchronously synthesizes images and dialogues for visual instruction tuning.
This approach harnesses the power of generative models, marrying the abilities of ChatGPT and text-to-image generative models.
Our research includes comprehensive experiments conducted on various datasets.
arXiv Detail & Related papers (2023-08-20T12:43:52Z) - Topic-Selective Graph Network for Topic-Focused Summarization [0.0]
We propose a topic-arc recognition objective and topic-selective graph network.
First, the topic-arc recognition objective is used to model training, which endows the capability to discriminate topics for the model.
The topic-selective graph network can conduct topic-guided cross-interaction on sentences based on the results of topic-arc recognition.
arXiv Detail & Related papers (2023-02-25T15:56:06Z) - Improving the Inference of Topic Models via Infinite Latent State
Replications [18.632435007093594]
One of the most popular inference approaches to topic models is perhaps collapsed Gibbs sampling (CGS)
We propose to leverage state augmentation technique by maximizing the number of topic samples to infinity.
We then develop a new inference approach, called infinite latent state replication (ILR), to generate robust soft topic assignment for each given document-word pair.
arXiv Detail & Related papers (2023-01-25T17:07:25Z) - Generating More Pertinent Captions by Leveraging Semantics and Style on
Multi-Source Datasets [56.018551958004814]
This paper addresses the task of generating fluent descriptions by training on a non-uniform combination of data sources.
Large-scale datasets with noisy image-text pairs provide a sub-optimal source of supervision.
We propose to leverage and separate semantics and descriptive style through the incorporation of a style token and keywords extracted through a retrieval component.
arXiv Detail & Related papers (2021-11-24T19:00:05Z) - ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive
Summarization with Argument Mining [61.82562838486632]
We crowdsource four new datasets on diverse online conversation forms of news comments, discussion forums, community question answering forums, and email threads.
We benchmark state-of-the-art models on our datasets and analyze characteristics associated with the data.
arXiv Detail & Related papers (2021-06-01T22:17:13Z) - Improving Neural Topic Models using Knowledge Distillation [84.66983329587073]
We use knowledge distillation to combine the best attributes of probabilistic topic models and pretrained transformers.
Our modular method can be straightforwardly applied with any neural topic model to improve topic quality.
arXiv Detail & Related papers (2020-10-05T22:49:16Z) - Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling [81.33107307509718]
We propose a topic adaptive storyteller to model the ability of inter-topic generalization.
We also propose a prototype encoding structure to model the ability of intra-topic derivation.
Experimental results show that topic adaptation and prototype encoding structure mutually bring benefit to the few-shot model.
arXiv Detail & Related papers (2020-08-11T03:55:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.