Towards Lexical Analysis of Dog Vocalizations via Online Videos
- URL: http://arxiv.org/abs/2309.13086v1
- Date: Thu, 21 Sep 2023 23:53:14 GMT
- Title: Towards Lexical Analysis of Dog Vocalizations via Online Videos
- Authors: Yufei Wang, Chunhao Zhang, Jieyi Huang, Mengyue Wu, Kenny Zhu
- Abstract summary: This study presents a data-driven investigation into the semantics of dog vocalizations via correlating different sound types with consistent semantics.
We first present a new dataset of Shiba Inu sounds, along with contextual information such as location and activity, collected from YouTube.
Based on the analysis of conditioned probability between dog vocalizations and corresponding location and activity, we discover supporting evidence for previous research on the semantic meaning of various dog sounds.
- Score: 19.422796780268605
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deciphering the semantics of animal language has been a grand challenge. This
study presents a data-driven investigation into the semantics of dog
vocalizations via correlating different sound types with consistent semantics.
We first present a new dataset of Shiba Inu sounds, along with contextual
information such as location and activity, collected from YouTube with a
well-constructed pipeline. The framework is also applicable to other animal
species. Based on the analysis of conditioned probability between dog
vocalizations and corresponding location and activity, we discover supporting
evidence for previous heuristic research on the semantic meaning of various dog
sounds. For instance, growls can signify interactions. Furthermore, our study
yields new insights that existing word types can be subdivided into
finer-grained subtypes and minimal semantic unit for Shiba Inu is word-related.
For example, whimper can be subdivided into two types, attention-seeking and
discomfort.
Related papers
- Feature Representations for Automatic Meerkat Vocalization Classification [15.642602544201308]
This paper investigates feature representations for automatic meerkat vocalization analysis.
Call type classification studies conducted on two data sets reveal that feature extraction methods developed for human speech processing can be effectively employed for automatic meerkat call analysis.
arXiv Detail & Related papers (2024-08-27T10:51:51Z) - Towards a Universal Method for Meaningful Signal Detection [0.0]
It is known that human speech and certain animal vocalizations can convey meaningful content because we can decipher the content that a given utterance does convey.
This paper explores an alternative approach to determining whether a signal is meaningful, one that analyzes only the signal itself and is independent of what the conveyed meaning might be.
arXiv Detail & Related papers (2024-07-28T15:45:08Z) - Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification [23.974783158267428]
We explore the use of self-supervised speech representation models pre-trained on human speech to address dog bark classification tasks.
We show that using speech embedding representations significantly improves over simpler classification baselines.
We also find that models pre-trained on large human speech acoustics can provide additional performance boosts on several tasks.
arXiv Detail & Related papers (2024-04-29T14:41:59Z) - Phonetic and Lexical Discovery of a Canine Language using HuBERT [40.578021131708155]
This paper explores potential communication patterns within dog vocalizations and transcends traditional linguistic analysis barriers.
We present a self-supervised approach with HuBERT, enabling the accurate classification of phoneme labels.
We develop a web-based dog vocalization labeling system to highlight phoneme n-grams, present in the vocabulary, in the dog audio uploaded by users.
arXiv Detail & Related papers (2024-02-25T04:35:45Z) - Does My Dog ''Speak'' Like Me? The Acoustic Correlation between Pet Dogs
and Their Human Owners [19.422796780268605]
This paper presents a preliminary investigation into the possible correlation between domestic dog vocal expressions and their human host's language environment.
We first present a new dataset of Shiba Inu dog vocals from YouTube, which provides 7500 clean sound clips.
With a classification task and prominent factor analysis, we discover significant acoustic differences in the dog vocals from the two language environments.
arXiv Detail & Related papers (2023-09-21T23:49:21Z) - Do Orcas Have Semantic Language? Machine Learning to Predict Orca
Behaviors Using Partially Labeled Vocalization Data [50.02992288349178]
We study whether machine learning can predict behavior from vocalizations.
We work with recent recordings of McMurdo Sound orcas.
With careful combination of recent machine learning techniques, we achieve 96.4% classification accuracy.
arXiv Detail & Related papers (2023-01-28T06:04:22Z) - CLAMP: Prompt-based Contrastive Learning for Connecting Language and
Animal Pose [70.59906971581192]
We introduce a novel prompt-based Contrastive learning scheme for connecting Language and AniMal Pose effectively.
The CLAMP attempts to bridge the gap by adapting the text prompts to the animal keypoints during network training.
Experimental results show that our method achieves state-of-the-art performance under the supervised, few-shot, and zero-shot settings.
arXiv Detail & Related papers (2022-06-23T14:51:42Z) - Perception Point: Identifying Critical Learning Periods in Speech for
Bilingual Networks [58.24134321728942]
We compare and identify cognitive aspects on deep neural-based visual lip-reading models.
We observe a strong correlation between these theories in cognitive psychology and our unique modeling.
arXiv Detail & Related papers (2021-10-13T05:30:50Z) - Cetacean Translation Initiative: a roadmap to deciphering the
communication of sperm whales [97.41394631426678]
Recent research showed the promise of machine learning tools for analyzing acoustic communication in nonhuman species.
We outline the key elements required for the collection and processing of massive bioacoustic data of sperm whales.
The technological capabilities developed are likely to yield cross-applications and advancements in broader communities investigating non-human communication and animal behavioral research.
arXiv Detail & Related papers (2021-04-17T18:39:22Z) - SPLAT: Speech-Language Joint Pre-Training for Spoken Language
Understanding [61.02342238771685]
Spoken language understanding requires a model to analyze input acoustic signal to understand its linguistic content and make predictions.
Various pre-training methods have been proposed to learn rich representations from large-scale unannotated speech and text.
We propose a novel semi-supervised learning framework, SPLAT, to jointly pre-train the speech and language modules.
arXiv Detail & Related papers (2020-10-05T19:29:49Z) - Where New Words Are Born: Distributional Semantic Analysis of Neologisms
and Their Semantic Neighborhoods [51.34667808471513]
We investigate the importance of two factors, semantic sparsity and frequency growth rates of semantic neighbors, formalized in the distributional semantics paradigm.
We show that both factors are predictive word emergence although we find more support for the latter hypothesis.
arXiv Detail & Related papers (2020-01-21T19:09:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.