Definition-independent Formalization of Soundscapes: Towards a Formal
Methodology
- URL: http://arxiv.org/abs/2310.13404v1
- Date: Fri, 20 Oct 2023 10:22:15 GMT
- Title: Definition-independent Formalization of Soundscapes: Towards a Formal
Methodology
- Authors: Mikel D. Jedrusiak, Thomas Harweg, Timo Haselhoff, Bryce T. Lawrence,
Susanne Moebus, Frank Weichert
- Abstract summary: Soundscapes have been studied by researchers from various disciplines, each with different perspectives, goals, approaches, and terminologies.
We present a potential formalization that is independent of the underlying soundscape definition.
We show a practical application of our presented formalization.
- Score: 0.873811641236639
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Soundscapes have been studied by researchers from various disciplines, each
with different perspectives, goals, approaches, and terminologies. Accordingly,
depending on the field, the concept of a soundscape's components changes,
consequently changing the basic definition. This results in complicating
interdisciplinary communication and comparison of results. Especially when
soundscape-unrelated research areas are involved. For this reason, we present a
potential formalization that is independent of the underlying soundscape
definition, with the goal of being able to capture the heterogeneous structure
of the data as well as the different ideologies in one model. In an exemplary
analysis of frequency correlation matrices for land use type detection as an
alternative to features like MFCCs, we show a practical application of our
presented formalization.
Related papers
- SSDOnt: an Ontology for representing Single-Subject Design Studies [0.05524804393257919]
Single-Subject Design is used in several areas such as education and biomedicine.
No suited formal vocabulary exists for annotating the detailed configuration and the results of this type of research studies.
We present SSDOnt, a specific purpose ontology for describing and annotating single-subject design studies.
arXiv Detail & Related papers (2024-01-26T15:11:31Z) - Grounded and Well-rounded: A Methodological Approach to the Study of
Cross-modal and Cross-lingual Grounding [3.435010087800495]
Grounding has been argued to be a crucial component towards the development of more complete and truly semantically competent artificial intelligence systems.
We study what the effects are - if any - of providing models with richer input sources than text-only.
Experiments using this framework reveal qualitative differences in model behavior between cross-modally grounded, cross-lingually grounded, and ungrounded models.
arXiv Detail & Related papers (2023-10-18T13:05:50Z) - A Category-theoretical Meta-analysis of Definitions of Disentanglement [97.34033555407403]
Disentangling the factors of variation in data is a fundamental concept in machine learning.
This paper presents a meta-analysis of existing definitions of disentanglement.
arXiv Detail & Related papers (2023-05-11T15:24:20Z) - Variational Cross-Graph Reasoning and Adaptive Structured Semantics
Learning for Compositional Temporal Grounding [143.5927158318524]
Temporal grounding is the task of locating a specific segment from an untrimmed video according to a query sentence.
We introduce a new Compositional Temporal Grounding task and construct two new dataset splits.
We argue that the inherent structured semantics inside the videos and language is the crucial factor to achieve compositional generalization.
arXiv Detail & Related papers (2023-01-22T08:02:23Z) - Analyzing the Representational Geometry of Acoustic Word Embeddings [22.677210029168588]
Acoustic word embeddings (AWEs) are vector representations such that different acoustic exemplars of the same word are projected nearby.
This paper takes a closer analytical look at AWEs learned from English speech and study how the choice of the learning objective and the architecture shapes their representational profile.
arXiv Detail & Related papers (2023-01-08T10:22:50Z) - Mix and Reason: Reasoning over Semantic Topology with Data Mixing for
Domain Generalization [48.90173060487124]
Domain generalization (DG) enables a learning machine from multiple seen source domains to an unseen target one.
mire consists of two key components, namely, Category-aware Data Mixing (CDM) and Adaptive Semantic Topology Refinement (ASTR)
experiments on multiple DG benchmarks validate the effectiveness and robustness of the proposed mire.
arXiv Detail & Related papers (2022-10-14T06:52:34Z) - Compositional Temporal Grounding with Structured Variational Cross-Graph
Correspondence Learning [92.07643510310766]
Temporal grounding in videos aims to localize one target video segment that semantically corresponds to a given query sentence.
We introduce a new Compositional Temporal Grounding task and construct two new dataset splits.
We empirically find that they fail to generalize to queries with novel combinations of seen words.
We propose a variational cross-graph reasoning framework that explicitly decomposes video and language into multiple structured hierarchies.
arXiv Detail & Related papers (2022-03-24T12:55:23Z) - Do Acoustic Word Embeddings Capture Phonological Similarity? An
Empirical Study [12.210797811981173]
In this paper, we ask: does the distance in the acoustic embedding space correlate with phonological dissimilarity?
We train AWE models in controlled settings for two languages (German and Czech) and evaluate the embeddings on two tasks: word discrimination and phonological similarity.
Our experiments show that (1) the distance in the embedding space in the best cases only moderately correlates with phonological distance, and (2) improving the performance on the word discrimination task does not necessarily yield models that better reflect word phonological similarity.
arXiv Detail & Related papers (2021-06-16T10:47:56Z) - Interpretable Multi-dataset Evaluation for Named Entity Recognition [110.64368106131062]
We present a general methodology for interpretable evaluation for the named entity recognition (NER) task.
The proposed evaluation method enables us to interpret the differences in models and datasets, as well as the interplay between them.
By making our analysis tool available, we make it easy for future researchers to run similar analyses and drive progress in this area.
arXiv Detail & Related papers (2020-11-13T10:53:27Z) - A Matter of Framing: The Impact of Linguistic Formalism on Probing
Results [69.36678873492373]
Deep pre-trained contextualized encoders like BERT (Delvin et al.) demonstrate remarkable performance on a range of downstream tasks.
Recent research in probing investigates the linguistic knowledge implicitly learned by these models during pre-training.
Can the choice of formalism affect probing results?
We find linguistically meaningful differences in the encoding of semantic role- and proto-role information by BERT depending on the formalism.
arXiv Detail & Related papers (2020-04-30T17:45:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.