Deep segmental phonetic posterior-grams based discovery of
non-categories in L2 English speech
- URL: http://arxiv.org/abs/2002.00205v1
- Date: Sat, 1 Feb 2020 13:21:33 GMT
- Title: Deep segmental phonetic posterior-grams based discovery of
non-categories in L2 English speech
- Authors: Xu Li, Xixin Wu, Xunying Liu, Helen Meng
- Abstract summary: Second language (L2) speech is often labeled with the native, phone categories.
In many cases, it is difficult to decide on a categorical phone that an L2 segment to.
- Score: 65.04834943405673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Second language (L2) speech is often labeled with the native, phone
categories. However, in many cases, it is difficult to decide on a categorical
phone that an L2 segment belongs to. These segments are regarded as
non-categories. Most existing approaches for Mispronunciation Detection and
Diagnosis (MDD) are only concerned with categorical errors, i.e. a phone
category is inserted, deleted or substituted by another. However,
non-categorical errors are not considered. To model these non-categorical
errors, this work aims at exploring non-categorical patterns to extend the
categorical phone set. We apply a phonetic segment classifier to generate
segmental phonetic posterior-grams (SPPGs) to represent phone segment-level
information. And then we explore the non-categories by looking for the SPPGs
with more than one peak. Compared with the baseline system, this approach
explores more non-categorical patterns, and also perceptual experimental
results show that the explored non-categories are more accurate with increased
confusion degree by 7.3% and 7.5% under two different measures. Finally, we
preliminarily analyze the reason behind those non-categories.
Related papers
- Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions [35.20091752343433]
This work introduces two hierarchical contexts, namely perceptual context and spurious context, to carefully describe the precise category boundary.
The two contexts hierarchically construct the precise description for a certain category, which is first roughly classifying a sample to the predicted category.
The precise descriptions for those categories within the vision-language framework present a novel application: CATegory-EXtensible OOD detection (CATEX)
arXiv Detail & Related papers (2024-07-23T12:53:38Z) - Unsupervised Universal Image Segmentation [59.0383635597103]
We propose an Unsupervised Universal model (U2Seg) adept at performing various image segmentation tasks.
U2Seg generates pseudo semantic labels for these segmentation tasks via leveraging self-supervised models.
We then self-train the model on these pseudo semantic labels, yielding substantial performance gains.
arXiv Detail & Related papers (2023-12-28T18:59:04Z) - Generalized Category Discovery with Decoupled Prototypical Network [27.1635162759448]
We present a novel model called Decoupled Prototypical Network (DPN)
By formulating a bipartite matching problem for category prototypes, DPN can achieve different training targets effectively.
DPN can learn more discriminative features for both known and novel categories through our proposed Semantic-aware Prototypical Learning (SPL)
arXiv Detail & Related papers (2022-11-28T08:05:45Z) - Segmenting Known Objects and Unseen Unknowns without Prior Knowledge [86.46204148650328]
holistic segmentation aims to identify and separate objects of unseen, unknown categories into instances without any prior knowledge about them.
We tackle this new problem with U3HS, which finds unknowns as highly uncertain regions and clusters their corresponding instance-aware embeddings into individual objects.
Experiments on public data from MS, Cityscapes, and Lost&Found demonstrate the effectiveness of U3HS.
arXiv Detail & Related papers (2022-09-12T16:59:36Z) - Classifying with Uncertain Data Envelopment Analysis [0.0]
We propose a new classification scheme premised on the reality of imperfect data.
Our model uses uncertain data envelopment analysis to define a classification's proximity to equitable efficiency.
We conclude by classifying the thirty stocks in the Dow Jones Industrial average into performant tiers and by classifying prostate treatments into clinically effectual categories.
arXiv Detail & Related papers (2022-09-02T13:41:19Z) - SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation [111.61261419566908]
Deep neural networks (DNNs) are usually trained on a closed set of semantic classes.
They are ill-equipped to handle previously-unseen objects.
detecting and localizing such objects is crucial for safety-critical applications such as perception for automated driving.
arXiv Detail & Related papers (2021-04-30T07:58:19Z) - DeepCAT: Deep Category Representation for Query Understanding in
E-commerce Search [15.041444067591007]
We propose a deep learning model, DeepCAT, which learns joint word-category representations to enhance the query understanding process.
Our results show that DeepCAT reaches a 10% improvement on em minority classes and a 7.1% improvement on em tail queries over a state-of-the-art label embedding model.
arXiv Detail & Related papers (2021-04-23T18:04:44Z) - DropLoss for Long-Tail Instance Segmentation [56.162929199998075]
We develop DropLoss, a novel adaptive loss to compensate for the imbalance between rare and frequent categories.
We show state-of-the-art mAP across rare, common, and frequent categories on the LVIS dataset.
arXiv Detail & Related papers (2021-04-13T17:59:22Z) - Adaptive Class Suppression Loss for Long-Tail Object Detection [49.7273558444966]
We devise a novel Adaptive Class Suppression Loss (ACSL) to improve the detection performance of tail categories.
Our ACSL achieves 5.18% and 5.2% improvements with ResNet50-FPN, and sets a new state of the art.
arXiv Detail & Related papers (2021-04-02T05:12:31Z) - Generating CCG Categories [20.154553201329712]
We propose to generate categories rather than classify them.
We show that with this finer view on categories, annotations of different categories could be shared and interactions with sentence contexts could be enhanced.
The proposed category generator is able to achieve state-of-the-art tagging (95.5% accuracy) and parsing (89.8% labeled F1) performances on the standard CCGBank.
arXiv Detail & Related papers (2021-03-15T05:01:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.