A Tale of Two Lexica Testing Computational Hypotheses with Deep
Convolutional Neural Networks
- URL: http://arxiv.org/abs/2104.06271v1
- Date: Tue, 13 Apr 2021 15:03:14 GMT
- Title: A Tale of Two Lexica Testing Computational Hypotheses with Deep
Convolutional Neural Networks
- Authors: Enes Avcu, Olivia Newman, David Gow
- Abstract summary: We investigate the existence of two parallel wordform stores: the dorsal and ventral processing streams.
We created two deep convolutional neural networks (CNNs) to test the hypothesis.
Our results are consistent with the hypothesis that the divergent processing demands of the ventral and dorsal processing streams impose computational pressures for the development of multiple lexica.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Gow's (2012) dual lexicon model suggests that the primary purpose of words is
to mediate the mappings between acoustic-phonetic input and other forms of
linguistic representation. Motivated by evidence from functional imaging,
aphasia, and behavioral results, the model argues for the existence of two
parallel wordform stores: the dorsal and ventral processing streams. In this
paper, we tested the hypothesis that the complex, but systematic mapping
between sound and articulation in the dorsal stream poses different
computational pressures on feature sets than the more arbitrary mapping between
sound and meaning. To test this hypothesis, we created two deep convolutional
neural networks (CNNs). While the dorsal network was trained to identify
individual spoken words, the ventral network was trained to map them onto
semantic classes. We then extracted patterns of network activation from the
penultimate level of each network and tested how well features generated by the
network supported generalization to linguistic categorization associated with
the dorsal versus ventral processing streams. Our preliminary results showed
both models successfully learned their tasks. Secondary generalization testing
showed the ventral CNN outperformed the dorsal CNN on a semantic task:
concreteness classification, while the dorsal CNN outperformed the ventral CNN
on articulation tasks: classification by onset phoneme class and syllable
length. These results are consistent with the hypothesis that the divergent
processing demands of the ventral and dorsal processing streams impose
computational pressures for the development of multiple lexica.
Related papers
- Deception Detection from Linguistic and Physiological Data Streams Using Bimodal Convolutional Neural Networks [19.639533220155965]
This paper explores the application of convolutional neural networks for the purpose of multimodal deception detection.
We use a dataset built by interviewing 104 subjects about two topics, with one truthful and one falsified response from each subject about each topic.
arXiv Detail & Related papers (2023-11-18T02:44:33Z) - Prune and distill: similar reformatting of image information along rat
visual cortex and deep neural networks [61.60177890353585]
Deep convolutional neural networks (CNNs) have been shown to provide excellent models for its functional analogue in the brain, the ventral stream in visual cortex.
Here we consider some prominent statistical patterns that are known to exist in the internal representations of either CNNs or the visual cortex.
We show that CNNs and visual cortex share a similarly tight relationship between dimensionality expansion/reduction of object representations and reformatting of image information.
arXiv Detail & Related papers (2022-05-27T08:06:40Z) - Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
We show how fully-connected neural networks solving a discrimination task can learn a convolutional structure directly from their inputs.
By carefully designing data models, we show that the emergence of this pattern is triggered by the non-Gaussian, higher-order local structure of the inputs.
arXiv Detail & Related papers (2022-02-01T17:11:13Z) - Dynamic Inference with Neural Interpreters [72.90231306252007]
We present Neural Interpreters, an architecture that factorizes inference in a self-attention network as a system of modules.
inputs to the model are routed through a sequence of functions in a way that is end-to-end learned.
We show that Neural Interpreters perform on par with the vision transformer using fewer parameters, while being transferrable to a new task in a sample efficient manner.
arXiv Detail & Related papers (2021-10-12T23:22:45Z) - Controlled Caption Generation for Images Through Adversarial Attacks [85.66266989600572]
We study adversarial examples for vision and language models, which typically adopt a Convolutional Neural Network (i.e., CNN) for image feature extraction and a Recurrent Neural Network (RNN) for caption generation.
In particular, we investigate attacks on the visual encoder's hidden layer that is fed to the subsequent recurrent network.
We propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN.
arXiv Detail & Related papers (2021-07-07T07:22:41Z) - BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by
Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks.
Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z) - Effect of Word Embedding Models on Hate and Offensive Speech Detection [1.7403133838762446]
We investigate the impact of both word embedding models and neural network architectures on the predictive accuracy.
We first train several word embedding models on a large-scale unlabelled Arabic text corpus.
For each detection task, we train several neural network classifiers using the pre-trained word embedding models.
This task yields a large number of various learned models, which allows conducting an exhaustive comparison.
arXiv Detail & Related papers (2020-11-23T02:43:45Z) - On the Effects of Using word2vec Representations in Neural Networks for
Dialogue Act Recognition [0.6767885381740952]
We propose a new deep neural network that explores recurrent models to capture word sequences within sentences.
We validate this model on three languages: English, French and Czech.
arXiv Detail & Related papers (2020-10-22T07:21:17Z) - Identity-Based Patterns in Deep Convolutional Networks: Generative
Adversarial Phonology and Reduplication [0.0]
We use the ciwGAN architecture Beguvs in which learning of meaningful representations in speech emerges from a requirement that the CNNs generate informative data.
We propose a technique to wug-test CNNs trained on speech and, based on four generative tests, argue that the network learns to represent an identity-based pattern in its latent space.
arXiv Detail & Related papers (2020-09-13T23:12:49Z) - Combine Convolution with Recurrent Networks for Text Classification [12.92202472766078]
We propose a novel method to keep the strengths of the two networks to a great extent.
In the proposed model, a convolutional neural network is applied to learn a 2D weight matrix where each row reflects the importance of each word from different aspects.
We use a bi-directional RNN to process each word and employ a neural tensor layer that fuses forward and backward hidden states to get word representations.
arXiv Detail & Related papers (2020-06-29T03:36:04Z) - Mechanisms for Handling Nested Dependencies in Neural-Network Language
Models and Humans [75.15855405318855]
We studied whether a modern artificial neural network trained with "deep learning" methods mimics a central aspect of human sentence processing.
Although the network was solely trained to predict the next word in a large corpus, analysis showed the emergence of specialized units that successfully handled local and long-distance syntactic agreement.
We tested the model's predictions in a behavioral experiment where humans detected violations in number agreement in sentences with systematic variations in the singular/plural status of multiple nouns.
arXiv Detail & Related papers (2020-06-19T12:00:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.