A Fast Attention Network for Joint Intent Detection and Slot Filling on
Edge Devices
- URL: http://arxiv.org/abs/2205.07646v1
- Date: Mon, 16 May 2022 13:06:51 GMT
- Title: A Fast Attention Network for Joint Intent Detection and Slot Filling on
Edge Devices
- Authors: Liang Huang, Senjie Liang, Feiyang Ye, Nan Gao
- Abstract summary: We propose a Fast Attention Network (FAN) for joint intent detection and slot filling tasks, guaranteeing both accuracy and latency.
FAN can be implemented on different encoders and delivers more accurate models at every speed level.
Our experiments on the Jetson Nano platform show that FAN inferences fifteen utterances per second with a small accuracy drop, showing its effectiveness and efficiency on edge devices.
- Score: 5.982036462074887
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Intent detection and slot filling are two main tasks in natural language
understanding and play an essential role in task-oriented dialogue systems. The
joint learning of both tasks can improve inference accuracy and is popular in
recent works. However, most joint models ignore the inference latency and
cannot meet the need to deploy dialogue systems at the edge. In this paper, we
propose a Fast Attention Network (FAN) for joint intent detection and slot
filling tasks, guaranteeing both accuracy and latency. Specifically, we
introduce a clean and parameter-refined attention module to enhance the
information exchange between intent and slot, improving semantic accuracy by
more than 2%. FAN can be implemented on different encoders and delivers more
accurate models at every speed level. Our experiments on the Jetson Nano
platform show that FAN inferences fifteen utterances per second with a small
accuracy drop, showing its effectiveness and efficiency on edge devices.
Related papers
- A Point-Based Approach to Efficient LiDAR Multi-Task Perception [49.91741677556553]
PAttFormer is an efficient multi-task architecture for joint semantic segmentation and object detection in point clouds.
Unlike other LiDAR-based multi-task architectures, our proposed PAttFormer does not require separate feature encoders for task-specific point cloud representations.
Our evaluations show substantial gains from multi-task learning, improving LiDAR semantic segmentation by +1.7% in mIou and 3D object detection by +1.7% in mAP.
arXiv Detail & Related papers (2024-04-19T11:24:34Z) - Joint Multiple Intent Detection and Slot Filling with Supervised
Contrastive Learning and Self-Distillation [4.123763595394021]
Multiple intent detection and slot filling are fundamental and crucial tasks in spoken language understanding.
Joint models that can detect intents and extract slots simultaneously are preferred.
We present a method for multiple intent detection and slot filling by addressing these challenges.
arXiv Detail & Related papers (2023-08-28T15:36:33Z) - Task Conditioned BERT for Joint Intent Detection and Slot-filling [4.564638655448509]
We investigate the hypothesis that solving these challenges as one unified model will allow the transfer of parameter support data across the different tasks.
The proposed principled model is based on a Transformer encoder, trained on multiple tasks, and leveraged by a rich input that conditions the model on the target inferences.
arXiv Detail & Related papers (2023-08-11T14:47:27Z) - Multi-view Point Cloud Registration based on Evolutionary Multitasking
with Bi-Channel Knowledge Sharing Mechanism [29.232021965321408]
This paper models the registration problem as multi-task optimization.
It proposes a novel bi-channel knowledge sharing mechanism for effective and efficient problem solving.
arXiv Detail & Related papers (2022-05-06T03:26:16Z) - Decoupled and Memory-Reinforced Networks: Towards Effective Feature
Learning for One-Step Person Search [65.51181219410763]
One-step methods have been developed to handle pedestrian detection and identification sub-tasks using a single network.
There are two major challenges in the current one-step approaches.
We propose a decoupled and memory-reinforced network (DMRNet) to overcome these problems.
arXiv Detail & Related papers (2021-02-22T06:19:45Z) - AFD-Net: Adaptive Fully-Dual Network for Few-Shot Object Detection [8.39479809973967]
Few-shot object detection (FSOD) aims at learning a detector that can fast adapt to previously unseen objects with scarce examples.
Existing methods solve this problem by performing subtasks of classification and localization utilizing a shared component.
We present that a general few-shot detector should consider the explicit decomposition of two subtasks, as well as leveraging information from both of them to enhance feature representations.
arXiv Detail & Related papers (2020-11-30T10:21:32Z) - Discriminative Nearest Neighbor Few-Shot Intent Detection by
Transferring Natural Language Inference [150.07326223077405]
Few-shot learning is attracting much attention to mitigate data scarcity.
We present a discriminative nearest neighbor classification with deep self-attention.
We propose to boost the discriminative ability by transferring a natural language inference (NLI) model.
arXiv Detail & Related papers (2020-10-25T00:39:32Z) - A Co-Interactive Transformer for Joint Slot Filling and Intent Detection [61.109486326954205]
Intent detection and slot filling are two main tasks for building a spoken language understanding (SLU) system.
Previous studies either model the two tasks separately or only consider the single information flow from intent to slot.
We propose a Co-Interactive Transformer to consider the cross-impact between the two tasks simultaneously.
arXiv Detail & Related papers (2020-10-08T10:16:52Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.