Time to Transfer: Predicting and Evaluating Machine-Human Chatting
Handoff
- URL: http://arxiv.org/abs/2012.07610v1
- Date: Mon, 14 Dec 2020 15:02:08 GMT
- Title: Time to Transfer: Predicting and Evaluating Machine-Human Chatting
Handoff
- Authors: Jiawei Liu, Zhe Gao, Yangyang Kang, Zhuoren Jiang, Guoxiu He,
Changlong Sun, Xiaozhong Liu, Wei Lu
- Abstract summary: We introduce the Machine-Human Chatting Handoff (MHCH), which enables human-algorithm collaboration.
To detect the normal/transferable utterances, we propose a Difficulty-Assisted Matching Inference (DAMI) network.
A matching inference mechanism is introduced to capture the contextual matching features.
- Score: 36.62707486132739
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Is chatbot able to completely replace the human agent? The short answer could
be - "it depends...". For some challenging cases, e.g., dialogue's topical
spectrum spreads beyond the training corpus coverage, the chatbot may
malfunction and return unsatisfied utterances. This problem can be addressed by
introducing the Machine-Human Chatting Handoff (MHCH), which enables
human-algorithm collaboration. To detect the normal/transferable utterances, we
propose a Difficulty-Assisted Matching Inference (DAMI) network, utilizing
difficulty-assisted encoding to enhance the representations of utterances.
Moreover, a matching inference mechanism is introduced to capture the
contextual matching features. A new evaluation metric, Golden Transfer within
Tolerance (GT-T), is proposed to assess the performance by considering the
tolerance property of the MHCH. To provide insights into the task and validate
the proposed model, we collect two new datasets. Extensive experimental results
are presented and contrasted against a series of baseline models to demonstrate
the efficacy of our model on MHCH.
Related papers
- UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection [18.25576487115016]
This paper focuses on Human-Object Interaction (HOI) detection.
It addresses the challenge of identifying and understanding the interactions between humans and objects within a given image or video frame.
We propose a novel approach textscUAHOI, Uncertainty-aware Robust Human-Object Interaction Learning.
arXiv Detail & Related papers (2024-08-14T10:06:39Z) - Are AI-Generated Text Detectors Robust to Adversarial Perturbations? [9.001160538237372]
Current detectors for AI-generated text (AIGT) lack robustness against adversarial perturbations.
This paper investigates the robustness of existing AIGT detection methods and introduces a novel detector, the Siamese Calibrated Reconstruction Network (SCRN)
The SCRN employs a reconstruction network to add and remove noise from text, extracting a semantic representation that is robust to local perturbations.
arXiv Detail & Related papers (2024-06-03T10:21:48Z) - Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model [75.66013048128302]
In this work, we investigate the potential of employing the QE model as the reward model to predict human preferences for feedback training.
We first identify the overoptimization problem during QE-based feedback training, manifested as an increase in reward while translation quality declines.
To address the problem, we adopt a simple yet effective method that uses rules to detect the incorrect translations and assigns a penalty term to the reward scores of them.
arXiv Detail & Related papers (2024-01-23T16:07:43Z) - PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - Towards General Visual-Linguistic Face Forgery Detection [95.73987327101143]
Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust.
Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model.
We propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation.
arXiv Detail & Related papers (2023-07-31T10:22:33Z) - DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement
Estimation in Conversation [11.185293979235547]
We introduce a convolutional Transformer for modeling and estimating human engagement.
Our proposed system surpasses the baseline models, exhibiting a noteworthy $7$% improvement on test set.
We employ different modality fusion mechanism and show that for this type of data, a simpled method with self-attention fusion gains the best performance.
arXiv Detail & Related papers (2023-07-31T06:02:35Z) - An Explainable Geometric-Weighted Graph Attention Network for
Identifying Functional Networks Associated with Gait Impairment [0.0]
One of the hallmark symptoms of Parkinson's Disease (PD) is the progressive loss of postural reflexes.
Identifying disruptions in brain function associated with gait impairment could be crucial in better understanding PD motor progression.
We present an explainable, geometric, weighted-graph attention neural network (xGW-GAT) to identify functional networks predictive of the progression of gait difficulties.
arXiv Detail & Related papers (2023-07-24T19:57:21Z) - MISMATCH: Fine-grained Evaluation of Machine-generated Text with
Mismatch Error Types [68.76742370525234]
We propose a new evaluation scheme to model human judgments in 7 NLP tasks, based on the fine-grained mismatches between a pair of texts.
Inspired by the recent efforts in several NLP tasks for fine-grained evaluation, we introduce a set of 13 mismatch error types.
We show that the mismatch errors between the sentence pairs on the held-out datasets from 7 NLP tasks align well with the human evaluation.
arXiv Detail & Related papers (2023-06-18T01:38:53Z) - A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff
and Service Satisfaction Analysis [35.937850808046456]
We propose a novel model, Role-Selected Sharing Network ( RSSN), which integrates dialogue satisfaction estimation and handoff prediction in one multi-task learning framework.
Unlike prior efforts in dialog mining, by utilizing local user satisfaction as a bridge, global satisfaction detector and handoff predictor can effectively exchange critical information.
arXiv Detail & Related papers (2021-09-17T08:39:45Z) - Exploiting Unsupervised Data for Emotion Recognition in Conversations [76.01690906995286]
Emotion Recognition in Conversations (ERC) aims to predict the emotional state of speakers in conversations.
The available supervised data for the ERC task is limited.
We propose a novel approach to leverage unsupervised conversation data.
arXiv Detail & Related papers (2020-10-02T13:28:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.