Related papers: VoCopilot: Voice-Activated Tracking of Everyday Interactions

VoCopilot: Voice-Activated Tracking of Everyday Interactions

URL: http://arxiv.org/abs/2312.10265v1
Date: Fri, 15 Dec 2023 23:46:52 GMT
Title: VoCopilot: Voice-Activated Tracking of Everyday Interactions
Authors: Sheen An Goh, Manoj Gulati, Ambuj Varshney
Abstract summary: This paper presents our efforts to design a new vocal tracking system we call VoCopilot. VoCopilot is an end-to-end system centered around an energy-efficient acoustic hardware and firmware combined with advanced machine learning models.
Score: 1.0435741631709405
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Voice plays an important role in our lives by facilitating communication, conveying emotions, and indicating health. Therefore, tracking vocal interactions can provide valuable insight into many aspects of our lives. This paper presents our ongoing efforts to design a new vocal tracking system we call VoCopilot. VoCopilot is an end-to-end system centered around an energy-efficient acoustic hardware and firmware combined with advanced machine learning models. As a result, VoCopilot is able to continuously track conversations, record them, transcribe them, and then extract useful insights from them. By utilizing large language models, VoCopilot ensures the user can extract useful insights from recorded interactions without having to learn complex machine learning techniques. In order to protect the privacy of end users, VoCopilot uses a novel wake-up mechanism that only records conversations of end users. Additionally, all the rest of pipeline can be run on a commodity computer (Mac Mini M2). In this work, we show the effectiveness of VoCopilot in real-world environment for two use cases.

Related papers

Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software [9.881955481813465]
Large Language Model (LLM)-based in-application assistants, or copilots, can automate software tasks. We investigated two automation paradigms by designing and implementing a fully automated copilot and a semi-automated copilot. GuidedCopilot automates trivial steps while offering step-by-step visual guidance.
arXiv Detail & Related papers (2025-04-22T03:11:10Z)
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning [47.9126187195398]
We introduce HACTS (Human-As-Copilot Teleoperation System), a novel system that establishes bilateral, real-time joint synchronization between a robot arm and teleoperation hardware. This simple yet effective feedback mechanism, akin to a steering wheel in autonomous vehicles, enables the human copilot to intervene seamlessly while collecting action-correction data for future learning.
arXiv Detail & Related papers (2025-03-31T13:28:13Z)
Healthcare Copilot: Eliciting the Power of General LLMs for Medical Consultation [96.22329536480976]
We introduce the construction of a Healthcare Copilot designed for medical consultation. The proposed Healthcare Copilot comprises three main components: 1) the Dialogue component, responsible for effective and safe patient interactions; 2) the Memory component, storing both current conversation data and historical patient information; and 3) the Processing component, summarizing the entire dialogue and generating reports. To evaluate the proposed Healthcare Copilot, we implement an auto-evaluation scheme using ChatGPT for two roles: as a virtual patient engaging in dialogue with the copilot, and as an evaluator to assess the quality of the dialogue.
arXiv Detail & Related papers (2024-02-20T22:26:35Z)
Demystifying Practices, Challenges and Expected Features of Using GitHub Copilot [3.655281304961642]
We conducted an empirical study by collecting and analyzing the data from Stack Overflow (SO) and GitHub Discussions. We identified the programming languages, technologies used with Copilot, functions implemented, benefits, limitations, and challenges when using Copilot. Our results suggest that using Copilot is like a double-edged sword, which requires developers to carefully consider various aspects when deciding whether or not to use it.
arXiv Detail & Related papers (2023-09-11T16:39:37Z)
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning [82.91837418721182]
Adaptive interfaces can help users perform sequential decision-making tasks. Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users. We propose a reinforcement learning algorithm to train an interface to map raw command signals to actions.
arXiv Detail & Related papers (2023-09-07T16:52:27Z)
Multi-model fusion for Aerial Vision and Dialog Navigation based on human attention aids [69.98258892165767]
We present an aerial navigation task for the 2023 ICCV Conversation History. We propose an effective method of fusion training of Human Attention Aided Transformer model (HAA-Transformer) and Human Attention Aided LSTM (HAA-LSTM) models.
arXiv Detail & Related papers (2023-08-27T10:32:52Z)
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers [0.797970449705065]
We propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training. The engine receives spoken communications from ATCo trainees, and it performs automatic speech recognition and understanding. To the best of our knowledge, this is the first work fully based on open-source ATC resources and AI tools.
arXiv Detail & Related papers (2023-04-16T17:45:21Z)
Towards Cooperative Flight Control Using Visual-Attention [61.99121057062421]
We propose a vision-based air-guardian system to enable parallel autonomy between a pilot and a control system. Our attention-based air-guardian system can balance the trade-off between its level of involvement in the flight and the pilot's expertise and attention.
arXiv Detail & Related papers (2022-12-21T15:31:47Z)
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator [0.5480546613836199]
This paper describes a simple yet efficient repetition-based modular system for speeding up air-traffic controllers (ATCos) training. E.g., a human pilot is still required in EURO's ESCAPE lite simulator (see https://www.eurocontrol.int/simulator/escape) during ATCo training. This need can be substituted by an automatic system that could act as a pilot.
arXiv Detail & Related papers (2022-12-14T11:34:59Z)
Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming [28.254978977288868]
We studied GitHub Copilot, a code-recommendation system used by millions of programmers daily. We developed CUPS, a taxonomy of common programmer activities when interacting with Copilot. Our insights reveal how programmers interact with Copilot and motivate new interface designs and metrics.
arXiv Detail & Related papers (2022-10-25T20:01:15Z)
Play it by Ear: Learning Skills amidst Occlusion through Audio-Visual Imitation Learning [62.83590925557013]
We learn a set of challenging partially-observed manipulation tasks from visual and audio inputs. Our proposed system learns these tasks by combining offline imitation learning from tele-operated demonstrations and online finetuning. In a set of simulated tasks, we find that our system benefits from using audio, and that by using online interventions we are able to improve the success rate of offline imitation learning by 20%.
arXiv Detail & Related papers (2022-05-30T04:52:58Z)
Stop Bugging Me! Evading Modern-Day Wiretapping Using Adversarial Perturbations [47.32228513808444]
Mass surveillance systems for voice over IP (VoIP) conversations pose a great risk to privacy. We present an adversarial-learning-based framework for privacy protection for VoIP conversations.
arXiv Detail & Related papers (2020-10-24T06:56:35Z)

This list is automatically generated from the titles and abstracts of the papers in this site.