ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and
Socially-engaged Conversational Agents
- URL: http://arxiv.org/abs/2005.01777v1
- Date: Mon, 4 May 2020 18:27:58 GMT
- Title: ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and
Socially-engaged Conversational Agents
- Authors: Chia-Yu Li, Daniel Ortega, Dirk V\"ath, Florian Lux, Lindsey
Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz V\"olkel, Pavel
Denisov, Sabrina Jenne, Zorica Kacarevic and Ngoc Thang Vu
- Abstract summary: ADVISER is an open-source, multi-domain dialog system toolkit.
It enables the development of multi-modal (incorporating speech, text and vision) conversational agents.
The final Python-based implementation of our toolkit is flexible, easy to use, and easy to extend.
- Score: 27.222054181839095
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present ADVISER - an open-source, multi-domain dialog system toolkit that
enables the development of multi-modal (incorporating speech, text and vision),
socially-engaged (e.g. emotion recognition, engagement level prediction and
backchanneling) conversational agents. The final Python-based implementation of
our toolkit is flexible, easy to use, and easy to extend not only for
technically experienced users, such as machine learning researchers, but also
for less technically experienced users, such as linguists or cognitive
scientists, thereby providing a flexible platform for collaborative research.
Link to open-source code: https://github.com/DigitalPhonetics/adviser
Related papers
- OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents [11.928422245125985]
Open Omni is an open-source, end-to-end pipeline benchmarking tool.
It integrates advanced technologies such as Speech-to-Text, Emotion Detection, Retrieval Augmented Generation, Large Language Models.
It supports local and cloud deployment, ensuring data privacy and supporting latency and accuracy benchmarking.
arXiv Detail & Related papers (2024-08-06T09:02:53Z) - OpenHands: An Open Platform for AI Software Developers as Generalist Agents [109.8507367518992]
We introduce OpenHands, a platform for the development of AI agents that interact with the world in similar ways to a human developer.
We describe how the platform allows for the implementation of new agents, safe interaction with sandboxed environments for code execution, and incorporation of evaluation benchmarks.
arXiv Detail & Related papers (2024-07-23T17:50:43Z) - OpenAgents: An Open Platform for Language Agents in the Wild [71.16800991568677]
We present OpenAgents, an open platform for using and hosting language agents in the wild of everyday life.
We elucidate the challenges and opportunities, aspiring to set a foundation for future research and development of real-world language agents.
arXiv Detail & Related papers (2023-10-16T17:54:53Z) - Conversational Health Agents: A Personalized LLM-Powered Agent Framework [1.4597673707346281]
Conversational Health Agents (CHAs) are interactive systems that provide healthcare services, such as assistance and diagnosis.
We propose openCHA, an open-source framework to empower conversational agents to generate a personalized response for users' healthcare queries.
openCHA includes an orchestrator to plan and execute actions for gathering information from external sources.
arXiv Detail & Related papers (2023-10-03T18:54:10Z) - Agents: An Open-source Framework for Autonomous Language Agents [98.91085725608917]
We consider language agents as a promising direction towards artificial general intelligence.
We release Agents, an open-source library with the goal of opening up these advances to a wider non-specialist audience.
arXiv Detail & Related papers (2023-09-14T17:18:25Z) - ChatDev: Communicative Agents for Software Development [84.90400377131962]
ChatDev is a chat-powered software development framework in which specialized agents are guided in what to communicate.
These agents actively contribute to the design, coding, and testing phases through unified language-based communication.
arXiv Detail & Related papers (2023-07-16T02:11:34Z) - UKP-SQUARE: An Online Platform for Question Answering Research [50.35348764297317]
We present UKP-SQUARE, an online QA platform for researchers which allows users to query and analyze a large collection of modern Skills.
UKP-SQUARE allows users to query and analyze a large collection of modern Skills via a user-friendly web interface and integrated tests.
arXiv Detail & Related papers (2022-03-25T15:00:24Z) - Deep Learning Tools for Audacity: Helping Researchers Expand the
Artist's Toolkit [8.942168855247548]
We present a software framework that integrates neural networks into the popular open-source audio editing software, Audacity.
We showcase some example use cases for both end-users and neural network developers.
arXiv Detail & Related papers (2021-10-25T23:56:38Z) - SpeechBrain: A General-Purpose Speech Toolkit [73.0404642815335]
SpeechBrain is an open-source and all-in-one speech toolkit.
It is designed to facilitate the research and development of neural speech processing technologies.
It achieves competitive or state-of-the-art performance in a wide range of speech benchmarks.
arXiv Detail & Related papers (2021-06-08T18:22:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.