Kani: A Lightweight and Highly Hackable Framework for Building Language
Model Applications
- URL: http://arxiv.org/abs/2309.05542v1
- Date: Mon, 11 Sep 2023 15:27:59 GMT
- Title: Kani: A Lightweight and Highly Hackable Framework for Building Language
Model Applications
- Authors: Andrew Zhu, Liam Dugan, Alyssa Hwang, Chris Callison-Burch
- Abstract summary: Kani is a lightweight, flexible, and model-agnostic open-source framework for building language model applications.
Kani helps developers implement a variety of complex features by supporting the core building blocks of chat interaction: model interfacing, chat management, and robust function calling.
- Score: 40.800025261168265
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Language model applications are becoming increasingly popular and complex,
often including features like tool usage and retrieval augmentation. However,
existing frameworks for such applications are often opinionated, deciding for
developers how their prompts ought to be formatted and imposing limitations on
customizability and reproducibility. To solve this we present Kani: a
lightweight, flexible, and model-agnostic open-source framework for building
language model applications. Kani helps developers implement a variety of
complex features by supporting the core building blocks of chat interaction:
model interfacing, chat management, and robust function calling. All Kani core
functions are easily overridable and well documented to empower developers to
customize functionality for their own needs. Kani thus serves as a useful tool
for researchers, hobbyists, and industry professionals alike to accelerate
their development while retaining interoperability and fine-grained control.
Related papers
- CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models [59.91221728187576]
This paper introduces the CMU Linguistic Linguistic Backend, an open-source framework that simplifies model deployment and continuous human-in-the-loop fine-tuning of NLP models.
CMULAB enables users to leverage the power of multilingual models to quickly adapt and extend existing tools for speech recognition, OCR, translation, and syntactic analysis to new languages.
arXiv Detail & Related papers (2024-04-03T02:21:46Z) - Arcee's MergeKit: A Toolkit for Merging Large Language Models [0.6374098147778188]
MergeKit is a framework to efficiently merge models on any hardware.
To date, thousands of models have been merged by the open-source community.
arXiv Detail & Related papers (2024-03-20T02:38:01Z) - AgentScope: A Flexible yet Robust Multi-Agent Platform [66.64116117163755]
AgentScope is a developer-centric multi-agent platform with message exchange as its core communication mechanism.
The abundant syntactic tools, built-in agents and service functions, user-friendly interfaces for application demonstration and utility monitor, zero-code programming workstation, and automatic prompt tuning mechanism significantly lower the barriers to both development and deployment.
arXiv Detail & Related papers (2024-02-21T04:11:28Z) - Lemur: Harmonizing Natural Language and Code for Language Agents [107.44942854856802]
We introduce Lemur and Lemur-Chat, open-source language models optimized for both natural language and coding capabilities.
Our models achieve state-of-the-art averaged performance across diverse text and coding benchmarks.
The harmonization between natural and programming languages enables Lemur-Chat to significantly narrow the gap with proprietary models on agent abilities.
arXiv Detail & Related papers (2023-10-10T17:57:45Z) - ModelScope-Agent: Building Your Customizable Agent System with
Open-source Large Language Models [74.64651681052628]
We introduce ModelScope-Agent, a customizable agent framework for real-world applications based on open-source LLMs as controllers.
It provides a user-friendly system library, with customizable engine design to support model training on multiple open-source LLMs.
A comprehensive framework has been proposed spanning over tool-use data collection, tool retrieval, tool registration, memory control, customized model training, and evaluation.
arXiv Detail & Related papers (2023-09-02T16:50:30Z) - Gentopia: A Collaborative Platform for Tool-Augmented LLMs [21.09079715807735]
We present gentopia, an Augmented Language Models (ALMs) framework enabling flexible customization of agents through simple configurations.
We also establish gentpool, a public platform enabling the registration and sharing of user-customized agents.
gentbench, an integral component of gentpool, is designed to thoroughly evaluate user-customized agents across diverse aspects such as safety, robustness, efficiency, etc.
arXiv Detail & Related papers (2023-08-08T04:12:29Z) - i-Code Studio: A Configurable and Composable Framework for Integrative
AI [93.74891865028867]
We propose the i-Code Studio, a flexible and composable framework for Integrative AI.
The i-Code Studio orchestrates multiple pre-trained models in a fine-tuning-free fashion to conduct complex multimodal tasks.
The i-Code Studio achieves impressive results on a variety of zero-shot multimodal tasks, such as video-to-text retrieval, speech-to-speech translation, and visual question answering.
arXiv Detail & Related papers (2023-05-23T06:45:55Z) - Flowstorm: Open-Source Platform with Hybrid Dialogue Architecture [0.9236074230806579]
Flowstorm is an open-source project suitable for creating, running, and analyzing conversational applications.
Thanks to the fast and fully automated build process, the dialogues created within the platform can be executed in seconds.
We propose a novel dialogue architecture that uses a combination of tree structures with generative models.
arXiv Detail & Related papers (2022-12-19T11:27:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.