AquiLLM: a RAG Tool for Capturing Tacit Knowledge in Research Groups
- URL: http://arxiv.org/abs/2508.05648v1
- Date: Fri, 25 Jul 2025 20:47:01 GMT
- Title: AquiLLM: a RAG Tool for Capturing Tacit Knowledge in Research Groups
- Authors: Chandler Campbell, Bernie Boscoe, Tuan Do,
- Abstract summary: Research groups face persistent challenges in capturing, storing, and retrieving knowledge that is distributed across team members.<n>AquiLLM is a lightweight, modular RAG system designed to meet the needs of research groups.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Research groups face persistent challenges in capturing, storing, and retrieving knowledge that is distributed across team members. Although structured data intended for analysis and publication is often well managed, much of a group's collective knowledge remains informal, fragmented, or undocumented--often passed down orally through meetings, mentoring, and day-to-day collaboration. This includes private resources such as emails, meeting notes, training materials, and ad hoc documentation. Together, these reflect the group's tacit knowledge--the informal, experience-based expertise that underlies much of their work. Accessing this knowledge can be difficult, requiring significant time and insider understanding. Retrieval-augmented generation (RAG) systems offer promising solutions by enabling users to query and generate responses grounded in relevant source material. However, most current RAG-LLM systems are oriented toward public documents and overlook the privacy concerns of internal research materials. We introduce AquiLLM (pronounced ah-quill-em), a lightweight, modular RAG system designed to meet the needs of research groups. AquiLLM supports varied document types and configurable privacy settings, enabling more effective access to both formal and informal knowledge within scholarly groups.
Related papers
- Structured Knowledge Representation through Contextual Pages for Retrieval-Augmented Generation [53.768256130061765]
PAGER is a page-driven autonomous knowledge representation framework for RAG.<n>It iteratively retrieves and refines relevant documents to populate each slot, ultimately constructing a coherent page.<n>Experiments show that PAGER consistently outperforms all RAG baselines.
arXiv Detail & Related papers (2026-01-14T11:44:31Z) - Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support [2.1763238176533037]
The system aims to provide fast and personalized responses, while reducing workload of university staff.<n>We employ retrieval-augmented generation to ground answers in university resources and to provide users with verifiable, contextually relevant information.
arXiv Detail & Related papers (2025-07-18T14:09:45Z) - SpiritRAG: A Q&A System for Religion and Spirituality in the United Nations Archive [4.575515160275914]
We present SpiritRAG, an interactive Question Answering (Q&A) system based on Retrieval-Augmented Generation (RAG)<n>Built using 7,500 United Nations (UN) resolution documents related to R/S in the domains of health and education, SpiritRAG allows researchers and policymakers to conduct complex, context-sensitive database searches.<n>A pilot test and evaluation with domain experts on 100 manually composed questions demonstrates the practical value and usefulness of SpiritRAG.
arXiv Detail & Related papers (2025-07-06T13:54:54Z) - Machine Assistant with Reliable Knowledge: Enhancing Student Learning via RAG-based Retrieval [0.0]
Machine Assistant with Reliable Knowledge (MARK) is a retrieval-augmented question-answering system designed to support student learning.<n>System is built on a retrieval-augmented generation (RAG) framework, which integrates a curated knowledge base to ensure factual consistency.
arXiv Detail & Related papers (2025-06-28T22:17:27Z) - Prompting is not Enough: Exploring Knowledge Integration and Controllable Generation [89.65955788873532]
Open-domain question answering (OpenQA) represents a cornerstone in natural language processing (NLP)<n>We propose a novel framework named GenKI, which aims to improve the OpenQA performance by exploring Knowledge Integration and controllable Generation.
arXiv Detail & Related papers (2025-05-26T08:18:33Z) - KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education? [41.49674849980441]
Retrieval-Augmented Generation (RAG) systems show remarkable potential as question answering tools in the K-12 Education domain.<n> discrepancies between these textbooks and the parametric knowledge inherent in Large Language Models (LLMs) can undermine the effectiveness of RAG systems.<n>KnowShiftQA simulates these discrepancies by applying deliberate hypothetical knowledge updates to both answers and source documents.
arXiv Detail & Related papers (2024-12-12T06:38:40Z) - StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization [94.31508613367296]
Retrieval-augmented generation (RAG) is a key means to effectively enhance large language models (LLMs)
We propose StructRAG, which can identify the optimal structure type for the task at hand, reconstruct original documents into this structured format, and infer answers based on the resulting structure.
Experiments show that StructRAG achieves state-of-the-art performance, particularly excelling in challenging scenarios.
arXiv Detail & Related papers (2024-10-11T13:52:44Z) - A Survey of Generative Information Retrieval [25.1249210843116]
Generative Retrieval (GR) is an emerging paradigm in information retrieval that leverages generative models to map queries to relevant document identifiers (DocIDs) without the need for traditional query processing or document reranking.
This survey provides a comprehensive overview of GR, highlighting key developments, indexing and retrieval strategies, and challenges.
arXiv Detail & Related papers (2024-06-03T10:59:33Z) - A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models [71.25225058845324]
Large Language Models (LLMs) have demonstrated revolutionary abilities in language understanding and generation.
Retrieval-Augmented Generation (RAG) can offer reliable and up-to-date external knowledge.
RA-LLMs have emerged to harness external and authoritative knowledge bases, rather than relying on the model's internal knowledge.
arXiv Detail & Related papers (2024-05-10T02:48:45Z) - REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering [115.72130322143275]
REAR is a RElevance-Aware Retrieval-augmented approach for open-domain question answering (QA)
We develop a novel architecture for LLM-based RAG systems, by incorporating a specially designed assessment module.
Experiments on four open-domain QA tasks show that REAR significantly outperforms previous a number of competitive RAG approaches.
arXiv Detail & Related papers (2024-02-27T13:22:51Z) - DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain
Question Answering over Knowledge Base and Text [73.68051228972024]
Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when relying on their internal knowledge.
Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge.
arXiv Detail & Related papers (2023-10-31T04:37:57Z) - Synergistic Interplay between Search and Large Language Models for
Information Retrieval [141.18083677333848]
InteR allows RMs to expand knowledge in queries using LLM-generated knowledge collections.
InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods.
arXiv Detail & Related papers (2023-05-12T11:58:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.