VayuChat: An LLM-Powered Conversational Interface for Air Quality Data Analytics
- URL: http://arxiv.org/abs/2511.01046v1
- Date: Sun, 02 Nov 2025 18:45:17 GMT
- Title: VayuChat: An LLM-Powered Conversational Interface for Air Quality Data Analytics
- Authors: Vedant Acharya, Abhay Pisharodi, Rishabh Mondal, Mohammad Rafiuddin, Nipun Batra,
- Abstract summary: VayuChat is a conversational system that answers natural language questions on air quality, meteorology, and policy programs.<n>Our live demonstration will show how users can perform complex environmental analytics through simple conversations.
- Score: 1.3048920509133808
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Air pollution causes about 1.6 million premature deaths each year in India, yet decision makers struggle to turn dispersed data into decisions. Existing tools require expertise and provide static dashboards, leaving key policy questions unresolved. We present VayuChat, a conversational system that answers natural language questions on air quality, meteorology, and policy programs, and responds with both executable Python code and interactive visualizations. VayuChat integrates data from Central Pollution Control Board (CPCB) monitoring stations, state-level demographics, and National Clean Air Programme (NCAP) funding records into a unified interface powered by large language models. Our live demonstration will show how users can perform complex environmental analytics through simple conversations, making data science accessible to policymakers, researchers, and citizens. The platform is publicly deployed at https://huggingface.co/spaces/SustainabilityLabIITGN/ VayuChat. For further information check out video uploaded on https://www.youtube.com/watch?v=d6rklL05cs4.
Related papers
- ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems [57.806797579986075]
We introduce an open-source, user-friendly toolkit to build unified web interfaces for various cascaded and E2E spoken dialogue systems.<n>Using the evaluation metrics, we compare various cascaded and E2E spoken dialogue systems with a human-human conversation dataset as a proxy.<n>Our analysis demonstrates that the toolkit allows researchers to effortlessly compare and contrast different technologies.
arXiv Detail & Related papers (2025-03-11T15:24:02Z) - Use of Air Quality Sensor Network Data for Real-time Pollution-Aware POI Suggestion [10.782779065468558]
This demo paper introduces AirSense-R, a privacy-preserving mobile application that delivers real-time, pollution-aware recommendations forPOIs.<n>By merging live air quality data from AirSENCE sensor networks with user preferences, the system enables health-conscious decision-making.
arXiv Detail & Related papers (2025-02-13T10:36:17Z) - VayuBuddy: an LLM-Powered Chatbot to Democratize Air Quality Insights [2.2754055137802074]
VayuBuddy is a Large Language Model (LLM)-powered chatbots for air quality sensor data analysis.
VyuBuddy receives the questions in natural language, analyses the structured sensory data with a LLM-generated Python code and provides answers in natural language.
VyuBuddy can also generate visual analysis such as line-plots, map plot, bar charts and many others from the sensory data.
arXiv Detail & Related papers (2024-11-16T08:02:35Z) - AutoConv: Automatically Generating Information-seeking Conversations
with Large Language Models [74.10293412011455]
We propose AutoConv for synthetic conversation generation.
Specifically, we formulate the conversation generation problem as a language modeling task.
We finetune an LLM with a few human conversations to capture the characteristics of the information-seeking process.
arXiv Detail & Related papers (2023-08-12T08:52:40Z) - Demonstrations of the Potential of AI-based Political Issue Polling [0.0]
We develop a prompt engineering methodology for eliciting human-like survey responses from ChatGPT.
We execute large scale experiments, querying for thousands of simulated responses at a cost far lower than human surveys.
We find ChatGPT is effective at anticipating both the mean level and distribution of public opinion on a variety of policy issues.
But it is less successful at anticipating demographic-level differences.
arXiv Detail & Related papers (2023-07-10T12:17:15Z) - Enhancing Chat Language Models by Scaling High-quality Instructional
Conversations [91.98516412612739]
We first provide a systematically designed, diverse, informative, large-scale dataset of instructional conversations, UltraChat.
Our objective is to capture the breadth of interactions that a human might have with an AI assistant.
We fine-tune a LLaMA model to create a powerful conversational model, UltraLLaMA.
arXiv Detail & Related papers (2023-05-23T16:49:14Z) - GenSpectrum Chat: Data Exploration in Public Health Using Large Language
Models [2.9823962001574187]
The COVID-19 pandemic highlighted the importance of making epidemiological data easily accessible and explorable.
We developed the "GenSpectrum Chat" which uses GPT-4 as the underlying large language model (LLM) to explore SARS-CoV-2 genomic sequencing data.
arXiv Detail & Related papers (2023-05-23T08:43:43Z) - VideoChat: Chat-Centric Video Understanding [80.63932941216129]
We develop an end-to-end chat-centric video understanding system, coined as VideoChat.
It integrates video foundation models and large language models via a learnable neural interface.
Preliminary qualitative experiments demonstrate the potential of our system across a broad spectrum of video applications.
arXiv Detail & Related papers (2023-05-10T17:59:04Z) - Training Conversational Agents with Generative Conversational Networks [74.9941330874663]
We use Generative Conversational Networks to automatically generate data and train social conversational agents.
We evaluate our approach on TopicalChat with automatic metrics and human evaluators, showing that with 10% of seed data it performs close to the baseline that uses 100% of the data.
arXiv Detail & Related papers (2021-10-15T21:46:39Z) - Conversations with Search Engines: SERP-based Conversational Response
Generation [77.1381159789032]
We create a suitable dataset, the Search as a Conversation (SaaC) dataset, for the development of pipelines for conversations with search engines.
We also develop a state-of-the-art pipeline for conversations with search engines, the Conversations with Search Engines (CaSE) using this dataset.
CaSE enhances the state-of-the-art by introducing a supporting token identification module and aprior-aware pointer generator.
arXiv Detail & Related papers (2020-04-29T13:07:53Z) - A Scalable Chatbot Platform Leveraging Online Community Posts: A
Proof-of-Concept Study [4.623392924486831]
We verify the feasibility of constructing a data-driven chatbots with processed online community posts by using them as pseudo-conversational data.
We argue that chatbots for various purposes can be built extensively through the pipeline exploiting the common structure of community posts.
arXiv Detail & Related papers (2020-01-10T01:45:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.