Using Data Analytics to Derive Business Intelligence: A Case Study
- URL: http://arxiv.org/abs/2305.19021v1
- Date: Tue, 30 May 2023 13:21:12 GMT
- Title: Using Data Analytics to Derive Business Intelligence: A Case Study
- Authors: Ugochukwu Orji, Ezugwu Obianuju, Modesta Ezema, Chikodili Ugwuishiwu,
Elochukwu Ukwandu, Uchechukwu Agomuo
- Abstract summary: Big data analytics is already at the forefront of innovations to help make meaningful business decisions.
Business intelligence and analytics has become a huge trend in todays IT world.
This paper aims to demonstrate the data analytical process of deriving business intelligence via the historical data of a fictional bike share company.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The data revolution experienced in recent times has thrown up new challenges
and opportunities for businesses of all sizes in diverse industries. Big data
analytics is already at the forefront of innovations to help make meaningful
business decisions from the abundance of raw data available today. Business
intelligence and analytics has become a huge trend in todays IT world as
companies of all sizes are looking to improve their business processes and
scale up using data driven solutions. This paper aims to demonstrate the data
analytical process of deriving business intelligence via the historical data of
a fictional bike share company seeking to find innovative ways to convert their
casual riders to annual paying registered members. The dataset used is freely
available as Chicago Divvy Bicycle Sharing Data on Kaggle. The authors used the
RTidyverse library in RStudio to analyse the data and followed the six data
analysis steps of ask, prepare, process, analyse, share, and act to recommend
some actionable approaches the company could adopt to convert casual riders to
paying annual members. The findings from this research serve as a valuable case
example, of a real world deployment of BIA technologies in the industry, and a
demonstration of the data analysis cycle for data practitioners, researchers,
and other potential users.
Related papers
- Data Analysis in the Era of Generative AI [56.44807642944589]
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges.
We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow.
We then examine human-centered design principles that facilitate intuitive interactions, build user trust, and streamline the AI-assisted analysis workflow across multiple apps.
arXiv Detail & Related papers (2024-09-27T06:31:03Z) - Data Issues in Industrial AI System: A Meta-Review and Research Strategy [10.540603300770885]
Artificial intelligence (AI) is assuming an increasingly pivotal role within industrial systems.
Despite the recent trend within various industries to adopt AI, the actual adoption of AI is not as developed as perceived.
How to address these data issues stands as a significant concern confronting both industry and academia.
arXiv Detail & Related papers (2024-06-22T08:36:59Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - Benchmarking Data Science Agents [11.582116078653968]
Large Language Models (LLMs) have emerged as promising aids as data science agents, assisting humans in data analysis and processing.
Yet their practical efficacy remains constrained by the varied demands of real-world applications and complicated analytical process.
We introduce DSEval -- a novel evaluation paradigm, as well as a series of innovative benchmarks tailored for assessing the performance of these agents.
arXiv Detail & Related papers (2024-02-27T03:03:06Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - Collaborative business intelligence virtual assistant [1.9953434933575993]
This study focuses on the applications of data mining within distributed virtual teams through the interaction of users and a CBI Virtual Assistant.
The proposed virtual assistant for CBI endeavors to enhance data exploration accessibility for a wider range of users and streamline the time and effort required for data analysis.
arXiv Detail & Related papers (2023-12-20T05:34:12Z) - Data Acquisition: A New Frontier in Data-centric AI [65.90972015426274]
We first present an investigation of current data marketplaces, revealing lack of platforms offering detailed information about datasets.
We then introduce the DAM challenge, a benchmark to model the interaction between the data providers and acquirers.
Our evaluation of the submitted strategies underlines the need for effective data acquisition strategies in Machine Learning.
arXiv Detail & Related papers (2023-11-22T22:15:17Z) - PADME-SoSci: A Platform for Analytics and Distributed Machine Learning
for the Social Sciences [4.294774517325059]
PADME is a distributed analytics tool that federates model implementation and training.
It enables the analysis of data across locations while still allowing the model to be trained as if all data were in a single location.
arXiv Detail & Related papers (2023-03-27T15:32:35Z) - OSOUM Framework for Trading Data Research [79.0383470835073]
We supply, to the best of our knowledge, the first open source simulation platform, Open SOUrce Market Simulator (OSOUM) to analyze trading markets and specifically data markets.
We describe and implement a specific data market model, consisting of two types of agents: sellers who own various datasets available for acquisition, and buyers searching for relevant and beneficial datasets for purchase.
Although commercial frameworks, intended for handling data markets, already exist, we provide a free and extensive end-to-end research tool for simulating possible behavior for both buyers and sellers participating in (data) markets.
arXiv Detail & Related papers (2021-02-18T09:20:26Z) - Explainable Patterns: Going from Findings to Insights to Support Data
Analytics Democratization [60.18814584837969]
We present Explainable Patterns (ExPatt), a new framework to support lay users in exploring and creating data storytellings.
ExPatt automatically generates plausible explanations for observed or selected findings using an external (textual) source of information.
arXiv Detail & Related papers (2021-01-19T16:13:44Z) - Towards an Integrated Platform for Big Data Analysis [4.5257812998381315]
This paper presents the vision of an integrated plat-form for big data analysis that combines all these aspects.
Main benefits of this approach are an enhanced scalability of the whole platform, a better parameterization of algorithms, and an improved usability during the end-to-end data analysis process.
arXiv Detail & Related papers (2020-04-27T03:15:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.