A Data Literacy Competence Model for Higher Education and Research
- URL: http://arxiv.org/abs/2504.15690v1
- Date: Tue, 22 Apr 2025 08:14:23 GMT
- Title: A Data Literacy Competence Model for Higher Education and Research
- Authors: Martina M. Echtenbruck, Simone Fühles-Ubach, Boris Naujoks, Elisabeth Kaliva,
- Abstract summary: Data Literacy Initiative (DaLI) at TH K"oln develops competence model for promoting data literacy in higher education.<n>Based on interdisciplinary collaboration and empirical research, the DaLI model defines seven overarching competence areas.<n>Intended for use across disciplines, the model supports the strategic integration of data literacy into university programs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In an increasingly data-driven world, the ability to understand, interpret, and use data - data literacy - is emerging as a critical competence across all academic disciplines. The Data Literacy Initiative (DaLI) at TH K\"oln addresses this need by developing a comprehensive competence model for promoting data literacy in higher education. Based on interdisciplinary collaboration and empirical research, the DaLI model defines seven overarching competence areas: "Establish Data Culture", "Provide Data", "Manage Data", "Analyze Data", "Evaluate Data", "Interpret Data", and "Publish Data". Each area is further detailed by specific competence dimensions and progression levels, providing a structured framework for curriculum design, teaching, and assessment. Intended for use across disciplines, the model supports the strategic integration of data literacy into university programs. By providing a common language and orientation for educators and institutions, the DaLI model contributes to the broader goal of preparing students to navigate and shape a data-informed society.
Related papers
- Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions [4.795774784702568]
Tabular data remains one of the most prevalent data types across a wide range of real-world applications.<n>Yet effective representation learning for this domain poses unique challenges due to its irregular patterns, heterogeneous feature distributions, and complex inter-column dependencies.
arXiv Detail & Related papers (2025-01-07T05:23:36Z) - LLMs for Generalizable Language-Conditioned Policy Learning under Minimal Data Requirements [50.544186914115045]
This paper presents TEDUO, a novel training pipeline for offline language-conditioned policy learning.<n>TEDUO operates on easy-to-obtain, unlabeled datasets and is suited for the so-called in-the-wild evaluation, wherein the agent encounters previously unseen goals and states.
arXiv Detail & Related papers (2024-12-09T18:43:56Z) - Effective Data Stewardship in Higher Education: Skills, Competences, and the Emerging Role of Open Data Stewards [0.0]
The significance of open data in higher education stems from the changing tendencies towards open science.
The paper proposes a structured training framework and comprehensive curriculum for data stewardship.
arXiv Detail & Related papers (2024-10-27T07:37:36Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - A Review of Data Mining in Personalized Education: Current Trends and
Future Prospects [30.033926908231297]
This paper focuses on four scenarios: educational recommendation, cognitive diagnosis, knowledge tracing, and learning analysis.
The integration of AI in educational platforms provides insights into academic performance, learning preferences, and behaviors, optimizing the personal learning process.
arXiv Detail & Related papers (2024-02-27T06:09:48Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - Big Data and Analytics Implementation in Tertiary Institutions to
Predict Students Performance in Nigeria [0.0]
The term Big Data has been coined to refer to the gargantuan bulk of data that cannot be dealt with by traditional data-handling techniques.
This paper explores the attributes of big data that are relevant to educational institutions.
It investigates the factors influencing the adoption of big data and analytics in learning institutions.
arXiv Detail & Related papers (2022-07-29T13:52:24Z) - DataPerf: Benchmarks for Data-Centric AI Development [81.03754002516862]
DataPerf is a community-led benchmark suite for evaluating ML datasets and data-centric algorithms.
We provide an open, online platform with multiple rounds of challenges to support this iterative development.
The benchmarks, online evaluation platform, and baseline implementations are open source.
arXiv Detail & Related papers (2022-07-20T17:47:54Z) - Data Governance in the Age of Large-Scale Data-Driven Language
Technology [79.92626780294258]
This work proposes an approach to global language data governance that attempts to organize data management amongst stakeholders, values, and rights.
The framework we present is a multi-party international governance structure focused on language data, and incorporating technical and organizational tools needed to support its work.
arXiv Detail & Related papers (2022-05-04T00:44:35Z) - CateCom: a practical data-centric approach to categorization of
computational models [77.34726150561087]
We present an effort aimed at organizing the landscape of physics-based and data-driven computational models.
We apply object-oriented design concepts and outline the foundations of an open-source collaborative framework.
arXiv Detail & Related papers (2021-09-28T02:59:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.