NFDI4DS Shared Tasks for Scholarly Document Processing
- URL: http://arxiv.org/abs/2509.22141v1
- Date: Fri, 26 Sep 2025 10:01:24 GMT
- Title: NFDI4DS Shared Tasks for Scholarly Document Processing
- Authors: Raia Abu Ahmad, Rana Abdulla, Tilahun Abedissa Taffa, Soeren Auer, Hamed Babaei Giglou, Ekaterina Borisova, Zongxiong Chen, Stefan Dietze, Jennifer DSouza, Mayra Elwes, Genet-Asefa Gesese, Shufan Jiang, Ekaterina Kutafina, Philipp Mayr, Georg Rehm, Sameer Sadruddin, Sonja Schimmler, Daniel Schneider, Kanishka Silva, Sharmila Upadhyaya, Ricardo Usbeck,
- Abstract summary: This paper presents an updated overview of twelve shared tasks developed and hosted under the German National Research Data Infrastructure for Data Science and Artificial Intelligence (NFDI4DS) consortium.<n>The tasks foster methodological innovations and contribute open-access datasets, models, and tools for the broader research community.
- Score: 11.143617103011842
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Shared tasks are powerful tools for advancing research through community-based standardised evaluation. As such, they play a key role in promoting findable, accessible, interoperable, and reusable (FAIR), as well as transparent and reproducible research practices. This paper presents an updated overview of twelve shared tasks developed and hosted under the German National Research Data Infrastructure for Data Science and Artificial Intelligence (NFDI4DS) consortium, covering a diverse set of challenges in scholarly document processing. Hosted at leading venues, the tasks foster methodological innovations and contribute open-access datasets, models, and tools for the broader research community, which are integrated into the consortium's research data infrastructure.
Related papers
- Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset [47.98539809308384]
We analyze the Asta Interaction dataset, a large-scale resource comprising over 200,000 user queries and interaction logs.<n>We characterize query patterns, engagement behaviors, and how usage evolves with experience.<n>We release the anonymized dataset and analysis with a new query taxonomy to inform future designs of real-world AI research assistants.
arXiv Detail & Related papers (2026-02-26T18:40:28Z) - ResearchArcade: Graph Interface for Academic Tasks [25.4720945876883]
We introduce ResearchArcade, a graph-based interface that connects multiple academic data sources.<n>ResearchArcade unifies diverse academic task definitions and supports various models with distinct input requirements.<n>Our experiments across six academic tasks demonstrate that combining cross-source and multi-modal information enables a broader range of tasks.
arXiv Detail & Related papers (2025-11-27T02:42:21Z) - Deep Research: A Systematic Survey [118.82795024422722]
Deep Research (DR) aims to combine the reasoning capabilities of large language models with external tools, such as search engines.<n>This survey presents a comprehensive and systematic overview of deep research systems.
arXiv Detail & Related papers (2025-11-24T15:28:28Z) - WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents [72.28593628378991]
WebResearcher is an iterative deep-research paradigm that reformulates deep research as a Markov Decision Process.<n>WebResearcher achieves state-of-the-art performance, even surpassing frontier proprietary systems.
arXiv Detail & Related papers (2025-09-16T17:57:17Z) - Research Knowledge Graphs in NFDI4DataScience: Key Activities, Achievements, and Future Directions [4.258678191793365]
NFDI4DataScience is developing and providing Research Knowledge Graphs (RKGs)<n>RKGs aim to capture and connect complex datasets, models, software, and scientific publications.
arXiv Detail & Related papers (2025-08-04T11:11:51Z) - From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents [96.65646344634524]
Large Language Models (LLMs), endowed with reasoning and agentic capabilities, are ushering in a new paradigm termed Agentic Deep Research.<n>We trace the evolution from static web search to interactive, agent-based systems that plan, explore, and learn.<n>We demonstrate that Agentic Deep Research not only significantly outperforms existing approaches, but is also poised to become the dominant paradigm for future information seeking.
arXiv Detail & Related papers (2025-06-23T17:27:19Z) - DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
arXiv Detail & Related papers (2024-07-01T18:58:22Z) - Research information in the light of artificial intelligence: quality and data ecologies [0.0]
This paper presents multi- and interdisciplinary approaches for finding the appropriate AI technologies for research information.
Professional research information management (RIM) is becoming increasingly important as an expressly data-driven tool for researchers.
arXiv Detail & Related papers (2024-05-06T16:07:56Z) - Assessing Scientific Contributions in Data Sharing Spaces [64.16762375635842]
This paper introduces the SCIENCE-index, a blockchain-based metric measuring a researcher's scientific contributions.
To incentivize researchers to share their data, the SCIENCE-index is augmented to include a data-sharing parameter.
Our model is evaluated by comparing the distribution of its output for geographically diverse researchers to that of the h-index.
arXiv Detail & Related papers (2023-03-18T19:17:47Z) - Industry-Academia Research Collaboration in Software Engineering: The
Certus Model [13.021014899410684]
Building scalable and effective research collaborations in software engineering is known to be challenging.
This paper aims to understand what are the elements of a successful industry-academia collaboration that enable the culture of participative knowledge creation.
arXiv Detail & Related papers (2022-04-23T10:16:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.