Related papers: Designing Multi-Step Action Models for Enterprise AI Adoption

Related papers

Towards practicable Machine Learning development using AI Engineering Blueprints [0.8654896256058138]
Small and medium-sized enterprises (SMEs) face challenges when implementing AI in their products or processes. This paper proposes a research plan designed to develop blueprints for the creation of proprietary machine learning (ML) models.
arXiv Detail & Related papers (2025-04-08T19:28:05Z)
AI Agentic workflows and Enterprise APIs: Adapting API architectures for the age of AI agents [0.0]
Generative AI has catalyzed the emergence of autonomous AI agents, presenting unprecedented challenges for enterprise computing infrastructures. Current enterprise API architectures are predominantly designed for human-driven, predefined interaction patterns, rendering them ill-equipped to support intelligent agents' dynamic, goal-oriented behaviors. This research systematically examines the architectural adaptations for enterprise APIs to support AI agentic effectively.
arXiv Detail & Related papers (2025-01-22T05:55:16Z)
Assessing AI Adoption and Digitalization in SMEs: A Framework for Implementation [0.0]
There is a significant gap between SMEs and large corporations in their use of AI. This study identifies critical drivers and obstacles to achieving intelligent transformation. It proposes a framework model to address key challenges and provide actionable guidelines.
arXiv Detail & Related papers (2025-01-14T15:10:25Z)
On the Modeling Capabilities of Large Language Models for Sequential Decision Making [52.128546842746246]
Large pretrained models are showing increasingly better performance in reasoning and planning tasks. We evaluate their ability to produce decision-making policies, either directly, by generating actions, or indirectly. In environments with unfamiliar dynamics, we explore how fine-tuning LLMs with synthetic data can significantly improve their reward modeling capabilities.
arXiv Detail & Related papers (2024-10-08T03:12:57Z)
Data Analysis in the Era of Generative AI [56.44807642944589]
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges. We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow. We then examine human-centered design principles that facilitate intuitive interactions, build user trust, and streamline the AI-assisted analysis workflow across multiple apps.
arXiv Detail & Related papers (2024-09-27T06:31:03Z)
EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents [53.717918131568936]
Embodied artificial intelligence (EAI) integrates advanced AI models into physical entities for real-world interaction. Foundation models as the "brain" of EAI agents for high-level task planning have shown promising results. However, the deployment of these agents in physical environments presents significant safety challenges. This study introduces EARBench, a novel framework for automated physical risk assessment in EAI scenarios.
arXiv Detail & Related papers (2024-08-08T13:19:37Z)
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning [50.45558735526665]
We provide an in-depth and comprehensive evaluation of the performance of MFMs on embodied task planning. We propose a new benchmark, named MFE-ETP, characterized its complex and variable task scenarios. Using the benchmark and evaluation platform, we evaluated several state-of-the-art MFMs and found that they significantly lag behind human-level performance.
arXiv Detail & Related papers (2024-07-06T11:07:18Z)
Science based AI model certification for new operational environments with application in traffic state estimation [1.2186759689780324]
The expanding role of Artificial Intelligence (AI) in diverse engineering domains highlights the challenges associated with deploying AI models in new operational environments. This paper proposes a science-based certification methodology to assess the viability of employing pre-trained data-driven models in new operational environments.
arXiv Detail & Related papers (2024-05-13T16:28:00Z)
Science based AI model certification for untrained operational environments with application in traffic state estimation [1.2186759689780324]
The expanding role of Artificial Intelligence (AI) in diverse engineering domains highlights the challenges associated with deploying AI models in new operational environments. This paper proposes a science-based certification methodology to assess the viability of employing pre-trained data-driven models in untrained operational environments.
arXiv Detail & Related papers (2024-03-21T03:01:25Z)
Human-Centered AI Product Prototyping with No-Code AutoML: Conceptual Framework, Potentials and Limitations [0.0]
This paper focuses on the challenges posed by the probabilistic nature of AI behavior and the limited accessibility of prototyping tools to non-experts. A Design Science Research (DSR) approach is presented which culminates in a conceptual framework aimed at improving the AI prototyping process. The framework describes the seamless incorporation of non-expert input and evaluation during prototyping, leveraging the potential of no-code AutoML to enhance accessibility and interpretability.
arXiv Detail & Related papers (2024-02-06T16:00:32Z)
An HCAI Methodological Framework (HCAI-MF): Putting It Into Action to Enable Human-Centered AI [8.094008212925598]
Human-centered artificial intelligence (HCAI) is a design philosophy that prioritizes humans in the design, development, deployment, and use of AI systems. Despite its growing prominence in literature, the lack of methodological guidance for its implementation poses challenges to HCAI practice. This paper proposes a comprehensive HCAI methodological framework (HCAI-MF) comprising five key components.
arXiv Detail & Related papers (2023-11-27T17:40:49Z)
Let's reward step by step: Step-Level reward model as the Navigators for Reasoning [64.27898739929734]
Process-Supervised Reward Model (PRM) furnishes LLMs with step-by-step feedback during the training phase. We propose a greedy search algorithm that employs the step-level feedback from PRM to optimize the reasoning pathways explored by LLMs. To explore the versatility of our approach, we develop a novel method to automatically generate step-level reward dataset for coding tasks and observed similar improved performance in the code generation tasks.
arXiv Detail & Related papers (2023-10-16T05:21:50Z)
The Participatory Turn in AI Design: Theoretical Foundations and the Current State of Practice [64.29355073494125]
This article aims to ground what we dub the "participatory turn" in AI design by synthesizing existing theoretical literature on participation. We articulate empirical findings concerning the current state of participatory practice in AI design based on an analysis of recently published research and semi-structured interviews with 12 AI researchers and practitioners.
arXiv Detail & Related papers (2023-10-02T05:30:42Z)
Navigating the Complexity of Generative AI Adoption in Software Engineering [6.190511747986327]
The adoption patterns of Generative Artificial Intelligence (AI) tools within software engineering are investigated. Influencing factors at the individual, technological, and societal levels are analyzed.
arXiv Detail & Related papers (2023-07-12T11:05:19Z)
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective [54.38373782121503]
A Foundation Decision Model (FDM) can be developed by formulating diverse decision-making tasks as sequence decoding tasks. We present a case study demonstrating our FDM implementation, DigitalBrain (DB1) with 1.3 billion parameters, achieving human-level performance in 870 tasks.
arXiv Detail & Related papers (2022-12-24T06:16:45Z)
Developing and Operating Artificial Intelligence Models in Trustworthy Autonomous Systems [8.27310353898034]
This work-in-progress paper aims to close the gap between the development and operation of AI-based AS. We propose a novel, holistic DevOps approach to put it into practice.
arXiv Detail & Related papers (2020-03-11T17:52:30Z)

This list is automatically generated from the titles and abstracts of the papers in this site.