Centrala begrepp
This paper introduces a domain-specific language (DSL) and a modeling framework to enable the definition and enactment of machine learning engineering processes within organizations.
Sammanfattning
The paper presents a framework for modeling machine learning (ML) engineering processes, built around a domain-specific language (DSL). The key highlights are:
The framework is designed to address the challenges of developing ML-based systems, which require multidisciplinary teams with diverse skill sets. Process models can help standardize task orchestration, provide a common language, and foster collaboration.
The DSL combines standard process modeling concepts with AI-specific primitives, based on an analysis of scientific and industry literature on ML engineering practices. It covers aspects such as business understanding, data preparation, AI modeling, and operations.
The DSL defines constructs for modeling AI-specific artifacts (e.g., datasets, models), roles (e.g., data scientists, model operators), and activities (e.g., feature engineering, model deployment).
The framework includes a modeling editor based on the DSL, a BPMN converter to integrate with standard workflow platforms, and an HTML documentation generator to communicate the process information.
The applicability of the framework is demonstrated through a case study modeling the Microsoft Team Data Science Process (TDSP) using the provided tools.
The proposed framework aims to enable organizations to define and enact their own ML engineering processes in a structured and standardized manner.
Statistik
"The development of Machine Learning (ML) based systems is complex and requires multidisciplinary teams with diverse skill sets."
"Process models can alleviate these challenges by standardizing task orchestration, providing a common language to facilitate communication, and nurturing a collaborative environment."
"Current process modeling languages are not suitable for describing the development of such systems."
Citat
"This modeling framework is built around a domain-specific language (DSL) that combines standard process modeling concepts with AI-specific process primitives."
"A DSL provides a shared language in a particular problem space that fosters communication and collaboration between all stakeholders."
"The framework includes the identification of roles and their functions, along with the assignment of responsibilities towards the different activities of the new system."