The content discusses the need for efficiently serving deep learning (DL) models on relational data across diverse commercial and scientific domains. It highlights three pivotal architectural paradigms: the state-of-the-art DL-centric architecture, the potential UDF-centric architecture, and the potential relation-centric architecture.
The DL-centric architecture offloads DL computations to dedicated DL frameworks, leading to significant cross-system overheads. The UDF-centric architecture encapsulates tensor computations into User Defined Functions (UDFs) within the relational database management system (RDBMS), but lacks flexibility in optimizing the operations within the UDF. The relation-centric architecture represents large-scale tensor computations through relational operators, facilitating co-optimization with relational processing, but may incur higher latency for small model inferences.
The content argues for a novel RDBMS design that seamlessly integrates these three architectural paradigms and various middle grounds in between. The key components of this envisioned system include:
The proposed system aims to enhance productivity, deliver superior performance for a broad class of inference applications, and effectively avoid cross-system overheads.
Na inny język
z treści źródłowej
arxiv.org
Głębsze pytania