Authors
Maximilian Schule, Harald Lang, Maximilian Springer, Alfons Kemper, Thomas Neumann, Stephan Gunnemann
Publication date
2021/7/6
Book
Proceedings of the 33rd International Conference on Scientific and Statistical Database Management
Pages
25-36
Description
In machine learning, continuously retraining a model guarantees accurate predictions based on the latest data as training input. But to retrieve the latest data from a database, time-consuming extraction is necessary as database systems have rarely been used for operations such as matrix algebra and gradient descent.
In this work, we demonstrate that SQL with recursive tables makes it possible to express a complete machine learning pipeline out of data preprocessing, model training and its validation. To facilitate the specification of loss functions, we extend the code-generating database system Umbra by an operator for automatic differentiation for use within recursive tables: With the loss function expressed in SQL as a lambda function, Umbra generates machine code for each partial derivative. We further use automatic differentiation for a dedicated gradient descent operator, which generates LLVM code to …
Total citations
2021202220232024110204
Scholar articles
M Schule, H Lang, M Springer, A Kemper, T Neumann… - Proceedings of the 33rd International Conference on …, 2021