Data Management in Machine Learning Systems

Author:   Matthias Boehm ,  Arun Kumar ,  Jun Yang ,  H. V. Jagadish
Publisher:   Morgan & Claypool Publishers
ISBN:  

9781681734989


Pages:   173
Publication Date:   28 February 2019
Format:   Hardback
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Our Price $113.00 Quantity:  
Add to Cart

Share |

Data Management in Machine Learning Systems


Add your own review!

Overview

Large-scale data analytics using machine learning (ML) underpins many modern data-driven applications. ML systems provide means of specifying and executing these ML workloads in an efficient and scalable manner. Data management is at the heart of many ML systems due to data-driven application characteristics, data-centric workload characteristics, and system architectures inspired by classical data management techniques. In this book, we follow this data-centric view of ML systems and aim to provide a comprehensive overview of data management in ML systems for the end-to-end data science or ML lifecycle. We review multiple interconnected lines of work: (1) ML support in database (DB) systems, (2) DB-inspired ML systems, and (3) ML lifecycle systems. Covered topics include: in-database analytics via query generation and user-defined functions, factorized and statistical-relational learning; optimizing compilers for ML workloads; execution strategies and hardware accelerators; data access methods such as compression, partitioning and indexing; resource elasticity and cloud markets; as well as systems for data preparation for ML, model selection, model management, model debugging, and model serving. Given the rapidly evolving field, we strive for a balance between an up-to-date survey of ML systems, an overview of the underlying concepts and techniques, as well as pointers to open research questions. Hence, this book might serve as a starting point for both systems researchers and developers.

Full Product Details

Author:   Matthias Boehm ,  Arun Kumar ,  Jun Yang ,  H. V. Jagadish
Publisher:   Morgan & Claypool Publishers
Imprint:   Morgan & Claypool Publishers
Dimensions:   Width: 19.10cm , Height: 1.10cm , Length: 23.50cm
Weight:   0.333kg
ISBN:  

9781681734989


ISBN 10:   1681734982
Pages:   173
Publication Date:   28 February 2019
Audience:   General/trade ,  General
Format:   Hardback
Publisher's Status:   Active
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Table of Contents

Preface Acknowledgments Introduction ML Through Database Queries and UDFs Multi-Table ML and Deep Systems Integration Rewrites and Optimization Execution Strategies Data Access Methods Resource Heterogeneity and Elasticity Systems for ML Lifecycle Tasks Conclusions Bibliography Authors' Biographies

Reviews

Author Information

Matthias Boehm is a professor at Graz University of Technology, Austria, where he holds a BMVIT-endowed chair for data management. Prior to joining TU Graz in 2018, he was a research staff member at IBM Research - Almaden, CA, USA, with a focus on compilation and runtime techniques for declarative, large-scale machine learning. He received his Ph.D. from Dresden University of Technology, Germany in 2011 with a dissertation on cost-based optimization of integration flows. His previous research also includes systems support for time series forecasting as well as in-memory indexing and query processing. Matthias is a recipient of the 2016 VLDB Best Paper Award, and a 2016 SIGMOD Research Highlight Award.

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

wl

Shopping Cart
Your cart is empty
Shopping cart
Mailing List