Large-Scale Data Analytics with Python and Spark: A Hands-on Guide to Implementing Machine Learning Solutions

Author:   Isaac Triguero (University of Nottingham) ,  Mikel Galar (Public University of Navarre)
Publisher:   Cambridge University Press
ISBN:  

9781009318259


Pages:   422
Publication Date:   23 November 2023
Format:   Paperback
Availability:   In stock   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Our Price $77.60 Quantity:  
Add to Cart

Share |

Large-Scale Data Analytics with Python and Spark: A Hands-on Guide to Implementing Machine Learning Solutions


Add your own review!

Overview

Based on the authors' extensive teaching experience, this hands-on graduate-level textbook teaches how to carry out large-scale data analytics and design machine learning solutions for big data. With a focus on fundamentals, this extensively class-tested textbook walks students through key principles and paradigms for working with large-scale data, frameworks for large-scale data analytics (Hadoop, Spark), and explains how to implement machine learning to exploit big data. It is unique in covering the principles that aspiring data scientists need to know, without detail that can overwhelm. Real-world examples, hands-on coding exercises and labs combine with exceptionally clear explanations to maximize student engagement. Well-defined learning objectives, exercises with online solutions for instructors, lecture slides, and an accompanying suite of lab exercises of increasing difficulty in Jupyter Notebooks offer a coherent and convenient teaching package. An ideal teaching resource for courses on large-scale data analytics with machine learning in computer/data science departments.

Full Product Details

Author:   Isaac Triguero (University of Nottingham) ,  Mikel Galar (Public University of Navarre)
Publisher:   Cambridge University Press
Imprint:   Cambridge University Press
Dimensions:   Width: 17.00cm , Height: 2.00cm , Length: 24.50cm
Weight:   0.780kg
ISBN:  

9781009318259


ISBN 10:   100931825
Pages:   422
Publication Date:   23 November 2023
Audience:   General/trade ,  General
Format:   Paperback
Publisher's Status:   Active
Availability:   In stock   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Table of Contents

Part I. Understanding and Dealing with Big Data: 1. Introduction; 2. MapReduce; Part II. Big Data Frameworks: 3. Hadoop; 4. Spark; 5. Spark SQL and DataFrames; Part III. Machine Learning for Big Data: 6. Machine Learning with Spark; 7. Machine Learning for Big Data; 8. Implementing Classical Methods: k-means and Linear Regression; 9. Advanced Examples: Semi-supervised, Ensembles, Deep Learning Model Deployment.

Reviews

'With the growing ubiquity of large and complex datasets, MapReduce and Spark's dataflow programming models have become mission-critical skills for data scientists, data engineers, and ML engineers. Triguero and Galar leverage their extensive teaching experience on this topic to deliver this tour de force deep dive into both the technical concepts and programming knowhow needed for such modern large-scale data analytics. They interleave intuitive exposition of the concepts and examples from data engineering and classical ML pipelines with well-thought-out hands-on code and outputs. This book not only shows how all this knowledge is useful in practice today but also sets up the reader to be able to successfully 'generalize' to future workloads.' Arun Kumar, University of California, San Diego


‘With the growing ubiquity of large and complex datasets, MapReduce and Spark's dataflow programming models have become mission-critical skills for data scientists, data engineers, and ML engineers. Triguero and Galar leverage their extensive teaching experience on this topic to deliver this tour de force deep dive into both the technical concepts and programming knowhow needed for such modern large-scale data analytics. They interleave intuitive exposition of the concepts and examples from data engineering and classical ML pipelines with well-thought-out hands-on code and outputs. This book not only shows how all this knowledge is useful in practice today but also sets up the reader to be able to successfully ‘generalize’ to future workloads.’ Arun Kumar, University of California, San Diego


Author Information

Isaac Triguero is Distinguished Senior Researcher at the Department of Computer Science and Artificial Intelligence, University of Granada, and Associate Professor of Data Science at the School of Computer Science of the University of Nottingham. He won the 2019 School of Computer Science – University of Nottingham Award for Teaching. Mikel Galar is Associate Professor of Computer Science and Artificial Intelligence at the Department of Statistics, Computer Science and Mathematics, Public University of Navarre. He is a co-founder of Neuraptic AI and won the 2020 Excellence in Teaching Award of the Public University of Navarre.

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

wl

Shopping Cart
Your cart is empty
Shopping cart
Mailing List