Scaling Machine Learning with Spark

Distributed ML with MLlib, TensorFlow, and PyTorch

Häftad, Engelska, 2023

Av Adi Polak

999 kr

Beställningsvara. Skickas inom 5-8 vardagar
Fri frakt för medlemmar vid köp för minst 249 kr.

Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.You will:Explore machine learning, including distributed computing concepts and terminologyManage the ML lifecycle with MLflowIngest data and perform basic preprocessing with SparkExplore feature engineering, and use Spark to extract featuresTrain a model with MLlib and build a pipeline to reproduce itBuild a data system to combine the power of Spark with deep learningGet a step-by-step example of working with distributed TensorFlowUse PyTorch to scale machine learning and its internal architecture

Produktinformation

  • Utgivningsdatum2023-03-21
  • Mått178 x 233 x 16 mm
  • Vikt528 g
  • FormatHäftad
  • SpråkEngelska
  • Antal sidor400
  • FörlagO'Reilly Media
  • ISBN9781098106829

Tillhör följande kategorier