Book description
Build predictive models using large data volumes and deploy them to production using cutting-edge techniques
Key Features
- Build highly accurate state-of-the-art machine learning models against large-scale data
- Deploy models for batch, real-time, and streaming data in a wide variety of target production systems
- Explore all the new features of the H2O AI Cloud end-to-end machine learning platform
Book Description
H2O is an open source, fast, and scalable machine learning framework that allows you to build models using big data and then easily productionalize them in diverse enterprise environments.
Machine Learning at Scale with H2O begins with an overview of the challenges faced in building machine learning models on large enterprise systems, and then addresses how H2O helps you to overcome them. You'll start by exploring H2O's in-memory distributed architecture and find out how it enables you to build highly accurate and explainable models on massive datasets using your favorite ML algorithms, language, and IDE. You'll also get to grips with the seamless integration of H2O model building and deployment with Spark using H2O Sparkling Water. You'll then learn how to easily deploy models with H2O MOJO. Next, the book shows you how H2O Enterprise Steam handles admin configurations and user management, and then helps you to identify different stakeholder perspectives that a data scientist must understand in order to succeed in an enterprise setting. Finally, you'll be introduced to the H2O AI Cloud platform and explore the entire machine learning life cycle using multiple advanced AI capabilities.
By the end of this book, you'll be able to build and deploy advanced, state-of-the-art machine learning models for your business needs.
What you will learn
- Build and deploy machine learning models using H2O
- Explore advanced model-building techniques
- Integrate Spark and H2O code using H2O Sparkling Water
- Launch self-service model building environments
- Deploy H2O models in a variety of target systems and scoring contexts
- Expand your machine learning capabilities on the H2O AI Cloud
Who this book is for
This book is for data scientists and machine learning engineers who want to gain hands-on machine learning experience by building and deploying state-of-the-art models with advanced techniques using H2O technology. An understanding of the data science process and experience in Python programming is recommended. This book will also benefit students by helping them understand how machine learning works in real-world enterprise scenarios.
Table of contents
- Machine Learning at Scale with H2O
- Acknowledgments
- Contributors
- About the authors
- About the reviewers
- Preface
- Section 1 – Introduction to the H2O Machine Learning Platform for Data at Scale
- Chapter 1: Opportunities and Challenges
- Chapter 2: Platform Components and Key Concepts
-
Chapter 3: Fundamental Workflow – Data to Deployable Model
- Technical requirements
- Use case and data overview
- The fundamental workflow
-
Variation points – alternatives and extensions to the fundamental workflow
- Launching an H2O cluster using the Enterprise Steam API versus the UI (step 1)
- Launching an H2O-3 versus Sparkling Water cluster (step 1)
- Implementing Enterprise Steam or not (steps 1–2)
- Using a personal access token to log in to Enterprise Steam (step 2)
- Building the model (step 3)
- Evaluating and explaining the model (step 4)
- Exporting the model's scoring artifact (step 5)
- Shutting down the cluster (step 6)
- Summary
- Section 2 – Building State-of-the-Art Models on Large Data Volumes Using H2O
- Chapter 4: H2O Model Building at Scale – Capability Articulation
-
Chapter 5: Advanced Model Building – Part I
- Technical requirements
- Splitting data for validation or cross-validation and testing
- Algorithm considerations
- Model optimization with grid search
- H2O AutoML
- Feature engineering options
- Leveraging H2O Flow to enhance your IDE workflow
- Putting it all together – algorithms, feature engineering, grid search, and AutoML
- Summary
- Chapter 6: Advanced Model Building – Part II
- Chapter 7: Understanding ML Models
- Chapter 8: Putting It All Together
- Section 3 – Deploying Your Models to Production Environments
- Chapter 9: Production Scoring and the H2O MOJO
-
Chapter 10: H2O Model Deployment Patterns
- Technical requirements
- Surveying a sample of MOJO deployment patterns
- Exploring examples of MOJO scoring with H2O software
- Exploring examples of MOJO scoring with third-party software
- Exploring examples of MOJO scoring with your target-system software
- Exploring examples of accelerators based on H2O Driverless AI integrations
- Summary
- Section 4 – Enterprise Stakeholder Perspectives
- Chapter 11: The Administrator and Operations Views
- Chapter 12: The Enterprise Architect and Security Views
- Section 5 – Broadening the View – Data to AI Applications with the H2O AI Cloud Platform
-
Chapter 13: Introducing H2O AI Cloud
- Technical requirements
- An H2O AI Cloud overview
-
H2O AI Cloud component breakdown
- DistributedML (H2O-3 and H2O Sparkling Water)
- H2O AutoML (H2O Driverless AI)
- DeepLearningML (H2O Hydrogen Torch)
- DocumentML (H2O Document AI)
- A self-provisioning service (H2O Enterprise Steam)
- Feature Store (H2O AI Feature Store)
- MLOps (H2O MLOps)
- Low-code SDK for AI applications (H2O Wave)
- App Store (H2O AI App Store)
- H2O AI Cloud architecture
- Summary
-
Chapter 14: H2O at Scale in a Larger Platform Context
- Technical requirements
- A quick recap of H2O AI Cloud
- Exploring a baseline reference solution for H2O at scale
-
Exploring new possibilities for H2O at scale
- Leveraging H2O Driverless AI for prototyping and feature discovery
- Integrating H2O MLOps for model monitoring, management, and governance
- Leveraging H2O AI Feature Store for feature operationalization and reuse
- Consuming predictions in a business context from a Wave AI app
- Integrating an automated retraining pipeline in a Wave AI app
- A Reference H2O Wave app as an enterprise AI integration fabric
- Summary
- Appendix : Alternative Methods to Launch H2O Clusters
- Other Books You May Enjoy
Product information
- Title: Machine Learning at Scale with H2O
- Author(s):
- Release date: July 2022
- Publisher(s): Packt Publishing
- ISBN: 9781800566019
You might also like
book
Practical Machine Learning with H2O
Machine learning has finally come of age. With H2O software, you can perform machine learning and …
book
Practical Automated Machine Learning Using H2O.ai
Accelerate the adoption of machine learning by automating away the complex parts of the ML pipeline …
book
Automated Machine Learning
Get to grips with automated machine learning and adopt a hands-on approach to AutoML implementation and …
book
Agile Machine Learning with DataRobot
Leverage DataRobot's enterprise AI platform and automated decision intelligence to extract business value from data Key …