Data Engineering on Azure

by Vlad Riscutia

Released August 2021

Publisher(s): Manning Publications

ISBN: 9781617298929

Start your free trial

Book description

Build a data platform to the industry-leading standards set by Microsoft’s own infrastructure.

In Data Engineering on Azure you will learn how to:

Pick the right Azure services for different data scenarios
Manage data inventory
Implement production quality data modeling, analytics, and machine learning workloads
Handle data governance
Using DevOps to increase reliability
Ingesting, storing, and distributing data
Apply best practices for compliance and access control

Data Engineering on Azure reveals the data management patterns and techniques that support Microsoft’s own massive data infrastructure. Author Vlad Riscutia, a data engineer at Microsoft, teaches you to bring an engineering rigor to your data platform and ensure that your data prototypes function just as well under the pressures of production. You'll implement common data modeling patterns, stand up cloud-native data platforms on Azure, and get to grips with DevOps for both analytics and machine learning.

About the Technology
Build secure, stable data platforms that can scale to loads of any size. When a project moves from the lab into production, you need confidence that it can stand up to real-world challenges. This book teaches you to design and implement cloud-based data infrastructure that you can easily monitor, scale, and modify.

About the Book
In Data Engineering on Azure you’ll learn the skills you need to build and maintain big data platforms in massive enterprises. This invaluable guide includes clear, practical guidance for setting up infrastructure, orchestration, workloads, and governance. As you go, you’ll set up efficient machine learning pipelines, and then master time-saving automation and DevOps solutions. The Azure-based examples are easy to reproduce on other cloud platforms.

What's Inside

Data inventory and data governance
Assure data quality, compliance, and distribution
Build automated pipelines to increase reliability
Ingest, store, and distribute data

About the Reader
For data engineers familiar with cloud computing and DevOps.

About the Author
Vlad Riscutia is a software architect at Microsoft.

Quotes
A definitive and complete guide on data engineering, with clear and easy-to-reproduce examples.
- Kelum Prabath Senanayake, Echoworx

An all-in-one Azure book, covering all a solutions architect or engineer needs to think about.
- Albert Nogués, Danone

A meaningful journey through the Azure ecosystem. You’ll be building pipelines and joining components quickly!
- Todd Cook, Appen

A gateway into the world of Azure for machine learning and DevOps engineers.
- Krzysztof Kamyczek, Luxoft