Book description
Companies working to become data driven often view data scientists as heroes, but that overlooks the vital role that data engineers play in the process. While data scientists focus on finding new insights from datasets, data engineers deal with preparation—obtaining, cleaning, and creating enhanced versions of the data an organization needs. In this report, Andy Oram examines how the role of data engineer has quickly evolved.
DBAs, software engineers, developers, and students will explore the responsibilities of modern data engineers and the skills and tools necessary to do the job. You’ll learn how to deal with software engineering concepts such as rapid and continuous development, automation and orchestration, modularity, and traceability. Decision makers considering a move to the cloud will also benefit from the in-depth discussion this report provides.
This report covers:
- Major tasks of data engineers today
- The different levels of structure in data and ways to maximize its value
- Capabilities of third-party cloud options
- Tools for ingestion, transfer, and enrichment
- Using containers and VMs to run the tools
- Software engineering development
- Automation and orchestration of data engineering
Product information
- Title: The Evolving Role of the Data Engineer
- Author(s):
- Release date: April 2020
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492052500
You might also like
article
The Human Factor in AI-Based Decision-Making
Individuals’ unique decision-making styles inform the choices they make when working with AI-based inputs. The authors …
article
Why So Many Data Science Projects Fail to Deliver
Many companies are unable to consistently gain business value from their investments in big data, artificial …
article
Three Ways to Sell Value in B2B Markets
As customers face pressure to reduce costs while maintaining profitability, value-based selling (VBS) has become critical …
audiobook
The Year in Tech, 2025
<B>A year of HBR's essential thinking on tech—all in one place.</B><br/><br/><br/><br/>Generative AI, biometrics, spatial computing, electric …