Book description
There’s a lot of information about big data technologies, but splicing these technologies into an end-to-end enterprise data platform is a daunting task not widely covered. With this practical book, you’ll learn how to build big data infrastructure both on-premises and in the cloud and successfully architect a modern data platform.
Ideal for enterprise architects, IT managers, application architects, and data engineers, this book shows you how to overcome the many challenges that emerge during Hadoop projects. You’ll explore the vast landscape of tools available in the Hadoop and big data realm in a thorough technical primer before diving into:
- Infrastructure: Look at all component layers in a modern data platform, from the server to the data center, to establish a solid foundation for data in your enterprise
- Platform: Understand aspects of deployment, operation, security, high availability, and disaster recovery, along with everything you need to know to integrate your platform with the rest of your enterprise IT
- Taking Hadoop to the cloud: Learn the important architectural aspects of running a big data platform in the cloud while maintaining enterprise security and high availability
Publisher resources
Table of contents
- Foreword
- Preface
- 1. Big Data Technology Primer
- I. Infrastructure
- 2. Clusters
- 3. Compute and Storage
- 4. Networking
- 5. Organizational Challenges
- 6. Datacenter Considerations
- II. Platform
- 7. Provisioning Clusters
- 8. Platform Validation
- 9. Security
- 10. Integration with Identity Management Providers
- 11. Accessing and Interacting with Clusters
- 12. High Availability
- 13. Backup and Disaster Recovery
- III. Taking Hadoop to the Cloud
- 14. Basics of Virtualization for Hadoop
- 15. Solutions for Private Clouds
- 16. Solutions in the Public Cloud
- 17. Automated Provisioning
- 18. Security in the Cloud
- A. Backup Onboarding Checklist
- Index
Product information
- Title: Architecting Modern Data Platforms
- Author(s):
- Release date: December 2018
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491969229
You might also like
book
Deciphering Data Architectures
Data fabric, data lakehouse, and data mesh have recently appeared as viable alternatives to the modern …
book
Architecting Data and Machine Learning Platforms
All cloud architects need to know how to build data platforms that enable businesses to make …
book
Foundations of Scalable Systems
In many systems, scalability becomes the primary driver as the user base grows. Attractive features and …
book
Quick Start Guide to Large Language Models: Strategies and Best Practices for Using ChatGPT and Other LLMs
The advancement of Large Language Models (LLMs) has revolutionized the field of Natural Language Processing in …