Book description
As enterprise-scale data science sharpens its focus on data-driven decision making and machine learning, new tools have emerged to help facilitate these processes. This practical ebook shows data scientists and enterprise developers how the notebook interface, Apache Spark, and other collaboration tools are particularly well suited to bridge the communication gap between their teams.
Through a series of real-world examples, author Jerome Nilmeier demonstrates how to generate a model that enables data scientists and developers to share ideas and project code. You’ll learn how data scientists can approach real-world business problems with Spark and how developers can then implement the solution in a production environment.
- Dive deep into data science technologies, including Spark, TensorFlow, and the Jupyter Notebook
- Learn how Spark and Python notebooks enable data scientists and developers to work together
- Explore how the notebook environment works with Spark SQL for structured data
- Use notebooks and Spark as a launchpad to pursue supervised, unsupervised, and deep learning data models
- Learn additional Spark functionality, including graph analysis and streaming
- Explore the use of analytics in the production environment, particularly when creating data pipelines and deploying code
Table of contents
- Foreword
- Preface
- 1. Sharing Information Across Disciplines in the Enterprise
- 2. Setting Up Your Notebook Environment
- 3. Data Science Technologies
- 4. Introduction to Machine Learning
- 5. Classic Machine Learning Examples and Applications
- 6. Advanced Machine Learning Examples and Applications
Product information
- Title: Data Science and Engineering at Enterprise Scale
- Author(s):
- Release date: April 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492039334
You might also like
book
The Applied Data Science Workshop - Second Edition
Designed with beginners in mind, this workshop helps you make the most of Python libraries and …
book
The Evolving Role of the Data Engineer
Companies working to become data driven often view data scientists as heroes, but that overlooks the …
video
Strata Data Conference 2019 - London, United Kingdom
The Strata Data Conference, the world's largest gathering of the data community, came to London April …
book
Julia Quick Syntax Reference: A Pocket Guide for Data Science Programming
This quick Julia programming language guide is a condensed code and syntax reference to the Julia …