Book description
Data science happens in code. The ability to write reproducible, robust, scaleable code is key to a data science project's success—and is absolutely essential for those working with production code. This practical book bridges the gap between data science and software engineering, and clearly explains how to apply the best practices from software engineering to data science.
Examples are provided in Python, drawn from popular packages such as NumPy and pandas. If you want to write better data science code, this guide covers the essential topics that are often missing from introductory data science or coding classes, including how to:
- Understand data structures and object-oriented programming
- Clearly and skillfully document your code
- Package and share your code
- Integrate data science code with a larger code base
- Learn how to write APIs
- Create secure code
- Apply best practices to common tasks such as testing, error handling, and logging
- Work more effectively with software engineers
- Write more efficient, maintainable, and robust code in Python
- Put your data science projects into production
- And more
Publisher resources
Table of contents
- Preface
- 1. What Is Good Code?
- 2. Analyzing Code Performance
- 3. Using Data Structures Effectively
- 4. Object-Oriented Programming and Functional Programming
- 5. Errors, Logging, and Debugging
- 6. Code Formatting, Linting, and Type Checking
- 7. Testing Your Code
- 8. Design and Refactoring
- 9. Documentation
- 10. Sharing Your Code: Version Control, Dependencies, and Packaging
- 11. APIs
- 12. Automation and Deployment
- 13. Security
- 14. Working in Software
- 15. Next Steps
- Index
- About the Author
Product information
- Title: Software Engineering for Data Scientists
- Author(s):
- Release date: April 2024
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098136208
You might also like
book
Data Science: The Hard Parts
This practical guide provides a collection of techniques and best practices that are generally overlooked in …
book
Practical Statistics for Data Scientists, 2nd Edition
Statistical methods are a key part of data science, yet few data scientists have formal statistical …
book
Data Science from Scratch, 2nd Edition
To really learn data science, you should not only master the tools—data science libraries, frameworks, modules, …
book
AI and Machine Learning for Coders
If you're looking to make a career move from programmer to AI specialist, this is the …