Practical Data Cleaning with Python
Katharine Jarmul will show you how to use Python libraries to speed up the data wrangling process and automate data cleaning, how to handle messy data, and how to write data unit tests that monitor data validity.
Insights, practical guidance, and announcements from O'Reilly
Katharine Jarmul will show you how to use Python libraries to speed up the data wrangling process and automate data cleaning, how to handle messy data, and how to write data unit tests that monitor data validity.
Learn about key microservices principles like Last Responsible Moment, risk sliders to assess benefit versus risk, heatlhchecks, and metrics.
How Scala will help you grow as a Java developer.
A field guide to the Apache Hadoop projects, subprojects, and related technologies.
A look at the Accessible Rich Internet Applications specification (ARIA), which enables dynamic applications to work with a variety of assistive technologies.
How much do you need to know?
Jeff Gothelf, author of Lean UX, shares his points of view on how pragmatism anchored in your organization's unique reality trumps the purity of process doctrine.
The scatterplot is a common type of visualization that represents two sets of corresponding values on two different axes.
Data science teams need people with the skills and curiosity to ask the big questions.
This recipe covers the basics of setting up a matplotlib plot, and how to create simple line plots.
This post is an introduction to a popular dimensionality reduction algorithm: t-distributed stochastic neighbor embedding (t-SNE).
In the next decade, Year Zero will be how big data reaches everyone and will fundamentally change how we live.