Big Data Tools and Pipelines

Ideas and resources related to data tools.

A/B Testing: a checklist

Lisa Qian lays out the process for a successful A/B test, from defining a goal and hypothesis, to knowing when to end the test. The most rigorous form of data-gathering when done right, A/B tests can't be run by guesswork or gut instinct.

Understanding YARN’s architecture and daemons

In the new O’Reilly video training "Introduction to Hadoop YARN," David Yahalom explains everything you need to know about using this new data processing platform to extend Hadoop’s potential. In this segment, Yahalom explains YARN’s architecture and daemons.

Analyzing Data with Python

In this webcast led by Sarah Guido, you'll get a bird's eye overview of some of the best tools for data analysis and how you can apply them to your workflow.