Structured streaming comes to Apache Spark 2.0
The O’Reilly Data Show Podcast: Michael Armbrust on enabling users to perform streaming analytics, without having to reason about streaming.
Our take on the ideas, information, and tools that make data work.
The O’Reilly Data Show Podcast: Michael Armbrust on enabling users to perform streaming analytics, without having to reason about streaming.
The O’Reilly Data Show Podcast: Danny Bickson on recommenders, data science, and applications of machine learning.
The O’Reilly Data Show Podcast: Ira Cohen on developing machine learning tools for a broad range of real-time applications.
The O’Reilly Data Show Podcast: Mikio Braun on practical data science, deep neural networks, machine learning, and AI.
Watch full keynotes covering data science, data tools, enterprise adoption and more. From Strata + Hadoop World in San Jose 2016.
A data-driven analysis of companies using Hadoop, Spark, data science, and machine learning.
The O’Reilly Data Show Podcast: Duncan Ross on the evolution of analytics, data mining, and data philanthropy.
The O’Reilly Data Show podcast: M.C. Srivas on streaming, enterprise grade systems, the Internet of Things, and data for social good.
The O’Reilly Data Show podcast: Fang Yu on data science in security, unsupervised learning, and Apache Spark.
The O’Reilly Data Show podcast: Joe Hellerstein on data wrangling, distributed systems, and metadata services.
One investor’s retrospective of data analytics over the last 25 years.
The O’Reilly Data Show podcast: Eric Colson on algorithms, human computation, and building data science teams.
The what, where, when, and how of unbounded data processing.
The O’Reilly Data Show podcast: Vasant Dhar on the race to build “big data machines” in financial investing.
Roadmaps should be driven by business goals, not technology.
The O’Reilly Data Show podcast: A fireside chat with Ben Horowitz, plus Reynold Xin on the rise of Apache Spark in China.
Using induction to test your hypotheses
The O’Reilly Data Show podcast: Evan Chan on the early days of Spark+Cassandra, FiloDB, and cloud computing.
The Programmer's Oath is missing one essential element: the customer.
The O’Reilly Data Show Podcast: Emil Eifrem on popular applications of graph technologies, cloud computing, and company culture.
True insights require a measurable action plan and domain knowledge.
How an algorithm is to a data scientist what a compound microscope is to a biologist.
The O’Reilly Data Show podcast: The Hadoop ecosystem, the recent surge in interest in all things real time, and developments in hardware.
How Lawrence Berkeley National Lab’s supercomputing center is tackling 10 data analytics problems across the sciences.