Big Data Books

Many of our books are available as Ebook Bundles — your bookshelf on your devices! And don't forget, you can Buy 2 books, get the 3rd FREE! Use discount code: OPC10 See details.

Bestselling


No Books Were Found.

New


No Books Were Found.

Upcoming


No Books Were Found.

More Big Data Books


Big Data Experts

Ken Pepple Ken Pepple is a recognized consultant, author and speaker in the technology industry.

Ben Jones Ben Jones Ben Jones is the Sr. Tableau Public product manager at Tableau Software in Seattle and he is also a data blogger at DataRemixed. Ben is the author of "Communicating Data with Tableau". Ben holds degrees in Mechanical Engineering (BSME, UCLA 2000) and Business (MBA, California Lutheran University, 2011) and previously…

Oliver Gierke Oliver Gierke is engineer at SpringSource, a division of VMware, project lead of the Spring Data JPA, MongoDB and core module.

Paco Nathan Paco Nathan is the Chief Scientist and Vice President of Research and Development for Symbiot.

More Big Data Experts

Big Data Answers

O'Reilly Answers: Clever Hacks. Creative Ideas. Innovative Solutions.

More Big Data Answers

Big Data News & Commentary

Four short links: 27 August 2014


August 27, 2014

Discourse turns 1.0 — community/forum software that doesn’t suck. Programmable Matter (IEEE Spectrum) — recap of where research is going in this area. Liquibase — source control for your database. Apache 2.0 licensed. A Few Useful Things to Know About …

Four short links: 20 August 2014


August 20, 2014

Machine Learning for Plant Properties — startup building database of plant genomics, properties, research, etc. for mining. The more familiar you are with your data and its meaning, the better your machine learning will be at suggesting fruitful lines of …

Four short links: 13 August 2014


August 13, 2014

Viv — another step in the cognition race. Wolfram Alpha was first out the gate, but Watson, Viv, and others are hot on heels of being able to parse complex requests, then seek and use information to fulfil them. Universal …

Four short links: 7 August 2014


August 7, 2014

Material Design in the Google I/O App (Medium) — steps through design thinking as they put Google’s new design metaphor in place. I’ve been chewing on material design. It brings an internal consistency and logic to the Android world that …

Four short links: 6 August 2014


August 6, 2014

Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing (PDF) — paper by Googlers on the database holding G’s ad data. Trillions of rows, petabytes of data, point queries with 99th percentile latency in the hundreds of milliseconds and overall query throughput …

Four short links: 5 August 2014


August 5, 2014

Discussion Graph Tool (Microsoft Research) — simplifies social media analysis by making it easy to extract high-level features and co-occurrence relationships from raw data. Superlinear Productivity in Collective Group Actions (PLoS ONE) — study of open source projects shows small …

Four short links: 1 August 2014


August 1, 2014

Miso — Dataset, a JavaScript client-side data management and transformation library, Storyboard, a state and flow-control management library & d3.chart, a framework for creating reusable charts with d3.js. Open source designed to expedite the creation of high-quality interactive storytelling and …

Why local state is a fundamental primitive in stream processing


July 31, 2014

One of the concepts that has proven the hardest to explain to people when I talk about Samza is the idea of fault-tolerant local state for stream processing. I think people are so used to the idea of keeping all …

More Big Data News & Commentary

Popular Topics

Browse Books & Videos

International Sites

O'Reilly China O'Reilly Germany O'Reilly Japan