The next 10 years of Apache Hadoop
Doug Cutting, Tom White, and Ben Lorica explore Hadoop's role over the coming decade.
Our take on the ideas, information, and tools that make data work.
Doug Cutting, Tom White, and Ben Lorica explore Hadoop's role over the coming decade.
This report dives into the IoT industry through a series of illuminating talks and case studies presented at 2015 Strata + Hadoop World Conferences in San Jose, New York, and Singapore.
Comparing AWS, GCP, and Azure for large-scale analytics.
Practical privacy, data security, and consumer protection dos and don’ts to help you avoid becoming a legal target.
Tricia Wang explores the application of “thick data” gathered through qualitative methods.
Megan Price explains why machine-learning methods can be crucial to understanding and addressing patterns of violence.
Stefanie Posavec discusses the insights she gained spending a year on her intensive Dear Data project.
Jordan Tigani shares what big data means for Google, and he announces several new BigQuery features.
Cat Drew explores how the UK’s Policy Lab and GDS data teams are bringing an approach to policymaking that combines data, digital, and design.
David Selby shares some of the data challenges he's faced and explains why he's particularly enthusiastic for the latest technological developments in the field.
Broad inter-domain awareness about which problems can be solved using deep learning techniques plays a key role in data analytics development.
Martin Willcox explains why data management, data integration, and multigenre analytics are essential for driving business value from IoT initiatives.
Joe Hellerstein explains why we now take a relativistic view data, where the meaning of data depends on the context in which it is used.
Mona Vernon outlines a framework for thinking about data shareability and data monetization.
Cloudera's Mike Olson and CERN's Manuel Martin Marquez discuss how CERN is using Hadoop to help drive operational efficiency for the Large Hadron Collider.
Learn how federated analytics is helping cancer research teams analyze large genomics and patient data sets, while preserving patient data privacy and intellectual property.
The future of construction will be inspired by social insects.
Watch keynotes from Strata + Hadoop World in London.
Bolke de Bruin and Hylke Hendriksen explain how, by considering a user’s click path a followed process, ING applied process mining and adapted it to Spark Streaming. This resulted in near real-time fraud detection and analysis.
Jun Rao explains the threats that Kafka Security mitigates, the changes that were made to Kafka to enable security, and the steps required to secure an existing Kafka cluster.
Learn how to use Python with the Hadoop Distributed File System, MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework.
How teaching others can help society and advance your career.
As the Internet of Things grows ever larger, data analysis and decision-making will have to localize—shifting from the cloud to the edge.
In this report, author Cornelia Lévy-Bencheton examines the disruptive megatrends taking hold at every level and juncture of the financial ecosystem.