The security data lake
Leveraging big data technologies to build a common data repository for security
Our take on the ideas, information, and tools that make data work.
Leveraging big data technologies to build a common data repository for security
Cleaning and combining fields can turn messy data into actionable insight.
How storytelling can enhance the effectiveness of your visualizations.
One example of how using a data API can lead to better visualizations.
A field guide to the Apache Hadoop projects, subprojects, and related technologies.
The goal is to offer a single platform where users can get the best distributed algorithms for any data processing task.
How to decide which framework is best for your particular use case.
Now that technology has made its way into the playroom, there are a lot of important questions we should be asking.
With Myriad, analytics can be performed on the same hardware that runs your production services.
Changing your frame of reference when starting with SQL on Hadoop.
Understanding information cascades, viral content, and significant relationships.
The best of European and American data privacy initiatives can come together for the betterment of all.
Using fast, scalable relational databases to build event-oriented applications.
Learn how to manipulate data, and construct and evaluate models in Azure ML, using a complete data science example.
The O'Reilly Data Show Podcast: Carlos Guestrin on the early days of GraphLab and the evolution of GraphLab Create.
For maximum business value, big data applications have to involve multiple Hadoop ecosystem components.
Drawing inspiration from recent advances in data preparation.
From data privacy to real-world problem solving, O’Reilly’s data editors highlight the best of the best talks from 2014.
Women in data and technology are no longer outliers or anomalies; they are entering the mainstream and excelling where technical skills, advanced education, and no small amount of personal tenacity and brilliance are the minimum requirements.
Our annual wrap-up of important developments in the big data field.
In this episode of the O'Reilly Data Show Podcast, Jay Kreps talks about data integration, event data, and the Internet of Things.
How sensors, fast networks, AI, and distributed computing are affecting the data landscape
Rajiv Maheswaran talks about the tools and techniques required to analyze new kinds of sports data.
Deciding what data to collect is hard when consequences are unpredictable.