By Jim Blomo | June 18, 2013
Starting and growing a data science team doesn't have to be a risky proposition. By balancing long term strategy and technology goals with immediate business demands, your data science team can quickly become productive and enjoy sustained growth.
By Sheridan Hitchens | June 11, 2013
Join us for an exclusive presentation by Sheriden Hitchens recorded live from Strata + Hadoop World 2012.
By Jonathan Bruner | June 05, 2013
In this Strata Online event, we'll look at some of the ways the rise of the always-on world is feeding the hungry engines of Big Data.
By Scott Murray | March 20, 2013
This webcast talk presented by Scott Murray author of Interactive Data Visualization for the Web, will introduce ideas from conceptual art, connecting them to the daily challenges faced by data visualizers working with code.
By Winston Chang | March 06, 2013
In this webcast presented by Winston Chang, author of R graphics Cookbook, you'll learn the basics of how to create data graphics using R and the popular ggplot2 package.
By Jeremy Howard | March 05, 2013
In this webcast talk Jeremy Howard, Kaggle's president and chief scientist, will explain exactly what occurred, why it was front-page newsworthy for the New York Times, how it will impact business, and what you need to know to make these new algorithms...
By Maksim Tsvetovat | March 05, 2013
In this webcast talk Maksim Tsvetovat author of Social Network Analysis for Startups will introduce a number of ways to address these issues and present an open-source Python-based toolkit for detecting and visualizing communities in Twitter networks...
By Wes McKinney | February 20, 2013
This live webcast is presented by Wes McKinney author of Python for Data Analysis and will be a somewhat advanced, technical talk connecting computer science concepts like data structure design and algorithms with the details of building intuitive, high...
By Alistair Croll | February 15, 2013
In this free online conference, we'll be showcasing some of the hot topics and thought-provoking speakers who will be joining us for the event.
By Bitsy Bentley | February 14, 2013
Businesses have access to more data than ever before, but the question of how the data can be leveraged to drive action is at times a daunting task, especially for larger organizations.
By Scott Murray | February 13, 2013
Join us for a hands-on webcast presented by Scott Murray author of Interactive Data Visualization for the Web, as he guides you through the framework of three avenues of engagement: aesthetic, narrative, and interactive.
http://oreillynet.com/images/people/weblogs/cj_date.jpgC.J. Date
By C.J. Date | January 30, 2013
In this webcast presentation, the overall message is: Views in general are just as updatable as base tables are! Attend this webcast and see why this isn't as extravagant a claim as it might seem.
By Alistair Croll | January 22, 2013
From public policy to elections, from healthcare to the battlefield, our lives rely on the analysis of abundant, connected data. But if data is infrastructure, then that infrastructure's vulnerable. Enemies can confound, confuse, distort, and mislead...
By Micheline Casey | January 15, 2013
In this webcast, Micheline Casey provides an overview of data governance and data management principles that should be applied to big data projects.
By David Boyle | January 08, 2013
In this exclusive webcast, David Boyle will look at how EMI changed itself, and the music industry, by moving from gut instinct and opinions to a data-informed business.
By Khaled El Emam | October 31, 2012
In this webcast presentation we will first provide an overview of how data can be re-identified, with reference to a number of recent real world examples. This will be followed by a description of how to de-identify health data in a defensible way according...
By Allen B. Downey | October 26, 2012
Join Allen Downey, author of Think Stats: Probability and Statistics for Programmers for an introduction to Bayesian statistics using Python. Bayesian statistical methods are becoming more common and more important, but there are not many resources to...
http://oreillynet.com/images/people/weblogs/benjamin_yoskovitz.jpgBenjamin Yoskovitz
By Benjamin Yoskovitz | October 25, 2012
The Lean movement has revolutionized how we create products and companies today. It focuses on customer development and tackling the risky parts first. At the core of this is iteration—a cycle of learning and adapting that's driven by data. Lean...
By James Pustejovsky, Amber Stubbs | October 16, 2012
Text-based data mining and information extraction systems that make use of machine learning techniques require annotated datasets for training the algorithms. In this webcast we will discuss the steps involved in creating your own training corpus for...
By J. Tod Fetherling | October 12, 2012
J. Tod Fetherling presents this 90 minute white board session walking the user through every aspect of the healthcare system from wellness to death.
By Wes McKinney | October 10, 2012
In this hands-on webcast presented by Wes McKinney, author of Python for Data Analysis , he will showcase a number of examples and you will receive an introduction to some of the most important tools in the Python language for data preparation, data ...
By Julie Steele | October 05, 2012
In this free online conference we will discuss how Microsoft Research has developed a new version of the Linear Mixed Model algorithm that is not only computationally inexpensive, but also is better at finding the true signals that account statistically...
By Alistair Croll | October 03, 2012
In this free online conference, we preview some of the hot topics, provocative speakers, and game-changing innovations that are fueling the growth of a data-driven society.
By John Myles White, Drew Conway | September 18, 2012
We'll introduce programmers to two of the most common tools in the machine learning toolkit: linear regression and logistic regression.
By Lars George | August 15, 2012
In this webcast we will look at popular reference architectures used by companies across several business verticals, discuss their pros and cons, and their applicability to different use-cases, and conclude with best-practice advise on hardware selection...
By Kaitlin Thaney, Alistair Croll, Jacomo Corbo, Simon Williams, Neal Lathia, John Graham-Cumming | July 24, 2012
In this Strata Online Conference, we'll look at data and movement across a variety of sports and industries.
By Edd Dumbill | June 20, 2012
In this Strata Online Event, we'll look at the way data science is shaping elections, from visualizations to game theory, from understanding issues to targeting voters.
http://oreillynet.com/images/people/weblogs/steve_francia.jpgSteve Francia
By Steve Francia | May 18, 2012
In this webcast presentation by Steve Francia, author of MongoDB and PHP, you will learn how to build elegant database applications with MongoDB and PHP.
By Alistair Croll | May 16, 2012
Join us for our seventh Strata online conference, as we look at Data That Matters.
By Tim O'Reilly, David Campbell | May 14, 2012
Tim O'Reilly, founder and CEO of O'Reilly Media, talks with Microsoft Technical Fellow Dave Campbell about new tools for data.
http://oreillynet.com/images/people/weblogs/alan_gates.jpgAlan Gates
By Alan Gates | May 10, 2012
In this webcast, we will cover how Pig can take advantage of changes in Hadoop 0.23.
By Gregory Brail, Daniel Jacobson, Dan Woods | March 22, 2012
In this webcast presentation join Dan Jacobson , Greg Brail, and Dan Woods as they discuss how business leaders can use APIs to transform as a strategy to transform business through private and public APIs.
By Jared Rosoff | February 17, 2012
In this webcast we'll provide a number of data modeling rules of thumb, and discuss the tradeoffs of various data modeling strategies.
By Kord Davis | February 16, 2012
The material will address the intersection of ethics and Big Data; what it is and what it isn't. Specifically, how to approach and generate dialog about an abstract subject with direct, real-world implications.
By Joe Kissell | February 03, 2012
In this webcast, veteran Mac author Joe Kissell explains what iCloud can do for you, how to deal with configuration puzzles and compatibility issues, and how best to manage the transition from MobileMe.
By John Zablocki | January 27, 2012
In this webcast John Zablocki, Developer Advocate at Couchbase, will introduce the .NET client library for Couchbase Server.
By Alistair Croll | December 07, 2011
In this online event, we'll look at how Big Data stacks and analytical approaches are gradually finding their way into organizations, as well as the roadblocks that can thwart efforts to become more data-driven.
By Maksim Tsvetovat | December 06, 2011
A follow-on to Analyzing Social Networks on Twitter, this webcast will concentrate on the social component of Twitter data rather then the questions of data gathering and decomposition.
By Lars George | November 04, 2011
This session explains the concepts behind coprocessors and uses examples to show how they can be used to implement data side extensions to the application code.
By Jim Adler, danah boyd, Terence Craig, Natalie Fonseca, Heather West | October 28, 2011
Join the panelists as they consider the evolution from private to public: how are our worlds colliding in the digital age?
http://oreillynet.com/images/people/weblogs/lars_george.jpgLars George
By Lars George | October 14, 2011
This session discusses the basic underlying concepts of the storage layer in HBase and how an application should be combined with the appropriate schema to achieve the best possible performance.
http://oreillynet.com/images/people/weblogs/allen_downey.jpgAllen B. Downey
By Allen B. Downey | October 04, 2011
People working with real data are often confused about hypothesis testing and paralyzed by the number of tests and their requirements. In this webcast, Allen B. Downey, author of Think Stats, presents a framework for using simple simulations to estimate...
By Terence Craig, Mary Ludloff | September 14, 2011
In this webcast, Terence Craig and Mary Ludloff, authors of Privacy and Big Data, ask and answer this question: What level of privacy do you really have in the age of big data?
By Julie Steele, Noah Iliinsky | September 06, 2011
This webcast will discuss data visualization. Learn linear processes and best practices so that your message may be transmitted without interference.
By Edd Dumbill, Kathryn Dekas, Michael Hugos, Michael Nelson, Hjalmar Gislason, Bill Schmarzo | August 31, 2011
In this special online event, you'll get an inside look at some of the world's leading thinkers and innovators in the fields of business, data, and disruption.
| August 09, 2011
In this session we will be demonstrating the construction of DSN's, linking tables, views, and using stored procedures and views in pass-through queries. This will include a discussion of the benefits in using SQL Server Schemas and Synonyms.
http://oreillynet.com/images/people/weblogs/mike_halsey3-50.jpgMike Halsey
By Mike Halsey | August 04, 2011
In this webcast, Mike Halsey MVP, the author of Troubleshooting Windows 7 Inside Out will talk you though how to keep your files and data safe from even the worst disaster.
By J. Chris Anderson, Dustin Sallings | April 19, 2011
In this webinar we'll introduce you to the Membase caching and clustering architecture, and show how CouchDB is a drop-in fit as the storage and query engine.
http://oreillynet.com/images/people/weblogs/kristina_chodorow.jpgKristina Chodorow
By Kristina Chodorow | February 04, 2011
This talk is a combination of whitepaper and Magic School Bus tour of how MongoDB scales across multiple machines. For applications that outgrow the resources of a single database server, MongoDB can convert to a sharded cluster, automatically managing...
http://oreillynet.com/images/people/50/bradford_stephens-50.jpgBradford Stephens
By Bradford Stephens | January 12, 2011
Building distributed systems is painful. Many organizations are approaching the point where their data and application infrastructures are being run on many servers (in the cloud or datacenter). Our software practices don't reflect that, often with disastrous...
http://oreillynet.com/images/people/50/hadi_hariri-50.jpgHadi Hariri
By Hadi Hariri | December 21, 2010
What does that mean to a .NET Developer? How do we store and retrieve data? How do we query it? If you've been interested in document databases but do not know where to start, then this is definitely the webcast for you. We'll see what CouchDB is about...
http://oreillynet.com/images/people/50/ken_goodhope-50.jpgKen Goodhope
By Ken Goodhope | November 23, 2010
We'll use real world examples in this webcast that demonstrate how to best utilize MapReduce with Hadoop. We'll also examine the appropriate uses of special partitioners, combiners, and configuration optimizations. We'll expose some common mistakes and...
http://oreillynet.com/images/people/50/benjamin_young-50.jpgBenjamin Young
By Benjamin Young | November 17, 2010
This talk will cover the basics of the CouchDB HTTP API and how to use it from PHP with and without helper libraries. We'll discuss some architecture approaches and briefly look at things to avoid when moving from an RDBMS to a Document Database such...
http://oreillynet.com/images/people/50/c_brown-50.jpgC. Titus Brown
By C. Titus Brown | November 10, 2010
Many data analysis problems are not easily parallelizable, often because the relevant analyses require an all-by-all analysis step. Applying heuristics often requires approximation, which introduces errors, noise, and bias. Recently, in confronting the...
http://oreillynet.com/images/people/50/kyle_banker-50.jpgKyle Banker
By Kyle Banker | October 29, 2010
We all know that MongoDB is one of the most flexible and feature-rich databases available. In this session we'll discuss how you can leverage this feature set and maintain high performance with your project's massive data sets and high loads. We'll cover...
http://oreillynet.com/images/people/50/kocoloski_adam-50.jpgAdam Kocoloski
By Adam Kocoloski | October 22, 2010
This talk will cover the basics of BigCouch, including deploying and managing your first CouchDB cluster, as well as some advanced features like quorum reads/writes and design patterns for distributed couchdb. Finally, for the erlang hackers out there...
http://oreillynet.com/images/people/50/aaron_miller-50.jpgAaron Miller
By Aaron Miller | September 22, 2010
Why CouchDB on a phone is awesome, and what you can do with it Deploying existing CouchApps to Android CouchDB Using CouchDB in native Android apps
http://oreillynet.com/images/people/weblogs/kristina_chodorow.jpgKristina Chodorow
By Kristina Chodorow | September 17, 2010
MongoDB's architecture features built-in support for horizontal scalability, and high availability through replica sets. Auto-sharding allows users to easily distribute data across many nodes. Replica sets enable automatic failover and recovery of database...
http://oreillynet.com/images/people/weblogs/tom_white.jpgTom White
By Tom White | September 15, 2010
Apache Hadoop is a part of a growing ecosystem of projects for large-scale data analysis which is being used to solve problems for organizations in a wide range of disciplines. This talk will touch on what's new in the second edition of Hadoop: The Definitive...
http://oreillynet.com/images/people/weblogs/jan_jehnardt.jpgJan Lehnardt
By Jan Lehnardt | August 25, 2010
Learn how to build robust web services using CouchDB's built-in facility for near-realtime updates. We'll explore a few patterns _changes can be used for: Building custom external indexers like CouchDB-Lucene, Powering CouchDB's replication, Real-time...
http://oreillynet.com/images/people/weblogs/sean_hull.jpgSean Hull
By Sean Hull | July 27, 2010
In this webcast we'll discuss a two-node MySQL multi-master replication setup. We'll take the audience step-by-step through the process, and then uses MMM (MySQL Multi-master Manager) to manage & automate the process exposing a virtual IP address...
http://oreillynet.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson
By J. Chris Anderson | July 14, 2010
CouchDB is known for having a flexible schemaless JSON storage API. But that is just the tip of the iceberg when it comes to flexibility. In this webcast we'll learn how replication can be used to share data securely, build offline-capable applications...
http://oreillynet.com/images/people/weblogs/jan_jehnardt.jpgJan Lehnardt
By Jan Lehnardt | June 22, 2010
This webcasts highlights new features and refines in the latest and upcoming release of CouchDB. It rehashes old solutions to problems that are now way easier to solve. We look at how the new features help you make your life and development work easier...
http://oreillynet.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson
By J. Chris Anderson | May 20, 2010
Learn to hack jQuery CouchApps -- p2p web applications that can be deployed anywhere there's a CouchDB. Apache CouchDB can host HTML5 apps natively, serving them over HTTP. Learn how to write JavaScript CouchApps which run on both the client and ...
http://oreillynet.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson
By J. Chris Anderson | April 21, 2010
CouchDB is a distributed document database accessed via HTTP and JSON and queried using JavaScript Map Reduce. CouchDB focuses on simplicity and reliability, with a data replication model that makes it well suited for mobile and offline applications...
http://oreillynet.com/images/people/weblogs/sean_hull.jpgSean Hull
By Sean Hull | January 19, 2010
DRBD has grown in popularity as an excellent low-cost high availability solution for MySQL. It provides synchronous replication of your data without MySQL having to worry too much about the details. Combined with Linux Heartbeat, and you have automatic...
http://oreillynet.com/images/people/weblogs/michael_milton.gifMichael Milton
By Michael Milton | October 28, 2009
Data analysis skills are critical to staying competitive in the 21st century economy. In this webcast the author of Head First Data Analysis, Michael Milton, provides some useful tips for common data problems that everyone faces.
http://oreillynet.com/images/people/weblogs/sean_hull.jpgSean Hull
By Sean Hull | August 04, 2009
MySQL's Clustering solution provides some pretty sophisticated functionality. In this webcast we'll take you through getting it up and running on your laptop or single node server, building a sandbox where you can play with the dials and levers and get...
http://oreillynet.com/images/people/weblogs/sean_hull.jpgSean Hull
By Sean Hull | January 22, 2009
In this live online event, Sean Hull (Oracle and Open Source) will talk about why MySQL slaves get out of sync with the master, both in terms of things that happen in the application and in MySQL's implementation of statement-based replication. He'll...