Blogs

BROWSE: Most Recent | Popular Tags |

Tags > data

Why local state is a fundamental primitive in stream processing

By Jay Kreps
July 31, 2014

One of the concepts that has proven the hardest to explain to people when I talk about Samza is the idea of fault-tolerant local state for stream processing. I think people are so used to the idea of keeping all …

Why local state is a fundamental primitive in stream processing

By Jay Kreps
July 31, 2014

One of the concepts that has proven the hardest to explain to people when I talk about Samza is the idea of fault-tolerant local state for stream processing. I think people are so used to the idea of keeping all …

Why local state is a fundamental primitive in stream processing

By Jay Kreps
July 31, 2014

One of the concepts that has proven the hardest to explain to people when I talk about Samza is the idea of fault-tolerant local state for stream processing. I think people are so used to the idea of keeping all …

New scalable solutions for data analysis with R

By Federico Castanedo
July 24, 2014

The R programming language is the most popular statistical software in use today by data scientists, according to the 2013 Rexer Analytics Data Miner survey. One of the main drawbacks of vanilla R is the inability to scale and handle …

New scalable solutions for data analysis with R

By Federico Castanedo
July 24, 2014

The R programming language is the most popular statistical software in use today by data scientists, according to the 2013 Rexer Analytics Data Miner survey. One of the main drawbacks of vanilla R is the inability to scale and handle …

New scalable solutions for data analysis with R

By Federico Castanedo
July 24, 2014

The R programming language is the most popular statistical software in use today by data scientists, according to the 2013 Rexer Analytics Data Miner survey. One of the main drawbacks of vanilla R is the inability to scale and handle …

How to build and run your first deep learning network

By Pete Warden
July 23, 2014

When I first became interested in using deep learning for computer vision I found it hard to get started. There were only a couple of open source projects available, they had little documentation, were very experimental, and relied on a …

How to build and run your first deep learning network

By Pete Warden
July 23, 2014

When I first became interested in using deep learning for computer vision I found it hard to get started. There were only a couple of open source projects available, they had little documentation, were very experimental, and relied on a …

How to build and run your first deep learning network

By Pete Warden
July 23, 2014

When I first became interested in using deep learning for computer vision I found it hard to get started. There were only a couple of open source projects available, they had little documentation, were very experimental, and relied on a …

In search of a model for modeling intelligence

By Beau Cronin
July 22, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. In my last post, we saw that AI means a lot of things to a lot of people. These dueling definitions each have a deep history — ok fine, …

In search of a model for modeling intelligence

By Beau Cronin
July 22, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. In my last post, we saw that AI means a lot of things to a lot of people. These dueling definitions each have a deep history — ok fine, …

In search of a model for modeling intelligence

By Beau Cronin
July 22, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. In my last post, we saw that AI means a lot of things to a lot of people. These dueling definitions each have a deep history — ok fine, …

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

A good nudge trumps a good prediction

By Simon Chan
July 17, 2014

Editor’s note: this is part of our investigation into analytic models and best practices for their selection, deployment, and evaluation. We all know that a working predictive model is a powerful business weapon. By translating data into insights and subsequent …

A good nudge trumps a good prediction

By Simon Chan
July 17, 2014

Editor’s note: this is part of our investigation into analytic models and best practices for their selection, deployment, and evaluation. We all know that a working predictive model is a powerful business weapon. By translating data into insights and subsequent …

A good nudge trumps a good prediction

By Simon Chan
July 17, 2014

Editor’s note: this is part of our investigation into analytic models and best practices for their selection, deployment, and evaluation. We all know that a working predictive model is a powerful business weapon. By translating data into insights and subsequent …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 11 July 2014

By Nat Torkington
July 11, 2014

Awesome Awesomeness — list of curated collections of frameworks and libraries in various languages that do not suck. They solve the problem of “so, I’m new to (language) and don’t want to kiss a lot of frogs before I find …

Four short links: 11 July 2014

By Nat Torkington
July 11, 2014

Awesome Awesomeness — list of curated collections of frameworks and libraries in various languages that do not suck. They solve the problem of “so, I’m new to (language) and don’t want to kiss a lot of frogs before I find …

Four short links: 11 July 2014

By Nat Torkington
July 11, 2014

Awesome Awesomeness — list of curated collections of frameworks and libraries in various languages that do not suck. They solve the problem of “so, I’m new to (language) and don’t want to kiss a lot of frogs before I find …

What is deep learning, and why should you care?

By Pete Warden
July 10, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. When I first ran across the results in the Kaggle image-recognition competitions, I didn’t believe them. I’ve spent years working with machine vision, and the reported accuracy on tricky …

What is deep learning, and why should you care?

By Pete Warden
July 10, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. When I first ran across the results in the Kaggle image-recognition competitions, I didn’t believe them. I’ve spent years working with machine vision, and the reported accuracy on tricky …

What is deep learning, and why should you care?

By Pete Warden
July 10, 2014

Editor’s note: this post is part of our Intelligence Matters investigation. When I first ran across the results in the Kaggle image-recognition competitions, I didn’t believe them. I’ve spent years working with machine vision, and the reported accuracy on tricky …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

There are many use cases for graph databases and analytics

By Ben Lorica
July 8, 2014

The rise of sensors and connected devices will lead to applications that draw from network/graph data management and analytics. As the number of devices surpasses the number of people — Cisco estimates 50 billion connected devices by 2020 — one …

There are many use cases for graph databases and analytics

By Ben Lorica
July 8, 2014

The rise of sensors and connected devices will lead to applications that draw from network/graph data management and analytics. As the number of devices surpasses the number of people — Cisco estimates 50 billion connected devices by 2020 — one …

There are many use cases for graph databases and analytics

By Ben Lorica
July 8, 2014

The rise of sensors and connected devices will lead to applications that draw from network/graph data management and analytics. As the number of devices surpasses the number of people — Cisco estimates 50 billion connected devices by 2020 — one …

New approaches to anomaly detection

By Ellen Friedman
July 2, 2014

As new tools for distributed storage and analysis of big data are becoming more stable and widely known, there is a growing need for discovering best practices for analytics at this scale. One of the areas of widespread interest that …

New approaches to anomaly detection

By Ellen Friedman
July 2, 2014

As new tools for distributed storage and analysis of big data are becoming more stable and widely known, there is a growing need for discovering best practices for analytics at this scale. One of the areas of widespread interest that …

New approaches to anomaly detection

By Ellen Friedman
July 2, 2014

As new tools for distributed storage and analysis of big data are becoming more stable and widely known, there is a growing need for discovering best practices for analytics at this scale. One of the areas of widespread interest that …

Questioning the Lambda Architecture

By Jay Kreps
July 1, 2014

Nathan Marz wrote a popular blog post describing an idea he called the Lambda Architecture (“How to beat the CAP theorem“). The Lambda Architecture is an approach to building stream processing applications on top of MapReduce and Storm or similar …

Questioning the Lambda Architecture

By Jay Kreps
July 1, 2014

Nathan Marz wrote a popular blog post describing an idea he called the Lambda Architecture (“How to beat the CAP theorem“). The Lambda Architecture is an approach to building stream processing applications on top of MapReduce and Storm or similar …

Questioning the Lambda Architecture

By Jay Kreps
July 1, 2014

Nathan Marz wrote a popular blog post describing an idea he called the Lambda Architecture (“How to beat the CAP theorem“). The Lambda Architecture is an approach to building stream processing applications on top of MapReduce and Storm or similar …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Roll-your-own database architecture

By Mike Barlow
June 24, 2014

Two years ago, most of the conversations around big data had a futuristic, theoretical vibe. That vibe has been replaced with a gritty sense of practically. Today, when big data or some surrogate term arises in conversation, the talk is …

Roll-your-own database architecture

By Mike Barlow
June 24, 2014

Two years ago, most of the conversations around big data had a futuristic, theoretical vibe. That vibe has been replaced with a gritty sense of practically. Today, when big data or some surrogate term arises in conversation, the talk is …

Roll-your-own database architecture

By Mike Barlow
June 24, 2014

Two years ago, most of the conversations around big data had a futuristic, theoretical vibe. That vibe has been replaced with a gritty sense of practically. Today, when big data or some surrogate term arises in conversation, the talk is …

Four short links: 24 June 2014

By Nat Torkington
June 24, 2014

Maximum Happy Imagination (Matt Jones) — questioning the true vision of Marc Andreessen’s recent Twitter discourse on the great future that awaits us. His analogies run out in the 20th century when it comes to the political, social and economic …

Four short links: 24 June 2014

By Nat Torkington
June 24, 2014

Maximum Happy Imagination (Matt Jones) — questioning the true vision of Marc Andreessen’s recent Twitter discourse on the great future that awaits us. His analogies run out in the 20th century when it comes to the political, social and economic …


1 to 50 of 2068 Next
The Watering Hole