Blogs

BROWSE: Most Recent | Popular Tags |

Tags > big data

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

Four short links: 21 July 2014

By Nat Torkington
July 21, 2014

nupic (github) -GPL v3-licensed ode from Numenta, at last. See their patent position. Robocup — soccer robotics contest, condition of entry is that all codes are open sourced after the contest. (via The Economist) Security Data Science Paper Collection — …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 15 July 2014

By Nat Torkington
July 15, 2014

Inside Data Brokers — very readable explanation of the data brokers and how their information is used to track advertising effectiveness. Elon, I Want My Data! — Telsa don’t give you access to the data that your cars collects. Bodes …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

Four short links: 9 July 2014

By Nat Torkington
July 9, 2014

Developer Inequality (Jonathan Edwards) — The bigger injustice is that programming has become an elite: a vocation requiring rare talents, grueling training, and total dedication. The way things are today if you want to be a programmer you had best …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 1 July 2014

By Nat Torkington
July 1, 2014

word2vec — This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research. From Google Research …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Four short links: 27 June 2014

By Nat Torkington
June 27, 2014

MillWheel: Fault-Tolerant Stream Processing at Internet Scale — Google Research paper on the tech underlying the new cloud DataFlow tool. Watch the video. Yow. The Integer Overflow Bug That Went to Mars — long-standing (20 year old!) bug in a …

Four short links: 24 June 2014

By Nat Torkington
June 24, 2014

Maximum Happy Imagination (Matt Jones) — questioning the true vision of Marc Andreessen’s recent Twitter discourse on the great future that awaits us. His analogies run out in the 20th century when it comes to the political, social and economic …

Four short links: 24 June 2014

By Nat Torkington
June 24, 2014

Maximum Happy Imagination (Matt Jones) — questioning the true vision of Marc Andreessen’s recent Twitter discourse on the great future that awaits us. His analogies run out in the 20th century when it comes to the political, social and economic …

Four short links: 24 June 2014

By Nat Torkington
June 24, 2014

Maximum Happy Imagination (Matt Jones) — questioning the true vision of Marc Andreessen’s recent Twitter discourse on the great future that awaits us. His analogies run out in the 20th century when it comes to the political, social and economic …

Four short links: 20 June 2014

By Nat Torkington
June 20, 2014

Dynamo and BigTable — good preso overview of two approaches to solving availability and consistency in the event of server failure or network partition. Goals Gone Wild (PDF) — In this article, we argue that the beneficial effects of goal …

Four short links: 20 June 2014

By Nat Torkington
June 20, 2014

Dynamo and BigTable — good preso overview of two approaches to solving availability and consistency in the event of server failure or network partition. Goals Gone Wild (PDF) — In this article, we argue that the beneficial effects of goal …

Four short links: 20 June 2014

By Nat Torkington
June 20, 2014

Dynamo and BigTable — good preso overview of two approaches to solving availability and consistency in the event of server failure or network partition. Goals Gone Wild (PDF) — In this article, we argue that the beneficial effects of goal …

Four short links: 9 June 2014

By Nat Torkington
June 9, 2014

textql — execute SQL against structured text like CSV or TSV. Social Network Structure of Fake Friends — author bought 4,000 Twitter followers and studied their relationships. Hidden Biases in Big Data — with every big data set, we need …

Four short links: 3 June 2014

By Nat Torkington
June 3, 2014

Machine Learning Done Wrong — [M]ost practitioners pick the modeling algorithm they are most familiar with rather than pick the one which best suits the data. In this post, I would like to share some common mistakes (the don’t-s). Bandits …

A growing number of applications are being built with Spark

By Ben Lorica
May 31, 2014

One of the trends we’re following closely at Strata is the emergence of vertical applications. As components for creating large-scale data infrastructures enter their early stages of maturation, companies are focusing on solving data problems in specific industries rather than …

How to be agile with your big data

By Mike Barlow
May 28, 2014

Data analysis, like other pursuits, is a balancing act. The rise of big data ratchets up the pressure on the traditional enterprise data warehouse (EDW) and associated software tools to handle rapidly evolving sets of new demands posed by the …

Four short links: 26 May 2014

By Nat Torkington
May 23, 2014

Car Alarms and Smoke Alarms (Slideshare) — how to think about and draw the line between sensitivity and specificity. 101 Uses for Content Mining — between the list in the post and the comments from readers, it’s a good introduction …

Four short links: 23 May 2014

By Nat Torkington
May 23, 2014

How to Educate Users (Luke Wroblewski) — help new users in your app, not in a video. Hardware By The Numbers (Renee DiResta) — slides from her keynote at the Solid conference. The mean success rate across all sectors is …

Four short links: 22 May 2014

By Nat Torkington
May 22, 2014

Ferry — helps you create big data clusters on your local machine. Define your big data stack using YAML and share your application with Dockerfiles. Ferry supports Hadoop, Cassandra, Spark, GlusterFS, and Open MPI. What Google Told SEC — For …

Four short links: 1 May 2014

By Nat Torkington
April 30, 2014

US Providers Must Divulge from Offshore Servers (Gigaom) — A U.S. magistrate judge ruled that U.S. cloud vendors must fork over customer data even if that data resides in data centers outside the country. (via Alistair Croll) Inside Google’s Self-Driving …

Four short links: 23 April 2014

By Nat Torkington
April 23, 2014

Samsung UX (Scribd) — little shop of self-catalogued UX horrors, courtesy discovery in a lawsuit. Dated (Android G1 as competition) but rewarding to see there are signs of self-awareness in the companies that inflict unusability on the world. Tools for …

Four short links: 22 April 2014

By Nat Torkington
April 22, 2014

PourOver — NYT open source Javascript for very fast in-browser filtering and sorting of large collections. LibreSSL — OpenBSD take on OpenSSL. Unclear how sustainable this effort is, or how well adopted it will be. Competing with OpenSSL is obviously …

Four short links: 18 April 2014

By Nat Torkington
April 18, 2014

16 Interviewing Tips for User Studies — these apply to many situations beyond user interviews, too. The Backlash Against Big Data contd. (Mike Loukides) — Learn to be a data skeptic. That doesn’t mean becoming skeptical about the value of …

Four short links: 10 April 2014

By Nat Torkington
April 10, 2014

Rise of the Patent Troll: Everything is a Remix (YouTube) — primer on patent trolls, in language anyone can follow. Part of the fixpatents.org campaign. (via BoingBoing) Petabytes of Field Data (GigaOm) — Farm Intelligence using sensors and computer vision …

The backlash against big data, continued

By Mike Loukides
April 9, 2014

Yawn. Yet another article trashing “big data,” this time an op-ed in the Times. This one is better than most, and ends with the truism that data isn’t a silver bullet. It certainly isn’t. I’ll spare you all the links (most of …

The backlash against big data, continued

By Mike Loukides
April 8, 2014

Yawn. Yet another article trashing “big data,” this time an op-ed in the Times. This one is better than most, and ends with the truism that data isn’t a silver bullet. It certainly isn’t. I’ll spare you all the links (most of …

5 Fun Facts about HBase that you didn’t know

By Ben Lorica
April 6, 2014

With HBaseCon right around the corner, I wanted to take stock of one of the more popular1 components in the Hadoop ecosystem. Over the last few years, many more companies have come to rely on HBase to run key products …

Four short links: 2 April 2014

By Nat Torkington
April 2, 2014

Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing (PDF) — Berkeley research paper behind Apache Spark. (via Nelson Minar) Angular Tour — trivially add tour tips (“This is the widget basket, drag and drop for widget goodness!” type …

Wearable intelligence

By Glen Martin
April 1, 2014

The age of ubiquitous computing is accelerating, and it’s creating some interesting social turbulence, particularly where wearable hardware is concerned. Intelligent devices other than phones and screens — smart headsets, glasses, watches, bracelets — are insinuating themselves into our daily …

Four short links: 31 March 2014

By Nat Torkington
March 31, 2014

Game Programming Patterns — a book in progress. Search for the Next Platform (Fred Wilson) — Mobile is now the last thing. And all of these big tech companies are looking for the next thing to make sure they don’t …

Four short links: 28 March 2014

By Nat Torkington
March 28, 2014

WearScript — open source project putting Javascript on Glass. See story on it. (via Slashdot) Mining the World’s Data by Selling Street Lights and Farm Drones (Quartz) — Depending on what kinds of sensors the light’s owners choose to install, …

Four short links: 24 March 2014

By Nat Torkington
March 24, 2014

The Parable of Google Flu (PDF) — We explore two issues that contributed to [Google Flu Trends]’s mistakes—big data hubris and algorithm dynamics—and offer lessons for moving forward in the big data age. Overtrained and underfed? Duktape — a lightweight …

Podcast: thinking with data

By Jon Bruner
March 18, 2014

Max Shron and Jake Porway spoke with me a few weeks ago about frameworks for making reasoned arguments with data. Max’s recent O’Reilly book, Thinking with Data, outlines the crucial process of developing good questions and creating a plan to answer …

Four short links: 18 March 2014

By Nat Torkington
March 18, 2014

On Managers (Mike Migurski) — Managers might be difficult, hostile, or useless, but because they are parts of an explicit power structure they can be evaluated explicitly. Big Data: Humans Required (Sherri Hammons) — the heart of the problem with …

The dangers of data-driven list-making

By Alistair Croll
March 17, 2014

Editor’s note: this post originally appeared on Tilt the Windmill; it is republished here with permission. Startupfest’s Pamela Perotti asked for my thoughts on this great Forbes piece by Lightspeed’s Barry Eggers about using big data to build top ten …

Four short links: 13 March 2014

By Nat Torkington
March 13, 2014

Is Parallel Programming Hard? And, If So, What Can You Do About It? — book by Paul E. McKenney, on single-machine multi-CPU parallel programming. Malignant Computation — The bitcoin mining network would work just as well if it had far …

Four short links: 11 March 2014

By Nat Torkington
March 11, 2014

In-Game Graph Analysis (The Economist) — one MLB team has bought a Cray Ulrika graph-processing appliance for in-game analysis of data. Please hold, boggling. (via Courtney Nash) Disney Bets $1B on Technology (BusinessWeek) — MyMagic+ promises far more radical change. …

Big data and privacy: an uneasy face-off for government to face

By Andy Oram
March 5, 2014

Thrust into controversy by Edward Snowden’s first revelations last year, President Obama belatedly welcomed a “conversation” about privacy. As cynical as you may feel about US spying, that conversation with the federal government has now begun. In particular, the first …

The technical aspects of privacy

By Andy Oram
March 5, 2014

Thrust into controversy by Edward Snowden’s first revelations last year, President Obama belatedly welcomed a “conversation” about privacy. As cynical as you may feel about US spying, that conversation with the federal government has now begun. In particular, the first …

Healthcare Lessons from the Data Sages at Strata

By Bonnie Feldman
February 27, 2014

This article was written with Ellen M. Martin. Most healthcare clinicians don’t often think about donating or sharing data. Yet, after hearing Stephen Friend of Sage Bionetworks talk about involving citizens and patients in the field of genetic research at …

Four short links: 26 February 2014

By Nat Torkington
February 26, 2014

Librarybox 2.0 — fork of PirateBox for the TP-Link MR 3020, customized for educational, library, and other needs. Wifi hotspot with free and anonymous file sharing. v2 adds mesh networking and more. (via BoingBoing) Chicago PD’s Using Big Data to …


1 to 50 of 242 Next
The Watering Hole