Blogs

BROWSE: Most Recent | Popular Tags |

Tags > learning

Four short links: 16 May 2013

By Nat Torkington
May 16, 2013

Australian Filter Scope Creep — The Federal Government has confirmed its financial regulator has started requiring Australian Internet service providers to block websites suspected of providing fraudulent financial opportunities, in a move which appears to also open the door for …

Google I/O 2013: Android Studio, Google Play Music: All Access, and New Advances in Search

By Rachel Roumeliotis
May 16, 2013

While there was no skydiving this year to show off Google’s new wearable Glass, there were plenty of attendees wearing them proudly including me. This year hardware, however, didn’t take center stage. The focus was on new tools and upgrades …

Four short links: 15 May 2013

By Nat Torkington
May 15, 2013

Facial Recognition in Google Glass (Mashable) — this makes Glass umpty more attractive to me. It was created in a hackathon for doctors to use with patients, but I need it wired into my eyeballs. How to Price Your Hardware …

Evaluating machine learning systems: Kaggle’s not enough

By Beau Cronin
May 11, 2013

There is a tremendous amount of commercial attention on machine learning (ML) methods and applications. This includes product and content recommender systems, predictive models for churn and lead scoring, systems to assist in medical diagnosis, social network sentiment analysis, and …

A different take on data skepticism

By Beau Cronin
May 7, 2013

Recently, the Mathbabe (aka Cathy O’Neil) vented some frustration about the pitfalls in applying even simple machine learning (ML) methods like k-nearest neighbors. As data science is democratized, she worries that naive practitioners will shoot themselves in the foot because these tools can …

Scalable streaming analytics using a single-server

By Ben Lorica
May 5, 2013

For many organizations real-time1 analytics entails complex event processing systems (CEP) or newer distributed stream processing frameworks like Storm, S4, or Spark Streaming. The latter have become more popular because they are able to scale (ingest) massive amounts of data, …

White House Science Fair praises future scientists and makers

By Alex Howard
April 29, 2013

There are few ways to better judge a nation’s character than to look at how its children are educated. What values do their parents, teachers and mentors demonstrate? What accomplishments are celebrated? In a world where championship sports teams are …

10 Questions to Ask Yourself About IT Education

By Kerry Beck
April 27, 2013

When you decide to enter the IT field or to fine-tune your skills so you’ll excel at the IT job you already have, there are some specific questions to ask yourself. Answering them will help you choose the right educational … Continue reading

Simpler workflow tools enable the rapid deployment of models

By Ben Lorica
April 21, 2013

Data science often depends on data pipelines, that involve acquiring, transforming, and loading data. (If you’re fortunate most of the data you need is already in usable form.) Data needs to be assembled and wrangled, before it can be visualized …

A different take on data skepticism

By Beau Cronin
April 19, 2013

Recently, the Mathbabe (aka Cathy O’Neil) vented some frustration about the pitfalls in applying even simple machine learning (ML) methods like k-nearest neighbors. As data science is democratized, she worries that naive practitioners will shoot themselves in the foot because …

Data skepticism

By Mike Loukides
April 11, 2013

A couple of months ago, I wrote that “big data” is heading toward the trough of a hype curve as a result of oversized hype and promises. That’s certainly true. I see more expressions of skepticism about the value of …

Data skepticism

By Mike Loukides
April 11, 2013

A couple of months ago, I wrote that “big data” is heading toward the trough of a hype curve as a result of oversized hype and promises. That’s certainly true. I see more expressions of skepticism about the value of …

Four short links: 9 April 2013

By Nat Torkington
April 9, 2013

Automated Essay Grading To Come to EdX (NY Times) — shortly after we get software that writes stories for us, we get software to read them for us. AMD Calls End of Moore’s Law in Ten Years (ComputerWorld) — story …

Learning Paths for JavaScript

By Simon St. Laurent
April 9, 2013

Everyone learns and teaches JavaScript their own way, but Cody Lindley (@codylindley) has spent a lot of time with a lot of different kinds of learners. He made the jQuery Cookbook happen, finding and managing contributors as well as making …

Four short links: 1 April 2013

By Nat Torkington
April 1, 2013

MLDemos — an open-source visualization tool for machine learning algorithms created to help studying and understanding how several algorithms function and how their parameters affect and modify the results in problems of classification, regression, clustering, dimensionality reduction, dynamical systems and …

Data Science tools: Are you “all in” or do you “mix and match”?

By Ben Lorica
March 31, 2013

An integrated data stack boosts productivity As I noted in my previous post, Python programmers willing to go “all in”, have Python tools to cover most of data science. Lest I be accused of oversimplification, a Python programmer still needs …

Four short links: 25 March 2013

By Nat Torkington
March 25, 2013

Analytics for Learning — Since doing good learning analytics is hard, we often do easy learning analytics and pretend that they are good instead. But pretending doesn’t make it so. (via Dan Meyer) Reproducible Research — a list of links …

Four short links: 11 March 2013

By Nat Torkington
March 11, 2013

Adventures in the Ransom Trade — between insurance, protection, and ransoms, Sean Gourley describes it as “one of the more interesting grey markets.” (via Sean Gourley) About High School Computer Science Teachers (Selena Deckelmann) — Selena gets an education in …

Four short links: 8 March 2013

By Nat Torkington
March 8, 2013

mlcomp — a free website for objectively comparing machine learning programs across various datasets for multiple problem domains. Printing Code: Programming and the Visual Arts (Vimeo) — Rune Madsen’s talk from Heroku’s Waza. (via Andrew Odewahn) What Data Brokers Know …

Four short links: 7 March 2013

By Nat Torkington
March 7, 2013

Pharmacovigilance — Signals from The Crowd (PDF) — in the NY Times’ words: Using automated software tools to examine queries by 6 million Internet users taken from Web search logs in 2010, the researchers looked for searches relating to an …

Natural language annotation for machine learning

By Meghan Blanchette
March 4, 2013

James Pustejovsky (@jamespusto) is an O’Reilly author and professor of computer science at Brandeis. Amber Stubbs (@amber_stubbs) is an O’Reilly author and post doc at SUNY Albany. We sat down to talk about natural language annotation as it relates to …

Four short links: 26 Feb 2013

By Nat Torkington
February 26, 2013

School of Data — free online courses around data science and visualization. libshorttext — classify and analyse short-text of things like titles, questions, sentences, and short messages. MIT-style open source license, Python and C++ source. Letterboxd — a site for …

MLbase: Scalable Machine-learning made accessible

By Ben Lorica
February 22, 2013

In the course of applying machine-learning against large data sets, data scientists face a few pain points. They need to tune and compare several suitable algorithms – a process that may involve having to configure a hodgepodge of tools, requiring …

With Respect to OST’s Executive Director

By Trish Gray
February 14, 2013

A loving heart is the beginning of all knowledge. -Thomas Carlyle A few words I use to describe O’Reilly School of Technology’s Executive Director, Scott Gray: Brilliant. Determined. Loyal. Visionary. Authentic. Of course, I may be a bit biased, after … Continue reading

Four short links: 12 February 2013

By Nat Torkington
February 12, 2013

Your USB Sticks Are Made With Chopsticks (Bunnie Huang) — behind-the-scenes on how USB sticks are made. mutetab — find and kill the Chrome tab making all the damn noise! (via Nelson Minar) Visualization, Modeling, and Surprises (John D Cook) …

Distributed resilience with functional programming

By Simon St. Laurent
February 8, 2013

Functional programming has a long and distinguished heritage of great work — that was only used by a small group of programmers. In a world dominated by individual computers running single processors, the extra cost of thinking functionally limited its …

Distributed resilience with functional programming

By Simon St. Laurent
February 8, 2013

Functional programming has a long and distinguished heritage of great work — that was only used by a small group of programmers. In a world dominated by individual computers running single processors, the extra cost of thinking functionally limited its …

Welcome New Instructor and One-Man Army, John Baker

By Lorri Coey
February 2, 2013

When you hear the term one-man army, what comes to mind? When our newest HTML and JavaScript instructor, John Baker, studied under Peter Patchen at the University of Toledo, Patchen encouraged John to become a one-man army of web development. … Continue reading

Four short links: 24 January 2013

By Nat Torkington
January 24, 2013

Google’s Driverless Car is Worth Trillions (Forbes) — Much of the reporting about Google’s driverless car has mistakenly focused on its science-fiction feel. [...] In fact, the driverless car has broad implications for society, for the economy and for individual …

Why I Moved Across the Country to Join OST

By Debra Woods
January 16, 2013

I joined OST as Academic Director in March 2012. It’s been a hectic ride so far to say the least, but I’ve settled in some and the new year feels like the perfect time to introduce myself and share the … Continue reading

Four short links: 18 December 2012

By Nat Torkington
December 18, 2012

Credibility Ranking of Tweets During High Impact Events (PDF) — interesting research. Situational awareness information is information that leads to gain in the knowledge or update about details of the event, like the location, people aff ected, causes, etc. We found …

Software that keeps an eye on Grandma

By Jon Bruner
November 21, 2012

Much of health care — particularly for the elderly — is about detecting change, and, as the mobile health movement would have it, computers are very good at that. Given enough sensors, software can model an individual’s behavior patterns and …

Software that keeps an eye on Grandma

By Jon Bruner
November 15, 2012

Much of health care — particularly for the elderly — is about detecting change, and, as the mobile health movement would have it, computers are very good at that. Given enough sensors, software can model an individual’s behavior patterns and …

Strata Week: AWS and Rackspace, compared

By Jenn Webb
November 2, 2012

Here are a few stories from the data space that caught my attention this week. Rackspace vs Amazon As Rackspace continues to ramp up its services to compete with Amazon Web Services (AWS) — this week, announcing a partnership with …

Two crucial questions for the smart grid

By Jon Bruner
October 31, 2012

In a lively panel discussion at last week’s IEEE Industrial Electronics Society meeting in Montreal, two questions related to the smart grid (the prospective electrical distribution system that will set prices dynamically and let consumers sell electricity to other users …

Listening for tired machinery

By Jon Bruner
October 26, 2012

Software is making its way into places where it hasn’t usually been before, like the cutting surfaces of very fast, ultra-precise machine tools. A high-speed milling machine can run at 42,000 RPM as it fabricates high-quality machine components within tolerances …

Four short links: 18 October 2012

By Nat Torkington
October 18, 2012

Let’s Pool Our Medical Data (TED) — John Wilbanks (of Science Commons fame) gives a strong talk for creating an open, massive, mine-able database of data about health and genomics from many sources. Money quote: Facebook would never make a …

Spark 0.6 improves performance and accessibility

By Ben Lorica
October 16, 2012

In an earlier post I listed a few reasons why I’ve come to embrace and use Spark. In particular I described why Spark is well-suited for many distributed Big Data Analytics tasks such as iterative computations and interactive queries, where …

Four short links: 9 October 2012

By Nat Torkington
October 9, 2012

Finland Crowdsourcing New Laws (GigaOm) — online referenda. The Finnish government enabled something called a “citizens’ initiative”, through which registered voters can come up with new laws – if they can get 50,000 of their fellow citizens to back them …

Four short links: 4 October 2012

By Nat Torkington
October 4, 2012

As We May Think (Vannevar Bush) — incredibly prescient piece he wrote for The Atlantic in 1945. Transparency and Topic Models (YouTube) — a talk from DataGotham 2012, by Hanna Wallach. She uses latent Dirichlet allocation topic models to mine …

Four short links: 14 September 2012

By Nat Torkington
September 14, 2012

Post Lingo — automatically transcribe incoming emails from foreign tongues. (via Brian McConnell) All Briefs Should Now Be in Comic Book Form — does wonders for mass audience acceptance of the arguments. (via Andy Lester) Magic Carpet Can Detect and …

Four short links: 13 September 2012

By Nat Torkington
September 13, 2012

Patterns for Research in Machine Learning — every single piece of advice should be tattooed under the eyelids of every beginning programmer, regardless of the field. Milton Friedman’s Thermostat — Everybody knows that if you press down on the gas …

Four short links: 30 August 2012

By Nat Torkington
August 30, 2012

TOS;DR — terms of service rendered comprehensible. “Make the hard stuff easy” is a great template for good ideas, and this just nails it. Sick of Impact Factors — typically only 15% of the papers in a journal account for …

Seven reasons why I like Spark

By Ben Lorica
August 21, 2012

A large portion of this week’s Amp Camp at UC Berkeley, is devoted to an introduction to Spark – an open source, in-memory, cluster computing framework. After playing with Spark over the last month, I’ve come to consider it a …

A grisly job for data scientists

By Jon Bruner
August 13, 2012

Javier Reveron went missing from Ohio in 2004. His wallet turned up in New York City, but he was nowhere to be found. By the time his parents arrived to search for him and hand out fliers, his remains had …

Four short links: 26 July 2012

By Nat Torkington
July 26, 2012

Drones Over Somalia are Hazard to Air Traffic (Washington Post) — In a recently completed report, U.N. officials describe several narrowly averted disasters in which drones crashed into a refu­gee camp, flew dangerously close to a fuel dump and almost …

The Lean Startup Cycle - Learning

By Kevin Shockey
July 3, 2012

One trait nearly every successful startup shares is endless market research. While innovation accounting covers a broader spectrum, assumptions about your users or community are some of the most important and should be tested early and often..

In defense of frivolities and open-ended experiments

By Bradley Voytek
June 8, 2012

Before you scoff at the pointlessness of yet another social network, web app, or project, remember that we don't always do the research or build the company that is immediately useful or profitable.

Strata Week: Data prospecting with Kaggle

By Audrey Watters
June 7, 2012

In this week's data news, Kaggle launches Prospect, HP unveils its big data plans, and Cloudera releases CDH4 (the latest version of its Hadoop distribution).

Strata Week: Data prospecting with Kaggle

Strata Week: Data prospecting with Kaggle
By Audrey Watters
June 7, 2012

In this week's data news, Kaggle launches Prospect, HP unveils its big data plans, and Cloudera releases CDH4 (the latest version of its Hadoop distribution).


1 to 50 of 132 Next
The Watering Hole