|
|
|||
BlogsTags > algorithmsSimpler workflow tools enable the rapid deployment of modelsBy Ben LoricaApril 21, 2013 Data science often depends on data pipelines, that involve acquiring, transforming, and loading data. (If you’re fortunate most of the data you need is already in usable form.) Data needs to be assembled and wrangled, before it can be visualized … Four short links: 8 March 2013By Nat TorkingtonMarch 8, 2013 mlcomp — a free website for objectively comparing machine learning programs across various datasets for multiple problem domains. Printing Code: Programming and the Visual Arts (Vimeo) — Rune Madsen’s talk from Heroku’s Waza. (via Andrew Odewahn) What Data Brokers Know … Four short links: 11 January 2013By Nat TorkingtonJanuary 11, 2013 How to Redesign Your App Without Pissing Everybody Off (Anil Dash) — the basic straightforward stuff that gets your users on-side. Anil’s making a career out of being an adult. Clockwork Raven (Twitter) — open source project to send data … Four short links: 17 October 2012By Nat TorkingtonOctober 17, 2012 Beyond Goods and Services: The Unmeasured Rise of the Data-Driven Economy — excellent points about data as neither good nor service, and how data use goes unmeasured by economists and thus doesn’t influence policy. According to statistics from the Bureau … Four short links: 16 October 2012By Nat TorkingtonOctober 16, 2012 cir.ca — news app for iPhone, which lets you track updates and further news on a given story. (via Andy Baio) DataWrangler (Stanford) — an interactive tool for data cleaning and transformation. Spend less time formatting and more time analyzing … Four short links: 8 October 2012
By Nat TorkingtonOctober 8, 2012 Beware the Drones (Washington Times) — the temptation to send difficult to detect, unmanned aircraft into foreign airspace with perceived impunity means policymakers will naturally incline towards aggressive use of drones and hyperactive interventionism, leading us to a future that … Wall Street’s robots are not out to get youBy Renee DiRestaAugust 17, 2012 Technology is critical to today’s financial markets. It’s also surprisingly controversial. In most industries, increasing technological involvement is progress, not a problem. And yet, people who believe that computers should drive cars suddenly become Luddites when they talk about computers … Can Future Advisor be the self-driving car for financial advice?
By Alex HowardJune 4, 2012 Given the turmoil in financial markets and uncertainty abroad, good financial advice has never been more valuable. Startup Future Advisor looks to democratize personalized financial advice using the Internet, data and algorithms. Can Future Advisor be the self-driving car for financial advice?By Alex HowardJune 4, 2012 Given the turmoil in financial markets and uncertainty abroad, good financial advice has never been more valuable. Startup Future Advisor looks to democratize personalized financial advice using the Internet, data and algorithms. Four short links: 21 May 2012
By Nat TorkingtonMay 21, 2012 Objectivist C -- very clever. In Objectivist-C, each program is free to acquire as many resources as it can, without interference from the operating system. (via Tim O'Reilly) Zynga and Facebook Stock Oddities (The Atlantic) -- signs of robotrading, a reminder that we're surrounded by algorithms and only notice them when they go awry. The Final ROFLcon and Mobile's... Four short links: 2 May 2012
By Nat TorkingtonMay 2, 2012 Punting on SxSW (Brad Feld) -- I came across this old post and thought: if you can make money by being a dick, or make money by being a caring family person, why would you choose to be a dick? As far as I can tell, being a dick is optional. Brogrammers, take note. Be more like Brad Feld,... Unstructured data is worth the effort when you've got the right toolsBy Suzanne AxtellFebruary 7, 2012 Alyona Medelyan and Anna Divoli are inventing tools to help companies contend with vast quantities of fuzzy data. They discuss their work and what lies ahead for big data in this interview. Unstructured data is worth the effort when you've got the right toolsBy Suzanne AxtellFebruary 7, 2012 Alyona Medelyan and Anna Divoli are inventing tools to help companies contend with vast quantities of fuzzy data. They discuss their work and what lies ahead for big data in this interview. Four short links: 25 July 2011
By Nat TorkingtonJuly 25, 2011 Anonymity in Bitcoin -- TL;DR: Bitcoin is not inherently anonymous. It may be possible to conduct transactions is such a way so as to obscure your identity, but, in many cases, users and their transactions can be identified. We have performed an analysis of anonymity in the Bitcoin system and published our results in a preprint on arXiv. (via... Four short links: 13 July 2011
By Nat TorkingtonJuly 13, 2011 Freebase in Node.js (github) -- handy library for interacting with Freebase from node code. (via Rob McKinnon) Formalize -- CSS library to provide a standard style for form elements. (via Emma Jane Hogbin) Suggesting More Friends Using the Implicit Social Graph (PDF) -- Google paper on the algorithm behind Friend Suggest. Related: Katango. (via Big Data) Dyslexia -- a... Four short links: 13 May 2011
By Nat TorkingtonMay 13, 2011 Mathematical Intimidation: Driven by the Data (PDF) -- excellent article from Notices of the American Mathematical Society about the flaws in "value-added modelling", the latest fad whereby data about students' results in different classes are analysed to identify the effect of each teacher. People recognize that tests are an imperfect measure of educational success, but when sophisticated mathematics is... Trading on sentimentBy Ciara ByrneMay 4, 2011 Sorting through thousands of news stories and categorizing information based on mood and tone creates useful data points for financial systems. Strata Week: Overcharging algorithmsBy Audrey WattersApril 28, 2011 In this Strata Week: Algorithm pricing on Amazon pushes the price of a biology book to astronomical levels, Yahoo weighs the future of Hadoop, and the Supreme Court hears arguments about a Vermont law restricting the data mining of prescription records. Four short links: 12 April 2011
By Nat TorkingtonApril 12, 2011 The Email Game -- game mechanics to get you answering email more efficiently. Can't wait to hear that conversation with corporate IT. "You want us to install what on the Exchange server?" (via Demo Day Wrapup) Stratified B-trees and versioning dictionaries -- A classic versioned data structure in storage and computer science is the copy-on-write (CoW) B-tree -- it... In the future we'll be talking, not typingBy Jenn WebbMarch 23, 2011 Stephan Spencer, co-author of "The Art of SEO," says searching the Internet of the future will be like talking to a human being. Smarter search looks for influence rather than linksBy Jenn WebbMarch 2, 2011 A search algorithm being developed by Princeton University researchers parses language to determine relevance. Academic application is one possibility, but this type of algorithm could also extend to news recommendations. Smarter search looks for influence rather than linksBy Jenn WebbMarch 2, 2011 A search algorithm being developed by Princeton University researchers parses language to determine relevance. Academic application is one possibility, but this type of algorithm could also extend to news recommendations. Four short links: 26 January 2011
By Nat TorkingtonJanuary 26, 2011 Find Communities -- algorithm for uncovering communities in networks of millions of nodes, for producing identifiable subgroups as in LinkedIn InMaps. (via Matt Biddulph's Delicious links) Seven Ways to Think Like The Web (Jon Udell) -- seven principles that will head off a lot of mistakes. They should be seared into the minds of anyone working in the web.... Four short links: 16 December 2010
By Nat TorkingtonDecember 16, 2010 On Compressing Social Networks (PDF) -- paper looking at the theory and practice of compressing social network graphs. Our main innovation here is to come up with a quick and useful method for generating an ordering on the social network nodes so that nodes with lots of common neighbors are near each other in the ordering, a property which... Four short links: 7 December 2010
By Nat TorkingtonDecember 7, 2010 Synopsis Data Structures for Massive Data Sets (PDF) -- survey of data structures that reduce the problem space when dealing with large data sets. (via Pete Warden) Optimism -- you build what you're thinking of. Time to figure out the optimistic future and build that. "Work as if you lived in the early days of a better nation.” --attributed... Four short links: 30 November 2010
By Nat TorkingtonNovember 30, 2010 libgit2 -- a linkable git library. Ruby and Python bindings. Open Data: How Not to Cock It Up -- Tom Steinberg lays it out. Algorithm and Crowd are Not Enough -- My point isn’t that Google, Netflix, Amazon, Yelp or any of the others are doomed. But I do think there’s an opportunity brewing for entrepreneurs, websites and companies... Four short links: 8 November 2010
By Nat TorkingtonNovember 8, 2010 xkcd 576 Made Real -- a Kiwi coder used the API to the local equivalent of eBay to build a real-life version of xkcd cartoon 576. You can follow its daily purchases on Twitter. Linus Torvalds' Greatest Invention (Dominus) -- slides to a talk Mark Jason Dominus gave to the Philadelphia Linux User Group. I like to see informed... Four short links: 22 September 2010
By Nat TorkingtonSeptember 22, 2010 The Rise of Amazon Web Services -- Stephen O'Grady points out that Amazon has become an enterprise sales company but we don't treat it as such because we think of it as a retail company that's dabbling in technology. I think of Amazon as an automation company: they automate and optimize everything, and a data center is just a... Four short links: 30 August 2010
By Nat TorkingtonAugust 30, 2010 Free as in Smokescreen (Mike Shaver) -- H.264, one of the ways video can be delivered in HTML5, is covered by patents. This prevents Mozilla from shipping an H.264 player, which fragments web video. The MPEG LA group who manage the patents for H.264 did a great piece of PR bullshit, saying "this will be permanently royalty-free to consumers".... Four short links: 27 July 2010
By Nat TorkingtonJuly 27, 2010 Digital Continuity Conference Proceedings -- proceedings from a New Zealand conference on digital archiving, preservation, and access for archives, museums, libraries, etc. What Are The Scaling Issues to Keep in Mind While Developing a Social Network Feed? (Quora) -- insight into why you see the failwhale. (via kellan on Twitter) Fan Feeding Frenzy -- Amanda Palmer sells $15k in... Manifold Learning, Calculus & Friendship, and Other Math Links
By Ben LoricaJanuary 17, 2010 One of the largest gatherings of mathematicians, the joint meetings of the AMS/MAA/SIAM, took place last week in San Francisco. Knowing that there were going to be over 6,000 pure and applied mathematicians at Moscone West, I took some time off from work and attended several sessions. Below are a few (somewhat technical) highlights. (It's the only conference I've attended... May Column: Multi-threaded Algorithm Implementations
By George T. HeinemanJune 19, 2009 Many of the algorithms presented in the book can take advantage of multi-core chips by using threads. In this column we explore effective uses of threads and report on situations where using threads will not help. April Column: Computational Geometry
By George T. HeinemanMay 15, 2009 Computational Geometry is the study of geometric algorithms and the data structures that enable their efficient implementation. We present algorithms that solve the Line Segment Intersection problem and describe challenges in validating their implementation March Column: Network Flow Algorithms
By George T. HeinemanApril 6, 2009 Many real-world transportation networks (i.e., highways, power grids, water subsystems) can be represented as Flow Networks. The Ford-Fulkerson technique describes an approach to solve numerous cost optimization problems. February Column: Improving Performance Of Algorithms
By George T. HeinemanMarch 3, 2009 You need two different techniques for optimizing the performance of an algorithm. Sometimes you improve upon the algorithm; sometimes you improve the way the algorithm is implemented. Both techniques are essential to applying algorithms properly. January Column: Algorithm to Solve FreeCell Solitaire Games
By George T. HeinemanJanuary 18, 2009 We describe how to use recursive backtracking to solve a search problem. We design a staged deepening algorithm to solve FreeCell solitaire games. This material is not drawn from the book, but rather helps to place the graph searching algorithms in a better context. December Column: Searching Algorithms
By George T. HeinemanDecember 4, 2008 In the second monthly column for the blog associated with the Algorithms in a Nutshell, we investigate searching algorithms and reaffirm the ability for hash-based search to provide an efficient implementation. November Column: Welcome to Algorithms in a Nutshell
By George T. HeinemanOctober 25, 2008 This is the first of a series of monthly columns in the blog associated with the Algorithms in a Nutshell, published October 2008 by O'Reilly Media, Inc. All algorithms are fully implemented and available in the online code repository associated with the book. 1 to 38 of 38 |
|||
|