Ted Malaska

Ted Malaska is a solutions architect at Cloudera and has worked on close to 100 clusters for over two- to three-dozen clients with over hundreds of use cases. Ted has 18 years of professional experience working for startups, the US government, a number of the world’s largest banks, commercial firms, bio firms, retail firms, hardware appliance firms, and the largest nonprofit financial regulator in the US. He has architecture experience across topics such as Hadoop, Web 2.0, mobile, SOA (ESB, BPM), and big data. Ted is a regular contributor to the Hadoop, HBase, and Spark projects, a regular committer to Flume, Avro, Pig, and YARN, and the coauthor of O’Reilly Media’s Hadoop Application Architectures.

Content

Running a word count application using Spark

May 26, 2017

How to use Apache Spark’s Resilient Distributed Dataset (RDD) API.

Shared nothing architectures: Giving Hadoop’s data processing frameworks scalability and fault tolerance

October 7, 2016

A look at the tools and patterns for accessing and processing data in Hadoop.

Best practices for streaming applications

August 11, 2016

Mark Grover and Ted Malaska offer an overview of projects for streaming applications, including Kafka, Flume, and Spark Streaming, and discuss the architectural schemas available, such as Lambda and Kappa.

Modeling your big data enterprise architecture after the human body

June 24, 2016

How decoupling, optimization, and specialization resemble connective systems in our bodies.

What I learned about architecture from running marathons

April 12, 2016

Ted Malaska explains how long hours of training, blisters, and shin splints relate to life-changing lessons in software architecture.

What I learned about software architecture from running a marathon

February 17, 2016

Good code comes from motivation and fresh minds.

Architecting Hadoop Applications

October 27, 2015

In this O'Reilly training video, the "Hadoop Application Architectures" authors present an end-to-end case study of a clickstream analytics engine to provide a concrete example of how to architect and implement a complete solution with Hadoop. In this segment, they provide an overview of the complete architecture. Presenters: Mark Grover, Gwen Shapira, Jonathan Seidman, Ted Malaska