Book description
Master the intricacies of Apache Storm and develop real-time stream processing applications with ease
About This Book
- Exploit the various real-time processing functionalities offered by Apache Storm such as parallelism, data partitioning, and more
- Integrate Storm with other Big Data technologies like Hadoop, HBase, and Apache Kafka
- An easy-to-understand guide to effortlessly create distributed applications with Storm
Who This Book Is For
If you are a Java developer who wants to enter into the world of real-time stream processing applications using Apache Storm, then this book is for you. No previous experience in Storm is required as this book starts from the basics. After finishing this book, you will be able to develop not-so-complex Storm applications.
What You Will Learn
- Understand the core concepts of Apache Storm and real-time processing
- Follow the steps to deploy multiple nodes of Storm Cluster
- Create Trident topologies to support various message-processing semantics
- Make your cluster sharing effective using Storm scheduling
- Integrate Apache Storm with other Big Data technologies such as Hadoop, HBase, Kafka, and more
- Monitor the health of your Storm cluster
In Detail
Apache Storm is a real-time Big Data processing framework that processes large amounts of data reliably, guaranteeing that every message will be processed. Storm allows you to scale your data as it grows, making it an excellent platform to solve your big data problems. This extensive guide will help you understand right from the basics to the advanced topics of Storm.
The book begins with a detailed introduction to real-time processing and where Storm fits in to solve these problems. You’ll get an understanding of deploying Storm on clusters by writing a basic Storm Hello World example. Next we’ll introduce you to Trident and you’ll get a clear understanding of how you can develop and deploy a trident topology. We cover topics such as monitoring, Storm Parallelism, scheduler and log processing, in a very easy to understand manner. You will also learn how to integrate Storm with other well-known Big Data technologies such as HBase, Redis, Kafka, and Hadoop to realize the full potential of Storm.
With real-world examples and clear explanations, this book will ensure you will have a thorough mastery of Apache Storm. You will be able to use this knowledge to develop efficient, distributed real-time applications to cater to your business needs.
Style and approach
This easy-to-follow guide is full of examples and real-world applications to help you get an in-depth understanding of Apache Storm. This book covers the basics thoroughly and also delves into the intermediate and slightly advanced concepts of application development with Apache Storm.
Table of contents
- Preface
- Real-Time Processing and Storm Introduction
- Storm Deployment, Topology Development, and Topology Options
- Storm Parallelism and Data Partitioning
- Trident Introduction
- Trident Topology and Uses
- Storm Scheduler
- Monitoring of Storm Cluster
- Integration of Storm and Kafka
- Storm and Hadoop Integration
- Storm Integration with Redis, Elasticsearch, and HBase
-
Apache Log Processing with Storm
- Apache log processing elements
- Producing Apache log in Kafka using Logstash
- Splitting the Apache log line
- Identifying country, operating system type, and browser type from the log file
- Calculate the search keyword
- Persisting the process data
- Kafka spout and define topology
- Deploy topology
- MySQL queries
- Summary
- Twitter Tweet Collection and Machine Learning
Product information
- Title: Mastering Apache Storm
- Author(s):
- Release date: August 2017
- Publisher(s): Packt Publishing
- ISBN: 9781787125636
You might also like
book
Mastering Apache Cassandra - Second Edition
Build, manage, and configure high-performing, reliable NoSQL database for your application with Cassandra In Detail With …
book
Learning Apache Cassandra - Second Edition
Build a scalable, fault-tolerant and highly available data layer for your applications using Apache Cassandra About …
book
Getting Started with Storm
Even as big data is turning the world upside down, the next phase of the revolution …
book
Storm Applied
Storm Applied is a practical guide to using Apache Storm for the real-world tasks associated with …