Data streaming systems

Data streaming systems are a computerized system that is built with the purpose of managing and processing the data in motion. The sheer size, variety, and velocity of big data add further challenges to these systems. Such systems are designed to manage relatively simple computations, such as one record at a time or a set of objects in a short time window of the most recent data:

Figure 7.4: Illustration of data streaming system 

In a data-steaming system, the computations are done in near real-time, sometimes in memory, and as independent computations. The processing components often subscribe to a system, or a stream ...

Get Hands-On Big Data Modeling now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.