Book description
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. In the second edition of this practical guide, you'll learn how to conduct analytics on data where it lives, whether it's a data lake using Hive, a modern lakehouse with Iceberg or Delta Lake, a different system like Cassandra, Kafka, or SingleStore, or a relational database like PostgreSQL or Oracle.
Analysts, software engineers, and production engineers learn how to manage, use, and even develop with Trino and make it a critical part of their data platform. Authors Matt Fuller, Manfred Moser, and Martin Traverso show you how a single Trino query can combine data from multiple sources to allow for analytics across your entire organization.
- Explore Trino's use cases, and learn about tools that help you connect to Trino for querying and processing huge amounts of data
- Learn Trino's internal workings, including how to connect to and query data sources with support for SQL statements, operators, functions, and more
- Deploy and secure Trino at scale, monitor workloads, tune queries, and connect more applications
- Learn how other organizations apply Trino successfully
Publisher resources
Table of contents
- Foreword
- Preface
- I. Getting Started with Trino
-
1. Introducing Trino
- The Problems with Big Data
- Trino to the Rescue
-
Trino Use Cases
- One SQL Analytics Access Point
- Access Point to Data Warehouse and Source Systems
- Provide SQL-Based Access to Anything
- Federated Queries
- Semantic Layer for a Virtual Data Warehouse
- Data Lake Query Engine
- SQL Conversions and ETL
- Better Insights Due to Faster Response Times
- Big Data, Machine Learning, and Artificial Intelligence
- Other Use Cases
- Trino Resources
- A Brief History of Trino
- Conclusion
- 2. Installing and Configuring Trino
- 3. Using Trino
- II. Diving Deeper into Trino
- 4. Trino Architecture
- 5. Production-Ready Deployment
-
6. Connectors
- Configuration
- RDBMS Connector Example: PostgreSQL
- Trino TPC-H and TPC-DS Connectors
- Hive Connector for Distributed Storage Data Sources
- Modern Distributed Storage Management and Analytics
- Non-Relational Data Sources
- Trino JMX Connector
- Black Hole Connector
- Memory Connector
- Other Connectors
- Conclusion
- 7. Advanced Connector Examples
-
8. Using SQL in Trino
- Trino Statements
- Trino System Tables
- Catalogs
- Schemas
- Information Schema
- Tables
- Views
- Session Information and Configuration
- Data Types
- SELECT Statement Basics
- WHERE Clause
- GROUP BY and HAVING Clauses
- ORDER BY and LIMIT Clauses
- JOIN Statements
- UNION, INTERSECT, and EXCEPT Clauses
- Grouping Operations
- WITH Clause
- Subqueries
- Deleting Data from a Table
- Conclusion
-
9. Advanced SQL
- Functions and Operators Introduction
- Scalar Functions and Operators
- Boolean Operators
- Logical Operators
- Range Selection with the BETWEEN Statement
- Value Detection with IS (NOT) NULL
- Mathematical Functions and Operators
- Trigonometric Functions
- Constant and Random Functions
- String Functions and Operators
- Strings and Maps
- Unicode
- Regular Expressions
- Unnesting Complex Data Types
- JSON Functions
- Date and Time Functions and Operators
- Histograms
- Aggregate Functions
- Window Functions
- Lambda Expressions
- Geospatial Functions
- Prepared Statements
- Conclusion
- III. Trino in Real-World Uses
- 10. Security
- 11. Integrating Trino with Other Tools
- 12. Trino in Production
- 13. Real-World Examples
- Conclusion
- Index
- About the Authors
Product information
- Title: Trino: The Definitive Guide, 2nd Edition
- Author(s):
- Release date: October 2022
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098137236
You might also like
book
Trino: The Definitive Guide
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. …
book
Designing Data-Intensive Applications, 2nd Edition
Data is at the center of many challenges in system design today. Difficult issues such as …
book
Learning Go, 2nd Edition
Go has rapidly become the preferred language for building web services. Plenty of tutorials are available …
book
Kafka: The Definitive Guide, 2nd Edition
Every enterprise application creates data, whether it consists of log messages, metrics, user activity, or outgoing …