Book description
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. With this practical guide, you'll learn how to conduct analytics on data where it lives, whether it's Hive, Cassandra, a relational database, or a proprietary data store. Analysts, software engineers, and production engineers will learn how to manage, use, and even develop with Trino.
Initially developed by Facebook, open source Trino is now used by Amazon, Google, LinkedIn, Lyft, Netflix, Pinterest, Salesforce, Shopify, and many other companies. Matt Fuller, Manfred Moser, and Martin Traverso show you how a single Trino query can combine data from multiple sources to allow for analytics across your entire organization.
- Get started: Explore Trino's use cases and learn about tools that will help you connect to Trino and query data
- Go deeper: Learn Trino's internal workings, including how to connect to and query data sources with support for SQL statements, operators, functions, and more
- Put Trino in production: Secure Trino, monitor workloads, tune queries, and connect more applications; learn how other organizations apply Trino
Publisher resources
Table of contents
- Foreword
- Preface
- I. Getting Started with Trino
-
1. Introducing Trino
- The Problems with Big Data
- Trino to the Rescue
-
Trino Use Cases
- One SQL Analytics Access Point
- Access Point to Data Warehouse and Source Systems
- Provide SQL-Based Access to Anything
- Federated Queries
- Semantic Layer for a Virtual Data Warehouse
- Data Lake Query Engine
- SQL Conversions and ETL
- Better Insights Due to Faster Response Times
- Big Data, Machine Learning, and Artificial Intelligence
- Other Use Cases
- Trino Resources
- A Brief History of Trino
- Conclusion
- 2. Installing and Configuring Trino
- 3. Using Trino
- II. Diving Deeper into Trino
- 4. Trino Architecture
- 5. Production-Ready Deployment
- 6. Connectors
- 7. Advanced Connector Examples
-
8. Using SQL in Trino
- Trino Statements
- Trino System Tables
- Catalogs
- Schemas
- Information Schema
- Tables
- Views
- Session Information and Configuration
- Data Types
- SELECT Statement Basics
- WHERE Clause
- GROUP BY and HAVING Clauses
- ORDER BY and LIMIT Clauses
- JOIN Statements
- UNION, INTERSECT, and EXCEPT Clauses
- Grouping Operations
- WITH Clause
- Subqueries
- Deleting Data from a Table
- Conclusion
-
9. Advanced SQL
- Functions and Operators Introduction
- Scalar Functions and Operators
- Boolean Operators
- Logical Operators
- Range Selection with the BETWEEN Statement
- Value Detection with IS (NOT) NULL
- Mathematical Functions and Operators
- Trigonometric Functions
- Constant and Random Functions
- String Functions and Operators
- Strings and Maps
- Unicode
- Regular Expressions
- Unnesting Complex Data Types
- JSON Functions
- Date and Time Functions and Operators
- Histograms
- Aggregate Functions
- Window Functions
- Lambda Expressions
- Geospatial Functions
- Prepared Statements
- Conclusion
- III. Trino in Real-World Uses
- 10. Security
- 11. Integrating Trino with Other Tools
- 12. Trino in Production
- 13. Real-World Examples
- 14. Conclusion
- Index
Product information
- Title: Trino: The Definitive Guide
- Author(s):
- Release date: April 2021
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098107710
You might also like
book
Trino: The Definitive Guide, 2nd Edition
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. …
book
Spark: The Definitive Guide
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the …
book
Kafka: The Definitive Guide, 2nd Edition
Every enterprise application creates data, whether it consists of log messages, metrics, user activity, or outgoing …
book
gRPC: Up and Running
Get a comprehensive understanding of gRPC fundamentals through real-world examples. With this practical guide, you’ll learn …