Book description
With the explosion of data, computing power, and cloud data warehouses, SQL has become an even more indispensable tool for the savvy analyst or data scientist. This practical book reveals new and hidden ways to improve your SQL skills, solve problems, and make the most of SQL as part of your workflow.
You'll learn how to use both common and exotic SQL functions such as joins, window functions, subqueries, and regular expressions in new, innovative ways--as well as how to combine SQL techniques to accomplish your goals faster, with understandable code. If you work with SQL databases, this is a must-have reference.
- Learn the key steps for preparing your data for analysis
- Perform time series analysis using SQL's date and time manipulations
- Use cohort analysis to investigate how groups change over time
- Use SQL's powerful functions and operators for text analysis
- Detect outliers in your data and replace them with alternate values
- Establish causality using experiment analysis, also known as A/B testing
Publisher resources
Table of contents
- Preface
- 1. Analysis with SQL
- 2. Preparing Data for Analysis
- 3. Time Series Analysis
- 4. Cohort Analysis
- 5. Text Analysis
- 6. Anomaly Detection
- 7. Experiment Analysis
- 8. Creating Complex Data Sets for Analysis
- 9. Conclusion
- Index
Product information
- Title: SQL for Data Analysis
- Author(s):
- Release date: September 2021
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492088783
You might also like
book
SQL for Data Analytics
Take your first steps to become a fully qualified data analyst by learning how to explore …
book
SQL Queries for Mere Mortals: A Hands-On Guide to Data Manipulation in SQL, 4th Edition
The #1 Easy, Common-Sense Guide to SQL Queries—Updated with More Advanced Techniques and Solutions Foreword by …
book
Data Analysis with Python and PySpark
Think big about your data! PySpark brings the powerful Spark big data processing engine to the …
book
Analytics Engineering with SQL and dbt
With the shift from data warehouses to data lakes, data now lands in repositories before it's …