Book description
Learn a modern approach to data analysis using Python to harness the power of programming and AI across your data. Detailed case studies bring this modern approach to life across visual data, social media, graph algorithms, and time series analysis.
Key Features
- Bridge your data analysis with the power of programming, complex algorithms, and AI
- Use Python and its extensive libraries to power your way to new levels of data insight
- Work with AI algorithms, TensorFlow, graph algorithms, NLP, and financial time series
- Explore this modern approach across with key industry case studies and hands-on projects
Book Description
Data Analysis with Python offers a modern approach to data analysis so that you can work with the latest and most powerful Python tools, AI techniques, and open source libraries. Industry expert David Taieb shows you how to bridge data science with the power of programming and algorithms in Python. You'll be working with complex algorithms, and cutting-edge AI in your data analysis. Learn how to analyze data with hands-on examples using Python-based tools and Jupyter Notebook. You'll find the right balance of theory and practice, with extensive code files that you can integrate right into your own data projects.
Explore the power of this approach to data analysis by then working with it across key industry case studies. Four fascinating and full projects connect you to the most critical data analysis challenges you're likely to meet in today. The first of these is an image recognition application with TensorFlow ? embracing the importance today of AI in your data analysis. The second industry project analyses social media trends, exploring big data issues and AI approaches to natural language processing. The third case study is a financial portfolio analysis application that engages you with time series analysis - pivotal to many data science applications today. The fourth industry use case dives you into graph algorithms and the power of programming in modern data science. You'll wrap up with a thoughtful look at the future of data science and how it will harness the power of algorithms and artificial intelligence.
What you will learn
- A new toolset that has been carefully crafted to meet for your data analysis challenges
- Full and detailed case studies of the toolset across several of today's key industry contexts
- Become super productive with a new toolset across Python and Jupyter Notebook
- Look into the future of data science and which directions to develop your skills next
Who this book is for
This book is for developers wanting to bridge the gap between them and data scientists. Introducing PixieDust from its creator, the book is a great desk companion for the accomplished Data Scientist. Some fluency in data interpretation and visualization is assumed. It will be helpful to have some knowledge of Python, using Python libraries, and some proficiency in web development.
Table of contents
-
Data Analysis with Python
- Table of Contents
- Data Analysis with Python
- Contributors
- Preface
-
1. Programming and Data Science – A New Toolset
- What is data science
- Is data science here to stay?
- Why is data science on the rise?
- What does that have to do with developers?
- Putting these concepts into practice
- Deep diving into a concrete example
- Data pipeline blueprint
- What kind of skills are required to become a data scientist?
- IBM Watson DeepQA
- Back to our sentiment analysis of Twitter hashtags project
- Lessons learned from building our first enterprise-ready data pipeline
- Data science strategy
- Jupyter Notebooks at the center of our strategy
- Summary
-
2. Python and Jupyter Notebooks to Power your Data Analysis
- Why choose Python?
- Introducing PixieDust
- SampleData – a simple API for loading data
- Wrangling data with pixiedust_rosie
- Display – a simple interactive API for data visualization
- Filtering
- Bridging the gap between developers and data scientists with PixieApps
- Architecture for operationalizing data science analytics
- Summary
-
3. Accelerate your Data Analysis with Python Libraries
-
Anatomy of a PixieApp
- Routes
- Generating requests to routes
- A GitHub project tracking sample application
- Displaying the search results in a table
- Invoking the PixieDust display() API using pd_entity attribute
- Invoking arbitrary Python code with pd_script
- Making the application more responsive with pd_refresh
- Creating reusable widgets
- Summary
-
Anatomy of a PixieApp
- 4. Publish your Data Analysis to the Web - the PixieApp Tool
-
5. Python and PixieDust Best Practices and Advanced Concepts
- Use @captureOutput decorator to integrate the output of third-party Python libraries
- Increase modularity and code reuse
- Run Node.js inside a Python Notebook
- Summary
- 6. Analytics Study: AI and Image Recognition with TensorFlow
-
7. Analytics Study: NLP and Big Data with Twitter Sentiment Analysis
- Getting started with Apache Spark
- Twitter sentiment analysis application
- Part 1 – Acquiring the data with Spark Structured Streaming
- Part 2 – Enriching the data with sentiment and most relevant extracted entity
- Part 3 – Creating a real-time dashboard PixieApp
- Part 4 – Adding scalability with Apache Kafka and IBM Streams Designer
- Summary
- 8. Analytics Study: Prediction - Financial Time Series Analysis and Forecasting
-
9. Analytics Study: Graph Algorithms - US Domestic Flight Data Analysis
- Introduction to graphs
- Getting started with the networkx graph library
- Part 1 – Loading the US domestic flight data into a graph
- Part 2 – Creating the USFlightsAnalysis PixieApp
- Part 3 – Adding data exploration to the USFlightsAnalysis PixieApp
- Part 4 – Creating an ARIMA model for predicting flight delays
- Summary
- 10. The Future of Data Analysis and Where to Develop your Skills
- A. PixieApp Quick-Reference
- Other Books You May Enjoy
- Index
Product information
- Title: Data Analysis with Python
- Author(s):
- Release date: December 2018
- Publisher(s): Packt Publishing
- ISBN: 9781789950069
You might also like
book
Python Data Analysis - Third Edition
Understand data analysis pipelines using machine learning algorithms and techniques with this practical guide Key Features …
book
Hands-On Exploratory Data Analysis with Python
Discover techniques to summarize the characteristics of your data using PyPlot, NumPy, SciPy, and pandas Key …
book
Python Data Analysis - Second Edition
Learn how to apply powerful data analysis techniques with popular open source Python modules About This …
book
Python for Data Analysis
Python for Data Analysis is concerned with the nuts and bolts of manipulating, processing, cleaning, and …