Video description
Web scraping is the process of scraping websites and extracting desired data from them, and in this course, you will learn and master web scraping using Python and Scrapy with a step-by-step and in-depth guide.
The course starts with introducing you to the web scraping process (with infographics—no code); learn how to scrape data from websites and how to use Scrapy for this. After getting the basics clear, you will perform web scraping using Python and the Scrapy framework! After you have built an actual web scraper, you will get an idea of how web scraping works first-hand. You will then look at the essential concepts of web scraping and Scrapy. Learning how to scrape websites and the essentials already makes you a complete web scraper but you will take this even further and learn the advanced web scraping techniques to become an expert.
Advanced topics such as crawling multiple pages and extracting data—pagination, scraping data using Regular Expressions (RegEx), scraping dynamic or JavaScript-rendered websites using Scrapy Playwright—will be thoroughly explained. Finally, you will perform three projects at the end—Champions League Table [ESPN], Product Tracker [Amazon], and Scraper Application [GUI].
By the end of this course, you will have learned how to do web scraping using Python and Scrapy.
What You Will Learn
- Send a request to a URL to scrape websites using Scrapy Spider
- Get the HTML Response from the URL and parse it for web scraping
- Use Scrapy shell commands to test and verify CSS Selectors or XPath
- Export and save scraped data to online databases such as MongoDB
- Scrape data from multiple web pages using Scrapy pagination
- Login to websites using Scrapy FormRequest with CSRF tokens
Audience
This course is ideal for beginner Python developers who want to master web scraping or freelance web scrapers looking to polish their skills. Any individual and college students working on their projects and wanting to master web scraping using Python and the Scrapy module, then this course is for you. A basic understanding of Python programming is a must and elementary-level knowledge of HTML basics will be a plus but not mandatory.
About The Author
Rahul Mula: Rahul Mula is a developer specializing in Python, Flutter, and web development. He was really intrigued the first time he learned about programming and realized what could be done with it. He loves to explore different technologies and create applications to build something new. He has developed Keyviz—the free and open-source tool to visualize keystrokes in real-time. He has written books and created courses on Python programming teaching thousands of students.
Table of contents
- Chapter 1 : Introduction to the Course
- Chapter 2 : Scrapy Installation
- Chapter 3 : Scrapy Spider
- Chapter 4 : CSS Selectors
- Chapter 5 : XPath
- Chapter 6 : Scrapy Shell
- Chapter 7 : Scrapy Items
- Chapter 8 : Exporting Data
- Chapter 9 : Scrapy Item Pipeline
- Chapter 10 : Pagination
- Chapter 11 : Following Links
- Chapter 12 : Scraping Tables
- Chapter 13 : Logging into Websites
- Chapter 14 : Scraping JavaScript Rendered Websites
- Chapter 15 : Scrapy Playwright
- Chapter 16 : API Endpoints
- Chapter 17 : Settings
- Chapter 18 : User Agents and Proxies
- Chapter 19 : Tips and Tricks
- Chapter 20 : Project 1: Champions League Table from ESPN.com
- Chapter 21 : Project 2: Amazon Product Rank
- Chapter 22 : Project 3: Extending Scraper with GUI
Product information
- Title: Web Scraping Tutorial with Scrapy and Python for Beginners
- Author(s):
- Release date: November 2022
- Publisher(s): Packt Publishing
- ISBN: 9781804615317
You might also like
book
Web Scraping with Python, 3rd Edition
If programming is magic, then web scraping is surely a form of wizardry. By writing a …
video
Python A-Z: Learn Python by Building 15 Projects and ChatGPT
This comprehensive Python course covers all fundamental concepts and advanced Python concepts, and you learn a …
book
Web Scraping with Python, 2nd Edition
If programming is magic then web scraping is surely a form of wizardry. By writing a …
video
Python Interactive Dashboards with Plotly Dash
Data visualization is a critical part of data science, in this quick and practical course you …