Scraping the Web for Fun and Profit

Scraping the Web for Fun and Profit

English | MP4 | AVC 1920×1080 | AAC 44KHz 2ch | 16 Lessons (6h 33m) | 1.95 GB

This course will teach you how to use web scraping to create profitable projects. You will learn the basics of scraping using Python libraries (e.g. Beautiful Soup), script acceleration techniques, blocking bypass, scraping secure sites, and working with unofficial APIs. Included are over 10 videos with real examples, source code, and material to be added in the future. The course is flexible and allows you to choose lessons depending on your needs.

You’ll learn:

  • The basics of web scraping using Python libraries such as Beautiful Soup.
  • Effective strategies to speed up your scraping scripts and reduce error rates.
  • How to avoid getting blocked.
  • Advanced screen scraping methods that will allow you to scrape websites that require a login.
  • How to reverse engineer browser requests.
  • How to find undocumented APIs and use them to extract information that’s not publicly available.
  • What I’ve learned about building and selling data products.
Table of Contents

1 Quick Introduction and Overview
2 What I learned about selling data products
3 Bypassing Anti-Scraping Measures – Headers, Rotating Proxies, Scraping APIs, Javascript Rendering
4 Scraping RallyRd – Advanced Screen Scraping w- Selenium
5 Scraping RallyRd – Advanced Scraping of Data Behind a Login
6 Scraping Instagram Leads via Duck Duck Go
7 Scraping Shopify Sites, Reddit, Indeed, Upwork – Alternative Formats JSON, RSS
8 Scraping Messari – GraphQL Scraping and Data Flattening
9 Scraping YC Companies and Cryptocurrencies – Using Algolia
10 Scraping Udemy Courses – Leveraging Undocumented Internal APIs
11 Scraping SearchMySite.com – Post Requests and the Curl Convert Trick
12 Scraping all Pitchbook Profiles – Method 3 Sitemap Scraping
13 Scraping all Pitchbook Profiles – Method 2 Search Engine Scraping
14 Scraping all Pitchbook Profiles – Approach 1 Brute Force
15 Scraping Goodreads Part 2 – (try-except, iterating over pages)
16 Scraping GoodReads Quotes – (Requests and BeautifulSoup Basics)

Homepage