|
|
1. Introduction and sample project (classifieds ads scraping).mp4
|
MP4
|
140.2 MB
|
|
|
1. Introduction and sample project (classifieds ads scraping).srt
|
SRT
|
29.3 KB
|
|
|
1. Introduction.mp4
|
MP4
|
61.5 MB
|
|
|
1. Introduction.srt
|
SRT
|
13.7 KB
|
|
|
1. Login to websites.mp4
|
MP4
|
45.9 MB
|
|
|
1. Login to websites.srt
|
SRT
|
15.3 KB
|
|
|
1. Phone-models project and spider rate-limiting.mp4
|
MP4
|
124.2 MB
|
|
|
1. Phone-models project and spider rate-limiting.srt
|
SRT
|
24.2 KB
|
|
|
1. Storing scraped data in MongoDB.mp4
|
MP4
|
66 MB
|
|
|
1. Storing scraped data in MongoDB.srt
|
SRT
|
19.5 KB
|
|
|
1. What is Selenium.mp4
|
MP4
|
38.1 MB
|
|
|
1. What is Selenium.srt
|
SRT
|
16.7 KB
|
|
|
1. What is Splash.mp4
|
MP4
|
32.5 MB
|
|
|
1. What is Splash.srt
|
SRT
|
10.9 KB
|
|
|
1. What is a web bot Is it ethical.mp4
|
MP4
|
48.4 MB
|
|
|
1. What is a web bot Is it ethical.srt
|
SRT
|
14.4 KB
|
|
|
1. Xpath 101 node types.mp4
|
MP4
|
39 MB
|
|
|
1. Xpath 101 node types.srt
|
SRT
|
13.7 KB
|
|
|
1.1 Classifieds Ads project.html
|
HTML
|
102.4 B
|
|
|
1.1 Login bot.html
|
HTML
|
102.4 B
|
|
|
1.1 MongoDB pipeline.html
|
HTML
|
102.4 B
|
|
|
1.1 Phone Models Project.html
|
HTML
|
102.4 B
|
|
|
1.1 Resources.html
|
HTML
|
102.4 B
|
|
|
1.1 xpath_node_types.png
|
PNG
|
1.2 MB
|
|
|
2. Changing the user-agent.mp4
|
MP4
|
26.8 MB
|
|
|
2. Changing the user-agent.srt
|
SRT
|
6.5 KB
|
|
|
2. Introduction to Docker (optional).mp4
|
MP4
|
54.9 MB
|
|
|
2. Introduction to Docker (optional).srt
|
SRT
|
18.9 KB
|
|
|
2. Removing ads with duplicate titles.mp4
|
MP4
|
42.3 MB
|
|
|
2. Removing ads with duplicate titles.srt
|
SRT
|
8.6 KB
|
|
|
2. Revisiting infinitely-scrolling pages (medium.com).mp4
|
MP4
|
174.9 MB
|
|
|
2. Revisiting infinitely-scrolling pages (medium.com).srt
|
SRT
|
36.8 KB
|
|
|
2. Rotating user-agents middleware.mp4
|
MP4
|
52.2 MB
|
|
|
2. Rotating user-agents middleware.srt
|
SRT
|
10.4 KB
|
|
|
2. Scrapy installation.html
|
HTML
|
2.8 KB
|
|
|
2. Storing scraped data in MySQL.mp4
|
MP4
|
74.9 MB
|
|
|
2. Storing scraped data in MySQL.srt
|
SRT
|
16.6 KB
|
|
|
2. The Scrapy Shell.mp4
|
MP4
|
68.5 MB
|
|
|
2. The Scrapy Shell.srt
|
SRT
|
19.4 KB
|
|
|
2. Xpath 102 basic syntax.mp4
|
MP4
|
70.9 MB
|
|
|
2. Xpath 102 basic syntax.srt
|
SRT
|
22.1 KB
|
|
|
2.1 MySQL Pipeline.html
|
HTML
|
102.4 B
|
|
|
2.1 Remove duplicates pipeline.html
|
HTML
|
102.4 B
|
|
|
2.1 Rotating user-agents project.html
|
HTML
|
102.4 B
|
|
|
2.1 XPath 102 Cheat Sheet.pdf
|
PDF
|
64 KB
|
|
|
2.1 firefox-how-to.pdf
|
PDF
|
29.4 KB
|
|
|
2.2 Removing duplicates pipeline.html
|
HTML
|
102.4 B
|
|
|
2.2 Revisiting infinitely-scrolling pages (medium.com).html
|
HTML
|
102.4 B
|
|
|
3. Clicking buttons (Yahoo Finance).mp4
|
MP4
|
120.8 MB
|
|
|
3. Clicking buttons (Yahoo Finance).srt
|
SRT
|
24.7 KB
|
|
|
3. Creating your first Scrapy project.mp4
|
MP4
|
69.3 MB
|
|
|
3. Creating your first Scrapy project.srt
|
SRT
|
18 KB
|
|
|
3. Handling AJAX requests 1.mp4
|
MP4
|
70.2 MB
|
|
|
3. Handling AJAX requests 1.srt
|
SRT
|
17.2 KB
|
|
|
3. Removing ads with no phone numbers.mp4
|
MP4
|
28 MB
|
|
|
3. Removing ads with no phone numbers.srt
|
SRT
|
5.5 KB
|
|
|
3. Rotating proxies middleware.mp4
|
MP4
|
83.5 MB
|
|
|
3. Rotating proxies middleware.srt
|
SRT
|
16 KB
|
|
|
3. Test-driving Splash.mp4
|
MP4
|
43.7 MB
|
|
|
3. Test-driving Splash.srt
|
SRT
|
10.4 KB
|
|
|
3. Using Vault to sore sensitive Scrapy settings.mp4
|
MP4
|
73 MB
|
|
|
3. Using Vault to sore sensitive Scrapy settings.srt
|
SRT
|
17.3 KB
|
|
|
3. XPath 103 Axes (Node Relations).mp4
|
MP4
|
43.4 MB
|
|
|
3. XPath 103 Axes (Node Relations).srt
|
SRT
|
14.4 KB
|
|
|
3.1 Clicking buttons (Yahoo Finance).html
|
HTML
|
102.4 B
|
|
|
3.1 Create your own Scrapy project.html
|
HTML
|
102.4 B
|
|
|
3.1 Dropping Ads with no phones pipeline.html
|
HTML
|
102.4 B
|
|
|
3.1 Handling AJAX requests.html
|
HTML
|
102.4 B
|
|
|
3.1 Rotating proxies.html
|
HTML
|
102.4 B
|
|
|
3.1 Using Vault to store sensitive data for Scrapy.html
|
HTML
|
102.4 B
|
|
|
3.1 XPath 103 Cheat Sheet Axes (node relations).pdf
|
PDF
|
51.5 KB
|
|
|
4. Creating your first Scrapy spider.mp4
|
MP4
|
88.2 MB
|
|
|
4. Creating your first Scrapy spider.srt
|
SRT
|
19.2 KB
|
|
|
4. Handling AJAX requests 2.mp4
|
MP4
|
45.9 MB
|
|
|
4. Handling AJAX requests 2.srt
|
SRT
|
9.2 KB
|
|
|
4. Integrating Scrapy with Splash.mp4
|
MP4
|
112 MB
|
|
|
4. Integrating Scrapy with Splash.srt
|
SRT
|
22.1 KB
|
|
|
4. Revisiting our real-estate web scraping example.mp4
|
MP4
|
76.1 MB
|
|
|
4. Revisiting our real-estate web scraping example.srt
|
SRT
|
14.1 KB
|
|
|
4. Storing data to AWS S3 bucket.mp4
|
MP4
|
58.1 MB
|
|
|
4. Storing data to AWS S3 bucket.srt
|
SRT
|
14.6 KB
|
|
|
4.1 Create your own Scrapy spider.html
|
HTML
|
102.4 B
|
|
|
4.1 Handling AJAX requests.html
|
HTML
|
102.4 B
|
|
|
4.1 S3 Pipeline.html
|
HTML
|
102.4 B
|
|
|
4.1 Wikipedia with Splash.html
|
HTML
|
102.4 B
|
|
|
5. Dealing with infinitely-scrolling pages using Splash.mp4
|
MP4
|
123.7 MB
|
|
|
5. Dealing with infinitely-scrolling pages using Splash.srt
|
SRT
|
27.8 KB
|
|
|
5. Handling AJAX requests 3.mp4
|
MP4
|
38.2 MB
|
|
|
5. Handling AJAX requests 3.srt
|
SRT
|
7.1 KB
|
|
|
5. Handling combined queries using the getall() method.mp4
|
MP4
|
60.3 MB
|
|
|
5. Handling combined queries using the getall() method.srt
|
SRT
|
10.8 KB
|
|
|
5. Using Amazon Glue and Athena to query the data from S3 (extra lecture).mp4
|
MP4
|
59.8 MB
|
|
|
5. Using Amazon Glue and Athena to query the data from S3 (extra lecture).srt
|
SRT
|
13.1 KB
|
|
|
5.1 Combining XPath queries.html
|
HTML
|
102.4 B
|
|
|
5.1 Handling AJAX requests.html
|
HTML
|
102.4 B
|
|
|
5.1 Handling scrolling pages with Splash.html
|
HTML
|
102.4 B
|
|
|
6. Caching responses.mp4
|
MP4
|
60.9 MB
|
|
|
6. Caching responses.srt
|
SRT
|
13 KB
|
|
|
6. Data cleansing using Item Loaders.mp4
|
MP4
|
114.5 MB
|
|
|
6. Data cleansing using Item Loaders.srt
|
SRT
|
27.5 KB
|
|
|
6.1 Item Loaders.html
|
HTML
|
102.4 B
|
|
|
6.2 The Scrapy project.html
|
HTML
|
102.4 B
|
|
|
7. Image harvesting.mp4
|
MP4
|
147.1 MB
|
|
|
7. Image harvesting.srt
|
SRT
|
24.1 KB
|
|
|
7. Pagination and link-following using Crawl Spiders.mp4
|
MP4
|
99 MB
|
|
|
7. Pagination and link-following using Crawl Spiders.srt
|
SRT
|
17.9 KB
|
|
|
7.1 Crawl Spiders.html
|
HTML
|
102.4 B
|
|
|
8. Scraped images storage in FTP and AWS S3.mp4
|
MP4
|
43.6 MB
|
|
|
8. Scraped images storage in FTP and AWS S3.srt
|
SRT
|
9.7 KB
|
|
|
8.1 Images storage to S3 and FTP.html
|
HTML
|
102.4 B
|
|
|
Bonus Resources.txt
|
TXT
|
409.6 B
|
|
|
Get Bonus Downloads Here.url
|
URL
|
204.8 B
|