You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Scrapy is a Python web scraping framework, but it also offers a lot of encapsulated async data processing functionality independent of actual web scraping. Since I have now done the same data processing with a Scrapy pipeline and without, and since we want to standardize the role of Scrapy in our data work, I want to reflect on each implementation option and the strengths and tradeoffs of each.
Proposal
Planning to spend an afternoon typing up a document of notes.
The text was updated successfully, but these errors were encountered:
Overview
Scrapy is a Python web scraping framework, but it also offers a lot of encapsulated async data processing functionality independent of actual web scraping. Since I have now done the same data processing with a Scrapy pipeline and without, and since we want to standardize the role of Scrapy in our data work, I want to reflect on each implementation option and the strengths and tradeoffs of each.
Proposal
Planning to spend an afternoon typing up a document of notes.
The text was updated successfully, but these errors were encountered: