Awesome client
Spider orchestrated by Airflow
Scrapper quotes from famous authors using the Scrapy and Apache Airflow as an orchestration tool.
Project: https://github.com/octadelsueldo/scrapy-crawl-airflow.
This is the repository where you will find a scrapper for the 'http://quotes.toscrape.com/' website using the Python Scrapy library and Apache Airflow as an orchestration and production tool. Last but not least, the extracted information has been stored in an SQLite database for later analytical treatment.
"The process of initialize Airflow locally using Docker was, by far, the most dificult part of the project".