Awesome client

Spider orchestrated by Airflow

Scrapper quotes from famous authors using the Scrapy and Apache Airflow as an orchestration tool.
Project image

Project: https://github.com/octadelsueldo/scrapy-crawl-airflow.

This is the repository where you will find a scrapper for the 'http://quotes.toscrape.com/' website using the Python Scrapy library and Apache Airflow as an orchestration and production tool. Last but not least, the extracted information has been stored in an SQLite database for later analytical treatment.

"The process of initialize Airflow locally using Docker was, by far, the most dificult part of the project".