The project developed in the fourth module (Data Extraction I) of the Santander Coders program aims to acquire data from the News API and ensure a reliable, continuous and stable data pipeline.
First of all, make sure you have Git installed on your machine. Then, open Git Bash (Windows) or Terminal (Linux/macOS) in the folder of your choice and type:
-
Access Databricks Community.
-
Import the file "SantanderCoders_Project_Group_3.dbc".
-
The required notebooks will be uploaded to your environment within Databricks.
-
Execute the files in the numerical sequence.