Skip to content

Files

Latest commit

 

History

History

Web Scrapping

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Automation Tool for Web Scraping

🎯 Goal

The goal of this project is to access the HTML structure of a particular webpage and extract useful information or data from it using Python. This project focuses on scraping book information from Books to Scrape, a mock online bookstore website.

🧾 Description

This project demonstrates the implementation of web scraping in Python. The script scrapes book titles, prices, and availability status from Books to Scrape and saves the extracted data into a CSV file for further analysis.

🧮 Features Implemented

  • Scrapes book titles, prices, and availability status from Books to Scrape.
  • Iterates through multiple pages to collect data from the entire catalog.
  • Saves the scraped data into a structured CSV file for further use or analysis.

📚 Libraries Needed

  • BeautifulSoup: To parse the HTML content and extract useful information.
  • Requests: To fetch the HTML content from web pages.
  • Pandas: To store the extracted data and save it into a CSV file.

📊 Example Output:

The output CSV file books_data.csv will contain data in the following structure:

Title,Price,Availability
A Light in the Attic,£51.77,In stock
Tipping the Velvet,£53.74,In stock
Soumission,£50.10,In stock
...,...,...

AYESHA NAZNIN
GitHub | LinkedIn