Web Scrapping

History

Name		Name	Last commit message	Last commit date
parent directory ..
linkedin_scraper/linkedin_scraper		linkedin_scraper/linkedin_scraper
README.md		README.md
WebScrapping.ipynb		WebScrapping.ipynb
books_data.csv		books_data.csv

README.md

Automation Tool for Web Scraping

🎯 Goal

The goal of this project is to access the HTML structure of a particular webpage and extract useful information or data from it using Python. This project focuses on scraping book information from Books to Scrape, a mock online bookstore website.

🧾 Description

This project demonstrates the implementation of web scraping in Python. The script scrapes book titles, prices, and availability status from Books to Scrape and saves the extracted data into a CSV file for further analysis.

🧮 Features Implemented

Scrapes book titles, prices, and availability status from Books to Scrape.
Iterates through multiple pages to collect data from the entire catalog.
Saves the scraped data into a structured CSV file for further use or analysis.

📚 Libraries Needed

BeautifulSoup: To parse the HTML content and extract useful information.
Requests: To fetch the HTML content from web pages.
Pandas: To store the extracted data and save it into a CSV file.

📊 Example Output:

The output CSV file books_data.csv will contain data in the following structure:

Title,Price,Availability
A Light in the Attic,£51.77,In stock
Tipping the Velvet,£53.74,In stock
Soumission,£50.10,In stock
...,...,...

AYESHA NAZNIN
|

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

Web Scrapping

Web Scrapping

README.md

Automation Tool for Web Scraping

🎯 Goal

🧾 Description

🧮 Features Implemented

📚 Libraries Needed

📊 Example Output:

Files

Web Scrapping

Directory actions

More options

Directory actions

More options

Latest commit

History

Web Scrapping

Folders and files

parent directory

README.md

Automation Tool for Web Scraping

🎯 Goal

🧾 Description

🧮 Features Implemented

📚 Libraries Needed

📊 Example Output: