Skip to content
This repository has been archived by the owner on Jul 28, 2024. It is now read-only.
/ vinted-db-feeder Public archive

Series of data scrappers in Python that stract static information from the vinted source code, being the perfect complement for a Vinted API Wrapper

Notifications You must be signed in to change notification settings

ThalosDev/vinted-db-feeder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vinted data scrapper

📣📣 DEPRECATED This project has been integrated as a subdirectory of the main repository: vinted-rs

A series of data scrappers in Python that extract static information from the Vinted source code, being the perfect complement for a Vinted API wrapper.

Install & Run

Requirements

  • Pip
  • Python
  1. Create a virtual environment

    python3 -m venv pyvenv/
    source venv/bin/activate
  2. Run requirements.txt

     pip3 install -r requirements.txt
  3. Run main.py

Extracted data categories

Element Fields Returned
Brands Names, Ids
Materials Id, Name
Colors Id, Color, Hex Code
Sizes Id, Title, Size_Type, Category_id
Categories Id, Title, Code, Parent Id, URL, URL EN, Item Count
Categories Children Category Id, Child Id
Countries Id, French_name, local_name, ISO_code, flag_emoji

Materials and sizes

  • Available languages: 🇪🇸 🇫🇷 🇺🇸
    • [More languages can be added if html file included in data/raw/materials or data/raw/sizes]

Categories - Catalogs

  • Debug mode: Builds the full decision tree

  • Exec mode: Returns 2 CSVs:

    • categories.csv: Table of all the available categories and their attributes

    • categories_children.csv: Dictionary that models the Category->Children list relationship

Performance

  • Without brands search:
real    0m1,941s
user    0m1,225s
sys     0m0,037s
  • Brands validation process:
real    14m14,211s
user    0m19,753s
sys     0m1,229s

Authors

Álvaro Cabo

Pepe Márquez

About

Series of data scrappers in Python that stract static information from the vinted source code, being the perfect complement for a Vinted API Wrapper

Resources

Stars

Watchers

Forks