Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stale Cache system #834

Open
T0nd0Tara opened this issue Jan 13, 2025 · 0 comments
Open

Stale Cache system #834

T0nd0Tara opened this issue Jan 13, 2025 · 0 comments

Comments

@T0nd0Tara
Copy link

Hey I'm using colly for a personal project of mine, and I'd like to not spam the web page I'm scraping.
I saw there was a CacheDir function to colly, but what happens if the webpage changes? Maybe it has some new data.

How can I set a timeout that a cached page is stale after a day or so, and colly would automatically fetch the current one?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant