Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: A way to not save duplicate files #13

Open
Madeiner opened this issue Jan 5, 2025 · 0 comments
Open

Feature request: A way to not save duplicate files #13

Madeiner opened this issue Jan 5, 2025 · 0 comments

Comments

@Madeiner
Copy link

Madeiner commented Jan 5, 2025

Hello,

i'm starting to use this fantastic app to download and archive my huge amount of reddit posts that i saved.
The idea was to run this monthly or yearly, in order to keep the data in case one day reddit is lost.
I noticed by default old downloaded html of the same post are kept (which is good: people might delete content from the web or add to old threads). I wonder if there is an easy way to delete identical saved html.
Right now it's not easy since there is a timestamp and other data in the html content that changes each time, so a simple search, compare and delete does not work.
Is there any chance you could implement a check between downloads and if the content is identical, keep only one file? Thanks

@Madeiner Madeiner changed the title Any way to remove duplicate downloads Feature request: A way to not save duplicate files Jan 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant