-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
19 lines (15 loc) · 1.02 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
## Slurp
### About
Takes a RSS feed (designed for use with google reader feeds like "starred items") and downloads the first image found in each entry, generating a companion text file with the source URL and any text from the entry
### Usage
python slurp.py
### Options
There're a few variables you can set near the top of the script:
+ "_filedir" sets the directory to save files in (must exist)
+ "userid" is the google reader user id, you can get it from the URL on the greader site
+ "retrieveNum" is the number of items from the greader feed to download. This is a function of the feed, not this script
+ "blacklist" is a string array. Just put in domains you want to ignore, it's used in a simple substring check
### Notes
+ Only takes the first image from each entry
+ images without an extension in the URL will be left "as-is", i.e. the downloaded file won't have one either...
+ Would like to support a minimum-size option, to avoid downloading 1x1 size images that are just for tracking, but this apparently requires installing PIL