forked from Modified/MHTifier
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
13 additions
and
11 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,13 +1,15 @@ | ||
# MHTifier | ||
Un/packs an MHT (MHTML) archive into/from separate files, writing/reading them in directories to match their Content-Location. | ||
# MHTifier 2 | ||
Un/packs a MHT (MHTML) archive into/from separate files, writing/reading them | ||
in directories to match their Content-Location. | ||
|
||
Whole [story](http://decodecode.net/elitist/2013/01/mhtifier/) is in my devlog. | ||
A fork from [Modified/MHTifier](https://github.com/Modified/MHTifier). | ||
|
||
# Issues | ||
1. Cleanest would've been to use stdin/out, but turned out inconvenient, annoying even, so added command line options. | ||
2. Python's stdlib module's performance (premature optimization?): | ||
`email.message_from_bytes(mht.read()) # Parser is "conducive to incremental parsing of email messages, such as would be necessary when reading the text of an email message from a source that can block", so I guess it's more efficient to have it read stdin directly, rather than buffering.` | ||
3. Encodings (ascii, UTF-8) and de/coding was painful, and probably still buggy. | ||
4. base64 encoded binaries: my editor, Geany, suffocates, I think, when wrapping these long lines? | ||
1. Verify index.html is present!? | ||
1. A few un/Pythonisms, idioms,I guess. | ||
Under development, so no public API should be assumed. | ||
|
||
## Known Issues | ||
1. Cleanest would've been to use stdin/out, but turned out inconvenient, | ||
annoying even, so added command line options. | ||
2. Verify index.html is present!? | ||
3. A few un/Pythonisms, idioms,I guess. | ||
4. Rewrite whole program to provide stable public APIs. Preferably, keep it as | ||
a single file with no dependency other than Python 3 and its standard libraries. |