Releases · adbar/htmldate

11 Feb 16:21

adbar

v0.8.0

573e5a3

htmldate-0.8.0

dateparser and regex modules fully integrated
patterns added for coverage
smarter HTML doc loading

Assets 2

04 Jan 12:45

adbar

v0.7.3

25d89a7

htmldate-0.7.3

dependencies updated and reduced: switch from requests to bare urllib3, make chardet standard and cchardet optional
fixes: downloads, OverflowError in extraction

Assets 2

20 Oct 14:38

adbar

v0.7.2

6abc31f

htmldate-0.7.2

compatibility with Python 3.9
better speed and accuracy

Assets 2

14 Sep 14:53

adbar

v0.7.1

386cacf

htmldate-0.7.1

technical release: package requirements and docs wording

Assets 2

29 Jul 17:19

adbar

v0.7.0

1547051

htmldate-0.7.0

code base and performance improved
minimum date available as option
support for Turkish patterns and CMS idiosyncrasies (thanks @evolutionoftheuniverse)

Assets 2

26 May 10:34

adbar

v0.6.3

3c0a550

htmldate-0.6.3

more efficient code
additional evaluation data

Assets 2

29 Apr 10:55

adbar

v0.6.2

1d83a7e

htmldate-0.6.2

v0.6.2

roundup + version bump

Assets 2

17 Jan 12:37

adbar

v0.6.1

65532e5

htmldate-0.6.1

htmldate finds original and updated publication dates of any web page. All the steps needed from web page download to HTML parsing, scraping and text analysis are included.

In a nutshell, with Python:

from htmldate import find_date
find_date('http://blog.python.org/2016/12/python-360-is-now-available.html')
'2016-12-23'
find_date('https://netzpolitik.org/2016/die-cider-connection-abmahnungen-gegen-nutzer-von-creative-commons-bildern/', original_date=True)
'2016-06-23'

On the command-line:

$ htmldate -u http://blog.python.org/2016/12/python-360-is-now-available.html
'2016-12-23'

Releases used in production and meant to be archived on Zenodo for reproducibility and citability.

For more information see htmldate.readthedocs.io

Assets 2

24 Sep 14:49

adbar

v0.5.6

4755ef0

First stable release for Zenodo

First release used in production and meant to be archived on Zenodo for reproducibility and citability.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: adbar/htmldate

htmldate-0.8.0

htmldate-0.7.3

htmldate-0.7.2

htmldate-0.7.1

htmldate-0.7.0

htmldate-0.6.3

htmldate-0.6.2

htmldate-0.6.1

First stable release for Zenodo