Skip to content

Commit

Permalink
Merge branch 'master' of https://github.com/sotorrent/db-scripts
Browse files Browse the repository at this point in the history
# Conflicts:
#	sotorrent/gh-references/retrieve-gh-references.sh
  • Loading branch information
sbaltes committed Nov 9, 2020
2 parents e65953e + b3fbc60 commit aabdce7
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,12 +5,17 @@ All notable changes to the SOTorrent dataset project will be documented in this

## [Upcoming]

* Update import script to use new SQL dumps
* Extract language information from Stack Snippets and link individual snippets to their predecessors
* Update database schema on website
* Add historical user reputation
* Remove foreign key constraints, switch to SQLite, make it possible to only party import SOTorrent
* Replace XML by CSV files
* Automate import of tables `PostTags` and `PostViews`

## [2020-08-31] - First release based on SO data dump 2020-06-02
* Update escaping of newline characters (related to [this](https://github.com/sotorrent/db-scripts/issues/19) issue )
* Now using MySQL dumps, newline characters are not espaced anymore in the BigQuery version of the dataset
* This also fixes a bug in the export script (for tables `PostVersionUrl` and `CommentUrl`, column `LinkAnchor` was identical to column `FullMatch`)
* Fix bug in creation of table `Threads` (now using correct dataset version)

## [2020-03-15] - First release based on SO data dump 2020-03-02

Expand Down

0 comments on commit aabdce7

Please sign in to comment.