Skip to content

jeffreyhorner/Wikipedia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia Data Scrape

Tools to create data sets that mimic the SKEW and DISTINCT files from:

Askitis, Nikolas, and Justin Zobel. "Redesigning the string hash table, burst trie, and bst to exploit cache." Journal of Experimental Algorithmics (JEA) 15 (2010): 1-7.

Configure and Build

  1. Execute make all
  2. Execute R --vanilla < create_data_sets.R

Data and code are licenced under the Creative Commons Attribution-Share-Alike License 3.0.

About

Tools to collect Wikipedia string data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published