Skip to content

ambanum/TOSBack-CGUs-bridge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

97 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TOSBack CGUs bridge

Usage:

  • get a server with Ubuntu 20.04, 16Gb memory, and 8 CPUs.
  • ssh into it
  • Run tmux new so that you can disconnect and come back later with tmux ls ; tmux attach
  • Run:
curl -o- https://raw.githubusercontent.com/ambanum/TOSBack-CGUs-bridge/master/prepare.sh | bash
cd TOSBack-CGUs-bridge
sh ./prepare2.sh
export DATABASE_URL=...
sh ./run.sh
  • You can now disconnect from your tmux session and come back several hours later (note to self: started 15:30)
  • Check out the import-123456789 and rebased-123456789 branches for:

Report

This script was run once, in October 2020. The result is here: #2 (comment)

A report of what we were able to import from each of the 1711 tosback2 crawl files was generated using report-21.js, the result is in report-21.txt.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published