TessScraper

This is a prototype of a python script which can be used to scrape html from websites and upload the content to a CKAN installation.

Installation

git clone [email protected]:ElixirUK/TessScraper.git
cd TessScraper
sudo pip install -r requirements.txt
cp example_uploader_config.txt uploader_config.txt

You will need to edit the uploader configuration file, which by default is called 'uploader_config.txt'. An example file is provided, and it should reside in the root directory of TessScraper.

On your TeSS instance locate the API key from your user account page and copy it into the configuration file. It should look like this:

auth = 2204e6c5-d011-4aec-8005-5b1243159aed

If you are not using the https://tess.oerc.ox.ac.uk deployment of TeSS be sure to configure the uploader to the correct urls/protocols of your deployment in the configuration file.

Usage

python goblet_scraper.py
python soc_scraper.py
python genome3d_scraper.py
python ebi_scraper.py

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
training		training
.gitignore		.gitignore
README.md		README.md
Template for TeSS_EMBL EBI.xlsx		Template for TeSS_EMBL EBI.xlsx
coursera_parser.py		coursera_parser.py
ebi_scraper.py		ebi_scraper.py
event_loader.py		event_loader.py
example_uploader_config.txt		example_uploader_config.txt
genome3d_scraper.py		genome3d_scraper.py
goblet_scraper.py		goblet_scraper.py
parser.py		parser.py
requirements.txt		requirements.txt
soc_scraper.py		soc_scraper.py
update_all_resources.py		update_all_resources.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TessScraper

Installation

Usage

About

Releases

Packages

Contributors 2

Languages

ElixirTeSS/TessScraper

Folders and files

Latest commit

History

Repository files navigation

TessScraper

Installation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages