Example frontera project

This is an example frontera project, combining scrapy's quotes example and frontera's cluster-setup example. (links: https://doc.scrapy.org/en/latest/intro/tutorial.html and http://frontera.readthedocs.io/en/v0.7.1/topics/cluster-setup.html

How to run it

First off open a bunch (7 should be enough) of terminal windows (tmux is awesome for this).

First the backend

Run the following cmd to get hbase, kafka, and zookeeper running. It will also create the HBase namespace.

docker-compose -f docker-compose-backend.yml up -d 
echo "waiting for hbase startup (30 seconds)"
sleep 30
echo "create_namespace 'crawler'" | docker exec -i docker_hbase /hbase/bin/hbase shell

Next the Storage workers

In a separate terminal window run python -m frontera.worker.db --config frontier.dbworker --no-incoming In another separate terminal window run python -m frontera.worker.db --no-batches --config frontier.dbworker

Next the strategy workers

In a separate terminal windows run python -m frontera.worker.strategy --config frontier.sworker --partition-id 0 In another separate terminal window run python -m frontera.worker.strategy --config frontier.sworker --partition-id 1

Next the crawlers

In separate terminal windows run:

scrapy crawl quotes -L INFO -s SPIDER_PARTITION_ID=0
scrapy crawl quotes -L INFO -s SPIDER_PARTITION_ID=1

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docker		docker
frontier		frontier
general		general
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example frontera project

How to run it

First the backend

Next the Storage workers

Next the strategy workers

Next the crawlers

About

Releases

Packages

Languages

colmex/frontera_example

Folders and files

Latest commit

History

Repository files navigation

Example frontera project

How to run it

First the backend

Next the Storage workers

Next the strategy workers

Next the crawlers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages