WG Gesucht Crawler

Python web crawler / scraper for WG-Gesucht. Crawls the WG-Gesucht site for new apartment listings and send a message to the poster, based off your saved filters and saved text template.

Installation

$ pip install wg-gesucht-crawler-cli

Or, if you have virtualenvwrapper installed:

$ mkvirtualenv wg-gesucht-crawler-cli
$ pip install wg-gesucht-crawler-cli

Use

Can be run directly from the command line with:

$ wg-gesucht-crawler-cli --help

Or if you want to use it in your own project:

from wg_gesucht.crawler import WgGesuchtCrawler

Just make sure to save at least one search filter as well as a template text on your wg-gesucht account.

Free software: MIT license
Documentation: https://wg-gesucht-crawler-cli.readthedocs.org.

Features

Searches https://wg-gesucht.de for new WG ads based off your saved filters
Sends your saved template message and applies to all matching listings
Reruns every ~5 minutes
Run on a RPi or free EC2 micro instance 24/7 to always be one of the first to apply for new listings

Getting Caught with reCAPTCHA

I've made the crawler sleep for 5-8 seconds between each request to try and avoid their reCAPTCHA, but if the crawler does get caught, you can sign into your wg-gesucht account manually through the browser and solve the reCAPTCHA, then start the crawler again. If it continues to happen, you can also increase the sleep time in the get_page() function in wg_gesucht.py

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
docs		docs
tests		tests
wg_gesucht		wg_gesucht
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
AUTHORS.rst		AUTHORS.rst
CONTRIBUTING.rst		CONTRIBUTING.rst
HISTORY.rst		HISTORY.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
requirements-in.txt		requirements-in.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini
versioneer.py		versioneer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WG Gesucht Crawler

Installation

Use

Features

About

Releases

Packages

Languages

License

cami0/wg-gesucht-crawler-cli

Folders and files

Latest commit

History

Repository files navigation

WG Gesucht Crawler

Installation

Use

Features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages