Skip to content

Building a po' boy's spatial data pipeline from scratch with Dagster, geoparquet, & R2. (This is an example)

Notifications You must be signed in to change notification settings

alexlowellmartin/poboy-pipeline-example

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DAG preview image

poboy-pipeline-example

Overview

Here I build a simple geospatial data pipeline to fetch data from ArcGIS Feature Servers and write / read it to GeoParquet in Cloudflare R2. (This is an example - please read my write-up for a thorough walk through of my rationale and development process.)

External links

Getting started

Visit the Dagster Getting started page. Dagster supports Python 3.8 through 3.12. Ensure you have one of the supported Python versions installed before proceeding.

First, Clone this Dagster repository.

git clone https://github.com/alexlowellmartin/poboy-pipeline-example && cd poboy-pipeline-example

Second, install your Dagster code location as a Python package. By using the --editable flag, pip will install your Python package in "editable mode" so that as you develop, local code changes will automatically apply.

pip install -e ".[dev]"

Then, start the Dagster UI web server:

dagster dev

Open http://localhost:3000 with your browser to see the project.

You can start writing assets in poboy_pipeline_example/assets.py. The assets are automatically loaded into the Dagster code location as you define them.

About

Building a po' boy's spatial data pipeline from scratch with Dagster, geoparquet, & R2. (This is an example)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages