Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OONI Data, OONI Pipeline split #60

Merged
merged 25 commits into from
Apr 15, 2024
Merged

OONI Data, OONI Pipeline split #60

merged 25 commits into from
Apr 15, 2024

Conversation

hellais
Copy link
Member

@hellais hellais commented Mar 17, 2024

This PR introduces a major refactoring of oonidata into two separate packages:

  • oonidata, which is the end user pip installable package to download and parse measurements (should have minimal dependencies and not require additional components to run, eg. clickhouse)
  • oonipipeline, which is the thing that actually performs the analysis and processing of data. This does require external dependencies to run.

Eventually we might want to move them into their own respective repos.

This fixes the following issues:

@hellais
Copy link
Member Author

hellais commented Mar 17, 2024

Note: you might see many removed lines and wonder like me what caused that. The cause for that is we dropped poetry in place of hatch and remove the poetry.lock which was 3.5k lines.

@hellais hellais merged commit 546f40e into main Apr 15, 2024
6 checks passed
@hellais hellais deleted the ooni-datapipeline branch April 15, 2024 11:19
hellais added a commit that referenced this pull request Aug 30, 2024
* v5.0.0-rc.0:
  Add simple redirector
  Tidy up the layout of the analysis viewer
  Add an observations viewer
  Get rid of all the dataviz that isn't the analysis visualizer
  Release/5.0.0 alpha3 (#81)
  Offset analysis schedule by 6 hours
  Add support for temporal cloud (#79)
  fix: support for sorting network_events using transaction_id (#51)
  Optimize performance of table writers and refactor table model (#74)
  Improvements related to deployment (#69)
  Add .codecov file
  Update jsonl sync example
  Setup workflow to publish ooni data docs (#73)
  OONI Pipeline v5 alpha (#64)
  Fix codecov (#62)
  Temporal workflows (#61)
  OONI Data, OONI Pipeline split (#60)
  Add support for caching netinfodb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant