Skip to content

Commit

Permalink
Tweak the join algorithm to avoid OOM issues
Browse files Browse the repository at this point in the history
xref: #105
  • Loading branch information
hellais committed Jan 24, 2025
1 parent 409bec8 commit 8cdf9bd
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion oonipipeline/src/oonipipeline/analysis/web_analysis.py
Original file line number Diff line number Diff line change
Expand Up @@ -504,6 +504,7 @@ def format_query_analysis_web_fuzzy_logic(
GROUP BY hostname, measurement_day
) as other
ON ctrl.hostname = other.hostname AND ctrl.measurement_day = other.measurement_day
SETTINGS join_algorithm = 'grace_hash', grace_hash_join_initial_buckets = 8
) as full_ctrl
ON full_ctrl.hostname = experiment.hostname AND full_ctrl.measurement_day = experiment.measurement_day
GROUP BY domain,
Expand All @@ -514,7 +515,7 @@ def format_query_analysis_web_fuzzy_logic(
measurement_start_time,
measurement_uid,
ooni_run_link_id
SETTINGS join_algorithm = 'grace_hash'
SETTINGS join_algorithm = 'grace_hash', grace_hash_join_initial_buckets = 8
"""
return SQL, q_params

Expand Down

0 comments on commit 8cdf9bd

Please sign in to comment.