Support safe concurrent dbt runs #420

BentsiLeviav · 2025-02-13T08:59:23Z

Support safe concurrent dbt runs

Description

There have been multiple issues related to concurrent dbt runs in ClickHouse, where one run attempts to query an intermediate/temp table that has already been dropped by a previous run.

Several PRs have attempted to fix this by appending invocation_id or UUIDs to table names or by implementing conditional drops based on specific criteria. However, each change was applied to different parts of the codebase based on individual use cases, resulting in an inconsistent and partial solution.

Proposed Solution

Introduce dedicated macros, clickhouse__make_intermediate_relation and clickhouse__make_temp_relation, that would:

Include a prefix parameter with a default value using invocation_id, ensuring unique table names for each dbt run.
Introduce a feature flag (disabled by default) that allows users to opt into this behavior. When enabled, invocation_id will be automatically appended to intermediate and temp table names.
Standardize the handling of temp and intermediate relations across the dbt-clickhouse adapter, avoiding ad-hoc solutions in different parts of the code.

✅ Benefits

Prevents table name collisions in parallel dbt runs.
Ensures a consistent approach to naming intermediate and temp tables.
Provides an opt-in feature flag for backward compatibility.
Reduces the need for future patches addressing the same issue in different ways.

❌ Potential Downsides

Increased risk of hanging tables:
- Currently, if a temp/intermediate table is not dropped, it is often cleaned up in the next run. With this change, the risk of orphaned tables may increase.
- Potential solution: Implement a cleanup mechanism that drops stale intermediate/temp tables based on their creation time or last usage.

This is not a heavy lift, but I Would love to get the community's opinion on it.

The text was updated successfully, but these errors were encountered:

BentsiLeviav · 2025-02-13T09:12:05Z

Related PRs #373 #365 #353

stephen-up · 2025-02-13T22:21:10Z

Hi, I think this is a good idea and would help my situation. 2 things to think about. 1 outstanding problem.

To aid cleanup of hanging tables we could, the temporary tables could use timestamps rather than invocation_id. It could be the dbt runs start time. That way a cleanup process can more easily scan for table names matching the pattern where the timestamp is older than 1 day etc.
eg my_table_tmp_1739484798591 with a epoch on the end.

Re, naming with prefixes. Im not sure exactly what you mean but suffixes might be better than prefixes.
Prefixes like below .. dont sort as well as suffixes. Also the adapter is already using suffixes for __dbt_tmp tables.

ab2344_my_first_table
cd12345_my_second_table
xy142_my_first_table

vs

my_first_table_ab2344
my_first_table_xy142
my_second_table_cd12345

Out of order runs
Out of order model runs and their completions could be a problem.
Consider a race condition where there are 2 attempts to refresh the same table with different versions of code. If the first run takes longer to complete than the second, when the first completes it will run the exchange tables between its temporary table and the final table and roll back the change.

Currently, these concurrent refreshes would each try and delete the __dbt_tmp table. The second run would kill the first runs attempt. The first run would get an error about the table its writing to not existing anymore. So by trying to support concurrent runs we might be making the impact of race conditions worse.

I dont have a great solution for this that fits well with existing components, I dont think solving this needs to be a blocker. But worth considering as a downside.

BentsiLeviav added the bug Something isn't working label Feb 13, 2025

BentsiLeviav mentioned this issue Feb 13, 2025

Dont drop tables in incremental append runs #365

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support safe concurrent dbt runs #420

Support safe concurrent dbt runs #420

BentsiLeviav commented Feb 13, 2025

BentsiLeviav commented Feb 13, 2025

stephen-up commented Feb 13, 2025 •

edited

Loading

Support safe concurrent dbt runs #420

Support safe concurrent dbt runs #420

Comments

BentsiLeviav commented Feb 13, 2025

Support safe concurrent dbt runs

Description

Proposed Solution

✅ Benefits

❌ Potential Downsides

BentsiLeviav commented Feb 13, 2025

stephen-up commented Feb 13, 2025 • edited Loading

stephen-up commented Feb 13, 2025 •

edited

Loading