[SUPPORT] use flink1.16 write hudi0.15.0 MOR table, have same primary key data #12903

screamedddd · 2025-03-03T14:53:51Z

Data like this :
_hoodie_record_key _hoodie_partition_path _hoodie_file_name
835735 20250101 0001.parquet
835735 20250201 0002.parquet

hudi table had compaction, same primary key can query(use snap-shot or read-optimize) many diff partition data.
use MOR table how can I achieve the same primary key, even if the data is in different partitions, only one piece of data can be queried?

Expected behavior

A clear and concise description of what you expected to happen.

Environment Description

Hudi version : 0.15.0
Spark version : 3.3.2
Hive version : 2.1.1
Hadoop version : 3.0.0
Storage (HDFS/S3/GCS..) : HDFS
Running on Docker? (yes/no) : no

Additional context

Add any other context about the problem here.

Stacktrace

Add the stacktrace of the error.

The text was updated successfully, but these errors were encountered:

danny0405 · 2025-03-03T23:30:34Z

what kind of index did you use then?

screamedddd · 2025-03-04T00:36:37Z

what kind of index did you use then?

bucket index

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUPPORT] use flink1.16 write hudi0.15.0 MOR table, have same primary key data #12903

[SUPPORT] use flink1.16 write hudi0.15.0 MOR table, have same primary key data #12903

screamedddd commented Mar 3, 2025

danny0405 commented Mar 3, 2025

screamedddd commented Mar 4, 2025

[SUPPORT] use flink1.16 write hudi0.15.0 MOR table, have same primary key data #12903

[SUPPORT] use flink1.16 write hudi0.15.0 MOR table, have same primary key data #12903

Comments

screamedddd commented Mar 3, 2025

danny0405 commented Mar 3, 2025

screamedddd commented Mar 4, 2025