Replacing get_shape() #16786

jbedichekTT · 2025-01-15T22:00:00Z

Ticket

None

Problem description

Replacing the legacy Tensor::get_shape() with new Tensor::get_padded_shape and Tensor::get_logical_shape from Moreh folder.

What's changed

Remove usage of Tensor::get_shape() in more places, replacing with Tensor::get_logical_shape() or Tensor::get_padded_shape()
Shift from copying a value, to getting a const reference to avoid any copies

Checklist

Post commit CI passes
Device performance regression CI testing passes (if applicable)
New/Existing tests provide coverage for changes

ttnn/cpp/ttnn/operations/moreh/moreh_cumsum/device/moreh_cumsum_device_operation.cpp

ttnn/cpp/ttnn/operations/moreh/moreh_dot_backward/device/moreh_dot_backward_program_factory.cpp

ttnn/cpp/ttnn/operations/moreh/moreh_getitem/device/moreh_getitem_device_operation.cpp

ayerofieiev-tt · 2025-01-16T19:34:02Z

ttnn/cpp/ttnn/operations/moreh/moreh_cumsum/device/moreh_cumsum_device_operation.cpp

    for (int i = 0; i < input_shape.rank(); ++i) {
        TT_FATAL(
            input_shape[i] == output_shape[i],
            "Input shape must match output shape. Received input_shape = {} and output_shape = {}.",
            input_shape[i],
            output_shape[i]);
-        TT_FATAL(


@razorback3 can someone from the team please check?

ayerofieiev-tt · 2025-01-16T19:34:49Z

ttnn/cpp/ttnn/operations/moreh/moreh_cumsum/device/moreh_cumsum_device_operation.cpp

+    // Unnecessary? ->
+    //const auto input_shape_wo_padding = input.get_logical_shape();
+    //const auto output_shape_wo_padding = input.get_logical_shape();


My recommendation is to remove things you believe are not needed, instead of commenting them
You can then ping related parties in the PR to clarify with them.

ayerofieiev-tt · 2025-01-16T19:36:11Z

ttnn/cpp/ttnn/operations/moreh/moreh_getitem/device/moreh_getitem_device_operation.cpp

+        uint32_t index_size = index.get_logical_shape()[-1];
+        uint32_t index_size_without_padding = index.get_logical_shape()[-1];


@razorback3 , both values are without a padding here. I think someone from Moreh team will have to take a look

ayerofieiev-tt · 2025-01-16T19:37:24Z

ttnn/cpp/ttnn/operations/moreh/moreh_getitem/device/moreh_getitem_rm_factory.cpp

    }

-    uint32_t index_size = index_tensors.front().get_shape().value[-1];
+    const uint32_t& index_size = index_tensors.front().get_logical_shape()[-1];


get_shape().value is returns a LegacyShape. its operator[] will give padded value.
So the proper replacement here is get_padded_shape()

ayerofieiev-tt · 2025-01-16T19:38:23Z

ttnn/cpp/ttnn/operations/moreh/moreh_getitem/device/moreh_getitem_tilized_factory.cpp

+    const auto& input_shape = input.get_padded_shape();
+    const auto& input_shape_without_padding = input.get_logical_shape()


@razorback3 , another place where there seem to be an issue. get_shape and input_shape.value.without_padding() are basically returning the same thing. One can probably be removed.

ayerofieiev-tt · 2025-01-16T19:39:21Z

ttnn/cpp/ttnn/operations/moreh/moreh_getitem/device/moreh_getitem_tilized_factory.cpp

-    auto input_shape_without_padding = input_shape.value.without_padding();
-    auto output_shape = output.get_shape();
-    auto output_shape_without_padding = output_shape.value.without_padding();
+    const auto& input_shape = input.get_padded_shape();


@jbedichekTT input_shape variable here seem to have had a semantics of get_logical_shape()

auto input_shape = input.get_shape();

ayerofieiev-tt · 2025-01-16T19:39:55Z

ttnn/cpp/ttnn/operations/moreh/moreh_group_norm/device/moreh_group_norm_device_operation.cpp

@@ -31,34 +31,37 @@ void MorehGroupNormOperation::validate_tensors(
    check_tensor(beta, "moreh_group_norm", "beta");

    // input (N, C, H, W)
-    auto C = input.get_shape().value[1];
+    auto C = input.get_logical_shape()[1];


get_padded_shape()

ayerofieiev-tt · 2025-01-16T19:40:00Z

ttnn/cpp/ttnn/operations/moreh/moreh_group_norm/device/moreh_group_norm_device_operation.cpp

    TT_FATAL(C % num_groups == 0, "input_shape[1] must be divisible by num_groups.");
    // output (N, C, H, W)
    if (output.has_value()) {
-        C = output.value().get_shape().value[1];
+        C = output.value().get_logical_shape()[1];


get_padded_shape()

ayerofieiev-tt · 2025-01-16T19:40:37Z

ttnn/cpp/ttnn/operations/moreh/moreh_group_norm/device/moreh_group_norm_device_operation.cpp

        TT_FATAL(C % num_groups == 0, "beta_shape[-1] must be divisible by num_groups.");
    }

    // mean (1, 1, N, num_groups)
    if (mean.has_value()) {
        TT_FATAL(
-            mean.value().get_shape().value.without_padding()[-1] == num_groups,
+            //"mean_shape[-1] must match num_groups.");
+            mean.value().get_logical_shape()[-1] == num_groups,


mean->get_logical_shape()[-1]

ayerofieiev-tt · 2025-01-16T19:40:41Z

ttnn/cpp/ttnn/operations/moreh/moreh_group_norm/device/moreh_group_norm_device_operation.cpp

        TT_FATAL(C % num_groups == 0, "beta_shape[-1] must be divisible by num_groups.");
    }

    // mean (1, 1, N, num_groups)
    if (mean.has_value()) {
        TT_FATAL(
-            mean.value().get_shape().value.without_padding()[-1] == num_groups,
+            //"mean_shape[-1] must match num_groups.");


ayerofieiev-tt · 2025-01-16T19:41:38Z

...up_norm_backward/device/input_grad/moreh_group_norm_backward_input_grad_device_operation.cpp

    TT_FATAL(C % num_groups == 0, "input_shape[1] must be divisible by num_groups.");
    // input_grad (N, C, H, W)
    if (input_grad.has_value()) {
-        C = input_grad.value().get_shape().value[1];
+        C = input_grad.value().get_logical_shape()[1];


get_padded_value

ayerofieiev-tt · 2025-01-16T19:42:40Z

ttnn/cpp/ttnn/operations/moreh/moreh_layer_norm/device/moreh_layer_norm_program_factory.cpp

+    const auto& input_shape = input.get_padded_shape();
+    const auto& input_shape_without_padding = input.get_logical_shape();


this is flipped. I recommend to rename/refactor variable names.
input_shape --> input_shape_padded
input_shape_without_padding --> input_shape

ayerofieiev-tt · 2025-01-16T19:43:05Z

...reh_layer_norm_backward/device/moreh_layer_norm_backward_gamma_beta_grad_program_factory.cpp

+    const auto& mean_rstd_shape = mean.get_padded_shape();
+    const auto mean_rstd_shape_without_padding = mean.get_logical_shape();


name mismatch

ayerofieiev-tt · 2025-01-16T19:43:22Z

...eh/moreh_layer_norm_backward/device/moreh_layer_norm_backward_input_grad_program_factory.cpp

-    const auto mean_rstd_shape = mean.get_shape().value;
-    const auto mean_rstd_shape_without_padding = mean_rstd_shape.without_padding();
+
+    const auto& mean_rstd_shape_without_padding = mean.get_logical_shape();


name mismatch

ayerofieiev-tt · 2025-01-16T19:43:50Z

ttnn/cpp/ttnn/operations/moreh/moreh_matmul/device/moreh_matmul_device_operation.cpp

+    const auto& input_wo_shape = input.get_logical_shape();
+    const auto& other_wo_shape = other.get_logical_shape();


should just be input_shape and other_shape

ayerofieiev-tt · 2025-01-16T19:44:19Z

ttnn/cpp/ttnn/operations/moreh/moreh_matmul/device/moreh_matmul_device_operation.cpp

+    const auto& input_shape = tensor_args.input.get_padded_shape();
+    const auto& other_shape = tensor_args.other.get_padded_shape();


should be input_shape_padded and other_shape_padded

ayerofieiev-tt · 2025-01-16T19:44:27Z

ttnn/cpp/ttnn/operations/moreh/moreh_matmul/device/moreh_matmul_device_operation.cpp

+    const auto& input_shape_wo_padding = tensor_args.input.get_logical_shape();
+    const auto& other_shape_wo_padding = tensor_args.other.get_logical_shape();


Suggested change

const auto& input_shape_wo_padding = tensor_args.input.get_logical_shape();

const auto& other_shape_wo_padding = tensor_args.other.get_logical_shape();

const auto& input_shape = tensor_args.input.get_logical_shape();

const auto& other_shape = tensor_args.other.get_logical_shape();

ayerofieiev-tt · 2025-01-16T19:44:53Z

ttnn/cpp/ttnn/operations/moreh/moreh_matmul/device/moreh_matmul_program_factory.cpp

+    const auto& a_shape = tensor_a.get_logical_shape();
+    const auto& b_shape = tensor_b.get_logical_shape();


get_padded_shape()

ayerofieiev-tt · 2025-01-16T19:45:01Z

ttnn/cpp/ttnn/operations/moreh/moreh_matmul/device/moreh_matmul_program_factory.cpp

+    const auto& input_shape = input.get_padded_shape();
+    const auto& input_shape_wo_padding = input.get_logical_shape();


name mismatch

ayerofieiev-tt · 2025-01-16T19:46:38Z

...cpp/ttnn/operations/moreh/moreh_mean_backward/device/moreh_mean_backward_program_factory.cpp

    auto rank = shape.rank();
+    // Replace? ->
    auto padding = shape.value.padding();


this likely does not compile.
Please revert this change. Someone will later make a pass here.

CC @sminakov-tt

ayerofieiev-tt · 2025-01-16T19:47:20Z

ttnn/cpp/ttnn/operations/moreh/moreh_sum_backward/device/moreh_sum_backward_program_factory.cpp

-    const auto& input_grad_shape = input_grad.get_shape();
-    const auto& input_grad_shape_wo_padding = input_grad_shape.value.without_padding();
+    const auto& input_grad_shape = input_grad.get_logical_shape();
+    //const auto& input_grad_shape_wo_padding = input_grad_shape.value.without_padding();


ayerofieiev-tt · 2025-01-23T18:35:16Z

ttnn/cpp/ttnn/operations/moreh/moreh/moreh_abs_pow/device/kernels/moreh_abs_pow_kernel.cpp

why is this file changed?

#0: Replacing get_shape()

8ccd319

jbedichekTT requested review from razorback3, dongjin-na, cfjchu, ayerofieiev-tt and dmakoviichuk-tt as code owners January 15, 2025 22:00

jbedichekTT changed the title ~~#0: Replacing get_shape()~~ Replacing get_shape() Jan 15, 2025