Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add upsample_bilinear2d channels last kernel implementation #950

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

majing921201
Copy link
Contributor

@majing921201 majing921201 commented Sep 30, 2024

We got 2x perf improvement in backward but no change for forward,
For non-CL forward kernel, we observed little perf improvement. There is a inner loop optimization in forward kernel


                                               Name    Self CPU %      Self CPU   CPU total %     CPU total  CPU time avg      Self XPU    Self XPU %     XPU total  XPU time avg    # of Calls

        UpsampleBilinear2dBackwardnhwcKerne...         0.00%       0.000us         0.00%       0.000us       0.000us       3.582ms        94.79%       3.582ms       3.582ms             1

        UpsampleBilinear2dnhwcKernelFunctor...         0.00%       0.000us         0.00%       0.000us       0.000us     184.800us         4.89%     184.800us     184.800us             1

@majing921201 majing921201 changed the title Aadd upsample_bilinear2d channels last kernel implementation Add upsample_bilinear2d channels last kernel implementation Sep 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants