LLVM and SPIRV-LLVM-Translator pulldown (WW04 2025) #16781

iclsrc · 2025-01-26T16:03:06Z

LLVM: llvm/llvm-project@915f3ed
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@cec12d6cf46306d

- Added support for AArch64-specific build attributes. - Print AArch64 build attributes to assembly. - Parse AArch64 build attributes from assembly. - Emit AArch64 build attributes to ELF. Specification: ARM-software/abi-aa#230

…ap. (#123813) Currently we make two memory allocations for each PyOperation: a Python object, and the PyOperation class itself. With some care we can allocate the PyOperation inline inside the Python object, saving us a malloc() call per object and perhaps improving cache locality.

@leewei05

This PR replaces some instances of `undef` with `function argument value` or `poison` or `concrete values` in several tests under `llvm/test/Transforms/` directory. These changes align with modern LLVM standards for better-defined behavior and test determinism. If this small PR is okay and gets merged, I will work on the rest. This is inspired by [this project](https://discourse.llvm.org/t/gsoc-2024-remove-undefined-behavior-from-tests/77236/29), work done on this by @leewei05

…width size

…ame (#123275) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965

Users of the PlayStation SDK aren't given the means to create or run static executables. Uses of `-static` are limited a few specialized cases within SIE. A `--build-id` isn't wanted in those cases. SIE tracker: TOOLCHAIN-16704

Summary: Previously, managed variables didn't work in rdc mode using the new driver because we just didn't register them. This was previously ignored because we didn't have enough space in the current struct format. This patch amends that by just emitting a struct pair for the two variables and using the single pointer. In the future, a more extensible entry format would be nice, but that can be done later.

This reverts commit 43177b5.

…gn-comprison (#122127) - add an option `EnableQtSupport`, that makes C++17 `q20::cmp_*` alternative available for Qt-based applications.

No test changes with this removed and it appears to be obsolete.

…#123803) This fixes a compile-time regression caused by #116645, where an entry basic block with a very large number of allocas and other instructions caused SROA to take ~100× its expected runtime, as every alloca (with ~2 uses) now calls this method to find the order of those few instructions, rescanning the very large basic block every single time. Since this code was originally written, Instructions now have ordering numbers available to determine relative order without unnecessarily scanning the basic block.

It is sufficient to just use `HAVE_DLOPEN`.

This patch fixes: mlir/lib/Dialect/Tosa/Transforms/TosaInferShapes.cpp:309:7: error: variable 'errs' set but not used [-Werror,-Wunused-but-set-variable]

…827) This commit addresses some uncertainty raised in 84fa175 as to which features Apple M4 has.

…… (#120566) …the distributed IR case. This patch allows `nd_load` and `nd_store` to preserve the tensor descriptor shape during distribution to SIMT. The validation now expects the distributed instruction to retain the `sg_map` attribute and uses it to verify the consistency.

I think the std::begin/end were to work around an old gcc bug. Hopefully we don't need them anymore.

This holds a physical register unit or virtual register and mask. While I was here I've used emplace_back and removed an unneeded use of a template.

…t r… (#122726)" This reverts commit c3ba6f3. We are seeing performance regressions of up to 40% on some compilations with this patch, we will investigate and reland after fixing performance issues.

CONFLICT (content): Merge conflict in libclc/clc/include/clc/clcmacro.h CONFLICT (content): Merge conflict in libclc/generic/lib/common/mix.cl CONFLICT (content): Merge conflict in libclc/generic/lib/common/mix.inc CONFLICT (content): Merge conflict in libclc/generic/lib/math/mad.cl CONFLICT (modify/delete): libclc/generic/lib/math/mad.inc deleted in c8eb865 and modified in HEAD. Version HEAD of libclc/generic/lib/math/mad.inc left in tree. CONFLICT (modify/delete): libclc/generic/lib/math/sincospiF_piby4.h deleted in HEAD and modified in c8eb865. Version c8eb865 of libclc/generic/lib/math/sincospiF_piby4.h left in tree. CONFLICT (content): Merge conflict in libclc/libspirv/lib/generic/math/clc_exp10.cl CONFLICT (content): Merge conflict in libclc/libspirv/lib/generic/math/clc_hypot.cl CONFLICT (content): Merge conflict in libclc/libspirv/lib/generic/math/clc_pow.cl

MrSidims · 2025-01-30T15:28:42Z

I'm fine with Update spirv-headers-tag.conf despite revert of dd33e595 and [SYCL][E2E] XFAIL multisource.cpp for now

frasercrmck · 2025-01-30T15:43:46Z

clang/lib/Driver/ToolChains/Cuda.cpp

@@ -517,7 +517,7 @@ void NVPTX::Assembler::ConstructJob(Compilation &C, const JobAction &JA,
 static bool shouldIncludePTX(const ArgList &Args, StringRef InputArch) {
  // The new driver does not include PTX by default to avoid overhead.
  bool includePTX = !Args.hasFlag(options::OPT_offload_new_driver,
-                                  options::OPT_no_offload_new_driver, true);
+                                  options::OPT_no_offload_new_driver, false); // INTEL


I can't really answer to what's going on here, sorry. I suspect that this indicates we're not passing the right flag to control the new offload driver? The false should essentially be equivalent to us explicitly passing -fno-offload-new-driver to the driver.

Perhaps this is okay for now but we need to investigate this properly.

The new offload driver is currently not enabled by default for intel/llvm. The plan is to move to the new model this year.

Makes sense, thanks. But might it be easier to explicitly disable the new offload driver by passing the option, rather than have to change the default values of various hasFlag checks?

Yes, this is the easiest workaround to let cuda sycl use new offload driver for now. Once we switch the default to new offload driver, we should remove this workaround.

jsji · 2025-01-30T16:09:16Z

@intel/llvm-gatekeepers I think this is ready for merge. Last CI run was success, the new changes after that are mostly NFC (I have tested locally for NVPTX codegen tests). The current CI is broken, so please merge when CI is fixed.

sarnex · 2025-01-30T16:13:30Z

CI should be fixed, so ping me when CI passes and this is ready for merge

jsji · 2025-01-31T01:19:04Z

CI should be fixed, so ping me when CI passes and this is ready for merge

This is ready for merge now. @sarnex The failure in post-commit e2e-line intel arc are common to others.

sarnex · 2025-01-31T14:45:17Z

/merge

bb-sycl · 2025-01-31T14:45:47Z

Fri 31 Jan 2025 02:45:47 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2025-01-31T14:51:03Z

Fri 31 Jan 2025 02:51:02 PM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

sivan-shani and others added 30 commits January 22, 2025 14:23

[SLP][NFC]Add a test with potential alternate node, marked for minbit…

ccd7795

…width size

[X86][AVX10.2-SATCVT][NFC] Remove NE from intrinsic and instruction n…

4f40b07

…ame (#123275) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965

[clangd][NFC] Delete a pessimizing move

a2063ba

Remove references to mips within Android (#123856)

a7a8694

[gn] port 6aeffcd

4170d61

[gn] fix mistake in d7fb4a2

d0a89e7

Revert "[GISel] Add more FP opcodes to CSE (#123624)" (#123954)

c938436

This reverts commit 43177b5.

[clang-tidy] Add EnableQtSupport option to modernize-use-integer-si…

aa580c2

…gn-comprison (#122127) - add an option `EnableQtSupport`, that makes C++17 `q20::cmp_*` alternative available for Qt-based applications.

AMDGPU: Delete FillMFMAShadowMutation (#123861)

93d35ad

No test changes with this removed and it appears to be obsolete.

[Clang][Arch] Disable mve.fp when explicit -mfpu option (#123028)

6b486f4

[CMake] Remove HAVE_DLFCN_H and HAVE_DLADDR (#123879)

58c6d44

It is sufficient to just use `HAVE_DLOPEN`.

[mlir] Fix a warning

5a9b74d

This patch fixes: mlir/lib/Dialect/Tosa/Transforms/TosaInferShapes.cpp:309:7: error: variable 'errs' set but not used [-Werror,-Wunused-but-set-variable]

[llvm][AArch64] apple-m4 does not have FEAT_{SPEv1p2,SEL2,MPAM} (#123…

75ce2dc

…827) This commit addresses some uncertainty raised in 84fa175 as to which features Apple M4 has.

[X86] Simplify ArrayRef construction. NFC (#123899)

13d09df

I think the std::begin/end were to work around an old gcc bug. Hopefully we don't need them anymore.

[CodeGen] Rename RegisterMaskPair to VRegMaskOrUnit. NFC (#123799)

9e6494c

This holds a physical register unit or virtual register and mask. While I was here I've used emplace_back and removed an unneeded use of a template.

Revert "[Modules] Delay deserialization of preferred_name attribute a…

f63e8ed

…t r… (#122726)" This reverts commit c3ba6f3. We are seeing performance regressions of up to 40% on some compilations with this patch, we will investigate and reland after fixing performance issues.

[X86] var-permute-256.ll - regenerate VPTERNLOG comments

16298e4

[X86] add/sub signed sat vec tests - regenerate VPTERNLOG comments

603529b

[X86] avx512-broadcast-unfold.ll - regenerate VPTERNLOG comments

e6c7d6a

[X86] avx512 intrinsics tests - regenerate VPTERNLOG comments

bb754f2

[X86] vector rotate tests - regenerate VPTERNLOG comments

a25f2cb

[X86] vector reduction tests - regenerate VPTERNLOG comments

44f3168

jsji had a problem deploying to WindowsCILock January 30, 2025 14:25 — with GitHub Actions Failure

[NFC] Add comments about early exiting in populateKernels

5a6655a

jsji had a problem deploying to WindowsCILock January 30, 2025 14:30 — with GitHub Actions Failure

jsji had a problem deploying to WindowsCILock January 30, 2025 14:30 — with GitHub Actions Error

jsji closed this Jan 30, 2025

jsji reopened this Jan 30, 2025

jsji had a problem deploying to WindowsCILock January 30, 2025 15:02 — with GitHub Actions Failure

jsji had a problem deploying to WindowsCILock January 30, 2025 15:03 — with GitHub Actions Failure

MrSidims approved these changes Jan 30, 2025

View reviewed changes

frasercrmck reviewed Jan 30, 2025

View reviewed changes

frasercrmck approved these changes Jan 30, 2025

View reviewed changes

mdtoguchi approved these changes Jan 30, 2025

View reviewed changes

jsji closed this Jan 30, 2025

jsji reopened this Jan 30, 2025

jsji temporarily deployed to WindowsCILock January 30, 2025 16:11 — with GitHub Actions Inactive

jsji had a problem deploying to WindowsCILock January 30, 2025 16:12 — with GitHub Actions Failure

jsji had a problem deploying to WindowsCILock January 30, 2025 17:13 — with GitHub Actions Failure

jsji temporarily deployed to WindowsCILock January 30, 2025 17:54 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock January 30, 2025 18:09 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock January 30, 2025 19:09 — with GitHub Actions Inactive

bb-sycl approved these changes Jan 31, 2025

View reviewed changes

bb-sycl merged commit 0a81741 into sycl Jan 31, 2025
61 of 91 checks passed

aelovikov-intel deleted the llvmspirv_pulldown branch January 31, 2025 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM and SPIRV-LLVM-Translator pulldown (WW04 2025) #16781

LLVM and SPIRV-LLVM-Translator pulldown (WW04 2025) #16781

iclsrc commented Jan 26, 2025

MrSidims commented Jan 30, 2025

frasercrmck Jan 30, 2025

mdtoguchi Jan 30, 2025

frasercrmck Jan 30, 2025

jsji Jan 30, 2025

jsji commented Jan 30, 2025 •

edited

Loading

sarnex commented Jan 30, 2025

jsji commented Jan 31, 2025

sarnex commented Jan 31, 2025

bb-sycl commented Jan 31, 2025

bb-sycl commented Jan 31, 2025

LLVM and SPIRV-LLVM-Translator pulldown (WW04 2025) #16781

LLVM and SPIRV-LLVM-Translator pulldown (WW04 2025) #16781

Conversation

iclsrc commented Jan 26, 2025

MrSidims commented Jan 30, 2025

frasercrmck Jan 30, 2025

Choose a reason for hiding this comment

mdtoguchi Jan 30, 2025

Choose a reason for hiding this comment

frasercrmck Jan 30, 2025

Choose a reason for hiding this comment

jsji Jan 30, 2025

Choose a reason for hiding this comment

jsji commented Jan 30, 2025 • edited Loading

sarnex commented Jan 30, 2025

jsji commented Jan 31, 2025

sarnex commented Jan 31, 2025

bb-sycl commented Jan 31, 2025

bb-sycl commented Jan 31, 2025

jsji commented Jan 30, 2025 •

edited

Loading