Update ARM CPU experimental kernels from AO to leverage pip install #1458

metascroy · 2025-01-15T21:14:19Z

torchao experimental CPU kernels are now installed and loaded automatically by pip.
Switch quantization to use new quantize_ API

pytorch-bot · 2025-01-15T21:14:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1458

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 67dd729 with merge base e5cf6e5 ():

NEW FAILURES - The following jobs have failed:

pull / test-torchao-experimental-cpp (macos-14-xlarge) (gh)
Process completed with exit code 134.
pull / test-torchao-experimental-et (macos-14-xlarge) (gh)
AttributeError: '_OpNamespace' 'torchao' object has no attribute '_pack_8bit_act_4bit_weight'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu

Thanks for making the pip install work with the subclass APIs!!

docs/quantization.md

torchchat/utils/quantize.py

Jack-Khuu · 2025-01-15T21:30:12Z

cc: @manuelcandales

Can have please for MPS? 🥺🥺 (Separate PR)

Jack-Khuu · 2025-01-18T00:26:07Z

Awaiting pytorch/executorch#7759

metascroy · 2025-01-28T21:48:35Z

@Jack-Khuu did you update the version AO uses in ET?

Jack-Khuu · 2025-01-29T01:32:41Z

Yup pytorch/executorch@9836b39 Points to pytorch/ao@11333ba

Co-authored-by: Jack-Khuu <[email protected]>

nikhil-arm · 2025-02-20T12:49:19Z

Hello @metascroy @Jack-Khuu , what is the plan to get this in mainline? We would like to use KleidiAI kernels from aten via this quantizer path. Let us know if we need to raise a new PR ?

Jack-Khuu · 2025-02-20T18:56:52Z

Hi @nikhil-arm, we're still planning to land this

Can you share the specific commit hashes y'all need?

Jack-Khuu · 2025-02-26T21:46:08Z

@nikhil-arm We've bumped the AO pin on main.
Please let me know if you there's any additional support needed to unblock KleidiAI kernels

install/install_requirements.sh

Jack-Khuu · 2025-02-27T01:53:58Z

After a suite of rebases, pinbumps, and splitting up tests up we know what we're tackling:

test-torchao-experimental-cpp (macos-14-xlarge): Tests the AOTI runner and likely failing (also in main) due to not linking to the LibOMP from torch as @malfet mentioned in Bump PT 2025131 and ET pins 20250209 #1493.
test-torchao-experimental-et (macos-14-xlarge): Tests the ET runner; looks like a install bug where USE_CPP isn't set, but will likely run into the same LibOMP issue as above

metascroy · 2025-03-05T23:42:50Z

Hello @metascroy @Jack-Khuu , what is the plan to get this in mainline? We would like to use KleidiAI kernels from aten via this quantizer path. Let us know if we need to raise a new PR ?

Sorry about the delay @nikhil-arm.

@Jack-Khuu let's try to get this landed within the next week. Bumping the ao pin in torchchat had various conflicts with the CI, but I think we can dedicate to making this work.

I think it does make sense to first land pytorch/ao#1836 in torchao before bumping because they've already deprecated the old quantizers in quantize_.

metascroy requested a review from Jack-Khuu January 15, 2025 21:14

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 15, 2025

Jack-Khuu added the Quantization Issues related to Quantization or torchao label Jan 15, 2025

Jack-Khuu approved these changes Jan 15, 2025

View reviewed changes

docs/quantization.md Outdated Show resolved Hide resolved

torchchat/utils/quantize.py Outdated Show resolved Hide resolved

torchchat/utils/quantize.py Show resolved Hide resolved

torchchat/utils/quantize.py Outdated Show resolved Hide resolved

Jack-Khuu changed the title ~~update experimental kernels in torchchat~~ Update ARM CPU experimental kernels from AO to leverage pip install Jan 15, 2025

metascroy and others added 6 commits January 29, 2025 17:15

update experimental kernels in torchchat

bdac616

Update docs/quantization.md

74363e4

Co-authored-by: Jack-Khuu <[email protected]>

Update torchchat/utils/quantize.py

48f568d

Co-authored-by: Jack-Khuu <[email protected]>

Update torchchat/utils/quantize.py

525701d

Co-authored-by: Jack-Khuu <[email protected]>

Fixing import typo in quantize.py

f9a7bb9

Bump ET pin to pick up AO changes

0abe175

metascroy force-pushed the new-intx-quantizer branch from 8ebf63f to 0abe175 Compare January 30, 2025 01:15

Merge branch 'main' into new-intx-quantizer

95304b8

Jack-Khuu mentioned this pull request Feb 11, 2025

Bump PT 2025131 and ET pins 20250209 #1493

Merged

Jack-Khuu added 2 commits February 11, 2025 11:32

Bump torchao-pin to match ET and torchchat

76e8ec5

Merge branch 'main' into new-intx-quantizer

c2108d6

Jack-Khuu added 2 commits February 24, 2025 11:45

Merge branch 'main' into new-intx-quantizer

4ee1b96

Merge branch 'main' into new-intx-quantizer

61a1c62

metascroy commented Feb 27, 2025

View reviewed changes

install/install_requirements.sh Show resolved Hide resolved

Jack-Khuu added 2 commits February 26, 2025 16:13

Update torchao-pin.txt

3e04645

Split up AOTI and ET tests

94fcd9a

Jack-Khuu added 3 commits February 26, 2025 17:55

Bump ET pin to 2-26-25 with new AO pin

7e56c55

Undo et pin bump; fails basic install

77e8a62

Merge branch 'main' into new-intx-quantizer

67dd729

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ARM CPU experimental kernels from AO to leverage pip install #1458

Update ARM CPU experimental kernels from AO to leverage pip install #1458

metascroy commented Jan 15, 2025

pytorch-bot bot commented Jan 15, 2025 •

edited

Loading

Jack-Khuu left a comment

Jack-Khuu commented Jan 15, 2025

Jack-Khuu commented Jan 18, 2025

metascroy commented Jan 28, 2025

Jack-Khuu commented Jan 29, 2025

nikhil-arm commented Feb 20, 2025

Jack-Khuu commented Feb 20, 2025

Jack-Khuu commented Feb 26, 2025

Jack-Khuu commented Feb 27, 2025

metascroy commented Mar 5, 2025

Update ARM CPU experimental kernels from AO to leverage pip install #1458

Are you sure you want to change the base?

Update ARM CPU experimental kernels from AO to leverage pip install #1458

Conversation

metascroy commented Jan 15, 2025

pytorch-bot bot commented Jan 15, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1458

❌ 2 New Failures

Jack-Khuu left a comment

Choose a reason for hiding this comment

Jack-Khuu commented Jan 15, 2025

Jack-Khuu commented Jan 18, 2025

metascroy commented Jan 28, 2025

Jack-Khuu commented Jan 29, 2025

nikhil-arm commented Feb 20, 2025

Jack-Khuu commented Feb 20, 2025

Jack-Khuu commented Feb 26, 2025

Jack-Khuu commented Feb 27, 2025

metascroy commented Mar 5, 2025

pytorch-bot bot commented Jan 15, 2025 •

edited

Loading