Bug fix: Enable fast to override quantize json #1377

Jack-Khuu · 2024-11-15T02:36:52Z

Follow up fix for: 4697764

We want to allow "fast" to work as a manual overwrite of a --quantize accelerator config, which the previous PR disabled while fixing a different bug

No overwrite

python3 torchchat.py generate llama3.1 --quantize '{"precision": {"dtype":"float16"}, "executor":{"accelerator":"mps"}}'

NumExpr defaulting to 10 threads.
PyTorch version 2.6.0.dev20241002 available.
lm_eval is not installed, GPTQ may not be usable
Using device=mps

Overwrite with device arg

python3 torchchat.py generate llama3.1 --quantize '{"precision": {"dtype":"float16"}, "executor":{"accelerator":"mps"}}' --device cpu

overriding json-specified device mps with cli device cpu
NumExpr defaulting to 10 threads.
PyTorch version 2.6.0.dev20241002 available.
lm_eval is not installed, GPTQ may not be usable
Using device=cpu Apple M1 Max

Overwrite with fast arg

python3 torchchat.py generate llama3.1 --quantize '{"precision": {"dtype":"float16"}, "executor":{"accelerator":"cpu"}}' --device fast

overriding json-specified device cpu with cli device mps
NumExpr defaulting to 10 threads.
PyTorch version 2.6.0.dev20241002 available.
lm_eval is not installed, GPTQ may not be usable
Using device=mps

Run with fast arg (same as running without device arg)

python3 torchchat.py generate llama3.1 --device fast

NumExpr defaulting to 10 threads.
PyTorch version 2.6.0.dev20241002 available.
lm_eval is not installed, GPTQ may not be usable
Using device=mps

pytorch-bot · 2024-11-15T02:36:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1377

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

✅ No Failures

As of commit 8db4b72 with merge base 4697764 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-11-15T02:37:19Z

cc: @mikekgfb

Gasoonjia · 2024-11-16T23:25:04Z

torchchat/cli/cli.py

            args.device = get_device_str(
                args.quantize.get("executor", {}).get("accelerator", default_device)
            )
        else:
+            args.device = get_device_str(args.device)
            executor_handler = args.quantize.get("executor", None)
            if executor_handler:


consider merging this if statement with below if:

Suggested change

if executor_handler:

if executor_handler and executor_handler["accelerator"] != args.device:

Bug fix: Enable fast to override quantize json

304308d

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 15, 2024

Jack-Khuu requested review from byjlw, Gasoonjia and vmpuri and removed request for byjlw November 15, 2024 02:37

Gasoonjia approved these changes Nov 16, 2024

View reviewed changes

collapse conditional

8db4b72

Jack-Khuu merged commit b809b69 into main Nov 20, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix: Enable fast to override quantize json #1377

Bug fix: Enable fast to override quantize json #1377

Jack-Khuu commented Nov 15, 2024

pytorch-bot bot commented Nov 15, 2024 •

edited

Loading

Jack-Khuu commented Nov 15, 2024

Gasoonjia Nov 16, 2024

	if executor_handler:
	if executor_handler and executor_handler["accelerator"] != args.device:

Bug fix: Enable fast to override quantize json #1377

Bug fix: Enable fast to override quantize json #1377

Conversation

Jack-Khuu commented Nov 15, 2024

pytorch-bot bot commented Nov 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1377

❗ 1 Active SEVs

✅ No Failures

Jack-Khuu commented Nov 15, 2024

Gasoonjia Nov 16, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Nov 15, 2024 •

edited

Loading