AOTI filesize regression .pt2 filesize is bigger than .so #1365

metascroy · 2024-11-11T22:45:33Z

🐛 Describe the bug

Exported model for both pt2 and so. pt2 file is 2x larger:

llama31_1bit.pt2 filesize: 3.09GB
llama31_1bit.so filesize: 1.55GB

pt2 command:

OMP_NUM_THREADS=6 python torchchat.py export llama3.1 --device cpu --dtype float32 --quantize '{"embedding:wx": {"bitwidth": 1, "groupsize": 32}, "linear:a8wxdq": {"bitwidth": 1, "groupsize": 256, "has_weight_zeros": false}}' --output-aoti-package-path llama31_1bit.pt2

so command:

OMP_NUM_THREADS=6 python torchchat.py export llama3.1 --device cpu --dtype float32 --quantize '{"embedding:wx": {"bitwidth": 1, "groupsize": 32}, "linear:a8wxdq": {"bitwidth": 1, "groupsize": 256, "has_weight_zeros": false}}' --output-dso llama31_1bit.so

Versions

Collecting environment information...
PyTorch version: 2.6.0.dev20241007
Is debug build: False
CUDA used to build PyTorch: None
ROCM used to build PyTorch: N/A

OS: macOS 14.7 (arm64)
GCC version: Could not collect
Clang version: 16.0.0 (clang-1600.0.26.3)
CMake version: version 3.30.5
Libc version: N/A

Python version: 3.10.0 (default, Mar 3 2022, 03:54:28) [Clang 12.0.0 ] (64-bit runtime)
Python platform: macOS-14.7-arm64-arm-64bit
Is CUDA available: False
CUDA runtime version: No CUDA
CUDA_MODULE_LOADING set to: N/A
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True

CPU:
Apple M1 Pro

Versions of relevant libraries:
[pip3] executorch==0.5.0a0+72b3bb3
[pip3] numpy==1.26.4
[pip3] torch==2.6.0.dev20241007
[pip3] torchao==0.5.0
[pip3] torchaudio==2.5.0.dev20241007
[pip3] torchsr==1.0.4
[pip3] torchtune==0.4.0.dev20241010+cpu
[pip3] torchvision==0.20.0.dev20241007
[conda] executorch 0.5.0a0+72b3bb3 pypi_0 pypi
[conda] numpy 1.26.4 pypi_0 pypi
[conda] torch 2.6.0.dev20241007 pypi_0 pypi
[conda] torchao 0.5.0 pypi_0 pypi
[conda] torchaudio 2.5.0.dev20241007 pypi_0 pypi
[conda] torchsr 1.0.4 pypi_0 pypi
[conda] torchtune 0.4.0.dev20241010+cpu pypi_0 pypi
[conda] torchvision 0.20.0.dev20241007 pypi_0 pypi

The text was updated successfully, but these errors were encountered:

Jack-Khuu · 2024-11-12T00:52:45Z

Yup yup, this is a known issue/bug in pytorch/pytorch

It'll be solved when this lands: pytorch/pytorch#140022

Jack-Khuu · 2024-11-16T02:28:18Z

Will revisit post pin bump #1367

Jack-Khuu added bug Something isn't working Known Gaps These are known Gaps/Issues/Bug items in torchchat actionable Items in the backlog waiting for an appropriate impl/fix Compile / AOTI Issues related to AOT Inductor and torch compile labels Nov 12, 2024

Jack-Khuu self-assigned this Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AOTI filesize regression .pt2 filesize is bigger than .so #1365

AOTI filesize regression .pt2 filesize is bigger than .so #1365

metascroy commented Nov 11, 2024

Jack-Khuu commented Nov 12, 2024

Jack-Khuu commented Nov 16, 2024

AOTI filesize regression *.pt2 filesize is bigger than .*so #1365

AOTI filesize regression *.pt2 filesize is bigger than .*so #1365

Comments

metascroy commented Nov 11, 2024

🐛 Describe the bug

Versions

Jack-Khuu commented Nov 12, 2024

Jack-Khuu commented Nov 16, 2024

AOTI filesize regression .pt2 filesize is bigger than .so #1365

AOTI filesize regression .pt2 filesize is bigger than .so #1365