Add 24 compressor #167

rahul-tuli · 2024-09-26T15:42:35Z

This PR adds Sparse24Compressor, for 2:4 sparse models
The code is based off #129

Depends on:

This implements Part 3 of the Design doc: https://www.notion.so/Design-Document-24-Compressor-25ac643aee604c298f2bb12a6c220861?pvs=4

Class Hierarchy:

BaseCompressor (Abstract Class)
    |
    +-- BaseSparsityCompressor (Abstract Class)
           |
           +-- Sparse24Compressor

File Structure:

compressors/
└── sparse_compressors/
    ├── __init__.py
    ├── base.py                 <-- Contains BaseSparsityCompressor
    ├── dense.py
    ├── sparse_bitmask.py
    └── sparse24.py             <-- New file for Sparse24Compressor

horheynm

Very clean.
lgtm after tests
!!

Move tests Remove unused import

The base branch was changed.

markurtz

Overall code looks simple. I'd like to reformulate the scope, though. Specifically, I'm not following why we are restricting to just 2:4 right now when we could easily expand this to handle all sparsity cases and detect whether it is 2:4 format, some type of structured pruning, and if not any then set as unstructured. cc @dsikka

dsikka

testing?

rahul-tuli changed the base branch from main to update-folder-structure-compressors September 26, 2024 15:42

rahul-tuli force-pushed the add-24-compressor branch from 2015e71 to dea129e Compare September 26, 2024 15:45

rahul-tuli mentioned this pull request Sep 26, 2024

Add Sparse24Compressor #129

Closed

horheynm previously approved these changes Sep 26, 2024

View reviewed changes

rahul-tuli force-pushed the update-folder-structure-compressors branch 2 times, most recently from 2f69d16 to fc4b23c Compare October 2, 2024 20:56

rahul-tuli force-pushed the add-24-compressor branch from dea129e to 68ca6c3 Compare October 2, 2024 20:58

rahul-tuli force-pushed the update-folder-structure-compressors branch from fc4b23c to dd16499 Compare October 2, 2024 21:02

Update folder structure

7155e61

Move tests Remove unused import

rahul-tuli force-pushed the update-folder-structure-compressors branch from dd16499 to 7155e61 Compare October 2, 2024 21:06

Add: Sparse24Compressor

6636872

rahul-tuli force-pushed the add-24-compressor branch from 68ca6c3 to 6636872 Compare October 2, 2024 21:08

Base automatically changed from update-folder-structure-compressors to main October 3, 2024 00:43

Merge branch 'main' into add-24-compressor

729dfe5

mgoin mentioned this pull request Oct 4, 2024

[WIP] Example for 2:4 sparsity with w8a8 vllm-project/llm-compressor#775

Closed

Fix sync

e56bf72

markurtz reviewed Oct 18, 2024

View reviewed changes

markurtz mentioned this pull request Oct 18, 2024

No model size reduction seen vllm-project/llm-compressor#790

Open

dsikka requested changes Oct 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 24 compressor #167

Add 24 compressor #167

rahul-tuli commented Sep 26, 2024 •

edited

Loading

horheynm left a comment

markurtz left a comment

dsikka left a comment

Add 24 compressor #167

Are you sure you want to change the base?

Add 24 compressor #167

Conversation

rahul-tuli commented Sep 26, 2024 • edited Loading

horheynm left a comment

Choose a reason for hiding this comment

markurtz left a comment

Choose a reason for hiding this comment

dsikka left a comment

Choose a reason for hiding this comment

rahul-tuli commented Sep 26, 2024 •

edited

Loading