Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add recipe check vllm e2e
#929 opened Nov 21, 2024 by horheynm Loading…
[1/2] Expand e2e testing to prepare for lm-eval ready When a PR is ready for review
#922 opened Nov 17, 2024 by dsikka Loading…
[Bugfix] Support model offloading SparseGPTQ
#918 opened Nov 16, 2024 by kylesayrs Loading…
Implement HooksMixin
#917 opened Nov 14, 2024 by kylesayrs Loading…
Kylesayrs/gptq partition
#914 opened Nov 13, 2024 by kylesayrs Draft
fix consecutive oneshot
#898 opened Nov 5, 2024 by horheynm Loading…
Allow Shortcutting Min-max Observer
#887 opened Nov 1, 2024 by kylesayrs Loading…
FSDP utils cleanup
#854 opened Oct 19, 2024 by kylesayrs Loading…
[Bugfix] DisableKVCache Context
#834 opened Oct 9, 2024 by kylesayrs Loading…
Awq re implementation
#824 opened Oct 7, 2024 by rahul-tuli Draft
Enable Sparse compression
#822 opened Oct 7, 2024 by rahul-tuli Loading…
1 of 3 tasks
e2e tests
#742 opened Oct 1, 2024 by horheynm Loading…
Add: Weight clipping to AWQModifier
#184 opened Sep 18, 2024 by rahul-tuli Loading…
1 task
AWQ modifier implementation
#183 opened Sep 18, 2024 by rahul-tuli Loading…
Add: Some utility functions needed for AWQ Modifier
#182 opened Sep 18, 2024 by rahul-tuli Loading…
3 tasks
Feature Branch for AWQ Modifier
#181 opened Sep 18, 2024 by rahul-tuli Loading…
4 tasks
[KV Cache] kv-cache end to end tests ready When a PR is ready for review
#141 opened Sep 4, 2024 by horheynm Loading…
ProTip! Adding no:label will show everything without a label.