Skip to content

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization #111

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization #111

Triggered via pull request November 12, 2024 09:45
@xffxffxffxff
opened #66
vllm
Status Success
Total duration 19s
Artifacts

format.yml

on: pull_request
Matrix: build
Fit to window
Zoom out
Zoom in

Annotations

1 warning
build (3.10)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-python@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/