Skip to content

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization #111

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization

Replaced grouped_gemm with vLLM's Fused MoE Kernel for Inference Optimization #111

Annotations

1 warning

build (3.10)

succeeded Nov 12, 2024 in 11s