Replaced grouped_gemm
with vLLM's Fused MoE Kernel for Inference Optimization
#111
Job | Run time |
---|---|
11s | |
11s |
grouped_gemm
with vLLM's Fused MoE Kernel for Inference Optimization
#111
Job | Run time |
---|---|
11s | |
11s |