Skip to content

Actions: ggerganov/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
8,059 workflow runs
8,059 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Rename Olmo1124 to Olmo2
Server #8376: Pull request #10500 opened by 2015aroras
November 25, 2024 16:59 1h 33m 30s 2015aroras:rename-olmo-nov
November 25, 2024 16:59 1h 33m 30s
vulkan: get the first command buffer submitted sooner
Server #8375: Pull request #10499 opened by jeffbolznv
November 25, 2024 16:54 1h 31m 15s jeffbolznv:overlap
November 25, 2024 16:54 1h 31m 15s
Add some minimal optimizations for CDNA
Server #8374: Pull request #10498 opened by IMbackK
November 25, 2024 16:41 1h 12m 56s IMbackK:master
November 25, 2024 16:41 1h 12m 56s
llama : accept a list of devices to use to offload a model
Server #8373: Pull request #10497 synchronize by slaren
November 25, 2024 16:33 28m 8s sl/llama-dev-selection
November 25, 2024 16:33 28m 8s
llama : accept a list of devices to use to offload a model
Server #8372: Pull request #10497 synchronize by slaren
November 25, 2024 16:24 9m 37s sl/llama-dev-selection
November 25, 2024 16:24 9m 37s
llama : accept a list of devices to use to offload a model
Server #8371: Pull request #10497 synchronize by slaren
November 25, 2024 16:16 8m 27s sl/llama-dev-selection
November 25, 2024 16:16 8m 27s
llama : accept a list of devices to use to offload a model
Server #8370: Pull request #10497 opened by slaren
November 25, 2024 16:14 1m 28s sl/llama-dev-selection
November 25, 2024 16:14 1m 28s
Add download chat feature to server chat (#10481)
Server #8369: Commit a9a678a pushed by ngxson
November 25, 2024 16:11 1h 29m 33s master
November 25, 2024 16:11 1h 29m 33s
vulkan: fix group_norm
Server #8368: Pull request #10496 opened by jeffbolznv
November 25, 2024 15:22 2h 15m 2s jeffbolznv:group_norm
November 25, 2024 15:22 2h 15m 2s
imatrix-combine-only idea
Server #8367: Pull request #10492 synchronize by robbiemu
November 25, 2024 14:51 Action required robbiemu:imatrix-combine-only
November 25, 2024 14:51 Action required
imatrix-combine-only idea
Server #8366: Pull request #10492 opened by robbiemu
November 25, 2024 14:32 Action required robbiemu:imatrix-combine-only
November 25, 2024 14:32 Action required
server : add speculative decoding support (#10455)
Server #8365: Commit 9ca2e67 pushed by ggerganov
November 25, 2024 14:31 1h 46m 26s master
November 25, 2024 14:31 1h 46m 26s
metal : enable mat-vec kernels for bs <= 4
Server #8364: Pull request #10491 opened by ggerganov
November 25, 2024 14:27 1h 32m 37s gg/metal-enable-mv
November 25, 2024 14:27 1h 32m 37s
Add download chat feature to server chat
Server #8363: Pull request #10481 synchronize by ngxson
November 25, 2024 14:20 1h 30m 50s brucepro:llama-server-chat-enhancements
November 25, 2024 14:20 1h 30m 50s
ggml : add support for dynamic loading of backends (#10469)
Server #8362: Commit 5931c1f pushed by slaren
November 25, 2024 14:13 1h 4m 59s master
November 25, 2024 14:13 1h 4m 59s
tests : fix compile warning
Server #8361: Commit f6d12e7 pushed by ggerganov
November 25, 2024 13:17 59m 35s master
November 25, 2024 13:17 59m 35s
metal : use F16 math in mul_mat kernels
Server #8360: Pull request #10220 synchronize by ggerganov
November 25, 2024 13:16 42m 45s gg/metal-mul-mat-f16
November 25, 2024 13:16 42m 45s
metal : minor code formatting
Server #8359: Commit b756441 pushed by ggerganov
November 25, 2024 13:08 6m 39s master
November 25, 2024 13:08 6m 39s
server : add speculative decoding support
Server #8355: Pull request #10455 synchronize by ggerganov
November 25, 2024 08:17 8m 35s gg/speculative-server
November 25, 2024 08:17 8m 35s
server : add speculative decoding support
Server #8353: Pull request #10455 synchronize by ggerganov
November 25, 2024 08:05 7m 0s gg/speculative-server
November 25, 2024 08:05 7m 0s
server : add speculative decoding support
Server #8352: Pull request #10455 synchronize by ggerganov
November 25, 2024 07:51 8m 42s gg/speculative-server
November 25, 2024 07:51 8m 42s