Skip to content

Actions: ggerganov/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
9,849 workflow runs
9,849 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

server: (web UI) Add samplers sequence customization (#10255)
Publish Docker image #14718: Commit bcdb7a2 pushed by ngxson
November 16, 2024 13:26 36m 58s master
November 16, 2024 13:26 36m 58s
vulkan: Optimize some mat-vec mul quant shaders (#10296)
Publish Docker image #14717: Commit 772703c pushed by 0cc4m
November 16, 2024 06:27 38m 43s master
November 16, 2024 06:27 38m 43s
ggml : optimize Q4_0 into Q4_0_X_Y repack (#10324)
Publish Docker image #14716: Commit 1e58ee1 pushed by slaren
November 16, 2024 00:53 1h 31m 39s master
November 16, 2024 00:53 1h 31m 39s
llama : save number of parameters and the size in llama_model (#10286)
Publish Docker image #14715: Commit 89e4caa pushed by slaren
November 16, 2024 00:42 54m 54s master
November 16, 2024 00:42 54m 54s
Make updates to fix issues with clang-cl builds while using AVX512 fl…
Publish Docker image #14714: Commit 74d73dc pushed by slaren
November 15, 2024 21:27 1h 34m 20s master
November 15, 2024 21:27 1h 34m 20s
ggml : fix some build issues
Publish Docker image #14713: Commit 883d206 pushed by ggerganov
November 15, 2024 19:45 2h 8m 26s master
November 15, 2024 19:45 2h 8m 26s
cmake : fix ppc64 check (whisper/0)
Publish Docker image #14712: Commit 09ecbcb pushed by ggerganov
November 15, 2024 13:44 5h 40m 50s master
November 15, 2024 13:44 5h 40m 50s
AVX BF16 and single scale quant optimizations (#10212)
Publish Docker image #14711: Commit 1842922 pushed by slaren
November 15, 2024 11:48 5h 46m 54s master
November 15, 2024 11:48 5h 46m 54s
sycl: Update Intel docker images to use DPC++ 2025.0 (#10305)
Publish Docker image #14710: Commit 57f8355 pushed by ggerganov
November 15, 2024 11:10 5h 2m 1s master
November 15, 2024 11:10 5h 2m 1s
server : (web UI) add copy button for code block, fix api key (#10242)
Publish Docker image #14709: Commit 9901068 pushed by ngxson
November 15, 2024 09:48 3h 51m 51s master
November 15, 2024 09:48 3h 51m 51s
cann: dockerfile and doc adjustment (#10302)
Publish Docker image #14708: Commit 231f936 pushed by hipudding
November 15, 2024 07:09 1h 3m 26s master
November 15, 2024 07:09 1h 3m 26s
sycl: Use syclcompat::dp4a (#10267)
Publish Docker image #14707: Commit 5a54af4 pushed by airMeng
November 15, 2024 03:09 5m 10s master
November 15, 2024 03:09 5m 10s
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)
Publish Docker image #14706: Commit 1607a5e pushed by slaren
November 15, 2024 00:28 30m 52s master
November 15, 2024 00:28 30m 52s
ggml : build backends as libraries (#10256)
Publish Docker image #14705: Commit ae8de6d pushed by slaren
November 14, 2024 17:04 59m 39s master
November 14, 2024 17:04 59m 39s
CUDA: no -sm row for very small matrices (#10185)
Publish Docker image #14704: Commit 4a8ccb3 pushed by JohannesGaessler
November 14, 2024 12:00 31m 46s master
November 14, 2024 12:00 31m 46s
speculative : fix out-of-bounds access (#10289)
Publish Docker image #14703: Commit 2a82891 pushed by ggerganov
November 14, 2024 09:44 1h 11m 44s master
November 14, 2024 09:44 1h 11m 44s
vulkan: Optimize binary ops (#10270)
Publish Docker image #14702: Commit af148c9 pushed by 0cc4m
November 14, 2024 05:22 32m 55s master
November 14, 2024 05:22 32m 55s
vulkan: Use macros to make the mat mul pipeline creation more concise…
Publish Docker image #14701: Commit 66798e4 pushed by 0cc4m
November 13, 2024 20:59 2h 30m 52s master
November 13, 2024 20:59 2h 30m 52s
llama : propagate the results of graph_compute (#9525)
Publish Docker image #14700: Commit fb4a0ec pushed by ggerganov
November 13, 2024 18:00 57m 18s master
November 13, 2024 18:00 57m 18s
server : fix incorrect res in validate_model_chat_template (#10272)
Publish Docker image #14699: Commit 0e712a5 pushed by ggerganov
November 13, 2024 11:15 36m 23s master
November 13, 2024 11:15 36m 23s
sycl : Fixes to broken builds and test-backend-ops (#10257)
Publish Docker image #14698: Commit 2e82ffa pushed by Alcpz
November 13, 2024 09:40 32m 37s master
November 13, 2024 09:40 32m 37s
vulkan: Optimize contiguous copies (#10254)
Publish Docker image #14697: Commit 80dd7ff pushed by 0cc4m
November 13, 2024 06:59 32m 52s master
November 13, 2024 06:59 32m 52s
vulkan: Throttle the number of shader compiles during the build step.…
Publish Docker image #14696: Commit 54ef9cf pushed by 0cc4m
November 11, 2024 17:13 31m 41s master
November 11, 2024 17:13 31m 41s
metal : more precise Q*K in FA vec kernel (#10247)
Publish Docker image #14695: Commit b0cefea pushed by ggerganov
November 11, 2024 06:39 1h 21m 40s master
November 11, 2024 06:39 1h 21m 40s
server : enable KV cache defrag by default (#10233)
Publish Docker image #14694: Commit b141e5f pushed by ggerganov
November 11, 2024 06:38 32m 50s master
November 11, 2024 06:38 32m 50s