Skip to content

Actions: VJHack/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
29 workflow runs
29 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

server : (UI) Improve messages bubble shape in RTL (#11220)
Server #29: Commit 504af20 pushed by VJHack
January 14, 2025 00:27 5m 58s master
January 14, 2025 00:27 5m 58s
llama: add support for QRWKV6 model architecture (#11001)
Server #28: Commit ee7136c pushed by VJHack
January 10, 2025 05:10 5m 53s master
January 10, 2025 05:10 5m 53s
ci : use actions from ggml-org (#11140)
Server #27: Commit f7cd133 pushed by VJHack
January 8, 2025 16:31 5m 42s master
January 8, 2025 16:31 5m 42s
sync : ggml
Server #26: Commit 99a3755 pushed by VJHack
January 8, 2025 12:36 6m 29s master
January 8, 2025 12:36 6m 29s
llama-run : fix context size (#11094)
Server #25: Commit dc7cef9 pushed by VJHack
January 6, 2025 23:51 5m 55s master
January 6, 2025 23:51 5m 55s
common : add missing env var for speculative (#10801)
Server #24: Commit 9fdb124 pushed by VJHack
December 12, 2024 16:17 6m 46s master
December 12, 2024 16:17 6m 46s
docs: update server streaming mode documentation (#9519)
Server #23: Commit 5555c0c pushed by VJHack
December 11, 2024 23:20 4m 34s master
December 11, 2024 23:20 4m 34s
Update README.md (#10772)
Server #22: Commit 1a31d0d pushed by VJHack
December 11, 2024 17:17 4m 55s master
December 11, 2024 17:17 4m 55s
CUDA: fix shared memory access condition for mmv (#10740)
Server #21: Commit 26a8406 pushed by VJHack
December 10, 2024 02:16 6m 3s master
December 10, 2024 02:16 6m 3s
convert : add custom attention mapping
Server #20: Commit c5ede38 pushed by VJHack
December 6, 2024 20:38 4m 32s master
December 6, 2024 20:38 4m 32s
ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_…
Server #19: Commit 0c39f44 pushed by VJHack
November 30, 2024 20:34 5m 38s master
November 30, 2024 20:34 5m 38s
Introduce llama-run (#10291)
Server #18: Commit 0cc6375 pushed by VJHack
November 25, 2024 23:35 8m 15s master
November 25, 2024 23:35 8m 15s
readme : update hot topics
Server #17: Commit ba6f62e pushed by VJHack
November 1, 2024 17:13 9m 8s master
November 1, 2024 17:13 9m 8s
llama : switch KQ multiplication to F32 precision by default (#10015)
Server #16: Commit 8841ce3 pushed by VJHack
October 27, 2024 20:09 7m 5s master
October 27, 2024 20:09 7m 5s
readme : update bindings list (#9918)
Server #15: Commit 3752217 pushed by VJHack
October 17, 2024 14:36 26m 21s master
October 17, 2024 14:36 26m 21s
allow disable context shift for sever
Server #14: Commit 5688864 pushed by VJHack
September 19, 2024 00:34 11m 14s master
September 19, 2024 00:34 11m 14s
ggml : fix n_threads_cur initialization with one thread (#9538)
Server #13: Commit 64c6af3 pushed by VJHack
September 18, 2024 22:17 24m 27s master
September 18, 2024 22:17 24m 27s
[SYCL]set context default value to avoid memory issue, update guide (…
Server #12: Commit faf67b3 pushed by VJHack
September 18, 2024 01:25 10m 23s master
September 18, 2024 01:25 10m 23s
ggml : move common CPU backend impl to new header (#9509)
Server #11: Commit 23e0d70 pushed by VJHack
September 17, 2024 01:49 10m 39s master
September 17, 2024 01:49 10m 39s
py : add "LLaMAForCausalLM" conversion support (#9485)
Server #10: Commit 3c7989f pushed by VJHack
September 15, 2024 14:27 22m 51s master
September 15, 2024 14:27 22m 51s
made loading message more descriptive
Server #9: Commit 739ea75 pushed by VJHack
September 13, 2024 04:14 16m 3s master
September 13, 2024 04:14 16m 3s
removed print statement
Server #8: Commit df9f167 pushed by VJHack
September 13, 2024 04:04 10m 34s master
September 13, 2024 04:04 10m 34s
eol fix
Server #7: Commit cd80fce pushed by VJHack
September 13, 2024 03:16 11m 13s master
September 13, 2024 03:16 11m 13s
Merge branch 'ggerganov:master' into master
Server #6: Commit 69c97bb pushed by VJHack
September 13, 2024 03:14 2m 54s master
September 13, 2024 03:14 2m 54s
precommit corrections
Server #5: Commit 42abdd0 pushed by VJHack
September 13, 2024 03:04 11m 38s master
September 13, 2024 03:04 11m 38s