Skip to content

Releases: ggml-org/llama.cpp

b4821

04 Mar 17:16
56d7a9f
Compare
Choose a tag to compare
main: allow preloading conversation with -p and add -st / --single-tu…

b4820

04 Mar 07:05
1a24c46
Compare
Choose a tag to compare
`server`: fix deadly typo in response_format.json_schema.schema handl…

b4819

03 Mar 22:00
becade5
Compare
Choose a tag to compare
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16

---

Signed-off-by: Carl Klemm <[email protected]>
Co-authored-by: Johannes Gäßler <[email protected]>
Co-authored-by: Ben Jackson <[email protected]>

b4818

03 Mar 17:01
Compare
Choose a tag to compare
sync : ggml

ggml-ci

b4806

03 Mar 14:49
c43af92
Compare
Choose a tag to compare
tts: add speaker file support (#12048)

* tts: add speaker file support

Signed-off-by: dm4 <[email protected]>

* tts: handle outetts-0.3

* tts : add new line in error message

---------

Signed-off-by: dm4 <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

b4805

03 Mar 14:44
d5c63cd
Compare
Choose a tag to compare
test-backend-ops : add option -p to filter by op params (#12155)

b4804

03 Mar 14:43
9660ffe
Compare
Choose a tag to compare
ggml : fix kleidiai build (#12159)

The libggml API has changed, but this has not been updated.

b4803

03 Mar 14:25
c950a1f
Compare
Choose a tag to compare
Adding UTF-8 support to llama.cpp (#12111)

For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>

b4801

03 Mar 10:47
ece9745
Compare
Choose a tag to compare
SYCL: Move CPY kernels to a separate file and add few missing kernels…

b4800

02 Mar 21:50
cc473ca
Compare
Choose a tag to compare
ggml-backend : keep paths in native string type when possible (#12144)