Releases · ggml-org/llama.cpp

04 Mar 17:16

56d7a9f

b4821

main: allow preloading conversation with -p and add -st / --single-tu…

Assets 25

04 Mar 07:05

github-actions

b4820

1a24c46

b4820

`server`: fix deadly typo in response_format.json_schema.schema handl…

Assets 25

03 Mar 22:00

github-actions

b4819

becade5

b4819

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16

---

Signed-off-by: Carl Klemm <[email protected]>
Co-authored-by: Johannes Gäßler <[email protected]>
Co-authored-by: Ben Jackson <[email protected]>

Assets 25

03 Mar 17:01

github-actions

b4818

dfd6b2c

b4818

sync : ggml

ggml-ci

Assets 25

03 Mar 14:49

github-actions

b4806

c43af92

b4806

tts: add speaker file support (#12048)

* tts: add speaker file support

Signed-off-by: dm4 <[email protected]>

* tts: handle outetts-0.3

* tts : add new line in error message

---------

Signed-off-by: dm4 <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

Assets 25

03 Mar 14:44

github-actions

b4805

d5c63cd

b4805

test-backend-ops : add option -p to filter by op params (#12155)

Assets 25

03 Mar 14:43

github-actions

b4804

9660ffe

b4804

ggml : fix kleidiai build (#12159)

The libggml API has changed, but this has not been updated.

Assets 25

03 Mar 14:25

github-actions

b4803

c950a1f

b4803

Adding UTF-8 support to llama.cpp (#12111)

For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>

Assets 25

03 Mar 10:47

github-actions

b4801

ece9745

b4801

SYCL: Move CPY kernels to a separate file and add few missing kernels…

Assets 25

02 Mar 21:50

github-actions

b4800

cc473ca

b4800

ggml-backend : keep paths in native string type when possible (#12144)

Assets 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggml-org/llama.cpp

b4821

b4820

b4819

b4818

b4806

b4805

b4804

b4803

b4801

b4800