sync : llama.cpp (training, refactoring) #548

ggerganov · 2023-10-03T20:44:06Z

No description provided.

ggml-ci

ggerganov · 2023-10-04T08:14:28Z

include/ggml/ggml.h

@@ -1771,6 +1771,7 @@ extern "C" {
        GGML_OPT_NO_CONTEXT,
        GGML_OPT_INVALID_WOLFE,
        GGML_OPT_FAIL,
+        GGML_OPT_CANCEL,


@xaedes I've added the GGML_OPT_CANCEL return code and simplified the cancellation logic during optimization

That makes a lot of sense.

ggerganov · 2023-10-04T08:15:15Z

src/ggml.c

@@ -20220,8 +20205,8 @@ static enum ggml_opt_result ggml_opt_lbfgs(
        ggml_vec_cpy_f32(nx, gp, g);

        ls = linesearch_backtracking(&params, nx, x, &fx, g, d, step, xp, f, gb, &cplan, np, ps, &cancel, callback, callback_data);


Here instead of passing &cancel, we should check the return code if it matches GGML_OPT_CANCEL

slaren · 2023-10-04T09:09:11Z

include/ggml/ggml.h

-    #define GGML_GRAPH_HASHTABLE_SIZE 8273
+    // #define GGML_GRAPH_HASHTABLE_SIZE 8273
+    // #define GGML_GRAPH_HASHTABLE_SIZE 16411
+    #define GGML_GRAPH_HASHTABLE_SIZE 32771


There is a chance that this and the increase to GGML_MAX_NODES will break the examples that allocate the graph on the stack.

That's true. I think we will migrate things as we go, or alternatively - migrate everything after #547 is merged

Yeah.. I used heap allocation because of this.

We could rewrite the examples to use heap allocated graphs with ggml_new_graph.

Maybe it would be more convenient to have a way to dynamically grow the graph beyond some stack allocatable initial capacity. But then all code that iterates over or adds/deletes nodes need to be changed. E.g. by replacing with calls to new API functions for accessing graph nodes. Sounds too bloated.

Or change the build process to add libraries with custom compile definitions for the sizes and let finetune and train-text-from-scratch link against those.

The gpt-2 example has already been updated to allocate on the heap. We just need to apply the same treatment to the rest of the examples

ggml-ci

sync : llama.cpp (training, refactoring)

4e3b1b5

ggerganov force-pushed the sync branch from 76822e2 to 4e3b1b5 Compare October 3, 2023 20:49

ggerganov changed the title ~~ggml : sync llama.cpp (training, refactoring)~~ sync : llama.cpp (training, refactoring) Oct 3, 2023

ggerganov added 2 commits October 4, 2023 11:01

examples : fix ggml_rope

16bde2c

ggml : better optimizer cancel handling

13d5ec9

ggml-ci

ggerganov commented Oct 4, 2023

View reviewed changes

ggerganov requested a review from slaren October 4, 2023 08:15

slaren reviewed Oct 4, 2023

View reviewed changes

ggml : fix UBs

a094d9c

ggml-ci

ggerganov force-pushed the sync branch from 86035d0 to a094d9c Compare October 4, 2023 12:47

ggerganov mentioned this pull request Oct 4, 2023

ggml : expose hash table API from ggml.c and reuse in ggml-alloc #549

Closed

ggml : add TODO for refactoring the opt cancellation

4e025e6

ggerganov merged commit ef33685 into master Oct 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : llama.cpp (training, refactoring) #548

sync : llama.cpp (training, refactoring) #548

ggerganov commented Oct 3, 2023 •

edited

Loading

ggerganov Oct 4, 2023

xaedes Oct 4, 2023

ggerganov Oct 4, 2023

slaren Oct 4, 2023

ggerganov Oct 4, 2023

xaedes Oct 4, 2023

ggerganov Oct 4, 2023

		@@ -20220,8 +20205,8 @@ static enum ggml_opt_result ggml_opt_lbfgs(
		ggml_vec_cpy_f32(nx, gp, g);

		ls = linesearch_backtracking(&params, nx, x, &fx, g, d, step, xp, f, gb, &cplan, np, ps, &cancel, callback, callback_data);

sync : llama.cpp (training, refactoring) #548

sync : llama.cpp (training, refactoring) #548

Conversation

ggerganov commented Oct 3, 2023 • edited Loading

ggerganov Oct 4, 2023

Choose a reason for hiding this comment

xaedes Oct 4, 2023

Choose a reason for hiding this comment

ggerganov Oct 4, 2023

Choose a reason for hiding this comment

slaren Oct 4, 2023

Choose a reason for hiding this comment

ggerganov Oct 4, 2023

Choose a reason for hiding this comment

xaedes Oct 4, 2023

Choose a reason for hiding this comment

ggerganov Oct 4, 2023

Choose a reason for hiding this comment

ggerganov commented Oct 3, 2023 •

edited

Loading