Feature Request: introduce Tool Call API in server mode #9031

tybalex · 2024-08-14T21:55:27Z

Prerequisites

I am running the latest code. Mention the version if possible as well.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

In the server mode, user should be able to use the OAI api for tool calling just like they do to do tool calling for gpt-4. Today this is not supported yet: https://github.com/ggerganov/llama.cpp/blob/master/examples/server/utils.hpp#L394

Motivation

There are more and more open sourced small models(7b, 30b, 70b) out there that support tool calling. Is it possible that llama.cpp starts to support those? I also created my own tool calling models: https://huggingface.co/rubra-ai/Meta-Llama-3-8B-Instruct-GGUF, but it requires extra preprocess and post process steps to handle the function calling requests, so I have to create a fork tools.cpp and implemented custom logic.

I wonder if it is possible to create something like a standard tool calling template so that tool calling models can follow. Basically it needs to cover 3 things:

convert tool calling output from model to OAI json format.
convert OAI format input function definitions to proper system prompt of local model.
convert OAI format input chat messages with role tool_call or tools to a proper format that a tool calling model can support.

Possible Implementation

No response

The text was updated successfully, but these errors were encountered:

qnixsynapse · 2024-08-15T02:26:31Z

cc: @ngxson

ngxson · 2024-08-16T09:25:52Z

Tool calls is not just a chat template, so it's trivial to implement. The problem is that it must be done per model. I think we can base on the same infrastructure provided by #5695

segmond · 2024-08-27T03:31:08Z

tool call is no longer optional or a fancy thing to have. An LLM without tool calling is not as useful. I look forward to this.

crashr · 2024-08-28T14:53:56Z

Exactly what @segmond wrote.

github-actions · 2024-10-12T01:13:11Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

tybalex added the enhancement New feature or request label Aug 14, 2024

ngxson mentioned this issue Aug 30, 2024

server : add Hermes-3 tool call support (WIP) #9254

Draft

12 tasks

github-actions bot added the stale label Sep 28, 2024

github-actions bot closed this as completed Oct 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: introduce Tool Call API in server mode #9031

Feature Request: introduce Tool Call API in server mode #9031

tybalex commented Aug 14, 2024

qnixsynapse commented Aug 15, 2024

ngxson commented Aug 16, 2024

segmond commented Aug 27, 2024

crashr commented Aug 28, 2024

github-actions bot commented Oct 12, 2024

Feature Request: introduce Tool Call API in server mode #9031

Feature Request: introduce Tool Call API in server mode #9031

Comments

tybalex commented Aug 14, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation

qnixsynapse commented Aug 15, 2024

ngxson commented Aug 16, 2024

segmond commented Aug 27, 2024

crashr commented Aug 28, 2024

github-actions bot commented Oct 12, 2024