Skip to content

llama-cli: fix passing chat_template_kwargs and reasoning_format params#25201

Open
percontation wants to merge 1 commit into
ggml-org:masterfrom
percontation:cli_template_kwargs
Open

llama-cli: fix passing chat_template_kwargs and reasoning_format params#25201
percontation wants to merge 1 commit into
ggml-org:masterfrom
percontation:cli_template_kwargs

Conversation

@percontation

Copy link
Copy Markdown
Contributor

Overview

llama-cli and llama-completion will now respect --chat-template-kwargs and --reasoning-format args.

llama-cli was previously accepting these arguments, but ignoring them.

llama-completion was previously accepting & ignoring --reasoning and --reasoning-format, and not accepting --chat-template-kwargs. It now respects all three.

Requirements

llama-cli and llama-completion will now respect --chat-template-kwargs
and --reasoning-format args.
@percontation percontation requested review from a team, ggerganov and ngxson as code owners July 1, 2026 12:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant