Add LLaMA 3.1 and 3.2 tool calling support by qgallouedec · Pull Request #5518 · huggingface/trl

qgallouedec · 2026-04-10T21:07:27Z

Add response schema for LLaMA 3.1 and 3.2 (done by Claude)
Add llama3_1.jinja and llama3_2.jinja templates for identity matching in add_response_schema
Add these two models to TestAddResponseSchema and TestParseResponse test parametrizations
Add LLaMA to supported models in agent training docs

Part of #5460

Note

Medium Risk
Changes response parsing behavior based on chat-template identity matching, which could affect tool-call extraction for Llama variants if templates or regex matching diverge from real model outputs.

Overview
Adds Llama 3.1/3.2 tool-calling response parsing by introducing a dedicated llama3_schema and wiring it into add_response_schema via new identity-matching templates (llama3_1.jinja, llama3_2.jinja). The schema handles Llama’s bare JSON tool-call format by converting parameters into standard arguments, while reflecting template limitations (single tool call, no content alongside tool call).

Updates the test suite to cover the new tiny Llama 3.1/3.2 fixtures and adjusts expectations/skips for unsupported behaviors (inline reasoning_content, multiple tool calls, tool call + content). Documentation for GRPO agent training now lists Llama 3.1 and 3.2 as supported models, and the chat-template README documents the new templates and their constraints.

^{Reviewed by Cursor Bugbot for commit 9a7b9d9. Bugbot is set up for automated code reviews on this repo. Configure here.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5d6615bc4d

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-10T21:14:55Z

+    if tokenizer.chat_template in [llama3_1_chat_template, llama3_2_chat_template]:
+        tokenizer.response_schema = llama3_schema


Handle Llama 3.1 builtin tool-call format separately

add_response_schema now assigns llama3_schema to both 3.1 and 3.2 templates, but llama3_1.jinja has a builtin-tools path that renders calls as <|python_tag|>...<|eom_id|> (not JSON {"name":..., "parameters":...}<|eot_id|>). Because llama3_schema only matches the JSON form, builtin tool outputs are parsed as plain content instead of tool_calls, so tool execution is skipped for Llama 3.1 sessions that use builtin_tools.

Useful? React with 👍 / 👎.

Rocketknight1 · 2026-04-13T13:48:32Z

@qgallouedec it's really cool, but is there a reason to put the templates/schema in TRL and not in the Hub repos?

qgallouedec · 2026-04-13T13:56:56Z

Because it's very unlikely that models like llama 3.1 would merge this kind of change. I don't have the link but I remember that the Qwen team pushed back on adding generation marker in their template.

Response schemas are needed for RL training. So if we just wait for the labs to add them, it's just impossible for us to do RL

The idea is to make these features (generation marker and schema) more popular and tested and hope they'll see greater adoption in the future.

Rocketknight1 · 2026-04-13T14:15:40Z

We should still try! I'll see if I can make PRs and try to get them merged after this PR is merged.

HuggingFaceDocBuilderDev · 2026-04-14T02:34:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-04-16T17:18:22Z

merging this one with no review. It's been opened for a while and nothing critical here

qgallouedec added 2 commits April 10, 2026 20:56

Add Llama 3.1 and 3.2 tool calling support

a5219ba

doc

5d6615b

qgallouedec requested review from AmineDiro, albertvillanova and kashif April 10, 2026 21:07

qgallouedec mentioned this pull request Apr 10, 2026

Tracking: tool calling support across chat templates #5460

Open

chatgpt-codex-connector bot reviewed Apr 10, 2026

View reviewed changes

Merge branch 'main' into llama-3-1-and-2-schema

4c30c9f

qgallouedec and others added 7 commits April 13, 2026 22:54

Merge branch 'main' into llama-3-1-and-2-schema

923b7ab

Merge branch 'main' into llama-3-1-and-2-schema

3ebb774

Merge branch 'main' into llama-3-1-and-2-schema

2a8bbb0

Merge branch 'main' into llama-3-1-and-2-schema

c723688

Merge branch 'main' into llama-3-1-and-2-schema

39f1c49

Merge branch 'main' into llama-3-1-and-2-schema

fe964fc

nit in readme

9a7b9d9

qgallouedec merged commit abe20a8 into main Apr 16, 2026
4 of 13 checks passed

qgallouedec deleted the llama-3-1-and-2-schema branch April 16, 2026 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLaMA 3.1 and 3.2 tool calling support#5518

Add LLaMA 3.1 and 3.2 tool calling support#5518
qgallouedec merged 10 commits intomainfrom
llama-3-1-and-2-schema

qgallouedec commented Apr 10, 2026 •

edited by cursor bot

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 10, 2026

Uh oh!

Rocketknight1 commented Apr 13, 2026

Uh oh!

qgallouedec commented Apr 13, 2026

Uh oh!

Rocketknight1 commented Apr 13, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 14, 2026

Uh oh!

qgallouedec commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if tokenizer.chat_template in [llama3_1_chat_template, llama3_2_chat_template]:
		tokenizer.response_schema = llama3_schema

Conversation

qgallouedec commented Apr 10, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Apr 13, 2026

Uh oh!

qgallouedec commented Apr 13, 2026

Uh oh!

Rocketknight1 commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 14, 2026

Uh oh!

qgallouedec commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 10, 2026 •

edited by cursor bot

Loading

Rocketknight1 commented Apr 13, 2026 •

edited

Loading