fix: preserve tool-call context in tokenization #527

corbt · 2026-01-21T18:51:08Z

Summary

Avoid dropping tool calls from assistant messages in context by only inserting sentinels for trainable assistant messages.
Preserve tool_calls in the chat template for trainable assistants and fail fast if a tool-call dict would be tokenized via content-only splicing.
Proposal: drop support for allow_training_without_logprobs (importance sampling needs logprobs; this mode complicates paths and enables subtle tool-call bugs).

Test plan

Not run (tokenization-only change)

Only splice trainable assistant spans and keep tool_calls in the template; error if tool_calls would be dropped.

- Use .get() instead of direct [] access for tool_calls to handle message types that don't have this key - Cast message to dict[str, Any] when appending to token_template_messages

fix: preserve tool-call context in tokenization

2d329a2

Only splice trainable assistant spans and keep tool_calls in the template; error if tool_calls would be dropped.

bradhilton approved these changes Jan 21, 2026

View reviewed changes

Fix type errors in tool-call tokenization

25ffbaf

- Use .get() instead of direct [] access for tool_calls to handle message types that don't have this key - Cast message to dict[str, Any] when appending to token_template_messages

corbt mentioned this pull request Jan 21, 2026

RFC: Deprecate allow_training_without_logprobs option #528

Open

corbt merged commit b220140 into main Jan 21, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve tool-call context in tokenization #527

fix: preserve tool-call context in tokenization #527

Uh oh!

corbt commented Jan 21, 2026 •

edited by vivekkalyan

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: preserve tool-call context in tokenization #527

fix: preserve tool-call context in tokenization #527

Uh oh!

Conversation

corbt commented Jan 21, 2026 • edited by vivekkalyan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

corbt commented Jan 21, 2026 •

edited by vivekkalyan

Loading