fix(go): defer template rendering in LoadPrompt to execution time #3925

Zereker · 2025-12-11T12:28:03Z

Summary

Fix template variable substitution bug in LoadPrompt where variables were replaced with empty values at load time
Defer template rendering to execution time using WithMessagesFn
Add convertDotpromptMessages helper function
Add regression test TestLoadPromptTemplateVariableSubstitution

Problem

When using LoadPrompt to load .prompt files, the template was rendered at load time with an empty DataArgument. This caused all template variables (like {{name}}, {{topic}}, etc.) to be replaced with empty values immediately.

As a result, subsequent calls to Execute() or Render() with actual input values had no effect - the template was already "baked" with empty values.

Example

// greeting.prompt content:
// Hello {{name}}, welcome to {{place}}!

prompt := genkit.LookupPrompt(g, "greeting")

// BUG: Variables not substituted!
result, _ := prompt.Execute(ctx, ai.WithInput(map[string]any{
    "name":  "Alice",
    "place": "Wonderland",
}))
// Expected: "Hello Alice, welcome to Wonderland!"
// Actual: "Hello , welcome to !"  (empty values)

Solution

Defer template rendering to execution time by using WithMessagesFn. The closure:

Captures the raw template text at load time
Compiles and renders the template with actual input values at execution time
Properly handles multi-role messages (<<<dotprompt:role:XXX>>> markers)
Properly handles history insertion (<<<dotprompt:history>>> markers)

Test Plan

Added TestLoadPromptTemplateVariableSubstitution regression test
Verified TestMultiMessagesRenderPrompt still passes (multi-role support)
All existing ai package tests pass

Fixes #3924

hugoaguirre · 2025-12-15T20:25:16Z

Hi @Zereker
Thanks for both of your contributions (here and Dotprompt). I'll take a look at them.

We are making improvements in the core which are causing merge conflicts with your contribution. Would it be possible if you address the conflicts?

Zereker · 2025-12-16T03:41:55Z

Hi @hugoaguirre, I've rebased on the latest main and resolved the conflicts. Ready for review!

hugoaguirre · 2025-12-16T21:14:47Z

Hi @Zereker, I've tried to reproduce the issue with the latest changes in main and I was able to see the prompt rendering correctly. This is the code that I used to reproduce the issue:

greeting.prompt contents:

---
description: "A greeting prompt with variables"
---
Hello {{name}}, welcome to {{place}}!

Genkit code:

func PromptFromZereker(ctx context.Context, g *genkit.Genkit) {
	prompt := genkit.LoadPrompt(g, "./prompts/greeting.prompt", "greetings")
	if prompt == nil {
		log.Fatal("empty prompt")
	}

	resp, err := prompt.Execute(ctx,
		ai.WithInput(map[string]any{
			"name":  "Alice",
			"place": "Wonderland",
		}))
	if err != nil {
		log.Fatalf("error executing prompt: %v", err)
	}
	fmt.Printf("request: %#v\n", resp.Request.Messages[0].Text())
	log.Print(resp.Text())
}

Output:

request: "Hello Alice, welcome to Wonderland!"
2025/12/16 21:11:11 Thank you for the warm welcome! What an intriguing place to find myself. I'm already feeling a delightful sense of wonder and perhaps a touch of delightful confusion, which I hear is quite common here.

So, tell me, where shall our adventure begin? Are there any White Rabbits I should follow, or perhaps a curious riddle to solve? I'm quite ready for whatever Wonderland has in store!

Could you point me in the right direction to reproduce the issue you are reporting?

You can copy paste this sample in go/samples/prompts/main.go and run it

Zereker · 2025-12-17T03:18:46Z

Hi @hugoaguirre,

Thanks for testing! I found that the {{role "system"}} Handlebars syntax is already used in the codebase:

go/samples/prompts/prompts/multi-msg.prompt
go/samples/coffee-shop/main.go
go/ai/prompt_test.go (TestMultiMessagesPrompt)

However, there's an issue with how this syntax is handled in LoadPrompt.

How to Reproduce

Add this test to go/ai/prompt_test.go:

func TestHandlebarsRoleMarkers(t *testing.T) {
    tempDir := t.TempDir()
    mockPromptFile := filepath.Join(tempDir, "test.prompt")
    content := `---
model: test/chat
---
{{role "system"}}
You are a helpful assistant.

{{role "user"}}
Hello {{name}}, welcome to {{place}}!
`
    if err := os.WriteFile(mockPromptFile, []byte(content), 0644); err != nil {
        t.Fatal(err)
    }

    prompt := LoadPrompt(registry.New(), tempDir, "test.prompt", "test")
    opts, err := prompt.Render(context.Background(), map[string]any{
        "name":  "Alice",
        "place": "Wonderland",
    })
    if err != nil {
        t.Fatal(err)
    }

    // Verify messages are correctly separated
    if len(opts.Messages) != 2 {
        t.Errorf("Expected 2 messages, got %d", len(opts.Messages))
    }
    if opts.Messages[0].Role != RoleSystem {
        t.Errorf("Expected first message to be system role")
    }
}

Expected:

Messages: 2
[0] Role: system, Text: "You are a helpful assistant."
[1] Role: user, Text: "Hello Alice, welcome to Wonderland!"

Actual (on main):

Messages: 1
[0] Role: user, Text: "\nYou are a helpful assistant.\n\n\nHello Alice, welcome to Wonderland!"

Comparison

The existing test TestMultiMessagesRenderPrompt uses <<<dotprompt:role:system>>> format (internal markers), which works correctly. But the Handlebars syntax {{role "system"}} that users write does not work properly - all roles are merged into a single user message.

Format	Messages	Result
`<<<dotprompt:role:system>>>`	2	✅ Correctly separated
`{{role "system"}}`	1	❌ Merged into single user message

Root Cause

In LoadPrompt, line 711 calls:

dpMessages, err := dotprompt.ToMessages(parsedPrompt.Template, &dotprompt.DataArgument{})

This renders the template at load time with an empty DataArgument, before the Handlebars {{role "..."}} syntax is processed into internal markers.

My Fix

My fix defers template rendering to execution time by using WithMessagesFn:

Compile the template at load time (but don't render)
Render at execution time with actual input values via WithMessagesFn closure
Convert the rendered dotprompt.Message list to ai.Message with correct roles

This ensures both template variables ({{name}}) and role markers ({{role "system"}}) are properly processed.

hugoaguirre · 2025-12-19T21:00:07Z

Hi @Zereker,
Thanks for the clarification. I'll make some internal validations and will get back to you.

mcicoria · 2026-01-02T19:48:23Z

Note: This is resolved by syncing to the latest version of the main branch at cd3835a but I'm leaving it here in case it's helpful

I'm bumping this rather than reporting a separate issue. This should resolve another issue which I don't see explicitly reported where rendering a prompt multiple times with different inputs does not update the rendered template after the first input.

How to Reproduce

// TestPromptMultipleRenders tests that the prompt correctly handles
// multiple sequential renders with different inputs, ensuring each render uses
// the correct input value.
func TestPromptMultipleRenders(t *testing.T) {
	ctx := context.Background()

	tempDir := t.TempDir()
	mockPromptFile := filepath.Join(tempDir, "test.prompt")
	content := `---
model: test/chat
input:
  schema:
    input: string
---
Here is the input: {{input}}
`
	if err := os.WriteFile(mockPromptFile, []byte(content), 0644); err != nil {
		t.Fatal(err)
	}

	g := genkit.Init(ctx, genkit.WithPromptDir(tempDir))

	prompt := genkit.LookupPrompt(g, "test")
	if prompt == nil {
		t.Fatal("Prompt 'test.prompt' not found")
	}

	// Test multiple sequential renders with different inputs
	inputs := []string{
		"input-test-abc-1",
		"input-test-def-2",
		"input-test-ghi-3",
	}

	for i, input := range inputs {
		inputMap := map[string]any{
			"input": input,
		}

		actionOpts, err := prompt.Render(ctx, inputMap)
		if err != nil {
			t.Fatalf("Failed to render prompt with input %d (%q): %v", i+1, input, err)
		}

		if actionOpts == nil {
			t.Fatalf("Render() returned nil action options for input %d", i+1)
		}

		if len(actionOpts.Messages) == 0 {
			t.Fatalf("Render() returned no messages for input %d", i+1)
		}

		// Collect all text
		var renderedText strings.Builder
		for _, msg := range actionOpts.Messages {
			for _, part := range msg.Content {
				if part.IsText() {
					renderedText.WriteString(part.Text)
					renderedText.WriteString(" ")
				}
			}
		}
		text := renderedText.String()

		// Verify current input appears
		if !strings.Contains(text, input) {
			t.Errorf("Input %d (%q) not found in render %d. Text snippet: %q", i+1, input, i+1, text[:min(200, len(text))])
		}

		// Verify previous inputs do NOT appear
		for j, prevInput := range inputs {
			if j < i && strings.Contains(text, prevInput) {
				t.Errorf("BUG: Previous input %d (%q) found in render %d when it should only contain input %d (%q).",
					j+1, prevInput, i+1, i+1, input)
			}
		}
	}
}

Expected:

[0] Here is the input: input-test-abc-1
[1] Here is the input: input-test-def-2
[2] Here is the input: input-test-ghi-3

Actual:

[0] Here is the input: input-test-abc-1
[1] Here is the input: input-test-abc-1
[2] Here is the input: input-test-abc-1

Zereker · 2026-01-04T03:06:11Z

Hi @mcicoria, thanks for bumping this and providing the detailed reproduction case!

The issue you reported (multiple renders always using the first input) is caused by a template sharing bug in the dotprompt Compile() method. I've already fixed this in a separate PR: google/dotprompt#363 (merged on 2025-12-29).

After updating genkit to use the latest dotprompt version (v0.0.0-20251229072418-d79986469d4c), your test case passes:

=== RUN   TestPromptMultipleRenders
--- PASS: TestPromptMultipleRenders (0.00s)

Note: This PR (#3925) addresses a different issue — the {{role "..."}} Handlebars syntax not being processed correctly in LoadPrompt, where all messages get merged into a single user message instead of being properly separated by role.

Add TestLoadPromptTemplateVariableSubstitution to verify that template variables are correctly substituted with actual input values at execution time, not load time. This test covers: 1. Single role prompts with template variables 2. Multi-role prompts (system + user) with template variables 3. Verification that consecutive renders use their own input values Related to firebase#3924 (fixed by firebase#4035)

Zereker · 2026-01-08T11:57:42Z

Update: Rebased and Simplified

This PR has been rebased on the latest main branch. Since PR #4035 has already fixed the core issue (deferring template rendering to execution time), this PR now only adds a regression test.

Why This Test is Still Valuable

The existing tests in #4035 verify that:

Multi-role messages work correctly (TestMultiMessagesPrompt)
Template variables are substituted (TestDeferredSchemaResolution, TestDataPromptExecute)

However, none of them test the core scenario of issue #3924:

What happens when the same prompt is rendered multiple times with different input values?

Our TestLoadPromptTemplateVariableSubstitution test specifically covers this:

// First render
actionOpts1, _ := prompt.Render(ctx, map[string]any{"name": "Alice", "place": "Wonderland"})

// Second render with DIFFERENT values
actionOpts2, _ := prompt.Render(ctx, map[string]any{"name": "Bob", "place": "Paradise"})

// Critical assertion: second render must NOT contain first render's values
if strings.Contains(text2, "Alice") {
    t.Errorf("BUG: Second render contains 'Alice' from first input!")
}

This test ensures that if someone accidentally regresses the fix (e.g., pre-rendering templates at load time), the test will catch it immediately.

Test Coverage

Sub-test	Scenario
`single role`	Template variables + consecutive renders + isolation verification
`multi role`	Multi-role + template variables + consecutive renders + isolation verification

I believe this regression test adds value as a safeguard against future regressions of issue #3924.

Zereker · 2026-01-08T12:47:42Z

@hugoaguirre Could you please approve the workflow runs when you have a chance? This PR adds a regression test for the template variable substitution fix. Thanks!

github-project-automation bot added this to Genkit Backlog Dec 11, 2025

github-actions bot added go fix labels Dec 11, 2025

Zereker force-pushed the fix/go-loadprompt-template-rendering branch from 3b4f001 to bf10054 Compare December 11, 2025 14:46

hugoaguirre self-requested a review December 15, 2025 20:23

Zereker force-pushed the fix/go-loadprompt-template-rendering branch from bf10054 to 64ef676 Compare December 16, 2025 03:41

Zereker force-pushed the fix/go-loadprompt-template-rendering branch from 64ef676 to 24977e3 Compare January 8, 2026 11:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(go): defer template rendering in LoadPrompt to execution time #3925

fix(go): defer template rendering in LoadPrompt to execution time #3925

Uh oh!

Zereker commented Dec 11, 2025

Uh oh!

hugoaguirre commented Dec 15, 2025

Uh oh!

Zereker commented Dec 16, 2025

Uh oh!

hugoaguirre commented Dec 16, 2025 •

edited

Loading

Uh oh!

Zereker commented Dec 17, 2025

Uh oh!

hugoaguirre commented Dec 19, 2025

Uh oh!

mcicoria commented Jan 2, 2026 •

edited

Loading

Uh oh!

Zereker commented Jan 4, 2026

Uh oh!

Zereker commented Jan 8, 2026

Uh oh!

Zereker commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(go): defer template rendering in LoadPrompt to execution time #3925

Are you sure you want to change the base?

fix(go): defer template rendering in LoadPrompt to execution time #3925

Uh oh!

Conversation

Zereker commented Dec 11, 2025

Summary

Problem

Example

Solution

Test Plan

Uh oh!

hugoaguirre commented Dec 15, 2025

Uh oh!

Zereker commented Dec 16, 2025

Uh oh!

hugoaguirre commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Zereker commented Dec 17, 2025

How to Reproduce

Comparison

Root Cause

My Fix

Uh oh!

hugoaguirre commented Dec 19, 2025

Uh oh!

mcicoria commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to Reproduce

Uh oh!

Zereker commented Jan 4, 2026

Uh oh!

Zereker commented Jan 8, 2026

Update: Rebased and Simplified

Why This Test is Still Valuable

Test Coverage

Uh oh!

Zereker commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hugoaguirre commented Dec 16, 2025 •

edited

Loading

mcicoria commented Jan 2, 2026 •

edited

Loading