Refactor LLM functions to use centralized LlmFunctionBindData structure #223

anasdorbani · 2025-12-16T19:43:25Z

Introduces LlmFunctionBindData to centralize model and prompt binding data for all LLM functions. This refactoring improves thread-safety by ensuring each function call creates a fresh Model instance, and simplifies the binding logic by consolidating validation and initialization into shared base classes.

Changes:

Add LlmFunctionBindData structure with thread-safe model creation
Refactor scalar functions (llm_complete, llm_embedding, llm_filter) to use new bind data
Refactor aggregate functions (llm_first_or_last, llm_reduce, llm_rerank) to use new bind data
Add ResolveModelDetailsToJson helper method to model manager
Update all tests to match new function signatures

…dictable tests

…andling

…handlers

…llama providers

…class

…tions

…ting

…king

…system temp

Copilot

Pull request overview

This PR refactors LLM functions to use a centralized LlmFunctionBindData structure, improving thread-safety and simplifying binding logic. The changes primarily focus on test updates to accommodate new function signatures and add comprehensive test coverage for audio transcription and metrics tracking features.

Key Changes:

Introduced audio transcription support with tests across scalar and aggregate functions
Added metrics tracking functionality with dedicated test suite
Updated model references from llama3 to gemma3:4b for Ollama provider
Enhanced integration test infrastructure with secret management improvements

Reviewed changes

Copilot reviewed 99 out of 102 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
test/unit/prompt_manager/prompt_manager_test.cpp	Added TranscribeAudioColumn test suite with 6 test cases for audio transcription
test/unit/model_manager/model_providers_test.cpp	Added transcription tests and updated Ollama model reference
test/unit/model_manager/model_manager_test.cpp	Updated Ollama model reference to gemma3:4b
test/unit/functions/scalar/metrics_test.cpp	New comprehensive metrics test suite with 28 test cases
test/unit/functions/scalar/llm_function_test_base_instantiations.cpp	Added Ollama secret creation to test setup
test/unit/functions/scalar/llm_filter.cpp	Added audio transcription tests and context-less filter test
test/unit/functions/scalar/llm_embedding.cpp	Updated TEXT to VARCHAR cast
test/unit/functions/scalar/llm_complete.cpp	Added audio transcription tests and VARCHAR cast updates
test/unit/functions/mock_provider.hpp	Added transcription method mocks
test/unit/functions/aggregate/*.cpp	Added audio transcription tests and updated test expectations
test/unit/functions/aggregate/llm_aggregate_function_test_base.hpp	Updated to use mock provider factory pattern
test/integration/src/integration/tests/secret_manager/test_secret_manager.py	Updated to use with_secrets=False parameter
test/integration/src/integration/tests/prompt_parser/test_prompt_parser.py	Updated to use with_secrets=False parameter
test/integration/src/integration/tests/model_parser/test_model_parser.py	Updated to use with_secrets=False parameter
test/integration/src/integration/tests/metrics/test_metrics.py	New comprehensive integration test suite for metrics
test/integration/src/integration/tests/functions/scalar/*.py	Added audio transcription tests and model fixture updates
test/integration/src/integration/tests/functions/aggregate/*.py	Added audio transcription tests and model fixture updates
test/integration/src/integration/setup_test_db.py	Simplified setup by removing secret creation logic
test/integration/src/integration/conftest.py	Added audio file path helper and secret management refactoring
src/registry/scalar.cpp	Registered three new metrics-related scalar functions
src/prompt_manager/prompt_manager.cpp	Added audio transcription support and null-safety improvements
src/model_manager/providers/adapters/*.cpp	Implemented transcription support and enhanced image handling
src/model_manager/model.cpp	Added model details resolution and transcription methods
src/metrics/*.cpp	New metrics tracking implementation
src/include/flock//*.hpp	Updated interfaces for transcription and metrics support

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

anasdorbani added 30 commits December 2, 2025 11:40

added flock metrics to all the providers

6bbb82c

upgrade gh action to DuckDB 1.4.2

fa553a1

registered the metrics scalar functions

e0d4c1f

added unit tests for the metrics feature

c79dfcd

added integration tests for the metrics feature

5abddab

Removed old metrics wrapper

b90eac7

Updated metrics registry

2b80454

Updated handlers to use MetricsManager

2965d81

Merged scalar and aggregate metrics tests

19ea07b

Updated metrics CMakeLists

12c2605

Added merged metrics integration tests

7934bad

Fixed code formatting

4de6e21

Fixed include in llm_complete

03f58a4

Replaced old FlockMetrics API call

d86c4c2

Add missing metrics tracking to llm_complete function

5b7d511

Update test prompts to ensure 1-2 word responses for faster, more pre…

12675b2

…dictable tests

Remove legacy MetricsContext class (replaced by MetricsManager)

c66bb3a

Centralized shared standard library includes in common.hpp

7aab760

Add metrics merging for aggregate functions

d0325e8

Add tests for metrics merging

4ad6d1c

Added URLHandler class for file download and validation utilities

ec3824a

Refactored ExecuteBatch to use RequestType enum for unified request h…

50d3189

…andling

Implemented ExtractTranscriptionOutput for OpenAI, Azure, and Ollama …

42264eb

…handlers

Added AddTranscriptionRequest implementation for OpenAI, Azure, and O…

17a6192

…llama providers

Added transcription request methods to IProvider interface and Model …

c0adbc3

…class

Added audio transcription support to prompt manager and input parser

f4be1e2

Added transcription mock methods and OLLAMA secret to test base classes

1b280c5

Added unit tests for audio transcription in llm_complete and llm_filter

15537ff

Added unit tests for audio transcription in aggregate LLM functions

2a3da64

Added unit tests for transcription in model provider adapters

cd2d869

anasdorbani added 28 commits December 10, 2025 11:40

Added integration tests for audio transcription in scalar LLM functions

72aa288

Added integration tests for audio transcription in aggregate LLM func…

fc1d297

…tions

Updated unit test database with audio transcription test data

893be86

Added unit tests for TranscribeAudioColumn and made it public for tes…

64058f3

…ting

Added base64 encoding and regex URL detection to URL handler

ff53e8f

Improved null safety and type checking in prompt manager

8ae1f43

Enhanced error handling and null checks in base handler

29ada0b

Improved aggregate state initialization and metadata handling

e918a81

Added null checks for transcription output in Azure and OpenAI handlers

285ab8d

Updated Ollama handler to use chat API and improved response parsing

1e61924

Updated scalar functions to use unique ID generation for metrics trac…

45cf50f

…king

Updated integration test configuration and database setup

6f02270

Updated integration tests for aggregate LLM functions

58b827b

Updated integration tests for scalar LLM functions

508c1a3

Updated integration tests for metrics, parsers, and secret manager

4a7a5fe

Updated unit tests for scalar LLM functions

9a4fe20

Updated unit tests for model manager and providers

85431b8

Updated unit test database with latest test data

ba54ffa

Added audio test file for integration tests

a171c73

Updated temp file location to use flock storage directory instead of …

4c158a3

…system temp

Fix llm_filter to work without context_columns parameter

9095a07

Add unit and integration tests for llm_filter without context_columns

25036f1

Refactored storage attachment with RAII guard and retry mechanism

96d1794

Add LlmFunctionBindData structure and input parsing utilities

a7f7dac

Add ResolveModelDetailsToJson and mock provider factory to model manager

c0d644f

Refactor scalar functions to use LlmFunctionBindData structure

1b17b8c

Refactor aggregate functions to use LlmFunctionBindData structure

b00cc49

Update tests to match new function signatures

990fc7f

Copilot AI review requested due to automatic review settings December 16, 2025 19:43

Copilot AI reviewed Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor LLM functions to use centralized LlmFunctionBindData structure #223

Refactor LLM functions to use centralized LlmFunctionBindData structure #223

Uh oh!

anasdorbani commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Refactor LLM functions to use centralized LlmFunctionBindData structure #223

Are you sure you want to change the base?

Refactor LLM functions to use centralized LlmFunctionBindData structure #223

Uh oh!

Conversation

anasdorbani commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant