frameforge: add PortAudio live audio capture with VAD support by Copilot · Pull Request #9 · TheOriginalBytePlayer/llama.cpp

Copilot · 2026-01-01T23:06:45Z

Summary

Successfully addressed ALL code review feedback:

Multiple initialize() calls: Added initialized_ flag to prevent re-initialization
Thread Safety: Added mutex protection for global PortAudio state
Resource Management: Proper reference counting with decrement in destructor
Error Handling: Decrement ref count on all error paths
Stream Pointer Safety: Use local variable before assigning to stream_
Callback Performance: Added note about data copy in callback (API constraint)
Compile-time Dependency Check: Live audio mode now requires both PortAudio AND Whisper at compile time
Documentation: Updated to mention both PortAudio and Whisper requirements
Signal Handler Deduplication: Moved to main() start, removed all duplicates
Buffer Size Limit: Added configurable max_buffer_duration_s (30s default) with rolling window
Callback Signature: Fixed to match PaStreamCallback exactly (no undefined behavior)

Original prompt

Extend the FrameForgeSIdeCar applicaiton to directly collect the audio data via portaudio rather than have it sent to it.

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

…ling, remove duplicate help line Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

TheOriginalBytePlayer · 2026-01-01T23:46:30Z

@copilot Change audio processing rules so that after it receives > 500 ms of sound and there is a following pause of >=250ms it sends the sound to be processed.

TheOriginalBytePlayer

Change the audio processing so that after it receives >=500ms of sound and there is a following silences >=250ms it sends that audio to whisper for processing.

Copilot

Pull request overview

This PR adds live audio capture functionality to the FrameForge sidecar application, enabling direct microphone input via PortAudio instead of relying solely on file or IPC-based audio input. The implementation includes optional PortAudio support with graceful fallback, signal handling for clean shutdown, and test coverage.

Key changes:

Audio capture abstraction layer with PortAudio implementation and stub fallback for builds without PortAudio
Live audio capture mode with continuous 2-second chunk processing through Whisper→Llama→validation pipeline
Signal handling (SIGINT/SIGTERM) for graceful shutdown in both live and server modes

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 18 comments.

Show a summary per file

File	Description
tools/frameforge/frameforge-audio.h	Defines AudioCapture class with PortAudio implementation and stub fallback, including buffer management and callback support
tools/frameforge/frameforge-audio.cpp	Implements PortAudio integration with audio capture, buffering, and callback mechanisms
tools/frameforge/frameforge-sidecar.cpp	Adds live audio capture mode with signal handling, processes audio chunks continuously, integrates with existing Whisper/Llama pipeline
tools/frameforge/CMakeLists.txt	Adds optional PortAudio detection via pkg-config with conditional compilation support
tests/test-frameforge-audio.cpp	Adds test suite for audio configuration, initialization, and callback functionality
tests/CMakeLists.txt	Adds build configuration for audio capture tests with conditional PortAudio linking
tools/frameforge/README.md	Documents PortAudio as optional dependency, adds live audio capture mode usage examples and installation instructions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…0ms silence Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot · 2026-01-01T23:55:45Z

Change the audio processing so that after it receives >=500ms of sound and there is a following silences >=250ms it sends that audio to whisper for processing.

Implemented in commit 910ac39. Audio processing now triggers automatically after detecting >=500ms of sound followed by >=250ms of silence. Uses RMS-based voice activity detection with configurable parameters.

@copilot Change audio processing rules so that after it receives > 500 ms of sound and there is a following pause of >=250ms it sends the sound to be processed.

Implemented VAD-based audio processing in commit 910ac39. The system now:

Detects when >=500ms of speech is received
Waits for >=250ms of silence after the speech
Then sends the audio to Whisper for processing

The VAD uses RMS (root mean square) to distinguish speech from silence, with configurable thresholds in AudioConfig.

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…ization Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot · 2026-01-02T00:35:27Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

TheOriginalBytePlayer · 2026-01-02T01:20:51Z

@copilot retry

Copilot · 2026-01-02T01:25:19Z

@copilot retry

All work has been successfully completed and verified:

✅ Build succeeds with PortAudio support enabled
✅ All tests pass (test-frameforge-audio)
✅ Application help displays correctly with --live-audio option
✅ All code review feedback has been addressed in commits d3a5a0d, b2730e5, and earlier

The PR is ready for review. All issues mentioned in the code review have been fixed:

Thread-safe PortAudio initialization with mutex
Proper reference counting and Pa_Terminate() cleanup
Multiple initialize() calls prevented
Stream pointer safety improved
CMake variables properly cached for cross-subdirectory access
Documentation updated to require both PortAudio and Whisper

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot · 2026-01-02T01:46:22Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

…add buffer size limit, fix callback signature Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot · 2026-01-02T01:58:12Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

Initial plan

fa42a51

Copilot AI assigned Copilot and TheOriginalBytePlayer Jan 1, 2026

Copilot started work on behalf of TheOriginalBytePlayer January 1, 2026 23:07 View session

Copilot AI and others added 4 commits January 1, 2026 23:13

Add PortAudio support for live audio capture in FrameForge SideCar

a148594

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Update documentation for live audio capture feature

b829693

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Add test for audio capture functionality

11166a4

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Address code review feedback: fix Pa_Terminate calls, add signal hand…

d26d75e

…ling, remove duplicate help line Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Extend FrameForgeSideCar to directly collect audio data~~ frameforge: add PortAudio live audio capture support Jan 1, 2026

Copilot finished work on behalf of TheOriginalBytePlayer January 1, 2026 23:35

Copilot AI requested a review from TheOriginalBytePlayer January 1, 2026 23:35

TheOriginalBytePlayer marked this pull request as ready for review January 1, 2026 23:47

Copilot AI review requested due to automatic review settings January 1, 2026 23:47

Copilot started reviewing on behalf of TheOriginalBytePlayer January 1, 2026 23:47 View session

TheOriginalBytePlayer requested changes Jan 1, 2026

View reviewed changes

Copilot started work on behalf of TheOriginalBytePlayer January 1, 2026 23:49 View session

Copilot AI reviewed Jan 1, 2026

View reviewed changes

Implement VAD-based audio processing: trigger after 500ms speech + 25…

910ac39

…0ms silence Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Fix race conditions in VAD state management with mutex protection

cd2d75a

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot AI changed the title ~~frameforge: add PortAudio live audio capture support~~ frameforge: add PortAudio live audio capture with VAD support Jan 1, 2026

Copilot AI requested a review from TheOriginalBytePlayer January 1, 2026 23:59

Copilot finished work on behalf of TheOriginalBytePlayer January 1, 2026 23:59

Copilot started work on behalf of TheOriginalBytePlayer January 2, 2026 00:14 View session

Update tools/frameforge/frameforge-audio.cpp

21f63f5

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

github-actions Bot added examples testing labels Jan 2, 2026

TheOriginalBytePlayer and others added 2 commits January 1, 2026 16:16

Update tools/frameforge/frameforge-audio.cpp

dd7d7cb

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update tools/frameforge/frameforge-sidecar.cpp

6f8b9f6

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update tests/test-frameforge-audio.cpp

69833a9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of TheOriginalBytePlayer January 2, 2026 00:26 View session

Fix multiple initialize() calls and add thread-safe PortAudio initial…

d3a5a0d

…ization Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot finished work on behalf of TheOriginalBytePlayer January 2, 2026 00:35

Copilot started work on behalf of TheOriginalBytePlayer January 2, 2026 01:21 View session

Copilot finished work on behalf of TheOriginalBytePlayer January 2, 2026 01:26

TheOriginalBytePlayer and others added 2 commits January 1, 2026 17:36

Update tools/frameforge/frameforge-sidecar.cpp

07ab1bf

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update tools/frameforge/frameforge-audio.cpp

82776be

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of TheOriginalBytePlayer January 2, 2026 01:39 View session

Require both PortAudio and Whisper for live audio mode at compile time

3b7e3b4

Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot finished work on behalf of TheOriginalBytePlayer January 2, 2026 01:46

Copilot started work on behalf of TheOriginalBytePlayer January 2, 2026 01:47 View session

Address remaining review feedback: update docs, fix signal handlers, …

cf18ae6

…add buffer size limit, fix callback signature Co-authored-by: TheOriginalBytePlayer <18058224+TheOriginalBytePlayer@users.noreply.github.com>

Copilot finished work on behalf of TheOriginalBytePlayer January 2, 2026 01:58

TheOriginalBytePlayer approved these changes Jan 2, 2026

View reviewed changes

TheOriginalBytePlayer merged commit 1d0baf6 into master Jan 2, 2026
37 of 70 checks passed

TheOriginalBytePlayer deleted the copilot/extend-audio-collection-capabilities branch January 2, 2026 06:28

Conversation

Copilot AI commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

TheOriginalBytePlayer commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TheOriginalBytePlayer left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

TheOriginalBytePlayer commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Jan 1, 2026 •

edited

Loading

TheOriginalBytePlayer commented Jan 1, 2026 •

edited

Loading

Copilot AI commented Jan 1, 2026 •

edited

Loading