add resume_false_interruption and pause/resume the audio output #3109

longcw · 2025-08-08T01:57:34Z

No description provided.

longcw · 2025-08-12T03:32:43Z

@theomonnom can you review this one?

theomonnom

this lg, I'm just not sure about the scenario where a SpeechHandle created using say/generate_reply should be "restarted".

theomonnom · 2025-08-13T01:35:26Z

Just saw one of those examples in dz's PR:

        await self.session.say("connecting you to the customer now.")
        await self.session_manager.merge_calls()

longcw · 2025-08-13T01:36:18Z

this lg, I'm just not sure about the scenario where a SpeechHandle created using say/generate_reply should be "restarted".

we can add the source back to the event, and skip retry for source say in the default callback.

theomonnom · 2025-08-26T03:31:01Z

livekit-agents/livekit/agents/voice/room_io/_output.py


+        self._audio_buf = utils.aio.Chan[rtc.AudioFrame]()
+        self._audio_bstream = utils.audio.AudioByteStream(
+            sample_rate, num_channels, samples_per_channel=sample_rate // 20


The tricky thing if splitting the frames in smaller size, is that as soon as the asyncio loop is slower or the cpu usage is high, the audio may be stuttering.

You should be able to see that when using the stress cmd on mac

I remember having a lot of issues around that when initially developing agents. This is why we can push faster than realtime from the Python side (from this PR)

but we still have a reasonable buffer (200ms) for the AudioSource, the chunking here is just to ensure a single frame is not too big that we paused too late.

I see, so the interruptions has a min of 200ms latency here? Would be good to check if 200ms is enough. The only real reason we need big buffers is because users may write slow code on the event loop

we can use 100ms or 200ms, but I am wondering, what will happen if we call audio_source.clear_queue() when await audio_source.capture(frame) didn't return, how can we get the queued audio back from the audio_source?

if we can get the audio back, then the queue size and frame size don't matter, I can implement the pause in a different way when pause is called.

There is no easy way to get the audio back. But it's a complete OK assumption to assume that the playout was realtime. So 2s passed = 2s to discard

right now we will have a max of frame_size latency to pause the audio, and will drop max to 200ms (audio_source queue size) of audio when pause.

maybe 200ms is completely fine in both naturalness and cpu stress

longcw · 2025-08-26T13:37:36Z

livekit-agents/livekit/agents/voice/agent_activity.py

+                logger.debug("resumed false interrupted speech", extra={"timeout": timeout})
+
+            self._session.emit(
+                "agent_false_interruption", AgentFalseInterruptionEvent(resumed=resumed)


keep this event or replace it with a callback?

We should deprecate it for now, to avoid breaking changes, logging a warning is fine for now

theomonnom · 2025-08-26T17:16:07Z

livekit-agents/livekit/agents/voice/agent_session.py

        video_sampler: NotGivenOr[_VideoSampler | None] = NOT_GIVEN,
        user_away_timeout: float | None = 15.0,
-        agent_false_interruption_timeout: float | None = 4.0,
+        agent_false_interruption_timeout: float | None = 2.0,


Suggested change

agent_false_interruption_timeout: float | None = 2.0,

false_interruption_timeout: float | None = 2.0,

I think it's fine to call it this way, user_false_interruption_timeout wouldn't make a ton of sense anyway

this will be a breaking change, should we change a name and log deprecate warning if user set agent_false_interruption_timeout?

theomonnom

lgtm!!

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

longcw requested a review from a team August 8, 2025 01:57

theomonnom reviewed Aug 13, 2025

View reviewed changes

longcw requested a review from theomonnom August 13, 2025 07:33

longcw force-pushed the longc/resume-interrupted-agent-cb branch from 2db6dc1 to 3a0ec93 Compare August 20, 2025 03:22

longcw changed the title ~~add resume_false_interruption callback to AgentSession~~ add resume_false_interruption and pause/resume the audio output Aug 20, 2025

longcw added 7 commits August 22, 2025 09:33

add resume_false_interruption callback to AgentSession

0acfee1

update basic agent example

13b79ba

ignore interruption on session.say

6b39e90

add pause and resume

6f7dd1c

clean

4a2812f

fix await interruption issues

292ec5f

update example

bd23bf3

longcw force-pushed the longc/resume-interrupted-agent-cb branch from 6a951eb to bd23bf3 Compare August 22, 2025 01:33

longcw and others added 6 commits August 22, 2025 10:31

ignore empty final transcripts

9ecd3c4

fix none check

9fe1c96

add pause for cli audio output

4ccf232

clear paused buf

363cedd

chunk frames sent to audio output

ca7b3af

move chunk to room io

0a2d250

theomonnom reviewed Aug 26, 2025

View reviewed changes

longcw added 3 commits August 26, 2025 16:51

add agent_false_interruption event back

ce2fa36

fix for cli output

c36a781

add warning for avatar

b75e1db

longcw commented Aug 26, 2025

View reviewed changes

theomonnom reviewed Aug 26, 2025

View reviewed changes

theomonnom approved these changes Aug 26, 2025

View reviewed changes

refactor chat cli pause

1c9c24c

longcw added 3 commits August 27, 2025 11:36

rename to false_interruption_timeout

945cb6b

fix test

8ad6629

fix chat cli mark empty

33e51a8

longcw merged commit 2c84fc9 into main Aug 27, 2025
25 checks passed

longcw deleted the longc/resume-interrupted-agent-cb branch August 27, 2025 07:19

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Aug 28, 2025

add resume_false_interruption and pause/resume the audio output (live…

26ef3fe

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Aug 28, 2025

add resume_false_interruption and pause/resume the audio output (live…

c2ca097

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 5, 2025

add resume_false_interruption and pause/resume the audio output (live…

07b5b80

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 5, 2025

add resume_false_interruption and pause/resume the audio output (live…

d2cbf9d

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 19, 2025

add resume_false_interruption and pause/resume the audio output (live…

a8f6915

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 19, 2025

add resume_false_interruption and pause/resume the audio output (live…

a50c32d

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 29, 2025

add resume_false_interruption and pause/resume the audio output (live…

8bfdb3a

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Sep 29, 2025

add resume_false_interruption and pause/resume the audio output (live…

c90462b

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Nov 20, 2025

add resume_false_interruption and pause/resume the audio output (live…

0bbf6a9

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Nov 20, 2025

add resume_false_interruption and pause/resume the audio output (live…

939d717

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Nov 28, 2025

add resume_false_interruption and pause/resume the audio output (live…

6d290cd

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Nov 28, 2025

add resume_false_interruption and pause/resume the audio output (live…

f0ffbb5

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

longcw mentioned this pull request Dec 15, 2025

add pause support for ConsoleAudioOutput #4251

Merged

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Jan 11, 2026

add resume_false_interruption and pause/resume the audio output (live…

022dd8b

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

akshaym1shra pushed a commit to akshaym1shra/agents that referenced this pull request Jan 11, 2026

add resume_false_interruption and pause/resume the audio output (live…

dcd2e52

…kit#3109) Co-authored-by: David Zhao <dz@livekit.io>

	agent_false_interruption_timeout: float \| None = 2.0,
	false_interruption_timeout: float \| None = 2.0,

add resume_false_interruption and pause/resume the audio output #3109

add resume_false_interruption and pause/resume the audio output #3109

Uh oh!

Conversation

longcw commented Aug 8, 2025

Uh oh!

longcw commented Aug 12, 2025

Uh oh!

theomonnom left a comment

Choose a reason for hiding this comment

Uh oh!

theomonnom commented Aug 13, 2025

Uh oh!

longcw commented Aug 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

longcw Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

theomonnom left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw Aug 26, 2025 •

edited

Loading