feat: answering machine detection by chenghao-mou · Pull Request #4906 · livekit/agents

chenghao-mou · 2026-02-20T13:35:29Z

AMDResult with categories: human, machine-ivr, machine-vm, machine-unavailable, and uncertain
amd.execute() API for agents to await detection results
Example in ‎examples/telephony/amd.py

Usage

await session.start(
    agent=MyAgent(),
    room=ctx.room,
)

async with AMD(session, llm="openai/gpt-5-mini") as detector:
    result = await detector.execute()

    if result.category == "human":
        logger.info("human answered the call, proceeding with normal conversation")
    elif result.category == "machine-ivr":
        logger.info("ivr menu detected, starting navigation")
    elif result.category == "machine-vm":
        logger.info("voicemail detected, leaving a message")
        speech_handle = session.generate_reply(
            instructions=(
                "You've reached voicemail. Leave a brief message asking "
                "the customer to call back."
            ),
        )
        await speech_handle.wait_for_playout()
        session.shutdown()
    elif result.category == "machine-unavailable":
        logger.info("mailbox unavailable, ending call")
        session.shutdown()

Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com>

devin-ai-integration

Devin Review found 2 new potential issues.

View 11 additional findings in Devin Review.

devin-ai-integration

Devin Review found 2 new potential issues.

View 15 additional findings in Devin Review.

devin-ai-integration · 2026-04-03T22:08:21Z

+    def _on_first_audio(self) -> None:
+        """Start AMD on the first audio frame and pause speech authorization."""
+        if self._classifier is None or self._classifier.started:
+            return
+        self._classifier.start()
+        if self._session is not None and self._session._activity is not None:
+            self._session._activity._pause_authorization()


🟡 AMD authorization pause not applied to new AgentActivity created during agent handoff

When AMD pauses authorization via _pause_authorization() on the current AgentActivity, and then an agent handoff occurs (e.g., via update_agent), a new AgentActivity is created with _authorization_allowed initialized as set (livekit-agents/livekit/agents/voice/agent_activity.py:155-156). The AMD's _on_first_audio only fires once (it checks self._classifier.started at detector.py:139 and returns), so _pause_authorization() is never called on the new activity. This means the new activity's speech will bypass AMD's authorization gate, defeating the purpose of holding speech until AMD resolves.

Scenario

AMD starts, calls _pause_authorization() on current activity

Agent handoff occurs (e.g. user calls session.update_agent()) while AMD is still pending

New AgentActivity is created with _authorization_allowed already set

Speech on the new activity proceeds without waiting for AMD result

Prompt for agents

The AMD detector pauses authorization on the current AgentActivity, but if an agent handoff creates a new AgentActivity while AMD is still pending, the new activity won't have authorization paused. To fix this, either: (1) propagate the AMD authorization pause state to new AgentActivity instances when they are created in _update_activity (in agent_session.py), e.g. by checking if session._amd is pending and calling _pause_authorization() on the new activity; or (2) have the AMD store a reference to the session rather than the activity and apply the pause on whatever is the current activity at any given time, checking this in the activity's authorization wait path.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-04-03T22:08:22Z

+        result = self._result
+
+        if result.is_machine and self._interrupt_on_machine:
+            await self._session.interrupt(force=True)
+
+        if result.category == AMDCategory.MACHINE_IVR and self._ivr_detection:
+            await self._session._start_ivr_detection(transcript=result.transcript)
+
+        # eagerly resume so agent can speak immediately to a human
+        if self._session._activity is not None:
+            self._session._activity._resume_authorization()
+
+        return result


🟡 execute() does not resume authorization if an exception occurs before _resume_authorization()

In execute(), if self._session.interrupt(force=True) (line 105) or self._session._start_ivr_detection(...) (line 108) raises an exception, the _resume_authorization() call at line 112 is skipped. When execute() is used inside the async with AMD(...) context manager, __aexit__ → aclose() will resume authorization as a fallback. However, if execute() is called directly (without the context manager), authorization remains permanently paused, deadlocking all subsequent speech generation.

Suggested change

result = self._result

if result.is_machine and self._interrupt_on_machine:

await self._session.interrupt(force=True)

if result.category == AMDCategory.MACHINE_IVR and self._ivr_detection:

await self._session._start_ivr_detection(transcript=result.transcript)

# eagerly resume so agent can speak immediately to a human

if self._session._activity is not None:

self._session._activity._resume_authorization()

return result

result = self._result

try:

if result.is_machine and self._interrupt_on_machine:

await self._session.interrupt(force=True)

if result.category == AMDCategory.MACHINE_IVR and self._ivr_detection:

await self._session._start_ivr_detection(transcript=result.transcript)

finally:

# eagerly resume so agent can speak immediately to a human

if self._session._activity is not None:

self._session._activity._resume_authorization()

return result

Was this helpful? React with 👍 or 👎 to provide feedback.

* upstream/main: fix: add PARTICIPANT_KIND_CONNECTOR to default participant kinds (livekit#5339) feat: expose service_tier in CompletionUsage from OpenAI Responses API (livekit#5341) feat: answering machine detection (livekit#4906) fix: wait_for_participant waits until participant is fully active (livekit#5271) (gemini realtime): add warnings in update_chat_ctx and update_instructions (livekit#5332) fix: convert oneOf to anyOf in strict schema for discriminated unions (livekit#5324) fix(voice): make function call history preservation configurable in AgentTask (livekit#5288)

* fix(voice): make function call history preservation configurable in AgentTask (livekit#5288) * fix: convert oneOf to anyOf in strict schema for discriminated unions (livekit#5324) * (gemini realtime): add warnings in update_chat_ctx and update_instructions (livekit#5332) * fix: wait_for_participant waits until participant is fully active (livekit#5271) * feat: answering machine detection (livekit#4906) Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com> * feat: expose service_tier in CompletionUsage from OpenAI Responses API (livekit#5341) * fix: add PARTICIPANT_KIND_CONNECTOR to default participant kinds (livekit#5339) --------- Co-authored-by: Gopal Bagaswar <67310594+GopalGB@users.noreply.github.com> Co-authored-by: Long Chen <longch1024@gmail.com> Co-authored-by: Tina Nguyen <72938484+tinalenguyen@users.noreply.github.com> Co-authored-by: David Zhao <dz@livekit.io> Co-authored-by: Chenghao Mou <chenghao.mou@livekit.io> Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com> Co-authored-by: Piyush Gambhir <90608533+piyush-gambhir@users.noreply.github.com> Co-authored-by: Anunay Maheshwari <anunaym14@gmail.com>

Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com>

chenghao-mou added 9 commits February 9, 2026 15:24

add AMD interface

ce60b71

add amd event emitter

26d3aaa

update event emitter

5672b37

add amd hold

7e67624

refactor amd result and example

1786e0c

Merge branch 'main' into feat/amd

1131225

minor fixes

b69884e

clean up example

44d845f

clean up and refactoring

a3222bd

chenghao-mou marked this pull request as ready for review March 3, 2026 17:37

chenghao-mou requested a review from a team March 3, 2026 17:37

This comment was marked as resolved.

Sign in to view

fix example

404bc2b

This comment was marked as resolved.

Sign in to view

disable AMD after first use

3be962f

This comment was marked as resolved.

Sign in to view

chenghao-mou marked this pull request as draft March 9, 2026 22:18

theomonnom reviewed Mar 11, 2026

View reviewed changes

Comment thread livekit-agents/livekit/agents/voice/amd/base.py Outdated

chenghao-mou and others added 4 commits March 15, 2026 16:41

Merge branch 'main' into feat/amd

8517423

refactoring

0ee8791

use function call for prediction

1b78498

exclude silence from speech duration calculation

9dc196c

Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com>

chenghao-mou marked this pull request as ready for review March 15, 2026 20:02

chenghao-mou requested a review from a team March 15, 2026 20:02

This comment was marked as resolved.

Sign in to view

chenghao-mou added 5 commits March 16, 2026 14:03

minor refactoring

56c0b38

more fixes

9baf7df

Merge branch 'main' into feat/amd

865737a

fix type issues

3c845c3

move amd out of session

348e1a3

This comment was marked as resolved.

Sign in to view

chenghao-mou added 2 commits April 2, 2026 22:54

add docstring

772b731

update to use str enum

0b4bd23

This comment was marked as resolved.

Sign in to view

use string match in example instead

8790091

davidzhao reviewed Apr 3, 2026

View reviewed changes

Comment thread examples/telephony/amd.py

Comment thread livekit-agents/livekit/agents/voice/amd/detector.py

rename amd -> machine detection/detector

a7925e2

This comment was marked as resolved.

Sign in to view

address comments

382c9b4

This comment was marked as resolved.

Sign in to view

clean up autocomplete error

efa2286

theomonnom approved these changes Apr 3, 2026

View reviewed changes

rename to amd

265bdf1

This comment was marked as resolved.

Sign in to view

address comments

91dadc0

chenghao-mou changed the title ~~feat: automatic machine detection~~ feat: answering machine detection Apr 3, 2026

chenghao-mou added needs-js needs-documentation labels Apr 3, 2026

rename

4cfcae9

devin-ai-integration Bot reviewed Apr 3, 2026

View reviewed changes

Comment thread livekit-agents/livekit/agents/voice/amd/classifier.py Outdated

Comment thread livekit-agents/livekit/agents/voice/amd/detector.py Outdated

chenghao-mou added 2 commits April 3, 2026 22:48

address more comments

6751e9a

fix type issues

8bed855

devin-ai-integration Bot reviewed Apr 3, 2026

View reviewed changes

chenghao-mou mentioned this pull request Apr 3, 2026

feat(amd): answering machine detection livekit/agents-js#1204

Open

chenghao-mou merged commit 2f604bc into main Apr 3, 2026
20 of 22 checks passed

chenghao-mou deleted the feat/amd branch April 3, 2026 22:41

chenghao-mou mentioned this pull request Apr 9, 2026

feat(voice): add answering machine detection helper livekit/agents-js#1215

Merged

russellmartin-livekit pushed a commit that referenced this pull request Apr 13, 2026

feat: answering machine detection (#4906)

6f785f1

Co-authored-by: devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com>

Conversation

chenghao-mou commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chenghao-mou commented Feb 20, 2026 •

edited

Loading