Fix Ollama API timeout for local LLM inference by Copilot · Pull Request #23 · hammar/LocalAgent

Copilot · 2026-01-26T19:22:56Z

Fix LLM Timeouts for Local Ollama ✅

Successfully resolved the issue where local Ollama-based LLM operations were timing out after ~10 seconds by configuring extended HTTP timeouts for the Ollama API client.

Problem

ApiService was throwing System.Threading.Tasks.TaskCanceledException after ~10 seconds
Default HTTP resilience handler timeout was 10 seconds, too short for local LLM inference
Issue was intermittent but more common on low-performance machines

Solution

Rebased on latest main branch with Azure AI Foundry support
Configured timeout extension to apply only for Local Ollama provider (not Azure)
Increased TotalRequestTimeout to 5 minutes (from 10 seconds default)
Set AttemptTimeout to 2 minutes (leaving 3 minutes for retry logic)
Set HttpClient.Timeout to InfiniteTimeSpan to avoid conflicts with resilience pipeline
Configured connection pooling for better performance with long-running requests
Verified build succeeds with both Local and Azure configurations

Changes Made

Timeout configuration now only applies when using Local Ollama provider (inside the else block at lines 66-86)
Azure AI Foundry provider uses default timeout settings (which are appropriate for cloud services)
Configuration is properly scoped within the provider-specific conditional block
Added clarifying comment to indicate scope of timeout configuration

Security Summary

No security vulnerabilities were introduced. The changes only involve configuration of timeout values and connection pooling settings for HTTP client communication with the local Ollama service.

Original prompt

This section details on the original issue you should resolve

<issue_title>LLM timeouts when running locally</issue_title>
<issue_description>When executing against a local Ollama-based LLM, the chats often time out.

ApiService throws System.Threading.Tasks.TaskCanceledException with the following stack trace:

System.Threading.Tasks.TaskCanceledException: The operation was canceled.
 ---> System.IO.IOException: Unable to read data from the transport connection: I/O-åtgärden har avbrutits därför att en tråd har avslutats eller för att ett program har begärt det..
 ---> System.Net.Sockets.SocketException (995): I/O-åtgärden har avbrutits därför att en tråd har avslutats eller för att ett program har begärt det.
   --- End of inner exception stack trace ---
   at System.Net.Sockets.Socket.AwaitableSocketAsyncEventArgs.ThrowException(SocketError error, CancellationToken cancellationToken)
   at System.Net.Sockets.Socket.AwaitableSocketAsyncEventArgs.System.Threading.Tasks.Sources.IValueTaskSource<System.Int32>.GetResult(Int16 token)
   at System.Net.Http.HttpConnection.InitialFillAsync(Boolean async)
   at System.Net.Http.HttpConnection.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
   --- End of inner exception stack trace ---
   at System.Net.Http.HttpConnection.SendAsync(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
   at System.Net.Http.HttpConnectionPool.SendWithVersionDetectionAndRetryAsync(HttpRequestMessage request, Boolean async, Boolean doRequestAuth, CancellationToken cancellationToken)
   at System.Net.Http.Metrics.MetricsHandler.SendAsyncWithMetrics(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)
   at System.Net.Http.DiagnosticsHandler.SendAsyncCore(HttpRequestMessage request, Boolean async, CancellationToken cancellationToken)

In the Docker Ollama container I see these logs:

[GIN] 2026/01/26 - 19:20:14 | 500 | 10.001327917s |      172.18.0.1 | POST     "/api/chat"
time=2026-01-26T19:20:21.847Z level=INFO source=runner.go:681 msg="aborting completion request due to client closing the connection"

This is intermittent but seems a little more common on low-performance machines.</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes LLM timeouts when running locally #22

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

…meouts Co-authored-by: hammar <68027+hammar@users.noreply.github.com>

Co-authored-by: hammar <68027+hammar@users.noreply.github.com>

hammar

Looks promising but please rebase this on the latest main branch (that supports Azure AI Foundry) and ensure that these changes only kick in if we are using a local Ollama client, rather than the AI Foundry client.

Copilot · 2026-01-26T23:56:33Z

Looks promising but please rebase this on the latest main branch (that supports Azure AI Foundry) and ensure that these changes only kick in if we are using a local Ollama client, rather than the AI F...

Done! Rebased on latest main (commit ed8ddab) which includes Azure AI Foundry support. The timeout configuration now only applies when using the Local Ollama provider (lines 66-87 in Program.cs, inside the else block). Azure AI Foundry uses default timeout settings.

Changes in commits 8ef3299, ffcc634.

hammar · 2026-01-27T06:22:56Z

Restarting work on this PR.

Initial plan

37cc163

Copilot AI assigned Copilot and hammar Jan 26, 2026

Copilot started work on behalf of hammar January 26, 2026 19:23 View session

Copilot AI and others added 2 commits January 26, 2026 19:28

Configure extended timeouts for Ollama HttpClient to fix local LLM ti…

e05b1d3

…meouts Co-authored-by: hammar <68027+hammar@users.noreply.github.com>

Address code review feedback: Fix comments and adjust timeout values

22f7971

Co-authored-by: hammar <68027+hammar@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix LLM timeouts when executing locally~~ Fix Ollama API timeout for local LLM inference Jan 26, 2026

Copilot AI requested a review from hammar January 26, 2026 19:33

Copilot finished work on behalf of hammar January 26, 2026 19:33

hammar requested changes Jan 26, 2026

View reviewed changes

Copilot started work on behalf of hammar January 26, 2026 23:51 View session

Copilot AI requested a review from hammar January 27, 2026 00:01

hammar closed this Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Ollama API timeout for local LLM inference#23

Fix Ollama API timeout for local LLM inference#23
Copilot wants to merge 3 commits intomainfrom
copilot/fix-llm-timeout-issues

Copilot AI commented Jan 26, 2026 •

edited

Loading

Uh oh!

hammar left a comment

Uh oh!

Copilot AI commented Jan 26, 2026

Uh oh!

hammar commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix LLM Timeouts for Local Ollama ✅

Problem

Solution

Changes Made

Security Summary

Comments on the Issue (you are @copilot in this section)

Uh oh!

hammar left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Jan 26, 2026

Uh oh!

hammar commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Jan 26, 2026 •

edited

Loading