Skip to content

Conversation

@Xunzhuo
Copy link
Member

@Xunzhuo Xunzhuo commented Aug 27, 2025

What this PR does / why we need it:

This PR added auto-reasoning mode based on request intent

Signed-off-by: bitliu <bitliu@tencent.com>
@Xunzhuo Xunzhuo changed the title feat: support reasoning mode feat: support auto-enable reasoning mode based on intention Aug 27, 2025
@Xunzhuo Xunzhuo merged commit d7c578e into main Aug 27, 2025
1 of 3 checks passed
@Xunzhuo Xunzhuo deleted the reasoning-mode branch August 29, 2025 03:23
srini-abhiram added a commit to srini-abhiram/semantic-router that referenced this pull request Dec 2, 2025
This commit addresses 6 critical issues identified during code review,
plus build improvements for better user experience.

Critical fixes:
- Issue vllm-project#1: PID file race condition - kill router process if PID file
  write fails to prevent untrackable processes
- Issue vllm-project#2: Cross-platform compatibility - replace hardcoded /tmp paths
  with os.TempDir() and add user-specific suffixes using os.Getuid()
- Issue vllm-project#3: Signal handling - add graceful shutdown for port-forward
  process on Ctrl+C to prevent orphaned kubectl processes
- Issue vllm-project#4: HTTP timeout - add 30-second timeout to prevent hanging
  requests in test command
- Issue vllm-project#5: Security - restrict log/PID file permissions to 0600
- Issue vllm-project#6: Input validation - enforce 10k character limit on prompts

Build improvements:
- Embed library path using -ldflags="-r" to eliminate need for users
  to manually set LD_LIBRARY_PATH when running vsr binary

Files modified:
- src/semantic-router/pkg/cli/deployment/deployment.go
- src/semantic-router/pkg/cli/deployment/upgrade.go
- src/semantic-router/cmd/vsr/commands/dashboard.go
- src/semantic-router/cmd/vsr/commands/test.go
- tools/make/build-run-test.mk
- src/semantic-router/pkg/cli/deployment/deployment_test.go

All fixes have been verified with standalone test programs.

Signed-off-by: Srinivas A <56465971+srini-abhiram@users.noreply.github.com>
srini-abhiram added a commit to srini-abhiram/semantic-router that referenced this pull request Dec 2, 2025
This commit addresses 6 critical issues identified during code review,
plus build improvements for better user experience.

Critical fixes:
- Issue vllm-project#1: PID file race condition - kill router process if PID file
  write fails to prevent untrackable processes
- Issue vllm-project#2: Cross-platform compatibility - replace hardcoded /tmp paths
  with os.TempDir() and add user-specific suffixes using os.Getuid()
- Issue vllm-project#3: Signal handling - add graceful shutdown for port-forward
  process on Ctrl+C to prevent orphaned kubectl processes
- Issue vllm-project#4: HTTP timeout - add 30-second timeout to prevent hanging
  requests in test command
- Issue vllm-project#5: Security - restrict log/PID file permissions to 0600
- Issue vllm-project#6: Input validation - enforce 10k character limit on prompts

Build improvements:
- Embed library path using -ldflags="-r" to eliminate need for users
  to manually set LD_LIBRARY_PATH when running vsr binary

Files modified:
- src/semantic-router/pkg/cli/deployment/deployment.go
- src/semantic-router/pkg/cli/deployment/upgrade.go
- src/semantic-router/cmd/vsr/commands/dashboard.go
- src/semantic-router/cmd/vsr/commands/test.go
- tools/make/build-run-test.mk
- src/semantic-router/pkg/cli/deployment/deployment_test.go

All fixes have been verified with standalone test programs.

Signed-off-by: Srinivas A <56465971+srini-abhiram@users.noreply.github.com>
srini-abhiram added a commit to srini-abhiram/semantic-router that referenced this pull request Dec 2, 2025
Add unit tests to verify all 6 critical production fixes implemented
in the previous commit. Tests ensure cross-platform compatibility,
security, and robustness of the VSR CLI tool.

Test coverage added:
- Cross-platform path handling (Issue vllm-project#2)
  * Absolute path validation
  * User-specific path generation
  * Temp directory usage

- PID file management and security (Issues vllm-project#1, vllm-project#5)
  * PID file race condition handling
  * Process termination on write failure
  * Restrictive 0600 file permissions
  * Proper cleanup on exit

- HTTP client timeout (Issue vllm-project#4)
  * 30-second timeout prevents hanging
  * Fast responses unaffected
  * Timeout configuration validation

- Input validation (Issue vllm-project#6)
  * 10k character limit enforcement
  * Boundary condition testing

- Signal handling (Issue vllm-project#3)
  * Graceful shutdown on interrupt
  * Process cleanup verification
  * No orphaned processes

All tests compile and validate the production fixes.

Signed-off-by: Srinivas A <56465971+srini-abhiram@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants