[Feature] Add automatic GPU memory cleanup fixture for ROCm tests by eppaneamd · Pull Request #181 · ROCm/flashinfer

eppaneamd · 2026-02-23T22:04:32Z

📌 Description

Introduces an automatic pytest fixture that monitors and cleans GPU memory after each test in the ROCm test suite.

Add _maybe_clear_gpu_memory() helper function to conditionally clear GPU cache based on memory threshold
Add clear_gpu_memory fixture with autouse=True to run automatically after each test
Configurable threshold via FLASHINFER_TEST_MEMORY_THRESHOLD environment variable (default: 0.75)

Copilot

Pull request overview

This PR introduces automatic GPU memory cleanup for ROCm tests by adding a pytest fixture that monitors and conditionally clears GPU cache after each test based on a configurable memory threshold.

Changes:

Added _maybe_clear_gpu_memory() helper function to check GPU memory usage and clear cache when above threshold
Added clear_gpu_memory autouse pytest fixture to automatically run cleanup after each test
Introduced FLASHINFER_TEST_MEMORY_THRESHOLD environment variable for configurable threshold (default: 0.75)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-23T22:08:20Z

+
+def _maybe_clear_gpu_memory(device: torch.device) -> None:
+    total_memory = torch.cuda.get_device_properties(device).total_memory
+    reserved_memory = torch.cuda.memory_reserved()


The torch.cuda.memory_reserved() call is missing the device parameter. Without specifying the device, it defaults to the current device, which may not match the device passed to this function. This should be torch.cuda.memory_reserved(device) to ensure consistency with the device used for getting total memory on line 65.

Suggested change

reserved_memory = torch.cuda.memory_reserved()

reserved_memory = torch.cuda.memory_reserved(device)

Copilot · 2026-02-23T22:08:21Z

+def _maybe_clear_gpu_memory(device: torch.device) -> None:
+    total_memory = torch.cuda.get_device_properties(device).total_memory
+    reserved_memory = torch.cuda.memory_reserved()
+
+    # FLASHINFER_TEST_MEMORY_THRESHOLD: threshold for PyTorch reserved memory usage (default: 0.75)
+    threshold = float(os.environ.get("FLASHINFER_TEST_MEMORY_THRESHOLD", "0.75"))
+
+    if reserved_memory > threshold * total_memory:
+        gc.collect()
+        torch.cuda.empty_cache()


This function duplicates the logic from clear_cuda_cache in tests/test_helpers/test_helpers.py with two differences: (1) the default threshold is 0.75 here vs 0.9 there, and (2) the existing function also has the same bug of missing device parameter in memory_reserved(). Consider either reusing the existing function or ensuring consistency in implementation. The lower threshold (0.75) means more aggressive cleanup, which may impact test performance differently than the existing tests.

eppaneamd added 3 commits February 23, 2026 20:40

add fixture to gpu cache cleanup

9e37170

add gpu memory cleanup hook to rocm tests

9706b41

restore original conftest

c687759

Copilot AI review requested due to automatic review settings February 23, 2026 22:04

Copilot started reviewing on behalf of eppaneamd February 23, 2026 22:05 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

missing device param

cb94af2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add automatic GPU memory cleanup fixture for ROCm tests#181

[Feature] Add automatic GPU memory cleanup fixture for ROCm tests#181
eppaneamd wants to merge 4 commits intoROCm:amd-integrationfrom
eppaneamd:fix/test-cache-cleanup

eppaneamd commented Feb 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 23, 2026

Uh oh!

Copilot AI Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	reserved_memory = torch.cuda.memory_reserved()
	reserved_memory = torch.cuda.memory_reserved(device)

Conversation

eppaneamd commented Feb 23, 2026

📌 Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants