[Serve] fix video analysis release test by abrarsheikh · Pull Request #61324 · ray-project/ray

abrarsheikh · 2026-02-25T21:16:18Z

regression caused due to upgrade in transformer lib version, specifically there is a behavior change caused by huggingface/transformers#42564

Signed-off-by: abrar <abrar@anyscale.com>

gemini-code-assist

Code Review

This pull request addresses a regression in the video analysis release test caused by an upgrade in the transformers library. The change in doc/source/serve/tutorials/video-analysis/deployments/encoder.py correctly adapts to the new return type of model.get_image_features. Previously, this method returned a tensor directly, but now it returns a BaseModelOutputWithPooling object. The fix correctly accesses the pooler_output attribute of this object to retrieve the frame embeddings before normalization. The change is correct and necessary to fix the regression. I have added one suggestion to improve a code comment.

gemini-code-assist · 2026-02-25T21:19:04Z

-
-                # L2 normalize on GPU (faster than CPU numpy)
-                frame_embeddings = torch.nn.functional.normalize(outputs, p=2, dim=1)
+                # get_image_features returns BaseModelOutputWithPooling; use pooler_output for embeddings


While the new comment is helpful for understanding the change in the transformers library, the removed comment # L2 normalize on GPU (faster than CPU numpy) provided a useful performance rationale. It would be beneficial to retain this information for future maintainers, especially in a tutorial. Consider combining both pieces of information.

Suggested change

# get_image_features returns BaseModelOutputWithPooling; use pooler_output for embeddings

# get_image_features returns BaseModelOutputWithPooling; use pooler_output for embeddings.

# The embeddings are then L2 normalized on GPU (faster than CPU numpy).

[Serve] fix video analysis release test

f0110b7

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh requested a review from a team as a code owner February 25, 2026 21:16

abrarsheikh added the go add ONLY when ready to merge, run all tests label Feb 25, 2026

akyang-anyscale approved these changes Feb 25, 2026

View reviewed changes

gemini-code-assist Bot reviewed Feb 25, 2026

View reviewed changes

Merge branch 'master' into abrar-vid-analysis-fix

d958785

aslonnie merged commit 73b5266 into master Feb 26, 2026
5 of 6 checks passed

aslonnie deleted the abrar-vid-analysis-fix branch February 26, 2026 01:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve] fix video analysis release test#61324

[Serve] fix video analysis release test#61324
aslonnie merged 2 commits intomasterfrom
abrar-vid-analysis-fix

abrarsheikh commented Feb 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	# get_image_features returns BaseModelOutputWithPooling; use pooler_output for embeddings
	# get_image_features returns BaseModelOutputWithPooling; use pooler_output for embeddings.
	# The embeddings are then L2 normalized on GPU (faster than CPU numpy).

Conversation

abrarsheikh commented Feb 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants