Skip to content

Implemented fast image processor for VitPose#40149

Open
msminhas93 wants to merge 4 commits intohuggingface:mainfrom
msminhas93:fast-image-processor-vitpose
Open

Implemented fast image processor for VitPose#40149
msminhas93 wants to merge 4 commits intohuggingface:mainfrom
msminhas93:fast-image-processor-vitpose

Conversation

@msminhas93
Copy link
Copy Markdown

What does this PR do?

This PR implements a fast PyTorch image processor for VitPose (VitPoseImageProcessorFast) that extends the existing BaseImageProcessorFast class. The new processor provides optimized image processing capabilities specifically designed for pose estimation tasks using the VitPose model.

Key Features

  • Fast PyTorch-native implementation: Leverages PyTorch operations for efficient image processing
  • Affine transformations: Supports image cropping and resizing based on bounding boxes
  • Channel dimension handling: Automatic detection and conversion between different channel formats (CHW vs HWC)
  • Keypoint processing: Includes methods for heatmap-based keypoint extraction and refinement
  • Post-processing utilities: Dark unbiased data processing and coordinate transformation functions
  • Optimized normalization: Handles variable channel counts (1, 3, 4 channels) with proper mean/std scaling

Before submitting

Who can review?

@yonigozlan

@msminhas93 msminhas93 closed this Aug 14, 2025
@msminhas93 msminhas93 reopened this Aug 14, 2025
@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, vitpose

@msminhas93
Copy link
Copy Markdown
Author

msminhas93 commented Aug 14, 2025

Hi @yonigozlan! Could you please take a look. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant