Skip to content

[Infer] Colossal-Inference serving example w/ TorchServe (single GPU case)#4771

Merged
yuanheng-zhao merged 12 commits intohpcaitech:mainfrom
yuanheng-zhao:colossal-inference/torchserve
Oct 2, 2023
Merged

[Infer] Colossal-Inference serving example w/ TorchServe (single GPU case)#4771
yuanheng-zhao merged 12 commits intohpcaitech:mainfrom
yuanheng-zhao:colossal-inference/torchserve

Conversation

@yuanheng-zhao
Copy link
Copy Markdown
Contributor

@yuanheng-zhao yuanheng-zhao commented Sep 21, 2023

📌 Checklist before creating the PR

  • I have created an issue for this PR for traceability
  • The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

📝 What does this PR do?

  • This PR adds necessary files for Colossal Inference serving with TorchServe. Steps of deployment were added into README.
  • This demo is used for testing and demonstrating the usage of Colossal Inference from colossalai.inference.tensor_parallel with TorchServe. For now, serving for single-gpu inference is supported.

💥 Checklist before requesting a review

  • I have linked my PR to an issue (instruction)
  • My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • I have performed a self-review of my code
  • I have added thorough tests.
  • I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • 🌝 Yes, I do.
  • 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

@yuanheng-zhao yuanheng-zhao marked this pull request as ready for review September 22, 2023 09:04
@yuanheng-zhao yuanheng-zhao added the example example-related issuer or pull request label Sep 26, 2023
Comment thread examples/inference/serving/torch_serve/Colossal_Inference_Handler.py Outdated
@tiandiao123 tiandiao123 changed the title [Infer] Colossal-Inference serving example w/ TorchServe [Infer] Colossal-Inference serving example w/ TorchServe (single GPU case) Sep 26, 2023
Comment thread examples/inference/serving/torch_serve/Colossal_Inference_Handler.py Outdated
Comment thread examples/inference/serving/torch_serve/Colossal_Inference_Handler.py Outdated
FoolPlayer
FoolPlayer previously approved these changes Sep 28, 2023
@FoolPlayer FoolPlayer dismissed their stale review September 28, 2023 10:00

wrong click

@yuanheng-zhao yuanheng-zhao merged commit 3a74eb4 into hpcaitech:main Oct 2, 2023
@yuanheng-zhao yuanheng-zhao deleted the colossal-inference/torchserve branch October 2, 2023 09:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

example example-related issuer or pull request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants