Skip to content

[Infer] Serving example w/ ray-serve (multiple GPU case)#4841

Merged
yuanheng-zhao merged 12 commits intohpcaitech:mainfrom
yuanheng-zhao:colossal-infer/ray-serve
Oct 2, 2023
Merged

[Infer] Serving example w/ ray-serve (multiple GPU case)#4841
yuanheng-zhao merged 12 commits intohpcaitech:mainfrom
yuanheng-zhao:colossal-infer/ray-serve

Conversation

@yuanheng-zhao
Copy link
Copy Markdown
Contributor

@yuanheng-zhao yuanheng-zhao commented Sep 29, 2023

📌 Checklist before creating the PR

  • I have created an issue for this PR for traceability
  • The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

📝 What does this PR do?

  • This PR adds necessary files for Colossal Inference serving with Ray Serve. Steps of deployment were added into README.
  • This demo is used for testing and demonstrating the usage of Colossal Inference from colossalai.inference.tensor_parallel with ray serve. Serving for multiple-gpu inference is supported (e.g. tensor parallel with colossalai).

💥 Checklist before requesting a review

  • I have linked my PR to an issue (instruction)
  • My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • I have performed a self-review of my code
  • I have added thorough tests.
  • I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • 🌝 Yes, I do.
  • 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

@yuanheng-zhao yuanheng-zhao added the example example-related issuer or pull request label Sep 29, 2023
Comment thread colossalai/kernel/triton/__init__.py Outdated
@yuanheng-zhao yuanheng-zhao marked this pull request as ready for review September 29, 2023 08:01
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Oct 2, 2023

The code coverage for the changed files is 61%.

Click me to view the complete report
Name                                                        Stmts   Miss  Cover
-------------------------------------------------------------------------------
colossalai/inference/tensor_parallel/modeling/__init__.py       4      4     0%
colossalai/kernel/triton/__init__.py                           14      3    79%
-------------------------------------------------------------------------------
TOTAL                                                          18      7    61%

@yuanheng-zhao yuanheng-zhao requested a review from Xu-Kai October 2, 2023 03:44
Comment thread examples/inference/serving/ray_serve/Colossal_Inference_rayserve.py Outdated
Comment thread colossalai/inference/tensor_parallel/modeling/__init__.py
@yuanheng-zhao yuanheng-zhao merged commit 573f270 into hpcaitech:main Oct 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

example example-related issuer or pull request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants