-
Notifications
You must be signed in to change notification settings - Fork 0
declare required ACP version for knative operator #129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -2,15 +2,14 @@ | |
| weight: 35 | ||
| --- | ||
|
|
||
| # Install Alauda AI Generative | ||
| # Install Alauda Build of KServe | ||
|
|
||
| **Alauda AI Generative** is a cloud-native component built on **KServe** for serving generative AI models. As an extension of the Alauda AI ecosystem, it specifically optimizes for **Large Language Models (LLMs)**, offering essential features such as inference orchestration, streaming responses, and resource-based auto-scaling for generative workloads. | ||
| **Alauda Build of KServe** is a cloud-native component built on **KServe** for serving generative AI models. As an extension of the Alauda AI ecosystem, it specifically optimizes for **Large Language Models (LLMs)**, offering essential features such as inference orchestration, streaming responses, and resource-based auto-scaling for generative workloads. | ||
|
|
||
| --- | ||
|
|
||
| ## Prerequisites | ||
|
|
||
| Before installing **Alauda AI Generative**, you need to ensure the following dependencies are installed: | ||
| Before installing **Alauda Build of KServe**, you need to ensure the following dependencies are installed: | ||
|
|
||
| ### Required Dependencies | ||
|
|
||
|
|
@@ -28,20 +27,20 @@ Before installing **Alauda AI Generative**, you need to ensure the following dep | |
|
|
||
| | Dependency | Type | Description | | ||
| |------------|------|-------------| | ||
| | GIE | Built-in | Integrated GIE (gateway-api-inference-extension) for enhanced AI capabilities. Can be enabled through the Alauda AI Generative UI. | | ||
| | GIE | Built-in | Integrated GIE (gateway-api-inference-extension) for enhanced AI capabilities. Can be enabled through the Alauda Build of KServe UI. | | ||
| | Alauda AI | Operator | Required only if you need to use KServe Predictive AI functionality. Can be disabled if you only need LLM Generative AI functionality. | | ||
|
|
||
| ### Installation Notes | ||
|
|
||
| 1. **Required Dependencies**: All three required dependencies must be installed before installing Alauda AI Generative. | ||
| 2. **GIE Integration**: If you want to use GIE, you can enable it during the installation process by selecting the "Integrated GIE" option in the Alauda AI Generative UI. | ||
| 1. **Required Dependencies**: All three required dependencies must be installed before installing Alauda Build of KServe. | ||
| 2. **GIE Integration**: If you want to use GIE, you can enable it during the installation process by selecting the "Integrated GIE" option in the Alauda Build of KServe UI. | ||
| 3. **Alauda AI Integration**: If you don't need KServe Predictive AI functionality and only want to use LLM Generative AI, you can disable the "Integrated With Alauda AI" option during installation. | ||
|
|
||
| ## Downloading Cluster Plugin | ||
|
|
||
| :::info | ||
|
|
||
| `Alauda AI Generative` cluster plugin can be retrieved from Customer Portal. | ||
| `Alauda Build of KServe` cluster plugin can be retrieved from Customer Portal. | ||
|
|
||
| Please contact Consumer Support for more information. | ||
|
|
||
|
|
@@ -51,9 +50,9 @@ Please contact Consumer Support for more information. | |
|
|
||
| For more information on uploading the cluster plugin, please refer to <ExternalSiteLink name="acp" href="ui/cli_tools/index.html#uploading-cluster-plugins" children="Uploading Cluster Plugins" /> | ||
|
|
||
| ## Installing Alauda AI Generative | ||
| ## Installing Alauda Build of KServe | ||
|
|
||
| 1. Go to the `Administrator` -> `Marketplace` -> `Cluster Plugin` page, switch to the target cluster, and then deploy the `Alauda AI Generative` Cluster plugin. | ||
| 1. Go to the `Administrator` -> `Marketplace` -> `Cluster Plugin` page, switch to the target cluster, and then deploy the `Alauda Build of KServe` Cluster plugin. | ||
|
|
||
| 2. In the deployment form, configure the following parameters as needed: | ||
|
|
||
|
|
@@ -99,7 +98,7 @@ For more information on uploading the cluster plugin, please refer to <ExternalS | |
|
|
||
| 4. Verify result. You can see the status of "Installed" in the UI. | ||
|
|
||
| ## Upgrading Alauda AI Generative | ||
| ## Upgrading Alauda Build of KServe | ||
|
|
||
| 1. Upload the new version for package of **Alauda AI Generative** plugin to ACP. | ||
| 2. Go to the `Administrator` -> `Clusters` -> `Target Cluster` -> `Functional Components` page, then click the `Upgrade` button, and you will see the `Alauda AI Generative` can be upgraded. | ||
| 1. Upload the new version for package of **Alauda Build of KServe** plugin to ACP. | ||
| 2. Go to the `Administrator` -> `Clusters` -> `Target Cluster` -> `Functional Components` page, then click the `Upgrade` button, and you will see the `Alauda Build of KServe` can be upgraded. | ||
|
Comment on lines
+103
to
+104
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Tighten the upgrade wording.
🤖 Prompt for AI Agents |
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -46,9 +46,9 @@ The diagram below illustrates the architecture of the Alauda AI platform. | |
|
|
||
| | Component | Description | Type | License | | ||
| | --- | --- | --- | --- | | ||
| | Kserve (Alauda AI Model Serving/Alauda Generative AI) | Kubernetes-native model serving framework | Open source | Apache Version 2.0 | | ||
| | vLLM (Alauda AI Model Serving/Alauda Generative AI) | High-performance model inference engine for large language models | Open source | Apache Version 2.0 | | ||
| | llm-d (Alauda Generative AI) | Distributed inference engine for large language models | Open source | Apache Version 2.0 | | ||
| | Kserve (Alauda AI/Alauda Build of KServe) | Kubernetes-native model serving framework | Open source | Apache Version 2.0 | | ||
| | vLLM (Alauda AI/Alauda Build of KServe) | High-performance model inference engine for large language models | Open source | Apache Version 2.0 | | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🧩 Analysis chain🏁 Script executed: # First, let's check the architecture.mdx file around line 50
cat -n docs/en/overview/architecture.mdx | sed -n '45,55p'Repository: alauda/aml-docs Length of output: 1199 🏁 Script executed: # Check the inference_service.mdx file around the mentioned lines
cat -n docs/en/model_inference/inference_service/functions/inference_service.mdx | sed -n '38,52p'Repository: alauda/aml-docs Length of output: 903 🏁 Script executed: # Let's also search for other references to vLLM in the documentation
rg -n "vLLM" docs/ --type markdownRepository: alauda/aml-docs Length of output: 41 Clarify the The label 🤖 Prompt for AI Agents |
||
| | llm-d (Alauda Build of KServe) | Distributed inference engine for large language models | Open source | Apache Version 2.0 | | ||
| | Model as a Service (Alauda build of Envoy AI Gateway) | API gateway for serving AI models as a service | Open source | Apache Version 2.0 | | ||
| | Fine-tuning | Tools integrated with the workbench for fine-tuning large language models, e.g. transformers, accelerate, llama-factory etc. | Open source | - | | ||
| | Training (Alauda support for Kubeflow Trainer v2) | Kubernetes-native training job management | Open source | Apache Version 2.0 | | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.