Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
46 commits
Select commit Hold shift + click to select a range
96f4f76
fix(inference): attach private network (#4524)
Laure-di Feb 19, 2025
a9f7fc9
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
dda7dce
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
83824a8
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
4da9473
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
52bae61
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
2737607
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
71720dd
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
d20db76
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
710dfca
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
d7c8592
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
d3a0fd9
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
e1ce1e3
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
33dc6bb
feat(audit_trail): add account user and organization resources (#4717)
scaleway-bot May 15, 2025
e8b8f21
chore: bump sdk (#4751)
scaleway-bot May 22, 2025
97c89b1
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
efc1c59
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
6504774
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
18c3e07
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
fa894a0
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
0ffe456
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
7fddefc
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
6bed35e
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
c3be94e
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
a53458a
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
d6c2c33
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
62d092a
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
6298daa
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
fc62f7e
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
43fe8c7
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
eb1a3d0
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
3d036e5
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
7f027f3
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
5bb3c60
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
b0f5cd2
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
8d6f3f9
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
f4e66fc
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
fa2c5a2
chore(deps): bump golang.org/x/text from 0.22.0 to 0.23.0 (#4567)
dependabot[bot] Mar 6, 2025
35ff803
feat(secret_manager): display restore methods (#4597)
scaleway-bot Mar 19, 2025
948838b
fix golangci linter
Laure-di May 27, 2025
90444d2
feat(inference): add support inference v1
Laure-di May 27, 2025
da52d36
update documentation
Laure-di May 27, 2025
de501ca
update goldens all usage
Laure-di May 27, 2025
05bacb0
update mainUsage
Laure-di May 27, 2025
7a305cc
remove custom_deployment
Laure-di May 28, 2025
d70503c
Merge branch 'master' into add-support-v1-inference
jremy42 May 28, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -8,20 +8,19 @@ USAGE:
ARGS:
name=<generated> Name of the deployment
[project-id] Project ID to use. If none is passed the default project ID will be used
model-name Name of the model to use
model-id ID of the model to use
[accept-eula] Accept the model's End User License Agreement (EULA).
node-type Name of the node type to use
node-type-name Name of the node type to use
[tags.{index}] List of tags to apply to the deployment
[min-size] Defines the minimum size of the pool
[max-size] Defines the maximum size of the pool
[endpoints.{index}.is-public=false] Will configure your public endpoint if true
[endpoints.{index}.private-network.private-network-id] ID of the Private Network
[endpoints.{index}.private-network.private-network-id]
[endpoints.{index}.disable-auth=false] Disable the authentication on the endpoint.
[quantization.bits] The number of bits each model parameter should be quantized to. The quantization method is chosen based on this value.
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)

FLAGS:
-h, --help help for create
-w, --wait wait until the deployment is ready

GLOBAL FLAGS:
-c, --config string The path to the config file
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,6 @@ ARGS:

FLAGS:
-h, --help help for delete
-w, --wait wait until the deployment is ready

GLOBAL FLAGS:
-c, --config string The path to the config file
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -6,12 +6,14 @@ USAGE:
scw inference deployment update <deployment-id ...> [arg=value ...]

ARGS:
deployment-id ID of the deployment to update
[name] Name of the deployment
[tags.{index}] List of tags to apply to the deployment
[min-size] Defines the new minimum size of the pool
[max-size] Defines the new maximum size of the pool
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)
deployment-id ID of the deployment to update
[name] Name of the deployment
[tags.{index}] List of tags to apply to the deployment
[min-size] Defines the new minimum size of the pool
[max-size] Defines the new maximum size of the pool
[model-id] Id of the model to set to the deployment
[quantization.bits] The number of bits each model parameter should be quantized to. The quantization method is chosen based on this value.
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)

FLAGS:
-h, --help help for update
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,7 @@ USAGE:

ARGS:
deployment-id ID of the deployment to create the endpoint for
[endpoint.is-public=false] Will configure your public endpoint if true
[endpoint.private-network.private-network-id] ID of the Private Network
[endpoint.private-network.private-network-id]
[endpoint.disable-auth=false] Disable the authentication on the endpoint.
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
🎲🎲🎲 EXIT CODE: 0 🎲🎲🎲
🟥🟥🟥 STDERR️️ 🟥🟥🟥️
Delete an existing model from your model library.

USAGE:
scw inference model delete <model-id ...> [arg=value ...]

ARGS:
model-id ID of the model to delete
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)

FLAGS:
-h, --help help for delete

GLOBAL FLAGS:
-c, --config string The path to the config file
-D, --debug Enable debug mode
-o, --output string Output format: json or human, see 'scw help output' for more info (default "human")
-p, --profile string The config profile to use
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
🎲🎲🎲 EXIT CODE: 0 🎲🎲🎲
🟥🟥🟥 STDERR️️ 🟥🟥🟥️
Import a new model to your model library.

USAGE:
scw inference model import [arg=value ...]

ARGS:
name=<generated> Name of the model
[project-id] Project ID to use. If none is passed the default project ID will be used
[source.url]
[source.secret]
[region=fr-par] Region to target. If none is passed will use default region from the config (fr-par)

FLAGS:
-h, --help help for import

GLOBAL FLAGS:
-c, --config string The path to the config file
-D, --debug Enable debug mode
-o, --output string Output format: json or human, see 'scw help output' for more info (default "human")
-p, --profile string The config profile to use
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,9 @@ USAGE:
scw inference model <command>

AVAILABLE COMMANDS:
delete Delete a model
get Get a model
import Import a model
list List models

FLAGS:
Expand Down
3 changes: 1 addition & 2 deletions cmd/scw/testdata/test-all-usage-inference-usage.golden
Original file line number Diff line number Diff line change
@@ -1,12 +1,11 @@
🎲🎲🎲 EXIT CODE: 0 🎲🎲🎲
🟥🟥🟥 STDERR️️ 🟥🟥🟥️
This API allows you to manage your Inference services.
This API allows you to handle your Managed Inference services.

USAGE:
scw inference <command>

AVAILABLE COMMANDS:
acl Access Control List (ACL) management commands
deployment Deployment commands
endpoint Endpoint management commands
model Models commands
Expand Down
4 changes: 2 additions & 2 deletions cmd/scw/testdata/test-main-usage-usage.golden
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@
USAGE:
scw <command>

AI COMMANDS:
inference This API allows you to manage your Inference services
AVAILABLE COMMANDS:
inference This API allows you to handle your Managed Inference services

BAREMETAL COMMANDS:
apple-silicon Apple silicon API
Expand Down
2 changes: 1 addition & 1 deletion commands/commands.go
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ import (
function "github.com/scaleway/scaleway-cli/v2/internal/namespaces/function/v1beta1"
"github.com/scaleway/scaleway-cli/v2/internal/namespaces/help"
iam "github.com/scaleway/scaleway-cli/v2/internal/namespaces/iam/v1alpha1"
inference "github.com/scaleway/scaleway-cli/v2/internal/namespaces/inference/v1beta1"
inference "github.com/scaleway/scaleway-cli/v2/internal/namespaces/inference/v1"
"github.com/scaleway/scaleway-cli/v2/internal/namespaces/info"
initNamespace "github.com/scaleway/scaleway-cli/v2/internal/namespaces/init"
"github.com/scaleway/scaleway-cli/v2/internal/namespaces/instance/v1"
Expand Down
154 changes: 53 additions & 101 deletions docs/commands/inference.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,7 @@
<!-- DO NOT EDIT: this file is automatically generated using scw-doc-gen -->
# Documentation for `scw inference`
This API allows you to manage your Inference services.
This API allows you to handle your Managed Inference services.

- [Access Control List (ACL) management commands](#access-control-list-(acl)-management-commands)
- [Add new ACLs](#add-new-acls)
- [Delete an existing ACL](#delete-an-existing-acl)
- [List your ACLs](#list-your-acls)
- [Set new ACL](#set-new-acl)
- [Deployment commands](#deployment-commands)
- [Create a deployment](#create-a-deployment)
- [Delete a deployment](#delete-a-deployment)
Expand All @@ -19,101 +14,14 @@ This API allows you to manage your Inference services.
- [Delete an endpoint](#delete-an-endpoint)
- [Update an endpoint](#update-an-endpoint)
- [Models commands](#models-commands)
- [Delete a model](#delete-a-model)
- [Get a model](#get-a-model)
- [Import a model](#import-a-model)
- [List models](#list-models)
- [Node types management commands](#node-types-management-commands)
- [List available node types](#list-available-node-types)


## Access Control List (ACL) management commands

Access Control List (ACL) management commands.


### Add new ACLs

Add new ACL rules for a specific deployment.

**Usage:**

```
scw inference acl add <deployment-id ...> [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| deployment-id | Required | ID of the deployment to add ACL rules to |
| acls.{index}.ip | | IP address to be allowed |
| acls.{index}.description | | Description of the ACL rule |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |



### Delete an existing ACL

Delete an existing ACL.

**Usage:**

```
scw inference acl delete <acl-id ...> [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| acl-id | Required | ID of the ACL rule to delete |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |



### List your ACLs

List ACLs for a specific deployment.

**Usage:**

```
scw inference acl list <deployment-id ...> [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| deployment-id | Required | ID of the deployment to list ACL rules for |
| region | Default: `fr-par`<br />One of: `fr-par`, `all` | Region to target. If none is passed will use default region from the config |



### Set new ACL

Set new ACL rules for a specific deployment.

**Usage:**

```
scw inference acl set <deployment-id ...> [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| deployment-id | Required | ID of the deployment to set ACL rules for |
| acls.{index}.ip | | IP address to be allowed |
| acls.{index}.description | | Description of the ACL rule |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |



## Deployment commands

Deployment commands.
Expand All @@ -136,15 +44,15 @@ scw inference deployment create [arg=value ...]
|------|---|-------------|
| name | Required<br />Default: `<generated>` | Name of the deployment |
| project-id | | Project ID to use. If none is passed the default project ID will be used |
| model-name | Required | Name of the model to use |
| model-id | Required | ID of the model to use |
| accept-eula | | Accept the model's End User License Agreement (EULA). |
| node-type | Required | Name of the node type to use |
| node-type-name | Required | Name of the node type to use |
| tags.{index} | | List of tags to apply to the deployment |
| min-size | | Defines the minimum size of the pool |
| max-size | | Defines the maximum size of the pool |
| endpoints.{index}.is-public | Default: `false` | Will configure your public endpoint if true |
| endpoints.{index}.private-network.private-network-id | | ID of the Private Network |
| endpoints.{index}.private-network.private-network-id | | |
| endpoints.{index}.disable-auth | Default: `false` | Disable the authentication on the endpoint. |
| quantization.bits | | The number of bits each model parameter should be quantized to. The quantization method is chosen based on this value. |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |


Expand Down Expand Up @@ -254,6 +162,8 @@ scw inference deployment update <deployment-id ...> [arg=value ...]
| tags.{index} | | List of tags to apply to the deployment |
| min-size | | Defines the new minimum size of the pool |
| max-size | | Defines the new maximum size of the pool |
| model-id | | Id of the model to set to the deployment |
| quantization.bits | | The number of bits each model parameter should be quantized to. The quantization method is chosen based on this value. |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |


Expand All @@ -279,8 +189,7 @@ scw inference endpoint create <deployment-id ...> [arg=value ...]
| Name | | Description |
|------|---|-------------|
| deployment-id | Required | ID of the deployment to create the endpoint for |
| endpoint.is-public | Default: `false` | Will configure your public endpoint if true |
| endpoint.private-network.private-network-id | | ID of the Private Network |
| endpoint.private-network.private-network-id | | |
| endpoint.disable-auth | Default: `false` | Disable the authentication on the endpoint. |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |

Expand Down Expand Up @@ -332,6 +241,26 @@ scw inference endpoint update <endpoint-id ...> [arg=value ...]
Models commands.


### Delete a model

Delete an existing model from your model library.

**Usage:**

```
scw inference model delete <model-id ...> [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| model-id | Required | ID of the model to delete |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |



### Get a model

Get the model for the given ID.
Expand All @@ -352,6 +281,29 @@ scw inference model get <model-id ...> [arg=value ...]



### Import a model

Import a new model to your model library.

**Usage:**

```
scw inference model import [arg=value ...]
```


**Args:**

| Name | | Description |
|------|---|-------------|
| name | Required<br />Default: `<generated>` | Name of the model |
| project-id | | Project ID to use. If none is passed the default project ID will be used |
| source.url | | |
| source.secret | | |
| region | Default: `fr-par`<br />One of: `fr-par` | Region to target. If none is passed will use default region from the config |



### List models

List all available models.
Expand Down
9 changes: 9 additions & 0 deletions internal/namespaces/inference/v1/custom.go
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
package inference

import "github.com/scaleway/scaleway-cli/v2/core"

func GetCommands() *core.Commands {
cmds := GetGeneratedCommands()

return cmds
}
Loading