intent-2-batch

Convert user intent to a GCP batch job

Context

In a common workflow, GCP batch users need to manually convert their intent into a configuration file which can be understood by Batch API before creating a Batch Job.

graph TD;
    UserIntent("User Intent") -->|Manually Crafting| BatchJobConfig("Batch Job Config");
    BatchJobConfig --> BatchAPI("Batch API");
    BatchAPI --> ComputeResources("Compute Resources");

We can leverage Gemini API to help us convert user intent to a job configuration and then directly feed the configuration to Batch API.

graph TD;
    Prompt("Prompt") -.->|Examples & Rules| GeminiAPI("Gemini API");
    UserIntent("User Intent") --> GeminiAPI;
    GeminiAPI --> BatchJobConfig("Batch Job Config");
    BatchJobConfig --> BatchAPI("Batch API");
    BatchAPI --> ComputeResources("Compute Resources");

How to run

Make sure you have python and gcloud installed in your local env.

Clone this repo.
Enable Vertex AI APIs in your GCP project
Configure gcloud to use the same project as default project.

gcloud config set project PROJECT_ID

Create a virtual env for python

cd intent-2-batch
virtualenv intent-2-batch
source intent-2-batch/bin/activate

Install required library

pip install --upgrade google-cloud-aiplatform

Run it!

python intent2batch.py

Provide your batch job description, example:

--------------------------------------------------------------------------------
Describe your desired batch job: I want to create a test batch job which is not mission critical, but I want to create 5 different tasks to run the same test script, please generate the JSON.
--------------------------------------------------------------------------------
Generated content:
{
  "taskGroups": [
    {
      "taskSpec": {
        "runnables": [
          {
            "script": {
              "text": "echo Hello world! This is task ${BATCH_TASK_INDEX}. This job has a total of ${BATCH_TASK_COUNT} tasks."
            }
          }
        ],
        "computeResource": {
          "cpuMilli": 2000,
          "memoryMib": 2000
        },
        "maxRetryCount": 1,
        "maxRunDuration": "3600s"
      },
      "taskCount": 5,
      "parallelism": 5
    }
  ],
  "allocationPolicy": {
    "instances": [
      {
        "policy": { "machineType": "e2-standard-4", "provisioningModel": "SPOT" }
      }
    ]
  },
  "logsPolicy": {
    "destination": "CLOUD_LOGGING"
  }
}

Update your batch job.

After each time your batch job is generated, the program will ask whether you are happy with the generated job config or not. If you select N as not satisfacted, the program will let you to describe the fields you want to update, example is as below:

Are you happy with the job config: Y/N
N
--------------------------------------------------------------------------------
Describe the field you want to update: I want to update machine type as e2-medium.
--------------------------------------------------------------------------------
Generated job config:

{
  "taskGroups": [
    {
      "taskSpec": {
        "runnables": [
          {
            "script": {
              "text": "echo Hello world! This is task ${BATCH_TASK_INDEX}. This job has a total of ${BATCH_TASK_COUNT} tasks."
            }
          }
        ],
        "computeResource": {
          "cpuMilli": 2000,
          "memoryMib": 2000
        },
        "maxRetryCount": 1,
        "maxRunDuration": "3600s"
      },
      "taskCount": 5,
      "parallelism": 5
    }
  ],
  "allocationPolicy": {
    "instances": [
      {
        "policy": { "machineType": "e2-medium", "provisioningModel": "SPOT" }  # changed from "e2-highcpu-2" to "e2-medium"
      }
    ]
  },
  "logsPolicy": {
    "destination": "CLOUD_LOGGING"
  }
}

You can choose to update field one by one, or updates multiple fields at the same time. For example, you can describe the fields you want to update also as I want to update task count to be 10, machine type as e2-medium, max retry count to be 3.

Submit your satifacted batch job.

If you select "Y" as you are happy with the generated job config, the program will ask you about the job name and location you want to feed, and help you directly submit the job through gcloud command.

--------------------------------------------------------------------------------
Submitting batch job...
gcloud batch jobs submit example-ai-job-2 --location us-central1 --config job_config.json
--------------------------------------------------------------------------------

Input "exit" to exit the program.

The prompt

A prompt includes Input, Context and Examples (Reference).

We are doing a multi-turn generation between the client and Gemini API. The first turn we provide Input, Context and Examples, loaded from prompt.md. For all following turns, we are only providing Input and expecting(praying) the API to give us some meaningful configurations based on its existing knowledge and what we have taught it.

Generated configuration

It's interesteing to see Gemini API is using not just our training data but also the knowledge from pre-trained model. For example we see the API generates different values for machineType and provisioningModel while we haven't included these in our training data. There are many more things we can explore and improve!

Major TODOs:

Try different parameters/models.
- How much does the prompt help? We can compare our result with the response from plain Gemini API and see if our current prompt has really made a difference.
- If we relax temperature will the output be better or worse?
- How about other models like Codey APIs models. I did a few manual testings and it seems Gemini is returning better result.
Improve prompt.md
- Will more training examples improve the response quality?
- Can we experiment more rules, for example always generate script job config instead onf container job config.
- Can we prompt Gemini to handle more complex logic? For example choose between policy and instanceTemplate when defining the underlying compute resources.
Help customer support
- Can Gemini help to answer customer questions, for example, these ones from https://www.googlecloudcommunity.com?
Generic exploration on the capability of Gemini API and see what interesting configurations it can produce.
Make the tool more practical.
- Asking for target GCP project Id and job name before creating the job.
- Introducing an extra step to allow users to modify the generated config before submitting through gcloud.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fix_my_batch_job_prompt.md		fix_my_batch_job_prompt.md
fixed_job.json		fixed_job.json
intense_config.json		intense_config.json
intent2batch.py		intent2batch.py
iterative_prompt.md		iterative_prompt.md
job_config.json		job_config.json
job_to_fix.json		job_to_fix.json
prompt.md		prompt.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

intent-2-batch

Context

How to run

The prompt

Generated configuration

Major TODOs:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

intent-2-batch

Context

How to run

The prompt

Generated configuration

Major TODOs:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages