Model loading is slow / Time before generating

### Describe the bug

I have used a simple realization on T4 (Google Cloud)
```
import torch
from torch import autocast
from diffusers import StableDiffusionPipeline

access_token = ""

pipe = StableDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4",
    revision="fp16",
    torch_dtype=torch.float16,
    use_auth_token=access_token,
)
pipe = pipe.to("cuda")

prompt = "a photo of an astronaut riding a horse on mars"
image = pipe(prompt).images[0]

image.save("astronaut_rides_horse_1.png")
```

And generating can take 10-15 sec per image - it's good result.
But we have a problem with time before start of generating. Pre-loading models and files can take ~ 30 sec.
For example:
![20221024-penj-42kb](https://user-images.githubusercontent.com/7080230/197623318-0b0bfa75-54bf-40af-8ce6-7e22549f3839.jpg)


How we can speed up the preloading?

### Reproduction

_No response_

### Logs

_No response_

### System Info

Google Cloud - T4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model loading is slow / Time before generating #968

Describe the bug

Reproduction

Logs

System Info

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Model loading is slow / Time before generating #968

Description

Describe the bug

Reproduction

Logs

System Info

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions