Hi,
I usually run a model with device_map="auto" option to utilize all the cards I have.
output = model.generate(
inputs,
max_length=100,
num_return_sequences=1,
device_map="auto",
attention_mask=torch.ones_like(inputs),
pad_token_id=tokenizer.eos_token_id,
)
But it seems like using llm = Syncode(...); llm.infer(...) API, there is no direct way to turn the option on.
Is there any way to do so in the current setting?
Thank you
Hi,
I usually run a model with
device_map="auto"option to utilize all the cards I have.But it seems like using
llm = Syncode(...); llm.infer(...)API, there is no direct way to turn the option on.Is there any way to do so in the current setting?
Thank you