Skip to content
This repository was archived by the owner on Jun 24, 2024. It is now read-only.
This repository was archived by the owner on Jun 24, 2024. It is now read-only.

Metal Prompt Feeding #403

@jafioti

Description

@jafioti

I'm trying to run llama on mac using metal, but I noticed on the accelerators doc it states metal cannot be used for feeding in a prompt with more than 1 token. Is this an underlying limitation with ggml, or llm?

I'd love to help enable this, but I'm not sure where to begin.

Metadata

Metadata

Assignees

No one assigned

    Labels

    issue:bugSomething isn't workingtopic:metalhttps://developer.apple.com/metal/ support

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions