Skip to content

feature: unbuffered token stream #109

@mudler

Description

@mudler

Now this should be quite easy at least for the llama.cpp backend: go-skynet/go-llama.cpp#28 thanks to @noxer's contribution ( ❤️ ) now it's just a matter of wiring things up in the SSE callback here in the server

  • go-llama.cpp
  • gpt4all.cpp
  • gpt2.cpp
  • rwkv.cpp

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions