llama.go

Go bindings to llama.cpp

Installation

make sure you have git golang cmake gcc make installed on the system to build.

Build from source

~ git clone https://github.com/Qitmeer/llama.go.git
~ cd llama.go
~ make

Get model

Manually download the model:Hugging Face Qwen3-8B-GGUF
Please first set the storage location of the model file, which can be done using environment variables LLAMAGO_MODEL_DIR or command-line parameters model-dir
Default model files directory is ./data/models

~ ./llama --model-dir=<your_model_files_directory>
or
~ export LLAMAGO_MODEL_DIR=<your_model_files_directory>

As the startup of the server

~ ./llama --model=qwen2.5-0.5b-q8_0.gguf serve
or
~ ./llama --model=gpt-oss-20b-mxfp4.gguf --jinja serve

client:

~ ./llama run 天空为什么是蓝的

Or enable interactive mode to run:

~ ./llama run

Download Model by CLI:

~ ./llama pull gte-small-Q8_0-GGUF

or

~ ./llama pull gte-small-Q8_0-GGUF:gte-small-q8_0.gguf

or

~ ./llama pull llamago/gte-small-Q8_0-GGUF:gte-small-q8_0.gguf

Support REST API:

~ curl -s -k -X POST -H 'Content-Type: application/json' --data '{"prompt":"天空为什么是蓝的"}' http://127.0.0.1:8081/api/generate

WebUI

Enter this address http://127.0.0.1:8081 in the browser

Embedding

Local mode:

~ ./llama --model=qwen2.5-0.5b-q8_0.gguf embedding 天空为什么是蓝的 --output-file=./embs.json

Server mode:

~ curl -s -k -X POST -H 'Content-Type: application/json' --data '{"input":["天空","蓝色"]}' http://127.0.0.1:8081/api/embed
~ curl -s -k -X POST -H 'Content-Type: application/json' --data '{"prompt":"天空为什么是蓝的"}' http://127.0.0.1:8081/api/embeddings

Whisper

Firstly, you need to download the model from this address https://huggingface.co/ggerganov/whisper.cpp and then place it in LLAMAGO_MODEL_DIR or model-dir

~ ./llama --model=ggml-base.en.bin whisper --input="./your-voice.wav"

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.github/workflows		.github/workflows
api		api
app		app
cmd/llama		cmd/llama
common		common
config		config
core		core
format		format
model		model
runner		runner
scripts		scripts
server		server
system		system
version		version
wrapper		wrapper
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama.go

Installation

Get model

As the startup of the server

client:

Download Model by CLI:

WebUI

Embedding

Whisper

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

Qitmeer/llama.go

Folders and files

Latest commit

History

Repository files navigation

llama.go

Installation

Get model

As the startup of the server

client:

Download Model by CLI:

WebUI

Embedding

Whisper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages