Skip to content

docs: document LLMLingua sidecar setup, ship sidecar/ in npm package#8

Open
jajanet wants to merge 1 commit intosliday:mainfrom
jajanet:docs/sidecar-llmlingua-setup
Open

docs: document LLMLingua sidecar setup, ship sidecar/ in npm package#8
jajanet wants to merge 1 commit intosliday:mainfrom
jajanet:docs/sidecar-llmlingua-setup

Conversation

@jajanet
Copy link
Copy Markdown

@jajanet jajanet commented Apr 20, 2026

The llmlingua stage is enabled by default, but silently no-ops without the Python sidecar.

I had to do some debug steps too figure why this is the case, and I'd imagine other people would be interested in having this setup and working out of the box. LLMLingua has 20x token compression with minimal performance loss is pretty big (Jiang et al). I use a Mac though, not sure if the issue is specific to me

This adds setup instructions and ships sidecar/ in the npm package, so npm installs have the files needed. Also documents TAMP_LLMLINGUA_URL env var to work around the startup race condition when the sidecar is still loading the model.

The llmlingua stage is enabled by default but silently no-ops without
the Python sidecar. This adds setup instructions and ships sidecar/
in the npm package so npm installs have the files needed.

Also documents TAMP_LLMLINGUA_URL env var to work around the startup
race condition when the sidecar is still loading the model.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant