- Directly Prompting Off-The-Shelf LLMs
- Prompt engineering
- Multiple prompts (agentic)
- Give it questions to answer to build a "train of thought", then prompt it to actually give the action to take
- Can we use reasoning models like Qwen, GPT-OSS to accomplish the above for better performance?
- Fine-tuning
- LoRA, UnSloth (more control, lower resource usage), Axlotl (potentially better for beginners), TorchTune
- Fine-tuning based on game walkthroughs (Jericho provides it)
- Fine-tuning based on dataset from Q*BERT for question-answers (qa-jericho)
- Q*BERT testing
- Reinforcement Learning
- Input as word-embeddings? How do we do action space?
- Stable Baselines (try it)
- RAG
- Get room prompt
- Turn into embedding vector
- Use vector to access vector DB for relevant info we've learned
- Build out our action prompt by taking room prompt, any relevant info from DB, and whatever final prompt we want to give the LLM
- Semantically split room prompt and put into database
- Send prompt to LLM and take action
- Vector databases
- Redis
- Postgres with pgvector
- sqlite with sqlite-vec <- Tyson's favorite idea
- Lots of specialized options, like FAISS, QDrant, Chroma
-
Notifications
You must be signed in to change notification settings - Fork 0
lewark/cs542-adventure
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published