- FSDP Experiments
- RoPE Embeddings
- MoE Architecture
- MLA Latent Attention
- Grouped Query Attention
- Pre-Training
- LLM Evaluation
- SFT Training.
- RLHF Data + Training
- RL based on rule-based reward
Akhilez/lexical_lab
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|