An open-source (Apache-2.0) framework for fine-tuning and running open-weight models with custom CUDA kernels — roughly 2x faster training and large VRAM savings, so 7B–13B models fit on a single consumer GPU. Free tier runs on Colab/Kaggle or locally; Pro and Enterprise tiers add multi-GPU and multi-node speedups. Exports to GGUF/Safetensors for llama.cpp, vLLM, and Ollama.
Fine-tuning · Unsloth AI
Unsloth
Fine-tune open LLMs 2x faster with far less VRAM. Open source.
FREEMIUMOpen sourceLocalCLILinuxWindowsmacOS
Model support
Multi-model
- Llama
- Qwen
- Gemma
- DeepSeek
- Mistral
- gpt-oss
Fine-tunes open-weight models (LoRA/QLoRA + full) with reduced memory use.
Where it runs
- CLI
- Linux
- Windows
- macOS
Tags
- #fine-tuning
- #lora
- #open-source
- #training
Related in Fine-tuning
View OpenPipe details Fine-tuningFREEMIUMOpenPipe
OpenPipe
Replace frontier-model spend with a fine-tuned small model.
Captures your production OpenAI / Anthropic calls, builds a dataset, fine-tunes a small open-weights model on your traffic, then serves the swap behind your existing SDK. The pitch: 10x cost reduction at parity.
- fine-tuning
- cost-reduction
- drop-in
- open-weights