Fine-tuning · Unsloth AI

Unsloth

Fine-tune open LLMs 2x faster with far less VRAM. Open source.

FREEMIUMOpen sourceLocalCLILinuxWindowsmacOS

An open-source (Apache-2.0) framework for fine-tuning and running open-weight models with custom CUDA kernels — roughly 2x faster training and large VRAM savings, so 7B–13B models fit on a single consumer GPU. Free tier runs on Colab/Kaggle or locally; Pro and Enterprise tiers add multi-GPU and multi-node speedups. Exports to GGUF/Safetensors for llama.cpp, vLLM, and Ollama.

Model support

Multi-model

Llama
Qwen
Gemma
DeepSeek
Mistral
gpt-oss

Fine-tunes open-weight models (LoRA/QLoRA + full) with reduced memory use.

Where it runs

CLI
Linux
Windows
macOS

Multi-model

OpenPipe