Unsloth AI's Avatar

Unsloth AI

@unsloth.ai

Open source LLM fine-tuning! πŸ¦₯ Github: http://github.com/unslothai/unsloth Discord: https://discord.gg/unsloth

648
Followers
6
Following
24
Posts
19.11.2024
Joined
Posts Following

Latest posts by Unsloth AI @unsloth.ai

Post image

You can now fine-tune Qwen3.5 with our free notebook! πŸ”₯

You just need 5GB VRAM to train Qwen3.5-2B LoRA locally!

Unsloth trains Qwen3.5 1.5x faster with 50% less VRAM.

GitHub: github.com/unslothai/un...
Guide: unsloth.ai/docs/models/...

Qwen3.5-4B Colab: colab.research.google.com/github/unslo...

03.03.2026 14:54 πŸ‘ 42 πŸ” 5 πŸ’¬ 0 πŸ“Œ 4
Post image

Qwen releases 4 new Qwen3.5 Small models!

Qwen3.5: 0.8B β€’ 2B β€’ 4B β€’ 9B

Run Qwen3.5-0.8B, 2B and 4B on your phone. Run 9B on 6GB RAM.

The vision reasoning LLMs perform better than models 4x their size.

GGUFs: huggingface.co/collections/...
Guide: unsloth.ai/docs/models/...

02.03.2026 13:39 πŸ‘ 52 πŸ” 4 πŸ’¬ 0 πŸ“Œ 3

apparently Unsloth is on bluesky?

10.02.2026 19:01 πŸ‘ 53 πŸ” 3 πŸ’¬ 2 πŸ“Œ 0
Post image

You can now train MoE models 12Γ— faster with 35% less VRAM via our new Triton kernels (no accuracy loss).

Train gpt-oss locally on 12.8GB VRAM.

In collab with @hf.co, Unsloth trains DeepSeek, Qwen3, GLM faster.

Repo: github.com/unslothai/un...
Blog: unsloth.ai/docs/new/fas...

10.02.2026 15:37 πŸ‘ 60 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1
Post image

You can now fine-tune embedding models in our free notebook!

Improve retrieval and RAG with better semantic search & similarity.

Unsloth trains 2x faster, 20% less VRAM, 2x context & no accuracy loss

Blog: unsloth.ai/docs/new/emb...
EmbeddingGemma (300M): colab.research.google.com/github/unslo...

22.01.2026 16:16 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

You can now train LLMs 3Γ— faster with no accuracy loss, via our new RoPE and MLP kernels.

Our Triton kernels plus smart auto packing delivers ~3Γ— faster training & 30% less VRAM vs optimized FA3 setups.

Train Qwen3-4B 3x faster on just 3.9GB VRAM.

Blog: docs.unsloth.ai/new/3x-faste...

10.12.2025 14:56 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

You can now train OpenAI gpt-oss with Reinforcement Learning in our free notebook!

This notebook automatically creates faster kernels via RL.

Unsloth RL achieves the fastest inference & lowest VRAM vs. any setup - 0 accuracy loss

gpt-oss-20b GRPO Colab: colab.research.google.com/github/unslo...

26.09.2025 16:06 πŸ‘ 9 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1
Post image

We're teaming up with Mistral and NVIDIA for an Unsloth event on Tues, Oct 21 at Y Combinator's office! πŸ¦₯

Join us in San Francisco for a night of talks, merch and more.

Food & drinks provided. RSVP required!
⭐ lu.ma/unsloth-yc

22.09.2025 14:20 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

Run DeepSeek-V3.1 locally on 170GB RAM with our Dynamic 1-bit GGUFs!πŸ‹

The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.

The 1-bit GGUF passes all our code tests & we fixed the chat template.

Guide: docs.unsloth.ai/basics/deeps...
GGUF: huggingface.co/unsloth/Deep...

22.08.2025 20:13 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Learn to fine-tune OpenAI gpt-oss with our new step-by-step guide! ✨

Learn about:
β€’ Local gpt-oss training + inference FAQ & tips
β€’ Evaluation, hyperparameters & overfitting
β€’ Reasoning effort, Data prep
β€’ Run & saving your model to llama.cpp GGUF, HF etc.

πŸ”—Guide: docs.unsloth.ai/basics/gpt-o...

18.08.2025 14:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

We made a complete Guide on Reinforcement Learning (RL) for LLMs!

Learn about:
β€’ RL's goal & why it's key to building intelligent AI agents
β€’ Why o3, Claude 4 & R1 use RL
β€’ GRPO, RLHF, DPO, reward functions
β€’ Training your own local R1 model with Unsloth

πŸ”— docs.unsloth.ai/basics/reinf...

17.06.2025 15:09 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

You can now run DeepSeek-R1-0528 locally with our Dynamic 1-bit GGUFs! πŸ‹

We shrank the full 715GB model to just 168GB (-80% size).

We achieve optimal accuracy by selectively quantizing layers.

DeepSeek-R1-0528-Qwen3-8B is also supported.

GGUFs: huggingface.co/unsloth/Deep...

30.05.2025 20:34 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

Introducing Unsloth Dynamic v2.0 GGUFs!

v2.0 sets new benchmarks on 5-shot MMLU + KL Divergence. So, you can now run quantized LLMs with minimal accuracy loss.

For benchmarks, we built an evaluation framework to match official MMLU scores of Llama 4 & Gemma 3

Blog: docs.unsloth.ai/basics/dynam...

24.04.2025 19:12 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 1
Post image

You can now run Llama 4 on your local device! πŸ¦™

We shrank Maverick (402B) from 400GB to 122GB (-70%).

Our Dynamic 1.78-bit iMatrix GGUFs ensures optimal accuracy & size by selectively quantizing layers.

Scout + Maverick GGUFs: huggingface.co/collections/...
Guide: docs.unsloth.ai/basics/tutor...

08.04.2025 20:39 πŸ‘ 14 πŸ” 2 πŸ’¬ 0 πŸ“Œ 1

The 1.58-bit quant fits in 131GB VRAM (2Γ— H100s) for fast throughput inference at ~140 tokens/s.

For best results, use 2.51-bit Dynamic quant & at least 160GB+ combined VRAM + RAM.

Basic 1-bit & 2-bit quantization causes the model to produce repetition and poor code. Our dynamic quants solve this.

25.03.2025 23:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
How to Run Deepseek-V3-0324 Locally DeepSeek's V3-0324 model is the most powerful open-source model rivalling GPT 4.5 and Claude 3.7. Learn to run the model with Unsloth Dynamic quants.

You can now Run DeepSeek-V3-0324 locally using our 1.58 & 2.51-bit Dynamic GGUF!πŸ‹

We shrank the 720GB model to 131GB (-80%) by selectively quantizing layers for the best performance. Fixes basic quants breaking & bad output issues: www.unsloth.ai/blog/deepsee...

GGUF huggingface.co/unsloth/Deep...

25.03.2025 23:54 πŸ‘ 12 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Post image

We teamed up with πŸ€—Hugging Face to release a free notebook for fine-tuning Gemma 3 with GRPO

Learn to:
β€’ Enable reasoning in Gemma 3 (1B)
β€’ Prepare/understand reward functions
β€’ Make GRPO work for tiny LLMs

Notebook: colab.research.google.com/github/unslo...
Details: huggingface.co/reasoning-co...

19.03.2025 16:31 πŸ‘ 16 πŸ” 6 πŸ’¬ 0 πŸ“Œ 0
Post image

We made a Guide to teach you how to Fine-tune LLMs correctly!

Learn about:
β€’ Choosing the right parameters & training method
β€’ RL, GRPO, DPO, CPT
β€’ Data prep, Overfitting, Evaluation
β€’ Training with Unsloth & deploy on vLLM, Ollama, Open WebUI

πŸ”—https://docs.unsloth.ai/get-started/fine-tuning-guide

10.03.2025 16:30 πŸ‘ 19 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

Tutorial: Train your own Reasoning LLM for free!

Make Llama 3.1 (8B) have chain-of-thought with DeepSeek's GRPO algorithm. Unsloth enables 90% less VRAM use.

Learn about:
β€’ Reward Functions + dataset prep
β€’ Training on free Colab GPUs
β€’ Running + Evaluating

Guide: docs.unsloth.ai/basics/reaso...

25.02.2025 18:32 πŸ‘ 10 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

For our benchmarks, a standard GRPO QLoRA setup (TRL + FA2) for Llama 3.1 (8B) at 20K context required 510.8GB VRAM. Unsloth’s GRPO algorithms reduces this to just 54.3GB.

The 5GB VRAM requirement for Qwen2.5 (1.5B) is down from 7GB in our previous GRPO release two weeks ago!

20.02.2025 18:41 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Today, we’re launching new algorithms that enable 10x longer context lengths & 90% less VRAM for training Reasoning Models (GRPO).

Using Unsloth, you can now train your own reasoning model with just 5GB VRAM for Qwen2.5-1.5B with no accuracy loss.

Blog: unsloth.ai/blog/grpo

20.02.2025 18:41 πŸ‘ 11 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
Post image

You can now reproduce DeepSeek-R1's reasoning on your own local device!

Experience the "Aha" moment with just 7GB VRAM.

Unsloth reduces GRPO training memory use by 80%.

15GB VRAM can transform Llama-3.1 (8B) & Phi-4 (14B) into reasoning models.

Blog: unsloth.ai/blog/r1-reas...

06.02.2025 18:09 πŸ‘ 34 πŸ” 8 πŸ’¬ 0 πŸ“Œ 1
Preview
Finetune Phi-4 with Unsloth Fine-tune Microsoft's new Phi-4 model with Unsloth! We've also found & fixed 4 bugs in the model.

You can finetune Phi-4 for free on Colab now!

Unsloth finetunes LLMs 2x faster, with 70% less VRAM, 12x longer context - with no accuracy loss

Documentation: docs.unsloth.ai
We also fixed 4 bugs in Phi-4: unsloth.ai/blog/phi4

Phi-4 Colab: colab.research.google.com/github/unslo...

10.01.2025 19:55 πŸ‘ 15 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

Introducing Unsloth Dynamic 4-bit Quantization!

Naive quantization often harms accuracy but we avoid quantizing certain parameters. This achieves higher accuracy using only <10% more VRAM than BnB 4bit

Read our Blog: unsloth.ai/blog/dynamic...
Quants on Hugging Face: huggingface.co/collections/...

04.12.2024 19:51 πŸ‘ 15 πŸ” 6 πŸ’¬ 1 πŸ“Œ 2
Preview
Llama 3.2 Vision Fine-tuning with Unsloth Fine-tune Meta's Llama 3.2 Vision, Llava, Qwen 2.5 Vision models open-source 2x faster via Unsloth! Beginner friendly.

You can finetune Llama-3.2-Vision-11B for free on Colab now!

Unsloth finetunes VLMs 2x faster, with 50% less VRAM, 6x longer context - with no accuracy loss.

Other VLMs like Qwen2-VL, Llava & Pixtral are supported.

Blog: unsloth.ai/blog/vision

Fine-tuning Colabs: docs.unsloth.ai/get-started/...

21.11.2024 19:14 πŸ‘ 17 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0