danielhanchen

2024-12-10 19:51:26

Llama 3.3 (70B) Finetuning - now with 90K context length and fits on <41GB VRAM.

buildingai770

2025-03-01 16:39:00

What changed in the latest unsloth (2025.2.15) vs 2025.1.5 version that can cause training failure?

shing3232

2025-03-01 04:19:45

Day 6: One More Thing, DeepSeek-V3/R1 Inference System Overview

yoracale

2025-03-01 00:33:13

Phi-4-mini Bug Fixes + GGUFs

Dr_Karminski

2025-02-28 01:15:12

DeepSeek Realse 5th Bomb! Cluster Bomb Again! 3FS (distributed file system) & smallpond (A lightweight data processing framework)

hedgehog0

2025-02-26 23:22:15

Microsoft announces Phi-4-multimodal and Phi-4-mini

danielhanchen

2025-02-26 18:51:23

[P] Train your own Reasoning model - GRPO works on just 5GB VRAM

Dr_Karminski

2025-02-27 02:20:47

DeepSeek Realse 4th Bomb! DualPipe an innovative bidirectional pipeline parallism algorithm

danielhanchen

2024-12-31 18:09:44

What would you like to see in Unsloth for 2025?

danielhanchen

2025-02-25 18:39:05

You can now train your own Reasoning model with just 5GB VRAM

Dr_Karminski

2025-02-26 01:09:10

DeepSeek Realse 3th Bomb! DeepGEMM a library for efficient FP8 General Matrix

danielhanchen

2025-02-26 00:17:05

You can now train your own Reasoning model with just 5GB VRAM

danielhanchen

2025-02-25 02:36:58

DeepSeek 2nd OSS package - DeepEP - Expert parallel FP8 MOE kernels

Dr_Karminski

2025-02-25 02:37:58

DeepSeek Realse 2nd Bomb, DeepEP a communication library tailored for MoE model

AaronFeng47

2025-02-24 01:37:17

FlashMLA - Day 1 of OpenSourceWeek

djm07231

2023-12-21 05:11:05

OpenAI Triton Course/Tutorial Recommendations

danielhanchen

2025-02-22 21:39:03

Perplexity R1 Llama 70B Uncensored GGUFs & Dynamic 4bit quant

danielhanchen

2025-02-20 18:15:26

10x longer contexts for reasoning training - 90% less memory GRPO in Unsloth

Kooky-Somewhere-2883

2025-02-21 07:56:29

We GRPO-ed a 1.5B model to test LLM Spatial Reasoning by solving MAZE

yoracale

2025-02-19 08:24:48

R1-1776 Dynamic GGUFs by Unsloth

TKGaming_11

2025-02-18 19:01:54

PerplexityAI releases R1-1776, a DeepSeek-R1 finetune that removes Chinese censorship while maintaining reasoning capabilities

yoracale

2025-02-10 17:37:52

Unsloth is the #1 trending repo on GitHub!

danielhanchen

2025-02-07 19:39:26

[P] GRPO fits in 8GB VRAM - DeepSeek R1's Zero's recipe

danielhanchen

2025-02-07 19:11:12

You can now train your own DeepSeek-R1 model on your local device!

danielhanchen

2025-02-06 18:59:49

Train your own Reasoning model - 80% less VRAM - GRPO now in Unsloth (7GB VRAM min.)

Share Your Mood