Learning guide

LoRA 101

Start with the mental model, then learn why low-rank adapters work, how LoRA training differs from full fine-tuning, and when QLoRA or full fine-tuning makes more sense.

Start guide →

LessonsAdd future lessons as MDX files

What LoRA Is

Start with the core LoRA mental model: a frozen base model plus a small trainable adapter.

4 min 02

The Low-Rank Trick

Understand how two small matrices can represent a useful model update and why rank controls adapter capacity.

5 min 03

How LoRA Training Works

Follow the training loop and see why only adapter weights update while the frozen base model still runs.

5 min 04

Using LoRA In Practice

Learn how adapters are saved, loaded, swapped, merged, and placed into target modules.

5 min 05

QLoRA And Tradeoffs

Learn how quantization changes the memory picture and when LoRA is a strong fit versus the wrong tool.

5 min

Feedback

Each lesson has a quick usefulness check. I only show the public useful count; written notes stay private and help shape future revisions.

Sources

LoRA 101

What LoRA Is

The Low-Rank Trick

How LoRA Training Works

Using LoRA In Practice

QLoRA And Tradeoffs

LoRA: Low-Rank Adaptation of Large Language Models ↗

QLoRA: Efficient Finetuning of Quantized LLMs ↗

Hugging Face PEFT LoRA Guide ↗

Thinking Machines LoRA Primer ↗

Thinking Machines: LoRA Without Regret ↗