LLMs from Scratch

Repository and accompanying book materials that guide readers to build a working LLM from first principles.

Author: Sebastian Raschka

Added Date: 2025-10-04

Open Source Since: 2023-07-23

Overview

LLMs from Scratch is the official code repository accompanying the book “Build a Large Language Model (From Scratch)”. It walks readers through implementing, pretraining, and fine-tuning GPT-like models using clear, educational code and explanations.

Key features

Chapter-aligned code examples covering tokenization, attention mechanisms, model implementation, and training loops.
Exercises, notebooks, and optional setup guides for running experiments on local machines and GPUs.
Focus on clarity and reproducibility for learning and research prototyping.

Use cases

Teaching and self-study to understand LLM internals.
Reference implementation for prototyping model components and training recipes.

Technical notes

Implemented primarily in PyTorch with attention to numerical stability and engineering practices.
Includes scripts for pretraining, fine-tuning, and evaluation, as well as optional performance improvements.

LLMs from Scratch

Overview

Key features

Use cases

Technical notes

Resource Info

Related Resources

A Curated List of ML System Design Case Studies

EdgeAI for Beginners

Context Engineering