Build a small language model from scratch: Data pre-processing
0
0
0 Views
Published on 12/14/25 / In
How-to & Learning
Dr. Raj Dandekar (MIT PhD) conducted a 7-hour small language model workshop. This is part 1 of that workshop.
In this, we cover the following four aspects:
1. 00:00 Introduction to SLM
2. 40:15 Dataset loading
3. 53:36 Tokenization
4. 01:24:42 Creating input-output pairs
If you want to get access to the detailed lecture notes, register here: https://vizuara.ai/courses/bui....ld-slm-from-scratch-
Show more
0 Comments
sort Sort By
Generative AI