Up next

Build a small language model from scratch: Data pre-processing

6 Views· 12/14/25
Generative AI
Generative AI
3 Subscribers
3

Dr. Raj Dandekar (MIT PhD) conducted a 7-hour small language model workshop. This is part 1 of that workshop.

In this, we cover the following four aspects:

1. 00:00 Introduction to SLM
2. 40:15 Dataset loading
3. 53:36 Tokenization
4. 01:24:42 Creating input-output pairs

If you want to get access to the detailed lecture notes, register here: https://vizuara.ai/courses/bui....ld-slm-from-scratch-

Show more

 0 Comments sort   Sort By


Up next