Build a small language model from scratch: Assemble the model architecture
0
0
0 Views
Published on 12/14/25 / In
How-to & Learning
Dr. Raj Dandekar (MIT PhD) conducted a 7-hour small language model workshop. This is part 2 of that workshop.
In this, we cover the following four aspects:
1. 00:00 Recap of part 1: data pre-processing, tokenization, input-output pairs
2. 10:55 Building blocks of the SLM architecture
3 01:08:34 Attention Mechanism Explained
If you want to get access to the detailed lecture notes, register here: https://vizuara.ai/courses/bui....ld-slm-from-scratch-
Show more
0 Comments
sort Sort By
Generative AI