Up next


Transformer Neural Networks - EXPLAINED! (Attention is all you need)

2,355,852 Views
AI Lover
3
Published on 12/17/22 / In How-to & Learning

Please subscribe to keep me alive: https://www.youtube.com/c/Code....Emporium?sub_confirm

BLOG: https://medium.com/@dataemporium

MATH COURSES (7 day free trial)
πŸ“• Mathematics for Machine Learning: https://imp.i384100.net/MathML
πŸ“• Calculus: https://imp.i384100.net/Calculus
πŸ“• Statistics for Data Science: https://imp.i384100.net/AdvancedStatistics
πŸ“• Bayesian Statistics: https://imp.i384100.net/BayesianStatistics
πŸ“• Linear Algebra: https://imp.i384100.net/LinearAlgebra
πŸ“• Probability: https://imp.i384100.net/Probability

OTHER RELATED COURSES (7 day free trial)
πŸ“• ⭐ Deep Learning Specialization: https://imp.i384100.net/Deep-Learning
πŸ“• Python for Everybody: https://imp.i384100.net/python
πŸ“• MLOps Course: https://imp.i384100.net/MLOps
πŸ“• Natural Language Processing (NLP): https://imp.i384100.net/NLP
πŸ“• Machine Learning in Production: https://imp.i384100.net/MLProduction
πŸ“• Data Science Specialization: https://imp.i384100.net/DataScience
πŸ“• Tensorflow: https://imp.i384100.net/Tensorflow

REFERENCES
[1] The main Paper: https://arxiv.org/abs/1706.03762
[2] Tensor2Tensor has some code with a tutorial: https://www.tensorflow.org/tut....orials/text/transfor
[3] Transformer very intuitively explained - Amazing: http://jalammar.github.io/illustrated-transformer/
[4] Medium Blog on intuitive explanation: https://medium.com/inside-mach....ine-learning/what-is
[5] Pretrained word embeddings: https://nlp.stanford.edu/projects/glove/
[6] Intuitive explanation of Layer normalization: https://mlexplained.com/2018/1....1/30/an-overview-of-
[7] Paper that gives even better results than transformers (Pervasive Attention): https://arxiv.org/abs/1808.03867
[8] BERT uses transformers to pretrain neural nets for common NLP tasks. : https://ai.googleblog.com/2018..../11/open-sourcing-be
[9] Stanford Lecture on RNN: http://cs231n.stanford.edu/sli....des/2018/cs231n_2018
[10] Colah’s Blog: https://colah.github.io/posts/....2015-08-Understandin
[11] Wiki for timeseries of events: https://en.wikipedia.org/wiki/....Transformer_(machine

Show more
0 Comments sort Sort By

Up next