Previous
Previous Product Image

Numpy For Data Science (73 Pages)

Original price was: ₹73.00.Current price is: ₹0.00.
Next

Machine Learning Handwritten note

Original price was: ₹113.00.Current price is: ₹0.00.
Next Product Image

How to Build LLM from Scratch (34 Pages)

Original price was: ₹34.00.Current price is: ₹0.00.

Add to Wishlist
Add to Wishlist

Description

How to Build LLM from Scratch (34 Pages)

By limiting the use of libraries and focusing on math and coding, I could code the architecture from a very basic point of view. I also came here with a few notes and handwritten notes to note. (Handwriting is kinda cringy!)

GPT architecture looks complex, but if studied properly then everything comes under the:

👉 Metrics and tensors: You should be comfortable handling tensors and understanding their dimensions.
👉 Probability and Statistics: Softmax, Layer normalization, and the multinomial distribution play a very important role in building GPT.
👉 Calculus: To train GPT, we need to run backpropagation. The chain rule is the core of this.

In my notes I’ve mentioned all the steps involved in building GPT-2 from scratch as follows:

👉 Learning about LLM (Large Language Models)
👉 Stages of building LLM
👉 Data preprocessing
👉 Cleaning and tokenizing text
👉 Transformer architecture
👉 Attention mechanisms (Multi Head Attention)
👉 Coding & Training Model

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

You may also like…

Shopping cart

0
image/svg+xml

No products in the cart.

Continue Shopping