Z2H Video 2, round 2, finished watching [Post #15, Day 32]

February 6, 2025•89 words

Just finished watching Video 2, now attempting the exercises.

E01: train a trigram language model, i.e. take two characters as an input to predict the 3rd one. Feel free to use either counting or a neural net. Evaluate the loss; Did it improve over a bigram model?

This is not an easy one! I started with trying the neural net framework, going to switch to the explicit counts method now.

I have also been watching a new video that Andrej just published yesterday: Deep Dive into LLMs like ChatGPT.

👍❤️🫶👏👌🤯🤔😂😍😭😢😡😮

More from A Civil Engineer to AI Software Engineer 🤖
All posts

Z2H Video 2, round 2 [Post #14, Day 30]

February 4, 2025•93 words

PyTorch is a deep learning neural network framework. Part 1: bigram language modeling, explicit (statistical) approach Intro to makemore, "makemore takes one text file as input, where each line is assumed to be one training thing, and generates more things like it" Names data set, quick analysis of data set Divide up all the bigrams in the names data set and keep counts of them in a dictionary PyTorch tensor to store bigram counts instead (27 x 27 tensor) Summary Part 2: bigram language mo...

Read post

Z2H Video 2, exercise E01 [Post #16, Day 33]

February 7, 2025•133 words

I have worked my way through building and "training" my trigram language model using the counting method. Here are some fun names output from my model (I searched through the data set of names and some of these like Samiyah, Kaley, Aviyah, and Glen are actually in the data set, hmmm, is that a problem?): Ce Bra Jalius Rochityharlonimittain Luwak Ka Da Samiyah Javer Gotai Moriellavoji Preda Kaley Maside En Aviyah Folspihiliven Tahlas Kashruban Glen Qualitatively speaking, they seem a little...

Read post

Z2H Video 2, round 2, finished watching [Post #15, Day 32]

More from A Civil Engineer to AI Software Engineer 🤖All posts

Z2H Video 2, round 2 [Post #14, Day 30]

Z2H Video 2, exercise E01 [Post #16, Day 33]

More from A Civil Engineer to AI Software Engineer 🤖
All posts