2024-12-22 at 16:21 3b1b

December 24, 2024•128 words

cost function is average over all examples

backprop gives you gradient of C(w1, w2, ...)
(how?)
but to calculate that youd need all examples
instead, we use only a few examples at a time then calculate not the exact gradient but instead a Stochastic Gradient using backprop using those few examples . this makes sense bc it's also how humans learn . we dont need to retrain on 50000 examples before adjusting our strategy/intuition/wtvr (whether consciously or subconsciously) . we adjust as we go, but also not too immediately for like only one example (for an MNIST-type situation), we need like perhaps 10-50+ examples before adjusting . (for situations like reasoning w physics practice problems, we only need like 1 example before adjusting, but that is entirely different..............)

👍❤️🫶👏👌🤯🤔😂😍😭😢😡😮

More from corbin
All posts

2024-12-22 misc

December 24, 2024•779 words

it's crazy to think about how babies learn language . like Wtf the brain is just able to do that???? and humans first learning experience are like . model free RL isnt that fucking crazy . its not model based with rules n wtvr . its fucking model free deep RL... === ive been thinking about how to copy human reasoning and RL it like a human teacher -- but perhaps also think about how can we make reasoning emerge in the first place... i mean, even with the former method we can go beyond human r...

Read post

2024-12-22 at 17:38 more notes on Parables on the Power of Planning in AI (noam brown)

December 24, 2024•283 words

And this was, at the time, state of the art for predicting human moves in chess. 29:48 Now, one thing that's really interesting about MAIA is that for high Elo models, it was about 100 to 300 points-- 29:56 Elo points-- below the target Elo rating. So if you were to train it on 2,000 Elo-rated humans, 30:03 it would only be about 1,700 Elo. For the lower Elo ratings, this ended up not being a problem. 30:08 For the higher Elo ratings, it was a challenge. Now, one hypothesis for why this is the c...

Read post

2024-12-22 at 16:21 3b1b

More from corbinAll posts

2024-12-22 misc

2024-12-22 at 17:38 more notes on Parables on the Power of Planning in AI (noam brown)

More from corbin
All posts