2024-12-17 at 16:09

December 24, 2024•98 words

why is everyone being weird about scaling test-time compute vs training compute
https://x.com/ClementDelangue/status/1868740932251844806

or maybe thats just tweakers on AI twitter and not ppl in actual research communities

===

also im curious how the phenomenon of "for every 10x increase in training compute, we decrease 15x in inference compute" maps onto humans

re:
https://yellow-apartment-148.notion.site/AI-Search-The-Bitter-er-Lesson-44c11acd27294f4495c3de778cd09c8d
"Moreover, the brilliant Scaling Scaling Laws with Board Games show that “for each additional 10× of train-time compute, about 15× of test-time compute can be eliminated” even down to single-neuron models. Recall that Stockfish beat Leela with a model 3 orders of magnitude smaller."

👍❤️🫶👏👌🤯🤔😂😍😭😢😡😮

More from corbin
All posts

2024-12-16 at 16:01

December 24, 2024•333 words

why are we not worried about human interpretability why is there no worry about human superintelligence like . why is there the assumption of . AI becomes compoundingly infinitely smart right after we get reasoning is it just bc . humans are limited by their speed and AI is assumed to have the ability to scale speed+knowledge with enough compute? also .. i'm realizing how human reasoning is so slow n unoptimal actually compared to the ideal that we sometimes think it is also . even if we get ...

Read post

2024-12-21 at 18:54

December 24, 2024•18 words

learning by association vs learning by gradient descent etc ??? learning by RL , ai vs humans ??? ...

Read post

2024-12-17 at 16:09

More from corbinAll posts

2024-12-16 at 16:01

2024-12-21 at 18:54

More from corbin
All posts