From a learning theory perspective, how do self-play (or other synth data) like AlphaZero achieve superhuman perf without “training data”?
Treat yourself to this really fun attempt to tackle this+other foundational Qs by reframing information in terms of a computationally bounded observer.
14.02.2026 18:59
👍 4
🔁 0
💬 0
📌 0
GGUF, the long way around
What are ML artifacts?
I wrote a very long post on what a machine learning artifact is, how files work, safetensors, pickle, and a lot more, as a way to understand the new GGUF local LLM file format.
vickiboykis.com/2024/02/28/g...
29.02.2024 23:35
👍 23
🔁 4
💬 1
📌 0
Oops working link arxiv.org/abs/2310.15916
08.11.2023 15:34
👍 2
🔁 1
💬 0
📌 0
"In-Context Learning Creates Task Vectors" does some nice state substitution/patching experiments to tease apart function construction/selection vs function application in ICL - fun stuff huggingface.co/papers/2310....
05.11.2023 03:41
👍 0
🔁 0
💬 1
📌 0
A man with grey hair and a black suit looks calmly at a futuristic robot with intricate designs and glowing red eyes. The robot seems to be whispering something in the man's ear. On the right side of the image, there's a speech bubble from a notification prompting to continue watching "Seinfeld."
18.10.2023 15:12
👍 2
🔁 0
💬 0
📌 0
Frequent SF (Nob Hill) sighting recently: astonished & delighted tourists taking photos/videos of the self-driving cars.
14.10.2023 18:17
👍 5
🔁 0
💬 0
📌 0