Why NVIDIA builds their own open models | Nemotron w/ Bryan Catanzaro
YouTube video by Interconnects AI
For people who are just learning about Nemotron with the awesome Nemotron 3 Super drop, recommend you watching this interview I did with one of the leads Bryan Catanzaro -- Nemotron as a project is a LONG time coming.
www.youtube.com/watch?v=Y3Vb...
12.03.2026 18:10
π 35
π 3
π¬ 1
π 2
But in the backward pass, the story is much worse. Gradients get compressed via projection onto a D-dimensional subspace, and most of the training signal simply vanishes.
12.03.2026 19:51
π 10
π 1
π¬ 1
π 0
Common Corpus just breaking 1M downloads: it took some time but open data in ai is actually popular.
11.03.2026 19:57
π 59
π 5
π¬ 3
π 1
A little offended Grammarly didn't make a sloppelganger of me
10.03.2026 20:55
π 1585
π 200
π¬ 32
π 94
GitHub - karpathy/autoresearch: AI agents running research on single-GPU nanochat training automatically
AI agents running research on single-GPU nanochat training automatically - karpathy/autoresearch
βautoresearchβ micro teaching repo from Karpathy
readme edits seem like such a nice dx for open ended hparam tuning, and maybe other kinds of hill climbing too, so much less painful than the old days
github.com/karpathy/aut...
10.03.2026 17:26
π 0
π 0
π¬ 0
π 0
It was all about spying on Americans: www.theatlantic.com/technology/2...
02.03.2026 01:33
π 49
π 10
π¬ 2
π 0
FlashSampling: Fast and Memory-Efficient Exact Sampling
Paper: flashsampling.github.io/FlashSamplin...
01.03.2026 07:27
π 15
π 2
π¬ 0
π 0
We analyzed 250K+ queries & 430K+ clickstream interactions from Asta, our AI-powered research assistantβand today we're releasing the full dataset. How do researchers actually use AI science tools? Here's what we found. π§΅
27.02.2026 17:56
π 23
π 6
π¬ 1
π 1
27.02.2026 01:53
π 237
π 40
π¬ 3
π 2
Permissioned Data Diary 2: Buckets
The second in a series of posts building up a solution to permissioned data on atproto. We introduce buckets: a new protocol primitive for creating a shared social context.
new blog post on permissioned data in atproto! this one introduces "buckets", the protocol-level primitive for shared access control. I walk through two approaches that don't quite work and land on something that I think does
let me know your thoughts!
26.02.2026 18:12
π 286
π 57
π¬ 19
π 21
tldr iiuc we are once again enclosing the commons and industrializing craft, dispossessing laborers while apotheosizing capital, and to slow down this doomloop we need to innovate new collectives and public goods
24.02.2026 18:33
π 1
π 0
π¬ 0
π 0
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models
Decoder-only language models have the ability to dynamically switch between various computational tasks based on input prompts. Despite many successful applications of prompting, there is very limited...
This has a very cool result on in-context learned classification tasks, where they disentangle representational quality (how well-separated concept labels are) and readout alignment (how good it is at reading out its own inner labels). Adding demo examples helps through readout, not representations!
23.02.2026 20:01
π 36
π 5
π¬ 1
π 0
Designing around the tight bottleneck on latency and throughput that separates local and cloud compute is such an interesting problem. Significant challenges though
20.02.2026 16:52
π 1
π 0
π¬ 0
π 0
Anti-homeless benches in Pokemon Legends ZA
why is there anti-homeless architecture in pokemon
15.02.2026 00:51
π 2476
π 341
π¬ 58
π 29
Data Centers Ditching the Power Grid, Mark Carney's Viral Speech, and Some Joy
Here are some trends I'm following
A year ago, data center developers were focused on connecting to the grid. Today roughly 1/3 of all planned capacity is onsite power - and 72% of that planned capacity is fossil gas. Homer City PA's data center project could soon be one of the largest single sources of carbon emissions in the US.
31.01.2026 16:13
π 69
π 42
π¬ 5
π 9
warning: earnestpost
thanks Caleb
11.02.2026 17:57
π 1
π 0
π¬ 0
π 0
extremely poor safekeeping of a studentβs private data
as a tech worker I think itβs very disturbing to see Google endangering its own users
11.02.2026 16:33
π 0
π 0
π¬ 0
π 0
working on a seven thousand layer model of extended claugenition
08.02.2026 22:46
π 76
π 7
π¬ 5
π 1
This is a real banger of a paper. The example of a model being weirdly focused on jasmine (lol) makes me increasingly think that single-point-of-access models don't really consider who their audience is. Jasmine is a super legible cultural marker for people outside, but is so, _so_ generic.
03.02.2026 16:41
π 12
π 4
π¬ 2
π 0
the reason I'd follow Cat Hicks into hell is this unswerving humanist conviction that actually
people are going to do the best they can
we can help them do even better
and neither avenue is served by thinking less of people
03.01.2026 23:13
π 79
π 9
π¬ 3
π 0
39C3 - From Silicon to Darude Sand-storm: breaking famous synthesizer DSPs
YouTube video by media.ccc.de
i think we are about to experience an explosion of the possibilities in reverse engineering
02.01.2026 19:38
π 48
π 3
π¬ 2
π 0
weβre at a fascinating moment where I am still ~better at programming than Claude at a medium-horizon difficulty task, but Claude has me absolutely beat in terms of cognitive fatigue so weβre able to ship so much more stuff I never wouldβve gotten around to before
02.01.2026 20:56
π 99
π 4
π¬ 2
π 0
Great list of models in 2025 ππ½
02.01.2026 17:14
π 3
π 1
π¬ 0
π 0
arXiv AI/ML Catch-Up
Was your New Year's resolution to keep up with arXiv AI/ML preprints? Browse the past week's new uploads in 30 mins.
I uh, made this. It was supposed to be a joke / concept-art thing that scrolls through the torrent of new AI/ML arXiv uploads too fast to read. But I think I iterated too much and made it almost usable.
01.01.2026 23:45
π 78
π 13
π¬ 7
π 3
Everyoneβs favorite feed is running on one personβs gaming system. I love how hackable this site is, it makes it much more fun.
26.12.2025 18:47
π 22
π 1
π¬ 3
π 0
If youβre working on a non-fiction research/writing project that isnβt journalism and you donβt have an academic affiliation, how do you find other people who are doing the same thing? Ideally locally (Iβm in NY).
22.12.2025 01:53
π 4
π 1
π¬ 0
π 0