I’m on vscode/phpstorm, running mostly Claude Code with Codex as a verifier. Mostly building internal tooling in our WordPress CRM, so within an auth boundary. 8 months ago Claude was a helpful tool that needed close attention for small tasks. Now it’s breezing through complex refactors.
22.02.2026 17:54
👍 1
🔁 0
💬 0
📌 0
Sure, it’s possible it won’t get there. But so far, progress continues. The frontier models are starting to make modest contributions to maths and physics. I don’t think it’s obvious that ‘coding and some help in maths’ is the natural end point, and it seems plausible it can extend to other domains
22.02.2026 15:45
👍 1
🔁 0
💬 0
📌 0
If you are trying to build a cancer researcher, not a classifier of x-rays, there is 15y of research that the more human data you put in, the more coherent the outputs.
22.02.2026 15:29
👍 0
🔁 0
💬 1
📌 0
There have been plenty of philosophical arguments that LLMs will inherently miss some part of being human, and maybe that’s right. But it is simultaneously plausible that they will get _good enough_ to make medical advances anyway.
22.02.2026 13:15
👍 0
🔁 0
💬 0
📌 0
Ok, you keep stating things as facts that I do not agree are facts (and prob vice versa), so we have prob got as far as we can here.
22.02.2026 12:26
👍 0
🔁 0
💬 0
📌 0
All right, all I can say at this point is that this isn’t the (tentative) conclusion I had come to from the same data.
22.02.2026 12:25
👍 0
🔁 0
💬 0
📌 0
But a year ago this wasn’t possible. It might indeed plateau now, who knows. But that argument has been made for the last 3y and yet things continue to improve. Is it not worth taking seriously the possibility that this is coming?
22.02.2026 12:23
👍 0
🔁 0
💬 1
📌 0
Are you expecting that coding is a specific domain and that the same targeting RL and other techniques won’t scale to other areas? Maybe you’re correct! I just don’t see why…
22.02.2026 12:15
👍 0
🔁 0
💬 1
📌 0
ChatGPT Pro is being used for Erdos problems in maths. ChatGPT Pro is a model scaled on copyrighted works, amongst many other optimisations. Isn’t it?
22.02.2026 12:14
👍 0
🔁 0
💬 1
📌 0
Sure, fair enough. I would only implore you to keep an eye on the progress being made in coding, maths, and physics, and reevaluate from time to time.
22.02.2026 12:12
👍 0
🔁 0
💬 1
📌 0
But the machines making frontier progress in physics and maths are LLMs. Maybe there’s a reason this wouldn’t extend to medical research, but it’s not obvious to me what that is.
22.02.2026 12:09
👍 0
🔁 0
💬 1
📌 0
Because the aim is to make a human-like intelligence. The research of the last 15y shows that the more data you feed in, the more it pattern matches on all available human thought. The idea is that this leads to an ‘intelligence’ which can at improve medical pattern matching, and hopefully innovate.
22.02.2026 12:07
👍 0
🔁 0
💬 2
📌 0
You can declare this to be true, and maybe you’re right. But the moral case being made is by thousands of machine learning engineers who disagree, is my point.
22.02.2026 12:06
👍 0
🔁 0
💬 2
📌 0
(fwiw I think the case is ‘if you think you have a pathway to a machine that can cure cancer by ingesting all IP then you should do it and take the consequences’, but I know that isn’t satisfactory)
22.02.2026 11:25
👍 1
🔁 0
💬 2
📌 0
Some behaviour isn’t good, for sure. But it’s coming nonetheless and getting better rapidly. The original sin of copyright is a major issue for many, but the maths is just out there now. The more help from smart liberals the better off we’ll all be.
22.02.2026 11:25
👍 0
🔁 0
💬 1
📌 0
Smart people like you can have major impacts this early in the process - especially because the left are leaving the whole field open for the right to impose their desires.
22.02.2026 10:06
👍 0
🔁 0
💬 1
📌 0
There is a pathway here to a tech that can make major medical advances, and it would be nice if that pathway could work well for everyone.
22.02.2026 10:06
👍 0
🔁 0
💬 1
📌 0
The tech is now good enough for plenty of meaningful use cases, while still being bad at others. That’s sufficient. The world really needs the help of smart people like you to be thinking how to deal with the real possibility of job losses, or preventing people using it lazily for important things.
22.02.2026 10:06
👍 0
🔁 0
💬 1
📌 0
You are a good guy acting in good faith, and I am a fan of your work. But a lot has changed in AI since November - there’s been a coding explosion. And it couldn’t count fingers for 2 months, 2 years ago.
22.02.2026 10:06
👍 1
🔁 0
💬 1
📌 0
A Guide to Which AI to Use in the Agentic Era
It's not just chatbots anymore
Every few months, I write an updated, idiosyncratic guide on which AIs to use right now.
My new version has the most changes ever, since AI is no longer just about chatbots. To use AI you need to understand how to think about models, apps, and harnesses. open.substack.com/pub/oneusefu...
18.02.2026 01:50
👍 129
🔁 30
💬 5
📌 8
Claude Code users: if you have a mouse with a bunch of buttons I highly recommend:
—setting one to toggle+paste SuperWhisper (or speech->text app of your choice)
—mapping another to Enter
Feels absolutely decadent, but so pleasing.
12.02.2026 19:14
👍 0
🔁 0
💬 0
📌 0
Yes exactly. It’s not a given. But I’m just kinda assuming that if you can train a model on code outputs to be as good as Opus, you can do it on things like ‘analyse this doc for legal problems’. Especially when the bar isn’t perfection, it’s just ‘better than the average humans’.
11.02.2026 15:03
👍 1
🔁 0
💬 0
📌 0
There’s this odd cross-purposes thing where some people are talking about intelligence and some people are talking about utility. I’m doing the latter. My point would be that predictive text - or whatever wording you want to use - is enough.
11.02.2026 12:24
👍 1
🔁 0
💬 1
📌 0
Sure. But I would feel happier if we could quantify ‘a lot’.
11.02.2026 11:31
👍 0
🔁 0
💬 0
📌 0
I suppose to me that doesn’t feel like how markets work. Compute and tokens have only got cheaper over time. If there’s demand, someone will supply.
11.02.2026 11:29
👍 1
🔁 0
💬 1
📌 0
I suppose I see the 90% as jagged. There are some specialties where that’s true. But there are a lot of developers churning out entirely normal stuff like internal tooling and standard apps (me among them) for whom LLMs are speeding them up 5x.
11.02.2026 11:27
👍 0
🔁 0
💬 1
📌 0
But the claim isn’t that it can do _everything_. There is a lot of software that is less critical than financial systems. It’s that the tech is already good enough to do a lot, and is only improving.
11.02.2026 11:24
👍 0
🔁 0
💬 1
📌 0
But if you have a model capable of doing 90% of everyday spreadsheet management and administrative processes, are employers not going to seek it out and use it?
11.02.2026 11:06
👍 0
🔁 0
💬 1
📌 0
I agree that salespeople are annoying. But do the capabilities of Opus 4.6 and Codex 5.3 not give you pause? The claim isn’t that they are perfect, it’s that they are good enough. And the implication is that anything with verifiable outputs can be modelled in the same way.
11.02.2026 10:59
👍 2
🔁 0
💬 3
📌 0
Sure. But there’s plenty of administrative work that is managing spreadsheets and running processes too.
11.02.2026 10:57
👍 0
🔁 0
💬 2
📌 0