New chapter: Agentic manual testing - about how having agents "manually" try out code is a useful way to help them spot issues that might not have been caught by their automated tests simonwillison.net/guides/agent...
New chapter: Agentic manual testing - about how having agents "manually" try out code is a useful way to help them spot issues that might not have been caught by their automated tests simonwillison.net/guides/agent...
I get a version of that by saying "write a plan for what you're going to do and wait for me to review it"
Many of my prompts start with me feeding in the entirety of my existing code - hard to see how that decision relating to image generation applies to LLMs for code
I'm very frustrated at how little useful discourse there has been over that issue though - how much do you need to prompt Claude Code before you can claim copyright over the result? Has anyone even tried to answer that question yet?
I think that affects images generated from a single prompt, I haven't seen anything that suggests it affects LLM generated code that's had any hint of human interaction with the coding process at all
That was a total miss, I'm sorry! Spent the whole 2.5 days in Manhattan
I went to the New York Earth Room! It's 280,000 pounds of soil in a loft in SoHo that's been there mostly unchanged since 1977
www.niche-museums.com/117
I started a new chapter of my Agentic Engineering Patternw guide about anti-patterns - things NOT to do
So far I only have one: Inflicting unreviewed code on collaborators, aka dumping a thousand line PR without even making sure it works first simonwillison.net/guides/agent...
Published some notes on the situation at Qwen - they released the Qwen 3.5 family (an outstanding family of open weight models) but now their lead researcher and several others all appear to have resigned within the past 24 hours simonwillison.net/2026/Mar/4/q...
I hope it's OK!
I started a new section of my Agentic Engineering guide for annotated versions of prompts I've used for projects - the first is a prompt I used to have Claude Code for web build me a web UI for compressing GIFs using a WebAssembly build of Gifsicle simonwillison.net/guides/agent...
5. "No new chicks for four years (due to a lack of fruiting rimu trees)" The phrasing "lack of fruiting rimu trees" is slightly imprecise. The issue isn't that rimu trees failed to fruit at all, but that there was no mass fruiting (masting) event, which is the specific trigger for kΔkΔpΕ breeding. Consider "due to a lack of rimu masting" or "due to a lack of mass rimu fruiting."
Sent the February edition of my sponsors-only newsletter - a summary of my last month of blogging for people who want to pay for a shorter version
I use Claude as a proofreader and fact checker, was delighted that it called me out on this KΔkΔpΕ detail: simonwillison.net/2026/Mar/2/f...
This is why I'm a heavy user of Claude Code for web, Anthropic's cloud hosted version of Claude
I don't care what happens in the sandbox if it's running on Anthropic's machines and not mine
New chapter of my Agentic Engineering Patterns guide. This one is about having coding agents build custom interactive and animated explanations to help fight back against cognitive debt simonwillison.net/guides/agent...
{ "recordWidth": 256, "totalRecords": 299382, "totalBytes": 76641792, "signposts": [ { "idx": 0, "cp": 0 }, { "idx": 37422, "cp": 38452 }, { "idx": 74845, "cp": 81958 }, { "idx": 112268, "cp": 150305 }, { "idx": 149691, "cp": 187764 }, { "idx": 187113, "cp": 1001839 }, { "idx": 224536, "cp": 1039262 }, { "idx": 261959, "cp": 1076687 } ] }
It has a very tiny index to speed up the first few hits, but the point of the exercise was to do a binary search over HTTP ranges so optimizing it further would break that goal!
When D3 was released in February 2011, it transformed how we build data visualizations for the web and ushered in a new age of unbridled creativity in information design, data journalism, and beyond. Today, it remains the backbone of modern, interactive data viz.
Happy 15th birthday, D3 π
d3js.org
Go for it!
I worry about the crossover with the GOAT acronym, that's what came to mind for me first before spotting the scapegoating equivalence
Built a fun prototype this morning of binary search using HTTP range requests, in this case to lookup characters in ~77MB of unicode data simonwillison.net/2026/Feb/27/...
Yes, that's definitely on the list
Today's chapter of Agentic Engineering Patterns is some good general career advice which happens to also help when working with coding agents: Hoard things you know how to do simonwillison.net/guides/agent...
"Google says" annoyed me a whole lot more than "Wikipedia says", I think because the Wikipedia one at least gives me enough provenance to save me a click
If I wanted to know what ChatGPT thought I'd fucking well ask ChatGPT!
I'm asking YOU because you're a human and you have experience!
Nothing to open Xcode /even once/ for my presentation app project was delightful
I like Ethan Mollick's perspective on AI outside of just computer programmers, Drew Breunig for deep prompting nerdery, Armin Ronacher for software engineering with AI, Jack Clark for AI policy with an Anthropic angle... most of the rest I get from Hacker News and various Discords
No I've not tried that yet - I'm really just using them for my own confirmation right now
Brief notes on Claude Code Remote and Cowork scheduled tasks - both of which overlap with OpenClaw, and both of which require you to leave your computer powered on somewhere simonwillison.net/2026/Feb/25/...
I've been having good results recently asking coding agents to provide "a linear walkthrough of the code that explains how it all works in detail" - I demonstrated that against this vibe coded Swift codebase and wrote up the technique here: simonwillison.net/guides/agent...