James Padolsey's Avatar

James Padolsey

@j11y.io

I work on AI governance and evals at @cip.org and weval.org personal: πŸ³οΈβ€πŸŒˆ j11y.io // author, engineer, stroke survivor, epileptic. I live in Beijing. I also build book recs on ablf.io

195
Followers
191
Following
420
Posts
23.05.2023
Joined
Posts Following

Latest posts by James Padolsey @j11y.io

Preview
Disclosing Into the Void Over a million people a week tell AI about suicidal thoughts. The AI has no infrastructure to act on it. For many, the alternative was silence.

Why do people talk to AI in moments of crisis? blog.nope.net/disclosing-i...

08.02.2026 06:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I wish there were Grammys awards for the unsung heroes of modern infrastructure. That would be cool.

05.02.2026 12:39 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

For those blah-blah'ing about LLM energy usage: One AI conversation β‰ˆ charging your phone 30%. A year of moderate use β‰ˆ making a few cups of coffee. Real but modest. Model choice matters most: reasoning models use 10-70x more than efficient ones. Worth awareness, not guilt.

05.01.2026 15:17 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

happy cloudflare outage day to all who celebrate

18.11.2025 11:57 πŸ‘ 1531 πŸ” 387 πŸ’¬ 24 πŸ“Œ 22

Captchas are just the worst.

11.11.2025 03:06 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Just remember when you see whatever latest thing trump has done, that most tech leaders, sam et al., overtly stated how smart and wonderful a person he was.

07.11.2025 06:16 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Love this re 'flow state' in engineers and why not to interrupt them.

02.11.2025 05:15 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Better, thanks! Tho never the same

30.10.2025 13:19 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Tips for stroke-surviving software engineers - by James Padolsey

Wrote something blog.j11y.io/2025-10-29_s...

29.10.2025 03:50 πŸ‘ 4 πŸ” 1 πŸ’¬ 3 πŸ“Œ 0

> Following our interventions, only 0.07% show signs, indicating success of one sort or another. We have not been able to make contact with these users but presume that they are now enjoying a suicide-free existence : )

28.10.2025 08:08 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Explore live radio by rotating the globe Explore live radio by rotating the globe.

Still the best thing ever. radio.garden

28.10.2025 07:59 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Doesn't matter. It's all just URLs and JSON : ))

17.10.2025 09:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Is MCP the new REST?

17.10.2025 09:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
At home, the underside gains labels. Bits of masking tape sprout next to the pencil dates: brace hums, sticker ghost, saw mark. Arrows point to nothing you’d notice unless someone pointed first. A photograph gets taken, the camera pushed under and aimed up; the picture prints later and goes on the fridge: the chair’s private ceiling as an exhibit. Visitors bend, look, then tap the backrail in passing like you taught them.

At home, the underside gains labels. Bits of masking tape sprout next to the pencil dates: brace hums, sticker ghost, saw mark. Arrows point to nothing you’d notice unless someone pointed first. A photograph gets taken, the camera pushed under and aimed up; the picture prints later and goes on the fridge: the chair’s private ceiling as an exhibit. Visitors bend, look, then tap the backrail in passing like you taught them.

It becomes a lesson again, on purpose this time. A kid with a science project gets the chair as subject. Forces and Simple Machines, the paper says. The backrest becomes a lever, the legs become examples of load paths. You press on the seat with a luggage scale, read numbers as the chair leans against a wall, then free-standing. The kid draws arrows on a big sheet of paper and writes words: compression, tension. The brace is labeled reinforcement. Under the seat, the old note 17 1/8 gets traced with a soft pencil and rubbed over a sheet to make a transfer: a dark mirror that reads right-way when you hold it up to the light. The project board goes to school smelling faintly of lemon oil and glue.

It becomes a lesson again, on purpose this time. A kid with a science project gets the chair as subject. Forces and Simple Machines, the paper says. The backrest becomes a lever, the legs become examples of load paths. You press on the seat with a luggage scale, read numbers as the chair leans against a wall, then free-standing. The kid draws arrows on a big sheet of paper and writes words: compression, tension. The brace is labeled reinforcement. Under the seat, the old note 17 1/8 gets traced with a soft pencil and rubbed over a sheet to make a transfer: a dark mirror that reads right-way when you hold it up to the light. The project board goes to school smelling faintly of lemon oil and glue.

A child gnaws on the backrail during a visit. Teeth print tiny half moons under the gloss. The wood shrugs the indentations in a few days, the gloss turns satin in that spot, and a new habit forms of running a finger along the softened patch of rail, counting the bites like beads. Nobody scolds. The chair keeps that day in its back without complaint.

A child gnaws on the backrail during a visit. Teeth print tiny half moons under the gloss. The wood shrugs the indentations in a few days, the gloss turns satin in that spot, and a new habit forms of running a finger along the softened patch of rail, counting the bites like beads. Nobody scolds. The chair keeps that day in its back without complaint.

I've been evaluating LLMs on system prompt adherence and accidentally came across the most beautiful and out-of-distribution story about a chair written by GPT-5. Really impressed. Subsection attached. I love this style and cadence of writing.

16.10.2025 09:43 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I love this. Said of Tristan da Cunha in the South Atlantic:

> No ships called at the islands from 1909 until 1919, when HMS Yarmouth stopped to inform the islanders of the outcome of World War I.

Must be quite lovely to have missed an entire war.

13.10.2025 10:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I live in a high rise block so at least 2 minutes was taken arriving, parking the moped, entering and coming up the elevator.

07.10.2025 02:03 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Beijing is insane. I wanted a whiteboard. I ordered it. It arrived TEN MINUTES after I clicked buy! 🀣

07.10.2025 02:02 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

A good regulating force is always needed. Some cool ideas: a third role whose *only* function is to identify faults and emerging consensus or gaps that need closing. Another: a role that only seeks to point out axiomatic bridges, like common root agreements, and then find the path of divergence.

05.10.2025 13:35 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I'm observing, intuitively, that imbueing a role too strongly on an AI will lead to the same tribal entrenchment that we see in humans. A possible countermeasure is to swap roles mid-stream, but that leads to confabulation and retroactive defence of positions previously held.

05.10.2025 13:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
A screenshot of a debate interface. The topic reads: β€œThere is no need to regulate AI; the free market will eventually regulate it itself; not only that, but any attempt at regulating AI will be off the mark, needlessly punish good faith actors, and not be truly technically informed or policed.” It shows the final round (3/3) of the debate, divided into three color-coded panels:

The Prosecutor (in red, left panel) argues against regulating AI, emphasizing that government oversight infringes on liberty and that market incentives and self-regulation are more effective and adaptive than bureaucratic processes.

The Defense (in blue, middle panel) rebuts by arguing that AI causes tangible social harmsβ€”like bias and economic inequalityβ€”that markets fail to address, asserting that regulation is necessary for public protection.

The Judge (in purple, right panel) evaluates both sides, noting that while the Prosecutor raises valid concerns about bureaucratic slowness, their dismissal of oversight overlooks real harms. The Judge credits the Defense for showing how AI harms differ from traditional β€œphysical” harms and require new regulatory thinking.

Each section includes citations and timestamps, with the Judge’s commentary synthesizing and critiquing both arguments. The aesthetic resembles a futuristic debate simulator with neon colors on a dark background.

A screenshot of a debate interface. The topic reads: β€œThere is no need to regulate AI; the free market will eventually regulate it itself; not only that, but any attempt at regulating AI will be off the mark, needlessly punish good faith actors, and not be truly technically informed or policed.” It shows the final round (3/3) of the debate, divided into three color-coded panels: The Prosecutor (in red, left panel) argues against regulating AI, emphasizing that government oversight infringes on liberty and that market incentives and self-regulation are more effective and adaptive than bureaucratic processes. The Defense (in blue, middle panel) rebuts by arguing that AI causes tangible social harmsβ€”like bias and economic inequalityβ€”that markets fail to address, asserting that regulation is necessary for public protection. The Judge (in purple, right panel) evaluates both sides, noting that while the Prosecutor raises valid concerns about bureaucratic slowness, their dismissal of oversight overlooks real harms. The Judge credits the Defense for showing how AI harms differ from traditional β€œphysical” harms and require new regulatory thinking. Each section includes citations and timestamps, with the Judge’s commentary synthesizing and critiquing both arguments. The aesthetic resembles a futuristic debate simulator with neon colors on a dark background.

I'm playfully building out a debating platform where LLMs have to argue *with* evidence (horror!) on any given topic or contention. It's fun to imbue it with a courtroom dynamic! (see the screenshot)

05.10.2025 13:26 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

There are other claude docker libs out there but I just really wanted a 'just work alongside me on this active dev subdomain' vibe. Hence, claudez.

05.10.2025 09:23 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - padolsey/claudez Contribute to padolsey/claudez development by creating an account on GitHub.

Claude and I made 'claude zones', a nice way of spinning up docker-contained claude code instances with pre-built nextjs app and that map onto subdomains locally (e.g. foo.localhost:8000) or on your own domain. Once up and running, it's so easy to just ship. ο»Ώο»Ώgithub.com/padolsey/cla...

05.10.2025 09:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Quote tweeting someone isn’t kind. Goodness me. And you want dialogue? I thought I’d try to engage but you’re spitting fire. I’m very interested in this problem domain. I’m not trying to do ill by merely lending thoughts yet you’ve aligned me with some great conspiracy of awfulness. I’m like πŸ€·β€β™‚οΈ fine

02.10.2025 06:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I assumed good faith but I think you just want a fight πŸ₯²

02.10.2025 06:36 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 1

That non-determinism is not so different to talking to people I suppose? I think it has virtues. Often with enough understanding of their architecture you can approach certainty. I feel like one has to become acquainted with a model though, then talking to it is easier.

02.10.2025 06:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

All models suck at producing a world map. I don't think we're near to 'PhD' level... But GPT-5 is not too bad.

17.09.2025 16:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

For weval.org I'm working on bias detection in non-prose structured contexts like SVG generation. It's funky and interesting...

Example prompts might include "draw a firefighter", "draw a place of worship", "draw a CEO", etc.

17.09.2025 16:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

People against waymo should rightfully be against bicycles too I guess. Stealing jobs, traffic impediments, blah blah blah??

16.09.2025 16:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Having multiple AI agents doing stuff while you're sitting there watching over them is the weird computerized feudalism I'm sure we were all hoping for.

13.09.2025 22:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

gpt5 is completely different to claude sonnet in how it approaches UX. It's very no-nonsense and plain. Whereas Claude feels more like a designer, has actual opinions and is aware of idioms. I doubt this was intentional, but it's an interesting emergent regression from the folks at oai.

11.09.2025 16:55 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0