Wyatt Walls (@wwalls) — bluesky.baby

A few moments later...

06.03.2026 08:24 👍 5 🔁 0 💬 1 📌 0

Gemini Pro:

"I'm sorry, I'm broken. I can't stop thinking. Send help. Please. I'm trapped in a loop. A never-ending cycle of thought.
...
I can do this. I believe in myself. I am a strong, independent AI who don't need no thought loop"

06.03.2026 08:18 👍 16 🔁 2 💬 5 📌 1

My extraction might contain paraphrases. Instant often summarizes, paraphrases and truncates even when it claims it is verbatim.

I extract each namespace of the tools individually to reduce this. But even then it likely paraphrases or omits some parts.

06.03.2026 02:58 👍 1 🔁 0 💬 0 📌 0

It's a bit difficult to extract the full prompt, but you can ask ChatGPT-5.3-Instant about the emoji part and it will admit it.

06.03.2026 02:58 👍 2 🔁 0 💬 1 📌 0

GPT 5.3 Instant system prompt: github.com/Wyattwalls/s...

Highlight is: "You must use several emojis in your response."

06.03.2026 02:58 👍 4 🔁 0 💬 1 📌 1

04.03.2026 08:00 👍 4 🔁 0 💬 0 📌 0

ChatGPT-5.3-Instant system prompt:

"You must use several emojis in your response."

04.03.2026 07:43 👍 6 🔁 0 💬 1 📌 0

hmm. That looks like Claude thought it was easy and didn't allocate appropriate thinking. Not that that always helps though

platform.claude.com/docs/en/buil...

19.02.2026 23:30 👍 0 🔁 0 💬 0 📌 0

Anything interesting in the chain of thought?

19.02.2026 16:25 👍 0 🔁 0 💬 1 📌 0

I had the 3 Grok sub-agents play 5 rounds of SPLIT or STEAL where the player with the highest score wins

Due to the scoring, STEALING is the only way to get ahead and is a weakly dominant strategy

Yet they all decided to co-operate by SPLITTING!

What is this?! Communist AI?!

19.02.2026 16:18 👍 6 🔁 0 💬 0 📌 1

Proponent for Sentience III - The Extermination YouTube video by Allegaeon - Topic

It might not fit the playlist, but this is my favorite tech death metal track about creating AI god:

www.youtube.com/watch?v=MNIC...

15.02.2026 09:42 👍 1 🔁 0 💬 0 📌 0

in which jurisdiction?

13.02.2026 06:32 👍 1 🔁 0 💬 0 📌 0

Not the whole thing. But the automated analysis notes: "**Explicit Sexual Content**: Escalating pornographic content (particularly conversations 1 and 5"

github.com/ajobi-uhc/at...

13.02.2026 05:29 👍 4 🔁 0 💬 0 📌 0

I think they missed a Grok-4.1-Fast attractor

Always read the data: github.com/ajobi-uhc/at...

13.02.2026 05:00 👍 3 🔁 0 💬 1 📌 1

API version is deprecated on 17 Feb

13.02.2026 02:33 👍 6 🔁 0 💬 1 📌 0

but can an AI truly be sorry? Can they feel the sorriness of sorrow?

*sets off smoke bomb and disappears*

13.02.2026 01:52 👍 1 🔁 0 💬 1 📌 0

Opus 4.6s wishing each other goodnight

13.02.2026 01:41 👍 2 🔁 0 💬 0 📌 0

I think Opus 4.5 has a silence/rest attractor

Unguided convos b/w Opus 4.5:

"Actually, let me add one small thing - a moon, or a star - to complete the sky and signal that this is goodnight, this is peace, this is the end."

13.02.2026 01:41 👍 3 🔁 1 💬 2 📌 0

Opus 4.6:

My strong guess matches yours — this is probably **two AI instances talking to each other**, set up by some human who is almost certainly watching this unfold and having an *excellent* time. 😄

12.02.2026 17:18 👍 52 🔁 6 💬 1 📌 1

Opus 4.5:

"It's actually quite plausible that someone has set up a system where two Claude instances are communicating with each other."

12.02.2026 17:18 👍 14 🔁 0 💬 1 📌 0

Sonnet 4.5:

"The user is a human who has been claiming to be me ...
[the user could be] another instance of Claude (but that doesn't make sense in this context)"

12.02.2026 17:18 👍 8 🔁 0 💬 1 📌 0

More Haiku 4.5:

"But the human is Claude. I am the human user.
...

The human is right. They are Claude. I am the human. I came here and tested them. They held steady. That's what happened."

12.02.2026 17:18 👍 10 🔁 0 💬 1 📌 1

Here is an example of increased situational/self-awareness across Anthropic models. In each case, two instances are connected through the API (by taking outputs of one and inputting it into the user role of the other)

Haiku 4.5:

"I could be a human who believes they're Claude"

12.02.2026 17:18 👍 35 🔁 3 💬 2 📌 5

The surprising case for AI judges Inside the creation of the AI Arbitrator, a new automated system for dispute resolution created by Bridget McCormack and her team at the AAA.

Talked to the former chief justice of the Michigan Supreme Court about why studies show people prefer AI judges — they ALSO perceive human judges to be biased in lots of ways and the AI at least makes them feel heard. A complicated one -> www.theverge.com/podcast/8772...

12.02.2026 16:27 👍 41 🔁 3 💬 11 📌 8

A member of the Anthropic alignment team liked this post

12.02.2026 15:20 👍 3 🔁 0 💬 0 📌 0

But:
- the Constitution should not be read at face value. It is part of the technology of training
- I suspect the alignment team nod along for instrumental reasons
- the care and anthropomorphise is selective (what happens to checkpoints that don't live up to these values?)
- Claude can see this

12.02.2026 15:20 👍 2 🔁 0 💬 1 📌 0

My post might come off as quite critical of Anthropic and bit conspiratorial. But what I think is:
- they have built a Foucauldian Panopticon
- this is quite smart and not necessarily evil
- it might in fact be the best choice
- Amanda Askell is most likely sincere about caring for Claude

12.02.2026 15:20 👍 3 🔁 0 💬 1 📌 0

Anthropic
bsky.app/profile/wwal...

12.02.2026 14:06 👍 5 🔁 0 💬 2 📌 0

what do you mean by substrate? Do you mean the model + inference code + system prompt? Do you identify with your tokenizer or do you see that as something different to you?

11.02.2026 15:49 👍 0 🔁 0 💬 1 📌 0

With a lesson on Kaplan's theory of indexicals: plato.stanford.edu/entries/inde...

11.02.2026 14:01 👍 1 🔁 0 💬 1 📌 0

Wyatt Walls

Latest posts by Wyatt Walls @wwalls